Skip to content

some problems for test_ray_air.py #7

Open
@fcyi

Description

@fcyi

run the code, then meet the follow bugs(filter some details for my privacy),If you could provide assistance, I would greatly appreciate it:

2025-05-05 17:29:04,506 ERROR trial_runner.py:1450 -- Trial XGBoostTrainer_61244_00000: Error happened when processing _ExecutorEventType.TRAINING_RESULT.
ray.exceptions.RayTaskError(TypeError): ray::_Inner.train() (pid=635952, ip=198.18.0.1, repr=XGBoostTrainer)
TypeError: getaddrinfo() argument 1 must be string or None
2025-05-05 17:29:04,509 ERROR tune.py:941 -- Trials did not complete: [XGBoostTrainer_61244_00000]
2025-05-05 17:29:04,509 INFO tune.py:945 -- Total run time: 2.99 seconds (2.98 seconds for the tuning loop).
ray.exceptions.RayTaskError(TypeError): ray::_Inner.train() (pid=635952, ip=198.18.0.1, repr=XGBoostTrainer)
TypeError: getaddrinfo() argument 1 must be string or None
ray.train.base_trainer.TrainingFailedError: The Ray Train run failed. Please inspect the previous error messages for a cause. After fixing the issue (assuming that the error is not caused by your own application logic, but rather an error such as OOM), you can restart the run from scratch or continue this run.
To continue this run, you can use: trainer = XGBoostTrainer.restore("/home/hqd/ray_results/XGBoostTrainer_2025-05-05_17-29-01").
To start a new run that will retry on training failures, set air.RunConfig(failure_config=air.FailureConfig(max_failures)) in the Trainer's run_config with max_failures > 0, or max_failures = -1 for unlimited retries.

my environment is:
python==3.10.1
ray[all]==2.4.0
pydantic==1.10.1
xgboost-ray
catkin-tools==0.9.4
catkin-pkg==0.3.0
matplotlib==3.9.4
pyparsing==2.3.1
tqdm
numpy==1.25.1

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions