The combination of segmentation and pose with strange result #19501
Replies: 2 comments 2 replies
-
👋 Hello @Ichiruchan, thank you for sharing your detailed query and interest in the Ultralytics repository 🚀! Your results and observations are quite intriguing. We recommend referring to the Docs for detailed guidance on custom setups like this. For additional context, you can review the sections on model configurations and custom training to ensure all modifications align with best practices. If this is a 🐛 Bug Report related to the model behavior, please share a minimum reproducible example (MRE). For example, please include the following:
If this is a ❓ Question about custom training adjustments, please provide:
UpgradeIt's also a good idea to ensure your environment is up to date. You can upgrade to the latest pip install -U ultralytics EnvironmentsUsing a verified and stable environment can help debug more reliably. Supported environments include:
Community and ResourcesTo connect with the community and gain further insights, consider joining:
StatusLastly, you can check the current CI status of the repository here: This is an automated response 😊, but don’t worry—an Ultralytics engineer will take a closer look at your issue and assist you shortly! |
Beta Was this translation helpful? Give feedback.
-
@Ichiruchan for multi-task learning combining segmentation and pose estimation, we recommend: 1) Verifying your annotation alignment between segmentation masks and keypoints 2) Adjusting loss weights specifically for pose estimation using the hyp.pose parameter in your hyperparameters file 3) Considering separate task-specific models as combined architectures can create conflicting gradients. The Pose Estimation Documentation shows proper implementation for individual tasks. For custom architectures, you might need to modify the loss balancing - see our pose loss implementation in ultralytics/utils/loss.py. |
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
-
Hello everyone,
I’m currently facing a challenge with my model, where I’ve combined the segmentation head and pose head into a single structure. I’ve adjusted the data reading process and modified the loss function to train the new model with the default hyperparameters. However, the predictions seem off, and the metrics are not performing well (MAP50-95 is about 0.91). For instance, the keypoints are appearing outside the bounding boxes, and both the segmentation and detection components are underperforming

Interestingly, when I remove the keypoint annotations and train on segmentation, the model performs well (MAP50-95 is nearly 0.955).

Could anyone provide suggestions on how to improve this situation?
Beta Was this translation helpful? Give feedback.
All reactions