UniPose: A Unified Multimodal Framework for Human Pose Comprehension, Generation and Editing

CVPR 2025 Highlight

⭐ If UniPose is helpful to your projects, please help star this repo. Thanks! 🤗

This repository contains the official implementation of UniPose. UniPose is a multi-modal LLM to understand, generate and edit 3D human poses. It enables seamless transformation of human poses across multi-modalities (visual, text, SMPL) and multi-tasks (comprehension, generation, editing).

🔧 Dependencies and Installation

Clone this repo and navigate to UniPose folder

git clone https://github.com/liyiheng23/UniPose.git
cd UniPose

Create Conda Environment and Install Package

conda create -n unipose python=3.10 -y
conda activate unipose
pip3 install -r requirements.txt

Download Pre-trained Models and put these models under ./cache folder.

Base Model:

CLIP-ViT: openai/clip-vit-large-patch14-336
LLaVA: liuhaotian/llava-v1.6-mistral-7b
smpl_models: smplh

Ours provided Model:

unipose: L-yiheng/UniPose
tokenhmr_model: tokenhmr_model
pose-vqvae: pose-vqvae

Now under ./cache, you should be able to see the following:

cache
├── clip-vit-large-patch14-336
├── llava-v1.6-mistral-7b
├── pose_vqvae
│   └── best_MPJPE.ckpt
├── smpl_models
│   └── smplh
│       ├── SMPLH_FEMALE.npz
│       ├── SMPLH_MALE.npz
│       └── SMPLH_NEUTRAL.npz
├── tokenhmr_model.ckpt
└── unipose

⚡ Inference

python inference.py \
    --model-path cache/unipose \
    --model-base cache/llava-v1.6-mistral-7b \
    --config configs/inference.py

🪪 License

The provided code and pre-trained weights are licensed under the Apache 2.0 license.

🤗 Acknowledgement

This code is based on ChatPose, LISA, LLaVA and PoseScript. Some code are brought from TokenHMR, 4D-Humans. We thank the authors for their awesome work.

📧 Contact

If you have any questions, please feel free to reach me out at liyiheng23@gmail.com.

📖 Citation

If you find our work useful for your research, please consider citing our paper:

@article{li2024unipose,
  title={UniPose: A Unified Multimodal Framework for Human Pose Comprehension, Generation and Editing},
  author={Li, Yiheng and Hou, Ruibing and Chang, Hong and Shan, Shiguang and Chen, Xilin},
  journal={arXiv preprint arXiv:2411.16781},
  year={2024}
}

Name		Name	Last commit message	Last commit date
Latest commit History 22 Commits
assets		assets
configs		configs
posegpt		posegpt
scripts/instructions		scripts/instructions
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
inference.py		inference.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

UniPose: A Unified Multimodal Framework for Human Pose Comprehension, Generation and Editing

🔧 Dependencies and Installation

Base Model:

Ours provided Model:

⚡ Inference

🪪 License

🤗 Acknowledgement

📧 Contact

📖 Citation

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 2

Uh oh!

Languages

License

liyiheng23/UniPose

Folders and files

Latest commit

History

Repository files navigation

UniPose: A Unified Multimodal Framework for Human Pose Comprehension, Generation and Editing

🔧 Dependencies and Installation

Base Model:

Ours provided Model:

⚡ Inference

🪪 License

🤗 Acknowledgement

📧 Contact

📖 Citation

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 2

Uh oh!

Languages

Packages