Unlock Multi-Person 3D Motion Capture with CoMotion: A Comprehensive Guide
Tired of single-person pose estimation? CoMotion revolutionizes 3D motion capture by detecting and tracking multiple people simultaneously from a single camera. This guide provides everything you need to get started with this groundbreaking technology, perfect for researchers and developers alike.
What is CoMotion and Why Should You Care?
CoMotion is an innovative approach to multi-person 3D motion capture. Unlike traditional methods, it excels in crowded scenes and handles complex poses with ease; plus, it provides significant upgrades:
- Real-Time Tracking: CoMotion updates poses directly from new images, enabling robust online tracking, even through occlusions.
- Single Camera Simplicity: Forget complex multi-camera setups. CoMotion works with a single, monocular camera.
- Detailed 3D Poses: Capture accurate and detailed 3D poses of multiple individuals, opening doors to advanced motion analysis.
Quick Start: Installing and Running CoMotion
Ready to jump in? Follow these simple steps to install and run CoMotion on your system using the detailed CoMotion installation guide:
- Create a Conda Environment:
- Download Pre-trained Models: This downloads the necessary checkpoints for both the detection and refinement stages.
- Get the SMPL Body Model:
- Download the neutral SMPL body model (version 1.1.0) from the official SMPL website.
- Copy
basicmodel_neutral_lbs_10_207_0_v1.1.0.pkl
tosrc/comotion_demo/data/smpl/SMPL_NEUTRAL.pkl
(renaming the file).
Running the CoMotion Demo: Video and Image Examples
CoMotion offers flexible demo options for both video and image inputs. Here's how to run them:
-
Video Demo:
Customize your runs by using arguments like
--start-frame
and--num-frames
. Disable visualization with--skip-visualization
. -
Single Image Demo:
CoMotion automatically detects image inputs based on the file suffix (.png, .jpeg, .jpg). The 3D pose estimation will be processed.
Troubleshooting Common Issues
Encountering problems? Here are quick solutions to common issues:
- "libc++.1.dylib not found" Error: Resolve this by running
conda install libcxx
. - "XOpenDisplay: cannot open display" Error (Headless Rendering):
- Start a virtual display using
Xvfb :0 -screen 0 640x480x24 & export DISPLAY=:0.0
. - Install Xvfb if needed:
apt install xvfb
.
- Start a virtual display using
Standalone Single-Image Multi-Person Pose Estimation
Utilize the detection stage independently from CoMotion for single-image multi-person pose estimation tasks. No tracking required! The comotion_detection_checkpoint.pt is perfect for leveraging the existing multi-person 3D motion capture base.
Citing CoMotion
If you find CoMotion useful in your research, please cite the following paper:
@inproceedings{newell2025comotion,
title = {CoMotion: Concurrent Multi-person 3D Motion},
author = {Alejandro Newell and Peiyun Hu and Lahav Lipson and Stephan R. Richter and Vladlen Koltun},
booktitle = {International Conference on Learning Representations},
year = {2025},
url = {https://openreview.net/forum?id=qKu6KWPgxt},
}
License Information
The sample code is released under the LICENSE terms, and the model weights are released under the MODEL LICENSE terms. Please refer to these licenses for usage details.