Reconstructing Global Multi-Person 3D Motions From A Single Moving Camera
dc.contributor.author | Zhao, Boxin | |
dc.contributor.copyright-release | Not Applicable | |
dc.contributor.degree | Master of Applied Science | |
dc.contributor.department | Department of Electrical & Computer Engineering | |
dc.contributor.ethics-approval | Not Applicable | |
dc.contributor.external-examiner | n/a | |
dc.contributor.manuscripts | Not Applicable | |
dc.contributor.thesis-reader | Kamal El-Sankary | |
dc.contributor.thesis-reader | Srinivas Sampalli | |
dc.contributor.thesis-supervisor | Jason Gu | |
dc.date.accessioned | 2025-04-17T15:08:23Z | |
dc.date.available | 2025-04-17T15:08:23Z | |
dc.date.defence | 2025-04-11 | |
dc.date.issued | 2025-04-15 | |
dc.description | To address 3D human reconstruction in multiple dynamic scenarios, a new optimization-based framework, Opti-Pose3D, is proposed to recover global multi-person motion from a single moving camera by integrating SLAM techniques with a learned human motion prior. To detect and associate individuals across frames, a robust tracking algorithm is employed to handle occlusions effectively. In addition, a learned motion smoothness prior is incorporated into the optimization pipeline to enhance temporal consistency. Using the initial camera parameters estimated by SLAM, human motion trajectories are reconstructed by jointly optimizing the SMPL model and camera parameters to align the projected 3D poses with the detected 2D observations. | |
dc.description.abstract | The reconstruction of 3D human models from in-the-wild videos has emerged as a highly promising research direction, with increasingly practical and widespread applications. To address the challenges of depth ambiguity and human occlusions in monocular videos, an optimization-based framework is proposed that combines SLAM techniques with human motion priors to recover global multi-person motion from a single moving camera. Camera parameters are estimated using SLAM, while a robust tracking algorithm is adopted to mitigate occlusion effects. A learned motion smoothness prior is further integrated into the optimization process. The SMPL model and camera parameters are jointly optimized to align the reconstructed 3D poses with 2D detections. Experimental results on the 3DPW dataset demonstrate that the proposed method outperforms existing approaches in both accuracy and robustness. | |
dc.identifier.uri | https://hdl.handle.net/10222/85014 | |
dc.language.iso | en | |
dc.subject | Reconstruction | |
dc.subject | SLAM | |
dc.subject | SmoothNet | |
dc.title | Reconstructing Global Multi-Person 3D Motions From A Single Moving Camera |