Abstract
This work presents a comparative evaluation of markerless upper-limb motion tracking methods, namely, Medi-aPipe (monocular), Fusion (multi-view fusion of MediaPipe out-puts), and Triangulation (3D reconstruction via calibrated multi-view geometry), against a marker-based ground truth acquired with OptiTrack. The objective is to assess the feasibility and accuracy of these approaches in reconstructing 3D joint positions during human motion imitation tasks. Data were collected using RGB-D cameras and compared with OptiTrack data after time-based synchronization. This study uses only RGB data. Four representative gestures, Drinking, Eating with a utensil, Closing a zipper, and Touching the nose, were analyzed. The evaluation focuses on the positional error of the shoulder, elbow, and wrist joints, using RMSE as the primary metric. Preliminary results indicate that triangulation achieves the highest spatial accuracy when MediaPipe detections are consistent across views, but its performance degrades rapidly under occlusions or missing key-points. In contrast, MediaPipe and multi-view fusion offer greater robustness in such conditions, maintaining functional accuracy even when visibility is compromised. These findings inform the design of adaptable, markerless motion capture pipelines suitable for different human-robot interaction scenarios.
| Original language | English |
|---|---|
| Title of host publication | 2025 IEEE International Conference on Metrology for eXtended Reality, Artificial Intelligence and Neural Engineering (MetroXRAINE) |
| Publisher | IEEE Institute of Electrical and Electronic Engineers |
| Pages | 818-823 |
| Number of pages | 6 |
| ISBN (Electronic) | 979-8-3315-0279-9 |
| DOIs | |
| Publication status | Published - 2025 |
| MoE publication type | A4 Article in a conference publication |
| Event | 4th IEEE International Conference on Metrology for eXtended Reality, Artificial Intelligence and Neural Engineering, MetroXRAINE 2025 - Ancona, Italy Duration: 22 Oct 2025 → 24 Oct 2025 |
Conference
| Conference | 4th IEEE International Conference on Metrology for eXtended Reality, Artificial Intelligence and Neural Engineering, MetroXRAINE 2025 |
|---|---|
| Country/Territory | Italy |
| City | Ancona |
| Period | 22/10/25 → 24/10/25 |
Funding
This research received external funding from the project FOCAAL, FOg Computing in Ambient Assisted Living, funded by MIMIT, the Ministry of Enterprise and Made in Italy, under the Life Sciences Innovation Agreement DM 05/03/2018. GB is also supported by the PNRR MUR project PE0000013-FAIR.
Keywords
- Human Motion Imitation
- Human-Robot Interaction
- Markerless Motion Capture
- Motion Analysis
- Multi-view Triangulation
- Pose Estimation
- Robotic Perception
- Upper Limb Kinematics
Fingerprint
Dive into the research topics of 'Markerless Upper Limb Motion Tracking: A Comparative Evaluation of Multi-View Approaches'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver