nicetoolbox.detectors.method_detectors.mmpose.mmpose_framework_3d¶
MMPose 3D frameworks and specific algorithm definitions.
Classes
MMPose3D handles native 3D pose lifters (e.g. |
|
Handler for MotionBERT models trained on Human3.6M (17 Keypoints). |
- class nicetoolbox.detectors.method_detectors.mmpose.mmpose_framework_3d.MMPose3D(io: SequenceIO, data: SequenceData, sequence_context: SequenceRuntimeConfig, algorithm_instance: str)[source]¶
MMPose3D handles native 3D pose lifters (e.g. MotionBERT). Post-inference applies optional temporal filtering and confidence masking on stored 2D/3D arrays; 3D stays in the lifter’s root-relative frame (no PnP or camera alignment).
Initialize base method detector with references.
- class nicetoolbox.detectors.method_detectors.mmpose.mmpose_framework_3d.MotionBERT(io: SequenceIO, data: SequenceData, sequence_context: SequenceRuntimeConfig, algorithm_instance: str)[source]¶
Handler for MotionBERT models trained on Human3.6M (17 Keypoints).
Initialize base method detector with references.
- compute_viz_folder(_visualize: bool) str | None[source]¶
Return None so base __init__ does not mkdir the visualization folder.
The subprocess creates the per-camera frame folders itself; the main process assembles the mp4s in visualization() after inference.
- get_per_component_keypoint_mapping(keypoints_indices)[source]¶
This method extracts the keypoint indices and descriptions for each pose estimation component.
It has to be implemented by the derived classes associated to the available pose estimation algorithms. (See HRNetw48 and Vitpose classes below)
Available algorithms are: hrnetw48, vitpose
- Parameters:
keypoints_indices (_type_) – _description_
- visualization(_)[source]¶
Generates a visualization video for each camera from the processed image frames.
This method takes the processed image frames for each camera and compiles them into a video file. It uses the frames_to_video() function from utils.video.py. The success of the video creation is tracked, and the method logs the outcome of the visualization process for each camera. :param data: An instance of a class that stores all data related
information, including the frame rate (fps) for the video and the starting frame number (video_start).
Note
The method assumes that the processed image frames are named in a
specific format (%09d.jpg), where each frame’s name is a zero-padded five-digit number representing its sequence in the video.