Advances in Probabilistic Modeling and Efficient 3D Representations

The recent advancements in 3D vision and human pose estimation have seen a significant shift towards probabilistic modeling, efficient 3D representations, and the integration of multi-modal data. Probabilistic approaches, particularly those leveraging normalizing flows and non-Euclidean geometries, are gaining traction for tasks like 3D human pose estimation, addressing the inherent ambiguities and uncertainties in these tasks. Efficient 3D representations, such as Fourier Occupancy Fields, are being developed to balance computational efficiency with high-quality results, enabling real-time applications. Multi-modal frameworks are also emerging, combining various data sources like RGB images, depth maps, and motion data to enhance the robustness and accuracy of human motion transfer and garment simulation. Additionally, there is a growing focus on self-calibration and low-rank adaptation techniques for 3D geometric models, improving their generalization capabilities in diverse and challenging scenarios. These developments collectively push the boundaries of what is possible in 3D vision, offering more accurate, efficient, and versatile solutions for a range of applications from telepresence to garment design.

Noteworthy papers include: 1) ProPLIKS for its innovative use of probabilistic modeling in 3D human pose estimation. 2) FOF-X for its efficient Fourier Occupancy Field representation enabling real-time detailed human reconstruction. 3) LoRA3D for its low-rank self-calibration method significantly improving 3D reconstruction performance.

Sources

ProPLIKS: Probablistic 3D human body pose estimation

PanoDreamer: 3D Panorama Synthesis from a Single Image

FOF-X: Towards Real-time Detailed Human Reconstruction from a Single Image

One-shot Human Motion Transfer via Occlusion-Robust Flow Prediction and Neural Texturing

Neural Garment Dynamic Super-Resolution

MV-DUSt3R+: Single-Stage Scene Reconstruction from Sparse Views In 2 Seconds

Deep Non-rigid Structure-from-Motion Revisited: Canonicalization and Sequence Modeling

LoRA3D: Low-Rank Self-Calibration of 3D Geometric Foundation Models

Combining Neural Fields and Deformation Models for Non-Rigid 3D Motion Reconstruction from Partial Data

BLADE: Single-view Body Mesh Learning through Accurate Depth Estimation

Coherent3D: Coherent 3D Portrait Video Reconstruction via Triplane Fusion

FreeSplatter: Pose-free Gaussian Splatting for Sparse-view 3D Reconstruction

LiftImage3D: Lifting Any Single Image to 3D Gaussians with Video Generation Priors

Built with on top of