Advances in Unsupervised and Test-Time Adaptation for Vision and Autonomous Systems

The recent advancements in computer vision and autonomous driving have seen a shift towards more robust and versatile models, particularly in the areas of image translation, semantic scene completion, and 3D semantic occupancy prediction. Innovations in LDR-to-HDR image translation now focus on leveraging unpaired datasets and semantic consistency, moving away from the reliance on high-quality paired data. Semantic Scene Completion (SSC) has seen significant improvements with the introduction of test-time adaptation methods that utilize temporal and spatial information from driving environments. The field of 3D semantic occupancy prediction is expanding to include off-road environments, with the introduction of new benchmarks and multi-modal frameworks that enhance prediction accuracy. Additionally, unsupervised domain adaptation in LiDAR-based semantic segmentation is advancing, with new approaches that bridge domain gaps using cross-modal adversarial training. Finally, unsupervised semantic segmentation of high-density multispectral point clouds is making strides, with methods that minimize labeling efforts while maintaining high accuracy.

Noteworthy papers include one that introduces a novel cycle-consistent adversarial architecture for unpaired LDR-to-HDR image translation, achieving state-of-the-art results. Another paper presents a test-time adaptation approach for SSC that significantly improves performance by leveraging temporal observations. A third paper introduces the first benchmark for off-road 3D semantic occupancy prediction, contributing to the expansion of this field into new environments.

Sources

A Cycle Ride to HDR: Semantics Aware Self-Supervised Framework for Unpaired LDR-to-HDR Image Translation

TALoS: Enhancing Semantic Scene Completion via Test-time Adaptation on the Line of Sight

WildOcc: A Benchmark for Off-Road 3D Semantic Occupancy Prediction

LiOn-XA: Unsupervised Domain Adaptation via LiDAR-Only Cross-Modal Adversarial Training

Unsupervised semantic segmentation of urban high-density multispectral point clouds

Built with on top of