Abstract: While showing promising results, recent RGB-D camera-based category-level object pose estimation methods have restricted applications due to the heavy reliance on depth sensors. RGB-only ...
Abstract: End-to-end DETR-based cross-modal fusion in 3-D object detection has achieved promising performance in many benchmarks. However, these methods either implement cross-modal fusion in a single ...