Loc3R-VLM: Language-based Localization and 3D Reasoning with Vision-Language Models Paper • 2603.18002 • Published 10 days ago • 13
nvidia/segformer-b5-finetuned-cityscapes-1024-1024 Image Segmentation • Updated Aug 9, 2022 • 62.1k • • 40