FIPO: Eliciting Deep Reasoning with Future-KL Influenced Policy Optimization Paper • 2603.19835 • Published 13 days ago • 302
CUA-Suite: Massive Human-annotated Video Demonstrations for Computer-Use Agents Paper • 2603.24440 • Published 8 days ago • 92
MoKus: Leveraging Cross-Modal Knowledge Transfer for Knowledge-Aware Concept Customization Paper • 2603.12743 • Published 20 days ago • 3
UniFine: A Unified and Fine-grained Approach for Zero-shot Vision-Language Understanding Paper • 2307.00862 • Published Jul 3, 2023 • 1