Repurposing Geometric Foundation Models for Multi-view Diffusion Paper • 2603.22275 • Published 2 days ago • 34
PEARL: Personalized Streaming Video Understanding Model Paper • 2603.20422 • Published 5 days ago • 36
MinerU-Diffusion: Rethinking Document OCR as Inverse Rendering via Diffusion Decoding Paper • 2603.22458 • Published 2 days ago • 110
Reasoning as Compression: Unifying Budget Forcing via the Conditional Information Bottleneck Paper • 2603.08462 • Published 16 days ago • 19
view article Article Introducing SPEED-Bench: A Unified and Diverse Benchmark for Speculative Decoding 6 days ago • 43
Look Where It Matters: High-Resolution Crops Retrieval for Efficient VLMs Paper • 2603.16932 • Published 11 days ago • 72
mSFT: Addressing Dataset Mixtures Overfiting Heterogeneously in Multi-task SFT Paper • 2603.21606 • Published 3 days ago • 33
PivotRL: High Accuracy Agentic Post-Training at Low Compute Cost Paper • 2603.21383 • Published 3 days ago • 13
VideoDetective: Clue Hunting via both Extrinsic Query and Intrinsic Relevance for Long Video Understanding Paper • 2603.22285 • Published 2 days ago • 45
Beyond Single Tokens: Distilling Discrete Diffusion Models via Discrete MMD Paper • 2603.20155 • Published 5 days ago • 7
view article Article **LoRA Fine-Tuning BitNet b1.58 LLMs on Heterogeneous Edge GPUs via QVAC Fabric** 8 days ago • 14
WorldAgents: Can Foundation Image Models be Agents for 3D World Models? Paper • 2603.19708 • Published 5 days ago • 10
LumosX: Relate Any Identities with Their Attributes for Personalized Video Generation Paper • 2603.20192 • Published 5 days ago • 22
FlowScene: Style-Consistent Indoor Scene Generation with Multimodal Graph Rectified Flow Paper • 2603.19598 • Published 6 days ago • 31
Bridging Semantic and Kinematic Conditions with Diffusion-based Discrete Motion Tokenizer Paper • 2603.19227 • Published 6 days ago • 40
LaDe: Unified Multi-Layered Graphic Media Generation and Decomposition Paper • 2603.17965 • Published 7 days ago • 5
Cubic Discrete Diffusion: Discrete Visual Generation on High-Dimensional Representation Tokens Paper • 2603.19232 • Published 6 days ago • 32
Efficient Training-Free Multi-Token Prediction via Embedding-Space Probing Paper • 2603.17942 • Published 7 days ago • 6