PEARL: Personalized Streaming Video Understanding Model Paper • 2603.20422 • Published 11 days ago • 40
Jackrong/Qwen3.5-27B-Claude-4.6-Opus-Reasoning-Distilled Image-Text-to-Text • 28B • Updated 8 days ago • 337k • 1.89k
Running 44 Qwen3.5 Omni Online Demo 📚 44 Chat with a multimodal AI using text, image, audio, or video
NLE: Non-autoregressive LLM-based ASR by Transcript Editing Paper • 2603.08397 • Published 22 days ago • 21
MAISI-v2: Accelerated 3D High-Resolution Medical Image Synthesis with Rectified Flow and Region-specific Contrastive Loss Paper • 2508.05772 • Published Aug 7, 2025 • 3
nvidia/NVIDIA-Nemotron-3-Super-120B-A12B-Base-BF16 Text Generation • 124B • Updated 17 days ago • 11.3k • 26
Vad-R1: Towards Video Anomaly Reasoning via Perception-to-Cognition Chain-of-Thought Paper • 2505.19877 • Published May 26, 2025 • 2
nvidia/Cosmos-Embed1-448p-anomaly-detection Video Classification • 1B • Updated 21 days ago • 376 • 5