-
R-4B: Incentivizing General-Purpose Auto-Thinking Capability in MLLMs via Bi-Mode Annealing and Reinforce Learning
Paper • 2508.21113 • Published • 110 -
Breaking the Exploration Bottleneck: Rubric-Scaffolded Reinforcement Learning for General LLM Reasoning
Paper • 2508.16949 • Published • 24 -
EmbodiedOneVision: Interleaved Vision-Text-Action Pretraining for General Robot Control
Paper • 2508.21112 • Published • 78 -
UItron: Foundational GUI Agent with Advanced Perception and Planning
Paper • 2508.21767 • Published • 12
Jeff Nyzio
TheOneTrueNiz
AI & ML interests
None yet
Recent Activity
liked a model about 1 hour ago
Jackrong/Qwen3.5-2B-Claude-4.6-Opus-Reasoning-Distilled-GGUF liked a model 1 day ago
Jackrong/Qwen3.5-27B-Claude-4.6-Opus-Reasoning-Distilled liked a model 18 days ago
microsoft/Fara-7BOrganizations
None yet