-
MIBURI: Towards Expressive Interactive Gesture Synthesis
Paper • 2603.03282 • Published • 4 -
BBQ-to-Image: Numeric Bounding Box and Qolor Control in Large-Scale Text-to-Image Models
Paper • 2602.20672 • Published • 9 -
GroupGPT: A Token-efficient and Privacy-preserving Agentic Framework for Multi-User Chat Assistant
Paper • 2603.01059 • Published • 1 -
Proact-VL: A Proactive VideoLLM for Real-Time AI Companions
Paper • 2603.03447 • Published • 36
Collections
Discover the best community collections!
Collections including paper arxiv:2603.10055
-
AgentConductor: Topology Evolution for Multi-Agent Competition-Level Code Generation
Paper • 2602.17100 • Published • 3 -
GroupGPT: A Token-efficient and Privacy-preserving Agentic Framework for Multi-User Chat Assistant
Paper • 2603.01059 • Published • 1 -
Multi-Domain Riemannian Graph Gluing for Building Graph Foundation Models
Paper • 2603.00618 • Published -
Heterogeneous Agent Collaborative Reinforcement Learning
Paper • 2603.02604 • Published • 188
-
The Cow of Rembrandt - Analyzing Artistic Prompt Interpretation in Text-to-Image Models
Paper • 2507.23313 • Published • 1 -
SonicMaster: Towards Controllable All-in-One Music Restoration and Mastering
Paper • 2508.03448 • Published • 6 -
C3D-AD: Toward Continual 3D Anomaly Detection via Kernel Attention with Learnable Advisor
Paper • 2508.01311 • Published • 2 -
Normalized Attention Guidance: Universal Negative Guidance for Diffusion Model
Paper • 2505.21179 • Published • 13
-
Neural Thickets: Diverse Task Experts Are Dense Around Pretrained Weights
Paper • 2603.12228 • Published • 12 -
Meta-Reinforcement Learning with Self-Reflection for Agentic Search
Paper • 2603.11327 • Published • 8 -
Training Language Models via Neural Cellular Automata
Paper • 2603.10055 • Published • 7 -
Attention Sinks Are Provably Necessary in Softmax Transformers: Evidence from Trigger-Conditional Tasks
Paper • 2603.11487 • Published • 2
-
Latent Zoning Network: A Unified Principle for Generative Modeling, Representation Learning, and Classification
Paper • 2509.15591 • Published • 45 -
A Survey on Latent Reasoning
Paper • 2507.06203 • Published • 94 -
Quantized Evolution Strategies: High-precision Fine-tuning of Quantized LLMs at Low-precision Cost
Paper • 2602.03120 • Published • 1 -
TADA! Tuning Audio Diffusion Models through Activation Steering
Paper • 2602.11910 • Published • 2
-
Depth Anything V2
Paper • 2406.09414 • Published • 103 -
An Image is Worth More Than 16x16 Patches: Exploring Transformers on Individual Pixels
Paper • 2406.09415 • Published • 51 -
Physics3D: Learning Physical Properties of 3D Gaussians via Video Diffusion
Paper • 2406.04338 • Published • 39 -
SAM 2: Segment Anything in Images and Videos
Paper • 2408.00714 • Published • 120
-
MIBURI: Towards Expressive Interactive Gesture Synthesis
Paper • 2603.03282 • Published • 4 -
BBQ-to-Image: Numeric Bounding Box and Qolor Control in Large-Scale Text-to-Image Models
Paper • 2602.20672 • Published • 9 -
GroupGPT: A Token-efficient and Privacy-preserving Agentic Framework for Multi-User Chat Assistant
Paper • 2603.01059 • Published • 1 -
Proact-VL: A Proactive VideoLLM for Real-Time AI Companions
Paper • 2603.03447 • Published • 36
-
Neural Thickets: Diverse Task Experts Are Dense Around Pretrained Weights
Paper • 2603.12228 • Published • 12 -
Meta-Reinforcement Learning with Self-Reflection for Agentic Search
Paper • 2603.11327 • Published • 8 -
Training Language Models via Neural Cellular Automata
Paper • 2603.10055 • Published • 7 -
Attention Sinks Are Provably Necessary in Softmax Transformers: Evidence from Trigger-Conditional Tasks
Paper • 2603.11487 • Published • 2
-
AgentConductor: Topology Evolution for Multi-Agent Competition-Level Code Generation
Paper • 2602.17100 • Published • 3 -
GroupGPT: A Token-efficient and Privacy-preserving Agentic Framework for Multi-User Chat Assistant
Paper • 2603.01059 • Published • 1 -
Multi-Domain Riemannian Graph Gluing for Building Graph Foundation Models
Paper • 2603.00618 • Published -
Heterogeneous Agent Collaborative Reinforcement Learning
Paper • 2603.02604 • Published • 188
-
Latent Zoning Network: A Unified Principle for Generative Modeling, Representation Learning, and Classification
Paper • 2509.15591 • Published • 45 -
A Survey on Latent Reasoning
Paper • 2507.06203 • Published • 94 -
Quantized Evolution Strategies: High-precision Fine-tuning of Quantized LLMs at Low-precision Cost
Paper • 2602.03120 • Published • 1 -
TADA! Tuning Audio Diffusion Models through Activation Steering
Paper • 2602.11910 • Published • 2
-
The Cow of Rembrandt - Analyzing Artistic Prompt Interpretation in Text-to-Image Models
Paper • 2507.23313 • Published • 1 -
SonicMaster: Towards Controllable All-in-One Music Restoration and Mastering
Paper • 2508.03448 • Published • 6 -
C3D-AD: Toward Continual 3D Anomaly Detection via Kernel Attention with Learnable Advisor
Paper • 2508.01311 • Published • 2 -
Normalized Attention Guidance: Universal Negative Guidance for Diffusion Model
Paper • 2505.21179 • Published • 13
-
Depth Anything V2
Paper • 2406.09414 • Published • 103 -
An Image is Worth More Than 16x16 Patches: Exploring Transformers on Individual Pixels
Paper • 2406.09415 • Published • 51 -
Physics3D: Learning Physical Properties of 3D Gaussians via Video Diffusion
Paper • 2406.04338 • Published • 39 -
SAM 2: Segment Anything in Images and Videos
Paper • 2408.00714 • Published • 120