arxiv:2603.19220
Dongfu Jiang
DongfuJiang
AI & ML interests
Large Language Model, Modality Reasoning and their evaluation
Recent Activity
authored a paper about 12 hours ago
EvolveCoder: Evolving Test Cases via Adversarial Verification for Code Reinforcement Learning authored a paper about 12 hours ago
Nemotron-Cascade 2: Post-Training LLMs with Cascade RL and Multi-Domain On-Policy Distillation authored a paper about 12 hours ago
OpenResearcher: A Fully Open Pipeline for Long-Horizon Deep Research Trajectory Synthesis