Dongfu Jiang's picture

Dongfu Jiang

DongfuJiang

·

https://jdf-prog.github.io/

AI & ML interests

Large Language Model, Modality Reasoning and their evaluation

Recent Activity

authored a paper about 12 hours ago

EvolveCoder: Evolving Test Cases via Adversarial Verification for Code Reinforcement Learning

authored a paper about 12 hours ago

Nemotron-Cascade 2: Post-Training LLMs with Cascade RL and Multi-Domain On-Policy Distillation

authored a paper about 12 hours ago

OpenResearcher: A Fully Open Pipeline for Long-Horizon Deep Research Trajectory Synthesis

View all activity

Organizations

Papers 20

arxiv:2603.19220

arxiv:2603.20278

arxiv:2603.12698

arxiv:2509.22824

models 48

DongfuJiang/nano_v3_search_incorrect_only_347_steps

32B • Updated Jan 28

DongfuJiang/nano_v3_search_correct_only_347_steps

32B • Updated Jan 28

DongfuJiang/nano_v3_search_200_steps

32B • Updated Jan 28 • 1

DongfuJiang/nano_v3_search_347_steps

32B • Updated Jan 28

DongfuJiang/nano_v3_search_incorrect_only_200_steps

32B • Updated Jan 28

DongfuJiang/nano_v3_search_correct_only_200_steps

32B • Updated Jan 28

DongfuJiang/vs2_qwen2_5vl_sft_17k_1e-5

Image-Text-to-Text • 8B • Updated Jul 17, 2025

DongfuJiang/vs2_qwen2_5vl_sft_17k

Image-Text-to-Text • 8B • Updated Jul 16, 2025

DongfuJiang/math_ct_adapt_qwen2.5_1.5B

2B • Updated Mar 6, 2025 • 1

DongfuJiang/math_ct_qwen2.5_1.5B

2B • Updated Mar 6, 2025

datasets 18

DongfuJiang/GPT-OSS-Search

Viewer • Updated Jan 20 • 97.6k • 5 • 1

DongfuJiang/aime_2025

Viewer • Updated Oct 3, 2025 • 30 • 312

DongfuJiang/math_35

Viewer • Updated Sep 22, 2025 • 8.52k • 11

DongfuJiang/livecodebench

Viewer • Updated Sep 13, 2025 • 454 • 936

DongfuJiang/hle_text_only

Viewer • Updated Aug 27, 2025 • 2.16k • 1.96k

DongfuJiang/LVBench

Viewer • Updated Apr 7, 2025 • 1.55k • 164

DongfuJiang/Big-Math-RL-Verified-CT

Viewer • Updated Mar 7, 2025 • 17.8k • 17

DongfuJiang/PRM_SFT

Viewer • Updated Dec 1, 2024 • 4.01M • 6

DongfuJiang/zeroeval

Viewer • Updated Nov 27, 2024 • 13.5k • 155

DongfuJiang/PRM_eval

Viewer • Updated Nov 27, 2024 • 9.54k • 6

View 18 datasets