Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
41
222
53
KABI
dongguanting
Follow
lixiaoxi45's profile picture
SnowNation's profile picture
MrDevolver's profile picture
68 followers
·
106 following
https://dongguanting.github.io/
kakakbibibi
dongguanting
AI & ML interests
Reasoning and Alignment for Large Language Models
Recent Activity
upvoted
a
paper
1 day ago
SKILL0: In-Context Agentic Reinforcement Learning for Skill Internalization
commented
on
a paper
2 days ago
The Latent Space: Foundation, Evolution, Mechanism, Ability, and Outlook
upvoted
a
paper
2 days ago
The Latent Space: Foundation, Evolution, Mechanism, Ability, and Outlook
View all activity
Organizations
dongguanting
's models
16
Sort: Recently updated
dongguanting/Tool-Star-Qwen-1.5B
Text Generation
•
2B
•
Updated
about 1 month ago
•
13
•
2
dongguanting/Qwen3-8B-AEPO-DeepSearch
Text Generation
•
8B
•
Updated
Dec 20, 2025
•
5
•
2
dongguanting/QwQ-32B-AEPO-DeepSearch
Text Generation
•
33B
•
Updated
Dec 20, 2025
•
6
•
1
dongguanting/QwQ-32B-ARPO-DeepSearch
33B
•
Updated
Dec 20, 2025
•
6
•
1
dongguanting/aepo_light
8B
•
Updated
Nov 3, 2025
•
3
dongguanting/Qwen2.5-7B-AEPO
Text Generation
•
8B
•
Updated
Oct 27, 2025
•
3
•
1
dongguanting/Qwen3-14B-AEPO-DeepSearch
Robotics
•
15B
•
Updated
Oct 21, 2025
•
4
•
1
dongguanting/Qwen2.5-7B-ARPO
Text Generation
•
8B
•
Updated
Aug 19, 2025
•
332
•
2
dongguanting/Llama3.1-8B-ARPO
Text Generation
•
8B
•
Updated
Aug 12, 2025
•
2
•
1
dongguanting/Qwen2.5-3B-ARPO
Text Generation
•
3B
•
Updated
Aug 12, 2025
•
4
•
3
dongguanting/Qwen3-14B-ARPO-DeepSearch
Text Generation
•
15B
•
Updated
Aug 12, 2025
•
9
•
5
dongguanting/Qwen3-8B-ARPO-DeepSearch
8B
•
Updated
Jul 29, 2025
•
83
•
2
dongguanting/Tool-Star-Qwen-7B
Text Generation
•
8B
•
Updated
Jun 30, 2025
•
8
•
2
dongguanting/RAG-Critic-3B
Text Generation
•
3B
•
Updated
Jun 28, 2025
•
30
•
4
dongguanting/Tool-Star-Qwen-0.5B
Text Generation
•
0.6B
•
Updated
Jun 6, 2025
•
5
•
1
dongguanting/Tool-Star-Qwen-3B
Text Generation
•
3B
•
Updated
May 25, 2025
•
12
•
5