Junfeng Tian
rgtjf
AI & ML interests
None yet
Organizations
None yet
models 13
rgtjf/ppo-Pyramids
Reinforcement Learning • Updated • 14
rgtjf/ppo-SnowballTarget
Reinforcement Learning • Updated • 5
rgtjf/Reinforce-2048
Reinforcement Learning • Updated
rgtjf/Qwen2-UtK-72B-128K
73B • Updated • 1
rgtjf/LLama3.1-UtK-8B-128K
8B • Updated • 1
rgtjf/Qwen2-UtK-ChatQA2-7B-128K
8B • Updated • 1
rgtjf/Qwen2-UtK-ChatQA2-72B-128K
73B • Updated • 3
rgtjf/Qwen2-UtK-7B-128K
8B • Updated • 2
rgtjf/Reinforce-1024
Reinforcement Learning • Updated
rgtjf/dqn-SpaceInvadersNoFrameskip-v4
Reinforcement Learning • Updated • 16
datasets 0
None public yet