Building on HF
·
AI & ML interests
Reward models
Organizations
reciprocate/mistral-7b-gsm8k-code-rm
Text Classification
• 7B • Updated • 4
• 3
reciprocate/mistral-7b-rm
Text Classification
• Updated • 8
• 2
reciprocate/rm_beluga-7b_hh-full
Text Classification
• Updated • 6
reciprocate/rm-llama2-7b-gsm8k
Text Generation
• Updated • 3
reciprocate/llama2-7b-gsm8k
Text Generation
• Updated • 4
• 1
Text Generation
• Updated • 6
• 1
Text Generation
• Updated • 3
reciprocate/vicuna-13b_rm_oasst-hh
Text Classification
• Updated • 1
reciprocate/openllama-13b-rlhf-v0
Text Generation
• Updated • 4
reciprocate/openllama-13b_rm_oasst-hh
Text Classification
• Updated • 3
reciprocate/gpt-j_rm_format-oa
Text Classification
• Updated • 17
• 1
reciprocate/dahoas-gptj-rm-static
Text Classification
• Updated • 7
reciprocate/gpt2-simulacra
Text Generation
• Updated • 2
Text Generation
• Updated • 4
• 6
reciprocate/ppo_hh_pythia-125M
Text Generation
• Updated • 2
reciprocate/ppo_hh_neox-20B
Text Generation
• Updated • 3
• 2
reciprocate/ppo_hh_pythia-1B
Text Generation
• Updated • 12
reciprocate/ppo_hh_pythia-6B
Text Generation
• Updated • 4
• 6