vikarti-anatra 's Collections Interesting ones
updated
LoRA Fine-tuning Efficiently Undoes Safety Training in Llama 2-Chat 70B
Paper
• 2310.20624
• Published • 13
Unleashing the Power of Pre-trained Language Models for Offline
Reinforcement Learning
Paper
• 2310.20587
• Published • 18
BadLlama: cheaply removing safety fine-tuning from Llama 2-Chat 13B
Paper
• 2311.00117
• Published
VideoFusion: Decomposed Diffusion Models for High-Quality Video
Generation
Paper
• 2303.08320
• Published • 3
Vikhrmodels/Vikhr-7B-instruct_0.4
Text Generation
• 8B • Updated • 329
• 35
IlyaGusev/saiga_llama3_8b
Text Generation
• 8B • Updated • 397k
• • 137
QuixiAI/wizard_vicuna_70k_unfiltered
Viewer
• Updated • 34.6k • 257
• 175
failspy/llama-3-70B-Instruct-abliterated
Text Generation
• Updated • 8.73k
• • 126
Zoyd/Sao10K_L3-8B-Stheno-v3.1-8_0bpw_exl2
Text Generation
• Updated • 2
• 3
Zoyd/Sao10K_L3-8B-Stheno-v3.1-6_5bpw_exl2
Text Generation
• Updated • 4
• 1
sophosympatheia/Aurora-Nights-70B-v1.0
Text Generation
• 69B • Updated • 865
• 22
PygmalionAI/mythalion-13b
Text Generation
• 13B • Updated • 968
• • 162
Nitral-AI/Poppy_Porpoise-1.0-L3-8B
Text Generation
• 8B • Updated • 15
• 27
NeverSleep/Noromaid-v0.4-Mixtral-Instruct-8x7b-Zloss
Text Generation
• 47B • Updated • 43
• 38
microsoft/Phi-3-medium-128k-instruct
Text Generation
• Updated • 17k
• 387
Text Generation
• 8B • Updated • 5
• 3
Lewdiculous/Poppy_Porpoise-1.0-L3-8B-GGUF-IQ-Imatrix
8B • Updated • 82
• 15
ACECODER: Acing Coder RL via Automated Test-Case Synthesis
Paper
• 2502.01718
• Published • 28
OLMoTrace: Tracing Language Model Outputs Back to Trillions of Training
Tokens
Paper
• 2504.07096
• Published • 77