F2LLM-v2: Inclusive, Performant, and Efficient Embeddings for a Multilingual World Paper • 2603.19223 • Published 6 days ago • 28
Nemotron-Post-Training-v3 Collection Collection of datasets used in the post-training phase of Nemotron Nano and Super v3. • 28 items • Updated about 23 hours ago • 100
Nemotron-Cascade 2 Collection Post-Training LLMs with Cascade RL and Multi-Domain On-Policy Distillation • 4 items • Updated about 23 hours ago • 37
Omnilingual MT: Machine Translation for 1,600 Languages Paper • 2603.16309 • Published 8 days ago • 19
view article Article Efficient LLM Pretraining: Packed Sequences and Masked Attention Oct 7, 2024 • 69
Effective Distillation to Hybrid xLSTM Architectures Paper • 2603.15590 • Published 9 days ago • 32 • 5
🥨 Bavarian NLP Papers Collection Awesome papers about Bavarian NLP • 13 items • Updated 8 days ago • 2