FinTral: A Family of GPT-4 Level Multimodal Financial Large Language
Models
Paper
• 2402.10986
• Published • 82
Aria Everyday Activities Dataset
Paper
• 2402.13349
• Published • 31
Chatbot Arena: An Open Platform for Evaluating LLMs by Human Preference
Paper
• 2403.04132
• Published • 40
SaulLM-7B: A pioneering Large Language Model for Law
Paper
• 2403.03883
• Published • 90
VideoAgent: Long-form Video Understanding with Large Language Model as
Agent
Paper
• 2403.10517
• Published • 37
RAFT: Adapting Language Model to Domain Specific RAG
Paper
• 2403.10131
• Published • 72
Med42-v2: A Suite of Clinical LLMs
Paper
• 2408.06142
• Published • 52
The AI Scientist: Towards Fully Automated Open-Ended Scientific
Discovery
Paper
• 2408.06292
• Published • 128
Sapiens: Foundation for Human Vision Models
Paper
• 2408.12569
• Published • 94
Law of Vision Representation in MLLMs
Paper
• 2408.16357
• Published • 95
CogVLM2: Visual Language Models for Image and Video Understanding
Paper
• 2408.16500
• Published • 57
SciLitLLM: How to Adapt LLMs for Scientific Literature Understanding
Paper
• 2408.15545
• Published • 38
From MOOC to MAIC: Reshaping Online Teaching and Learning through
LLM-driven Agents
Paper
• 2409.03512
• Published • 29
WildVision: Evaluating Vision-Language Models in the Wild with Human
Preferences
Paper
• 2406.11069
• Published • 14
Qwen2-VL: Enhancing Vision-Language Model's Perception of the World at
Any Resolution
Paper
• 2409.12191
• Published • 79
LLMs + Persona-Plug = Personalized LLMs
Paper
• 2409.11901
• Published • 35
Vista3D: Unravel the 3D Darkside of a Single Image
Paper
• 2409.12193
• Published • 10
bartowski/Sky-T1-32B-Preview-GGUF
Text Generation
• 33B • Updated • 634
• 82
Paper
• 2502.06049
• Published • 31
Text Generation
• Updated • 1.58k
• • 533
QeRL: Beyond Efficiency -- Quantization-enhanced Reinforcement Learning
for LLMs
Paper
• 2510.11696
• Published • 182
Paper
• 2510.18212
• Published • 36
MentraSuite: Post-Training Large Language Models for Mental Health Reasoning and Assessment
Paper
• 2512.09636
• Published • 26
RoboTracer: Mastering Spatial Trace with Reasoning in Vision-Language Models for Robotics
Paper
• 2512.13660
• Published • 37
The Molecular Structure of Thought: Mapping the Topology of Long Chain-of-Thought Reasoning
Paper
• 2601.06002
• Published • 58
Recurrent-Depth VLA: Implicit Test-Time Compute Scaling of Vision-Language-Action Models via Latent Iterative Reasoning
Paper
• 2602.07845
• Published • 71
Text Generation
• 754B • Updated • 284k
• • 1.93k
GLM-5: from Vibe Coding to Agentic Engineering
Paper
• 2602.15763
• Published • 120
MiMo-V2-Flash Technical Report
Paper
• 2601.02780
• Published • 37