stuff - a imjb Collection

imjb 's Collections

stuff

updated Mar 2

FinTral: A Family of GPT-4 Level Multimodal Financial Large Language Models

Paper • 2402.10986 • Published Feb 16, 2024 • 82
Aria Everyday Activities Dataset

Paper • 2402.13349 • Published Feb 20, 2024 • 31
Chatbot Arena: An Open Platform for Evaluating LLMs by Human Preference

Paper • 2403.04132 • Published Mar 7, 2024 • 40
SaulLM-7B: A pioneering Large Language Model for Law

Paper • 2403.03883 • Published Mar 6, 2024 • 90
VideoAgent: Long-form Video Understanding with Large Language Model as Agent

Paper • 2403.10517 • Published Mar 15, 2024 • 37
RAFT: Adapting Language Model to Domain Specific RAG

Paper • 2403.10131 • Published Mar 15, 2024 • 72
Med42-v2: A Suite of Clinical LLMs

Paper • 2408.06142 • Published Aug 12, 2024 • 52
The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery

Paper • 2408.06292 • Published Aug 12, 2024 • 128
Sapiens: Foundation for Human Vision Models

Paper • 2408.12569 • Published Aug 22, 2024 • 94
Law of Vision Representation in MLLMs

Paper • 2408.16357 • Published Aug 29, 2024 • 95
CogVLM2: Visual Language Models for Image and Video Understanding

Paper • 2408.16500 • Published Aug 29, 2024 • 57
SciLitLLM: How to Adapt LLMs for Scientific Literature Understanding

Paper • 2408.15545 • Published Aug 28, 2024 • 38
From MOOC to MAIC: Reshaping Online Teaching and Learning through LLM-driven Agents

Paper • 2409.03512 • Published Sep 5, 2024 • 29
WildVision: Evaluating Vision-Language Models in the Wild with Human Preferences

Paper • 2406.11069 • Published Jun 16, 2024 • 14
Qwen2-VL: Enhancing Vision-Language Model's Perception of the World at Any Resolution

Paper • 2409.12191 • Published Sep 18, 2024 • 79
LLMs + Persona-Plug = Personalized LLMs

Paper • 2409.11901 • Published Sep 18, 2024 • 35
Vista3D: Unravel the 3D Darkside of a Single Image

Paper • 2409.12193 • Published Sep 18, 2024 • 10
bartowski/Sky-T1-32B-Preview-GGUF

Text Generation • 33B • Updated Jan 11, 2025 • 634 • 82
LM2: Large Memory Models

Paper • 2502.06049 • Published Feb 9, 2025 • 31
inclusionAI/Ling-1T

Text Generation • Updated Nov 4, 2025 • 1.58k • • 533
QeRL: Beyond Efficiency -- Quantization-enhanced Reinforcement Learning for LLMs

Paper • 2510.11696 • Published Oct 13, 2025 • 182
A Definition of AGI

Paper • 2510.18212 • Published Oct 21, 2025 • 36
MentraSuite: Post-Training Large Language Models for Mental Health Reasoning and Assessment

Paper • 2512.09636 • Published Dec 10, 2025 • 26
RoboTracer: Mastering Spatial Trace with Reasoning in Vision-Language Models for Robotics

Paper • 2512.13660 • Published Dec 15, 2025 • 37
The Molecular Structure of Thought: Mapping the Topology of Long Chain-of-Thought Reasoning

Paper • 2601.06002 • Published Jan 9 • 58
Recurrent-Depth VLA: Implicit Test-Time Compute Scaling of Vision-Language-Action Models via Latent Iterative Reasoning

Paper • 2602.07845 • Published Feb 8 • 71
zai-org/GLM-5

Text Generation • 754B • Updated 1 day ago • 284k • • 1.93k
GLM-5: from Vibe Coding to Agentic Engineering

Paper • 2602.15763 • Published Feb 17 • 120
MiMo-V2-Flash Technical Report

Paper • 2601.02780 • Published Jan 6 • 37