ResearchClawBench: Evaluating AI Agents for Automated Research from Re-Discovery to New-Discovery
AI & ML interests
None defined yet.
Recent Activity
View all activity
Papers
Sci-CoE: Co-evolving Scientific Reasoning LLMs via Geometric Consensus with Sparse Supervision
InternAgent-1.5: A Unified Agentic Framework for Long-Horizon Autonomous Scientific Discovery
InternAgent aims to facilitate AI+X innovative research across diverse scientific domains
-
InternAgent-1.5: A Unified Agentic Framework for Long-Horizon Autonomous Scientific Discovery
Paper • 2602.08990 • Published • 76 -
InternScience/OmniCaptioner
8B • Updated • 6 • 18 -
InternScience/StructTable-base
Image-to-Text • 0.3B • Updated • 48 • 9 -
InternScience/MME-Reasoning
Preview • Updated • 199 • 8
Probing Scientific General Intelligence of LLMs with Scientist-Aligned Workflows
-
SGI-Bench Leaderboard
🥇7Scientific General Intelligence of LLMs/vLLMs
-
Probing Scientific General Intelligence of LLMs with Scientist-Aligned Workflows
Paper • 2512.16969 • Published • 120 -
InternScience/SGI-DeepResearch
Viewer • Updated • 318 • 500 • 6 -
InternScience/SGI-IdeaGeneration
Viewer • Updated • 315 • 410 • 3
ResearchClawBench: Evaluating AI Agents for Automated Research from Re-Discovery to New-Discovery
Probing Scientific General Intelligence of LLMs with Scientist-Aligned Workflows
-
SGI-Bench Leaderboard
🥇7Scientific General Intelligence of LLMs/vLLMs
-
Probing Scientific General Intelligence of LLMs with Scientist-Aligned Workflows
Paper • 2512.16969 • Published • 120 -
InternScience/SGI-DeepResearch
Viewer • Updated • 318 • 500 • 6 -
InternScience/SGI-IdeaGeneration
Viewer • Updated • 315 • 410 • 3
InternAgent aims to facilitate AI+X innovative research across diverse scientific domains
-
InternAgent-1.5: A Unified Agentic Framework for Long-Horizon Autonomous Scientific Discovery
Paper • 2602.08990 • Published • 76 -
InternScience/OmniCaptioner
8B • Updated • 6 • 18 -
InternScience/StructTable-base
Image-to-Text • 0.3B • Updated • 48 • 9 -
InternScience/MME-Reasoning
Preview • Updated • 199 • 8