-
Can LLMs Follow Simple Rules?
Paper • 2311.04235 • Published • 13 -
The Unreasonable Ineffectiveness of the Deeper Layers
Paper • 2403.17887 • Published • 82 -
GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection
Paper • 2403.03507 • Published • 189 -
Sora: A Review on Background, Technology, Limitations, and Opportunities of Large Vision Models
Paper • 2402.17177 • Published • 87
Kevin-Brian N'Diaye
kevin-nd
·
AI & ML interests
- Computer Vision
- Vision-Language-Action Models
Recent Activity
updated a model 5 days ago
kevin-nd/nanoVLM published a model 5 days ago
kevin-nd/nanoVLM upvoted a paper 11 days ago
ViT-5: Vision Transformers for The Mid-2020sOrganizations
None yet