-
docling-project/SmolDocling-256M-preview
Image-Text-to-Text • Updated • 63.3k • 1.61k -
microsoft/Florence-2-large
Image-Text-to-Text • 0.8B • Updated • 1.19M • 1.79k -
mistral-experimental/pixtral-12b
Image-Text-to-Text • 13B • Updated • 224k • 103 -
microsoft/Phi-4-multimodal-instruct
Automatic Speech Recognition • 6B • Updated • 308k • 1.58k
Kuan Lu
kuanlu
·
AI & ML interests
Machine learning
Recent Activity
updated a collection 6 days ago
LLM updated a collection 6 days ago
LLM updated a collection 9 days ago
AI OCROrganizations
None yet
AI OCR
-
docling-project/SmolDocling-256M-preview
Image-Text-to-Text • Updated • 63.3k • 1.61k -
microsoft/Florence-2-large
Image-Text-to-Text • 0.8B • Updated • 1.19M • 1.79k -
mistral-experimental/pixtral-12b
Image-Text-to-Text • 13B • Updated • 224k • 103 -
microsoft/Phi-4-multimodal-instruct
Automatic Speech Recognition • 6B • Updated • 308k • 1.58k
Text-to-Audio
models 0
None public yet
datasets 0
None public yet