ocr tencent/HunyuanOCR Image-Text-to-Text • Updated Jan 13 • 264k • 558 opendatalab/MinerU2.5-2509-1.2B Image-Text-to-Text • 1B • Updated Sep 29, 2025 • 118k • 343 PaddlePaddle/PaddleOCR-VL-1.5 Image-Text-to-Text • 1.0B • Updated 18 days ago • 336k • 528 PaddlePaddle/PaddleOCR-VL Image-Text-to-Text • 1.0B • Updated 9 days ago • 7.79k • 1.58k
asr FireRedTeam/FireRedASR-AED-L Automatic Speech Recognition • Updated Mar 5, 2025 • 150 • 68 microsoft/Phi-4-multimodal-instruct Automatic Speech Recognition • 6B • Updated Dec 10, 2025 • 296k • 1.58k
microsoft/Phi-4-multimodal-instruct Automatic Speech Recognition • 6B • Updated Dec 10, 2025 • 296k • 1.58k
ocr tencent/HunyuanOCR Image-Text-to-Text • Updated Jan 13 • 264k • 558 opendatalab/MinerU2.5-2509-1.2B Image-Text-to-Text • 1B • Updated Sep 29, 2025 • 118k • 343 PaddlePaddle/PaddleOCR-VL-1.5 Image-Text-to-Text • 1.0B • Updated 18 days ago • 336k • 528 PaddlePaddle/PaddleOCR-VL Image-Text-to-Text • 1.0B • Updated 9 days ago • 7.79k • 1.58k
asr FireRedTeam/FireRedASR-AED-L Automatic Speech Recognition • Updated Mar 5, 2025 • 150 • 68 microsoft/Phi-4-multimodal-instruct Automatic Speech Recognition • 6B • Updated Dec 10, 2025 • 296k • 1.58k
microsoft/Phi-4-multimodal-instruct Automatic Speech Recognition • 6B • Updated Dec 10, 2025 • 296k • 1.58k