GitHub - facebookresearch/multimodal at a33a8b888a542a4578b1...

GitHub - facebookresearch/multimodal at a33a8b888a542a4578b16972aecd072eff02c1a6

RelatedInsightsHighlights

Thumbnail of www-x-com-reach-vb-status-1894989136353738882-284f8cca11434192

HOLY SHITT, Microsoft dropped an open-source Multimodal (supports Audio, Vision and Text) Phi 4 - MIT licensed! 🔥 > Beats Gemini 2.0 Flash, GPT4o, Whisper, SeamlessM4T v2 > Models on Hugging Face hub, integrated with/ Transformers! Phi-4-Multimodal:... See more

Vaibhav (VB) Srivastav

x.com

Image-and-Language Understanding from Pixels Only abs: https://t.co/E9fOot76FZ https://t.co/H3x6GMMXqh

x.com

ByteDance just dropped BAGEL on Hugging Face The Open-Source Unified Multimodal Model https://t.co/nLWgVrhc9O

AK x.com