GitHub - lyuchenyang/Macaw-LLM: Macaw-LLM: Multi-Modal Langu...

GitHub - lyuchenyang/Macaw-LLM: Macaw-LLM: Multi-Modal Language Modeling with Image, Video, Audio, and Text Integration

RelatedInsightsHighlights

Thumbnail of www-x-com-mrsiipa-status-1848693436268089793-e62e786febf04d92

meta ai's pdf on vision language modeling https://t.co/SVqNaRjOOm

Inspired by @karpathy 's NotebookLM project, I gave the codebase of Llama-3 Architecture to NLM and used Rag to find the perfect images to sync with the generated audio. The result exceeded my expectations. Google's NotebookLM is truly amazing :) Here is a youtube link as well:... See more

Ankit Pal x.com

Large Language Models Understand and Can be Enhanced by Emotional Stimuli

Cheng Li, Jindong Wang, Yixuan Zhang, Kaijie Zhu, Wenxin Hou, Jianxun Lian, Fang Luo, Qiang Yang, Xing Xie readwise-assets.s3.amazonaws.com