Multimodal Foundation Models: From Specialists to General-Purpose Assistants

Multimodal Foundation Models: From Specialists to General-Purpose Assistants

A survey paper that investigates the taxonomy and evolution of multimodal foundation models, focusing on their transition from specialized models to general-purpose assistants in computer vision and vision-language domains.

arxiv.org

Multi-Modal Chatbot

A Survey on Large Language Model based Autonomous Agents

The paper surveys large language model-based autonomous agents, discussing their construction, applications across various domains, and evaluation strategies, while proposing a unified framework and identifying future research directions.

arxiv.org

Donald Metzler Rethinking Search: Making Domain Experts out of Dilettantes

Huge “foundation models” are turbo-charging AI progress

economist.com
Thumbnail of Huge “foundation models” are turbo-charging AI progress

Ernest Davis Rebooting AI: Building Artificial Intelligence We Can Trust

Ben Auffarth Generative AI with LangChain: Build large language model (LLM) apps with Python, ChatGPT, and other LLMs