Community Paper Reading

by Darren LI ยท updated 1y ago

  • The Dawn of LMMs: Preliminary Explorations with GPT-4V(ision)

    An analysis of GPT-4V, a large multimodal model with visual understanding, discussing its capabilities, input modes, working modes, prompting techniques, and potential applications in various domains.

    by Zicheng Liu

    Darren LI added 1y ago

  • GPT-4V Safety and Deployment Preparation

    Analysis of safety preparations and evaluations for GPT-4V, a multimodal language model with image analysis capabilities, including early access testing, red teaming, and mitigations for potential risks and limitations.

    by OpenAI.

    1 highlight

    Darren LI added 1y ago

  • AgentBench: Evaluating LLMs as Agents

    Evaluating Large Language Models (LLMs) as agents in interactive environments, highlighting the performance gap between API-based and open-source models, and introducing the AgentBench benchmark.

    by Chenhui Zhang

    2 highlights

    Darren LI added 1y ago

  • Ideas related to this collection