Community Paper Reading

by Darren LI ยท updated 1y ago

  • The Dawn of LMMs: Preliminary Explorations with GPT-4V(ision)

    An analysis of GPT-4V, a large multimodal model with visual understanding, discussing its capabilities, input modes, working modes, prompting techniques, and potential applications in various domains.

    by Zicheng Liu

    Darren LI added 1y ago

  • Open X-Embodiment: Robotic Learning Datasets and RT-X Models

    5 highlights

    Thumbnail of Open X-Embodiment: Robotic Learning Datasets and RT-X Models

    Darren LI added 1y ago

  • GitHub - mit-han-lab/streaming-llm: Efficient Streaming Language Models with Attention Sinks

    by mit-han-lab

    2 highlights

    Thumbnail of GitHub - mit-han-lab/streaming-llm: Efficient Streaming Language Models with Attention Sinks

    Darren LI added 1y ago

  • GPT-4V Safety and Deployment Preparation

    Analysis of safety preparations and evaluations for GPT-4V, a multimodal language model with image analysis capabilities, including early access testing, red teaming, and mitigations for potential risks and limitations.

    by OpenAI.

    1 highlight

    Darren LI added 1y ago

  • AgentBench: Evaluating LLMs as Agents

    Evaluating Large Language Models (LLMs) as agents in interactive environments, highlighting the performance gap between API-based and open-source models, and introducing the AgentBench benchmark.

    by Chenhui Zhang

    2 highlights

    Darren LI added 1y ago

  • Gorilla

    2 highlights

    Darren LI added 1y ago

  • Ideas related to this collection