Community Paper Reading
by Darren LI ยท updated 1y ago
Community Paper Reading
by Darren LI ยท updated 1y ago
An analysis of GPT-4V, a large multimodal model with visual understanding, discussing its capabilities, input modes, working modes, prompting techniques, and potential applications in various domains.
by Zicheng Liu
Darren LI added 1y ago
5 highlights
Darren LI added 1y ago
by mit-han-lab
2 highlights
Darren LI added 1y ago
Analysis of safety preparations and evaluations for GPT-4V, a multimodal language model with image analysis capabilities, including early access testing, red teaming, and mitigations for potential risks and limitations.
by OpenAI.
1 highlight
Darren LI added 1y ago
Evaluating Large Language Models (LLMs) as agents in interactive environments, highlighting the performance gap between API-based and open-source models, and introducing the AgentBench benchmark.
2 highlights
Darren LI added 1y ago
Darren LI added 1y ago
Ideas related to this collection