Sublime

An inspiration engine for ideas

AllPeopleCollectionsArticlesAudioBooksFilesHighlightsImagesLinksNotesTextTweetsVideosSocial

Thumbnail of www-x-com-akhaliq-status-1603589119409545216-dc07b096404b45b9

Image-and-Language Understanding from Pixels Only abs: https://t.co/E9fOot76FZ https://t.co/H3x6GMMXqh

Today, every Nomic-Embed-Text embedding becomes multimodal. Introducing Nomic-Embed-Vision: - a high quality, unified embedding space for image, text, and multimodal tasks - outperforms both OpenAI CLIP and text-embedding-3-small - open weights and code to enable indie hacking, research,... See more

CalCo x.com

In the Matter of OpenAI vs LangGraph

Alessio latent.space

BREAKING OpenAI released a implementation of Consistency Models consistency models, a new family of generative models that achieve high sample quality without adversarial training. They support fast one-step generation by design, while still allowing for few-step sampling to trade compute... See more

x.com

EVA-CLIP: Improved Training Techniques for CLIP at Scale Proposes EVA-CLIP, a series of models that significantly improve the efficiency and effectiveness of CLIP training. proj: https://t.co/LNOE9rKSdq abs: https://t.co/lgBvsgHAtC https://t.co/IrxwzNcTku

Aran Komatsuzaki

x.com

Open AI releases GPT-4V(ision) system card paper: https://t.co/lWqSHhlCUP GPT-4 with vision (GPT-4V) enables users to instruct GPT-4 to analyze image inputs provided by the user, and is the latest capability we are making broadly available. Incorporating additional modalities (such as image... See more

x.com

LLMs can now self-optimize. A new method allows an AI to rewrite its own prompts to achieve up to 35x greater efficiency, outperforming both Reinforcement Learning and Fine-Tuning for complex reasoning. UC Berkeley, Stanford, and Databricks introduce a new method called GEPA... See more

Jackson Atkins

x.com

Improved baselines for vision-language pre-training Finds that a simple CLIP baseline can be improved up to a 25% relative improvement on downstream zero-shot tasks, by using well-known training techniques that are popular in other subfields. https://t.co/gfDb2AT2At https://t.co/idLYLH3iay

Aran Komatsuzaki

x.com

3D-GPT: Procedural 3D Modeling with Large Language Models paper page: https://t.co/4UPUNNB3UG In the pursuit of efficient automated content creation, procedural generation, leveraging modifiable parameters and rule-based systems, emerges as a promising approach. Nonetheless, it could be a... See more

AK x.com