Sublime
An inspiration engine for ideas
To recognize speech, for example, one must abstract away from the specifics of the speaker’s voice. This is achieved by forcing a neural network to use the same connections in different frequency bands, whether the voice is low or high. Reducing the number of parameters that must be adjusted leads to greater speeds and better generalization to new
... See moreStanislas Dehaene • How We Learn: Why Brains Learn Better Than Any Machine . . . for Now

LangChain meets Google Gemini 🔥
Google just launched Gemini Pro as an API for developers to use in their applications.
LangChain without any wait quickly released a new Python package to access Google's multimodal LLM -Gemini!
All you need to do to get started:... See more
AI workflows
Dave King • 2 cards

🎤 Are you ready for a revolutionary breakthrough in audio technology? Say hello to AudioGPT!👋
This incredible tool allows LLMs to process complex audio information & conduct spoken conversations. Let's explore this game-changing innovation and try the official @Gradio demo.👇 https://t.co/V71OxkE1ZI




We shipped an alpha version of the new Surya OCR model. No hype, just facts:
- 90+ languages (focus on en, romance langs, zh, ar, ja, ko)
- LaTeX and formatting
- Char/word/line bboxes
- ~500M non-embed params
- 10-20 pages/s https://t.co/jtKDinWhec
Let's goo! F5-TTS 🔊
> Trained on 100K hours of data
> Zero-shot voice cloning
> Speed control (based on total duration)
> Emotion based synthesis
> Long-form synthesis
> Supports code-switching
> Best part: CC-BY license (commercially... See more
Vaibhav (VB) Srivastavx.com

