Sublime

An inspiration engine for ideas

AllPeopleCollectionsArticlesAudioBooksFilesHighlightsImagesLinksNotesTextTweetsVideosSocial

To recognize speech, for example, one must abstract away from the specifics of the speaker’s voice. This is achieved by forcing a neural network to use the same connections in different frequency bands, whether the voice is low or high. Reducing the number of parameters that must be adjusted leads to greater speeds and better generalization to new

Stanislas Dehaene • How We Learn: Why Brains Learn Better Than Any Machine . . . for Now

Thumbnail of www-x-com-saboo-shubham-status-1735182170715537663-b307646491294ac0

LangChain meets Google Gemini 🔥 Google just launched Gemini Pro as an API for developers to use in their applications. LangChain without any wait quickly released a new Python package to access Google's multimodal LLM -Gemini! All you need to do to get started:... See more

Shubham Saboo

x.com

AI workflows

Dave King • 2 cards

Thumbnail of www-x-com-tom-doerr-status-1947243694467215472-789065b08f8841fe

instant voice cloning with style and language control https://t.co/110OehFjGW

Tom Dörr

x.com

Text-to-speech and speech recognition tools https://t.co/wl0rZ4fw1u

Tom Dörr

x.com

🎤 Are you ready for a revolutionary breakthrough in audio technology? Say hello to AudioGPT!👋 This incredible tool allows LLMs to process complex audio information & conduct spoken conversations. Let's explore this game-changing innovation and try the official @Gradio demo.👇 https://t.co/V71OxkE1ZI

Yuvi

x.com

We shipped an alpha version of the new Surya OCR model. No hype, just facts: - 90+ languages (focus on en, romance langs, zh, ar, ja, ko) - LaTeX and formatting - Char/word/line bboxes - ~500M non-embed params - 10-20 pages/s https://t.co/jtKDinWhec

Vik Paruchuri

x.com

Let's goo! F5-TTS 🔊 > Trained on 100K hours of data > Zero-shot voice cloning > Speed control (based on total duration) > Emotion based synthesis > Long-form synthesis > Supports code-switching > Best part: CC-BY license (commercially... See more

Vaibhav (VB) Srivastav x.com

Gist: https://t.co/0d9n6h1ZwT https://t.co/aWwjcwl1Ns

kwindla

x.com