Gemini 2.0 Flash: An outstanding multi-modal LLM with a sci-fi streaming mode

Introducing the new Realtime Multimodal API, powered by Gemini 2.0 Flash!
You can stream audio, video, and text in, while dynamic tool calls happen in the background (Search, code execution, & function calling). Truly a wild experience, try it right now in AI Studio 🤯 https://t.co/SnMc9N4lI0

Google raised the bar with Realtime AI.
It enables AI to see your screen and chat with you in real-time, and users are discovering incredible use cases.
10 wild examples: https://t.co/lwIQ8a8vba
If you try nothing else today, give the demo at https://t.co/kUyz0fxIfE a go - it lets you stream video and audio directly to Gemini 2.0 Flash and get audio back, so you can have a real-time audio conversation about what you can see with the model
Feels like science fiction! https://t.co/EKKQ06rgmV
Simon Willisonx.comGoogle Gemini 2.0 realtime AI is insane.
Watch me turn it into a live code tutor just by sharing my screen and talking to it.
We’re living in future.
I’m speechless. https://t.co/MTaJYVwzl5
Mckay Wrigleyx.comWoow
Google has just rolled out the AI model Flash Thinking 2.0
This is the first reasoning model capable of accessing YouTube and it changes everything:
- Search for a video on your topic
- Ask Gemini to think about the video
- You'll... See more
Paul Couvertx.com