How AI-generated text is poisoning the internet
While LLMs continue to devour web-scraped data, they’ll increasingly consume their own digital progeny as AI-generated content continues to flood the internet. This recursive loop, experimentally confirmed, erodes the true data landscape. Rare events vanish first. Models churn out likely sequences from the original pool while injecting their own... See more
Azeem Azhar • 🔮 Open-source AI surge; UBI surprises; AI eats itself; Murdoch’s empire drama & the internet’s Balkanisation ++ #484
The problem may even get worse. Generative AI is producing vast amounts of questionable content that contaminates the datasets on which future AIs will be trained.
Joe Smith • The Optimized Marketer: Writing with AI: Future-proof Your Talent and Position Your Business for a World Transformed by AI (The Optimized Self)
GenAI isn't just a technology; it's an informational pollutant—a pervasive cognitive smog that touches and corrupts every aspect of the Internet. It's not just a productivity tool; it's a kind of digital acid rain, silently eroding the value of all information.
Every image is no longer a glimpse of reality, but a... See more
François Cholletx.comthe worst of all possible worlds: generative AI manages to pollute the internet with cheap synthetic data, manages to make being a human artist / creator harder