Cool experiment where researchers assemble an AI translation “company” with AI agents with simulated backgrounds filling various roles, from editors to proofreaders.
The AI “company” creates accurate translations of Chinese web novels that people prefer to GPT-4, and human, ones https://t.co/7lxg2jEjZi
Why Chat With PDF Is Hard And How ChatLLM Gets It Right
Chatting on long docs is hard because most LLMs other than Gemini don't have a large context.
However, even with Gemini's 1M context length, in-context learning is hard, and if you stuff the doc in the context, it doesn't do a good job.... See more
Claude generated this w/o modification: imo a pareidolic masterpiece folding together varied artists/styles from 4-dimensional hyperbolic geometry into some kind of organic cathedral tribute to the complex multifaceted nature of identity https://t.co/uboatRVi3Y
DeepSeekV2 is a big deal. Not only because its significant improvements to both key components of Transformer: the Attention layer and FFN layer.
It has also completed disrupted the Chines LLM market and forcing the competitors to drop the price to 1% of the original price.
⬇️... See more
A single fairly unknown Dutch company makes maybe the most expensive and complex non-military device ($200M) that builds on 40 years of Physics and has a monopoly responsible for all AI advancement today.
Here's the story of ASML, the company powering Moore's Law..
1/9... See more
🎥 New talk: "How Might We Learn?"
A (proto-?)vision talk of sorts—a first attempt at a broader picture of the future of learning I want to create, particularly given developments in AI.
Thanks to @HaijunXia and @ProfHollan for hosting me! 🙇♂️
(YT link in thread... See more
Excited to introduce a new project I've been working on called Payman!
Payman is an AI Agent tool that gives Agents the ability to pay people for tasks they cannot do themselves.
While many people imagine a future where humans pay AI agents for services they want completed, I believe... See more