llms

The community remains puzzled about whether these models genuinely generalize to unseen tasks, or seemingly succeed by memorizing the training data. This paper makes important strides in addressing this question. It constructs a suite of carefully designed counterfactual evaluations, providing fresh insights into the capabilities of... See more

Zhaofeng Wu • Reasoning skills of large language models are often overestimated

https://news.mit.edu/2024/reasoning-skills-large-language-models-often-overestimated-0711

If consciousness really can arise in a jumble of silicon chips, we run the risk of creating countless AIs — beings, really — that can not only intelligently perform tasks, but develop feelings about their lives.

…

Rather than asking if each new AI system is finally the one that has conscious experience, focusing on the more fundamental question of