Reasoning skills of large language models are often overestimated

Gary Marcus Deep Learning Is Hitting a Wall