Introducing LisanBench
LisanBench is a simple, scalable, and precise benchmark designed to evaluate large language models on knowledge, forward-planning, constraint adherence, memory and attention, and long context reasoning and "stamina".
"I see possible futures, all at once. Our enemies... See more