N
Hacker Next
new
show
ask
jobs
submit
login
Simple tasks showing reasoning breakdown in state-of-the-art LLMs
arxiv.org
375 points by
tosh
115 days ago
|
380 comments
add comment