I've used a wide variety of the "best" models, and I've mostly settled on Opus 4... | Hacker News

Hacker Newsnew | past | comments | ask | show | jobs | submit

		taormina 7 months ago \| parent \| context \| favorite \| on: Why LLMs can't really build software I've used a wide variety of the "best" models, and I've mostly settled on Opus 4 and Sonnet 4 with Claude Code, but they don't ever actually get better. Grok 3-4 and GPT4 were worse, but like, at a certain point you don't get brownie points for not tripping over how low the bar is set.

generalizations 7 months ago [–]

People have actually been basing their assertions on 4o. The bar is really low and people are still completely missing it.

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact