Discussion about this post

User's avatar
JP's avatar

SDD makes a lot of sense given where the models actually are. GPT-5.4 just dropped and its raw coding score barely moved (57.7% on SWE-Bench Pro vs 55.6% for 5.2). Where it did improve is tool search efficiency and multi-step orchestration. Which is basically the infrastructure that makes spec-driven workflows viable. Covered the full picture here: https://reading.sh/gpt-5-4-just-dropped-heres-your-explainer-8fcc0126d84d?sk=ad5982c9f3b9382ff8fea9c32491a811

No posts

Ready for more?