In brief GLM-5.2 trails Claude Opus 4.8 by just 1% on FrontierSWE—a benchmark measuring multi-hour autonomous engineering projects—while beating GPT-5.5…
Sign in to your account
Remember me