Skip to content

> articles

External articles we wrote or recommend. Each one links out to its original home.

2026-05-13

> Microsoft's multi-agent AI system tops Anthropic's Mythos on cybersecurity benchmark

Microsoft's new MDASH (multi-model agentic scanning harness) scored 88.45% on the CyberGym cybersecurity benchmark, surpassing single-model systems including Anthropic's Mythos and OpenAI's GPT-5.5. It runs more than 100 specialized AI agents across multiple models in a staged pipeline that finds, debates and proves vulnerabilities with proof-of-concept exploits. Microsoft used MDASH to disclose 16 new Windows vulnerabilities, including four critical remote code execution flaws fixed in May's Patch Tuesday.

read on external site ↗

2026-05-13

> Two Weeks After "Context Is the New Code" at AIE London: I Did Not See This Coming

Patrick Debois reflects on the unexpected viral pull of his 'Context Is the New Code' keynote: 60k+ views, community translations and extensions of the Context Development Lifecycle (CDLC) within two weeks. Practitioners expanded the model from 4 to 7 stages and introduced ideas like 'context debt'. His takeaway: the diversity of framings is doing work a single talk cannot.

read on external site ↗