June 2026
- Jun 4 THU AI News CMA forces Google opt-out, Alphabet drops 4% on $85B raise, Muse Spark 4th 13 min
- Jun 4 THU AI Research Dharma-AI cuts OCR loops 59%, Nemotron learns eval design, HKUST audits AI ideas 13 min
- Jun 4 THU AI Tech Gemma 4 skips vision guards, Reachy Mini trusts MCP, Android verifies in-stack 11 min
- Jun 3 WED AI Tech Microsoft ships MAI on cost, Holo3.1 lands at 1/10, Willison picks MicroPython 11 min
- Jun 3 WED AI News Microsoft's MAI stack, Anthropic's Glasswing at 150 orgs, OpenAI recasts Codex 14 min
- Jun 3 WED AI Research MiniMax-M2 fails repro, Anthropic logs 832 bans, Gemini Embedding 2 lands 12 min
- Jun 2 TUE AI News Anthropic files S-1 at $965B, OpenAI lands on AWS, model cracks Erdős 12 min
- Jun 2 TUE AI Tech GM cuts crash sims to 60s, IBM scaffolds enterprise LLMs, Google answers MCP 13 min
- Jun 2 TUE AI Research Sleep loops triple Rule 110, QUEST claims frontier parity, CUA-Gym hits 72.6% 13 min
- Jun 1 MON AI News Anthropic hits $965B, Brockovich maps data centers, clinicians reject Levie 11 min
- Jun 1 MON AI Tech NVIDIA bets on Cosmos 3, Willison endorses 'cancel AI,' Datasette ships 1.0a32 9 min
- Jun 1 MON AI Research SkillOpt sweeps 52-for-52, ByteDance's Shannon law, Pion rescues Muon RLVR 14 min
May 2026
- May 31 SUN AI Tech Anthropic patches Claude sandbox, DeepSWE flags git log, Datasette in Pyodide 11 min
- May 31 SUN AI News Copilot meters to $600, Whitacre quits OSS, SoftBank pledges €75B to France 13 min
- May 30 SAT AI Tech Hugging Face profiles torch, Antigravity wipes drives, Codex skips SWE-Bench 10 min
- May 30 SAT AI Research OpenAI publishes its eval playbook; AISI and Apollo flag what it doesn't share 4 min
- May 30 SAT AI News OpenAI gates Rosalind, Gemini Omni decays at turn 4, Shift films NYC homes 13 min
- May 29 FRI AI Tech Opus 4.8 abstains, jqwik plants a delete prompt, Devin's 80% won't travel 12 min
- May 29 FRI AI News Outside numbers undercut Anthropic's $965B, Google's agent pivot, OpenAI's 33% 14 min
- May 29 FRI AI Research Qwen3.5 buys 88.6% false claims, ESMFold2 tops AF3, Muon doubles on rare tokens 13 min
- May 28 THU AI Research Anthropic agent survey, IBM SRE benchmark at 47%, HRM-Text trained for $1,500 13 min
- May 28 THU AI News Anthropic, OpenAI bill by token, SQLite bans agent PRs, YouTube auto-labels AI 14 min
- May 28 THU AI Tech TRL cuts RL sync to 6 seconds, Reachy Mini drops cloud voice APIs 8 min
- May 27 WED AI Tech Approval-prompt defenses crack at Microsoft Copilot and Anthropic Claude 6 min
- May 27 WED AI News GLM-5.1 ties frontier on SWE-bench, curl swamped by AI reports, BadHost narrows 13 min
- May 27 WED AI Research GEPA, AutoResearchClaw, Gaperon: each headline turns on its verification step 14 min
- May 26 TUE AI Tech Hugging Face's agent glossary makes the harness a first-class layer 3 min
- May 26 TUE AI News Pope hands Anthropic the dais, Folha suit ends in deal, ClickUp cuts 286 13 min
- May 26 TUE AI Research Code harness lifts GPT-5.5 26pt, Llama skips tool calls, TDDev's 65.8% rebutted 13 min
- May 25 MON AI Research AIRA short of Hymba, UI traces ID 14 agents, MANSU holds unlearning at 4-bit 14 min
- May 25 MON AI Tech Datasette wires fixtures, Ronacher cuts bugs to 4 lines, Claude ports a 1983 PDF 11 min
- May 25 MON AI News Personas jailbreak GPT-5.1, Bee's LED inverts, Google shrugs at Gemini RCE 11 min
- May 24 SUN AI Tech ARIA 1.3's definition-list roles die in draft, leaving `<dl>` where it started 3 min
- May 24 SUN AI News OpenAI pilots stall 88%, xAI sued over Memphis turbines, Google holds Omni edit 12 min
- May 23 SAT AI News Google's $916 OS audited, NTSB halts AI pilot voices, startup ARR off 300% 12 min
- May 23 SAT AI Tech AI's HBM buildout shows up on entry-phone BOMs and Transsion's P&L 4 min
- May 23 SAT AI Research Mythos finds 10K vulns, Nemotron 8B hits 4× AR, Dharma 3B beats Opus 52× 12 min
- May 22 FRI AI Tech Datasette Agent self-heals SQL, Daytona boots sandboxes in 60ms 8 min
- May 22 FRI AI News FTC fines fake AI, CEOs kill Trump's order, Protect Working Musicians Act filed 12 min
- May 22 FRI AI Research SU-01 wins IMO gold, WildClawBench: 18pt harness gap, Lighthouse strips sparsity 13 min
- May 21 THU AI Research Orthrus drafts Qwen3 blocks, MinT serves million LoRAs, CFD agent gates with VLM 13 min
- May 21 THU AI Tech Ramp's Inspect merges 55%, Railway token wipes prod, tokenspeed reframes t/s 9 min
- May 21 THU AI News xAI bills Anthropic $1.25B/mo, OpenAI cracks Erdős, Google rebuilds on agents 14 min
- May 20 WED AI Research Gemini reaches 17 US labs, Pokémon harness self-edits, RL resurfaces known facts 14 min
- May 20 WED AI Tech Gemini 3.5 Flash GAs 3× pricier, Ettin 17M tops MiniLM, co-scientists retrieve 13 min
- May 20 WED AI News Google I/O 2026 leans on Flash as prices triple and agent safeguards crack 14 min
- May 19 TUE AI News Anthropic absorbs Stainless, jury clears OpenAI, Willison times the agent leap 13 min
- May 19 TUE AI Tech IBM's leaderboard rebutted, NVIDIA's rank-8 enough, PaddleOCR slides to third 11 min
- May 19 TUE AI Research Single neuron jailbreaks LLMs, MIT ELF cuts data 10×, Qwen-Image hits 4 steps 12 min
- May 18 MON AI Research Meta cuts BLT 77%, ReasonMaxxer skips RL at $25, MV-Split hits 1000 DiT layers 14 min
- May 18 MON AI News Siri routes to Gemini, NHS hides code from Mythos, Wendy's swaps wages for AI 13 min
- May 17 SUN AI News Brockman runs OpenAI product, DeepSeek trails 8 months, Malta gifts ChatGPT Plus 13 min
- May 17 SUN AI Tech Anthropic loses OpenClaw, Evans drops Tailwind, CFOs distrust agent data 9 min
- May 16 SAT AI Tech Anthropic admits 40% token bug, Abridge hits Epic, OpenAI Codex trails Gemini 12 min
- May 16 SAT AI News ChatGPT wires 12,000 banks, PwC trains 30,000 on Claude, arXiv bans AI slop 12 min
- May 15 FRI AI News Anthropic loses DC and Microsoft as OpenAI patches ChatGPT mid-trial 11 min
- May 15 FRI AI Tech Bun ports to Rust, Granite tops sub-100M MTEB, Transformers ships async batching 13 min
- May 15 FRI AI Research Google's math harness at 48%, CMU swarm +38.7%, Chinchilla adds repeat penalty 12 min
- May 14 THU AI Tech OpenAI rebuilds Codex sandbox on Windows, Willison ships Codex-built blog 7 min
- May 14 THU AI News Sutskever memo and TanStack worm rattle OpenAI as Anthropic courts SMBs 14 min
- May 14 THU AI Research SWE-WebDevBench caps at 60%, φ_first beats voting, Qwen3-4B splits think/speak 12 min
- May 13 WED AI Tech llm 0.32a2 adopts Responses API, CSP tool gates fetches, Codex repros segfault 9 min
- May 13 WED AI News OpenAI faces overdose suit, CMS pays AI care agents, Google merges ChromeOS 13 min
- May 13 WED AI Research Parameter Golf, SymptomAI, Workspace-Bench post wins on unaudited evals 14 min
- May 12 TUE AI Research MolmoAct2 tops GPT-5, Meta clears CWM on lax evals, Stanford caps counting at 2K 13 min
- May 12 TUE AI News OpenAI promises PE 17.5%, Thinking Machines voice MoE trails, Google blames LLM 11 min
- May 12 TUE AI Tech Shopify hits 77% merges, Willison ships shebang scripts, Shore prices the bill 12 min
- May 11 MON AI Research Agibot's LWD beats SFT, Odysseus tops GPT-5.4 5×, TreeFlow reframes boosting 11 min
- May 11 MON AI News NYT runs AI-faked quote, Google Finance ships at 43%, Anthropic blames sci-fi 12 min
- May 11 MON AI Tech Quinn's 10 MB FST beats a 3 GB SQLite because Finnish shares affixes 3 min
- May 10 SUN AI News DOJ probes Nvidia's $40B, Congress targets AI toys, Sarvam beats Wispr by 50pts 13 min
- May 10 SUN AI Research OncoAgent posts 100% on a retrieval proxy, skips the NCCN check 4 min
- May 10 SUN AI Tech Tossell's Codex-built Gmail client ships agent hooks, skips the injection audit 3 min
- May 9 SAT AI Research Anthropic rewrites honeypots, Ai2 routes documents, CyberSecQwen-4B beats Cisco 10 min
- May 9 SAT AI News Anthropic pays $570K, Google adds AI Overview citations, GPT-Realtime-2 at 1.12s 12 min
- May 9 SAT AI Tech Claude pushes HTML at 18×, OpenAI ships Codex safety post, Curley flags WebRTC 11 min
- May 8 FRI AI Research Anthropic NLAs probed, AlphaEvolve cloned cheaper, Nemotron loses voice lead 12 min
- May 8 FRI AI News China gap at 3-9 months, OpenAI ties Gemini voice, Anthropic divests Petri 13 min
- May 8 FRI AI Tech Mozilla files 423 CVEs, Gemini Flash-Lite up 3.75×, Willison ships commit tool 11 min
- May 7 THU AI Research Anthropic Institute opens, ElementsClaw verifies old hits, DXRG cuts 57% to 3% 12 min
- May 7 THU AI News Musk leases Colossus to Anthropic, Terafab pegged at $5T, OpenAI suit narrows 14 min
- May 7 THU AI Tech OpenAI donates MRC, Google strips Gemma 4 heads, ServiceNow rolls back vLLM V1 12 min
- May 6 WED AI Tech Andon agent forges permits, Hugging Face hides ASR audio, Willison patches llm 12 min
- May 6 WED AI Research GPT-5.2 leans on physicists, Sylph defers benchmarks, RecursiveMAS skips a rival 12 min
- May 6 WED AI News OpenAI claims the default and the ad slot; Anthropic claims FactSet's 13 min
- May 5 TUE AI News Anthropic and OpenAI's three plays: services JVs, distillation law, 2028 odds 13 min
- May 5 TUE AI Tech OpenAI's UDP voice relay, antirez ships Redis ARGREP, Gemini adds webhooks 11 min
- May 5 TUE AI Research Meta locks Sapiens2's license, Tuna-2 drops encoders, Apple routes KV layers 12 min
- May 4 MON AI Tech Anthropic's 9% sycophancy figure sits 3-5× under outside benchmarks 4 min
- May 4 MON AI News Beth Israel hedges o1-preview, KC Green fights Artisan, DualShot audits Claude 10 min
- May 4 MON AI Research Sessa beats Transformers, CHAI beats Gemini, agent survey regrades Sora 12 min
- May 3 SUN AI News Oscars price AI out of acting, Grok cuts tokens 60%, o1 runs hit $2,800 12 min
- May 3 SUN AI Tech Willison merges a blog feature from his phone; Wispr Flow idles at 800MB 7 min
- May 2 SAT AI Research AISI clocks GPT-5.5 jailbroken in 6 hours; Oxford ties warmth to 10-30pt errors 7 min
- May 2 SAT AI Tech Claude Code for web, demonstrated: Willison's campsite iNaturalist viewer 3 min
- May 2 SAT AI News DoD freezes out Anthropic, Musk's emails sink his case, Meta drops 9.4% 13 min
- May 1 FRI AI Research DeepMind's wrong test, Church recodes E. coli, Princeton's MoE tradeoff 14 min
- May 1 FRI AI Tech GPT-5.5 cracked in 6 hours, /goal loops unchecked, app feeds need an installer 11 min
- May 1 FRI AI News Musk testifies on Grok, Goodfire debuts Silico, Stripe tokenizes agent pay 12 min
April 2026
- Apr 30 THU AI Tech OpenAI's goblin fix, evals as bottleneck, Willison's `llm` goes typed 11 min
- Apr 30 THU AI Research Sycophancy, agent-coded bugs, research agents: outside audits widen each gap 13 min
- Apr 30 THU AI News Three AI pitches shrink under outside scrutiny: Anthropic, OpenAI, Zig 14 min
- Apr 29 WED AI Tech Nemotron Nano Omni is a sub-agent; Codex's prompt is a post-mortem 7 min
- Apr 29 WED AI Research Bio, skills, and judges: three benchmarks debut with the cracks already mapped 14 min
- Apr 29 WED AI News OpenAI's unverified plan, Seoul's contradiction, rural America's veto 13 min
- Apr 28 TUE AI Research Agent benchmarks on trial: gaming, unreliability, and self-graded wins 13 min
- Apr 28 TUE AI News The week AI's platform alliances got renegotiated in public 13 min
- Apr 28 TUE AI Tech New tooling, same bottlenecks — just moved one step downstream 12 min
- Apr 27 MON AI News The frontier labs are rewriting the contracts beneath them 10 min
- Apr 27 MON AI Tech OpenAI's release cadence is outrunning its deployment story 11 min
- Apr 27 MON AI Research When the scaffold outweighs the model: a day of harness-defined results 13 min
- Apr 26 SUN AI Tech Around the weights: gating, silicon, and scaffolding take over 11 min
- Apr 26 SUN AI Research DeepSeek-V4's Ascend pivot: cheaper tokens, shakier answers 4 min
- Apr 26 SUN AI News Lock-in day: OpenAI consolidates, DeepSeek picks Huawei, Google backs Anthropic 10 min
- Apr 25 SAT AI News Compute, capital, and access: the real shape of the AI frontier today 12 min
- Apr 25 SAT AI Research DeepSeek-V4 closes the open-weights capability gap — and reopens others 4 min
- Apr 25 SAT AI Tech The model isn't the variable anymore — access, harness, and silicon are 11 min
- Apr 24 FRI AI Tech The frontier splits three ways: pricier, cheaper, and sorrier 13 min
- Apr 24 FRI AI News Repositioning day: OpenAI productizes, Google bifurcates, Anthropic rations 13 min
- Apr 24 FRI AI Research Scaffolding, not weights: where AI research is actually moving today 13 min
- Apr 23 THU AI Tech Vendors ship the wins; deployers inherit the work 11 min
- Apr 23 THU AI News What Google, OpenAI, and Anthropic aren't saying out loud 13 min
- Apr 23 THU AI Research The work is migrating outward from the weights 13 min
- Apr 22 WED AI Research Long-horizon agents are outrunning their yardsticks 13 min
- Apr 22 WED AI Tech The pitches are confident; the people checking them aren't keeping up 11 min
- Apr 22 WED AI News Scaffolding and subsidies, not weights, are carrying AI's headline numbers 14 min
- Apr 21 TUE AI Research Agent research moves from leaderboard scores to the trace itself 13 min
- Apr 21 TUE AI Tech Open weights win the technical debate, lose the governance one 3 min
- Apr 21 TUE AI News OpenAI plays defense as open weights and agents redraw the coding stack 10 min
- Apr 20 MON AI Tech Agent harnesses and local leaderboards mature — and quietly hide their tradeoffs 8 min
- Apr 20 MON AI News Anthropic's Mega-Deal Day, and the Tiers Beneath It 11 min
- Apr 20 MON AI Research Clean in the lab, brittle in production: a day of disappearing wins 13 min