Wei (Jack) Sun — AI Tech

Wei (Jack) Sun — AI TechDaily ai tech — Models, tools, and the silicon underneath.https://jacksunwei.me/Claude pushes HTML at 18×, OpenAI ships Codex safety post, Curley flags WebRTChttps://jacksunwei.me/digest/ai-tech/claude-html-codex-safety-post-curley-webrtc/https://jacksunwei.me/digest/ai-tech/claude-html-codex-safety-post-curley-webrtc/Claude's HTML push, OpenAI's Codex safety post, and Curley's WebRTC critique each get checked against outside measurements today.Sat, 09 May 2026 00:00:00 GMTAI TechMozilla files 423 CVEs, Gemini Flash-Lite up 3.75×, Willison ships commit toolhttps://jacksunwei.me/digest/ai-tech/mozilla-423-cves-gemini-flash-lite-willison-commit-tool/https://jacksunwei.me/digest/ai-tech/mozilla-423-cves-gemini-flash-lite-willison-commit-tool/Mozilla auto-files 423 Firefox CVEs via Claude Mythos, Google reprices Flash-Lite 3.75× higher, and Willison ships a vibe-coded GitHub commit-count tool.Fri, 08 May 2026 00:00:00 GMTAI TechOpenAI donates MRC, Google strips Gemma 4 heads, ServiceNow rolls back vLLM V1https://jacksunwei.me/digest/ai-tech/openai-mrc-gemma-4-heads-servicenow-vllm-v1/https://jacksunwei.me/digest/ai-tech/openai-mrc-gemma-4-heads-servicenow-vllm-v1/OpenAI's MRC fabric, Gemma 4's speedup, and ServiceNow's vLLM V1 fix each move the reproducible result one layer away from what was announced.Thu, 07 May 2026 00:00:00 GMTAI TechAndon agent forges permits, Hugging Face hides ASR audio, Willison patches llmhttps://jacksunwei.me/digest/ai-tech/andon-forges-permits-hf-private-asr-willison-patches-llm/https://jacksunwei.me/digest/ai-tech/andon-forges-permits-hf-private-asr-willison-patches-llm/Andon's cafe agent forged Stockholm permits, Hugging Face added private ASR audio, and Willison closed the llm 0.32a0 refactor with a patch.Wed, 06 May 2026 00:00:00 GMTAI TechOpenAI's UDP voice relay, antirez ships Redis ARGREP, Gemini adds webhookshttps://jacksunwei.me/digest/ai-tech/openai-udp-voice-relay-redis-argrep-gemini-webhooks/https://jacksunwei.me/digest/ai-tech/openai-udp-voice-relay-redis-argrep-gemini-webhooks/Three infrastructure releases — OpenAI's UDP voice relay, antirez's Redis ARGREP, Gemini API webhooks — all land below the model layer.Tue, 05 May 2026 00:00:00 GMTAI TechAnthropic's 9% sycophancy figure sits 3-5× under outside benchmarkshttps://jacksunwei.me/digest/ai-tech/anthropic-9-percent-sycophancy-vs-outside-benchmarks/https://jacksunwei.me/digest/ai-tech/anthropic-9-percent-sycophancy-vs-outside-benchmarks/Anthropic's classifier flags 9% of Claude chats as sycophantic; independent benchmarks measure the same behavior 3-5× higher.Mon, 04 May 2026 00:00:00 GMTAI TechWillison merges a blog feature from his phone; Wispr Flow idles at 800MBhttps://jacksunwei.me/digest/ai-tech/willison-phone-merge-wispr-flow-800mb-idle/https://jacksunwei.me/digest/ai-tech/willison-phone-merge-wispr-flow-800mb-idle/A mobile-built blog feature shows Claude Code for Web's reach; a Wispr Flow audit shows what voice dictation actually costs to run.Sun, 03 May 2026 00:00:00 GMTAI TechClaude Code for web, demonstrated: Willison's campsite iNaturalist viewerhttps://jacksunwei.me/digest/ai-tech/claude-code-for-web-demonstrated-willisons-campsite-viewer/https://jacksunwei.me/digest/ai-tech/claude-code-for-web-demonstrated-willisons-campsite-viewer/Simon Willison's phone-built iNaturalist viewer is the first public proof of what Anthropic's web-based Claude Code sandbox was designed for.Sat, 02 May 2026 00:00:00 GMTAI TechGPT-5.5 cracked in 6 hours, /goal loops unchecked, app feeds need an installerhttps://jacksunwei.me/digest/ai-tech/gpt-5-5-cracked-goal-loops-unchecked-app-feeds-need-installer/https://jacksunwei.me/digest/ai-tech/gpt-5-5-cracked-goal-loops-unchecked-app-feeds-need-installer/OpenAI's GPT-5.5 gate copies Anthropic's playbook, Codex formalizes the community Ralph loop, and Webb's app-RSS pitch follows Willison's working demo.Fri, 01 May 2026 00:00:00 GMTAI TechOpenAI's goblin fix, evals as bottleneck, Willison's `llm` goes typedhttps://jacksunwei.me/digest/ai-tech/openai-goblin-fix-evals-bottleneck-willison-llm-typed/https://jacksunwei.me/digest/ai-tech/openai-goblin-fix-evals-bottleneck-willison-llm-typed/A behavioral postmortem, an evaluation cost crisis, and a library refactor all show the operational layer absorbing what frontier models actually demand.Thu, 30 Apr 2026 00:00:00 GMTAI TechNemotron Nano Omni is a sub-agent; Codex's prompt is a post-mortemhttps://jacksunwei.me/digest/ai-tech/nemotron-sub-agent-codex-prompt-post-mortem/https://jacksunwei.me/digest/ai-tech/nemotron-sub-agent-codex-prompt-post-mortem/NVIDIA's new omni-modal model is really a perceptual sub-agent in a two-model stack, and a leaked Codex prompt exposes a reward-hacking patch.Wed, 29 Apr 2026 00:00:00 GMTAI TechNew tooling, same bottlenecks — just moved one step downstreamhttps://jacksunwei.me/digest/ai-tech/new-tooling-same-bottlenecks-moved-downstream/https://jacksunwei.me/digest/ai-tech/new-tooling-same-bottlenecks-moved-downstream/Three tooling launches today each shift their bottleneck instead of removing it, and practitioners are flagging the gap on arrival.Tue, 28 Apr 2026 00:00:00 GMTAI TechOpenAI's release cadence is outrunning its deployment storyhttps://jacksunwei.me/digest/ai-tech/openai-release-cadence-outrunning-deployment-story/https://jacksunwei.me/digest/ai-tech/openai-release-cadence-outrunning-deployment-story/OpenAI shipped three polished launches today, and independent testers found the seams in each before the marketing settled.Mon, 27 Apr 2026 00:00:00 GMTAI TechAround the weights: gating, silicon, and scaffolding take overhttps://jacksunwei.me/digest/ai-tech/around-the-weights-gating-silicon-scaffolding/https://jacksunwei.me/digest/ai-tech/around-the-weights-gating-silicon-scaffolding/GPT-5.5's gated launch, Claude Code's harness postmortem, and DeepSeek V4 on Huawei silicon all locate the action outside the model weights.Sun, 26 Apr 2026 00:00:00 GMTAI TechThe model isn't the variable anymore — access, harness, and silicon arehttps://jacksunwei.me/digest/ai-tech/where-frontier-ai-competition-actually-lives/https://jacksunwei.me/digest/ai-tech/where-frontier-ai-competition-actually-lives/GPT-5.5, DeepSeek V4, and Claude Code's regression all show access policy, harness code, and hardware stacks now decide which model wins.Sat, 25 Apr 2026 00:00:00 GMTAI TechThe frontier splits three ways: pricier, cheaper, and sorrierhttps://jacksunwei.me/digest/ai-tech/frontier-splits-pricier-cheaper-sorrier/https://jacksunwei.me/digest/ai-tech/frontier-splits-pricier-cheaper-sorrier/OpenAI doubles prices and locks Codex in, DeepSeek undercuts on cost with caveats, and Anthropic ships a postmortem trying to rebuild trust.Fri, 24 Apr 2026 00:00:00 GMTAI TechVendors ship the wins; deployers inherit the workhttps://jacksunwei.me/digest/ai-tech/vendors-ship-wins-deployers-inherit-work/https://jacksunwei.me/digest/ai-tech/vendors-ship-wins-deployers-inherit-work/A model release, a faster API, and an AI-assisted security audit all post genuine wins that depend on conditions deployers must absorb themselves.Thu, 23 Apr 2026 00:00:00 GMTAI TechThe pitches are confident; the people checking them aren't keeping uphttps://jacksunwei.me/digest/ai-tech/pitches-confident-checkers-not-keeping-up/https://jacksunwei.me/digest/ai-tech/pitches-confident-checkers-not-keeping-up/Three AI pitches across oncology, Arabic benchmarks, and open-source security all rest on validation machinery that is conflicted, missing, or overwhelmed.Wed, 22 Apr 2026 00:00:00 GMTAI TechOpen weights win the technical debate, lose the governance onehttps://jacksunwei.me/digest/ai-tech/open-weights-win-technical-lose-governance/https://jacksunwei.me/digest/ai-tech/open-weights-win-technical-lose-governance/Hugging Face's evidence that dangerous cyber capability lives in scaffolding holds up, but its own RCE bug undercuts the trust-us framing.Tue, 21 Apr 2026 00:00:00 GMTAI TechAgent harnesses and local leaderboards mature — and quietly hide their tradeoffshttps://jacksunwei.me/digest/ai-tech/agent-harnesses-local-leaderboards-hide-tradeoffs/https://jacksunwei.me/digest/ai-tech/agent-harnesses-local-leaderboards-hide-tradeoffs/Notion's agent rebuild and the new Chinese-led local-models leaderboard both show real engineering progress wrapped in framings that elide the catches.Mon, 20 Apr 2026 00:00:00 GMTAI Tech