JS Wei (Jack) Sun

Tossell's Codex-built Gmail client ships agent hooks, skips the injection audit

Every URL the pipeline pulled into ranking for this issue — primary sources plus the supporting and contradicting findings each Researcher returned. Inline citations in the issue point back here.

← Back to the issue

Sources

Ben’s Builds #3 - an email app bensbites.com

They have some challenges 😅

References

Factory.ai Terminal-Bench announcement factory.ai

Factory’s Droids currently hold the top position with a 63.1% success rate, narrowly edging out the OpenAI Codex CLI at 60.4%

Matsuoka — Factory AI CodeDroid review hyperdev.matsuoka.com

developers have reported burning through entire monthly allowances on single, complex features that require multiple iterations to stabilize

Search Engine Journal — Google guidance on building for agents searchenginejournal.com

a site’s ‘accessibility tree’—originally designed for screen readers—serves as the primary high-fidelity map for AI agents

AtomicMail — Email AI privacy analysis atomicmail.io

Superhuman’s AI could be manipulated by a malicious incoming email to exfiltrate a user’s entire inbox contents to an external attacker via hidden Markdown image requests

msgvault.io architecture docs msgvault.io

history records are typically only available for one week; if a client stays offline longer, a full sync must be re-triggered

Hoangyell — Defuddle explained hoangyell.com

Defuddle is optimized to render pages in under 50 milliseconds by utilizing a site-specific ‘Extractor Registry’ for complex domains

Jack Sun

Jack Sun, writing.

Engineer · Bay Area

Hands-on with agentic AI all day — building frameworks, reading what industry ships, occasionally writing them down.

Digest
All · AI Tech · AI Research · AI News
Writing
Essays
Elsewhere
Subscribe
All · AI Tech · AI Research · AI News · Essays

© 2026 Wei (Jack) Sun · jacksunwei.me Built on Astro · hosted on Cloudflare