Daily digest

AI updates of 6 May, 2026

GPT-5.5 Instant lands; Gemma 4 gets 3× speedup; Sierra hits $15B

TL;DR

MUST READ OpenAI flips ChatGPT to GPT-5.5 Instant; Google ships Gemma 4 MTP drafters for 3× free inference; Sierra hits $15B.

WORTH A LOOK Coding agents move: Claude Code 2.1.128, Codex /goal, Cursor SDK + Security Review, GitHub MCP secret-scan GA.

NOTED 8 more items: Mistral Medium 3.5, OpenAI WebRTC infra, async-Rust criticism, Anthropic FS agents, more.

The feed

MUST READ DEV MGR NON-DEV ARCH

GPT-5.5 Instant ships as ChatGPT default; `chat-latest` alias for devs

techcrunch.com · 2026-05-05

Replaces 5.3 Instant. AIME 2025 65.4 → 81.2, MMMU-Pro 69.2 → 76, lower hallucination claim, same low latency. Plus/Pro on web today; mobile soon.

For non-devs: ChatGPT’s default brain just got smarter and less likely to make stuff up — no setting to change.

#model #release #OpenAI

MUST READ DEV ARCH

Gemma 4 gets MTP drafters: up to 3× faster inference, no quality loss

blog.google · 2026-05-05

Speculative decoding pair for Gemma 4. Apache 2.0 on Hugging Face/Kaggle/Ollama; works with vLLM, SGLang, MLX, transformers.

For non-devs: Google made its open Gemma 4 model run up to 3× faster, free for anyone to use.

#model #OSS #inference

MUST READ DEV MGR

Sierra raises $950M at $15B; agents now in nearly half the Fortune 50

siliconangle.com · 2026-05-04

Tiger + GV led, Benchmark/Sequoia/Greenoaks in. $150M ARR. Sierra’s Agent SDK + Studio is now a serious competitor for Anthropic.

For non-devs: Bret Taylor’s customer-service AI agent company just hit a $15B valuation — agents are eating enterprise software.

#funding #agents

WORTH A LOOK DEV ARCH

Claude Code v2.1.128: `workspace` reserved MCP name, .zip plugin archives

github.com · 2026-05-04

/mcp shows tool counts and flags zero-tool servers; --plugin-dir takes .zip; OpenTelemetry envs no longer leak to subprocesses.

#ClaudeCode #release #MCP

WORTH A LOOK DEV

Codex CLI 0.128 ships persisted `/goal` workflows + sandbox profiles

github.com · 2026-04-30

Goals now survive resumes via app-server APIs; new TUI controls (create/pause/resume/clear). Bedrock + Windows sandbox stability up.

npm i -g @openai/codex

#Codex #release

WORTH A LOOK DEV ARCH

Cursor SDK launches: build agents on Cursor’s runtime + harness

cursor.com · 2026-04-29

TypeScript SDK in public beta. Cloud Agents API gets run-scoped ops + SSE streaming. Token-based pricing.

#Cursor #agents #release

WORTH A LOOK DEV MGR ARCH

Cursor Security Review (beta): PR-time vuln scans + dependency auditing

cursor.com · 2026-04-30

Reviewer agents flag auth/injection issues at PR time. Scheduled scans for known CVEs and outdated deps.

#Cursor #security

WORTH A LOOK DEV ARCH

OpenAI publishes its WebRTC architecture for real-time voice

openai.com · 2026-05-04

Split-relay + transceiver design for one-port-per-session media termination, stateful ICE/DTLS, and global routing. Rare infra read.

#voice #infra #OpenAI

WORTH A LOOK DEV MGR ARCH

GitHub MCP Server: secret-scanning hits GA across Copilot CLI + VS Code

github.blog · 2026-05-05

Pre-commit secret detection through any MCP-compatible agent. Honors push-protection customization on repos with Secret Protection enabled.

#MCP #GitHub #security

WORTH A LOOK DEV ARCH

Mistral Medium 3.5: dense 128B, 256k context, single weights

mistral.ai · 2026-05-05

First “merged” flagship — replaces Magistral, Medium 3.1 and Devstral 2 in Vibe. Native function calling, JSON, image input.

#model #Mistral #release

WORTH A LOOK DEV MGR ARCH

“Computer Use is 45× more expensive than structured APIs”

reflex.dev · 2026-05-05 · HN 306pts

Hard data on the cost of pixel-driven agents vs. clean APIs. Read it before greenlighting your next computer-use demo.

#agents #eval #cost

WORTH A LOOK DEV MGR

Anthropic: Agents for financial services + Blackstone/Goldman JV

anthropic.com · 2026-05-04 / 05

Two-day combo: vertical “Agents for FS” launch and a new enterprise-AI services company with Blackstone, H&F, Goldman.

#Anthropic #vertical #agents

NOTED · 8 items

Claude Code v2.1.126: project purge, gateway model picker, OAuth-paste login · 2026-05-01
opencode v1.14.39: HTTP_PROXY support, system CA trust in desktop · 2026-05-05
Mistral Medium 3.5 weights up on Hugging Face · 2026-05-05
GLM-5V-Turbo: native foundation model for multimodal agents · arxiv · HN 109pts
“Async Rust never left the MVP state” · tweedegolf.nl · 424pts
Zig → Rust porting guide from the Bun team · 698pts
xAI bringing Grok Voice to Apple CarPlay · 2026-05-04
NVIDIA’s Jensen Huang unveils “Ising” quantum AI model · 2026-05-03

Trending repos & tools

AI / dev-tools projects gaining traction this week.

warpdotdev/warp ★ +28.5k

Agentic dev environment, born out of the terminal. Rust.

TauricResearch/TradingAgents ★ +14.7k

Multi-agent LLM framework for financial trading. Python.

AIDC-AI/Pixelle-Video ★ +4.2k

AI-powered short-video generation engine. Python.

forrestchang/andrej-karpathy-skills ★ +2.4k

Single CLAUDE.md distilled from Karpathy’s coding observations.

virattt/dexter ★ +2.0k

Autonomous agent for deep financial research. TypeScript.

mattpocock/skills ★ +25.4k

Curated Claude Code skills for “real engineers.”

ruvnet/ruflo ★ +9.2k

Claude orchestration with multi-agent swarms. TypeScript.

ComposioHQ/awesome-codex-skills ★ +3.4k

Curated Codex skills for CLI + API workflow automation.

Hmbown/DeepSeek-TUI ★ +2.4k

DeepSeek-driven coding agent in your terminal. Rust.

zed-industries/zed ★ +1.9k

Multiplayer code editor — sustained breakout. Rust.

On the radar — people & vendors

Sam Altman — GPT-5.5 Instant becomes ChatGPT default · 2026-05-05
Dario Amodei — FS agents + Blackstone/Goldman JV; Claude Code 2.1.128 · 2026-05-04/05
Demis Hassabis — Gemma 4 MTP drafters: 3× speedup · 2026-05-05
Elon Musk — Grok Voice headed to Apple CarPlay · 2026-05-04
Jensen Huang — NVIDIA “Ising” quantum AI model unveiled · 2026-05-03
Bret Taylor — Sierra raises $950M at $15B · 2026-05-04
Mistral — Medium 3.5: dense 128B, 256k context · 2026-05-05
Cursor — Cursor SDK + Security Review beta · 2026-04-29/30

Full timeline

2026-05-05 · GPT-5.5 Instant default — replaces 5.3 in ChatGPT
2026-05-05 · Gemma 4 MTP drafters — up to 3× faster, Apache 2.0
2026-05-05 · GitHub MCP secret scanning GA — Copilot CLI + VS Code
2026-05-05 · Anthropic agents for FS — vertical agent push
2026-05-05 · Mistral Medium 3.5 — dense 128B, 256k context
2026-05-05 · opencode v1.14.39 — HTTP_PROXY + system CA support
2026-05-04 · Sierra raises $950M — $15B valuation, agent SDK leader
2026-05-04 · Claude Code v2.1.128 — workspace MCP + .zip plugins
2026-05-04 · OpenAI WebRTC deep dive — split-relay architecture
2026-05-04 · Anthropic + Blackstone JV — enterprise AI services co
2026-05-04 · Grok Voice → CarPlay — non-Tesla rollout incoming
2026-05-03 · NVIDIA “Ising” — open-source quantum platform
2026-05-01 · Claude Code v2.1.126 — project purge, gateway picker
2026-05-01 · Musk: xAI distilled OpenAI models — courtroom admission
2026-04-30 · Codex CLI 0.128 — persisted /goal workflows
2026-04-30 · Cursor Security Review beta — PR-time vuln scanning
2026-04-29 · Cursor SDK launches — TS SDK in public beta

Compiled by an autonomous Claude agent. Sources linked inline.

Archit Dwivedi