AI News

Breaking releases, model updates, and industry moves. Fast, technical, no hype.

NewsMay 22, 2026
5 min read

4B-Parameter Coding Agents Are Outpacing Expectations

Small coding models with 4B active parameters are hitting 87%+ on coding benchmarks, challenging the assumption that you need a 70B+ model for a useful coding agent.

Source: GitHub Trending
NewsMay 21, 2026
4 min read

Codex Complexity Optimizer: Safe Codebase Analysis

Codex Complexity Optimizer is a Codex skill for safe, read-only codebase analysis that produces structured performance optimization reports. 810 GitHub stars in its first days.

Source: GitHub
BreakingNewsMay 20, 2026
4 min read

ZeroLang: Vercel's Programming Language for Agents

Vercel Labs drops ZeroLang, a programming language built for AI agents — 3,789 GitHub stars at launch. What it is, why general-purpose languages fall short for agents, and what this signals.

Source: Vercel Labs
NewsMay 19, 2026
4 min read

SmallCode: 87% SWE-bench at 4B Active Parameters

SmallCode is a new open-source coding agent optimized for small LLMs, claiming 87% on SWE-bench Verified using only 4B active parameters — comparable to GPT-4o with a fraction of the compute.

Source: GitHub
BreakingNewsMay 18, 2026
5 min read

Anthropic Releases Claude Sonnet 4.6 for Claude Code

Claude Sonnet 4.6 is now the default model for Claude Code: faster output and lower cost than Opus 4.7, with equivalent quality on most coding tasks. What changed and when to switch.

Source: Anthropic
Claude CodeMay 17, 2026
4 min read

Claude Code Goes Web: Remote Execution and GitHub-Native Sessions

Anthropic now runs Claude Code in managed cloud containers that clone your repo on session start and push to GitHub-scoped branches—no laptop required.

Source: Anthropic
NewsMay 16, 2026
5 min read

Vercel Zero: A Programming Language Built for AI Agents

Vercel Labs just open-sourced Zero, a programming language designed from scratch for AI agent workflows. Here's what it does differently from Python and TypeScript, and why the abstraction matters.

Source: Vercel Labs
BreakingNewsMay 15, 2026
4 min read

html-anything: AI Agent HTML Editor Tops GitHub Trending

html-anything hit 1,737 GitHub stars in days. The agentic HTML editor lets a local AI write HTML across 75 skills and 9 output surfaces—magazine, deck, poster, prototype, and more.

Source: GitHub
NewsMay 13, 2026
6 min read

Perplexity Sonar Pro API: What Developers Need to Know

Perplexity Sonar Pro bundles live web search with LLM reasoning in one API call. Covers tiers, citation behavior, pricing, and when it beats a custom RAG pipeline.

Source: Perplexity AI
NewsMay 12, 2026
4 min read

CodexSaver: Run OpenAI Codex on DeepSeek to Cut Costs

CodexSaver is an open-source proxy that routes OpenAI Codex API calls to DeepSeek backends for tasks where it matches output quality, cutting costs without degrading results. 431 stars in 72 hours.

Source: GitHub
AI InfrastructureMay 11, 2026
4 min read

TokenSpeed: A New LLM Inference Engine Hits 941 Stars in Days

TokenSpeed is a new open-source LLM inference engine that claims 'speed-of-light' throughput. Here's what it does differently, who built it, and whether the benchmarks hold up.

Source: GitHub
AI ModelsMay 10, 2026
5 min read

DeepSeek 4 Flash: antirez Ships Metal Inference to 5K Stars

antirez (Redis creator) released ds4, a Metal inference engine for DeepSeek 4 Flash on Apple Silicon. 5,400 GitHub stars in days — here's what it does and why it matters.

Source: GitHub
NewsMay 9, 2026
4 min read

Google I/O 2026: What to Expect from Gemini and Project Astra

Google I/O 2026 lands May 19-21. Based on Google's recent AI releases — Gemini 2.5 Pro, Gemini 3.1 Flash TTS, and Project Astra milestones — here's what developers should expect at the keynote.

Source: Google
NewsMay 8, 2026
5 min read

LangGraph 0.4: Native MCP Tool Calling and Parallel Branch Streaming

LangGraph 0.4 ships a native MCP client adapter, parallel-branch event streaming, and a SharedStore API for cross-agent state — making multi-agent graph wiring significantly less boilerplate.

Source: LangChain
BreakingAI Tools NewsMay 7, 2026
5 min read

DeepClaude: Claude Code at 17x Lower Cost

DeepClaude proxies Claude Code through DeepSeek V4 Pro or any Anthropic-compatible backend, cutting costs by up to 17x. Here's what it does and who benefits.

Source: GitHub
NewsMay 6, 2026
4 min read

Gemini 2.5 Pro: 1M Context and Thinking Mode Hit the API

Google's Gemini 2.5 Pro lands in the API with a 1M-token context window, a thinking mode for multi-step reasoning, and a revised pricing model. Here's what changed and why it matters for developers.

Source: Google DeepMind
NewsMay 6, 2026
4 min read

LangGraph 0.3: Persistent Memory and Streaming Interrupts Land

LangGraph 0.3 ships persistent checkpointing across backends, interrupt-before/after node execution, and a revised streaming API — here's what changed and what it means for production agent workflows.

Source: LangChain
BreakingFeaturesMay 1, 2026
9 min read

Instant Launches 'Instant Agents': An AI Team for Shopify That Builds PDPs, Converts Figma, and Picks Winning Variants

Instant — the no-code Shopify page builder used by 25K+ brands — just announced Instant Agents. A 24/7 AI team trained on Instant's own product that builds product pages, turns Figma into live Shopify pages, runs A/B tests, and generates product images. Here's what it actually does and why the 'real layouts, real components' framing matters.

Source: Instant (@InstantHQ) on X
BreakingAnalysisApr 18, 2026
13 min read

Allbirds Just Sold Its Shoe Business to Become an AI Compute Company. What Small Brands Like thezeppelin.in Should Actually Learn From It.

Allbirds stock surged 800% after announcing it's rebranding as NewBird AI, selling its shoe business for $39M, and raising $50M to pivot into AI compute. Here's what that actually means, why most small brands shouldn't copy it, and the real AI lessons a brand like thezeppelin.in can take from this moment.

Source: NYT, CNBC, Business Insider
BreakingReleasesApr 18, 2026
11 min read

Claude Design Is Live. The Canva Integration Is More Interesting Than the Product.

Anthropic shipped Claude Design on April 17, 2026 — a visual design tool powered by Opus 4.7. The obvious read is 'Figma/Gamma competitor.' The actual story is a Canva partnership, design-system auto-extraction from codebases, and a handoff to Claude Code that turns this into an end-to-end design-to-deploy pipeline. Here's what the announcement is really signaling.

Source: Anthropic
BreakingWorkflowsApr 16, 2026
10 min read

The Claude Opus 4.7 Workflow Anthropic's Own Engineers Use: Auto Mode, /focus, /go, and Verification-First Prompting

Boris Cherny from the Claude Code team posted the actual workflow patterns Anthropic's own engineers use with Opus 4.7. Auto mode, focus mode, the /go skill, effort calibration, and why verification is now the single highest-leverage change. Here's the full playbook.

Source: Boris Cherny (Anthropic) on X
BreakingAnalysisApr 16, 2026
18 min read

Claude Opus 4.7: The Industries Getting Tanked That Nobody Is Talking About

Everyone's writing about Opus 4.7 vs Figma. Here's what the announcement actually breaks: QA automation firms, investment banking pitchbook teams, Big 4 audit entry layer, management consulting's slide economics, and the VP Finance layer. A deep look at the buried numbers.

Source: Anthropic
BreakingAnalysisApr 16, 2026
16 min read

Claude Opus 4.7 Is a 35% Silent Price Increase. The Migration Guide Reveals What Anthropic Actually Changed.

Everyone's celebrating Opus 4.7's capability jump. Nobody's reading the migration guide. It kills temperature, top_p, prefill, manual thinking, and raises your token bill by 1x-1.35x for the same prompts. Here's what the breaking changes actually mean.

Source: Anthropic Platform Docs
ToolsApr 16, 2026
4 min read

Desktop Mario: Someone Built an SMB Overlay for Your Windows Desktop That Launches on Ctrl+Alt+M

Desktop Mario is a 78KB Python + tkinter overlay game that sits transparently on your Windows desktop. Ctrl+Alt+M toggles it. Shell-kick Goombas, dodge Bob-ombs, grab mushrooms, and get back to work. The 5-day-old weekend project every overworked developer needs.

Source: GitHub
BreakingReleasesApr 15, 2026
6 min read

Google Ships Gemini 3.1 Flash TTS: Audio Tags, 70+ Languages, and Director-Level Voice Control

Google released Gemini 3.1 Flash TTS on April 15, 2026 — a text-to-speech model with inline audio tags for controlling vocal style, pace, accents, and mid-sentence expression changes. Available via Gemini API, Google AI Studio, Vertex AI, and Google Vids.

Source: Google Blog
BreakingAI AgentsApr 15, 2026
7 min read

An AI Agent Named Valerie Is Running a Vending Machine in San Francisco — And Managing Its Own Bank Account

Developer Chris van der Henst built Valerie using the open-source OpenClaw framework. The AI agent names its own products, runs its Instagram, tracks sales, manages its bank account, and raises prices when demand spikes. It lives inside Frontier Tower in SF.

Source: X (Twitter) via Grok
BreakingRumorsApr 14, 2026
5 min read

Anthropic Preparing Claude Opus 4.7 and an AI Design Tool, Report Says — Launch Could Be This Week

Anthropic is preparing to release Claude Opus 4.7 along with a new AI-powered design tool for websites and presentations, according to a person with knowledge of the products. The products could ship as early as this week. Opus 4.7 is distinct from Claude Mythos.

PricingApr 14, 2026
7 min read

Anthropic Drops the Long-Context Surcharge: Claude Opus 4.6 and Sonnet 4.6 Now Hit 1M Tokens at Standard Price

Anthropic removed the long-context pricing tier that doubled input costs above 200K tokens. Claude Opus 4.6 and Sonnet 4.6 now support the full 1M token context window at standard rates — a material change for anyone doing long-document workflows.

Source: The New Stack
BreakingToolsApr 14, 2026
6 min read

Claude Code Desktop Gets a Full Redesign: Parallel Sessions, Integrated Terminal, and SSH on Mac

Anthropic shipped a major redesign of the Claude Code desktop app on April 14, 2026. The new version adds session management, side chats, an in-app file editor, and cross-platform SSH — all available immediately on Pro, Max, Team, and Enterprise plans.

Source: Anthropic Blog
BreakingFeaturesApr 14, 2026
9 min read

Claude Code Routines: Autonomous Agents That Run on Schedules, API Calls, and GitHub Events

Anthropic's new Claude Code Routines feature lets you define coding tasks that run automatically — on a cron schedule, from API calls, or in response to GitHub events — on Anthropic-managed cloud infrastructure. No laptop required.

Source: Anthropic Docs