Day 14 — OpenAI tapes out Jalapeño, Codex cuts v0.142.x stable
— Gemini grows native eyes, Anthropic's flagships sit dark on Liccardo's deadline.
Day fourteen of the Fable 5/ Mythos 5 freeze, and what actually ships on the day Rep. Sam Liccardo's Jun 26 bipartisan-letter deadline for Commerce to explain the export-control directive lapses is the single largest OpenAI infrastructure move of the year, the harness train cuts its first stable in twelve days, and Google turns computer use into a default tool inside its enterprise flagship. OpenAI and Broadcom unveil Jalapeño — an LLM-optimised inference ASIC, the first chip in a multi-generation compute platform the two companies are now building together — taped out from initial design in nine months with engineering samples already running GPT-5.3-Codex-Spark in the lab at production target frequency and power, with initial deployment scheduled end-2026. Per OpenAI's announcement page, the TechCrunch exclusive and the Broadcom investor release, the cycle is presented as the fastest high-performance ASIC bring-up on the public record and the chip is rated substantially better performance per watt than the current state of the art. On the harness rim, openai/codex tags rust-v0.142.1 at Thu Jun 25 00:36 UTC and rust-v0.142.2 at Thu Jun 25 07:32 UTC — the first stable patches off the v0.142.0 cut of Mon Jun 22, ending the twelve-day stable freeze that had run alongside the v0.143 alpha train; v0.142.2 flips MCP tool-search on by default, macOS authentication clients now honour system proxy / PAC / WPAD, and remote stdio MCP servers accept absolute working directories in the remote platform's path format. The v0.143 pre-release train does not pause: alpha.16 (03:41 UTC) through alpha.25 (21:55 UTC) stack on Jun 25 — ten alphas in ~22 hours, twenty-five alphas now sitting in the v0.143 line, with v0.143 stable still uncut. anthropics/claude-code v2.1.193 ships the day's matching governance lift: autoMode.classifyAllShell routes every Bash/PowerShell command through the auto-mode classifier (not just heuristically risky ones), claude_code.assistant_response lands as a first-class OpenTelemetry log event with redaction controls, live file-path autocomplete appears in bash mode, and a startup notice now flags MCP servers that require authentication. Google turns computer use — the screen-see-click-type loop until now living in a standalone Gemini 2.5 model — into a native built-in tool inside Gemini 3.5 Flash, the same model developers already call for function-calling, Search grounding and Maps; on OSWorld-Verified the new build posts 78.4 against GPT-5.5's 78.7 at $1.50 / $9 per M input/output versus GPT-5.5's $5 / $30, with adversarial prompt-injection training shipped alongside. On the flagship-freeze side of the ledger, Day 14 arrives with the Liccardo / Obernolte / Lieu / Franklin Jun 18 letter's Jun 26 deadline for Commerce to file written legal authorities, technical evaluations and restoration criteria — and with the Jun 26 Polymarket restoration contract pinned in the mid-teens as that book closes; API calls to claude-fable-5 and claude-mythos-5 continue to return errors. And the funding tape stacks the year's largest single day of vertical-AI-agent capital: Assort Health closes a $120M Series C led by Menlo Ventures at a $1.2B valuation; Taktile takes a $110M Series C led by Goldman Sachs Alternatives for the agentic decision layer that banks and insurers route high-stakes calls through; Runlayer closes $30M Series A led by Felicis (Vinod Khosla publicly wanted "every available dollar"); Coval takes $28M Series A led by Norwest for the simulation-and-evals layer that Zoom and Deepgram run their voice agents against. Throughline: on the day the flagship-availability negotiation is being run by lawyers, the infrastructure negotiation is being run by a tape-out, a stable, an OTel log event and a built-in computer-use tool — four production-grade moves that compound without asking Commerce for permission.
OpenAI tapes out its own silicon — Jalapeño, the first chip in a multi-generation compute platform with Broadcom, lands as the largest single OpenAI infrastructure move of the year and the fastest high-performance ASIC bring-up on the public record
OpenAI and Broadcom unveil Jalapeño on Wed Jun 24 — OpenAI's first LLM-optimised inference ASIC, taped out from initial design in nine months, with engineering samples already running ML workloads in the lab at production target frequency and power, including GPT-5.3-Codex-Spark; rated substantially better performance per watt than current state-of-the-art; first deployment scheduled end-2026 and presented as chip 1 of a multi-generation compute platform OpenAI and Broadcom are now building together
Jun 24The single largest OpenAI infrastructure move of the year, and the first time the lab has publicly owned a silicon design. Per the OpenAI announcement page, the Broadcom investor release, TechCrunch's exclusive, CNBC's on-the-record file, VentureBeat's infrastructure deep-read and the Tom's Hardware tape-out note, the architecture is purpose-built for inference: kernels, memory movement, networking and serving patterns are tuned around the workloads frontier serving actually runs (long-context KV traffic, tool-call control flow, multi-agent fan-out), not a general training-and- inference ASIC. The nine-month-design-to-tape-out cadence is the eye-catching number — Broadcom's release frames it as the fastest reticle-scale high-performance ASIC bring-up on the public record, accelerated by OpenAI's own models being used inside the design loop. Two reads. (1) The multi-generation platform framing is the clearest signal that Jalapeño is the first step of a roadmap, not a hedge — Broadcom says the partnership is now sized to ship gigawatt-scale data centres with Microsoft and other partners beginning in 2026, the same window the Jalapeño first deployment is scheduled for. The inference-tier capacity question that has gated GPT-5.6, the Codex background-agent fleet and every OpenAI consumer surface since Apr now has a first-party answer in the timeline, even if it doesn't deliver wafers tomorrow. (2) The GPT-5.3-Codex-Spark-already-running detail is the under-the-radar tell: OpenAI is not framing Jalapeño as a future-tense bet — engineering samples are running production-class model code at production-target frequency and power in the lab, and the model name picked for the demo is a Codex-tuned Spark variant, not a flagship. The harness layer is the surface getting validated.
The Anthropic flagships sit dark on Day 14 — Liccardo's bipartisan Jun-26 deadline for Commerce to explain the export-control directive lapses today, the Polymarket June-26 restoration contract closes in the mid-teens, and Anthropic's API still returns errors on claude-fable-5 and claude-mythos-5
Update — Day 14 of the Fable 5 / Mythos 5 freeze: Reps. Sam Liccardo (D-CA), Jay Obernolte (R-CA), Ted Lieu (D-CA) and Scott Franklin (R-FL) had set Jun 26 as the deadline for Commerce Secretary Howard Lutnick to file in writing the legal authorities, technical evaluations, review process and restoration criteria for the Jun 12 directive — the deadline lapses today; Anthropic is publicly running the talks through co-founder Tom Brown and policy chief Sarah Heck rather than CEO Dario Amodei; the Jun-26 Polymarket restoration contract closes pinned in the mid-teens and API calls to claude-fable-5 and claude-mythos-5 continue to return errors
Jun 26The first congressional deadline of the freeze meets the first Polymarket restoration date. Per the Liccardo office's Jun 18 bipartisan letter to Commerce Secretary Howard Lutnick — co-signed by Obernolte, Lieu and Franklin — the asks are specific: (a) the legal authority being used (the letter formally names 14 C.F.R. § 744.22(b)'s "is informed" mechanism under the Export Control Reform Act of 2018's "emerging and foundational technologies" clause), (b) the technical evaluation that drew the line, (c) the review process, and (d) the restoration / licensing criteria — all in writing by Jun 26. Two reads. (1) The letter-on-the-record shape turns Commerce's private-channel directive into a discoverable object: even if Lutnick answers under Touhy protections, the statutory hook argument now has a House-of-Representatives file attached to it, which is the same surface Anthropic's Jun-10 Senate Banking letter covered on Day 13 exited onto. (2) The Polymarket-Jun-26-mid-teens close is the cleanest external read on whether the Tom-Brown-replaces-Dario delegation move that lifted next-week's restoration contract is being priced as this-week: it isn't. The Jul-1 book is still where the weight sits. The flagship-availability negotiation is now publicly-receipted on both sides — industry-side in a Senate Banking hearing letter, congressional-side in a Commerce-direct demand letter — and Day 14 is the day the first deadline ticks past.
The harness train cuts its first stable in twelve days — openai/codex tags rust-v0.142.1 and rust-v0.142.2 stable on Jun 25, the v0.143 alpha train stacks ten more pre-releases the same day, and anthropics/claude-code v2.1.193 ships an auto-mode classifier for every shell command plus an OpenTelemetry assistant-response log event
openai/codex tags rust-v0.142.1 at Thu Jun 25 00:36 UTC and rust-v0.142.2 at Thu Jun 25 07:32 UTC — the first stable patches off the v0.142.0 cut of Mon Jun 22, ending the twelve-day stable freeze that had run alongside the v0.143 alpha train; v0.142.2 flips MCP tool-search on by default while preserving compatibility with older models, macOS authentication clients now honour system proxy / PAC / WPAD settings, remote stdio MCP servers accept absolute working directories in the remote platform's path format, expired Bedrock credentials produce actionable recovery guidance, and PowerShell commands containing certain executable AST regions now require approval; v0.142.1 ships opt-in Windows system proxy support for authentication (PAC, WPAD, static proxies, bypass rules)
Jun 25The harness team breaks the freeze on the stable channel while the alpha train keeps stacking. Per the openai/codex release pages: rust-v0.142.1 at 00:36 UTC is a tightly scoped Windows-proxy patch from @canvrno-oai via PR #26708; rust-v0.142.2 at 07:32 UTC is the full maintenance roll-up — MCP tool-search is now the default tool-discovery mode (with compat fall-back for older models), the macOS auth client joins the Windows one in respecting system proxy / PAC / WPAD, plugins gain a dark-mode logo hook through local manifests and remote catalogs, and apps can render richer safety-buffering UI using server-provided visibility and faster-model metadata. Two reads. (1) MCP-tool-search-default is the meaningful one for harness consumers: it flips the contract for how Codex walks an MCP server's surface area from list-everything-up-front to search-on-demand, which is the same architectural lift the MCP 2026-07-28 RC made at the protocol layer — Codex is now harness-default on the new shape. (2) The fact that two separate stables ship in seven hours after a twelve-day stable freeze is the clearest signal that the v0.143 alpha train is shielding a separate release branch from user-facing churn — Codex is running a two-track ship cadence now (stable patch + alpha pre-release), not a single-track one.
openai/codex stacks rust-v0.143.0-alpha.16 through alpha.25 on Thu Jun 25 — ten more v0.143 pre-release alphas in ~22 hours (03:41 UTC to 21:55 UTC), bringing the v0.143 alpha line to twenty-five tags off the v0.142.0 stable cut of Mon Jun 22, with v0.143 stable still uncut; the densest single-day pre-release cadence on the public Codex record and the clearest external read on the v0.143 surface area being held until the headline features land at once rather than dribbled
Jun 25The alpha half of the new two-track cadence stays at full speed. The v0.143.0-alpha.10 tag landed on Sat Jun 20, alpha.15 on Wed Jun 24 19:41 UTC, and the alpha.16–alpha.25 window now stacks ten more in ~22 hours — a per-hour pre-release rate above anything OpenAI's Rust harness has run on the public record, with v0.143 stable still uncut twelve days after the window opened. Two reads. (1) Holding v0.143 stable off the channel while v0.142.x takes the user-visible patch traffic is a release-engineering choice that only makes sense if the v0.143 ship has a single headline surface being gated — GPT-5.6-class backend routing, a model-tier swap, or a first-party computer-use mode on the harness side. Twenty-five alphas is not churn; it's integration. (2) The cadence itself is the productisation tell: the Mythos-class capability gap that Anthropic opened on Jun 9 with Fable 5's ship — the gap that took Mythos 5 out of public hands three days later — is being closed by harness-side velocity at OpenAI, not by a single GPT-5.6 ship.
anthropics/claude-code v2.1.193 lands the day's harness-governance lift — autoMode.classifyAllShell routes every Bash/PowerShell command through the auto-mode classifier instead of only heuristically risky ones, a first-class claude_code.assistant_response OpenTelemetry log event ships with redaction controls, live file-path autocomplete appears in bash mode, auto-mode denial reasons now surface in transcripts / toasts / the /permissions recent-denials list, and a startup notice flags MCP servers that require authentication so the OAuth handshake doesn't fail silently on first call
Jun 25The matching Claude Code move, shipped the same day Codex breaks its stable freeze. Per the anthropics/claude-code changelog entry for v2.1.193: autoMode.classifyAllShell is the headline — the setting forces every shell command through the classifier, removing the heuristic-on-suspicious bypass that has historically let deterministically-safe-looking commands skip the auto-mode model call. The claude_code.assistant_response OTel event finally puts a structured, redactable surface under model output in production telemetry, where prior versions left consumers reading raw transcripts to reconstruct what was said. Two reads. (1) classifyAllShell is a pricing move as much as a security move: the auto-mode classifier is a model call, and routing every Bash tool invocation through it changes the per-action cost shape of any agent the harness runs. Putting the option behind an opt-in setting is Anthropic letting enterprise policies eat the budget on their own terms. (2) The OTel log event paired with auto-mode denial reasons in /permissions recent-denials is the cleanest read on the compliance surface Anthropic is now building around the harness — the same surface the Claude Tag Slack rollout on Day 13 requires under enterprise admin scoping.
Gemini grows native eyes — Google promotes computer use from a standalone Gemini 2.5 model into a built-in tool inside Gemini 3.5 Flash, the same model developers already call for function-calling, Search grounding and Maps, with OSWorld-Verified 78.4 against GPT-5.5's 78.7 at a fifth of the input price and a third of the output
Google ships Gemini 3.5 Flash native computer use on Wed Jun 25 — the screen-see-click-type loop that until now lived in a standalone Gemini 2.5 model launched in Oct 2025 is now a built-in tool inside Gemini 3.5 Flash, exposed through the Gemini API and the Gemini Enterprise Agent Platform; on OSWorld-Verified the new build posts 78.4 against GPT-5.5's 78.7, priced at $1.50 per M input tokens and $9 per M output tokens versus GPT-5.5's $5 / $30; Google ships adversarial prompt-injection training specifically tuned for live computer-use sessions
Jun 25The cleanest read this week on where the computer-use surface consolidates. Per the blog.google announcement, the TechTimes file and the Cybersecurity News write-up, a single Gemini 3.5 Flash agent can now see a screen, look something up on Search, and interact with a map without routing requests between models — the same model that already exposes function-calling, Search grounding and Maps. Two reads. (1) The built-in-not-bolt-on shape is the meaningful product move: the Oct-2025 Gemini 2.5 computer-use model shipped as a standalone surface, which forced agent authors to multi-model around it; making computer use a default tool inside the same Flash model developers already call collapses the routing layer and changes the cost shape of a browser-driving agent at the API tier. (2) The 78.4 vs 78.7 OSWorld gap at $1.50/$9 vs $5/$30 is the under-the-radar tell: Google is comfortable shipping an approximately-parity computer-use benchmark at roughly a fifth of the GPT-5.5 input price and a third of the output price, which is the commodity-tier pricing move the OpenAI harness train and the Anthropic Slack/Tag surface have been ducking on the computer-use tier.
The vertical-AI-agent funding tape stacks the year's largest single 48 hours of capital — Assort Health closes $120M Series C at $1.2B, Taktile takes $110M Series C from Goldman Sachs Alternatives for high-stakes bank/insurer decisioning, Runlayer takes $30M Series A led by Felicis after Vinod Khosla publicly wanted "every available dollar", and Coval lands $28M Series A led by Norwest for voice-agent simulation and evals
Assort Health closes a $120M Series C on Thu Jun 25 led by Menlo Ventures at a $1.2B valuation, with participation from Lightspeed Venture Partners, Felicis, First Round Capital, Chemistry, Joe Montana, Tau Ventures and Quiet Capital — total disclosed capital now $222M+; the company runs voice / chat AI agents across the patient journey (scheduling, intake, referrals, forms, document processing, medication refills, payments), reports 190M patient voice interactions to date and a 20× revenue trajectory over the past 15 months; the financing moves the platform from voice-AI-patient-access into a stated "agentic operating system" for the patient journey
Jun 25The single largest vertical-AI-agent round of the week. Per the PRNewswire release, the FierceHealthcare file, HITConsultant's investor read and citybiz's valuation confirmation, the $1.2B mark is a unicorn-conversion step for a company whose 190M-patient-voice- interactions footprint is the cleanest production-scale reference for an enterprise voice-agent platform in U.S. healthcare. The "agentic operating system for the patient journey" framing signals the next product surface: expansion from patient access (scheduling/intake) into document processing, medication refills and payments — the same Box-Skills-class action-on-content rim covered Day 13, applied to a regulated vertical with 20× ARR growth as the financing thesis.
Taktile secures a $110M Series C on Wed Jun 24 led by Growth Equity at Goldman Sachs Alternatives, with participation from Balderton Capital, Index Ventures, Tiger Global, Y Combinator and Dig Ventures — total disclosed capital now $184M; the company runs an "agentic decision platform" that banks and insurers route high-stakes calls through (underwriting business loans, assessing claims, catching financial crime); one disclosed insurer customer is running multiple use cases with projected cost efficiencies over $90M in claims processing alone; the financing capitalises a Series-A-to-Series-C agent-decisioning bet for regulated finance
Jun 24Per the Taktile release, the Axios Pro Rata file, the FinTech.Global write-up and the Yahoo Finance deal-page, Goldman Sachs Alternatives leading the round is the meaningful detail: the $90M-projected-savings-at-one- insurer data point is the kind of single-customer ROI receipt growth-equity tickets ride on, and the round positions Taktile as the decision-layer incumbent for U.S./EMEA/LATAM banks and insurers ahead of the EU-AI-Act-Aug-2-cutover the spec community has been counting down to. The $110M-out-of-$184M-total shape is the read on how growth-tier investors are sizing the agentic-finance-decisions opportunity now that banks-and-insurers are the actual deploying customer, not an NDA-bound reference.
Runlayer closes a $30M Series A on Wed Jun 24 led by Felicis with participation from Khosla Ventures — Fortune's exclusive reports Vinod Khosla publicly wanted "every available dollar" of the round — bringing total capital to $42M; the product is an enterprise "control layer" for how employees and AI agents connect to tools, data and policies across a company: a corporate app-store / control-room surface where employees can use any AI tool against pre-approved guardrails while IT and security teams get visibility into what every agent is doing and what it costs
Jun 24The governance-layer bet of the week. Per Fortune's exclusive, the Techmeme roundup, the Finsmes file, the SaaSNews deal record and the Crypto Briefing write-up, Felicis leading with Khosla Ventures already in positions the round at the intersection of enterprise-AI-procurement and agent-governance-and-observability — the same surface OpenAI-MCP-OAuth-hardening and Anthropic's per-channel-data-and-tool-scope in Claude Tag are now both shipping on the model-and-harness side. The "every-available-dollar" quote from Khosla is the investor-side equivalent of the Claude-Tag-65%-of-product-team-code disclosure: a credibility deposit at the governance moment of the vertical, not the experimentation moment.
Coval closes a $28M Series A on Wed Jun 24 led by Norwest Venture Partners with participation from Base10 Partners, Twilio Ventures and Y Combinator — bringing total capital to ~$31M since 2024 launch; the platform runs simulation, observability, labeling and review for voice-AI and chat agents, with customers including Zoom and Deepgram — positioned as the "define safety and reliability" layer for autonomous voice agents before they fail on real customer calls
Jun 24The evals-and-simulation bet that closes the week's vertical-agent funding stack. Per the PRNewswire release, SiliconANGLE's production-pipeline read, TheNextWeb's product file and the Finsmes deal record, Coval's pitch is the production-side counter to Assort Health's patient-journey deployment: simulate the agent against thousands of synthetic customer-call branches before it ships, observe and label what happens in production, and review the regressions between releases. Zoom and Deepgram as named customers map cleanly to the voice-AI surface that OpenAI's GPT-Realtime-2 / Translate / Whisper GA earlier this month opened up at price/$M-token-tier. Norwest leading positions Coval as the evaluation-platform incumbent for the realtime-voice surface the way Snorkel's $100M Series D positioned it on the text-agent rim.
← Back to all Spotlight editions
