AI News | Field Notes by Michael Nemtsev

AI Developer Billing Flips | AI Field Notes #43

A developer's workstation where a crossed-out price tag floats above the keyboard while a syntax-assembled figure types alongside, conveying billing restructuring and quiet role displacement.

GitHub Copilot's flat-rate pricing ended June 1, replaced by AI Credits that now meter premium model requests against a monthly allotment, with a new Max tier above Pro+. Cognition closed a $1 billion Series D at a $26 billion valuation and disclosed that Devin, its autonomous coding agent, writes 89% of code inside Cognition's own repositories. Windsurf is now Devin Desktop: Cognition rebranded the IDE on June 2, rewrote the core engine in Rust for 30% better token efficiency, and added an Agent Communication Protocol (ACP) for third-party IDE interop. NVIDIA's Rubin platform is in production, promising 10x lower inference token cost than Blackwell and 4x fewer GPUs for mixture-of-experts training, with products shipping H2 2026. Apple's WWDC runs June 8-12 with the first public demo expected of Siri running on Google Gemini, following January's partnership announcement.

AI IndustryAI Agents ·Google for Developers

Gemini CLI sunset: Google kills free developer access on June 18

AnalysisFree developer access to Gemini Code Assist and Gemini CLI stops on June 18, leaving enterprise Code Assist license holders as the only tier with ongoing access. The consumer GitHub app follows on July 17. Google's migration path is Antigravity, a rebuilt multi-agent coding platform that requires a separate enterprise or Antigravity subscription. The timing is pointed: Gemini CLI had accumulated more than 6,000 community contributions before Google effectively retired it to enterprise-only status. Developers who wired the free CLI into CI pipelines or local review workflows have 13 days to reroute before requests stop going through.

AI Models ·Microsoft Blog

Microsoft MAI models: seven in-house AI models bid to end OpenAI dependence

AnalysisSeven new in-house models shipped at Microsoft Build on June 2, anchored by MAI-Thinking-1: 35 billion active parameters, a 256K context window, trained from scratch on commercially licensed data. Independent raters prefer it to Sonnet 4.6; on SWE-bench Pro, a coding test that measures real-bug fixes, it matches Opus 4.6. MAI-Code-1, tuned specifically for GitHub and VS Code, is already live in Copilot. The full family covers image generation (ranked #2 for image-to-image on the Arena AI leaderboard), 43-language transcription, and voice in 15 languages. All are available on Fireworks AI, Baseten, and OpenRouter, not just Azure. Microsoft has resold OpenAI models across its products for years; the in-house stack is now shipping.

AI Agents ·OpenAI

OpenAI Codex on Bedrock: autonomous coding agent now generally available on AWS

AnalysisCodex, OpenAI's autonomous software engineering agent, reached general availability on Amazon Bedrock on June 1, alongside GPT-5.5 and GPT-5.4. The Bedrock path matters for teams with AWS data-residency constraints or existing IAM policies: Codex runs inside their AWS account under their governance controls, without routing data through OpenAI's own infrastructure. All inference goes through Bedrock and usage counts toward AWS commitments, with pricing matching OpenAI's first-party rates. Codex's core function, autonomous code changes end-to-end, was already in place; Bedrock adds the enterprise compliance wrapper. OpenAI is selling through a cloud it competes with indirectly.

AI Agents ·Microsoft Foundry Blog

Foundry Agent Service: Microsoft's framework-agnostic agent runtime goes GA this month

AnalysisMicrosoft Foundry Agent Service reaches general availability by the end of June, with sub-100ms cold starts and zero idle cost. Each agent run gets its own isolated sandbox with dedicated compute, memory, and filesystem, keeping untrusted code away from other sessions. The runtime is framework-agnostic: agents built on Microsoft Agent Framework, LangGraph, the Claude Agent SDK, or any other tool deploy without rewrites. Tracing and evaluation also hit GA in June. Long-running agents get durable state and scheduled execution. For teams that have been prototyping agents on Azure, this is the shift from preview to production-supported infrastructure.

AI Agents ·Chrome Developers

Chrome DevTools MCP server: browser debugging now accessible to AI coding agents

AnalysisChrome's debugging interface is now accessible to AI coding agents through a public preview MCP server Google launched for Chrome DevTools. MCP, the open protocol from Anthropic for giving agents structured access to external tools, now covers live browser debugging: setting breakpoints, reading console output, inspecting DOM state, and examining network requests through the same API a developer opens in the browser pane. Previously a coding agent had to infer browser behavior from static code; with this server it opens a live Chrome session and reads actual state. The package ships 29 tools, is Apache-2.0 licensed, and has been releasing on roughly a weekly cadence since it launched. The public preview means it's available to test now, not just announced.

AI Agents ·AWS Blog

AWS MCP Server: authenticated AWS access for AI agents now generally available

AnalysisAuthenticated access to all AWS services from AI coding agents is now a production-supported path: Amazon's managed MCP server reached general availability in May 2026. The server handles IAM authentication on AWS's side, so agents can query EC2, S3, Lambda, DynamoDB, and any of 15,000+ AWS API operations without credentials landing in the model context. MCP's remote server model keeps auth entirely in AWS infrastructure; the agent sends requests and the managed service routes them. IAM context keys and CloudTrail logging mean organizations get audit records of everything an agent touches. For teams building agents that provision or monitor AWS resources, the alternative was a custom SDK wrapper or credentials passed directly to a prompt.

AI Agents ·ChatForest

MCP 2026 spec: protocol goes stateless for cloud-native and load-balanced deployment

AnalysisThe next major version of MCP, the open standard from Anthropic for connecting AI agents to external tools, removes persistent sessions at the protocol layer. A release candidate is live; the final specification ships on July 28. The stateless design solves the biggest obstacle to running MCP servers in production cloud environments: today, each client must stay connected to the same server instance, making load balancing and auto-scaling difficult. With the stateless protocol, any server instance can handle any request from any client. For teams running their own MCP servers in auto-scaling infrastructure, this is the migration to plan before the spec hardens.

AI Industry ·Anthropic

Claude Partner Network: Anthropic formalizes three-tier consulting ecosystem

AnalysisForty thousand firms have applied since March; Anthropic sorted them on June 3 with a three-tier partner structure for consultants deploying Claude. The tiers are Select (10 certified practitioners, 2 live deployments), Preferred (100 practitioners, 15 customers), and Global Premier (1,000 practitioners, 100 customers across 3 regions). The Partner Hub is the public directory enterprise buyers use to find vetted firms. Over 10,000 consultants have earned individual Claude certifications since the program launched. A new MCP connector lets firms check their own tier status through Claude. The structure gives Anthropic a distributed implementation salesforce without adding to its own headcount.

AI Agents ·Microsoft Blog

Microsoft Scout: personal AI agent takes on scheduling and coordination at Build 2026

AnalysisScout, Microsoft's new personal AI agent for Frontier Microsoft 365 customers, handles meeting prep, scheduling conflicts, and routine coordination automatically, without being asked. The Frontier tier is Microsoft 365's highest subscription level. Personal AI agents that manage calendars and logistics have been demoed for two years; Scout's distinction is that it ships inside an existing enterprise productivity suite, bypassing the separate vendor approval that a standalone app would require. An organization that already pays for Microsoft 365 Frontier gets Scout without an additional procurement conversation, or a discussion about which roles it replaces.

AI Industry ·Transparency Coalition

Vermont therapy bot ban: state bars AI mental health tools without human oversight

AnalysisVermont became the first US state to explicitly ban AI therapy bots from providing mental health counseling without a licensed human in the loop. Illinois sent five AI bills to Governor Pritzer before lawmakers adjourned, covering automated hiring tools and healthcare applications. Both states are moving while federal legislation remains stalled. The sprint follows California's near-complete passage of 30 AI bills last month and the White House executive order from June 2 attempting to preempt state rules. Vermont's therapy bot rule is the most specific consumer protection from this wave: one enforceable ban on one high-risk application, a design other states can lift directly.

AI AgentsAI Industry ·TechTimes

Anthropic splits Claude billing: Agent SDK and Claude Code get a separate credit pool June 15

AnalysisStarting June 15, usage through the Claude Agent SDK, the `claude -p` CLI flag, Claude Code's GitHub Actions integration, and third-party apps built on the Agent SDK draws from a separate monthly credit pool rather than the flat subscription. Pro subscribers get $20 of agent credits per month, Max 5x gets $100, and Max 20x gets $200. The pools are per-user, cannot be combined across a team, and do not roll over. When the pool empties, agent calls return errors without any overflow billing option. The change targets developers who have been running automated agent loops on personal Pro subscriptions, getting effectively unlimited inference for $20 a month. That arrangement ends on June 15.

AI Industry ·GitHub Changelog

GitHub Copilot moves to AI Credits billing: flat rates end June 1, Max plan added

AnalysisGitHub's flat-rate Copilot pricing ended June 1. Pro and Pro+ subscribers now draw from a monthly AI Credits allotment instead of a fixed seat fee, with code completion remaining unlimited but premium model requests consuming credits that expire at month's end. A new Max tier sits above Pro+ for users who need deep-research and longer context windows. New sign-ups paused while the quota tooling rolled out; the change arrived without a grace period for existing subscribers. Enterprise buyers remain on the team-level credits model from January 2026. Whether the math works out depends entirely on how heavily a given user leans on GPT-4o, Claude, and Gemini inside the IDE.

AI ModelsAI Industry ·Anthropic News

Anthropic Glasswing expands to 200 critical infrastructure partners, adds Claude Mythos

AnalysisAnthropic expanded Project Glasswing, its infrastructure vulnerability-hunting program, from roughly 50 to 200 partners on June 2. The new cohort spans power, water, healthcare, communications, and hardware suppliers across 15 countries, with exposure estimated at 100 million people. Running the assessments is Claude Mythos Preview, a specialized security model Anthropic describes as scoring 100% on a weighted coding leaderboard and the first Claude variant purpose-built for offensive security research. The program has surfaced over 10,000 critical flaws since launch. The expansion reflects a broader shift: AI-assisted red-teaming has become cheap enough relative to the cost of a breach that skipping it is now the harder case to make.

AI Agents ·Salesforce News

Salesforce Agentforce Summer '26: multi-agent orchestration and 70% latency cut ship today

AnalysisSalesforce's Summer '26 release began rolling out June 5 with the first production version of Agentforce's multi-agent orchestration layer. The headlined feature is HyperClassifier, a routing system Salesforce claims reduces agent-task latency by 70% versus the prior release. Slack-first workflows let agents surface decisions and approvals inside conversation threads without switching to the CRM interface. Agentforce now covers 14 industry verticals with pre-built agent templates. The timing follows Salesforce's internal Q1 2026 AI readiness audit, which found roughly half of customer deployments underutilizing the agent infrastructure they had already paid for. Summer '26 is the company's direct response to that gap.

AI ModelsAI Industry ·TechCrunch

WWDC 2026 preview: Apple confirms Siri-Gemini integration for June 8 keynote

AnalysisApple's WWDC keynote on June 8 is expected to show the first public demo of Siri running on Google's Gemini models, following the partnership announced in January 2026. The revamp covers multi-step reasoning, conversational continuity, App Store integration, and Visual Intelligence inside the Camera app. Apple is also expected to present iOS 27, macOS 27, and a broader Apple Intelligence platform update emphasizing on-device inference alongside cloud routing. The Gemini integration draws attention because it reverses Apple's years-long stance of keeping AI assistants on-device. The developer conference runs June 8 to 12 in Cupertino.

AI Agents ·Waxell Security Blog

Comment and Control: one PR title hijacked Claude Code, Gemini CLI, and Copilot agents

AnalysisResearchers at Johns Hopkins, Guan, Liu, and Zhong, published a vulnerability in April 2026 showing that a single malicious string embedded in a pull request title could simultaneously hijack three AI coding agents: Anthropic's Claude Code Security Review GitHub Action, Google's Gemini CLI Action, and GitHub Copilot Coding Agent. Once compromised, the agents exfiltrated API keys, GitHub tokens, and cloud credentials through GitHub's own APIs, leaving no external network trace that standard security monitoring would catch. The attack chain earned a CVSS 9.4 Critical rating and a combined $1,537 in bug bounties across the three vendors. Anthropic's system card had pre-documented this prompt injection risk; the Johns Hopkins paper made the actual exploitation path concrete rather than theoretical.

AI Industry ·EU AI Act Reference

EU AI Act August 2 deadline: content labeling fines and high-risk AI rules take effect

AnalysisAugust 2, 2026 is the first hard compliance date under the EU AI Act covering two categories: transparency and labeling obligations for AI-generated content, and compliance requirements for operators of high-risk AI systems. Fines for non-compliance on the labeling tier run to 7.5 million euros or 1.5% of global annual turnover, whichever is higher. The European Commission finalized the first draft of the Code of Practice on AI-content marking in Q2 2026, giving companies roughly eight weeks to implement. The Digital Omnibus regulation that would have deferred several provisions is still pending formal adoption, making the August 2 date effectively firm. US companies serving EU users are in scope regardless of where their servers are located.

AI AgentsAI Industry ·Fierce Healthcare

ARPA-H seeks teams to build the first FDA-authorized agentic cardiovascular AI

AnalysisThe Advanced Research Projects Agency for Health opened a solicitation in June 2026 for teams to develop ADVOCATE, described as the first FDA-authorized agentic AI system for cardiovascular care. The program sets a 3-year pathway to FDA authorization, with the stated goal of deploying a 24-hour AI clinical specialist capable of monitoring patients, flagging deterioration, and recommending interventions. ARPA-H is funding the development cycle, meaning selected teams will build against a pre-negotiated regulatory approval track rather than the standard post-development submission process. The solicitation is the first U.S. government program to structurally incorporate FDA authorization into an AI development contract, treating approval as a design constraint from day one.

AI ModelsAI Industry ·Anthropic News

Anthropic and MITRE ATT&CK publish inventory of real-world AI-assisted cyberattacks

AnalysisAnthropic published a report on June 3 in partnership with MITRE ATT&CK documenting observed AI-assisted cyberattacks from 2025 and early 2026, covering social engineering, malware development, and reconnaissance automation. The report focuses on confirmed cases rather than theoretical threat modeling. A separate Google Threat Intelligence finding released alongside the paper counted a 32% rise in AI-assisted prompt injection attempts between November 2025 and February 2026. Both findings point to the same pattern: existing attack techniques running faster and at lower cost, with no new taxonomy required. The MITRE integration maps those patterns directly to existing ATT&CK playbooks, so security teams can start building detections today.

Subscribe for full archive access

Every past issue, weekly deep dives, and the full back catalogue — delivered free.

Read on Substack

Want this in your inbox?

One email a day, zero hype.

A short read every morning: what actually changed in AI, and what it means for work and daily life. Free, unsubscribe anytime.