Cast AI said its autonomous Kimchi Coding agent is the first to offer MiniMax M3, making it the default builder model in Kimchi's orchestration layer. Cast AI cited M3's 59% score on SWE-bench Pro and its MiniMax Sparse Attention architecture, which it says cuts per-token compute at one-million-token context to 1/20th of prior levels with 15x faster decoding. Access is rolling out via an Early Access program.
Your Daily AI Briefing
AI News Today
Looking for today's AI news? This is a fast, no-noise feed of the most important developments in artificial intelligence — covering new AI models, AI agents, research, chips and hardware, security, and how enterprises are putting AI to work. Filter by topic, search the headlines, and click through to the original source for the full story.
The latest dispatches, as of Jun 16, 2026
ACE Robotics' open Kairos world model tops embodied-AI benchmarks
ACE Robotics said its open-source Kairos world model ranked first among evaluated world models and vision-language-action systems across four global embodied-intelligence benchmarks — RoboTwin 2.0, LIBERO-Plus, WorldModelBench Robot and DreamGen — as of June 12. The company says Kairos leads on complex robotic manipulation, scene-level generalization, physical-world modeling and zero-shot transfer, and is openly available on GitHub, Hugging Face and ModelScope.
US government directive forces Anthropic to suspend Claude Fable 5 and Mythos 5
Anthropic launched Claude Fable 5 (a generally available, safety-tuned model) and the restricted Claude Mythos 5 on June 9, but said on June 12 it was suspending access to both after the US government issued an export control directive. Anthropic apologized for the disruption and said it was working to restore access; other Claude models such as Opus 4.8 remain available.
OpenAI backs EU code on AI-content transparency and provenance
OpenAI announced support for the European Commission's Code of Practice on Transparency of AI-Generated Content, an early step in implementing the EU AI Act. OpenAI pointed to its provenance work since 2024, including C2PA metadata in image tools and SynthID-style marking and detection, and said it will comply with the transparency requirements that apply to its products.
OpenAI to acquire Ona to run Codex agents in customer clouds
OpenAI said it will acquire Ona to bring secure cloud execution and orchestration into its Codex ecosystem, letting long-running agents operate inside an organization's own cloud while OpenAI provides the intelligence. The company says the deal expands Codex beyond a single device or session and is subject to customary closing conditions and regulatory approvals.
Cohere open-sources North Mini Code, its first agentic coding model
Cohere launched North Mini Code under an Apache 2.0 license — a 30B-total / 3B-active mixture-of-experts model with a 256K context window aimed at code generation, agentic software engineering, and terminal tasks. Cohere says it is the first of a new generation of models and is available on Hugging Face, the Cohere API, Model Vault and OpenRouter, running on a single H100 at FP8.
Google ships Gemini 3.5 Live Translate for real-time speech in 70+ languages
Google launched Gemini 3.5 Live Translate, an audio model that delivers near real-time speech-to-speech translation across more than 70 languages while preserving the speaker's intonation, pacing and pitch. It is rolling out to developers via the Gemini Live API and AI Studio, to enterprises in Google Meet, and to everyone through the Google Translate app on Android and iOS.
NVIDIA releases open 550B Nemotron 3 Ultra for long-running agents
NVIDIA released Nemotron 3 Ultra, a fully open 550B-parameter mixture-of-experts model with 55B active parameters, built to orchestrate complex, long-running agent workflows. It uses hybrid Mamba-Transformer layers and NVFP4 quantization that NVIDIA says delivers up to 5x higher throughput, with a single checkpoint that runs across Hopper, Blackwell and Ampere GPUs. Weights, data and recipes are open.
Google's Gemma 4 12B brings encoder-free multimodal AI to laptops
Google introduced Gemma 4 12B, a unified, encoder-free multimodal model that feeds vision and audio directly into the LLM backbone, with native audio inputs and a 256K context. Google says it nears the performance of its 26B MoE model at less than half the memory footprint and runs locally on laptops with 16GB of RAM, released under an Apache 2.0 license.
Anthropic confidentially files draft S-1 for an IPO
Anthropic said it confidentially submitted a draft S-1 registration statement to the US SEC for a proposed initial public offering, giving it the option to go public after the SEC completes its review. The number of shares and price have not been set, and the company said any offering will depend on market conditions.
NVIDIA launches Cosmos 3, an open foundation model for physical AI
NVIDIA launched Cosmos 3, an open world foundation model for physical AI built on a mixture-of-transformers architecture that combines vision reasoning, world generation and action prediction in one system. NVIDIA describes it as the first fully open omnimodel spanning text, image, video, ambient sound and action, available now as Cosmos 3 Super and Nano, with an Edge variant coming soon.
Cloud Security Alliance details two-wave AI developer supply-chain attack
The Cloud Security Alliance published a May 22 analysis of TeamPCP's Shai-Hulud/Megalodon campaign against AI developer infrastructure. CSA says Mini Shai-Hulud compromised 172 npm packages and 2 PyPI packages across 404 malicious versions, then Megalodon pushed 5,718 malicious commits to 5,561 GitHub repositories in under six hours, with persistence hooks targeting tools including Claude Code and Visual Studio Code.
OpenAI Codex named a Leader in enterprise AI coding agents
OpenAI said Codex was recognized as a Leader in Gartner's 2026 Magic Quadrant for Enterprise AI Coding Agents. The company says Codex is used by more than 4 million people each week, and highlighted enterprise controls including approval gates, RBAC, customizable policies, OS-level sandboxing, auditable workspace governance, IDE and CLI surfaces, SDKs, and cloud orchestration.
Virgin Atlantic says Codex speeds refactors and app testing
OpenAI published a Virgin Atlantic case study saying the airline used Codex to ship a revamped mobile app with near-complete unit test coverage and zero P1 defects at launch. Virgin Atlantic also reported 78% to 80% codebase size reductions on some legacy refactors and said work that once took two weeks can now take about 30 minutes to an hour.
AdventHealth deploys ChatGPT for Healthcare across clinical workflows
OpenAI detailed AdventHealth's deployment of ChatGPT Enterprise and ChatGPT for Healthcare across a hospital system operating in nine states. AdventHealth says the rollout targets administrative burden, utilization-management summaries, structured rationales, and operational workflows, with an 80% reduction in time spent on some administrative tasks and an emphasis on governance and measured adoption.
Hark raises $700M for a universal AI interface and hardware
TechCrunch reported that Hark, the AI lab founded by Figure AI and Archer founder Brett Adcock, raised a $700 million Series A at a $6 billion post-money valuation. Hark says it is building an agentic AI system as a universal interface for the digital world, expects to release multimodal models this summer, and plans custom hardware after that.
Microsoft Foundry Labs ships new open agentic stack and benchmarks
Microsoft Foundry Labs released a May roundup with SocialReasoning-Bench for measuring whether agents act in a user's best interest, plus an open end-to-end agentic stack made up of MagenticLite, MagenticBrain, and Fara 1.5. The stack emphasizes visible reasoning, browser and local-file workflows, sandboxed code execution, human approvals for critical actions, and small computer-use models built on Qwen 3.5.
NVIDIA Vera Rubin NVL72 and Jetson Thor win COMPUTEX AI awards
NVIDIA said its Vera Rubin NVL72 rack-scale AI supercomputer, Jetson Thor edge AI and robotics platform, and Alpamayo autonomous-vehicle platform won COMPUTEX 2026 Best Choice Awards. NVIDIA says Vera Rubin NVL72 is designed for agentic AI, reasoning, and long-context workloads, while Jetson Thor delivers up to 2,070 FP4 teraflops for physical AI and autonomous robots.
OpenAI model disproves long-standing discrete geometry conjecture
OpenAI reported that an internal general-purpose reasoning model disproved a central conjecture in the planar unit distance problem, producing an infinite family of constructions with polynomial improvement over the long-believed square-grid bound. OpenAI says external mathematicians checked the proof and wrote companion remarks, calling the result a milestone for AI-assisted mathematics.
Anthropic hires Andrej Karpathy for Claude pretraining research
OpenAI cofounder and former Tesla AI director Andrej Karpathy said he is joining Anthropic. CNBC reports Karpathy will be part of Anthropic's pretraining team, building a group focused on using Claude to accelerate the research that gives the company's models their core knowledge and capabilities.
Google brings AI agents and generative UI into Search
Google said AI Mode in Search now uses Gemini 3.5 Flash globally and introduced a redesigned AI-powered Search box. New Search agents will monitor the web in the background, send synthesized updates, help with booking tasks, and eventually generate custom interactive layouts, simulations, dashboards, and trackers with Antigravity-powered coding.
Google expands SynthID and Content Credentials verification
Google expanded AI-content verification across Search, Gemini, Chrome, Pixel, and Google Cloud, saying SynthID has watermarked more than 100 billion images and videos and 60,000 years of audio. OpenAI, Kakao, and ElevenLabs are adopting SynthID for more AI-generated content, while a new Google Cloud AI Content Detection API is launching with trusted partners.
Google launches Gemini Omni Flash for multimodal video generation
Google introduced Gemini Omni, a new model family that combines Gemini reasoning with generative media, beginning with video output. The first release, Gemini Omni Flash, can use text, images, video, and audio references to generate or conversationally edit videos, is rolling out to Google AI Plus, Pro, and Ultra subscribers through Gemini and Flow, and will come to developer and enterprise APIs in the coming weeks.
Google previews Gemini Spark as a 24/7 personal AI agent
Google announced Gemini Spark, a cloud-based personal agent powered by Gemini 3.5 and the Antigravity harness. Spark is designed to keep working after a laptop closes, integrate with Gmail, Docs, Slides, and other connected apps, ask before high-stakes actions, and roll out first to trusted testers before a U.S. beta for Google AI Ultra subscribers.
Google releases Gemini 3.5 Flash for agents and coding
At Google I/O 2026, Google introduced Gemini 3.5 as a model family focused on complex agentic workflows, starting with Gemini 3.5 Flash. Google says Flash is now available globally in the Gemini app, AI Mode in Search, Antigravity, the Gemini API, AI Studio, Android Studio, and Gemini Enterprise, with claimed gains on coding and agentic benchmarks plus 4x faster output than other frontier models.
Ocean emerges from stealth with $28M to fight AI phishing
Ocean, an agentic email-security startup founded by former Israeli cybersecurity researcher Shay Shwartz, emerged from stealth with $28 million in total funding led by Lightspeed Venture Partners. The company says AI has automated spear-phishing at much larger scale and that its small language model analyzes billions of emails each month for customers including Kayak, Kingston Technology, and Headspace.
Anthropic acquires Stainless to strengthen agent connectivity
Anthropic acquired Stainless, the SDK and MCP server tooling company that has generated official Anthropic SDKs since the API's early days. Stainless creates SDKs, CLIs, and MCP servers from API specs across TypeScript, Python, Go, Java, and more, and Anthropic says the deal will help Claude agents connect more reliably to external systems.
NVIDIA ships first Vera CPUs to top AI labs
NVIDIA delivered its first standalone Vera CPU systems to Anthropic, OpenAI, SpaceXAI, and Oracle Cloud Infrastructure, moving the agentic-AI processor from announcement to customer evaluation. Vera packs 88 NVIDIA-designed Olympus cores, 1.2TB/s of memory bandwidth, and 50% faster per-core performance for agent sandboxes, tool calls, orchestration, and long-context retrieval workloads.
Anthropic and Gates Foundation commit $200M to beneficial AI programs
Anthropic announced a four-year, $200 million partnership with the Gates Foundation spanning Claude usage credits, technical support, and grant funding. The work targets global health, life sciences, education, and economic mobility, including public health datasets, healthcare AI benchmarks, disease-modeling support, AI tools for neglected diseases, K-12 tutoring, and agricultural productivity applications.
Khosla backs Synthetic with $10M for autonomous AI bookkeeping
Synthetic, founded by former Bench Accounting CEO Ian Crosby, raised a $10 million seed round led by Khosla Ventures to pursue a fully autonomous AI bookkeeper for accrual-based financials. The startup plans to focus on AI and software companies first, while acknowledging that current foundation models still make bookkeeping mistakes and the product remains in the design phase.
Lovable backs Atech to bring vibe coding to hardware prototypes
Danish startup Atech raised an $800,000 pre-seed round with backing from Lovable, a16z scout fund, Sequoia Scout Fund, and Nordic Makers. Atech pairs hardware starter kits with an AI chatbot that turns natural-language prototype ideas into code for working hardware builds, aiming to reduce the engineering barrier for physical products.
OpenAI brings Codex to the ChatGPT mobile app
OpenAI rolled out Codex in preview on iOS and Android so users can follow active coding threads, review diffs and terminal output, approve actions, and redirect long-running agent work from a phone. The update also makes Remote SSH generally available, adds generally available Codex hooks, introduces programmatic access tokens for Business and Enterprise workspaces, and supports eligible HIPAA-compliant local Codex deployments.
OpenAI says two employee devices were hit by TanStack supply-chain attack
After malicious TanStack package versions spread through npm, OpenAI confirmed two employee devices were affected and that a limited subset of internal source-code repositories saw unauthorized credential access. The company said it found no evidence that user data, production systems, intellectual property, or software releases were compromised and began rotating signing certificates as a precaution.
OpenAI updates ChatGPT to better track risk in sensitive conversations
OpenAI detailed new safety updates that help ChatGPT recognize when self-harm, suicide, or harm-to-others risk emerges over time. The system uses short-lived, narrowly scoped safety summaries for rare high-risk cases and improved safe-response performance by 50% in long suicide and self-harm evaluations, 16% in harm-to-others scenarios, and 39% to 52% across multi-conversation GPT-5.5 Instant tests.
Twin Prime raises $10M to build frontier AI for defense and security
London-based Twin Prime landed a $10 million pre-seed round led by Expeditions to develop multimodal AI models for defense and security. The startup is building systems that reason across sensor modalities and compress perception-to-decision workflows for real-time threat response, with plans for a joint venture with European defense prime Theon.
Anthropic launches Claude for Small Business
Announced May 13, Claude for Small Business plugs directly into QuickBooks, PayPal, HubSpot, Canva, DocuSign, Google Workspace, and Microsoft 365. It ships with 15 ready-to-run agentic workflows spanning finance, ops, sales, marketing, HR, and customer service — including automated payroll planning, month-end reconciliation, campaign management, and invoice tracking.
Meta unveils four new MTIA chips for its AI data centers
Meta announced a new MTIA (Meta Training and Inference Accelerator) lineup. MTIA 300 is already deployed for training smaller ranking and recommendation models; MTIA 400, 450, and 500 are in development for generative AI inference and will launch by 2027.
NVIDIA launches Nemotron 3 Nano Omni multimodal model
Nemotron 3 Nano Omni is an open multimodal model unifying vision, audio, and language. NVIDIA reports up to 9× higher throughput than competing open models, targeting more efficient AI agents on commodity hardware.
NVIDIA partners with David Silver's Ineffable Intelligence
NVIDIA announced a collaboration with British AI startup Ineffable Intelligence, founded by former DeepMind RL lead David Silver, to develop systems that learn through reinforcement learning rather than human data. The work will run on NVIDIA's Grace Blackwell and Vera Rubin platforms.
NVIDIA releases Star Elastic: one checkpoint, three reasoning models
NVIDIA Research introduced Star Elastic, a post-training method that embeds nested 30B, 23B, and 12B reasoning submodels inside a single checkpoint with zero-shot slicing. Operators can pick a model size at inference time without retraining.
OpenAI releases GPT-5.5 ("Spud"), its most agentic model yet
Rolled out to paid ChatGPT and Codex users on May 13, GPT-5.5 is tuned for long-running agentic tasks with minimal prompting. API access will follow once additional security guardrails are in place. OpenAI did not publish SWE-bench Verified scores, where Anthropic's Claude Mythos Preview currently leads at 93.9%.
Thinking Machines Lab loses key talent to Meta, OpenAI, and xAI
After founding employees crossed the one-year cliff and unlocked equity, Thinking Machines Lab saw a wave of departures. Meta reportedly recruited seven founding team members plus a star researcher with compensation packages worth hundreds of millions.
Google DeepMind reimagines the mouse pointer with Gemini
DeepMind unveiled an AI-enabled pointer powered by Gemini that understands on-screen visual context. Users can issue shorthand commands like "Fix this" or "Show me directions" without switching windows or writing long prompts.
Google publishes patterns for long-running enterprise agents
Google's Developers Blog detailed how to build pause-and-resume agents with the Agent Development Kit (ADK). The approach uses durable memory schemas and event-driven dormancy gates — instead of stateless chatbot patterns — to support multi-week workflows like HR onboarding without losing context.
IBM debuts Red Hat AI Inference and OpenShift Virtualization on IBM Cloud
IBM announced two managed offerings on May 12: Red Hat AI Inference Service and Red Hat OpenShift Virtualization Service on IBM Cloud. Both are aimed at helping enterprises operationalize AI and run virtualized workloads at scale with built-in governance controls.
Microsoft's MDASH agentic security system tops CyberGym
Microsoft's new multi-model security system (codename MDASH) orchestrates 100+ specialized agents and posted an industry-leading 88.45% on the CyberGym benchmark. In the announcement, Microsoft says the system has already discovered 16 new vulnerabilities in Windows, including four critical RCE flaws.
OpenAI introduces "Daybreak" cyber platform
Announced May 12, Daybreak combines OpenAI's language models with Codex's agentic capabilities to automate vulnerability detection, patch validation, and secure software development inside enterprise security workflows. The launch puts OpenAI head-to-head with Anthropic's Mythos in enterprise cyber.
Power Apps MCP server adds closed-loop learning for agents
Microsoft introduced closed-loop learning on the Power Apps MCP server: user corrections automatically improve enterprise agent performance using memory-based optimization and a genetic-Pareto optimization step.
SAP and Anthropic bring Claude to SAP Business AI Platform
At SAP Sapphire, SAP and Anthropic announced plans to embed Claude across the Business AI Platform to advance the "Autonomous Enterprise." Claude will power agentic capabilities such as financial closing, employee leave questions, and supplier order management directly inside SAP systems.
SAP and NVIDIA co-define enterprise-grade agent execution
SAP and NVIDIA detailed a joint framework for secure, auditable, and governable AI agents built on NVIDIA OpenShell. The work focuses on the runtime controls enterprises need before pushing autonomous agents into production.
SAP unveils the Autonomous Enterprise with 50+ Joule Assistants
SAP introduced a unified Business AI Platform and Autonomous Suite, deploying more than 50 domain-specific Joule Assistants across finance, supply chain, and HR. Partnerships span Anthropic, AWS, Google Cloud, Microsoft, NVIDIA, and Palantir. SAP says its Autonomous Close Assistant can compress financial closing from weeks to days.
Microsoft research: AI agents still struggle with long workflows
A Microsoft study using the new DELEGATE-52 benchmark tested frontier models (Gemini 3.1 Pro, Claude 4.6 Opus, GPT-5.4) across 52 professional workflows. The team found models lose ~25% of document content over 20 interactions on average, with severe corruption in 80% of conditions. Only Python programming hit "ready" status at 98%+ accuracy.
OpenAI launches the "OpenAI Deployment Company"
A new entity dedicated to helping organizations build and deploy AI for mission-critical work. The Deployment Company starts with $4B in initial backing from 19 global investment firms and consultancies, and absorbs Tomoro to bring on roughly 150 Forward Deployed Engineers and Deployment Specialists.
Reader Questions
AI News — Frequently Asked Questions
How often is the AI news updated?+
This page is curated and refreshed regularly with the latest artificial intelligence announcements, model releases, research, and enterprise rollouts. The "last updated" date always reflects our most recent refresh.
What kind of AI news do you cover?+
We cover new AI model releases, AI agents, research breakthroughs, AI hardware and chips, AI security, enterprise AI adoption, and notable talent moves — focusing on the stories that matter most to founders, engineers, and technology leaders.
Where do these AI news stories come from?+
Every story is compiled from public reporting and primary sources. Each card links directly to the original source so you can verify the details and read more.
How can I get AI news in my inbox?+
Subscribe to the weekly DonvitoCodes AI newsletter to get the most important AI news, releases, and analysis delivered to your inbox every week.