DonvitoCodes AI Digest

Major Deals & Partnerships

OpenAI-AWS: The $38 billion bet on multi-cloud AI

$38,000,000,000

OPENAIAWS OpenAI signed a 7-year, $38 billion cloud deal with Amazon Web Services, marking one of the largest cloud commitments in history. This partnership includes expanded use of AWS's Trainium and Inferentia chips for training and inference workloads.

The deal underscores OpenAI's strategic shift toward multi-cloud strategies amid growing compute demands. Rather than relying solely on Microsoft's Azure infrastructure, OpenAI is diversifying its infrastructure footprint to ensure reliability and scale for its rapidly expanding user base, which recently hit 1 million business users—the fastest adoption ever.

AWS's custom silicon plays a central role. Trainium chips for training and Inferentia for inference offer cost advantages over traditional Nvidia GPUs, potentially reducing OpenAI's compute expenses while maintaining performance. This represents a significant validation of AWS's AI chip strategy and could accelerate adoption across the industry.

Major Deals • 4 min read

Apple pays Google $1B annually for Gemini integration

APPLEGOOGLE Apple agreed to pay Google $1 billion annually to integrate Gemini models into Siri, enhancing its AI capabilities for more natural conversations and complex tasks. This deal is part of Apple's broader Siri overhaul expected in early 2026.

The partnership comes amid regulatory scrutiny on Big Tech collaborations, but represents a pragmatic recognition that Apple's internal AI capabilities lag behind competitors. By leveraging Gemini's advanced language understanding, Apple aims to catch up in the assistant wars while maintaining its privacy-first brand positioning.

Regulatory Watch

Lambda-Microsoft multi-billion dollar infrastructure play

LAMBDAMICROSOFT Lambda secured a multi-billion-dollar agreement with Microsoft to supply AI infrastructure, focusing on GPU clusters for Azure-based training. This bolsters Microsoft's position in enterprise AI while Lambda expands beyond its startup customer base into enterprise-grade infrastructure provision.

New Models & Releases

Kimi K2 Thinking: Open-weights champion at $4.6M training cost

MOONSHOT AI Moonshot AI's Kimi K2 Thinking achieves new open-weights state-of-the-art with a score of 67 on Artificial Analysis. The 1-trillion parameter Mixture-of-Experts model runs on INT4 quantization with 256K context window and was trained for approximately $4.6 million—orders of magnitude cheaper than proprietary alternatives.

The model excels in agentic tasks, generating a full Space Invaders game at 15 tokens/second on M3 Ultra hardware. It outperforms proprietary models in complex coding scenarios, validating the open-weights approach for production workloads. This represents a significant milestone: enterprise-grade performance at startup-friendly economics.

Model Performance Comparison

Artificial Analysis benchmark scores

Major model releases: Grok-4, Codex Mini, Aardvark

XAI Grok-4 received major upgrades for prompt injection defense and real-time reasoning, transitioning from "meme model" to enterprise-ready with leading robustness benchmarks.

OPENAI Codex Mini got capacity upgrades, faster inference, and higher rate limits, making it production-viable with prioritized processing for coding tasks.

OPENAI Aardvark is OpenAI's new security-focused model for automated bug hunting, patching, and remediation, currently in beta for internal and partner use. It teases an upcoming reasoning model targeting IMO/IOI gold-level math and coding performance.

Edge AI breakthroughs: Meta's EdgeTAM runs 16 FPS on iPhone

META Meta's EdgeTAM runs 22x faster than SAM2 for real-time segmentation, achieving 16 FPS on iPhone 15 Pro Max. Released under Apache 2.0 license, this on-device AI boost enables new applications in AR, photo editing, and computer vision without cloud dependencies.

ALIBABA Qwen 3 and GOOGLE Gemma 3 lead edge optimization efforts, with Arm emphasizing efficient models for on-device AI deployment. Hardware improvements are enabling rapid deployment of sophisticated models directly on consumer devices.

Edge Computing

Hardware, Tools & Enterprise

Anthropic hits $13B valuation with $7.5B revenue run rate

ANTHROPIC Anthropic reached a $13 billion valuation, driven by Claude's explosive enterprise adoption. The company projects a $7.5 billion annualized revenue run rate by year-end 2025, fueled by deals in finance, healthcare, and software development sectors.

This valuation milestone validates the enterprise-first strategy, with Claude becoming the preferred AI assistant for developers and knowledge workers. The company's focus on safety, reliability, and long-context understanding has resonated with enterprise buyers willing to pay premium prices for production-ready AI.

AI Company Valuations 2025

Leading AI companies by valuation (billions USD)

XPeng IRON humanoid enters mass production

XPENG Mass production of XPeng's IRON humanoid robot is slated for late 2026, featuring customizable bodies with advanced AI for tasks like supermarket operations, warehouse logistics, and customer service. This marks the transition from prototype to commercial robotics at scale.

META Meta announced Artemis Chips—open-source AI hardware designed to reduce Nvidia reliance and democratize access to AI computing. Combined with their software stack, Meta aims to create an accessible ecosystem for AI development.

Developer tools: LangChain, Hugging Face, SkyPilot

Key tool releases this week: LangChain's Agent Builder CLI for rapid agent development; Hugging Face's 214-page LLM training playbook codifying best practices; SkyPilot for multi-cloud GPU orchestration; and Terminal-Bench 2.0 for comprehensive agent evaluation.

QUALCOMM Qualcomm highlighted on-device optimizations including AI accelerators in Snapdragon 8 Elite Gen 5, with partnerships like Memories.ai demonstrating practical applications of edge AI in consumer devices.

Complete Model Release Table

Model / Provider	Key Details	Notable Features
Kimi K2 Thinking (Moonshot AI)	1T-parameter MoE; open-weights; INT4 quantization; 256K context; ~$4.6M training cost	Artificial Analysis: 67 (new SOTA); generates Space Invaders at 15 tok/s on M3 Ultra
Grok-4 (xAI)	Major upgrades for prompt injection defense and real-time reasoning	Leads robustness benchmarks; enterprise-ready
Codex Mini (OpenAI)	Capacity upgrades, faster inference, higher rate limits	Production-viable; prioritized for coding tasks
Aardvark (OpenAI)	Security-focused for bug hunting, patching, remediation	Beta access; teases IMO/IOI gold-level reasoning model
EdgeTAM (Meta)	22x faster than SAM2 for real-time segmentation	16 FPS on iPhone 15 Pro Max; Apache 2.0 licensed
Emu3.5 (ByteDance)	Video generation with improved coherence	Part of wave including Ouro and MotionStream
MiniMax M2 & Agent (MiniMax)	Hybrid linear attention (KDA); new coding CLI	Targets Claude's coding market; strong agent workflows
Qwen 3 (Alibaba)	Edge-optimized deployment	Rapid expansion to edge devices alongside Gemma 3
Other Notable	Tongyi DeepResearch, Ming-Flash-Omni, LongCat-Video, Keep CALM, MASPRM	Research autonomy; video spatial reasoning (Cambrian-S: 30% MLLM gains)

Other Notable Developments

Google's enterprise AI push

Gemini integrations rolled out across Maps, Workspace, Gmail, Docs, Drive, and Finance. Vertex AI agent capabilities boosted for enterprise deployments. However, delays on Gemini 3 disappointed some observers. Google also convened experts on AI consciousness, echoing past controversies.

Enterprise adoption milestones

OpenAI hit 1 million business users (fastest ever). Salesforce reported 84% of tech leaders cite data issues hampering AI ambitions. Verizon-AWS partnership launched fiber network optimized for AI applications, addressing infrastructure bottlenecks.

Controversies and challenges

Studio Ghibli demanded OpenAI stop using its content for training. OpenAI faced backlash over U.S. government loan backstops for infrastructure. Cursor and Windsurf caught fine-tuning Chinese models (likely GLM) without proper attribution, raising transparency concerns.

Ethics Watch

Research highlights

Advances in self-improving LLMs (Multi-Agent Evolve), long-term memory beyond 1M tokens, and safety frameworks (OS-Sentinel for mobile agents). PewDiePie built a 10x4090 local AI lab, demonstrating consumer-grade infrastructure viability.

OpenAI signs historic $38B AWS deal as AI infrastructure wars escalate