tandemly.ai
Briefing · JUN 3 2026

June 3, 2026

AI daily briefing

🎯 Top 3 Things to Know

1. Microsoft launched MAI-Thinking-1, its first in-house reasoning model, trained without distillation from any third party. The 35B sparse Mixture-of-Experts model with a 256K context window posts 97.0 on AIME 2025 and claims parity with Claude Opus 4.6 on SWE-Bench Pro. The story is less about the benchmarks and more about supply chain. Microsoft has run on OpenAI for years; the Foundry team is now publishing a model trained entirely on commercially licensed data and pitching it on price. Relevant for any team building on Azure or evaluating an Azure-resident reasoning model alongside Claude or GPT. Worth running side-by-side on a real reasoning workload before pricing the next contract. Microsoft Build 2026 coverage

2. Trump signed an executive order asking AI companies to hand over frontier models for up to 30 days of pre-release government testing. Participation is voluntary. The NSA will run a classified benchmark for "advanced cyber capabilities," and models that clear a threshold get tagged "covered frontier models." A new AI cybersecurity clearinghouse will pool vulnerabilities across vendors. The order does not preempt state AI law, which the December 2025 order tried to do and failed at. Watch which labs sign up. Anthropic and OpenAI have published responsible-scaling commitments that already overlap with this; whether they accept federal review on the government's timetable is the live question. White House action · NPR

3. NVIDIA opened its agent stack at GTC Taipei, releasing the Agent Toolkit with NemoClaw blueprints, OpenShell runtime, and Nemotron models. NemoClaw is generally available now; OpenShell, a secure runtime for personal agents, is in early preview with Microsoft, Canonical, and Red Hat integrating it. Nemotron 3 Ultra, a 550B parameter model with roughly 5x faster inference and 30% lower cost than its predecessor, ships June 4. The bet is that the bottleneck in enterprise agents has moved from model quality to the sandbox the agent runs in. Worth tracking whether OpenShell becomes the default agent runtime on Windows and Linux the way containerd became the default runtime under Kubernetes. NVIDIA newsroom

🚀 Frontier Models & Features

🔬 Research Worth Reading

🏢 Enterprise in the Wild

🛠️ Tooling & Ecosystem

⚖️ Policy & Regulation

📌 Watch List