tandemly.ai
Briefing · MAY 21 2026

May 21, 2026

AI daily briefing

🎯 Top 3 Things to Know

1. Google I/O dropped Gemini 3.5 Flash, the Gemini Spark agent, and Omni, a world model. Gemini 3.5 Flash slots in as the default model in the Gemini app, with Google pitching it at roughly a third of the cost of comparable frontier models. Spark is a long-running personal agent that lives on Google Cloud VMs and reaches across Workspace and the open web. Omni is a video-and-physics world model. The combined message is that Google is willing to compete on price at the small end while pushing autonomy at the high end. Teams running multimodal pipelines should benchmark 3.5 Flash on cost per quality this week, especially for high-volume document and video workloads where a single model could replace a stitched pipeline. CNBC · Google Developers Blog

2. Anthropic shipped self-hosted sandboxes and MCP tunnels for Claude Managed Agents. Sandboxes move tool execution off Anthropic's infrastructure and onto a company's own environment, or a managed runner like Cloudflare, Daytona, Modal, or Vercel. The agent loop itself still runs on Anthropic. MCP tunnels open a single outbound, end-to-end encrypted connection from an internal network out to the agent, so internal databases and ticketing systems become tools without a public endpoint. This is the answer to the most common reason agent pilots stall in regulated industries: the security team won't let execution or data leave the perimeter. Worth re-pricing any agent project that died last quarter on a security review. Anthropic blog · InfoQ

3. The EU and Council reached a provisional deal to delay big chunks of the AI Act and add two new prohibitions. Use-based high-risk obligations (Annex III) slip from August 2026 to December 2027. Product-regulated high-risk (Annex I) slips from August 2027 to August 2028. Member states get an extra year to stand up regulatory sandboxes. The deal also adds two outright bans: AI used to generate non-consensual intimate material and AI used to generate CSAM. The headline is timeline relief, but the new bans take effect on the original schedule, so anyone shipping image or video generation into the EU has work to do regardless of the delay. Council press release · Covington analysis

🚀 Frontier Models & Features

🔬 Research Worth Reading

🏢 Enterprise in the Wild

At Knowledge 2026, ServiceNow customers reported concrete deflection numbers from in-production AI specialists. Docusign is targeting autonomous resolution of 90% of internal IT tickets. Honeywell reports its AI assistant has eliminated the majority of service desk conversations. The city of Raleigh reports a 98% deflection rate on employee requests, the equivalent of a month of staff time saved. Across ServiceNow's customer base, AI specialists now resolve 91% of cases without reassignment. The pattern: governed, narrow agents inside a workflow system are clearing real ticket volume, not just answering FAQ in a chatbox. Fortune · ServiceNow newsroom

🛠️ Tooling & Ecosystem

⚖️ Policy & Regulation

📌 Watch List