BLOG - AWS re:Invent 2025

From “one Region, one cloud” to resilient, sovereign, and polycloud by design
Laurens van Gunst
December 10, 2025
Reinvent 2025 copy

AWS re:Invent is the industry’s annual reality check. It’s where AWS sets the direction for the next 12–24 months, lands enterprise‑ready capabilities at scale, and—often quietly—redraws the reference architectures most teams will follow. For leaders, it’s a strategic barometer; for builders, a backlog accelerator. The message this year is clear: resilience, sovereignty, and choice are moving from slideware to defaults—backed by agent‑driven operations, simpler networking, and more predictable cost models. Here are the most important takeaways and why I think they matter. This year marked the last Keynote of Werner Vogels, Amazon CTO. I’m curious to see how and by whom this in upcoming years will be done.

Global infrastructure and architecture

“Multi‑Region by design” is no longer a luxury but common sense. Entire Regions can fail; designing at the Region level lowers business risk and turns disaster recovery into a manageable ritual rather than an ad‑hoc crisis. For sensitive data or when predictable AI capacity is needed on‑prem, AWS AI Factories bring GPUs, high‑performance storage, and Bedrock/SageMaker into your data center. That shortens time‑to‑value, keeps data resident, and enables open‑weight models under tighter operational control.

Operations and reliability

AWS DevOps Agent (preview) reduces MTTR by investigating incidents immediately, correlating signals across observability, code, and CI/CD tools, and guiding remediation—fewer late‑night fires, more systematic improvement. AWS Security Agent (preview) continuously enforces security across the SDLC, checks code and configurations against your standards, and can open PRs and tickets—closing gaps faster without choking delivery. On the posture side, AWS Security Hub (GA) unifies findings and adds near‑real‑time risk analytics so teams can prioritize and act faster.

Compute

Long‑running and human‑in‑the‑loop processes get simpler with AWS Lambda Durable Functions: state persistence for up to a year, deterministic resumption, and no charges while waiting. If you need GPUs, large memory, or stable performance, AWS Lambda Managed Instances pairs the Lambda developer experience with EC2 flexibility while AWS runs the fleet. For general workloads, Graviton5 lifts price/performance with minimal code changes. Peak single‑thread performance and large memory come together in Amazon EC2 X8aedz, ideal for EDA and large relational databases. And if you train or fine‑tune models, Trainium3 UltraServers promise faster training at lower cost.

Containers

Platform teams can reduce undifferentiated Kubernetes toil with the new Amazon EKS platform capabilities, which streamline orchestration and cloud resource management and accelerate governance.

Artificial intelligence and agents

Production‑grade agents require guardrails and measurement: Amazon Bedrock AgentCore—policy and evaluations delivers real‑time tool‑call policies and quality metrics so behavior is auditable and reliable. For vector‑driven use cases, Amazon S3 Vectors (GA) brings native vector storage and querying to S3: massive scale at lower cost, simplifying RAG and agent memory without a separate vector database. Amazon Bedrock reinforcement fine‑tuning improves accuracy with fewer labels and less MLOps complexity. On the training side, Amazon SageMaker HyperPod’s checkpointless and elastic training keeps accelerators utilized and recovers faster from failures, cutting cost and wall‑clock time. Finally, Amazon SageMaker AI—serverless customization speeds SFT/DPO/RL via a simple UI, while serverless MLflow removes friction from experiment tracking.

Analytics and collaboration

Collaborative training without compromising privacy becomes more practical with AWS Clean Rooms—synthetic datasets, which preserve statistical utility while protecting individuals—ideal for partner ecosystems.

Databases

Cost efficiency without rigidity gets closer with Database Savings Plans: a single $/hour commitment that discounts across engines, sizes, Regions, and serverless/provisioned modes. Without replatforming, you can lower costs and scale using RDS features for Oracle & SQL Server, like additional storage and license‑sensitive CPU optimization. Vector search becomes faster and cheaper with OpenSearch Service—GPU acceleration, which automatically balances cost and performance.

Storage

For massive datasets (video, seismic, AI), S3 objects up to 50 TB make storage and movement simpler with familiar tools and lifecycle. Operational insight deepens with S3 Storage Lens—performance metrics, billions of prefixes, and export to S3 Tables. Global analytics face less friction with S3 Tables—automatic Iceberg replication. And Amazon FSx for NetApp ONTAP—S3 Access Points makes file data S3‑addressable for AI and analytics—without moving it.

Networking and content delivery

Enforcing TLS‑in‑transit within and across VPCs gets easier with VPC Encryption Controls, reducing compliance burden and misconfigurations. Routing and HA get a cleanup with the Regional NAT Gateway: one gateway per VPC instead of per AZ—often cheaper and less error‑prone. ALB Health Check Logs provide structured visibility for faster root‑cause analysis and better automation. Budgeting becomes more predictable with CloudFront flat‑rate plans: bundles with integrated security and no overages. And polycloud becomes practical with Interconnect – Multicloud (preview), which offers private, resilient links to other clouds without public‑internet variability.

Security, identity, and compliance

For European customers who require strict residency and operational independence, the AWS European Sovereign Cloud provides an EU‑based, separately operated cloud that aligns with sovereignty rules—and pairs logically with Interconnect patterns.

Management and governance

Bringing observability and compliance data together accelerates insights and reduces ETL with CloudWatch—unified data & analytics, including access via S3 Tables/Apache Iceberg.

Developer experience

Making APIs discoverable and governable becomes a no‑ops activity with the Amazon API Gateway Portal: launch a branded, secure portal across accounts in minutes, including access, subscriptions/keys, and onboarding.


AWS is shifting the center of gravity from “one Region, one cloud” to “resilient, sovereign, polycloud by design”—with agents to reduce MTTR, Lambda to simplify long‑running workflows, VPC/ALB/NAT updates that harden the network by default, storage/analytics that scale without ceremony, and pricing constructs (CloudFront flat‑rate, Database Savings Plans) that trade opacity for control.