Aurora Serverless v2: What It Is and When to Use It

Definition

Amazon Aurora Serverless v2 is the on-demand, auto-scaling configuration of Amazon Aurora that sizes database capacity automatically in response to application load. Instead of picking a fixed instance class, you specify a minimum and maximum range of Aurora Capacity Units (ACUs) and Aurora adjusts capacity in 0.5 ACU increments in sub-second transitions, without dropping connections or pausing queries. Serverless v2 supports Aurora MySQL-Compatible Edition and Aurora PostgreSQL-Compatible Edition, and it works alongside all major Aurora features including Multi-AZ, read replicas, and Global Database.

How It Works

One ACU ≈ 2 GiB of memory plus proportional vCPU and networking. When you create or modify an Aurora cluster, you can make any DB instance in the cluster Serverless v2 by choosing a capacity range — for example, min 0.5 ACU and max 16 ACU.

On every metric sample, Aurora Serverless v2 evaluates instance load (CPU, memory pressure, buffer pool behavior) and adjusts capacity up or down in 0.5 ACU steps. Scaling is non-disruptive: connections stay open, transactions continue, and queries in flight are not killed.

Key behaviors:

  • Scale-up latency: sub-second in most cases; Aurora pre-warms capacity slightly ahead of demand.
  • Scale-to-zero: supported on Serverless v2 — the cluster pauses after an idle period you configure, and the next connection triggers a resume. You pay $0 for ACU while paused (storage is still billed).
  • Mixed clusters: Serverless v2 instances and provisioned instances can coexist in the same cluster; the writer can be one type and readers another.
  • Failover: Serverless v2 participates in Aurora's standard failover behavior — a replica (provisioned or Serverless v2) is promoted within seconds.

Serverless v2 shares all infrastructure with provisioned Aurora: the same storage layer (6-way replication across 3 AZs), the same cluster/reader endpoints, the same RDS Data API, and the same Aurora Global Database pattern.

Key Features and Limits

Capacity range

  • Minimum: 0.5 ACU (1 GiB RAM, ~1 vCPU equivalent). With scale-to-zero enabled, can go to 0 ACU.
  • Maximum: up to 256 ACU per instance.
  • Step size: 0.5 ACU.
  • Scaling granularity: sub-second, in-place (no reboot).

Engine support

  • Aurora MySQL-Compatible Edition — MySQL 8.0-compatible versions supported.
  • Aurora PostgreSQL-Compatible Edition — modern PostgreSQL versions supported.
  • Extensions, procedures, and custom parameters all work identically to provisioned Aurora.

Integration with Aurora features

  • Multi-AZ: a Serverless v2 instance plus a Serverless v2 replica in another AZ gives you standard Aurora HA.
  • Read replicas: up to 15 Serverless v2 readers per cluster, each auto-scaling independently.
  • Aurora Global Database: Serverless v2 can participate as primary or secondary Region.
  • Blue/Green deployments, cloning, Backtrack (MySQL only), IAM authentication, and RDS Proxy all work.

Not supported / caveats

  • Aurora Serverless v1 is deprecated in favor of v2 and should not be used for new workloads.
  • Parallel Query on Serverless v2 has version restrictions — confirm current engine compatibility.
  • Very high, sustained, flat workloads may cost more than a right-sized provisioned Reserved Instance.

Common Use Cases

  1. Intermittent / dev-test workloads — databases that sit idle overnight or on weekends scale to 0 ACU and stop accruing compute charges.
  2. Multi-tenant SaaS — one database per tenant scales up during active hours and down (or to zero) when a tenant is quiet; massively reduces fixed per-tenant cost.
  3. Event-driven serverless backends — Lambda + RDS Data API + Serverless v2 gives HTTP-style access with no persistent TCP connections and fully elastic backing capacity.
  4. Unpredictable traffic — startups, viral events, or seasonal businesses (retail spikes, ticketing, tax season) that can't predict peak load.
  5. Analytical sandboxes — data scientists and analysts spin up Postgres-compatible databases that auto-scale up for big queries and back down when idle.
  6. DR secondary Regions — Serverless v2 secondaries in Aurora Global Database stay cheap while idle but can scale instantly when promoted.

Pricing Model

Aurora Serverless v2 bills on four axes:

  1. ACU-hours — per ACU-second consumed, summed hourly. Min-max range is a policy, not a billing floor; billing tracks actual scaled capacity. Scale-to-zero means $0 ACU cost while paused.
  2. Storage — per GB-month of data actually stored in the shared Aurora storage volume (auto-scales up to 128 TiB).
  3. I/O — on Aurora Standard pricing, per million I/O requests. Aurora I/O-Optimized pricing bundles I/O into a higher ACU rate.
  4. Backups, data transfer, Global Database cross-Region replication — standard Aurora rates.

Reserved Instance pricing is not directly applicable to Serverless v2, although Compute Savings Plans can discount Serverless v2 ACU usage in some Regions.

Pros and Cons

Pros

  • Sub-second, non-disruptive scaling in 0.5 ACU steps — no planning, no reboots.
  • Scale-to-zero (v2) eliminates idle cost for intermittent workloads.
  • Full Aurora feature parity: Multi-AZ, 15 replicas, Global Database, Blue/Green, Backtrack, RDS Data API.
  • Perfect fit for SaaS, dev/test, and Lambda-driven workloads.
  • Same endpoints and drivers as provisioned Aurora — migration is a configuration change.

Cons

  • Per-ACU-hour cost can exceed a comparably sized Reserved provisioned instance for steady, high-utilization workloads.
  • Cold start from 0 ACU adds a few seconds to the first query after a pause.
  • Very large max-ACU caps require careful cost controls to avoid runaway auto-scaling.
  • Not available for Aurora engines beyond MySQL and PostgreSQL compatibility (no Oracle/SQL Server).

Comparison with Alternatives

| | Aurora Serverless v2 | Provisioned Aurora | Aurora Serverless v1 | DynamoDB On-Demand | | --- | --- | --- | --- | --- | | Scaling unit | 0.5 ACU, sub-second | Instance class, manual | 1 ACU, 30-60 s | Per request | | Scale to zero | Yes (pause) | No | Yes | Native (no idle cost) | | Multi-AZ | Yes | Yes | Limited | Native (3 AZs) | | Replicas | Up to 15 | Up to 15 | None | N/A | | Global DB | Yes | Yes | No | Global Tables | | Best for | Intermittent/multi-tenant relational | Steady, high-utilization | Legacy (avoid) | Key-value at scale |

Exam Relevance

  • Solutions Architect Associate (SAA-C03) — when to pick Serverless v2 over provisioned Aurora; 0.5 ACU increments; Multi-AZ and replicas on Serverless; scale-to-zero economics for dev/test.
  • Developer Associate (DVA-C02) — Aurora Serverless v2 + RDS Data API + Lambda as a serverless relational stack; IAM authentication.
  • Database Specialty (DBS-C01) — Serverless v1 vs v2 differences, mixed provisioned/serverless clusters, Global Database participation, cost modeling at various utilization levels.

Common exam trap: "unpredictable, spiky SQL workload with long idle periods" → Aurora Serverless v2 (not provisioned Aurora, not DynamoDB, unless the schema is key-value).

Frequently Asked Questions

Q: How is Aurora Serverless v2 different from v1?

A: v1 scaled in whole ACU steps, paused connections during scaling, took tens of seconds to scale, did not support read replicas or Global Database, and had limited engine versions. v2 scales in 0.5 ACU steps in sub-second, non-disruptive transitions, supports full Aurora feature parity (Multi-AZ, up to 15 replicas, Global Database, Blue/Green, Backtrack), and can be mixed with provisioned instances in the same cluster. v1 is deprecated — all new workloads should use v2.

Q: Can Aurora Serverless v2 really scale to zero?

A: Yes. You configure an auto-pause threshold (for example, pause after 5 minutes of inactivity). While paused, the cluster consumes 0 ACU and you pay only for storage. The next connection triggers a resume; the first query after resume may wait a few seconds while the instance warms up. Scale-to-zero is ideal for dev/test, SaaS tenants that are quiet overnight, and DR secondaries.

Q: When should I stick with provisioned Aurora instead?

A: When utilization is steady and high — for example, a production OLTP system that runs at 40%+ CPU 24/7 — a Reserved provisioned instance is typically cheaper than the equivalent ACU-hour cost on Serverless v2. Also prefer provisioned when you need a specific feature not yet supported on Serverless v2 (confirm against current docs), or when you require deterministic capacity for strict latency SLAs where auto-scaling transitions would be unwelcome.


This article reflects AWS features and pricing as of 2026. AWS services evolve rapidly — always verify against the official Amazon Aurora Serverless v2 documentation before making production decisions.

Published: 4/17/2026 / Updated: 4/17/2026

This article is for informational purposes only. AWS services, pricing, and features change frequently — always verify details against the official AWS documentation before making production decisions.

More in Databases