☁︎SAA-C03

S3

S3 — Concept

What it is

Amazon Simple Storage Service (S3) = AWS's highly durable, infinitely scalable object storage. Objects (up to 5 TB) live in buckets identified by a globally unique name and a region.

Why it exists

S3 is the bedrock of AWS: data lake, static website hosting, backups, archive, big-data input/output, content distribution origin. 11 nines (99.999999999 %) of durability and 99.99 % availability target.

Core concepts

  • Bucket — top-level container (region-scoped, globally unique name).
  • Object — file + metadata + version ID (if versioning).
  • Key — full path inside the bucket (e.g. 2026/05/photo.jpg).
  • Prefix — virtual folder (no real hierarchy under the hood).

Storage classes (must memorize)

ClassLatencyMin storageUse
S3 StandardmsFrequent access, hot data
S3 Intelligent-TieringmsUnknown / changing access pattern (auto moves between Frequent/Infrequent/Archive tiers)
S3 Standard-IAms30 dInfrequent but quick access; retrieval fee
S3 One Zone-IAms30 dSingle-AZ, ~20 % cheaper than Std-IA, reproducible data
S3 Glacier Instant Retrievalms90 dArchive that needs occasional fast access
S3 Glacier Flexible Retrievalminutes–hours (1–5 min Expedited, 3–5 h Std, 5–12 h Bulk)90 dBackup / DR
S3 Glacier Deep Archive12 h Std / 48 h Bulk180 dLong-term compliance, cheapest
S3 Reduced Redundancy (RRS)msDeprecated, don't pick on exam

Lifecycle policy

  • Move objects between classes by age or transition (e.g. Standard → IA after 30 d → Glacier after 90 d).
  • Expire (delete) objects after N days.
  • Can also expire incomplete multipart uploads (clean up costs).
  • Filter by prefix or tag.

Versioning & MFA Delete

  • Versioning (bucket-level) keeps all versions of objects; deletes create a delete marker.
  • Once enabled cannot be disabled (only suspended).
  • MFA Delete requires MFA to delete versions or disable versioning (root account only).

Replication

  • Cross-Region Replication (CRR) — DR, latency.
  • Same-Region Replication (SRR) — log aggregation, compliance.
  • Requires versioning enabled on source and destination.
  • Asynchronous; new objects only (use Batch Replication for backfill).
  • Cross-account works.

Encryption (very common exam topic)

ModeWho manages keyNotes
SSE-S3 (default)AWS-managed AES-256Free; transparent
SSE-KMSKMS-managed CMKAudit + key rotation; counts toward KMS API limits
SSE-CCustomer provides key per requestAWS doesn't store the key
DSSE-KMSDual-layer with two distinct KMS keysHigher assurance compliance
Client-sideYou encrypt before uploadE2E control

Buckets can require encryption via bucket policy with s3:x-amz-server-side-encryption condition. Block Public Access (BPA) blocks all public ACLs/policies by default at account & bucket level.

Security & access

  • Bucket policy (resource-based) — fine-grained including conditions like IP, VPC, encryption header, MFA.
  • IAM policy — identity-based.
  • ACLs — legacy; AWS recommends disabling object ACLs (Bucket Owner Enforced).
  • Pre-signed URL — time-bound permission to upload/download via IAM creds.
  • Block Public Access — master switch, defaults to ON for new buckets.

Object Lock & Glacier Vault Lock

  • Object Lock = WORM (write-once-read-many), legal hold or retention period (Governance/Compliance mode).
  • Bucket versioning required.

Performance features

  • Multipart upload — required > 5 GB, recommended > 100 MB; parallel parts; resumable.
  • Byte-range fetches — parallel download.
  • S3 Transfer Acceleration — uploads via CloudFront edge POPs (extra fee).
  • S3 Select / Glacier Select — server-side SQL on individual objects (CSV/JSON/Parquet).
  • Request rate: at least 5,500 GET / 3,500 PUT per prefix per second (scales).

Storage Lens, Inventory, Analytics

  • S3 Inventory = daily CSV/Parquet listing of objects (for batch processing).
  • Storage Lens = org-wide usage & best-practice dashboard.
  • S3 Storage Class Analysis = recommends Std → IA transitions.

When to use vs alternatives

Use ...Instead of ...When ...
S3EFS / EBSObject storage, no POSIX, web-scale
Intelligent-TieringManual lifecycleUnknown / variable access patterns
Glacier Deep ArchiveOn-prem tapeCheapest long-term archive
S3 Transfer AccelerationDirect uploadFar-from-region clients pushing big files

Common exam scenarios

  1. "Cheapest storage for 7-year compliance archive, retrieval rare"Glacier Deep Archive.
  2. "Auto-optimize cost without knowing access pattern"Intelligent-Tiering.
  3. "Static website hosted on S3 with HTTPS and custom domain" → S3 + CloudFront (S3 website is HTTP only).
  4. "Allow upload to bucket from a mobile app without exposing keys"Pre-signed PUT URL.
  5. "Ensure all objects encrypted with KMS" → bucket policy with s3:x-amz-server-side-encryption-aws-kms-key-id condition + default encryption.
  6. "DR replica in another region"CRR with versioning.
  7. "Legal hold / immutable storage for audit logs"Object Lock (Compliance mode) + versioning.
  8. "Block accidental public exposure"Block Public Access ON at account level.

Exam tip

S3 = object storage, never mount as a file system in real solutions. CloudFront is the standard companion for HTTPS / latency. Encryption + Block Public Access are almost always part of "best practice" answers.

References