
| Category | AWS Service / Technology | Purpose |
|---|---|---|
| Big Data Processing | AWS EMR | Process large-scale data workloads using Spark, Hadoop, Hive, etc. |
| Monitoring | Amazon CloudWatch | Track EMR cluster metrics and trigger scaling events. |
| Compute Scaling | AWS Auto Scaling / Lambda | Dynamically adjust cluster nodes based on workload metrics. |
| Storage | Amazon S3 | Store input/output datasets and logs. |
| Security & Access | AWS IAM | Manage secure access and permissions for resources and services. |
| Notification | Amazon SNS / EventBridge | Send alerts when clusters scale or metrics cross thresholds. |
| Workflow Management | AWS Step Functions (Optional) | Orchestrate big data jobs and ensure smooth pipeline execution. |