
| GCP Service | Purpose |
|---|---|
| Cloud Composer | Orchestrates ETL and big data workflows; manages scheduling, retries, and dependencies. |
| Dataflow | Processes large-scale batch or streaming datasets; supports autoscaling and dynamic resource allocation. |
| Cloud Storage | Stores raw, intermediate, and processed datasets for the workflows. |
| BigQuery | Stores structured processed data for analytics and reporting. |
| Cloud Monitoring / Logging | Tracks job performance, resource utilization, and workflow health. |
| Cloud Functions | Triggers auxiliary workflows, notifications, or automated responses based on job events. |
| Pub/Sub | Streams real-time data into Dataflow pipelines for processing. |
| Vertex AI (Optional) | Can be used to predict optimal resource allocation for workflows using historical performance data. |
| Cloud Key Management Service (KMS) | Encrypts sensitive data in pipelines for security and compliance. |
| Looker / Data Studio | Visualizes workflow performance, resource usage, and processed data insights. |