GCP Service |
Purpose |
Cloud Composer |
Orchestrates ETL and big data workflows; manages scheduling, retries, and dependencies. |
Dataflow |
Processes large-scale batch or streaming datasets; supports autoscaling and dynamic resource allocation. |
Cloud Storage |
Stores raw, intermediate, and processed datasets for the workflows. |
BigQuery |
Stores structured processed data for analytics and reporting. |
Cloud Monitoring / Logging |
Tracks job performance, resource utilization, and workflow health. |
Cloud Functions |
Triggers auxiliary workflows, notifications, or automated responses based on job events. |
Pub/Sub |
Streams real-time data into Dataflow pipelines for processing. |
Vertex AI (Optional) |
Can be used to predict optimal resource allocation for workflows using historical performance data. |
Cloud Key Management Service (KMS) |
Encrypts sensitive data in pipelines for security and compliance. |
Looker / Data Studio |
Visualizes workflow performance, resource usage, and processed data insights. |