Run prio-processor containers on persistent GKE clusters
Categories
(Data Platform and Tools :: General, enhancement, P2)
Tracking
(Not tracked)
People
(Reporter: amiyaguchi, Assigned: amiyaguchi)
References
(Blocks 1 open bug)
Details
Attachments
(2 files)
From 1664057:
The current structure of Prio in Airflow contains three separate GCP projects: admin, A, and B. The admin project needs to write into buckets located in A and B by running a container. Containers are not run in the shared cluster because of service account permissions. Instead, each project spins up an ephemeral kubernetes cluster.
This will be changed so the cluster is always alive with a g1-small node, with a burstable node pool for processing multiple batches of jobs (e.g. backfill). This should be roughly $5/mo to keep the cluster alive.
The Airflow PR will contain the new cluster configuration and the new pod operator definitions. The cluster definition will likely move into the cloudops infra repository to be managed programmatically for each project.
Comment 1•5 years ago
|
||
Comment 2•5 years ago
|
||
Description
•