Description
When you use Dataproc, cluster and job data is stored on Persistent Disks (PDs) associated with the Compute Engine VMs in your cluster and in a Cloud Storage staging bucket. This PD and bucket data is encrypted using a Google-generated data encryption key (DEK) and key encryption key (KEK). The CMEK feature allows you to create, use, and revoke the key encryption key (KEK). Google still controls the data encryption key (DEK).
Rationaleโ
Cloud services offer the ability to protect data related to those services using encryption keys managed by the customer within Cloud KMS. These encryption keys are called customer-managed encryption keys (CMEK). When you protect data in Google Cloud services with CMEK, the CMEK key is within your control.
Impactโ
Using Customer Managed Keys involves additional overhead in maintenance by administrators.
Auditโ
From Google Cloud Consoleโ
- 
Login to the GCP Console and navigate to the Dataproc Cluster page by visiting https://console.cloud.google.com/dataproc/clusters. 
- 
Select the project from the project dropdown list. 
- 
On the Dataproc Clusterspage, select the cluster and click on the Name attribute value that you want to examine.
- 
On the detailspage, select theConfigurationstab.
- 
On the Configurationstab, check theEncryption typeconfiguration attribute value. If the value is set toGoogle-managed key, then Dataproc Cluster is not encrypted with Customer managed encryption keys.Repeat step no. 3 - 5 for other Dataproc Clusters available in the selected project. 
- 
Change the project from the project dropdown list and repeat the audit procedure for other projects. 
From Google Cloud CLIโ
- 
Run clusters list command to list all the Dataproc Clusters available in the region: gcloud dataproc clusters list --region='us-central1'
- 
Run clusters describe command to get the key details of the selected cluster: gcloud dataproc clusters describe <cluster_name> --region=us-central1 --flatten=config.encryptionConfig.gcePdKmsKeyName
- 
If the above command output return "null", then the selected cluster is not encrypted with Customer managed encryption keys. 
- 
Repeat step no. 2 and 3 for other Dataproc Clusters available in the selected region. Change the region by updating --region and repeat step no. 2 for other clusters available in the project. Change the project by running the below command and repeat the audit procedure for other Dataproc clusters available in other projects: gcloud config set project <project_ID>"