DevOps Engineer Practice Questions

Question #1

You are developing the deployment and testing strategies for your CI/CD pipeline in Google Cloud. You must be able to:

  • Reduce the complexity of release deployments and minimize the duration of deployment rollbacks.
  • Test real production traffic with a gradual increase in the number of affected users.

You want to select a deployment and testing strategy that meets your requirements. What should you do?

  1. Rolling update deployment and A/B testing
  2. Rolling update deployment and shadow testing
  3. Blue/green deployment and canary testing
  4. Recreate deployment and canary testing

 

Question #2

Your organization stores all application logs from multiple Google Cloud projects in a central Cloud Logging project. Your security team wants to enforce a rule that each project team can only view their respective logs, and only the operations team can view all the logs. You need to design a solution that meets the security team’s requirements, while minimizing costs. What should you do?

  1. Export logs to BigQuery tables for each project team. Grant project teams access to their tables. Grant logs writer access to the operations team in the central logging project.
  2. Grant each project team access to the project _Default view in the central logging project. Grant logging viewer access to the operations team in the central logging project.
  3. Create Identity and Access Management (IAM) roles for each project team and restrict access to the _Default log view in their individual Google Cloud project. Grant viewer access to the operations team in the central logging project.
  4. Create log views for each project team, and only show each project team their application logs. Grant the operations team access to the _AllLogs view in the central logging project.

 

Question #3

You work for a global organization and run a service with an availability target of 99% with limited engineering resources. For the current calendar month, you noticed that the service has 99.5% availability. You must ensure that your service meets the defined availability goals and can react to business changes, including the upcoming launch of new features. You also need to reduce technical debt while minimizing operational costs. You want to follow Google-recommended practices. What should you do?

  1. Add N+1 redundancy to your service by adding additional compute resources to the service.
  2. Allocate available engineers to the feature backlog while you ensure that the service remains within the availability target.
  3. Define an error budget for your service level availability and minimize the remaining error budget.
  4. Identify, measure, and eliminate toil by automating repetitive tasks.

 

Question #4

Your CTO has asked you to implement a postmortem policy on every incident for internal use. You want to define what a good postmortem is to ensure that the policy is successful at your company. What should you do?
Choose 2 answers.

  1. Ensure that all postmortems include what caused the incident, identify the person or team responsible for causing the incident, and how to prevent a future occurrence of the incident.
  2. Ensure that all postmortems include what caused the incident, how the incident could have been worse, and how to prevent a future occurrence of the incident.
  3. Ensure that all postmortems include the severity of the incident, how to prevent a future occurrence of the incident, and what caused the incident without naming internal system components.
  4. Ensure that all postmortems include how the incident was resolved and what caused the incident without naming customer information.
  5. Ensure that all postmortems include all incident participants in postmortem authoring and share postmortems as widely as possible.

 

Question #5

Your team is running microservices in Google Kubernetes Engine (GKE). You want to detect consumption of an error budget to protect customers and define release policies. What should you do?

  1. Create SLIs from metrics. Enable Alert Policies if the services do not pass.
  2. Use the metrics from Anthos Service Mesh to measure the health of the microservices.
  3. Create a SLO. Create an Alert Policy on select_slo_burn_rate.
  4. Create a SLO and configure uptime checks for your services. Enable Alert Policies if the services do not pass.