See above
Site Reliability Engineer Interview Questions
2,547 site reliability engineer interview questions shared by candidates
1) HR Interview - Few minutes to introduce each other and getting some information of my current employer and work status in US, and so on. 2) Technical Interview - Phone interview for 30 mins with manager. - Questions on my resume regarding my job responsibilities as Cloud Operations Engineer, some insight on monitoring tools I was working on and what role I played in setting it up. 3) On Site Interview - Interviewed by HR, Team member, Manager, Director, and two other members from Cloud team. (around 3-4 hours) - Questions on my resume: skills in Python, AWS, and monitoring tool I was working on in the current company (it's architecture, functionality, and so on)
The following thing failed in k8s how do you debug?
Given a game to you which is running on an instance and hasMySQL installed on it locally, now with the game popularity increasing, suggest ways that it stays highly secure and highly available and then with every step he was adding more things on it, like we want to use JWT on it, should we use it? session maintenance etc.
How would you build an app that has to upload images, what if it had to do this? What if it had to do that? Where in the database. Code in extra features to existing software across several files, etc. etc.
Tell me about yourself
I understood that, they're still learning and don't much exposure. I didn't find any question as difficulty, all of them simple DevOps skills
Where do see you the distinction between operational tasks and developer tasks.
Years of experience working as an SRE Engineer or in a very similar role Years of experience working with cloud (AWS) Years of experience working with IaC tools (Terraform) and GitOps CI/CD solutions (ArgoCD, GitHub Actions, or similar). Years of experience working with monitoring and logging OpenSource tools such as Grafana, Prometheus, Elastic/OpenSearch, Loki, Tempo Years of experience working in Kubernetes, including its core components, deployment methodologies, and monitoring best practices. Strong scripting abilities (Python, Go, or similar) for automating observability tasks. Experience in managing observability: SLI, SLOs, Log Transformation, Cardinality Management, Business and Resilience Metrics, 4 Golden Signals, Distributed Tracing. Experience with automated alerting workflows.
how does multi tenant architecture work in kubernetes
Viewing 761 - 770 interview questions