Engineering teams at Autonomic practice DevOps. To us, this means that teams own their automations, monitoring, and on-call rotations. On the Site Reliability Team, we focus our efforts on building scalable and reliable infrastructure that keeps our platform running smoothly.
Responsibilities:
Scale our cloud infrastructure to support our growing ecosystem
Deploy reliable and maintainable distributed systems
Adhere to industry standard security best practices
Write automation, monitoring, diagnostic and debugging tools
Participate in our on-call rotation
Minimum Job Requirements:
Can write scripts from scratch in Python and Bash
Can troubleshoot code written in Java, C, Golang or other strongly typed languages
Can debug network and performance issues in large scale distributed systems
Can identify and mitigate reliability risks
7+ years of experience in SRE, DevOps, or similar role
Desirable Skills:
Strong experience with AWS, Azure, or AliCloud (design, SDKs, best practices)
Familiar with design principles of monitoring and alerting systems
Experience implementing industry standard security best practices
Deep knowledge of distributed pub-sub message systems
Your Cover Letter:
In your cover letter, please rate your proficiency in the following areas on a scale from 1 to 7 where 1 is no experience and 7 is expert:
Linux
Shell Scripting
Python
Networking
AWS VPC
AWS IAM
Kubernetes
Apache Kafka
NOTE: A low self rating will not cause your application to be rejected.
Please submit your cover letter and resume in PDF format using the form below. We look forward to hearing from you.
by via developer jobs - Stack Overflow
No comments:
Post a Comment