Placement papers | Freshers Walkin | Jobs daily: Software Engineer - Site Reliability at Datadog (New York, NY)


Search jobs and placement papers

Software Engineer - Site Reliability at Datadog (New York, NY)

If you're not looking for Site Reliability or Infrastructure-focused roles, please check out our careers page.



How do you keep a data-intensive, real-time service that monitors hundreds of thousands of servers up-and-running around the clock?


How do you respond to infrastructure failures or performance issues in a high-volume, low-latency computing environment?


What should the infrastructure look like when Datadog monitors millions of servers and containers? If you these are problems that you find interesting and want to work on, apply to work on the SRE team!


What you will do



  • Keep our service reliable, available and fast as a member of the operations team.

  • Respond to, investigate and fix service issues, whether they be deep in the OS kernel or in the application code.

  • Design, build and maintain the infrastructure we need to support orders of magnitude more customers.


What we're looking for



  • You have a track record as a Software Engineer in the development and maintenance of a large site or distributed cloud system,

  • You value correctness and efficiency; you leave no stone unturned when diagnosing production issues,

  • You handle infrastructure with code because automation lets you focus on the more difficult and rewarding problems,

  • You have production experience with distributed compute/storage tools, e.g. Zookeeper, Cassandra, Postgres, Kafka, Elasticsearch, Redis,

  • You have production experience with one or more cloud platforms


Bonus Points



  • You have a BS/MS/PhD in a scientific field or equivalent experience

  • You have submitted bug fixes to the aforementioned projects

  • You are fully fluent in Python, Ruby OR Go

  • You have Kubernetes and container experience


by via developer jobs - Stack Overflow
 

No comments:

Post a Comment