Previous Job
Lead Site Reliability Engineer
Ref No.: 18-05172
Location: New York, New York
Position Type:Direct Placement
Pay Rate : $ 180,000.00 - 210,000.00 /Year
This SRE's job is to keep the platform that our client's 1,100 customers use running smoothly and efficiently. We build powerful automation that impacts everything from development and testing through to production deployment, scaling, monitoring, and alerting. Put another way, we eliminate work through automation. We have fun leveraging cutting-edge technologies such as Terraform, Kubernetes, Docker, Istio, Jenkins, and Spinnaker.
Help us scale our business to meet the needs of our growing customer base and develop new products on our platform. You'll be a critical part of our growing company, working on a cross-functional team to implement best practices in technology, architecture, and process. You'll have the chance to work in an open and collaborative environment, shape the engineering culture and have ample opportunities to grow and accelerate your career.

  • Design and build the tools, frameworks, systems, and processes that the engineers use to build, integrate, deploy, scale and manage their software.
  • Automate tasks across the full CI/CD lifecycle to create an efficient developer experience and reduce manual toil.
  • Scale solutions from proofs-of-concept to full production systems.
  • Collaborate effectively with and mentor other engineers on the SRE team and in the larger engineering org.
  • Promote and implement best practices in observability (monitoring, tracking, alerting, logging) and high availability software engineering.
  • Participate in an on-call rotation to mitigate site disruption.
  • Minimize risk of reliability-related failure outcomes as pertaining to durability, availability, performance, and correctness.
  • 3+ years in SRE or DevOps roles, with a focus on tooling, automation and distributed systems development.
  • 8+ years overall software industry experience.
  • A desire to stay on the cutting edge of infrastructure and automation technologies.
  • Strong software development skills in at least one programming language. We use Go, Python, .NET Core.
  • Production experience with infrastructure frameworks like Docker, Terraform and Kubernetes
  • Production experience with AWS and Linux environments
  • Experience with configuration management tools like Puppet, Chef, or Ansible