Previous Job
Previous
Site Reliability Engineer- Infrastructure
Ref No.: 18-02992
Location: New York, New York
Position Type:Full Time
Pay Rate : $ 150,000.00 - 250,000.00 /Year
Our client is a global business and financial information and news leader with a focus on open-source bleeding edge technology. They are currently seeking multiple SRE's for a number of their infrastructure teams.

The ideal candidate will have an expert level of systems(Linux) knowledge, with strong software engineering skills. As an SRE you will be trusted to improve the stability and availability of the production environment through automation. Your team responsible for monitoring, provisioning, configuration management, orchestration, capacity planning, deployment and rollback, incident management, and systems development life cycle practices.

Our Team:
The Systems Automation Engineering group is trusted to build-out core automation components and establish the necessary tooling and architectural standards for all aspects of orchestration and configuration management. This includes Chef, Salt, and a testing framework for both.
What's in It For You:
You'll work with modern, open-source tooling while maintaining mission-critical systems hosting a wide array of applications. System Engineers will trust you as an escalation point and you'll regularly collaborate to maintain the stability and performance of operating systems and servers. We'll depend on you to advise on design, architecture, and utilization of enterprise-class configuration and orchestration systems.
You'll Need to Have:
  •  Demonstrated experience programming and testing Python, Ruby, Go, or C/C++
  •  Experience working in a 24/7 production engineering organization
  •  Ability to listen, communicate, evaluate, problem solve, multi-task, and prioritize in a high-pressure, mission-critical, and rewarding team environment.
  •  Familiarity with configuration / orchestration management tools such as Chef, SaltStack, Puppet or Ansible
  •  Experience programming in Python or Ruby
  •  Ability to create robust testing and certification processes to comprehensively evaluate impact of configuration and orchestration changes to their systems stack
  •  Working knowledge of using source control systems like GIT and working on CICD pipelines utilizing tools like Docker, Jenkins
 
We'd Love to See:
  •  Deep expertise troubleshooting complex distributed systems
  •  Experience with creating and improving documented procedures and/or playbooks
  •  Working knowledge of Chef, Puppet, Ansible, or Salt
  •  Familiarity with open source configuration, orchestration, and CI/CD tools
  •  Deep understanding of TCP/IP and Unix networking, Linux kernel performance (virtual memory and process scheduling)