Site Reliability Engineer
Previous Job
Previous
Site Reliability Engineer
Ref No.: 18-06808
Location: Waltham, Massachusetts
Title: Site Reliability Engineer
Location : Waltham, MA
Duration : 12 months contract.
Terms : W2


Description :
The OnCommand Insight Team is one of Client's fastest moving teams. We deliver enterprise software to manage infrastructure and applications. Client's OnCommand Insight is one of our most successful products with significant sales to large global enterprises. We are seeking Site Reliability Engineers to help us build out our new cloud offering. If you have experience with complex software services and keeping services reliable and available, you are needed here.

Responsibilities:
· Build architecture and operational tools to run a SaaS product
· Build the right processes to run the service and its operation
· Work within the product team to design and improve the architecture of the product
· Daily responsibilities include:
o Coding to build tools that automate operations processes
o Scripting infrastructure (CloudFormation, SaltStack, etc.)
o Analyze and improve latency, performance, and availability
o Capacity planning
o Resolution of critical and/or high visibility customer issues
· Utilize previously acquired technical experience to become actively involved in the day-to-day projects, meet schedules, and resolve problems.
· Leverage expertise to resolve issues diverse in scope through short-term and mid-term planning.
· Work effectively with all levels of staff including vice presidents throughout the organization.

Requirements:
· Strong oral and written communication skills are essential
· Clear understanding of the product development life cycle, software development methodologies, and technical project management
· Clear understanding of concepts related to computer and software architecture
· Experience in software development, Linux system administration, and network engineering (a mix of these is OK)
· Experience with at least one major cloud provider (AWS preferred)
· Experience or clear understanding of Docker, Kubernetes, and orchestration tools such as OpenShift and Ranger
· Experience operating a SaaS infrastructure, including
o Scaling and high availability patterns
o Issue troubleshooting and resolution
o Software deployment and CI/CD pipelines
o Monitoring
· Experience with infrastructure configuration tools such as
o SaltStack
o Terraform
o Ansible
o Puppet/Chef
· Understanding of microservices architecture and REST interfaces

Education:
Typically requires a minimum of 8 years of related experience with a Bachelor's degree; or 6 years and a Master's degree; or a PhD with 3 years experience; or equivalent experience.