Previous Job
Previous
Reliability Site Engineer
Ref No.: 18-15036
Location: Roseland, New Jersey
Job Description:

A System Reliability Engineer (SRE) is responsible for availability, performance, and capacity of large-scale distributed systems. This role combines Software and Systems Engineering to develop creative solutions to solve challenges in a more efficient

Top 3 Skillset

1. SRE mindset/awareness with focus on automation, availability and performance of enterprise level systems
2. Developer background/mind set with automation using shell, phython, ansible, java etc.
3. Operational (Infrastructure as a code) background - both traditional data centers and cloud (AWS)

Job Profile

A System Reliability Engineer (SRE) is responsible for availability, performance, and capacity of large-scale distributed systems. This role combines Software and Systems Engineering to develop creative solutions to solve challenges in a more efficient, reliable, and automated way.

The mindset of an ADP SRE is that of curiosity, problem solving, and has a passion for improving quality. SRE's automate manual processes, are proactive with identifying future problems, and ensure existing solutions operate to specification.

Principal Responsibilities:

• Design and Develop API's, Modules, Frameworks, and Systems that support scale through automation.
• Design and develop system and software architectures to promote efficiency in a large distributed ecosystem.
• Participate in incident responses, then design/develop remedial solutions.
• Proactive engagement of IT Partners to understand, collaborate, and present alternative solutions to improve product reliability.


Minimum Qualifications:

• 5+ years of software engineering experience working with large distributed systems.
• Able to work independently on complex analysis, design, and implementation of large-scale distributed solutions.
• Define enhancement specifications through collaboration sessions and architecture context diagrams.
• Expertise with Software Engineering which includes but not limited to Object Oriented Programming, Design Patterns, and API development in multiple languages, including Java, Python, and Shell.
• Strong understanding of SQL and Table Design.
• Experience with continuous integration, continuous delivery, configuration management, and automated testing in a virtualized/containerized environment.
• Strong understanding of infrastructure (virtualization, operating systems, load balancers, web stacks, databases, storage solutions, and networking.)
• Systematic problem-solving approach, coupled with strong communication skills and a sense of ownership and drive.


Preferred Qualifications:

• Fundamental knowledge of AWS and/or other Cloud platforms.
• Experience with designing, implementing and supporting Docker and Kubernetes platforms.
• Experience troubleshooting and tuning Linux, Windows, and Java JVM's.
• Experience with writing SQL queries and stored procedures.
• Experience with writing Splunk queries and dashboards.
• Experience Designing a Continuous Integration and Delivery model with products like Jenkins to ensure delivery of new content is fully automatable with pipelines that do automated integration and functional testing.
• Strong understanding of application and system monitoring using blackbox and whitebox techniques.

Behavior Characteristics:

The mindset of an ADP SRE is that of curiosity, problem solving, and has a passion for improving quality. SRE's automate manual processes, are proactive with identifying future problems, and ensure existing solutions operate to specification.