What we are looking for - DevOps Developer Job Description #Ensure user visible uptime and quality, providing operational and development expertise in making our systems fail rarely, and are fast to fix when they do fail #Own the day-to-day health, uptime, monitoring, and reliability of services and server infrastructure #Participate in architecture and design reviews to provide recommended improvements to the development teams to improve the reliability and performance of applications #Minimize manual involvement by imagining & implementing continuous improvements that create an operating environment, including the development of new tools, dynamically monitoring, alerting, & automated self-healing & recovery Identify and/or analyze problems relating to mission critical services and implement automation to prevent problem recurrence; with the goal of automating response to all non-exceptional service conditions. #Engage in application performance analysis and system tuning, and capacity planning Perform root cause analysis to identify & implement continuous improvements #Capable of presenting analyses and recommendations to leadership or discussing the technical merits of solutions with engineers and architects. #Practice Agile and Scrum methodologies Work Complexity: #Moderate level of technical complexity experience with multiple integrated applications #Moderate-high degree of judgment involved in decisions regarding triaging and driving production incidents to resolution within the agreed upon SLE #Fast paced technically complex environment that requires the ability to manage competing priorities #High degree of interaction with technical personnel (developers, infrastructure support personnel, etc.) and non-technical personnel (Business Users, SME & QA teams) Analysis & Problem Solving: #High degree of reasoning required to collaboratively triage production incidents, analyze the root causes of the issue, and help drive them to conclusion. #Ability to use original thinking in investigating, analyzing, and resolving production incidents. #Builds off of experience with related products and associated systems to identify and resolve problems.