Job location: Pune
Roles and Responsibility:
We are seeking a Senior Service Reliability Engineer who will be responsible for improving and maintaining a software development, test and live infrastructure. The ideal candidate will be self-motivated, articulate, have experience with Linux and other *NIX-derivatives, and is comfortable working in a fast-paced development environment. Your primary mission as a System Engineer is working closely with the development, technical operations, quality assurance, and product management teams, to ensure the timely delivery of robust technical solutions for companys customers and internal teams.
Support Companys mission critical, front-end and back-end systems and applications in production and preproduction environments
Identify and drive improvements in infrastructure and system reliability, performance, monitoring, and overall stability of companys systems.
Capacity planning and demand forecasting to meet systems demand, identifying performance bottlenecks and devising tuning improvements.
Build tools and automation that eliminate repetitive tasks and prevent incident occurrence.
Create and maintain operational run books and documentation.
Participate in 247 operational support and on-call rotation.
B.S. in Computer Science or equivalent experience
Minimum of 5 years of production environment engineering, configuration management, release engineering
Experience supporting, analyzing and troubleshooting large-scale distributed mission critical systems with Java backend (MUST)
Systematic problem solving approach and strong sense of ownership to drive problems to resolution
Experience configuring and managing web servers (Apache, Tomcat, Nginx) and RESTful web service applications (MUST, one of these Tomcat, Jboss, Weblogic)
Strong knowledge of Linux systems administration and architecture (MUST)
Experience with configuring, managing and supporting virtualized environments
Proficiency working with Amazon Web Services (AWS) like EC2, EBS, ELB, S3, and Cloudformation in a highlyavailable and scalable production environment (Not a must but very important)
Experience with SQL (MySQL) and NoSQL databases (Redis, CouchBase, Cassandra)
Experience with open source technologies (Kafka, Memcached, Redis, Hadoop, HBase, Zookeeper, Oozie)
Network knowledge (TCP/IP, UDP, DNS, Load balancing) and prior network administration experience is abig plus.
Extensive scripting experience with Shell, Python or Ruby (MUST)
Experience documenting processes, systems, environments and runbook procedures
Experience with source control tools such as GIT/Github, SVN, or Perforce