Ford Business Solutions

Site Reliability Engineer

Click Here to Apply

Job Location

Chennai, India

Job Description

Short Description: A site reliability engineer (SRE) is a role that combines software engineering and systems engineering to ensure that a software system is available, scalable, and maintainable 247365 in "Always ON" aspect for the Ford's e-Commerce Platform Description for Internal Candidates Strong background in software development and systems administration, as well as excellent problem-solving and communication skills. Improve reliability, quality, and time-to-market of our suite of software solutions Measure and optimize system performance, with an eye toward pushing our capabilities forward, getting ahead of customer needs, and innovating to continually improve Identify and reduce or eliminate toil via automation to maximize the time spent on engineering and innovation Performing root cause analysis of production incidents and implementing preventive measures Responsibilities for Internal Candidates Strong background in software development and systems administration, as well as excellent problem-solving and communication skills. Run the production environment by monitoring availability and taking a holistic view of system health. Developing, improving, and operating the deployment and orchestration of a complex distributed system Improve reliability, quality, and time-to-market of our suite of software solutions Measure and optimize system performance, with an eye toward pushing our capabilities forward, getting ahead of customer needs, and innovating to continually improve Provide primary operational and engineering Support for multiple large, distributed software applications Identify and reduce or eliminate toil via automation to maximize the time spent on engineering and innovation Collaborating with development teams to design, build, and operate scalable and resilient software systems Automating deployment, monitoring, and incident response processes Performing root cause analysis of production incidents and implementing preventive measures Conducting performance analysis and optimization of the system Ensuring compliance with security and regulatory standards Implementing and maintaining disaster recovery processes Providing technical guidance and mentorship to other team members Participating in an on-call rotation for incident response and support. Qualifications: 4 Year College Degree in Computer Science or Equivalent. 2-5 years experience with JAVA, J2EE, NoSQL/SQL Datastore, Spring Boot, GCP/AWS/Azure & Docker/K8 in Maintenance and Development of multi-tier applications. Understanding of RESTful APIs and microservices platform 2-5 Years of experience with any of APM and other monitoring tools such as Dynatrace, New Relic, ELK, Splunk, Prometheus, Sensu, Nagios, Kafka, DataDog, PagerDuty. Strong experience with product & development teams to establish error budgets by identifying the right SLOs (Service level objective), SLIs (Service level indicators), KPIs (Key performance indicators) and effectively drive the use of the budget to ensure maximum domain availability/uptime. Regularly review key site technical metrics such as transactions errors, logging, response times, caching strategies, conversion/bounce rates, capacity & resource utilization. Proactively identify stability risks & work with engineering leadership to establish appropriate mitigation plans Experience in solving complex architecture/design & business problems, work to simplify, optimize, remove bottlenecks, etc. Architect, design & develop automation to reduce toil, improve recoverability, availability, latency & scalability of supported applications with understanding of MTTD (Mean Time to Detection) & MTTR (Mean Time to Resolution) Maintain knowledge repository that includes Standard operating procedure, Release checklists, Runbooks for incident recovery Same Posting Description for Internal and External Candidates

Location: Chennai, IN

Posted Date: 4/1/2024
Click Here to Apply
View More Ford Business Solutions Jobs

Contact Information

Contact Human Resources
Ford Business Solutions

Posted

April 1, 2024
UID: 4498518815

InternJobs.com does not guarantee the validity or accuracy of the job information posted in this database. It is the job seeker's responsibility to independently review all posting companies, contracts and job offers.