Talent Trace Consultancy
Site Reliability Engineer - Disaster Recovery
Job Location
in, India
Job Description
Role : SRE Disaster Recovery for a MNC Company Experience : 5-10 Years CTC : 22 LPA Job Location : Bangalore -Hybrid Job Description : We are seeking a highly skilled and motivated Senior Site Reliability Engineer (SRE) - Disaster Recovery Specialist. The ideal candidate will be responsible for designing, implementing, and maintaining systems and processes that ensure the reliability, scalability, and disaster resilience of our infrastructure and applications. This role requires a good understanding of both SRE/DevOps practices and Disaster Recovery strategies. - Responsible for managing all activities related to disaster recovery program, to ensure that IT systems are able to recover in the event of a disaster and perform DR testing exercises in AWS environment - Planning, design, documentation, and testing of disaster recovery solutions to meet business or technology requirements. - Implement and manage monitoring, logging, and alerting solutions to ensure system health and performance. - Automate repetitive tasks to improve efficiency and reduce manual intervention in Disaster recovery - Drive continuous improvement initiatives to enhance the effectiveness and efficiency of disaster recovery processes and technologies. - Develop, implement, and maintain disaster recovery plans to ensure the quick restoration of critical systems and data in the event of a disaster. - Conduct regular DR drills and simulations to test the effectiveness of the disaster recovery plan and identify areas for improvement. - Collaborate with stakeholders to identify critical systems and data that require protection under the DR plan. - Manage backup solutions and ensure that data is regularly backed up and can be quickly restored. - Understanding of DevOps principles of continuous delivery, deployment, and improvement - Good knowledge of DevOps Platform tooling (Git based repo, Jenkins, Docker, Kubernetes) - Maintain and enhance infrastructure-as-code (IaC) practices using tools like Terraform, Ansible, or CloudFormation. - Manage containerization and orchestration platforms such as Docker and Kubernetes. Requirements Required Qualifications : - Bachelor's degree in computer science, Information Technology, or related field, or equivalent experience. - 8 years of experience in SRE role with expertise in Disaster Recovery. - Proven experience in disaster recovery planning and implementation. - Strong knowledge of cloud platforms (e.g., AWS, Google Cloud). - Proficiency in scripting languages (e.g., Python, Bash). - Familiarity with CI/CD tools (e.g., Jenkins, GitHub). - Familiarity with containerization and orchestration tools (e.g., Docker, Kubernetes). - Strong understanding of networking, security, and infrastructure best practices. - Excellent problem-solving skills and the ability to work under pressure. Preferred Qualifications : - Certifications in cloud platforms (e.g., AWS Certified Solutions Architect). - Experience with infrastructure-as-code tools (e.g., Terraform, Ansible). - Experience with monitoring and logging tools (e.g., Dynatrace, AWS OpenSearch, Site24x7). - Knowledge of database management and DR solutions for databases. - Strong communication and collaboration skills. (ref:hirist.tech)
Location: in, IN
Posted Date: 11/27/2024
Location: in, IN
Posted Date: 11/27/2024
Contact Information
Contact | Human Resources Talent Trace Consultancy |
---|