TheThreeAcross

Azure Production Support Engineer - Incident Management

Job Location

bangalore, India

Job Description

Notice Period :- Must be 15 days official and 1 month official Job Description : Azure Production Support Engineer (L2/L3) Location : Bangalore Experience : 6 to 10 years Job Type : Full-Time Role Overview : We are seeking an experienced Azure Production Support Engineer (L2/L3) to ensure the stability, reliability, and performance of critical applications hosted on Azure Cloud. The ideal candidate will have strong expertise in Azure services, Linux environments, Terraform, scripting, Azure Service Bus, and GitHub while handling incident management, troubleshooting, and automation to improve system efficiency. Key Responsibilities : L2/L3 Production Support & Incident Management : - Provide L2 and L3 support for applications running on Azure. - Troubleshoot and resolve cloud infrastructure, application, and middleware issues. - Perform root cause analysis (RCA) for recurring incidents and implement permanent fixes. - Handle major incidents (P1/P2), escalations, and coordinate with cross-functional teams for resolution. - Ensure adherence to SLAs and SLOs for incident response and resolution times. - Participate in on-call rotation and provide support for critical production issues. Cloud & Infrastructure Management (Azure, Linux, Terraform) : - Manage and monitor Azure resources (VMs, Storage, Networking, Load Balancers, Service Bus). - Automate infrastructure provisioning using Terraform and Azure ARM templates. - Perform Azure cost optimization by analyzing resource utilization. - Implement high availability, failover, and backup strategies for production workloads. Scripting & Automation (Shell, PowerShell, Python, Terraform) : - Develop automation scripts for routine tasks, deployments, and monitoring using Shell, PowerShell, or Python. - Improve operational efficiency by automating manual processes (e.g., log analysis, alert handling, cloud resource provisioning). - Create Terraform modules for infrastructure as code (IaC) deployment. Monitoring, Logging & Performance Optimization : - Set up and configure monitoring tools (Azure Monitor, Application Insights, Prometheus, Grafana, Splunk, Kibana) to track application health. - Analyze logs, metrics, and alerts to proactively detect and resolve performance bottlenecks. - Conduct capacity planning, tuning, and optimization of Azure services. DevOps & CI/CD (GitHub, Azure DevOps, Jenkins) : - Support CI/CD pipelines using GitHub Actions, Azure DevOps, Jenkins. - Manage source control (GitHub), branching strategies, and deployments. - Work closely with development teams to ensure seamless deployments and rollback strategies. Service Bus & Middleware Support : - Manage Azure Service Bus for message queuing and event-driven architecture. - Troubleshoot issues related to service bus messaging, latency, and integration failures. - Ensure proper message processing between microservices, APIs, and event-driven workflows. Required Skills & Qualifications : Technical Expertise : - Azure Cloud Services (VMs, Networking, Storage, Load Balancers, Service Bus, Key Vault). - Linux OS Administration (troubleshooting, scripting, user management). - Terraform (IaC deployment, modules, state management). - Scripting (Shell, PowerShell, Python). - Azure Service Bus (message queues, event-driven processing). - GitHub (source control, branching, CI/CD pipelines). - Monitoring Tools (Azure Monitor, Prometheus, Grafana, Splunk, Kibana). - ITIL Process (incident, change, and problem management). Soft Skills : - Strong problem-solving and analytical thinking. - Ability to work under pressure in a 24x7 production support environment. - Good communication skills to coordinate with teams and stakeholders. - Experience in collaborating with DevOps and Development teams. Preferred Qualifications (Good to Have) : - Azure Certification (AZ-104: Azure Administrator, AZ-400: DevOps Engineer). - Experience with Docker/Kubernetes for container orchestration. - Exposure to ITSM tools (ServiceNow, Jira, Remedy). - Knowledge of other cloud platforms (AWS, GCP) is a plus. (ref:hirist.tech)

Location: bangalore, IN

Posted Date: 4/20/2025
View More TheThreeAcross Jobs

Contact Information

Contact Human Resources
TheThreeAcross

Posted

April 20, 2025
UID: 5101985262

AboutJobs.com does not guarantee the validity or accuracy of the job information posted in this database. It is the job seeker's responsibility to independently review all posting companies, contracts and job offers.