ZettaMine Labs Pvt Ltd
Server Management Administrator - Linux Infrastructure
Job Location
hyderabad, India
Job Description
Job Description : As a Server Management Administrator (Linux) at ZettaMine Labs, you will be responsible for the comprehensive management, maintenance, and support of Linux-based server infrastructure across a portfolio of client projects. You will utilize your extensive experience in Linux administration, virtualization technologies, networking fundamentals, cloud platforms, and automation tools to ensure the high availability, performance, and security of our clients' server environments. This role requires a strong technical foundation, excellent problem-solving abilities, and the capacity to manage multiple tasks effectively in a fast-paced environment. Responsibilities : - Provide expert-level administration, configuration, and troubleshooting for a variety of Linux distributions including RHEL, SUSE, Ubuntu, and CentOS. - Implement and manage user and group accounts, permissions, and security policies. - Perform advanced system tuning and optimization for performance and stability. - Manage and troubleshoot file systems, storage solutions (local and network), and logical volume management (LVM). - Implement and manage system security hardening based on industry best practices and client requirements. - Plan, execute, and document the installation, configuration, and deployment of new Linux servers (physical and virtual). - Perform routine server maintenance tasks, including patching, upgrades, and configuration changes. - - Monitor server health and performance using various tools and proactively address potential issues. - Manage the decommissioning and secure disposal of end-of-life servers. - Demonstrate extensive hands-on experience in managing and troubleshooting virtualized environments using VMware (vSphere, ESXi), XEN, or other relevant hypervisors. - Provision, clone, migrate, and manage virtual machines efficiently. - Monitor and optimize the performance of virtualized infrastructure. - Troubleshoot issues related to virtual networking and storage. - Possess a strong understanding of TCP/IP networking, DNS, DHCP, routing, VLANs, and load balancing concepts. - Configure and manage software firewalls (iptables, firewalld) and implement network security policies. - - Troubleshoot network connectivity issues at the server level and collaborate with network engineers as needed. - Implement and manage VPN connections and other secure communication protocols. - Demonstrate practical experience in deploying, managing, and troubleshooting Linux server infrastructure on major cloud platforms such as AWS, Azure, and/or GCP. - Utilize cloud-native services for server management, monitoring, and security. - Implement and manage infrastructure-as-code (IaC) using automation tools like Ansible, Chef, Terraform, SaltStack, or similar. - Develop and maintain automation scripts for provisioning, configuration management, and routine tasks. - - Exhibit advanced proficiency in scripting languages such as Bash, Perl, and/or Python for automating complex system administration tasks, creating sophisticated monitoring scripts, and managing infrastructure configurations at scale. - Develop custom tools and scripts to enhance server management capabilities. - Design, implement, and manage comprehensive monitoring solutions using tools like Prometheus, Nagios, Zabbix, Grafana, or similar. - - Configure alerts and notifications for critical system events and performance thresholds. - - Implement and manage centralized logging solutions using the ELK stack (Elasticsearch, Logstash, Kibana) or similar tools for log analysis and troubleshooting. - Utilize GitHub for version control of scripts and configurations. - Manage incidents, problems, and changes effectively using Jira and adhering to ITIL-based processes. - Participate in root cause analysis and contribute to knowledge base articles. - - Lead troubleshooting efforts for complex server-related incidents, identifying root causes and implementing effective and timely resolutions. - Manage incident escalations and communicate effectively with stakeholders. - Possess excellent verbal and written communication skills to interact effectively with technical and non-technical stakeholders, including application developers, database administrators, network engineers, project managers, and client representatives. - - Provide clear and concise technical documentation, including runbooks and standard operating procedures. Required Skills : - 7 Years of deep, hands-on experience administering various Linux distributions (RHEL, SUSE, Ubuntu, CentOS). - Proven ability to manage servers through their entire lifecycle. - Extensive hands-on knowledge of VMware, XEN, and physical server management, including troubleshooting complex virtualization issues. - Strong understanding and practical application of networking concepts, firewall management, and security protocols. - Significant experience with at least one major cloud platform (AWS/Azure/GCP) and deep proficiency in using automation tools (Ansible, Chef, Terraform, etc.). - Expert-level proficiency in Bash, Perl, or Python scripting for complex automation tasks. - Proven experience implementing and utilizing monitoring tools (Prometheus, Nagios, ELK stack) for proactive issue detection and analysis. - Familiarity and practical experience with GitHub, Jira, and ITIL-based incident/change management processes. (ref:hirist.tech)
Location: hyderabad, IN
Posted Date: 4/19/2025
Location: hyderabad, IN
Posted Date: 4/19/2025
Contact Information
Contact | Human Resources ZettaMine Labs Pvt Ltd |
---|