PVAR Services
Senior DevOps Engineer - HPC & Cloud Infrastructure
Job Location
chennai, India
Job Description
Designation : Senior DevOps Engineer HPC & Cloud Infrastructure Location : Chennai, Required : 4 to 9 years Industry : Semiconductor Manufacturing Key Responsibilities : - Enhance the efficiency and scalability of HPC workloads in containerized infrastructures. - Continuously explore developments in HPC and cloud-native ecosystems. - Collaborate across DevOps and engineering teams to ensure HPC integrations run smoothly. - Configure and fine-tune Linux OS environments tailored for HPC. - Deploy and manage Kubernetes clusters supporting high-performance workloads. - Evaluate, benchmark, and optimize open-source cloud stacks to meet demanding compute requirements. - Architect scalable, high-performance systems leveraging CPU/GPU resources, reliable storage, and high speed networking. Preferred Candidate Profile : Attitude & Communication : - Curious mindset with a passion for complex problem-solving. - Strong verbal and written communication skills. - Comfortable working in a team-focused environment with a proactive approach. Core Technical Qualifications : - Solid understanding of HPC frameworks and cloud platforms (e.g., gRPC, Kafka, Kubernetes, ZeroMQ, Redis, Ceph). - Expertise in Linux performance optimization across distributions (SuSE, RedHat, Rocky, Ubuntu). - Skilled in container management and Kubernetes orchestration. - Familiar with remote boot protocols like System-D, PXE, Linux HA. - Good grasp of networking fundamentals, including TCP/IP, DNS, DHCP. - Comfortable with storage management and Linux-based networking. - Experience in scripting (Ansible, Python, Bash) and low-level programming (C). - Proficient in CI/CD tools such as Jenkins, GitLab, etc. - Understanding of HPC job schedulers like Slurm or PBS. - Background in configuration management tools (SaltStack, Chef, Puppet). - Strong analytical skills and meticulous attention to detail. Desirable Additions : - Exposure to tuning performance of CPU and GPU workloads. - Bachelors or Masters degree in Computer/Electrical Engineering or a related field. - Prior experience with performance tuning in distributed compute systems. Soft Skills & Competencies : - Strong interpersonal abilities with a team-first approach. - Excellent time management and task prioritization. - Multitasking capabilities with a calm and focused mindset under pressure. - Agile and responsive to change in a fast-paced environment. - Reliable communicator, both written and verbal, with an eye for collaborative growth. (ref:hirist.tech)
Location: chennai, IN
Posted Date: 4/20/2025
Location: chennai, IN
Posted Date: 4/20/2025
Contact Information
Contact | Human Resources PVAR Services |
---|