Wenger & Watson Inc.

Machine Learning Compiler Engineer - Deep Learning Models

Click Here to Apply

Job Location

mumbai, India

Job Description

Overview : - Development and support of new AI/ML compiler features/technologies to accelerate deep learning models. - Work closely with AI hardware accelerator teams and add support for compiler features covering optimization algorithms, code generation, etc. to fully utilize the hardware features for maximum efficiency. - Be well acquainted with the latest trends in ML models and compiler technologies to build innovative solutions in our products. Roles and responsibilities : - Develop end-to-end ML compiler leveraging standard compiler infrastructures like TVM, MLIR, Torch Dynamo/Inductor taking advantage of both intra-operator parallelism and graph/pipeline/dataflow parallelism while mapping to hardware compute/processing elements of custom AI accelerator - Implement low level parallel programming model for development/deployment of high performance kernels fully utilizing the hardware capabilities - Adapt advance techniques/algorithms for placement/scheduling and parallelization of model graphs to improve the performance of ML applications optimizing execution speed and resource utilization - Code generation leveraging LLVM for the AI accelerator compute elements/cores of targeted ISA with custom instructions - Debugging and profiling of features of compiler to identify issues and performance hot spots - Own responsibility throughout the product lifecycle in solving functional/performance issues during execution phase - Work closely with hardware architecture team for efficient HW/SW Co-design of AI accelerator IP and overall AI server-class SoC - Keep up-to-date with the industry trend in compiler frameworks in terms of feature advancements, design methods and approaches Qualifications : - Should have minimum 12 years of relevant experience in AI/ML - Should have deep practical experience of developing end-to-end ML compiler with any AI accelerator architecture (GPU/ASIC/many-core heterogenous) - Should have good experience with handling complex hierarchies of compute elements and memories of hardware accelerator and their mapping in backend compiler for efficient power/performance - Should have very good knowledge in any one of compiler frameworks like TVM, MLIR, Torch Dynamo/Inductor or equivalent - Good knowledge on SOTA Generative AI model architectures like LLM and model optimization techniques will be helpful - Good knowledge on popular ML framework ecosystems (PyTorch/TensorFlow/ONNX) - High proficiency in C/C++, Python, domain-specific languages and parallel programming languages like OpenCL/CUDA (ref:hirist.tech)

Location: mumbai, IN

Posted Date: 11/24/2024
Click Here to Apply
View More Wenger & Watson Inc. Jobs

Contact Information

Contact Human Resources
Wenger & Watson Inc.

Posted

November 24, 2024
UID: 4894731606

AboutJobs.com does not guarantee the validity or accuracy of the job information posted in this database. It is the job seeker's responsibility to independently review all posting companies, contracts and job offers.