Skip to main content
Search jobs

AI Software Solutions Engineer (AI Frameworks, Workloads)

Bengaluru, Karnataka, India Job ID JR0263211 Job Category Software Engineering Work Mode Hybrid Experience Level Experienced Full/Part Time Full Time

Job Description

We are looking for a dynamic software engineer to design, develop and optimize AI frameworks for training and inference on Intel Habana ( deep learning accelerators. In this role, you will work with a cross-geo team on enabling and optimizing state of the art deep learning models with a specific focus on the PyTorch framework. The roles and responsibilities that you would need to carry out may include the following:
Design and develop SW techniques for AI frameworks - both HW-agnostic and HW-aware Contribute to enhancing and extending the Training and Inference capabilities in the Software stack. Profile deep learning inference and training workloads and identify optimization opportunities in the software stack.


BTech, MS or PhD in CS or related fields with an overall experience of 10 to 15 years

Programming skills in Advanced C++, Python and parallel programming skills

Previous exposure to Machine Learning (ML) frameworks such as PyTorch and Tensorflow.

Detailed understanding of machine learning systems optimization and deployment techniques such as quantization

understanding of optimization strategies for deployment of Large Language Models (LLMs)

knowledge of transformers, KV cache , prefill buffer etc optimzation technique for inference.

Working knowledge of operators in Pytorch or Tensorflow and Understanding of low level kernels.

Ability to debug complex issues in multi layered SW systems. Understanding of SW integration across open source framework and internal bridge layers.

Understanding of computer architecture and HW-SW optimization techniques

Practical knowledge of DL topologies for different use cases

Knowledge of compiler algorithms for heterogeneous systems

Experience working on frameworks/platforms that have gone to production

Effective communication skills and experience with working in a cross-geo setup

Preferred knowledge of open source compiler infrastructure like LLVM or gcc

Inside this Business Group

The Data Center & Artificial Intelligence Group (DCAI) is at the heart of Intel’s transformation from a PC company to a company that runs the cloud and billions of smart, connected computing devices. The data center is the underpinning for every data-driven service, from artificial intelligence to 5G to high-performance computing, and DCG delivers the products and technologies—spanning software, processors, storage, I/O, and networking solutions—that fuel cloud, communications, enterprise, and government data centers around the world.

Posting Statement

All qualified applicants will receive consideration for employment without regard to race, color, religion, religious creed, sex, national origin, ancestry, age, physical or mental disability, medical condition, genetic information, military and veteran status, marital status, pregnancy, gender, gender expression, gender identity, sexual orientation, or any other characteristic protected by local law, regulation, or ordinance.


We offer a total compensation package that ranks among the best in the industry. It consists of competitive pay, stock, bonuses, as well as, benefit programs which include health, retirement, and vacation. Find more information about all of our Amazing Benefits here.

It has come to our notice that some people have received fake job interview letters ostensibly issued by Intel, inviting them to attend interviews in Intel’s offices for various positions and further requiring them to deposit money to be eligible for the interviews. We wish to bring to your notice that these letters are not issued by Intel or any of its authorized representatives. Hiring at Intel is based purely on merit and Intel does not ask or require candidates to deposit any money. We would urge people interested in working for Intel, to apply directly at and not fall prey to unscrupulous elements.

Working Model

This role will be eligible for our hybrid work model which allows employees to split their time between working on-site at their assigned Intel site and off-site. In certain circumstances the work model may change to accommodate business needs.
Maggie, Offensive Security Researcher

Maggie Offensive Security Researcher

“I’ve always wanted to do something that changes the world — at Intel, I feel appreciated, and I’ve gained more confidence in myself. It makes me feel like I’m capable of doing great things.”

  • New Mexico DMO Module Equipment Technician Albuquerque, New Mexico View job
  • Operations Research Analyst - IAO Value Capture Multiple Locations View job
  • GPU Software Development Engineer Folsom, California View job
View all jobs

You don't have Recently Viewed Jobs yet.

View all jobs

You don't have Saved Jobs yet.

View all jobs

Join Our Talent Community

Be the first to hear about what's happening at Intel! Sign up to receive the latest news and updates.

Sign up