Engineer, Senior - Machine Learning, Embedded, C++
Qualcomm
All India, Hyderabad • 1 month ago
Experience: 1 to 6 Yrs
PREMIUM
Deal of the Day
--:--:--
15 Days Free Trial
Upgrade to CVX24 Premium
- Free Resume Writing
-
Get a Verified Blue tick
- See who viewed your profile
- Unlimited chat with recruiters
- Rank higher in recruiter searches
- Get up to 10× more recruiter visibility
- Auto-forward profile to 10 top recruiters
- Receive verified recruiter messages directly
- Unlock hidden jobs, not visible to free users
$0
Activate
$0
A small token amount will be charged to verify.
Get Refund in 48 Hours.
After free-trial 6 Months subscription will be auto Activated @ $2.49 (Cancel Anytime).
Free Bluetooth earphones with 6 Months subscription only.
Enter Your Details
Job Description
As a Qualcomm Software Engineer, you will design, develop, create, modify, and validate embedded and cloud edge software, applications, and/or specialized utility programs that launch cutting-edge, world-class products that meet and exceed customer needs. You will collaborate with systems, hardware, architecture, test engineers, and other teams to design system-level software solutions and obtain information on performance requirements and interfaces.
**Responsibilities:**
- Spearhead the development and commercialization of the Qualcomm AI Runtime (QAIRT) SDK on Qualcomm SoCs
- Push the limits of performance from large models as an AI inferencing expert
- Deploy large C/C++ software stacks using best practices
- Stay updated on GenAI advancements, understanding LLMs/Transformers and edge-based GenAI deployment nuances
- Drive AI's evolution at the edge with passion
**Requirements:**
- Masters/Bachelors degree in computer science or equivalent
- 6+ years of relevant work experience in software development
- Strong understanding of Generative AI models LLM, LVM, LMMs, and building blocks (self-attention, cross-attention, kv caching, etc.)
- Knowledge of Floating-point, Fixed-point representations, and Quantization concepts
- Experience with optimizing algorithms for AI hardware accelerators (like CPU/GPU/NPU)
- Proficiency in C/C++ programming, Design Patterns, and OS concepts
- Good scripting skills in Python
- Excellent analytical and debugging skills
- Good communication skills (verbal, presentation, written)
- Ability to collaborate across a globally diverse team and multiple interests
**Preferred Qualifications:**
- Understanding of SIMD processor architecture and system design
- Proficiency in object-oriented software development
- Familiarity with Linux and Windows environments
- Background in kernel development for SIMD architectures
- Knowledge of frameworks like llama.cpp, MLX, and MLC
- Familiarity with PyTorch, TFLite, and ONNX Runtime
- Experience with parallel computing systems and languages like OpenCL and CUDA
Join the exciting Generative AI team at Qualcomm focused on integrating cutting-edge GenAI models on Qualcomm chipsets. The team utilizes Qualcomm chips' extensive heterogeneous computing capabilities to enable inference of GenAI models on-device without the need for connection to the cloud. The inference engine is designed to help developers run neural network models trained in a variety of frameworks on Snapdragon platforms at high speeds while consuming minimal power. Utilize the power-efficient hardware and Software stack to run Large Language Models (LLMs) and Large Vision Models (LVM) at near GPU speeds. As a Qualcomm Software Engineer, you will design, develop, create, modify, and validate embedded and cloud edge software, applications, and/or specialized utility programs that launch cutting-edge, world-class products that meet and exceed customer needs. You will collaborate with systems, hardware, architecture, test engineers, and other teams to design system-level software solutions and obtain information on performance requirements and interfaces.
**Responsibilities:**
- Spearhead the development and commercialization of the Qualcomm AI Runtime (QAIRT) SDK on Qualcomm SoCs
- Push the limits of performance from large models as an AI inferencing expert
- Deploy large C/C++ software stacks using best practices
- Stay updated on GenAI advancements, understanding LLMs/Transformers and edge-based GenAI deployment nuances
- Drive AI's evolution at the edge with passion
**Requirements:**
- Masters/Bachelors degree in computer science or equivalent
- 6+ years of relevant work experience in software development
- Strong understanding of Generative AI models LLM, LVM, LMMs, and building blocks (self-attention, cross-attention, kv caching, etc.)
- Knowledge of Floating-point, Fixed-point representations, and Quantization concepts
- Experience with optimizing algorithms for AI hardware accelerators (like CPU/GPU/NPU)
- Proficiency in C/C++ programming, Design Patterns, and OS concepts
- Good scripting skills in Python
- Excellent analytical and debugging skills
- Good communication skills (verbal, presentation, written)
- Ability to collaborate across a globally diverse team and multiple interests
**Preferred Qualifications:**
- Understanding of SIMD processor architecture and system design
- Proficiency in object-oriented software development
- Familiarity with Linux and Windows environments
- Background in kernel development for SIMD architectures
- Knowledge of frameworks like llama.cpp, MLX, and MLC
- Familiarity with PyTorch, TFLite, and ONNX Runtime
- Experience with parallel computing systems and languages like OpenCL and CUDA
Join the exciting Generative AI team at Qualcomm focused on integrating cutting-edge GenAI models on Qualcomm chipsets. The team utilizes Qualcomm chips' extensive heterogeneous computing capabilities to enable inference of GenAI models on-device witho
Skills Required
C
C
Java
Python
LVM
Transformers
Design Patterns
analytical skills
communication skills
OpenCL
CUDA
Generative AI models
LLM
LLMs
selfattention
cross attention
kv caching
Floatingpoint
Fixedpoint representations
Quantization concepts
optimizing algorithms for AI hardware accelerators
OS concepts
Python scripting
debugging skills
SIMD processor architecture
objectoriented software development
Linux environment
Windows environment
kernel development for SIMD architectures
llamacpp
MLX
MLC
PyTorch
TFLite
ONNX Runtime
parallel computing systems
Posted on: April 1, 2026
Relevant Jobs
Step 2 of 2