About the role
AI summarisedPrincipal Engineer, Machine Learning, SMAI role based on the published job description. Key responsibilities and requirements were extracted directly from the posting for quick review.
IDMFull-timeSmart MFG/AI
Key Responsibilities
- Qualified applicants will have experience in a variety of data and cloud technologies and have extensive practice modeling data, querying, and deploying scalable data pipelines to execute machine learning models and AI agents.
- You will collaborate with Data Scientists, Data Engineers, and expert users to build and deploy scalable AI/ML solutions that drive value and insight from Micron's manufacturing processes and systems.
- Architect and execute large-scale custom model training and fine-tuning jobs (SFT, RLHF) on multi-node, multi-GPU clusters.
- Design and develop autonomous AI Agents capable of multi-step reasoning, planning, and tool execution to automate complex manufacturing workflows.
- Build and maintain data/solution pipelines that feed machine learning models and GenAI applications.
Requirements
- Deep understanding of GPU architecture (memory hierarchy, tensor cores, interconnects like NVLink) and experience managing GPU resources in both cloud environments and on-prem.
- Hands-on experience with Distributed Data Parallel (DDP), Fully Sharded Data Parallel (FSDP), and model parallelism techniques.
- Proficiency in fine-tuning Large Language Models using PEFT techniques (LoRA, QLoRA) and optimizing inference engines (vLLM, TensorRT-LLM).
- Experience developing GenAI applications and AI Agents using frameworks like LangChain, LangGraph, LlamaIndex, or AutoGen.
- Proficiency with Large Language Models (LLMs), including prompt engineering, function calling/tool use, and Chain-of-Thought (CoT) reasoning.
- Experience in building and executing end-to-end ML systems automating training, testing and deploying Machine Learning models.
