Large Language Model Machine Learning Engineer – Apple

Apply Now

Job Description

We are looking for a strong ML applied scientists and engineers to build ground breaking AI infrastructures to power the infrastructures that Apple in-house ML experts use everyday to optimize models shipped on devices and servers for Apple Intelligence. We are part of a collaborative group of software developers and deep learning authorities working in the area of neural network optimization, on device inference, and model evaluation. You will work with world-class talents in visualization, LLM training, on-device optimization, ML tools/platforms. You will develop reliable and scalable web services for ML developers: e.g., model optimization pipeline, effective ML dev workflow, infrastructure to serve internal service.


  • Experience developing/optimizing/training large language models (LLMs), or large computer vision models, or generative AI models.
  • Software engineering skills in Python and general purpose system admin and infrastructure management abilities.
  • History of applied research in neural network model life cycle or training or a related area application.
  • Track record to drive scientific investigations and experiments and overcome obstacles and uncertainty in a research environment.
  • BS degree and 3+ years of proven experience.


  • Publication record at top AI/ML venues
  • Experience with LLM LoRA fine-tuning, neural network optimization (e.g. quantization and compression)
  • Experience with on-device/server scale deployment
  • Experience with languages like C/C++
  • Infrastructure management and debugging experience
  • Experimental rigor when training/evaluating LLMs for the purpose of benchmarking LLM optimization algorithms
  • Strong communication and accountability skills; hard-working, strong work ethic, and collaboration abilities
  • MS or PHD in related field