Job Description
We are looking for a strong ML applied scientists and engineers to build ground breaking AI infrastructures to power the infrastructures that Apple in-house ML experts use everyday to optimize models shipped on devices and servers for Apple Intelligence. We are part of a collaborative group of software developers and deep learning authorities working in the area of neural network optimization, on device inference, and model evaluation. You will work with world-class talents in visualization, LLM training, on-device optimization, ML tools/platforms. You will develop reliable and scalable web services for ML developers: e.g., model optimization pipeline, effective ML dev workflow, infrastructure to serve internal service.
- Experience developing/optimizing/training large language models (LLMs), or large computer vision models, or generative AI models.
- Software engineering skills in Python and general purpose system admin and infrastructure management abilities.
- History of applied research in neural network model life cycle or training or a related area application.
- Track record to drive scientific investigations and experiments and overcome obstacles and uncertainty in a research environment.
- BS degree and 3+ years of proven experience.
- Publication record at top AI/ML venues
- Experience with LLM LoRA fine-tuning, neural network optimization (e.g. quantization and compression)
- Experience with on-device/server scale deployment
- Experience with languages like C/C++
- Infrastructure management and debugging experience
- Experimental rigor when training/evaluating LLMs for the purpose of benchmarking LLM optimization algorithms
- Strong communication and accountability skills; hard-working, strong work ethic, and collaboration abilities
- MS or PHD in related field