Job Description
Nexus AI Labs is at the forefront of generative AI research, building scalable infrastructure for the next generation of intelligent agents. We are seeking a visionary Senior Machine Learning Engineer to join our core research and engineering team in San Francisco. You will bridge the gap between theoretical model research and production-grade implementation, impacting millions of users globally.
We offer a collaborative, fast-paced environment where innovation is the standard. If you are passionate about pushing the boundaries of what is possible in artificial intelligence, we want to meet you.
Responsibilities
- Architect and implement high-performance machine learning pipelines for large-scale training.
- Collaborate with research scientists to deploy novel architectures into production environments.
- Optimize neural network performance for latency and throughput in cloud-based GPU clusters.
- Monitor and evaluate model metrics, conducting deep-dive analysis to resolve drifts or performance regressions.
- Mentor junior engineers and champion best practices in ML engineering and version control.
- Develop robust infrastructure for data preprocessing, model versioning, and automated testing.
Qualifications
- Master’s or Ph.D. in Computer Science, Artificial Intelligence, or a related quantitative field.
- 5+ years of professional experience in deep learning, MLOps, or large-scale distributed systems.
- Expertise in Python and at least one high-performance language (C++, Rust).
- Deep technical proficiency with PyTorch, TensorFlow, or JAX frameworks.
- Proven track record of deploying ML models in a high-traffic production environment.
- Strong understanding of GPU acceleration, distributed training techniques, and cloud infrastructure (AWS/GCP).
- Excellent communication skills with the ability to explain complex technical concepts to cross-functional stakeholders.