Job Description
Are you ready to push the boundaries of machine intelligence? NexusMind AI is seeking a high-caliber AI Research Engineer to join our core infrastructure team in San Francisco. You will be instrumental in designing and deploying cutting-edge generative models that solve real-world complexities at scale. We offer a culture of deep technical rigor, radical transparency, and significant equity ownership.
Responsibilities
- Architect and train large-scale neural network models using PyTorch or JAX.
- Optimize transformer architectures for production-level inference efficiency.
- Collaborate with cross-functional teams to integrate generative AI capabilities into our core platform.
- Conduct cutting-edge research in Reinforcement Learning from Human Feedback (RLHF) and fine-tuning.
- Maintain high code quality standards through rigorous peer review and architectural documentation.
- Mentor junior machine learning engineers on best practices for model optimization.
- Stay at the forefront of the AI research landscape by contributing to internal whitepapers and open-source projects.
Qualifications
- Master’s or PhD in Computer Science, Artificial Intelligence, or a related quantitative field.
- 5+ years of professional experience in deep learning, specifically with Large Language Models (LLMs).
- Proficiency in Python and deep expertise in modern frameworks (PyTorch, HuggingFace, JAX).
- Solid understanding of distributed systems, GPU cluster management, and CUDA optimization.
- Proven track record of deploying scalable AI models in a production cloud environment (AWS/GCP).
- Strong mathematical foundation in linear algebra, probability, and optimization theory.
- Exceptional problem-solving skills with an ability to navigate ambiguity in a fast-paced environment.