Job Description
At NexusMind, we are pushing the boundaries of generative AI and autonomous agents. We are seeking a world-class AI Engineer to join our core research and development team in San Francisco. You will work on cutting-edge transformer architectures and scalable inference engines, helping to solve complex real-world problems for global enterprises. If you thrive in a fast-paced environment and have a passion for building models that scale, we want to meet you.
Responsibilities
- Architect and implement scalable machine learning models using PyTorch or TensorFlow.
- Optimize large-scale neural networks for inference efficiency and reduced latency.
- Collaborate with product and data teams to translate business requirements into sophisticated AI solutions.
- Design and maintain data pipelines for high-velocity training datasets.
- Lead code reviews and contribute to internal best practices for MLOps.
- Stay at the forefront of AI research by evaluating new methodologies and frameworks.
- Mentor junior engineers and foster a culture of technical excellence.
Qualifications
- M.S. or Ph.D. in Computer Science, Artificial Intelligence, or a related quantitative field.
- 5+ years of industry experience deploying production-grade machine learning systems.
- Expert-level proficiency in Python and C++.
- Deep understanding of LLMs, attention mechanisms, and fine-tuning techniques.
- Experience with cloud infrastructure (AWS/GCP) and containerization tools like Docker/Kubernetes.
- Strong background in distributed computing and GPU optimization.
- Excellent analytical skills and ability to solve ambiguous, complex problems.