Job Description
At NexusMind AI Systems, we are pushing the boundaries of generative models and autonomous agent frameworks. We are seeking a visionary Senior AI Engineer to join our core research and development team in San Francisco. You will be instrumental in architecting scalable machine learning pipelines and deploying state-of-the-art LLMs into production environments that impact millions of users globally.
We offer a collaborative, fast-paced environment where innovation is the currency and technical excellence is the baseline. If you are passionate about building the future of intelligence, we want to hear from you.
Responsibilities
- Design, train, and fine-tune large-scale transformer-based models for enterprise applications.
- Collaborate with cross-functional teams to integrate AI models into high-availability cloud infrastructure.
- Optimize model inference latency and throughput for real-time performance.
- Develop robust evaluation frameworks to measure model accuracy, bias, and performance metrics.
- Mentor junior engineers and promote best practices in MLOps and scalable software architecture.
- Participate in end-to-end product lifecycle, from initial research to production deployment and monitoring.
- Contribute to internal research papers and present findings at top-tier industry conferences.
Qualifications
- Master’s or PhD in Computer Science, Artificial Intelligence, or a related quantitative field.
- 5+ years of hands-on experience building and deploying machine learning models in production environments.
- Advanced proficiency in Python and deep learning frameworks like PyTorch or JAX.
- Deep understanding of LLM architectures, fine-tuning techniques (LoRA, P-Tuning), and RAG pipelines.
- Experience with distributed training paradigms and cloud-based AI infrastructure (AWS/GCP).
- Strong background in data structures, algorithms, and software engineering best practices.
- Excellent communication skills with the ability to translate complex AI concepts into actionable business strategies.