Job Description
Are you ready to architect the future of intelligence? At NexusMind AI Labs, we are pushing the boundaries of generative models and neural architecture search. We are seeking a visionary Senior AI Engineer to join our core research and development team in San Francisco. You will work on cutting-edge production-grade LLMs, optimizing inference performance, and integrating autonomous agents into enterprise workflows.
We offer a world-class environment, competitive equity packages, and the chance to solve problems that define the next generation of computing.
Responsibilities
- Design, train, and deploy large-scale transformer models for diverse enterprise applications.
- Optimize neural network architectures for low-latency, high-throughput inference environments.
- Collaborate with cross-functional teams to integrate generative AI features into existing core products.
- Implement MLOps best practices, including automated data pipelines, versioning, and monitoring.
- Conduct cutting-edge research into emergent AI behaviors to maintain our competitive edge.
- Mentor junior machine learning engineers through code reviews and collaborative research initiatives.
- Translate complex business requirements into scalable technical roadmaps for AI adoption.
Qualifications
- Master’s or PhD in Computer Science, Artificial Intelligence, or a related quantitative field.
- 5+ years of hands-on experience in Deep Learning frameworks such as PyTorch or TensorFlow.
- Proven track record of deploying Large Language Models (LLMs) in a production environment.
- Expertise in Python, C++, and cloud infrastructure (AWS/GCP) for distributed training.
- Deep understanding of attention mechanisms, vector databases, and RAG architectures.
- Strong background in data structures, algorithms, and system design for distributed computing.
- Excellent communication skills with the ability to bridge the gap between technical and non-technical stakeholders.