Job Description
Are you ready to architect the future? NexusAI Research Labs is at the forefront of generative AI and machine learning innovation. We are seeking a visionary Senior AI Engineer to join our core research team in the heart of San Francisco. You will bridge the gap between theoretical research and scalable, production-ready AI systems, pushing the boundaries of what's possible with Large Language Models.
Responsibilities
- Design and implement scalable machine learning pipelines for massive datasets.
- Fine-tune Large Language Models (LLMs) to enhance performance for specialized enterprise tasks.
- Collaborate with cross-functional teams to integrate generative AI features into our core product stack.
- Conduct cutting-edge research in neural network architectures and optimization techniques.
- Mentor junior engineers and promote best practices in MLOps and software craftsmanship.
- Optimize inference performance to ensure sub-millisecond latency for real-time applications.
Qualifications
- Master's or PhD in Computer Science, Artificial Intelligence, or a related quantitative field.
- 5+ years of professional experience in machine learning, NLP, or deep learning.
- Proficiency in Python and deep learning frameworks such as PyTorch or TensorFlow.
- Hands-on experience with transformer architectures and LLM orchestration (e.g., LangChain, LlamaIndex).
- Deep understanding of cloud infrastructure (AWS/GCP) and containerization tools like Docker and Kubernetes.
- Strong background in distributed computing and vector databases (e.g., Pinecone, Milvus).
- Track record of published research or significant contributions to open-source AI projects.