Job Description
Join the vanguard of the intelligence revolution. Aetheris AI is seeking an elite Senior Machine Learning Engineer to architect the next generation of Large Language Model (LLM) infrastructures. You will work at the intersection of breakthrough research and scalable production, transforming complex neural architectures into world-class enterprise solutions.
As a key member of our core AI team, you will influence the technical roadmap of our generative platform. We value engineers who possess a deep intuition for model behavior, a passion for high-performance computing, and a relentless drive for optimization in a rapidly evolving landscape.
Responsibilities
- Design and implement state-of-the-art Generative AI models using advanced fine-tuning techniques such as LoRA and QLoRA.
- Architect scalable Retrieval-Augmented Generation (RAG) systems to enhance model accuracy and contextual relevance.
- Optimize inference latency and throughput for large-scale deployments on distributed GPU clusters.
- Lead the integration of proprietary AI capabilities into production-ready enterprise software products.
- Conduct rigorous A/B testing and evaluation of model performance across diverse industry benchmarks.
- Stay at the forefront of AI research, translating academic breakthroughs into commercially viable applications.
Qualifications
- Master’s or PhD in Computer Science, Mathematics, or a related quantitative field with a focus on Deep Learning.
- 5+ years of professional experience in Machine Learning, with a significant focus on Transformers and NLP.
- Expertise in Python and deep learning frameworks including PyTorch, JAX, or TensorFlow.
- Proven track record of deploying Large Language Models (LLMs) in high-traffic production environments.
- Hands-on experience with vector databases such as Pinecone, Weaviate, or Milvus.
- Strong background in distributed training and performance profiling on NVIDIA hardware.