Home Job Details
A
Information Technology 🏢 Full Time ⭐️ Verified

Senior Machine Learning Engineer (Generative AI)

Aetheris AI Solutions
San Francisco
Salary Estimate
USD 220.000 – USD 310.000
Latest
Live Update
1 Juni 2026
Deadline
1 Jun 2027

Job Description

Join the vanguard of the intelligence revolution. Aetheris AI is seeking an elite Senior Machine Learning Engineer to architect the next generation of Large Language Model (LLM) infrastructures. You will work at the intersection of breakthrough research and scalable production, transforming complex neural architectures into world-class enterprise solutions.

As a key member of our core AI team, you will influence the technical roadmap of our generative platform. We value engineers who possess a deep intuition for model behavior, a passion for high-performance computing, and a relentless drive for optimization in a rapidly evolving landscape.

Responsibilities

  • Design and implement state-of-the-art Generative AI models using advanced fine-tuning techniques such as LoRA and QLoRA.
  • Architect scalable Retrieval-Augmented Generation (RAG) systems to enhance model accuracy and contextual relevance.
  • Optimize inference latency and throughput for large-scale deployments on distributed GPU clusters.
  • Lead the integration of proprietary AI capabilities into production-ready enterprise software products.
  • Conduct rigorous A/B testing and evaluation of model performance across diverse industry benchmarks.
  • Stay at the forefront of AI research, translating academic breakthroughs into commercially viable applications.

Qualifications

  • Master’s or PhD in Computer Science, Mathematics, or a related quantitative field with a focus on Deep Learning.
  • 5+ years of professional experience in Machine Learning, with a significant focus on Transformers and NLP.
  • Expertise in Python and deep learning frameworks including PyTorch, JAX, or TensorFlow.
  • Proven track record of deploying Large Language Models (LLMs) in high-traffic production environments.
  • Hands-on experience with vector databases such as Pinecone, Weaviate, or Milvus.
  • Strong background in distributed training and performance profiling on NVIDIA hardware.

Required Skills

Generative AI LLMs PyTorch Python RAG Kubernetes CUDA Transformers

Ready to Take on This Challenge?

Make sure your resume is ready. Submit your application now before the deadline.

Apply Now

Related Job Openings

Job recommendations similiar to you

View All