Job Description
Are you ready to redefine the boundaries of what machine intelligence can achieve? Aetheris AI Labs is seeking a visionary Senior AI Research Scientist to join our core R&D team in the heart of San Francisco. In this role, you won't just follow the roadmap—you will build it. We are looking for an exceptional mind to spearhead the development of next-generation Large Language Models (LLMs) and multi-modal generative systems that solve complex, real-world problems.
We provide a high-octane environment with access to massive compute clusters, proprietary datasets, and a culture that prioritizes breakthrough innovation over incremental gains. If you are passionate about neural architecture search, reinforcement learning from human feedback (RLHF), and scaling laws, Aetheris is your new home.
Responsibilities
- Lead the architectural design and end-to-end training of large-scale transformer models and generative agents.
- Optimize model performance for both high-throughput inference and low-latency edge deployment.
- Publish original research in top-tier conferences like NeurIPS, ICML, or ICLR to maintain our industry leadership.
- Collaborate with infrastructure engineers to scale distributed training across thousands of GPUs using DeepSpeed and Megatron-LM.
- Develop novel techniques for fine-tuning, prompt engineering, and model alignment to ensure safety and reliability.
- Mentor junior researchers and contribute to a world-class engineering culture driven by technical excellence.
Qualifications
- Ph.D. in Computer Science, Artificial Intelligence, Mathematics, or a highly quantitative field.
- Minimum of 5 years of industry experience developing and deploying deep learning models at scale.
- Expertise in Python and deep learning frameworks, specifically PyTorch or JAX.
- Proven track record of success in LLM pre-training, instruction tuning, or reinforcement learning.
- Deep understanding of attention mechanisms, vector databases, and neural scaling laws.
- Strong publication record or a portfolio of significant open-source contributions to the AI community.
- Excellent communication skills with the ability to translate complex technical concepts into actionable product strategies.