Job Description
Are you ready to shape the future of machine intelligence? NeuralEdge Dynamics is at the forefront of generative AI and autonomous agent research. We are seeking a visionary Senior AI Research Engineer to join our core architecture team in the heart of San Francisco.
You will work on cutting-edge transformer models and high-throughput training pipelines, collaborating with top-tier researchers to push the boundaries of what is possible in large language model reasoning and multimodal synthesis.
Responsibilities
- Design and train state-of-the-art transformer architectures for large-scale multimodal applications.
- Optimize neural network training pipelines for distributed GPU clusters (NVIDIA H100/A100).
- Implement novel loss functions and optimization strategies to improve model convergence and reasoning capabilities.
- Collaborate with cross-functional teams to integrate AI models into production-grade consumer software.
- Conduct deep-dive research into model interpretability and alignment (RLHF/DPO).
- Publish high-impact research findings and represent the company at international AI conferences (NeurIPS, ICML).
- Mentor junior machine learning engineers on best practices for model deployment and evaluation.
Qualifications
- Ph.D. or Master's degree in Computer Science, Artificial Intelligence, or a related quantitative field.
- 5+ years of hands-on experience in training large-scale deep learning models.
- Fluency in PyTorch and JAX; deep understanding of GPU memory management.
- Strong background in algorithmic design, numerical optimization, and linear algebra.
- Experience with distributed training frameworks like DeepSpeed or Megatron-LM.
- Proven track record of publications in Tier-1 AI conferences.
- Excellent communication skills with the ability to explain complex concepts to non-technical stakeholders.