Job Description
We are seeking a visionary Senior AI/LLM Engineer to join our elite team in San Francisco. As we define the technology stack for 2026, you will be at the forefront of the Generative AI revolution, building scalable, intelligent systems that redefine user experiences.
In this high-impact role, you will architect and deploy state-of-the-art Large Language Models (LLMs), optimize neural network architectures, and integrate AI agents into complex enterprise solutions. You will work in a collaborative environment that encourages experimentation, innovation, and pushing the boundaries of artificial intelligence.
Responsibilities
- Architect & Deploy: Design and implement scalable machine learning pipelines and LLM inference services using Python, PyTorch, and cloud-native architectures (AWS/GCP).
- Model Optimization: Fine-tune and optimize pre-trained models (e.g., GPT-4, Llama 3) for specific domain applications to ensure high accuracy and low latency.
- Integration: Collaborate with frontend and backend engineers to seamlessly integrate AI capabilities into production web and mobile applications.
- R&D: Conduct research on emerging AI trends, including RAG (Retrieval-Augmented Generation) and multimodal models, to stay ahead of the 2026 technology curve.
- Maintenance: Monitor model performance, troubleshoot data pipelines, and ensure robust data governance and security standards are met.
Qualifications
- Education: Masterβs or PhD in Computer Science, Machine Learning, or a related quantitative field.
- Experience: 5+ years of professional experience in software engineering with a strong focus on AI/ML.
- Technical Proficiency: Deep knowledge of Python, C++, or Java; strong experience with PyTorch, TensorFlow, or JAX.
- AI Expertise: Proven track record of working with NLP, Transformers, Deep Learning, and vector databases.
- Communication: Ability to translate complex technical concepts into clear, actionable insights for non-technical stakeholders.