Job Description
We are looking for a visionary Senior Generative AI Engineer to lead our next-generation AI initiatives. As we look toward the technological landscape of 2026, we are building the infrastructure that will define the future of human-machine interaction. You will be at the forefront of deploying state-of-the-art Large Language Models (LLMs) and multimodal systems.
In this role, you won't just maintain existing systems; you will architect the future of our product suite, ensuring our AI solutions are scalable, ethical, and transformative.
Why Join Us?
- Work with the latest in Transformer architectures and reinforcement learning.
- Competitive equity package and remote-first flexibility.
- Collaborate with world-class researchers and product designers.
Responsibilities
- Architect and deploy scalable LLM inference pipelines using cloud-native technologies (Kubernetes, AWS/GCP).
- Optimize model performance for latency, throughput, and cost efficiency in production environments.
- Conduct research and implementation of fine-tuning techniques (LoRA, QLoRA) for domain-specific applications.
- Integrate Retrieval-Augmented Generation (RAG) frameworks to enhance model accuracy and reduce hallucinations.
- Collaborate with cross-functional teams to translate business requirements into technical AI solutions.
- Ensure compliance with AI safety guidelines and data privacy regulations.
Qualifications
- 5+ years of experience in Machine Learning, Deep Learning, or Natural Language Processing.
- Expert proficiency in Python and frameworks such as PyTorch, TensorFlow, or JAX.
- Deep understanding of Transformer models, BERT, GPT, and Llama architectures.
- Experience with vector databases (Pinecone, Milvus) and embedding techniques.
- Strong background in MLOps, CI/CD pipelines, and containerization (Docker, Docker Compose).
- Excellent problem-solving skills and the ability to work in a fast-paced, agile environment.