Job Description
We are on the precipice of a technological revolution. At Nexus Horizon, we are not just building software; we are architecting the intelligence that will define the year 2026 and beyond. We are seeking a visionary Senior AI Infrastructure Engineer to join our elite technical team in San Francisco.
In this pivotal role, you will bridge the gap between cutting-edge research and scalable production systems. You will be responsible for designing, deploying, and maintaining the robust infrastructure that powers our next-generation Large Language Models (LLMs) and autonomous agents. If you are passionate about the future of Artificial General Intelligence and want to build the backbone of tomorrow’s technology, we want to hear from you.
Why Join Us?
- Impact: Work on projects that will shape the industry for the next decade.
- Equity: Competitive stock options in a high-growth unicorn.
- Environment: Collaborate with world-class engineers and researchers in a state-of-the-art office.
Responsibilities
- Architect and maintain high-availability, low-latency inference pipelines for LLMs.
- Implement and optimize MLOps strategies using Kubernetes, Docker, and cloud-native services (AWS/GCP).
- Design data pipelines to support continuous training and fine-tuning workflows.
- Collaborate with ML researchers to translate experimental models into scalable production software.
- Ensure system security, data privacy, and compliance with industry standards.
- Monitor system performance and drive optimization initiatives to reduce inference costs.
Qualifications
- Bachelor’s or Master’s degree in Computer Science, Physics, or a related technical field.
- 5+ years of experience in software engineering, with a strong focus on backend infrastructure.
- Deep experience with Python, PyTorch, and TensorFlow.
- Expertise in containerization (Docker, Kubernetes) and CI/CD pipelines.
- Experience with distributed systems, message queues, and high-scale data processing.
- Strong understanding of cloud architectures and serverless computing.
- Excellent problem-solving skills and ability to thrive in a fast-paced, dynamic environment.