Infrastructure & Systems Lead
We're building a distributed system to train autonomous robots at unprecedented scale. We need someone to architect and own the entire pipeline: thousands of robots with embedded GPUs, communicating over wifi to inference clusters, streaming 10s of petabytes of video data to training clusters, with new model weights deployed every few minutes—all operating within strict real-time latency budgets.
You might be a good fit if you've worked on:
- Designing and operating large-scale distributed systems from scratch
- Managing fleets of hundreds or thousands of computers
- GPU performance optimization and writing custom CUDA kernels
- Real-time embedded systems or robotics infrastructure
- Petabyte-scale storage and database systems
- High-performance networking and video encoding pipelines
This role requires someone who can hold an entire system in their head and optimize it end-to-end. You'll need familiarity with networking, storage, databases, embedded software, deployment systems, and GPU optimization. Rust and performance optimization experience are highly valued.
We care much more about what you've built than credentials. We're a small fast-moving team working together in person, in San Francisco. If you're excited about architecting novel systems at unprecedented scale, we'd love to talk. To apply: send an email to hiring@pantograph.com with "Infrastructure & Systems Lead" in the subject line, and include your CV and a description of the most complex infrastructure system you've designed and operated.