Build the platform layer of our inference cloud — the OpenAI-compatible API surface, the request path, the streaming infrastructure, and the benchmarking harnesses that keep us honest. The runtime on our ASIC sits below you and is owned by our hardware partner today; your job is everything between an OpenRouter request landing on our edge and a token streaming back to the user.
This is a senior IC role on a small team. You'll own surfaces, not tickets. As we take more of the runtime in-house over the next 6-8 months, the work moves closer to the metal but the first year is platform and serving, not kernels.