We raised $15M to build the world's fastest neocloud.Read
Read the whitepaper

Inference at the speed of light

Up to 1,000 tokens/second by ditching the real bottleneck: GPUs

Up to 16.1x faster

OpenCode running the identical task on both sides. A real coding session at measured operating points.

OpenCodeGPU cloud today
110 tok/s/user
loop 1 of 32 · 0 tokensstreaming…
OpenCodeGeneral Compute
2,000 tok/s/user
loop 1 of 32 · 0 tokensstreaming…
Sign up with your agent

Your agent can sign up for you.

Hand this prompt to any autonomous coding agent. It handles the whole signup flow and comes back with a key — no dashboard, no forms.

CodexCodexClaude CodeClaude CodeOpenCodeOpenCodeCursorCursor
agent · signup

Sign me up for a General Compute API account and get an API key. Fetch instructions from https://docs.generalcompute.com/agent-signup and follow them.

Benchmarks

Verified benchmarks

GPT-OSS-120B on General Compute against the GPU cloud baseline — same model, same prompts, measured head to head.

See the full methodology and every model →
0.0x

Faster time to first token

0.0x

Higher output throughput

0.0x

Lower end-to-end latency

0 tok/s

Output on GPT-OSS-120B

Switch in 30 seconds.
No GPU required.

OpenAI-compatible API. Change your base URL, swap your key, and you're running on ASIC infrastructure. Your existing code doesn't change.

View Docs
main.py
from openai import OpenAI

client = OpenAI(
    base_url="https://api.generalcompute.com",
    api_key="your-api-key",
)

response = client.chat.completions.create(
    model="gpt-oss-120b",
    messages=[{"role": "user", "content": "Hello!"}],
    stream=True,
)
$10 in free credit when you sign up

Stop paying the GPU tax.

Get your API key in seconds. OpenAI-compatible — just change your base URL. $10 free credit to see the difference yourself.

ModeHumanAgent
General Compute — World's Fastest AI Inference