← All posts
Vidai 11 January 2026

Hello World

Why we’re here

Most "Hello World" examples in the AI space focus on the prompt. Ours focuses on the proxy. We are building at the intersection of performance and privacy, focusing on four core pillars:

  • Rust for the Runtime: Memory safety shouldn't come at the cost of a garbage collector’s pause. We use Rust to ensure that our overhead is measured in microseconds, not milliseconds.

  • The LLM Gateway: Orchestrating models shouldn't be the bottleneck of your stack. We are building the "Air Traffic Control" for tokens routing, retrying, and load-balancing with zero-copy efficiency.

  • ONNX at the Edge: To solve for latency, we move the logic closer to the data. By running guardrails via ONNX in edge runtimes, we catch "jailbreaks" and PII leaks before the request ever leaves the secure perimeter.

  • Data Sovereignty: In the age of black-box APIs, knowing where your data lives and who sees it is non-negotiable. We believe you should own your inference pipeline from the load balancer to the weights.

A Taste of the Stack

To us, "Hello World" isn't just a string, it's a packet that needs to be validated, scrubbed of PII, and routed to the most cost-effective model without jitter.

// A glimpse into our middleware logic
pub async fn process_request(req: Request) -> Result<Response, Error> {
    // 1. Run local ONNX guardrails (Latency < 2ms)
    // 2. Encrypt/Scrub for Data Sovereignty 
    // 3. Proxy to model via LLM Gateway
    let response = gateway.route(req).await?;
    Ok(response)
}

See it on a real deployment.

A 20-minute technical walkthrough on infrastructure that looks like yours.