Sovereign, OpenAI-compatible inference — served from owned Apple-Silicon hardware.
Drop-in compatible with the OpenAI SDK. Pay per token in fiat or Lightning. No lock-in, no data resale, self-custodied.
/v1/chat/completions, served from Llama-3.2-3B-Instruct-4bit./v1/embeddings for search, RAG, clustering./v1/documents + /v1/search: private per-key
vector store with semantic retrieval. No infra to run./v1/briefing: live signals + an analyst narrative.curl https://api.vellaquant.com/v1/chat/completions \
-H "Authorization: Bearer $VELLA_KEY" \
-H "Content-Type: application/json" \
-d '{"messages":[{"role":"user","content":"Hello"}]}'
curl https://api.vellaquant.com/v1/embeddings \
-H "Authorization: Bearer $VELLA_KEY" \
-d '{"input":"text to embed"}'
Top up an API key (card via Stripe, or a Lightning invoice):
curl https://api.vellaquant.com/v1/checkout \
-H "Content-Type: application/json" \
-d '{"api_key":"vq-…","amount_usd":10}'