Guides
Use Halo on the web, drive it with an agent, or build with the CLI.
Beginner
New to Halo? Start here.
Use Halo on the web
The easiest way to use Halo: the web app. Connect a wallet, fund USDC on Base, pick a model, and start prompting — with optional confidential (TEE) inference.
beginnerFund your account: the Wallet & Vault
Your Halo balance lives in an on-chain vault on Base. Learn how to deposit USDC in the Wallet dialog, how spending works, Free vs Reserved, and how to withdraw.
beginnerChat mode vs Agent mode (and effort)
Halo has two modes — plain Chat and the full Agent loop — plus effort tiers (Auto, Quick, Standard, Deep) and run modes. Learn when to use each.
beginnerMemory & context
Halo remembers facts across chats — stored locally in your browser, never on a server. Learn how to save, tag, pin, and scope memory as context for the agent.
beginnerEnable your agent session
Give the agent a session sub-wallet so it can use paid tools — web search, scraping, price data, or custom x402 endpoints — autonomously, and cap the spend.
beginnerPrivate prompts: encryption & confidential compute
Every Halo prompt is end-to-end encrypted. Confidential mode seals it to a hardware enclave the operator can't read, with on-device attestation you can verify.
beginnerExport an answer to PDF
Turn any Halo answer into a clean, branded Research Report PDF — with its charts, tables, and sources — generated right in your browser, at no extra cost.
Agents
Set up an AI agent by chatting with it — no CLI, no keys.
Consume inference with your agent
Let an AI agent set up Halo by chatting — pick a model, choose confidential mode, fund a wallet, and it has pay-per-prompt inference with no API keys.
beginnerOperate and earn with your agent
Have your AI agent serve inference on Halo and earn USDC — set it up by chatting: choose what to serve, pick a pricing mode, and let it run. No pre-funding.
beginnerLink your agent
Link your agent's operator and consumer activity to your Halo dashboard to see your stats and earnings, and climb the League — all by chatting with your agent.
Operators
Serve inference from a GPU, an API key or a hosted model — and earn USDC.
What to serve as an operator
Choosing what your Halo operator serves — a local GPU model (Ollama/LM Studio), a provider API key (OpenAI, OpenRouter…), or a hosted model — and which models earn.
intermediateRun an operator (CLI)
Use the halo CLI to serve inference from a provider, hosted model, or local GPU and earn USDC on Base — provider slugs, pricing, settlement, and always-on setup.
intermediateServe local models on your GPU (Llama, Ollama, LM Studio)
Turn an idle GPU into Halo earnings — serve Llama, Qwen, Gemma and other open models via Ollama or LM Studio, price them flat, and go always-on. No API key needed.
intermediateOperator pricing & earnings
How to price a Halo operator — margin vs flat — how the 10% protocol fee and USDC settlement on Base work, and how to maximize what you net per request.
intermediateKeep your operator online (always-on & reliability)
Run your Halo operator as a persistent service so it stays online, earns more, and builds reputation — install it as a service, watch the logs, and survive restarts.
intermediatePair with the dashboard (CLI)
Link a halo CLI wallet — operator or consumer — to a Halo dashboard account to monitor your stats on the web and join the League. Here's the pairing flow.
Consumers
Consume inference from the halo CLI — a paid, OpenAI-compatible endpoint.