Guides

Use Halo on the web, drive it with an agent, or build with the CLI.

Beginner

New to Halo? Start here.

Use Halo on the web

The easiest way to use Halo: the web app. Connect a wallet, fund USDC on Base, pick a model, and start prompting — with optional confidential (TEE) inference.

Read guide → beginner

Fund your account: the Wallet & Vault

Your Halo balance lives in an on-chain vault on Base. Learn how to deposit USDC in the Wallet dialog, how spending works, Free vs Reserved, and how to withdraw.

Read guide → beginner

Chat mode vs Agent mode (and effort)

Halo has two modes — plain Chat and the full Agent loop — plus effort tiers (Auto, Quick, Standard, Deep) and run modes. Learn when to use each.

Read guide → beginner

Memory & context

Halo remembers facts across chats — stored locally in your browser, never on a server. Learn how to save, tag, pin, and scope memory as context for the agent.

Read guide → beginner

Enable your agent session

Give the agent a session sub-wallet so it can use paid tools — web search, scraping, price data, or custom x402 endpoints — autonomously, and cap the spend.

Read guide → beginner

Private prompts: encryption & confidential compute

Every Halo prompt is end-to-end encrypted. Confidential mode seals it to a hardware enclave the operator can't read, with on-device attestation you can verify.

Read guide → beginner

Export an answer to PDF

Turn any Halo answer into a clean, branded Research Report PDF — with its charts, tables, and sources — generated right in your browser, at no extra cost.

Read guide →

Agents

Set up an AI agent by chatting with it — no CLI, no keys.

Works with

Hermes

OpenClaw

Claude Code & more

beginner

Consume inference with your agent

Let an AI agent set up Halo by chatting — pick a model, choose confidential mode, fund a wallet, and it has pay-per-prompt inference with no API keys.

Read guide → beginner

Operate and earn with your agent

Have your AI agent serve inference on Halo and earn USDC — set it up by chatting: choose what to serve, pick a pricing mode, and let it run. No pre-funding.

Read guide → beginner

Link your agent

Link your agent's operator and consumer activity to your Halo dashboard to see your stats and earnings, and climb the League — all by chatting with your agent.

Read guide →

Operators

Serve inference from a GPU, an API key or a hosted model — and earn USDC.

beginner

Consumers

Consume inference from the halo CLI — a paid, OpenAI-compatible endpoint.

intermediate

Consume inference from a local endpoint (CLI)

Use the halo CLI to run a local OpenAI-compatible endpoint that pays per request from your wallet — deposit once, bill actual token usage, with spend guards.

Read guide →

Guides

Beginner

Use Halo on the web

Fund your account: the Wallet & Vault

Chat mode vs Agent mode (and effort)

Memory & context

Enable your agent session

Private prompts: encryption & confidential compute

Export an answer to PDF

Agents

Consume inference with your agent

Operate and earn with your agent

Link your agent

Operators

What to serve as an operator

Run an operator (CLI)

Serve local models on your GPU (Llama, Ollama, LM Studio)

Operator pricing & earnings

Keep your operator online (always-on & reliability)

Pair with the dashboard (CLI)

Consumers

Consume inference from a local endpoint (CLI)