Skip to main content

Core Concept: Providers

Providers are the underlying AI engines (Large Language Models) that process your text and generate responses. Rephlo is a universal interface — you can swap these "brains" in and out per task, and a single provider can expose several models (see Models & Inference).

Provider Types

Rephlo supports 8 Bring-Your-Own-Key (BYOK) provider types plus a Rephlo-managed Dedicated API:

Cloud Providers (Internet Required)

  1. OpenAI — GPT-class models. Industry standard for reasoning, coding, and creativity.
  2. Anthropic — Claude models; known for high-quality writing, large context windows, and prompt caching (on by default).
  3. Groq — Ultra-fast inference on custom LPU chips — near-instant responses.
  4. Google — Gemini models with strong multimodal capabilities.
  5. xAI (Grok) — Grok models with optional real-time Live Search.
  6. OpenRouter — A unified gateway to 100+ models from many vendors, with tracking via Site URL / Site Name.
  7. OpenAI-Compatible — Connect to any service implementing the OpenAI API spec (Azure OpenAI, vLLM, LocalAI, LiteLLM, LM Studio, custom proxies). Requires API Key, Base URL, and Model Name.

Local Provider (Complete Privacy)

  1. Ollama — Run open-source models (Llama, Mistral, Gemma, DeepSeek, Qwen) directly on your computer. Your data never leaves your machine. Configurable Base URL (default http://localhost:11434); an optional Ollama API key enables web search.

Azure OpenAI is connected through the OpenAI-Compatible type — choose OpenAI-Compatible and enter your Azure deployment endpoint as the Base URL (and your deployment name as the Model Name). It is not a separate entry in the add-provider list.

On-device models: Rephlo also runs GGUF models fully on-device (no internet, no Ollama) via a built-in engine. That's a related but separate inference path — see On-Device Models and Models & Inference.

Managed Provider

  • Dedicated API — Rephlo's managed cloud option using OAuth 2.0. The simplest setup for users who don't want to manage API keys; usage is tracked with credits. Its configuration is auto-created and hidden from the regular provider list.

On-Device vs BYOK

These two bring-your-own-model paths are easy to confuse:

  • On-Device — open models (GGUF) running locally on your machine via the built-in engine. No API key, no internet, fully private; nothing leaves your computer.
  • BYOK (Bring Your Own Key) — you supply your own API key for an external provider (a cloud vendor, or local Ollama). Requests go to that provider under your key, and the vendor bills you directly.

Both On-Device and BYOK are Pro-tier features — using them requires an active Pro plan (or higher) or a perpetual license. For on-device models that gate covers both downloading and running them; once unlocked, they run with no credits and fully offline. Set up keys under Providers.

Platform Dedicated API vs Enterprise BYOK

  • Dedicated API is Rephlo-managed hosted inference: sign in and use Rephlo's frontier models, billed in credits, with no keys to manage.
  • Enterprise BYOK (over the platform) is for organizations that centrally configure their own provider keys and allowed models at the org level, so every member uses the org's models and keys — for compliance and cost control.

In short: the Dedicated API trades keys for credits and zero setup, while Enterprise BYOK keeps key management central to the organization. Personal BYOK keys are configured under Providers.

Provider Mode

Each provider operates in one of two modes: BYOK (you supply the vendor key) or Dedicated (Rephlo-managed with credit tracking).

Connection Modes

Dedicated Mode (Managed)

  • Description: Rephlo manages the connection and billing via OAuth 2.0; usage draws from your credits.
  • Pros: Zero setup, managed billing.
  • Cons: Less low-level control over model parameters.

BYOK Mode (Bring Your Own Key)

  • Description: You use your own vendor API key directly.
  • Pros: Maximum privacy (data goes from your machine straight to the vendor, bypassing Rephlo servers), direct vendor pricing, and immediate access to new models.
  • Cons: Requires a vendor account.

What a Provider Configuration Stores

Each provider configuration includes:

  • An optional Display Name override.
  • A Mode (BYOK / Dedicated) and provider type.
  • Models — one or more models, with one marked as the default.
  • Configuration — provider-specific fields. API keys and tokens are stored encrypted:
    • OpenAI / OpenAI-Compatible: API Key, optional Base URL (and, for OpenAI-Compatible, a Model Name). Azure OpenAI uses this type — point the Base URL at your Azure deployment endpoint.
    • Anthropic: API Key, optional Base URL, prompt caching enabled by default.
    • Google: API Key, optional Base URL.
    • Groq: API Key.
    • Ollama: Base URL, optional API key (for web search).
    • xAI (Grok): API Key, optional Live Search toggle.
    • OpenRouter: API Key, optional Site URL / Site Name.
  • Capabilities — toggles for Vision, Function Calling (tools), Long Context, Web Search, Thinking, and more; validated against the model's real capabilities at runtime.
  • Model Parameters — per-provider/model request settings (temperature, max tokens, top-p, etc.).

Only one provider is active at a time. You can also override the provider per command for fine-grained control.

The Add Provider side panel — Provider Type, Custom Name, API Key, Base URL, Configured Models with Add Model, and Provider Capabilities

Getting API Keys (BYOK)

When using BYOK, you pay the vendor directly for API usage.

ProviderWhere to Get KeysNotes
Azure OpenAIportal.azure.comRequires an Azure subscription + a deployed OpenAI resource
OpenAIplatform.openai.comGPT-class models
Anthropicconsole.anthropic.comClaude models; supports prompt caching
Groqconsole.groq.comFree tier available
Googleaistudio.google.comGemini Pro / Flash
xAIconsole.x.aiGrok models; optional Live Search
OpenRouteropenrouter.aiOne key, 100+ models
OpenAI-CompatibleVaries by servicevLLM, LocalAI, LiteLLM, LM Studio
Ollamalocal install + ollama.com/settings/keysLocal models; API key only needed for web search

Switching Providers

You can switch providers globally (default for all commands) or per-command. For example, use Groq for quick spell-checks (fast and cheap) and switch to Claude for complex analysis (smart and nuanced).

Provider Comparison Guide

ProviderBest ForSpeedPrivacy
Azure OpenAIEnterprise, complianceFastCloud
OpenAIGeneral, coding, analysisFastCloud
Anthropic (Claude)Writing, long docs, cachingFastCloud
GroqQuick tasksUltra-fastCloud
Google (Gemini)Multimodal, researchFastCloud
xAI (Grok)Real-time knowledgeFastCloud
OpenRouterMulti-model accessVariesCloud
OpenAI-CompatibleSelf-hosted, customVariesSelf-hosted / cloud
OllamaPrivacy, offlineDepends on HWOn your machine

Specific model names and context-window sizes change frequently. To see exactly which models each provider exposes — and how to override the model per command or per conversation — read Models & Inference.


Next Concept: Explore the Dashboard & Navigation.