Cloud LLM Providers

DevoxxGenie supports a wide range of cloud-based LLM providers, giving you access to powerful state-of-the-art models without requiring high-end local hardware.

tip

For all cloud providers, available models are automatically fetched from the provider's API when you select the provider and enter your API key. You don't need to manually enter model names.

OpenAI

OpenAI provides some of the most widely used LLMs, including the GPT and O-series families.

Setup

Create an account at OpenAI
Generate an API key in your account dashboard
In DevoxxGenie settings, select "OpenAI" as the provider
Paste your API key
Select your preferred model from the auto-populated list

Configuration

API Key: Your OpenAI API key
Model: Select from available models (GPT-4o, GPT-4o mini, O-series reasoning models, etc.)
Parameters: Temperature, Top P, Maximum tokens

Advantages

State-of-the-art models with excellent code understanding
Multimodal capabilities (GPT-4V+)
O-series models with advanced reasoning
Reliable API with extensive documentation

Check OpenAI's pricing page for current rates. DevoxxGenie's built-in token cost calculator can estimate costs before sending prompts.

Anthropic

Anthropic provides the Claude family of models, known for their strong reasoning and large context windows.

Setup

Create an account at Anthropic
Generate an API key in your account dashboard
In DevoxxGenie settings, select "Anthropic" as the provider
Paste your API key
Select your preferred Claude model

Configuration

API Key: Your Anthropic API key
Model: Select from available Claude models (Claude 4 family, Claude 3.5 Sonnet, etc.)
Parameters: Temperature, Top P, Maximum tokens

Advantages

Excellent reasoning abilities
Very large context windows (up to 200K tokens)
Strong performance on code tasks
Clear, nuanced responses

Check Anthropic's pricing page for current rates.

Google

Google provides the Gemini family of models through Google AI Studio.

Setup

Create an account at Google AI Studio
Generate an API key
In DevoxxGenie settings, select "Google" as the provider
Paste your API key
Select your preferred Gemini model

Configuration

API Key: Your Google API key
Model: Select from available Gemini models (Gemini 2.x, Gemini 1.5 Pro, Flash, etc.)
Parameters: Temperature, Top P, Maximum output tokens

Advantages

Extremely large context window (up to 1M+ tokens)
Strong multimodal capabilities
Good performance on code tasks
Integration with Google ecosystem

Check Google's pricing page for current rates.

Grok

Grok is xAI's LLM, accessible via an OpenAI-compatible API.

Setup

Create an account at xAI
Generate an API key
In DevoxxGenie settings, select "Grok" as the provider
Paste your API key
Select your preferred model

Configuration

API Key: Your xAI API key
Model: Select from available Grok models
Parameters: Temperature, Top P, Maximum tokens

Advantages

Strong reasoning and code capabilities
OpenAI-compatible API (base URL: https://api.x.ai/v1)
Good performance on technical tasks

Check xAI's website for current pricing.

Mistral

Mistral AI offers efficient, powerful models with competitive performance.

Setup

Create an account at Mistral AI
Generate an API key
In DevoxxGenie settings, select "Mistral" as the provider
Paste your API key
Select your preferred Mistral model

Configuration

API Key: Your Mistral API key
Model: Select from available models (Mistral Large, Small, etc.)
Parameters: Temperature, Top P

Advantages

Efficient models with competitive performance
European-based company (GDPR compliance)
Strong open-source foundation
Good price-to-performance ratio

Check Mistral's pricing page for current rates.

Groq

Groq is known for extremely fast inference speeds.

Setup

Create an account at Groq
Generate an API key
In DevoxxGenie settings, select "Groq" as the provider
Paste your API key
Select your preferred model

Configuration

API Key: Your Groq API key
Model: Select from available models (Llama, Mixtral, Gemma, etc.)
Parameters: Temperature, Top P

Advantages

Extremely fast inference speeds
Competitive pricing
Good selection of optimized open models
Strong performance on code tasks

Check Groq's pricing page for current rates.

DeepInfra

DeepInfra provides a platform for running various open-source models with optimized inference.

Setup

Create an account at DeepInfra
Generate an API key
In DevoxxGenie settings, select "DeepInfra" as the provider
Paste your API key
Select your preferred model

Configuration

API Key: Your DeepInfra API key
Model: Select from available models (Llama, Mistral, CodeLlama, and many more)
Parameters: Temperature, Top P, Max tokens

Advantages

Access to many open-source models
Competitive pricing
Good selection of code-specialized models

Check DeepInfra's pricing page for current rates.

DeepSeek

DeepSeek specializes in models with strong coding and reasoning capabilities.

Setup

Create an account at DeepSeek
Generate an API key
In DevoxxGenie settings, select "DeepSeek" as the provider
Paste your API key
Select your preferred model

Configuration

API Key: Your DeepSeek API key
Model: Select from available models (DeepSeek-Coder, DeepSeek-V2, DeepSeek-R1, etc.)
Parameters: Temperature, Top P

Advantages

Excellent code generation capabilities
Advanced reasoning with R1 model
Strong understanding of programming concepts
Competitive pricing

Kimi

Kimi (Moonshot AI) provides powerful language models with long context windows and strong performance on code-related tasks.

Setup

Create an account at Moonshot AI
Generate an API key from your account dashboard
In DevoxxGenie settings, select "Kimi" as the provider
Paste your API key
Select your preferred Kimi model

Configuration

API Key: Your Kimi API key
Model: Select from available models (Moonshot v1 8K/32K/128K, Kimi K2 Turbo Preview, etc.)
Parameters: Temperature (0.0-1.0), Top P, Maximum tokens

Advantages

Long context windows (up to 128K tokens, 256K for K2)
Strong code understanding and generation
Competitive pricing
Fast response times
Good support for Chinese and English

Check Moonshot AI's pricing page for current rates.

GLM

GLM (Zhipu AI / Z.AI) provides the ChatGLM family of models, offering strong performance on code-related tasks with competitive pricing.

Setup

Create an account at Zhipu AI (Z.AI)
Generate an API key from your account dashboard
In DevoxxGenie settings, select "GLM" as the provider
Paste your API key
Select your preferred GLM model

Configuration

API Key: Your GLM API key
Model: Select from available models (GLM-4.7, GLM-4.7 Flash, GLM-4.5)
Parameters: Temperature, Top P, Maximum tokens

Available Models

Model	Context Window	Input Cost	Output Cost
GLM-4.7	200K tokens	$0.60/1M tokens	$2.20/1M tokens
GLM-4.7 Flash	200K tokens	$0.06/1M tokens	$0.40/1M tokens
GLM-4.5	128K tokens	$0.35/1M tokens	$1.55/1M tokens

Advantages

Large context windows (up to 200K tokens)
Very competitive pricing, especially the Flash variant
Strong code understanding and generation
OpenAI-compatible API for seamless integration
Good support for Chinese and English

Check Zhipu AI's pricing page for current rates.

OpenRouter

OpenRouter is a unified API that provides access to many different models from various providers.

Setup

Create an account at OpenRouter
Generate an API key
In DevoxxGenie settings, select "OpenRouter" as the provider
Paste your API key
Select your preferred model from the extensive list

Configuration

API Key: Your OpenRouter API key
Model: Select from available models (OpenAI, Anthropic, Meta, Mistral, and many more)
Parameters: Temperature, Top P

Advantages

Single API for many different models
Fallback options if a provider is unavailable
Easy model comparisons
Pay-as-you-go pricing

Check OpenRouter's pricing page for current rates.

Cloudflare AI Gateway

Cloudflare AI Gateway is not a model provider itself — it sits in front of your existing providers (OpenAI, Anthropic, Google, Workers AI, and dozens more) and gives you a single endpoint with caching, rate limiting, spend controls, and per-request analytics.

DevoxxGenie talks to the gateway's OpenAI-compatible /compat endpoint, so every model you've wired up behind the gateway becomes available in the plugin.

Setup

Create (or reuse) a gateway in the Cloudflare AI Gateway dashboard. A gateway named default is created automatically on first authenticated request.
Store the provider API keys you want to use in the Cloudflare dashboard (BYOK — Bring Your Own Keys).
Generate a Cloudflare API token at dash.cloudflare.com/profile/api-tokens.
In DevoxxGenie settings, select "Cloudflare" as the provider.
Enter your API Key, Account ID, and Gateway Name.
Select your preferred model from the auto-populated list.

Configuration

Cloudflare API Key: Your Cloudflare API token, sent as Authorization: Bearer
Cloudflare Account ID: Your account identifier
Cloudflare Gateway Name: default, or the name of a gateway you created
Model: Select from the auto-populated list, or enable the model name override to type a model id directly
Parameters: Temperature, Top P, Maximum tokens

The base URL is assembled for you from the account id and gateway name — you never type it:

https://gateway.ai.cloudflare.com/v1/<account-id>/<gateway>/compat

Your provider keys stay in Cloudflare

This provider uses Cloudflare's single-token (BYOK) model. You give DevoxxGenie only your Cloudflare API token; the downstream provider keys (OpenAI, Anthropic, ...) live in your Cloudflare dashboard, where the gateway injects them. You never paste an OpenAI or Anthropic key into DevoxxGenie for this provider.

Model name override

Model discovery uses a fast best-effort probe of the gateway's /models endpoint. If your gateway doesn't expose that endpoint, or you already know the exact model id you want, enable the model name override and type it in — this skips discovery entirely.

Advantages

One key, every provider — no per-provider setup inside the plugin
Caching of identical requests, so repeated prompts don't cost you twice
Rate limiting and spend controls you own, rather than the provider's
Analytics and logs for every request (latency, tokens, cost) in the Cloudflare dashboard
Centralized key management — provider keys live in Cloudflare, not scattered across tools

Context window and cost estimates

Cloudflare's /compat/models endpoint doesn't report context length or pricing, so DevoxxGenie assumes a 128K context window and shows no cost estimate for gateway models. Use the Cloudflare dashboard for authoritative per-request cost data.

Azure OpenAI

Azure OpenAI Service provides OpenAI models integrated with Microsoft Azure.

note

Azure OpenAI is an optional provider that must be manually enabled in DevoxxGenie settings due to its more complex setup requirements.

Setup

Create an Azure account
Set up Azure OpenAI Service
Create a deployment and get your API details
In DevoxxGenie settings, select "Azure OpenAI" as the provider
Enter your API Key, Endpoint URL, Deployment name, and API version

Configuration

API Key: Your Azure OpenAI key
Endpoint: Your Azure OpenAI endpoint
Deployment: Your specific model deployment
API Version: The Azure OpenAI API version
Parameters: Temperature, Top P

Advantages

Enterprise compliance and security
Service level agreements (SLAs)
Regional availability options
Integration with other Azure services

Amazon Bedrock

Amazon Bedrock provides access to foundation models from various providers through AWS.

note

Amazon Bedrock is an optional provider that must be manually enabled in DevoxxGenie settings due to its more complex setup requirements.

Setup

Create an AWS account
Set up Amazon Bedrock and configure access permissions
In DevoxxGenie settings, select "Amazon Bedrock" as the provider
Enter your Access Key, Secret Key, and Region

Configuration

AWS Credentials: Your access and secret keys
Region: AWS region for Bedrock (supports regional inference with us/eu/apac prefixes)
Model ID: Specific model identifier
Parameters: Temperature, Top P

Advantages

Enterprise-grade security and compliance
Integration with AWS ecosystem
Choice of multiple foundation models (Anthropic, Meta, Cohere, Amazon Titan, etc.)
Regional inference for data residency requirements

Choosing a Cloud Provider

When selecting a cloud provider, consider:

Complex reasoning: Anthropic Claude, OpenAI O-series, DeepSeek R1
Code generation: DeepSeek, OpenAI, Anthropic Claude
Speed priority: Groq, Google Gemini Flash
Large context: Google Gemini (1M+ tokens), Anthropic Claude (200K tokens)
Budget-friendly: Groq, Mistral Small, DeepSeek, Kimi, GLM Flash
Enterprise: Azure OpenAI, Amazon Bedrock
European data residency: Mistral
Many providers behind one key, with caching and analytics: Cloudflare AI Gateway

Best Practices

API Key Security

DevoxxGenie stores keys securely in IntelliJ's credential store
Regularly rotate keys for better security

Cost Management

Monitor token usage through provider dashboards
Use the built-in token cost calculator in DevoxxGenie
Set usage limits on your provider accounts
Consider using RAG to reduce context size

Performance

Choose the right model for your task
Use streaming for better user experience
Balance context window size with cost

Supported Cloud Providers​

OpenAI​

Setup​

Configuration​

Advantages​

Anthropic​

Setup​

Configuration​

Advantages​

Google​

Setup​

Configuration​

Advantages​

Grok​

Setup​

Configuration​

Advantages​

Mistral​

Setup​

Configuration​

Advantages​

Groq​

Setup​

Configuration​

Advantages​

DeepInfra​

Setup​

Configuration​

Advantages​

DeepSeek​

Setup​

Configuration​

Advantages​

Kimi​

Setup​

Configuration​

Advantages​

GLM​

Setup​

Configuration​

Available Models​

Advantages​

OpenRouter​

Setup​

Configuration​

Advantages​

Cloudflare AI Gateway​

Setup​

Configuration​

Advantages​

Azure OpenAI​

Setup​

Configuration​

Advantages​

Amazon Bedrock​

Setup​

Configuration​

Advantages​

Choosing a Cloud Provider​

Best Practices​

API Key Security​

Cost Management​

Performance​

Supported Cloud Providers

OpenAI

Setup

Configuration

Advantages

Anthropic

Setup

Configuration

Advantages

Google

Setup

Configuration

Advantages

Grok

Setup

Configuration

Advantages

Mistral

Setup

Configuration

Advantages

Groq

Setup

Configuration

Advantages

DeepInfra

Setup

Configuration

Advantages

DeepSeek

Setup

Configuration

Advantages

Kimi

Setup

Configuration

Advantages

GLM

Setup

Configuration

Available Models

Advantages

OpenRouter

Setup

Configuration

Advantages

Cloudflare AI Gateway

Setup

Configuration

Advantages

Azure OpenAI

Setup

Configuration

Advantages

Amazon Bedrock

Setup

Configuration

Advantages

Choosing a Cloud Provider

Best Practices

API Key Security

Cost Management

Performance