Skip to main content

Cloud LLM Providers

DevoxxGenie supports a wide range of cloud-based LLM providers, giving you access to powerful state-of-the-art models without requiring high-end local hardware.

tip

For all cloud providers, available models are automatically fetched from the provider's API when you select the provider and enter your API key. You don't need to manually enter model names.

Supported Cloud Providers

  1. OpenAI
  2. Anthropic
  3. Google
  4. Grok
  5. Mistral
  6. Groq
  7. DeepInfra
  8. DeepSeek
  9. Kimi
  10. GLM
  11. OpenRouter
  12. Azure OpenAI
  13. Amazon Bedrock

OpenAI

OpenAI provides some of the most widely used LLMs, including the GPT and O-series families.

Setup

  1. Create an account at OpenAI
  2. Generate an API key in your account dashboard
  3. In DevoxxGenie settings, select "OpenAI" as the provider
  4. Paste your API key
  5. Select your preferred model from the auto-populated list

Configuration

  • API Key: Your OpenAI API key
  • Model: Select from available models (GPT-4o, GPT-4o mini, O-series reasoning models, etc.)
  • Parameters: Temperature, Top P, Maximum tokens

Advantages

  • State-of-the-art models with excellent code understanding
  • Multimodal capabilities (GPT-4V+)
  • O-series models with advanced reasoning
  • Reliable API with extensive documentation

Check OpenAI's pricing page for current rates. DevoxxGenie's built-in token cost calculator can estimate costs before sending prompts.

Anthropic

Anthropic provides the Claude family of models, known for their strong reasoning and large context windows.

Setup

  1. Create an account at Anthropic
  2. Generate an API key in your account dashboard
  3. In DevoxxGenie settings, select "Anthropic" as the provider
  4. Paste your API key
  5. Select your preferred Claude model

Configuration

  • API Key: Your Anthropic API key
  • Model: Select from available Claude models (Claude 4 family, Claude 3.5 Sonnet, etc.)
  • Parameters: Temperature, Top P, Maximum tokens

Advantages

  • Excellent reasoning abilities
  • Very large context windows (up to 200K tokens)
  • Strong performance on code tasks
  • Clear, nuanced responses

Check Anthropic's pricing page for current rates.

Google

Google provides the Gemini family of models through Google AI Studio.

Setup

  1. Create an account at Google AI Studio
  2. Generate an API key
  3. In DevoxxGenie settings, select "Google" as the provider
  4. Paste your API key
  5. Select your preferred Gemini model

Configuration

  • API Key: Your Google API key
  • Model: Select from available Gemini models (Gemini 2.x, Gemini 1.5 Pro, Flash, etc.)
  • Parameters: Temperature, Top P, Maximum output tokens

Advantages

  • Extremely large context window (up to 1M+ tokens)
  • Strong multimodal capabilities
  • Good performance on code tasks
  • Integration with Google ecosystem

Check Google's pricing page for current rates.

Grok

Grok is xAI's LLM, accessible via an OpenAI-compatible API.

Setup

  1. Create an account at xAI
  2. Generate an API key
  3. In DevoxxGenie settings, select "Grok" as the provider
  4. Paste your API key
  5. Select your preferred model

Configuration

  • API Key: Your xAI API key
  • Model: Select from available Grok models
  • Parameters: Temperature, Top P, Maximum tokens

Advantages

  • Strong reasoning and code capabilities
  • OpenAI-compatible API (base URL: https://api.x.ai/v1)
  • Good performance on technical tasks

Check xAI's website for current pricing.

Mistral

Mistral AI offers efficient, powerful models with competitive performance.

Setup

  1. Create an account at Mistral AI
  2. Generate an API key
  3. In DevoxxGenie settings, select "Mistral" as the provider
  4. Paste your API key
  5. Select your preferred Mistral model

Configuration

  • API Key: Your Mistral API key
  • Model: Select from available models (Mistral Large, Small, etc.)
  • Parameters: Temperature, Top P

Advantages

  • Efficient models with competitive performance
  • European-based company (GDPR compliance)
  • Strong open-source foundation
  • Good price-to-performance ratio

Check Mistral's pricing page for current rates.

Groq

Groq is known for extremely fast inference speeds.

Setup

  1. Create an account at Groq
  2. Generate an API key
  3. In DevoxxGenie settings, select "Groq" as the provider
  4. Paste your API key
  5. Select your preferred model

Configuration

  • API Key: Your Groq API key
  • Model: Select from available models (Llama, Mixtral, Gemma, etc.)
  • Parameters: Temperature, Top P

Advantages

  • Extremely fast inference speeds
  • Competitive pricing
  • Good selection of optimized open models
  • Strong performance on code tasks

Check Groq's pricing page for current rates.

DeepInfra

DeepInfra provides a platform for running various open-source models with optimized inference.

Setup

  1. Create an account at DeepInfra
  2. Generate an API key
  3. In DevoxxGenie settings, select "DeepInfra" as the provider
  4. Paste your API key
  5. Select your preferred model

Configuration

  • API Key: Your DeepInfra API key
  • Model: Select from available models (Llama, Mistral, CodeLlama, and many more)
  • Parameters: Temperature, Top P, Max tokens

Advantages

  • Access to many open-source models
  • Competitive pricing
  • Good selection of code-specialized models

Check DeepInfra's pricing page for current rates.

DeepSeek

DeepSeek specializes in models with strong coding and reasoning capabilities.

Setup

  1. Create an account at DeepSeek
  2. Generate an API key
  3. In DevoxxGenie settings, select "DeepSeek" as the provider
  4. Paste your API key
  5. Select your preferred model

Configuration

  • API Key: Your DeepSeek API key
  • Model: Select from available models (DeepSeek-Coder, DeepSeek-V2, DeepSeek-R1, etc.)
  • Parameters: Temperature, Top P

Advantages

  • Excellent code generation capabilities
  • Advanced reasoning with R1 model
  • Strong understanding of programming concepts
  • Competitive pricing

Kimi

Kimi (Moonshot AI) provides powerful language models with long context windows and strong performance on code-related tasks.

Setup

  1. Create an account at Moonshot AI
  2. Generate an API key from your account dashboard
  3. In DevoxxGenie settings, select "Kimi" as the provider
  4. Paste your API key
  5. Select your preferred Kimi model

Configuration

  • API Key: Your Kimi API key
  • Model: Select from available models (Moonshot v1 8K/32K/128K, Kimi K2 Turbo Preview, etc.)
  • Parameters: Temperature (0.0-1.0), Top P, Maximum tokens

Advantages

  • Long context windows (up to 128K tokens, 256K for K2)
  • Strong code understanding and generation
  • Competitive pricing
  • Fast response times
  • Good support for Chinese and English

Check Moonshot AI's pricing page for current rates.

GLM

GLM (Zhipu AI / Z.AI) provides the ChatGLM family of models, offering strong performance on code-related tasks with competitive pricing.

Setup

  1. Create an account at Zhipu AI (Z.AI)
  2. Generate an API key from your account dashboard
  3. In DevoxxGenie settings, select "GLM" as the provider
  4. Paste your API key
  5. Select your preferred GLM model

Configuration

  • API Key: Your GLM API key
  • Model: Select from available models (GLM-4.7, GLM-4.7 Flash, GLM-4.5)
  • Parameters: Temperature, Top P, Maximum tokens

Available Models

ModelContext WindowInput CostOutput Cost
GLM-4.7200K tokens$0.60/1M tokens$2.20/1M tokens
GLM-4.7 Flash200K tokens$0.06/1M tokens$0.40/1M tokens
GLM-4.5128K tokens$0.35/1M tokens$1.55/1M tokens

Advantages

  • Large context windows (up to 200K tokens)
  • Very competitive pricing, especially the Flash variant
  • Strong code understanding and generation
  • OpenAI-compatible API for seamless integration
  • Good support for Chinese and English

Check Zhipu AI's pricing page for current rates.

OpenRouter

OpenRouter is a unified API that provides access to many different models from various providers.

Setup

  1. Create an account at OpenRouter
  2. Generate an API key
  3. In DevoxxGenie settings, select "OpenRouter" as the provider
  4. Paste your API key
  5. Select your preferred model from the extensive list

Configuration

  • API Key: Your OpenRouter API key
  • Model: Select from available models (OpenAI, Anthropic, Meta, Mistral, and many more)
  • Parameters: Temperature, Top P

Advantages

  • Single API for many different models
  • Fallback options if a provider is unavailable
  • Easy model comparisons
  • Pay-as-you-go pricing

Check OpenRouter's pricing page for current rates.

Azure OpenAI

Azure OpenAI Service provides OpenAI models integrated with Microsoft Azure.

note

Azure OpenAI is an optional provider that must be manually enabled in DevoxxGenie settings due to its more complex setup requirements.

Setup

  1. Create an Azure account
  2. Set up Azure OpenAI Service
  3. Create a deployment and get your API details
  4. In DevoxxGenie settings, select "Azure OpenAI" as the provider
  5. Enter your API Key, Endpoint URL, Deployment name, and API version

Configuration

  • API Key: Your Azure OpenAI key
  • Endpoint: Your Azure OpenAI endpoint
  • Deployment: Your specific model deployment
  • API Version: The Azure OpenAI API version
  • Parameters: Temperature, Top P

Advantages

  • Enterprise compliance and security
  • Service level agreements (SLAs)
  • Regional availability options
  • Integration with other Azure services

Amazon Bedrock

Amazon Bedrock provides access to foundation models from various providers through AWS.

note

Amazon Bedrock is an optional provider that must be manually enabled in DevoxxGenie settings due to its more complex setup requirements.

Setup

  1. Create an AWS account
  2. Set up Amazon Bedrock and configure access permissions
  3. In DevoxxGenie settings, select "Amazon Bedrock" as the provider
  4. Enter your Access Key, Secret Key, and Region

Configuration

  • AWS Credentials: Your access and secret keys
  • Region: AWS region for Bedrock (supports regional inference with us/eu/apac prefixes)
  • Model ID: Specific model identifier
  • Parameters: Temperature, Top P

Advantages

  • Enterprise-grade security and compliance
  • Integration with AWS ecosystem
  • Choice of multiple foundation models (Anthropic, Meta, Cohere, Amazon Titan, etc.)
  • Regional inference for data residency requirements

Choosing a Cloud Provider

When selecting a cloud provider, consider:

  • Complex reasoning: Anthropic Claude, OpenAI O-series, DeepSeek R1
  • Code generation: DeepSeek, OpenAI, Anthropic Claude
  • Speed priority: Groq, Google Gemini Flash
  • Large context: Google Gemini (1M+ tokens), Anthropic Claude (200K tokens)
  • Budget-friendly: Groq, Mistral Small, DeepSeek, Kimi, GLM Flash
  • Enterprise: Azure OpenAI, Amazon Bedrock
  • European data residency: Mistral

Best Practices

API Key Security

  • DevoxxGenie stores keys securely in IntelliJ's credential store
  • Regularly rotate keys for better security

Cost Management

  • Monitor token usage through provider dashboards
  • Use the built-in token cost calculator in DevoxxGenie
  • Set usage limits on your provider accounts
  • Consider using RAG to reduce context size

Performance

  • Choose the right model for your task
  • Use streaming for better user experience
  • Balance context window size with cost