Introduction

The connection to an LLM is based on

An API protocol (OpenAI or specific)
A URL endpoint, as shown below
An API token the customer must enter when creating the configuration.

For example, to obtain your key from ChatGPT, please refer to the following page:

https://help.openai.com/en/articles/4936850-where-do-i-find-my-openai-api-key

Example of LLM Types and Models

Example LLM API Providers
LLM	API Endpoint URL	Model Examples
ChatGPT (OpenAI)	`https://api.openai.com/v1/chat/completions`	gpt-5.5, gpt-5.4-mini, o3
Claude (Claude)	`https://api.anthropic.com/v1/messages`	claude-opus-4-8, claude-sonnet-4-6, claude-haiku-4-5
DeepSeek (OpenAI)	`https://api.deepseek.com/v1/chat/completions`	deepseek-v4-pro, deepseek-v4-flash, deepseek-r1
Fireworks (OpenAI)	`https://api.fireworks.ai/inference/v1/chat/completions`	accounts/fireworks/models/llama-v3p3-70b-instruct, accounts/fireworks/models/deepseek-v3
Gemini (Gemini)	`https://generativelanguage.googleapis.com/v1beta/models/`	gemini-3.1-pro, gemini-3.5-flash, gemini-2.5-flash-lite
Groq (OpenAI)	`https://api.groq.com/openai/v1/chat/completions`	llama-3.3-70b-versatile, gpt-oss-120b
Mistral (OpenAI)	`https://api.mistral.ai/v1/chat/completions`	mistral-large-3, mistral-small-4, codestral
OpenRouter (OpenAI)	`https://openrouter.ai/api/v1/chat/completions`	openai/gpt-5.5, anthropic/claude-sonnet-4-6, google/gemini-3.1-pro
Perplexity (OpenAI)	`https://api.perplexity.ai/chat/completions`	sonar-pro, sonar
Together AI (OpenAI)	`https://api.together.xyz/v1/chat/completions`	meta-llama/Llama-3.3-70B-Instruct-Turbo, deepseek-ai/DeepSeek-V4-Pro
xAI Grok (OpenAI)	`https://api.x.ai/v1/chat/completions`	grok-4, grok-4-fast, grok-3-mini

Note: European customers may lean towards Mistral.

The LLM models change often. The above are examples as of June 2026.

LLM Creation Information

Below is a view of the Create page.

Pick OpenAI for all LLM whose API is compatible, that's all the LLM but Claude and Gemini. In the example below, Mistral uses the OpenAI protocol; Please note the Mistral URL.

Token Count

When viewing the LLM connection, the number of tokens used up to that point will be displayed.

When an LLM is used for translation, the number of tokens will be shown in the log file for that translation. The total tokens used for the project will be tracked on the Localyzer page.

Prompting

For best results with an LLM, prompting is recommended. Localyzer allows for three levels of prompting:

See: Prompting

Context Prompting

If the LLM connection is configured with an InContext server, then when a context has been associated with a string (See String Manager / Context), the context will be passed to ChatGPT to provide the necessary information to better translate that string.

Make sure the LLM model takes advantage of the InContext image. Not all models do.

LLM

Contents

Introduction

Example of LLM Types and Models

LLM Creation Information

Token Count

Prompting

Context Prompting

Navigation menu

LLM

Introduction

Example of LLM Types and Models

LLM Creation Information

Token Count

Prompting

Context Prompting

Navigation menu

Search