For mobile and web apps, the Firebase AI Logic SDKs let you interact with the supported Gemini models and Imagen models directly from your app.
Gemini models are considered multimodal because they're capable of processing and even generating multiple modalities, including text, code, PDFs, images, video, and audio. Imagen models can be prompted with text to generate images.
Also, review our FAQ about all the models that Firebase AI Logic supports and does not support.
Featured models
Gemini 3 Pro
gemini-3-pro-preview
Our best model for multimodal understanding, and our most powerful agentic and vibe-coding model yet, delivering richer visuals and deeper interactivity, all built on a foundation of state-of-the-art reasoning. (billing required)
Gemini 2.5 Pro
gemini-2.5-pro
Our state-of-the-art thinking model, capable of reasoning over complex problems in code, math, and STEM, as well as analyzing large datasets, codebases, and documents using long context.
Gemini 2.5 Flash
gemini-2.5-flash
Our best model in terms of price-performance, offering well-rounded capabilities. 2.5 Flash is best for large scale processing, low-latency, high volume tasks that require thinking, and agentic use cases.
Gemini 2.5 Flash-Lite
gemini-2.5-flash-lite
Our fastest flash model optimized for cost-efficiency and high throughput.
Image generating models
You can generate images with either Gemini or Imagen models.
Gemini
Gemini 2.5 Flash Image (aka nano banana)
gemini-2.5-flash-image
Our standard Flash model upgraded for rapid creative workflows with image generation and conversational, multi-turn editing capabilities. (billing required)
Imagen
Imagen 4
imagen-4.0-generate-001
Generates realistic, high-quality images from natural language text prompts. (billing required)
Imagen 4 Fast
imagen-4.0-fast-generate-001
Generates images for prototyping or low-latency use cases. (billing required)
Imagen 4 Ultra
imagen-4.0-ultra-generate-001
Generates realistic, high-quality images from natural language text prompts. (billing required)
Previous generation models
The following are active, but previous generation models. We recommend using one of the latest models instead when possible.
Gemini models
Gemini 2.0 Flash
gemini-2.0-flash-001
Our second generation workhorse model, with a 1 million token context window. Delivers next-gen features and improved capabilities, including superior speed and native tool use.
Gemini 2.0 Flash-Lite
gemini-2.0-flash-lite-001
Our second generation small workhorse model, with a 1 million token context window. Optimized for cost efficiency and low latency.
The remainder of this page provides detailed information about the models supported by Firebase AI Logic.
-
- Supported input and output
- High-level comparison of the supported capabilities
- Specifications and limitations, for example max input tokens or max length of input video
Description of how models are versioned, specifically their stable, auto-updated, preview, and experimental versions
Lists of available model names to include in your code during initialization
Lists of supported languages for the models
At the bottom of this page, you can view detailed information about previous generation models.
Compare models
Each model has different capabilities to support various use cases. Note that each of tables in this section describe each model when used with Firebase AI Logic. Each model might have additional capabilities that aren't available when using our SDKs.
If you can't find the information you're looking for in the following sub-sections, you can find even more information in your chosen API provider documentation:
Gemini Developer API: Gemini models and Imagen models
Vertex AI Gemini API: Gemini models and Imagen models
Supported input and output
These are the supported input and output types when using each model with Firebase AI Logic:
|
Gemini 3 Pro |
Gemini 2.5 Pro, Flash, Flash-Lite |
Gemini 2.5 Flash Image |
Gemini 2.0 Flash- Live |
Imagen (generate) | Imagen (capability) | |
|---|---|---|---|---|---|---|
| Input types | ||||||
| Text | ||||||
| Text (streaming) | ||||||
| Code | ||||||
| Documents (PDFs or plain-text) |
||||||
| Images | ||||||
| Video | ||||||
| Audio | ||||||
| Audio (streaming) | ||||||
| Output types | ||||||
| Text | ||||||
| Text (streaming) | ||||||
| Code | ||||||
| Structured output (like JSON) |
||||||
| Images | ||||||
| Audio | ||||||
| Audio (streaming) | ||||||
To learn about supported file types, see Supported input files and requirements.
Supported capabilities and features
These are the supported capabilities and features when using each model with Firebase AI Logic:
|
Gemini 3 Pro |
Gemini 2.5 Pro, Flash, Flash-Lite |
Gemini 2.5 Flash Image |
Gemini 2.0 Flash- Live |
Imagen (generate) | Imagen (capability) | |
|---|---|---|---|---|---|---|
| Thinking | ||||||
| Generate text from text-only or multimodal inputs | interleaved or as part of image | streaming only | ||||
|
Generate images (Gemini or Imagen) |
||||||
|
Edit images (Gemini or Imagen) |
||||||
| Generate audio | streaming only | |||||
|
Generate
structured output
(like JSON) |
||||||
|
Analyze documents
(PDFs or plain-text) |
||||||
| Analyze images (vision) | ||||||
| Analyze video (vision) | ||||||
| Analyze audio | streaming only | |||||
| Multi-turn chat | ||||||
| Bidirectional multimodal streaming | ||||||
| Function calling | ||||||
| Code execution | ||||||
| Grounding with Google Search | ||||||
| System instructions | ||||||
| Count tokens |
Specifications and limitations
These are the specifications and limitations when using each model with Firebase AI Logic:
| Property |
Gemini 3 Pro |
Gemini 2.5 Pro, Flash, Flash-Lite |
Gemini 2.5 Flash Image |
Gemini 2.0 Flash- Live |
Imagen (generate) | Imagen (capability) |
|---|---|---|---|---|---|---|
| Context window * Total token limit (combined input+output) |
1M tokens | 1,048,576 tokens | 32,768 tokens | 32,768 tokens | 480 tokens | 480 tokens |
| Output token limit * | 64K tokens | 65,536 tokens | 8,192 tokens | 8,192 tokens | --- | --- |
| Knowledge cutoff date | January 2025 | January 2025 | June 2025 | August 2024 | --- | --- |
| PDFs (per request) | ||||||
| Max number of input PDF files ** |
900 files | 3,000 files | 3 files | --- | --- | --- |
| Max number of pages per input PDF file ** |
900 pages | 1,000 pages | 3 pages | --- | --- | --- |
| Max size per input PDF file |
50 MB | 50 MB | 50 MB | --- | --- | --- |
| Images (per request) | ||||||
| Max number of input images |
1,000 images | 3,000 images | 3 images | --- | --- | 4 images |
| Max number of output images |
--- | --- | 10 images | --- | 4 images | 4 images |
| Max size per input base64-encoded image |
7 MB | 7 MB | 7 MB | --- | --- | --- |
| Video (per request) | ||||||
| Max number of input video files |
10 files | 10 files | --- | --- | --- | --- |
| Max length of all input video (frames only) |
~60 minutes | ~60 minutes | --- | --- | --- | --- |
| Max length of all input video (frames+audio) |
~45 minutes | ~45 minutes | --- | --- | --- | --- |
| Audio (per request) | ||||||
| Max number of input audio files |
1 file | 1 file | --- | --- | --- | --- |
| Max number of output audio files |
--- | --- | --- | --- | --- | --- |
| Max length of all input audio |
~8.4 hours | ~8.4 hours | ~8.4 hours | --- | --- | --- |
| Max length of all output audio |
--- | --- | --- | --- | --- | --- |
*
For all Gemini models, a token is equivalent to about 4 characters,
so 100 tokens are about 60-80 English words. For Gemini models, you can
determine the total count of tokens in your requests using
countTokens.
** PDFs are treated as images, so a single page of a PDF is treated as one image. The number of pages allowed in a request is limited to the number of images the model can support.
Find additional detailed information
Quotas and pricing are different for each model. Pricing also depends on input and output.
Learn about supported input file types, how to specify MIME type, and how to make sure that your input files and multimodal requests meet the requirements and follow best practices in Supported input files and requirements.
Model versioning and naming patterns
Models are offered in stable, preview, and experimental versions. For convenience, aliases without explicit version values are supported.
To find specific model names to use in your code, see the "available model names" section later on this page.
| Version type / Release stage |
Description | Model name pattern | |
|---|---|---|---|
| Stable |
Stable versions are available and supported
for production use starting on the release date.
|
|
|
| Auto-updated stable alias (Gemini 2.0 models only) |
Auto-updated stable aliases always point to
the latest stable version of that model.
|
Gemini 2.0 models only
Example: |
|
| Preview |
Preview versions have new capabilities and
are considered not stable.
|
Model names of preview versions are appended with
Examples: |
|
| Experimental |
Experimental versions have new capabilities
and are considered not stable.
|
Model names of experimental versions are appended with
Example: |
|
| Retired |
Retired versions are past their retirement
date and have been permanently deactivated.
|
--- |
|
Available model names
Model names are the explicit values that you include in your code during initialization of the model.
Jump to Gemini model names Jump to Imagen model names
Programmatically list all available models
You can list all available models names using the REST API:
Gemini Developer API: Call the
models.listendpointVertex AI Gemini API: Call the
publishers.models.listendpoint
Note that this returned list will include all models supported by the
API providers, but Firebase AI Logic only supports the
Gemini and Imagen models described on this page.
Also note that auto-updated aliases (for example, gemini-2.0-flash) aren't
listed because they're a convenience alias for the base model.
Gemini model names
For initialization examples for your platform, see the getting started guide.
For details about the release stages (especially for use cases, billing, and retirement), see model versioning and naming patterns.
Gemini 3 Pro model names
| Model name | Description | Release stage | Release date | Retirement date |
|---|---|---|---|---|
gemini-3-pro-preview |
Preview version of Gemini 3 Pro | Preview | 2025-11-18 | To be determined |
Gemini 2.5 Pro model names
| Model name | Description | Release stage | Release date | Retirement date |
|---|---|---|---|---|
gemini-2.5-pro |
Stable version of Gemini 2.5 Pro | Stable | 2025-06-17 | No earlier than 2026-06-17 |
Gemini 2.5 Flash model names
| Model name | Description | Release stage | Release date | Retirement date |
|---|---|---|---|---|
gemini-2.5-flash |
Stable version of Gemini 2.5 Flash | Stable | 2025-06-17 | No earlier than 2026-06-17 |
Gemini 2.5 Flash‑Lite model names
| Model name | Description | Release stage | Release date | Retirement date |
|---|---|---|---|---|
gemini-2.5-flash-lite |
Stable version of Gemini 2.5 Flash‑Lite | Stable | 2025-07-22 | No earlier than 2026-07-22 |
Gemini 2.5 Flash Image model names (aka "nano banana")
| Model name | Description | Release stage | Release date | Retirement date |
|---|---|---|---|---|
gemini-2.5-flash-image |
Stable version for Gemini 2.5 Flash Image (aka "nano banana") | Stable | 2025-10-02 | No earlier than 2026-10-02 |
Gemini 2.5 Flash Live model names
| Model name | Description | Release stage | Release date | Retirement date |
|---|---|---|---|---|
gemini-live-2.5-flash |
Stable version for Gemini 2.5 Flash Live | Stable (private GA) 1 |
2025-06-01 | No earlier than 2026-06-01 |
gemini-live-2.5-flash-preview 2 |
Preview version for Gemini 2.5 Flash Live | Preview | 2025-06-01 | To be determined |
Firebase AI Logic does not yet support the native audio
models (like gemini-2.5-flash-native-audio-preview-09-2025).
1 Reach out to your Google Cloud account team representative to request access. 2 Not supported by the Vertex AI Gemini API no matter how you access it.
Gemini 2.0 Flash Live model names
| Model name | Description | Release stage | Release date | Retirement date |
|---|---|---|---|---|
gemini-2.0-flash-live-001 2 |
Stable version for Gemini 2.0 Flash Live | Stable | 2025-04-01 | No earlier than 2026-04-01 |
gemini-2.0-flash-live-preview-04-09 |
Preview version for Gemini 2.0 Flash Live | Preview | 2025-04-09 | To be determined |
2 Not supported by the Vertex AI Gemini API no matter how you access it.
Imagen model names
For initialization examples for your platform, see the generate images with Imagen guide.
For details about the release stages (especially for use cases, billing, and retirement), see model versioning and naming patterns.
Imagen 4 model names
| Model name | Description | Release stage | Release date | Retirement date |
|---|---|---|---|---|
imagen-4.0-generate-001 |
Stable version of Imagen 4 | Stable | 2025-08-14 | No earlier than 2026-08-14 |
Imagen 4 Fast model names
| Model name | Description | Release stage | Release date | Retirement date |
|---|---|---|---|---|
imagen-4.0-fast-generate-001 |
Stable version of Imagen 4 Fast | Stable | 2025-08-14 | No earlier than 2026-08-14 |
Imagen 4 Ultra model names
| Model name | Description | Release stage | Release date | Retirement date |
|---|---|---|---|---|
imagen-4.0-ultra-generate-001 |
Stable version of Imagen 4 Ultra | Stable | 2025-08-14 | No earlier than 2026-08-14 |
Imagen 3 Capability model names
| Model name | Description | Release stage | Release date | Retirement date |
|---|---|---|---|---|
imagen-3.0-capability-001 3 |
Initial stable version of Imagen 3 Capability | Stable | 2024-12-10 | No earlier than 2025-12-10 |
3 Not supported by the Gemini Developer API no matter how you access it.
Supported languages
Gemini
All the Gemini models can understand and respond in the following languages:
Arabic (ar), Bengali (bn), Bulgarian (bg), Chinese simplified and traditional (zh), Croatian (hr), Czech (cs), Danish (da), Dutch (nl), English (en), Estonian (et), Finnish (fi), French (fr), German (de), Greek (el), Hebrew (iw), Hindi (hi), Hungarian (hu), Indonesian (id), Italian (it), Japanese (ja), Korean (ko), Latvian (lv), Lithuanian (lt), Norwegian (no), Polish (pl), Portuguese (pt), Romanian (ro), Russian (ru), Serbian (sr), Slovak (sk), Slovenian (sl), Spanish (es), Swahili (sw), Swedish (sv), Thai (th), Turkish (tr), Ukrainian (uk), Vietnamese (vi)
Gemini 2.0 Flash, Gemini 1.5 Pro and Gemini 1.5 Flash models can understand and respond in the following additional languages:
Afrikaans (af), Amharic (am), Assamese (as), Azerbaijani (az), Belarusian (be), Bosnian (bs), Catalan (ca), Cebuano (ceb), Corsican (co), Welsh (cy), Dhivehi (dv), Esperanto (eo), Basque (eu), Persian (fa), Filipino (Tagalog) (fil), Frisian (fy), Irish (ga), Scots Gaelic (gd), Galician (gl), Gujarati (gu), Hausa (ha), Hawaiian (haw), Hmong (hmn), Haitian Creole (ht), Armenian (hy), Igbo (ig), Icelandic (is), Javanese (jv), Georgian (ka), Kazakh (kk), Khmer (km), Kannada (kn), Krio (kri), Kurdish (ku), Kyrgyz (ky), Latin (la), Luxembourgish (lb), Lao (lo), Malagasy (mg), Maori (mi), Macedonian (mk), Malayalam (ml), Mongolian (mn), Meiteilon (Manipuri) (mni-Mtei), Marathi (mr), Malay (ms), Maltese (mt), Myanmar (Burmese) (my), Nepali (ne), Nyanja (Chichewa) (ny), Odia (Oriya) (or), Punjabi (pa), Pashto (ps), Sindhi (sd), Sinhala (Sinhalese) (si), Samoan (sm), Shona (sn), Somali (so), Albanian (sq), Sesotho (st), Sundanese (su), Tamil (ta), Telugu (te), Tajik (tg), Uyghur (ug), Urdu (ur), Uzbek (uz), Xhosa (xh), Yiddish (yi), Yoruba (yo), Zulu (zu)
Imagen
General availability: English
Preview: Chinese (simplified), Chinese (traditional), Hindi, Japanese, Korean, Portuguese, Spanish
Information about previous models
The following are active, but previous generation models. We recommend using one of the latest models instead when possible.
Gemini models
| Model | Input | Output | Description |
|---|---|---|---|
Gemini 2.0 Flashgemini-2.0-flash-001
|
text, code, PDFs, images, video, audio | text, code, JSON | Our multimodal model with next-gen features and improved capabilities, including superior speed, built-in tool use, and a 1M token context window |
Gemini 2.0 Flash‑Litegemini-2.0-flash-lite-001
|
text, code, PDFs, images, video, audio | text, code, JSON | Our fastest and most cost efficient Flash model. It's an upgrade path for 1.5 Flash users who want better quality for the same price and speed. |
Imagen models
| Model | Input | Output | Description |
|---|---|---|---|
Imagen 3imagen-3.0-generate-002 |
text | images | Generates realistic, high-quality images from natural language text prompts |
Imagen 3 Fast 2imagen-3.0-fast-generate-001
|
text | images | Generates images for prototyping or low-latency use cases |
2 Not supported by the Gemini Developer API no matter how you access it.
These are the input and output types when using each model with Firebase AI Logic:
|
Gemini 2.0 Flash |
Gemini 2.0 Flash- Lite |
|
|---|---|---|
| Input types | ||
| Text | ||
| Text (streaming) | ||
| Code | ||
| Documents (PDFs or plain-text) |
||
| Images | ||
| Video | ||
| Audio | ||
| Audio (streaming) | ||
| Output types | ||
| Text | ||
| Text (streaming) | ||
| Code | ||
| Structured output (like JSON) |
||
| Images | ||
| Audio | ||
| Audio (streaming) | ||
These are the capabilities and features when using each model with Firebase AI Logic:
|
Gemini 2.0 Flash |
Gemini 2.0 Flash- Lite |
|
|---|---|---|
| Thinking | ||
| Generate text from text-only or multimodal inputs | ||
|
Generate images (Gemini or Imagen) |
||
|
Edit images (Gemini or Imagen) |
||
| Generate audio | ||
|
Generate
structured output
(like JSON) |
||
|
Analyze documents
(PDFs or plain-text) |
||
| Analyze images (vision) | ||
| Analyze video (vision) | ||
| Analyze audio | ||
| Multi-turn chat | ||
| Bidirectional multimodal streaming | ||
| Function calling | ||
| Code execution | ||
| Grounding with Google Search | ||
| System instructions | ||
| Count tokens |
These are the specifications and limitations when using each model with Firebase AI Logic:
| Property |
Gemini 2.0 Flash |
Gemini 2.0 Flash- Lite |
|---|---|---|
| Context window * Total token limit (combined input+output) |
1,048,576 tokens | 1,048,576 tokens |
| Output token limit * | 8,192 tokens | 8,192 tokens |
| Knowledge cutoff date | June 2024 | June 2024 |
| PDFs (per request) | ||
| Max number of input PDF files ** |
3,000 files | 3,000 files |
| Max number of pages per input PDF file ** |
1,000 pages | 1,000 pages |
| Max size per input PDF file |
50 MB | 50 MB |
| Images (per request) | ||
| Max number of input images |
3,000 images | 3,000 images |
| Max number of output images |
--- | --- |
| Max size per input base64-encoded image |
7 MB | 7 MB |
| Video (per request) | ||
| Max number of input video files |
10 files | 10 files |
| Max length of all input video (frames only) |
~60 minutes | ~60 minutes |
| Max length of all input video (frames+audio) |
~45 minutes | ~45 minutes |
| Audio (per request) | ||
| Max number of input audio files |
1 file | 1 file |
| Max number of output audio files |
--- | --- |
| Max length of all input audio |
~8.4 hours | ~8.4 hours |
| Max length of all output audio |
--- | --- |
*
For all Gemini models, a token is equivalent to about 4 characters,
so 100 tokens are about 60-80 English words. For Gemini models, you can
determine the total count of tokens in your requests using
countTokens.
** PDFs are treated as images, so a single page of a PDF is treated as one image. The number of pages allowed in a request is limited to the number of images the model can support.
Model names are the explicit values that you include in your code during initialization of the model.
Gemini models
Gemini 2.0 Flash model names
| Model name | Description | Release stage | Release date | Retirement date |
|---|---|---|---|---|
gemini-2.0-flash-001 |
Latest stable version of Gemini 2.0 Flash | Stable | 2025-02-05 | No earlier than 2026-02-05 |
gemini-2.0-flash |
Auto-updated alias pointing to the latest stable version of
Gemini 2.0 Flash (currently gemini-2.0-flash-001 |
Stable | 2025-02-10 | --- |
Gemini 2.0 Flash‑Lite model names
| Model name | Description | Release stage | Release date | Retirement date |
|---|---|---|---|---|
gemini-2.0-flash-lite-001 |
Latest stable version of Gemini 2.0 Flash‑Lite | Stable | 2025-02-25 | No earlier than 2026-02-25 |
gemini-2.0-flash-lite |
Auto-updated alias pointing to the latest stable version of
Gemini 2.0 Flash‑Lite (currently gemini-2.0-flash-lite-001 |
Stable | 2025-02-25 | --- |
Imagen models
Imagen 3 model names
| Model name | Description | Release stage | Release date | Retirement date |
|---|---|---|---|---|
imagen-3.0-generate-002 |
Latest stable version of Imagen 3 | Stable | 2025-01-23 | No earlier than 2026-01-23 |
imagen-3.0-generate-001 3 |
Initial stable version of Imagen 3 | Stable | 2024-07-31 | No earlier than 2025-07-31 |
Imagen 3 Fast model names
| Model name | Description | Release stage | Release date | Retirement date |
|---|---|---|---|---|
imagen-3.0-fast-generate-001 3 |
Initial stable version of Imagen 3 Fast | Stable | 2024-07-31 | No earlier than 2025-07-31 |
3 Not supported by the Gemini Developer API no matter how you access it.
Next steps
Try out the capabilities of the Gemini API
- Build multi-turn conversations (chat).
- Generate text from text-only prompts.
- Generate text by prompting with various file types, like images, PDFs, video, and audio.
- Generate structured output (like JSON) from both text and multimodal prompts.
- Generate images from text prompts (Gemini or Imagen).
- Stream input and output (including audio) using the Gemini Live API.
- Use tools (like function calling and grounding with Google Search) to connect a Gemini model to other parts of your app and external systems and information.