Learn about supported models

For mobile and web apps, the Firebase AI Logic SDKs let you interact with the supported Gemini models and Imagen models directly from your app.

Gemini models are considered multimodal because they're capable of processing and even generating multiple modalities, including text, code, PDFs, images, video, and audio. Imagen models can be prompted with text to generate images.

Also, review our FAQ about all the models that Firebase AI Logic supports and does not support.

Jump to model comparisons

OUR MOST INTELLIGENT MODEL

gemini-3-pro-preview

Our best model for multimodal understanding, and our most powerful agentic and vibe-coding model yet, delivering richer visuals and deeper interactivity, all built on a foundation of state-of-the-art reasoning. (billing required)


OUR ADVANCED THINKING MODEL

gemini-2.5-pro

Our state-of-the-art thinking model, capable of reasoning over complex problems in code, math, and STEM, as well as analyzing large datasets, codebases, and documents using long context.

FAST AND INTELLIGENT

gemini-2.5-flash

Our best model in terms of price-performance, offering well-rounded capabilities. 2.5 Flash is best for large scale processing, low-latency, high volume tasks that require thinking, and agentic use cases.

ULTRA FAST

gemini-2.5-flash-lite

Our fastest flash model optimized for cost-efficiency and high throughput.

Image generating models

You can generate images with either Gemini or Imagen models.

Jump to model comparisons

Gemini

gemini-2.5-flash-image

Our standard Flash model upgraded for rapid creative workflows with image generation and conversational, multi-turn editing capabilities. (billing required)

Imagen

imagen-4.0-generate-001

Generates realistic, high-quality images from natural language text prompts. (billing required)

imagen-4.0-fast-generate-001

Generates images for prototyping or low-latency use cases. (billing required)

imagen-4.0-ultra-generate-001

Generates realistic, high-quality images from natural language text prompts. (billing required)

Previous generation models

The following are active, but previous generation models. We recommend using one of the latest models instead when possible.

Jump to model comparisons

Gemini models

OUR SECOND GENERATION WORKHORSE MODEL

gemini-2.0-flash-001

Our second generation workhorse model, with a 1 million token context window. Delivers next-gen features and improved capabilities, including superior speed and native tool use.

OUR SECOND GENERATION FAST MODEL

gemini-2.0-flash-lite-001

Our second generation small workhorse model, with a 1 million token context window. Optimized for cost efficiency and low latency.


The remainder of this page provides detailed information about the models supported by Firebase AI Logic.

  • Compare models:

    • Supported input and output
    • High-level comparison of the supported capabilities
    • Specifications and limitations, for example max input tokens or max length of input video
  • Description of how models are versioned, specifically their stable, auto-updated, preview, and experimental versions

  • Lists of available model names to include in your code during initialization

  • Lists of supported languages for the models

At the bottom of this page, you can view detailed information about previous generation models.



Compare models

Each model has different capabilities to support various use cases. Note that each of tables in this section describe each model when used with Firebase AI Logic. Each model might have additional capabilities that aren't available when using our SDKs.

If you can't find the information you're looking for in the following sub-sections, you can find even more information in your chosen API provider documentation:

Supported input and output

These are the supported input and output types when using each model with Firebase AI Logic:

Gemini
3 Pro
Gemini
2.5 Pro, Flash, Flash-Lite
Gemini
2.5 Flash
Image
Gemini
2.0 Flash-
Live
Imagen (generate) Imagen (capability)
Input types
Text
Text (streaming)
Code
Documents
(PDFs or plain-text)
Images
Video
Audio
Audio (streaming)
Output types
Text
Text (streaming)
Code
Structured output
(like JSON)
Images
Audio
Audio (streaming)

To learn about supported file types, see Supported input files and requirements.

Supported capabilities and features

These are the supported capabilities and features when using each model with Firebase AI Logic:

Gemini
3 Pro
Gemini
2.5 Pro, Flash, Flash-Lite
Gemini
2.5 Flash
Image
Gemini
2.0 Flash-
Live
Imagen (generate) Imagen (capability)
Thinking
Generate text from text-only or multimodal inputs interleaved or as part of image streaming only
Generate images
(Gemini or Imagen)
Edit images
(Gemini or Imagen)
Generate audio streaming only
Generate structured output
(like JSON)
Analyze documents
(PDFs or plain-text)
Analyze images (vision)
Analyze video (vision)
Analyze audio streaming only
Multi-turn chat
Bidirectional multimodal streaming
Function calling
Code execution
Grounding with Google Search
System instructions
Count tokens

Specifications and limitations

These are the specifications and limitations when using each model with Firebase AI Logic:

Property Gemini
3 Pro
Gemini
2.5 Pro, Flash, Flash-Lite
Gemini
2.5 Flash
Image
Gemini
2.0 Flash-
Live
Imagen (generate) Imagen (capability)
Context window *
Total token limit
(combined input+output)
1M tokens 1,048,576 tokens 32,768 tokens 32,768 tokens 480 tokens 480 tokens
Output token limit * 64K tokens 65,536 tokens 8,192 tokens 8,192 tokens --- ---
Knowledge cutoff date January 2025 January 2025 June 2025 August 2024 --- ---
PDFs (per request)
Max number
of input PDF files **
900 files 3,000 files 3 files --- --- ---
Max number
of pages
per input PDF file **
900 pages 1,000 pages 3 pages --- --- ---
Max size
per input PDF file
50 MB 50 MB 50 MB --- --- ---
Images (per request)
Max number
of input images
1,000 images 3,000 images 3 images --- --- 4 images
Max number
of output images
--- --- 10 images --- 4 images 4 images
Max size
per input base64-encoded image
7 MB 7 MB 7 MB --- --- ---
Video (per request)
Max number
of input video files
10 files 10 files --- --- --- ---
Max length
of all input video
(frames only)
~60 minutes ~60 minutes --- --- --- ---
Max length
of all input video
(frames+audio)
~45 minutes ~45 minutes --- --- --- ---
Audio (per request)
Max number
of input audio files
1 file 1 file --- --- --- ---
Max number
of output audio files
--- --- --- --- --- ---
Max length
of all input audio
~8.4 hours ~8.4 hours ~8.4 hours --- --- ---
Max length
of all output audio
--- --- --- --- --- ---

* For all Gemini models, a token is equivalent to about 4 characters, so 100 tokens are about 60-80 English words. For Gemini models, you can determine the total count of tokens in your requests using countTokens.

** PDFs are treated as images, so a single page of a PDF is treated as one image. The number of pages allowed in a request is limited to the number of images the model can support.

Find additional detailed information



Model versioning and naming patterns

Models are offered in stable, preview, and experimental versions. For convenience, aliases without explicit version values are supported.

To find specific model names to use in your code, see the "available model names" section later on this page.

Version type /
Release stage
Description Model name pattern
Stable Stable versions are available and supported for production use starting on the release date.
  • A stable model version is typically released with a retirement date, which indicates the last day that the model is available. After this date, the model is no longer accessible or supported by Google.
  • Gemini 2.5 models
    Model names of stable versions have no suffix

    Example: gemini-2.5-pro

  • Gemini 2.0 and Imagen models
    Model names of stable versions are appended with a specific three-digit version number

    Example: gemini-2.0-flash-001
    Example: imagen-3.0-generate-002

Auto-updated stable alias
(Gemini 2.0 models only)
Auto-updated stable aliases always point to the latest stable version of that model.
  • If a new stable version is released, the auto-updated alias automatically starts pointing to that new stable version.

Gemini 2.0 models only
Model names of aliases have no suffix

Example: gemini-2.0-flash

Preview Preview versions have new capabilities and are considered not stable.
  • These models are not recommended for production use, come with more restrictive rate limits, and may have billing requirements.
  • These models are retired within a few weeks or months after their associated stable version is released.
  • For the Vertex AI Gemini API, preview models released after June 2025 usually require you to set the model's location to global.

Model names of preview versions are appended with -preview and often the model's release date (-MM-DD for older models or -MM-YYYY for newer models)

Examples:
gemini-2.5-flash-preview-04-17
(released on April 17, 2025)
or
gemini-2.5-flash-preview-09-2025
(released in September 2025)
or
gemini-3-pro-preview
(released in November 2025)

Experimental Experimental versions have new capabilities and are considered not stable.
  • These models are not recommended for production use and come with more restrictive rate limits. Experimental models are intended for gathering feedback and to enable experimentation with our latest features.
  • These models are retired within a few weeks or months after their associated stable version is released.
  • For the Vertex AI Gemini API, experimental models released after June 2025 require you to set the model's location to global.

Model names of experimental versions are appended with -exp along with the model's release date (-MM-DD)

Example: gemini-2.5-pro-exp-03-25
(released on March 25, 2025)

Retired Retired versions are past their retirement date and have been permanently deactivated.
  • Retired models are no longer accessible or supported by Google, and a request using a retired model name returns a 404 error.

---



Available model names

Model names are the explicit values that you include in your code during initialization of the model.

Jump to Gemini model names Jump to Imagen model names

Programmatically list all available models

You can list all available models names using the REST API:

Note that this returned list will include all models supported by the API providers, but Firebase AI Logic only supports the Gemini and Imagen models described on this page. Also note that auto-updated aliases (for example, gemini-2.0-flash) aren't listed because they're a convenience alias for the base model.

Gemini model names

For initialization examples for your platform, see the getting started guide.

For details about the release stages (especially for use cases, billing, and retirement), see model versioning and naming patterns.

Gemini 3 Pro model names

Model name Description Release stage Release date Retirement date
gemini-3-pro-preview Preview version of Gemini 3 Pro Preview 2025-11-18 To be determined

Gemini 2.5 Pro model names

Model name Description Release stage Release date Retirement date
gemini-2.5-pro Stable version of Gemini 2.5 Pro Stable 2025-06-17 No earlier than
2026-06-17

Gemini 2.5 Flash model names

Model name Description Release stage Release date Retirement date
gemini-2.5-flash Stable version of Gemini 2.5 Flash Stable 2025-06-17 No earlier than
2026-06-17

Gemini 2.5 Flash‑Lite model names

Model name Description Release stage Release date Retirement date
gemini-2.5-flash-lite Stable version of Gemini 2.5 Flash‑Lite Stable 2025-07-22 No earlier than
2026-07-22

Gemini 2.5 Flash Image model names (aka "nano banana")

Model name Description Release stage Release date Retirement date
gemini-2.5-flash-image Stable version for Gemini 2.5 Flash Image (aka "nano banana") Stable 2025-10-02 No earlier than
2026-10-02

Gemini 2.5 Flash Live model names

Model name Description Release stage Release date Retirement date
gemini-live-2.5-flash Stable version for Gemini 2.5 Flash Live Stable
(private GA) 1
2025-06-01 No earlier than
2026-06-01
gemini-live-2.5-flash-preview 2 Preview version for Gemini 2.5 Flash Live Preview 2025-06-01 To be determined

Firebase AI Logic does not yet support the native audio models (like gemini-2.5-flash-native-audio-preview-09-2025).

1 Reach out to your Google Cloud account team representative to request access. 2 Not supported by the Vertex AI Gemini API no matter how you access it.

Gemini 2.0 Flash Live model names

Model name Description Release stage Release date Retirement date
gemini-2.0-flash-live-001 2 Stable version for Gemini 2.0 Flash Live Stable 2025-04-01 No earlier than
2026-04-01
gemini-2.0-flash-live-preview-04-09 Preview version for Gemini 2.0 Flash Live Preview 2025-04-09 To be determined

2 Not supported by the Vertex AI Gemini API no matter how you access it.

Imagen model names

For initialization examples for your platform, see the generate images with Imagen guide.

For details about the release stages (especially for use cases, billing, and retirement), see model versioning and naming patterns.

Imagen 4 model names

Model name Description Release stage Release date Retirement date
imagen-4.0-generate-001 Stable version of Imagen 4 Stable 2025-08-14 No earlier than 2026-08-14

Imagen 4 Fast model names

Model name Description Release stage Release date Retirement date
imagen-4.0-fast-generate-001 Stable version of Imagen 4 Fast Stable 2025-08-14 No earlier than 2026-08-14

Imagen 4 Ultra model names

Model name Description Release stage Release date Retirement date
imagen-4.0-ultra-generate-001 Stable version of Imagen 4 Ultra Stable 2025-08-14 No earlier than 2026-08-14

Imagen 3 Capability model names

Model name Description Release stage Release date Retirement date
imagen-3.0-capability-001 3 Initial stable version of Imagen 3 Capability Stable 2024-12-10 No earlier than 2025-12-10

3 Not supported by the Gemini Developer API no matter how you access it.



Supported languages

Gemini

  • All the Gemini models can understand and respond in the following languages:

    Arabic (ar), Bengali (bn), Bulgarian (bg), Chinese simplified and traditional (zh), Croatian (hr), Czech (cs), Danish (da), Dutch (nl), English (en), Estonian (et), Finnish (fi), French (fr), German (de), Greek (el), Hebrew (iw), Hindi (hi), Hungarian (hu), Indonesian (id), Italian (it), Japanese (ja), Korean (ko), Latvian (lv), Lithuanian (lt), Norwegian (no), Polish (pl), Portuguese (pt), Romanian (ro), Russian (ru), Serbian (sr), Slovak (sk), Slovenian (sl), Spanish (es), Swahili (sw), Swedish (sv), Thai (th), Turkish (tr), Ukrainian (uk), Vietnamese (vi)

  • Gemini 2.0 Flash, Gemini 1.5 Pro and Gemini 1.5 Flash models can understand and respond in the following additional languages:

    Afrikaans (af), Amharic (am), Assamese (as), Azerbaijani (az), Belarusian (be), Bosnian (bs), Catalan (ca), Cebuano (ceb), Corsican (co), Welsh (cy), Dhivehi (dv), Esperanto (eo), Basque (eu), Persian (fa), Filipino (Tagalog) (fil), Frisian (fy), Irish (ga), Scots Gaelic (gd), Galician (gl), Gujarati (gu), Hausa (ha), Hawaiian (haw), Hmong (hmn), Haitian Creole (ht), Armenian (hy), Igbo (ig), Icelandic (is), Javanese (jv), Georgian (ka), Kazakh (kk), Khmer (km), Kannada (kn), Krio (kri), Kurdish (ku), Kyrgyz (ky), Latin (la), Luxembourgish (lb), Lao (lo), Malagasy (mg), Maori (mi), Macedonian (mk), Malayalam (ml), Mongolian (mn), Meiteilon (Manipuri) (mni-Mtei), Marathi (mr), Malay (ms), Maltese (mt), Myanmar (Burmese) (my), Nepali (ne), Nyanja (Chichewa) (ny), Odia (Oriya) (or), Punjabi (pa), Pashto (ps), Sindhi (sd), Sinhala (Sinhalese) (si), Samoan (sm), Shona (sn), Somali (so), Albanian (sq), Sesotho (st), Sundanese (su), Tamil (ta), Telugu (te), Tajik (tg), Uyghur (ug), Urdu (ur), Uzbek (uz), Xhosa (xh), Yiddish (yi), Yoruba (yo), Zulu (zu)

Imagen

  • General availability: English

  • Preview: Chinese (simplified), Chinese (traditional), Hindi, Japanese, Korean, Portuguese, Spanish



Information about previous models

The following are active, but previous generation models. We recommend using one of the latest models instead when possible.

Gemini models

Model Input Output Description
Gemini 2.0 Flash
gemini-2.0-flash-001
text, code, PDFs, images, video, audio text, code, JSON Our multimodal model with next-gen features and improved capabilities, including superior speed, built-in tool use, and a 1M token context window
Gemini 2.0 Flash‑Lite
gemini-2.0-flash-lite-001
text, code, PDFs, images, video, audio text, code, JSON Our fastest and most cost efficient Flash model. It's an upgrade path for 1.5 Flash users who want better quality for the same price and speed.

Imagen models

Model Input Output Description
Imagen 3
imagen-3.0-generate-002
text images Generates realistic, high-quality images from natural language text prompts
Imagen 3 Fast 2
imagen-3.0-fast-generate-001
text images Generates images for prototyping or low-latency use cases

2 Not supported by the Gemini Developer API no matter how you access it.



Next steps

Try out the capabilities of the Gemini API