Firebase AI Logic supports Gemini 3 Pro and Gemini 3 Pro Image (nano banana pro) for use on all platforms (in preview).

Bu sayfa, Cloud Translation API ile çevrilmiştir.

Live API'nin yapılandırma seçenekleri

Live API için temel uygulama ile bile kullanıcılarınız için ilgi çekici ve güçlü etkileşimler oluşturabilirsiniz. Aşağıdaki yapılandırma seçeneklerini kullanarak deneyimi daha da özelleştirebilirsiniz:

Yanıt sesi ve dili
Ses girişi ve çıkışı için transkripsiyonlar
Ses etkinliği algılama (VAD)
Oturum yönetimi

Yanıt sesi ve dili

Modelin belirli bir sesle yanıt vermesini sağlayabilir ve modeli farklı dillerde yanıt vermeye yönlendirebilirsiniz.

Yanıt sesini belirtme

Bu sayfada sağlayıcıya özel içerikleri ve kodu görüntülemek için Gemini API sağlayıcınızı tıklayın.

Live API, HD seslerde sentezlenmiş konuşma yanıtlarını desteklemek için Chirp 3'ü kullanır.

Yanıt sesi belirtmezseniz varsayılan olarak Puck kullanılır.

Yanıt ses seçeneklerinin listesini görüntüleme

Her sesin nasıl duyulduğuna dair demolar için Chirp 3: HD sesler başlıklı makaleyi inceleyin.

Zephyr -- Parlak
Kore -- Sert
Orus -- Sert
Autonoe -- Parlak
Umbriel -- Rahat
Erinome -- Net
Laomedeia -- Neşeli
Schedar -- Dengeli
Achird -- Samimi
Sadachbia -- Canlı Puck -- Neşeli
Fenrir -- Heyecanlı
Aoede -- Sakin
Enceladus -- Fısıltılı
Algieba -- Akıcı
Algenib -- Gıcırtılı
Achernar -- Yumuşak
Gacrux -- Olgun
Zubenelgenubi -- Günlük
Sadaltager -- Bilgili Charon -- Bilgilendirici
Leda -- Genç
Callirrhoe -- Rahat
Iapetus -- Net
Despina -- Akıcı
Rasalgethi -- Bilgilendirici
Alnilam -- Kararlı
Pulcherrima -- İleriye dönük
Vindemiatrix -- Nazik
Sulafat -- Sıcakkanlı

Yanıt sesi belirtmek için speechConfig nesnesinde ses adını model yapılandırmasının bir parçası olarak ayarlayın.

Swift


// ...

let liveModel = FirebaseAI.firebaseAI(backend: .googleAI()).liveModel(
  modelName: "gemini-2.5-flash-native-audio-preview-09-2025",
  // Configure the model to use a specific voice for its audio response
  generationConfig: LiveGenerationConfig(
    responseModalities: [.audio],
    speech: SpeechConfig(voiceName: "VOICE_NAME")
  )
)

// ...

Kotlin


// ...

val model = Firebase.ai(backend = GenerativeBackend.googleAI()).liveModel(
    modelName = "gemini-2.5-flash-native-audio-preview-09-2025",
    // Configure the model to use a specific voice for its audio response
    generationConfig = liveGenerationConfig {
        responseModality = ResponseModality.AUDIO
        speechConfig = SpeechConfig(voice = Voice("VOICE_NAME"))
    }
)

// ...

Java


// ...

LiveGenerativeModel lm = FirebaseAI.getInstance(GenerativeBackend.googleAI()).liveModel(
    "gemini-2.5-flash-native-audio-preview-09-2025",
    // Configure the model to use a specific voice for its audio response
    new LiveGenerationConfig.Builder()
        .setResponseModality(ResponseModality.AUDIO)
        .setSpeechConfig(new SpeechConfig(new Voice("VOICE_NAME")))
        .build()
);

// ...

Web


// ...

const ai = getAI(firebaseApp, { backend: new GoogleAIBackend() });

const liveModel = getLiveGenerativeModel(ai, {
  model: "gemini-2.5-flash-native-audio-preview-09-2025",
  // Configure the model to use a specific voice for its audio response
  generationConfig: {
    responseModalities: [ResponseModality.AUDIO],
    speechConfig: {
      voiceConfig: {
        prebuiltVoiceConfig: { voiceName: "VOICE_NAME" },
      },
    },
  },
});

// ...

Dart


// ...

final _liveModel = FirebaseAI.googleAI().liveGenerativeModel(
  model: 'gemini-2.5-flash-native-audio-preview-09-2025',
  // Configure the model to use a specific voice for its audio response
  liveGenerationConfig: LiveGenerationConfig(
    responseModalities: [ResponseModalities.audio],
    speechConfig: SpeechConfig(voiceName: 'VOICE_NAME'),
  ),
);

// ...

Unity


// ...

var liveModel = FirebaseAI.GetInstance(FirebaseAI.Backend.GoogleAI()).GetLiveModel(
    modelName: "gemini-2.5-flash-native-audio-preview-09-2025",
    // Configure the model to use a specific voice for its audio response
    liveGenerationConfig: new LiveGenerationConfig(
        responseModalities: new[] { ResponseModality.Audio },
        speechConfig: SpeechConfig.UsePrebuiltVoice("VOICE_NAME")
    )
);

// ...

Yanıt dilini etkileme

Live API modelleri, yanıtları için uygun dili otomatik olarak seçer.

Desteklenen dillerin listesini görüntüleme

Dil	BCP-47 kodu	Dil	BCP-47 kodu
Arapça (Mısır)	ar-EG	Almanca (Almanya)	de-DE
İngilizce (ABD)	en-US	İspanyolca (ABD)	es-US
Fransızca (Fransa)	fr-FR	Hintçe (Hindistan)	hi-IN
Endonezce (Endonezya)	id-ID	İtalyanca (İtalya)	it-IT
Japonca (Japonya)	ja-JP	Korece (Kore)	ko-KR
Portekizce (Brezilya)	pt-BR	Rusça (Rusya)	ru-RU
Felemenkçe (Hollanda)	nl-NL	Lehçe (Polonya)	pl-PL
Tayca (Tayland)	th-TH	Türkçe (Türkiye)	tr-TR
Vietnamca (Vietnam)	vi-VN	Rumence (Romanya)	ro-RO
Ukraynaca (Ukrayna)	uk-UA	Bengalce (Bangladeş)	bn-BD
İngilizce (Hindistan)	en-IN ve hi-IN paketi	Marathi dili (Hindistan)	mr-IN
Tamilce (Hindistan)	ta-IN	Telugu dili (Hindistan)	te-IN

Modelin İngilizce olmayan bir dilde veya her zaman belirli bir dilde yanıt vermesini istiyorsanız aşağıdaki örneklerdeki gibi sistem talimatları kullanarak modelin yanıtlarını etkileyebilirsiniz:

Modele, İngilizce olmayan bir dilin uygun olabileceğini belirtin.

Listen to the speaker carefully. If you detect a non-English language, respond
in the language you hear from the speaker. You must respond unmistakably in the
speaker's language.

Modele her zaman belirli bir dilde yanıt vermesini söyleme

RESPOND IN LANGUAGE. YOU MUST RESPOND UNMISTAKABLY IN LANGUAGE.

Ses girişi ve çıkışı için transkriptler

Bu sayfada sağlayıcıya özel içerikleri ve kodu görüntülemek için Gemini API sağlayıcınızı tıklayın.

Modelin yanıtı kapsamında, ses girişinin ve modelin sesli yanıtının transkriptlerini alabilirsiniz. Bu yapılandırmayı model yapılandırması kapsamında ayarlarsınız.

Ses girişinin transkripsiyonu için inputAudioTranscription ekleyin.
Modelin sesli yanıtının transkripsiyonu için outputAudioTranscription simgesini ekleyin.

Aşağıdakileri göz önünde bulundurun:

Modeli hem girişin hem de çıkışın transkriptlerini döndürecek şekilde (aşağıdaki örnekte gösterildiği gibi) veya yalnızca birini ya da diğerini döndürecek şekilde yapılandırabilirsiniz.
Transkriptler sesle birlikte yayınlanır. Bu nedenle, her dönüşte metin bölümlerini topladığınız gibi transkriptleri de toplamanız önerilir.
Metne dönüştürme dili, ses girişinden ve modelin sesli yanıtından çıkarılır.

Swift


// ...

let liveModel = FirebaseAI.firebaseAI(backend: .googleAI()).liveModel(
  modelName: "gemini-2.5-flash-native-audio-preview-09-2025",
  // Configure the model to return transcriptions of the audio input and output
  generationConfig: LiveGenerationConfig(
    responseModalities: [.audio],
    inputAudioTranscription: AudioTranscriptionConfig(),
    outputAudioTranscription: AudioTranscriptionConfig()
  )
)

var inputTranscript: String = ""
var outputTranscript: String = ""

do {
  let session = try await liveModel.connect()
  for try await response in session.responses {
    if case let .content(content) = response.payload {
      if let inputText = content.inputAudioTranscription?.text {
        // Handle transcription text of the audio input
        inputTranscript += inputText
      }

      if let outputText = content.outputAudioTranscription?.text {
        // Handle transcription text of the audio output
        outputTranscript += outputText
      }

      if content.isTurnComplete {
        // Log the transcripts after the current turn is complete
        print("Input audio: \(inputTranscript)")
        print("Output audio: \(outputTranscript)")

        // Reset the transcripts for the next turn
        inputTranscript = ""
        outputTranscript = ""
      }
    }
  }


} catch {
  // Handle error
}

// ...

Kotlin


// ...

val liveModel = Firebase.ai(backend = GenerativeBackend.googleAI()).liveModel(
    modelName = "gemini-2.5-flash-native-audio-preview-09-2025",
    // Configure the model to return transcriptions of the audio input and output
    generationConfig = liveGenerationConfig {
        responseModality = ResponseModality.AUDIO
        inputAudioTranscription = AudioTranscriptionConfig()
        outputAudioTranscription = AudioTranscriptionConfig()
   }
)

val liveSession = liveModel.connect()

fun handleTranscription(input: Transcription?, output: Transcription?) {
    input?.text?.let { text ->
        // Handle transcription text of the audio input
        println("Input Transcription: $text")
    }
    output?.text?.let { text ->
        // Handle transcription text of the audio output
        println("Output Transcription: $text")
    }
}

liveSession.startAudioConversation(null, ::handleTranscription)

// ...

Java


// ...

ExecutorService executor = Executors.newFixedThreadPool(1);

LiveGenerativeModel lm = FirebaseAI.getInstance(GenerativeBackend.googleAI()).liveModel(
    "gemini-2.5-flash-native-audio-preview-09-2025",
    // Configure the model to return transcriptions of the audio input and output
    new LiveGenerationConfig.Builder()
            .setResponseModality(ResponseModality.AUDIO)
            .setInputAudioTranscription(new AudioTranscriptionConfig())
            .setOutputAudioTranscription(new AudioTranscriptionConfig())
            .build()
    );

LiveModelFutures liveModel = LiveModelFutures.from(lm);
ListenableFuture sessionFuture = liveModel.connect();

Futures.addCallback(sessionFuture, new FutureCallback() {
    @Override
    public void onSuccess(LiveSessionFutures ses) {
        LiveSessionFutures session = ses;
        session.startAudioConversation((Transcription input, Transcription output) -> {
            if (input != null) {
                // Handle transcription text of the audio input
                System.out.println("Input Transcription: " + input.getText());
            }
            if (output != null) {
                // Handle transcription text of the audio output
                System.out.println("Output Transcription: " + output.getText());
            }
            return null;
        });
    }

    @Override
    public void onFailure(Throwable t) {
        // Handle exceptions
        t.printStackTrace();
    }
}, executor);

// ...

Web


// ...

const ai = getAI(firebaseApp, { backend: new GoogleAIBackend() });

const liveModel = getLiveGenerativeModel(ai, {
  model: 'gemini-2.5-flash-native-audio-preview-09-2025',
  // Configure the model to return transcriptions of the audio input and output
  generationConfig: {
    responseModalities: [ResponseModality.AUDIO],
    inputAudioTranscription: {},
    outputAudioTranscription: {},
  },
});

const liveSession = await liveModel.connect();

liveSession.sendAudioRealtime({ data, mimeType: "audio/pcm" });

const messages = liveSession.receive();
for await (const message of messages) {
  switch (message.type) {
    case 'serverContent':
      if (message.inputTranscription) {
        // Handle transcription text of the audio input
        console.log(`Input transcription: ${message.inputTranscription.text}`);
      }
      if (message.outputTranscription) {
        // Handle transcription text of the audio output
        console.log(`Output transcription: ${message.outputTranscription.text}`);
      } else {
      	 // Handle other message types (modelTurn, turnComplete, interruption)
      }
    default:
      // Handle other message types (toolCall, toolCallCancellation)
  }
}

// ...

Dart


// ...

final _liveModel = FirebaseAI.googleAI().liveGenerativeModel(
  model: 'gemini-2.5-flash-native-audio-preview-09-2025',
  // Configure the model to return transcriptions of the audio input and output
  liveGenerationConfig: LiveGenerationConfig(
    responseModalities: [ResponseModalities.audio],
    inputAudioTranscription: AudioTranscriptionConfig(),
    outputAudioTranscription: AudioTranscriptionConfig(),
  ),
);

final LiveSession _session = _liveModel.connect();

await for (final response in _session.receive()) {
  LiveServerContent message = response.message;
  if (message.inputTranscription?.text case final inputText?) {
    // Handle transcription text of the audio input
    print('Input: $inputText');
  }

  if (message.outputTranscription?.text case final outputText?) {
    // Handle transcription text of the audio output
    print('Output: $outputText');
  }
}

// ...

Unity


// ...

var liveModel = FirebaseAI.GetInstance(FirebaseAI.Backend.GoogleAI()).GetLiveModel(
    modelName: "gemini-2.5-flash-native-audio-preview-09-2025",
    // Configure the model to return transcriptions of the audio input and output
    liveGenerationConfig: new LiveGenerationConfig(
        responseModalities: new[] { ResponseModality.Audio },
        inputAudioTranscription: new AudioTranscriptionConfig(),
        outputAudioTranscription: new AudioTranscriptionConfig()
    )
);

try
{
    var session = await liveModel.ConnectAsync();
    var stream = session.ReceiveAsync();
    await foreach (var response in stream) {
        if (response.Message is LiveSessionContent sessionContent) {
            if (!string.IsNullOrEmpty(sessionContent.InputTranscription?.Text)) {
              // handle transcription text of input audio
            }

            if (!string.IsNullOrEmpty(sessionContent.OutputTranscription?.Text)) {
              // handle transcription text of output audio
            }
        }
    }
}
catch (Exception e)
{
    // Handle error
}

// ...

Ses etkinliği algılama (VAD)

Model, sürekli bir ses girişi akışında otomatik olarak ses etkinliği algılama (VAD) gerçekleştirir. VAD varsayılan olarak etkindir.

Oturum yönetimi

Oturumlarla ilgili aşağıdaki konular hakkında bilgi edinin:
- Aşağıdakiler gibi gelişmiş özellikler:
  - Oturum ortasında sistem talimatlarını güncelleme
  - Artımlı içerik güncellemeleri ekleme
- Bağlantı ve oturum süresi sınırları, oturum bağlam penceresi sınırları ve sıklık sınırları dahil olmak üzere oturumla ilgili sınırlar.
Firebase AI Logic, oturum yönetimi için aşağıdaki özellikleri henüz desteklemiyor. Bir süre sonra tekrar kontrol edin.
- Kesintileri ele alma
- Oturum süresini uzatma
- Oturuma devam etme
- Oturumlar ve istekler arasında bağlamı koruma
- Bağlam penceresini sıkıştırma

Live API'nin yapılandırma seçenekleri Koleksiyonlar ile düzeninizi koruyun İçeriği tercihlerinize göre kaydedin ve kategorilere ayırın.

Yanıt sesi ve dili

Yanıt sesini belirtme

Swift

Kotlin

Java

Web

Dart

Unity

Yanıt dilini etkileme

Ses girişi ve çıkışı için transkriptler

Swift

Kotlin

Java

Web

Dart

Unity

Ses etkinliği algılama (VAD)

Oturum yönetimi

Live API'nin yapılandırma seçenekleri