Language Identification

With ML Kit's on-device language identification API, you can determine the language of a string of text.

Language identification can be useful when working with user-provided text, which often doesn't come with any language information.

Key capabilities

Broad language support Identifies over a hundred different languages. See the complete list.
Romanized text support Identifies Arabic, Bulgarian, Greek, Hindi, Japanese, Russian, and Chinese text in both native and romanized script.

Example results

Simple language identification
"My hovercraft is full of eels." en (English)
"Dao shan xue hai" zh-Latn (Latinized Chinese)
"ph'nglui mglw'nafh TensorFlow Google wgah'nagl fhtagn" und (undetermined)
Confidence distribution
"an amicable coup d'etat" en (0.52)
fr (0.44)
ca (0.03)