Multilingual speech understanding: ASR + emotion recognition + audio event detection. 50+ languages, 15x faster than Whisper, non-autoregressive.
-
Updated
Jun 22, 2026 - C
Multilingual speech understanding: ASR + emotion recognition + audio event detection. 50+ languages, 15x faster than Whisper, non-autoregressive.
A collection of sample apps to demonstrate how to use Google's ML Kit APIs on Android and iOS
A Repository for Single- and Multi-modal Speaker Verification, Speaker Recognition and Speaker Diarization
The most accurate natural language detection library for Python, suitable for short text and mixed-language text
The most accurate natural language detection library for Go, suitable for short text and mixed-language text
The most accurate natural language detection library for Rust, suitable for short text and mixed-language text
The most accurate natural language detection library for Java and the JVM, suitable for long and short text alike
A SOTA Industrial-Grade All-in-One ASR system with ASR, VAD, LID, and Punc modules. FireRedASR2 supports Chinese (Mandarin, 20+ dialects/accents), English, code-switching, and both speech and singing ASR. FireRedVAD supports speech/singing/music in 100+ langs. FireRedLID supports 100+ langs and 20+ zh dialects. FireRedPunc supports zh and en.
Cross-platform speech toolset, used from the command-line or as a Node.js library. Includes a variety of engines for speech synthesis, speech recognition, forced alignment, speech translation, voice isolation, language detection and more.
⚡️ 80x faster Fasttext language detection out of the box | Split text by language
Simple embedding based text classifier inspired by fastText, implemented in tensorflow
Textpipe: clean and extract metadata from text
Vietnamese NLP Toolkit for Node
[EMNLP 2023] 💬 Language Identification with Support for More Than 2000 Labels
Simple multilingual lemmatizer for Python, especially useful for speed and efficiency
Fast and accurate natural language detection. Detector written in Javascript. Nito-ELD, ELD.
Implementation of the paper "Spoken Language Recognition using X-vectors" in Pytorch
Code for the paper Language Identification Using Deep Convolutional Recurrent Neural Networks
A TensorFlow-based spoken language identification
✨ Split text by languages (e.g. 你喜欢看アニメ吗 -> 你喜欢看 | アニメ | 吗) for NLP tasks (e.g. parse, TTS). Powered by fasttext and budoux
Add a description, image, and links to the language-identification topic page so that developers can more easily learn about it.
To associate your repository with the language-identification topic, visit your repo's landing page and select "manage topics."