language-identification

Here are 174 public repositories matching this topic...

FunAudioLLM / SenseVoice

Multilingual speech understanding: ASR + emotion recognition + audio event detection. 50+ languages, 15x faster than Whisper, non-autoregressive.

Updated Jun 22, 2026
C

googlesamples / mlkit

Star

A collection of sample apps to demonstrate how to use Google's ML Kit APIs on Android and iOS

google translation barcode text-recognition face-detection object-detection barcode-scanner mlkit language-identification image-labeling ml-kit smart-reply mlkit-android mlkit-genai mlkit-genai-summarization mlkit-genai-image-description mlkit-genai-proofreading mlkit-genai-rewriting

Updated Jan 28, 2026
Java

modelscope / 3D-Speaker

Star

A Repository for Single- and Multi-modal Speaker Verification, Speaker Recognition and Speaker Diarization

speaker-verification speaker-diarization language-identification voxceleb modelscope campplus eres2net 3d-speaker cnceleb sdpn

Updated Dec 8, 2025
Python

pemistahl / lingua-py

Star

The most accurate natural language detection library for Python, suitable for short text and mixed-language text

nlp natural-language-processing python-library language-detection language-recognition language-identification language-classification

Updated Jun 18, 2026
Python

pemistahl / lingua-go

Star

The most accurate natural language detection library for Go, suitable for short text and mixed-language text

nlp go natural-language-processing language-detection language-modeling golang-library text-processing nlp-machine-learning language-recognition language-processing language-identification language-classification

Updated Feb 6, 2025
Go

pemistahl / lingua-rs

Star

The most accurate natural language detection library for Rust, suitable for short text and mixed-language text

nlp rust natural-language-processing language-detection rust-library nlp-machine-learning language-recognition language-processing rust-crate language-identification language-classification

Updated Mar 26, 2026
Rust

pemistahl / lingua

Star

The most accurate natural language detection library for Java and the JVM, suitable for long and short text alike

nlp natural-language-processing natural-language kotlin-library language-detection android-library java-library nlp-library nlp-machine-learning language-recognition language-processing language-identification language-classification

Updated Mar 21, 2025
Kotlin

A SOTA Industrial-Grade All-in-One ASR system with ASR, VAD, LID, and Punc modules. FireRedASR2 supports Chinese (Mandarin, 20+ dialects/accents), English, code-switching, and both speech and singing ASR. FireRedVAD supports speech/singing/music in 100+ langs. FireRedLID supports 100+ langs and 20+ zh dialects. FireRedPunc supports zh and en.

open-source speech-recognition vad automatic-speech-recognition asr lid language-identification sota voice-activity-detection asr-pipeline punctuation-restoration audio-event-classification llm punctuation-prediction industrial-grade multimodal-llm speechllm audio-event-detection

Updated Jun 2, 2026
Python

echogarden-project / echogarden

Star

Cross-platform speech toolset, used from the command-line or as a Node.js library. Includes a variety of engines for speech synthesis, speech recognition, forced alignment, speech translation, voice isolation, language detection and more.