speech-recognition

Here are 4,654 public repositories matching this topic...

HenestrosaDev / audiotext

A desktop application that transcribes audio from files, microphone input or YouTube videos with the option to translate the content and create subtitles.

python speech-recognition speech-to-text transcriber video-to-text audio-to-text speech-to-text-api subtitles-generator customtkinter whisperx

Updated Jun 12, 2024
Python

flozi00 / atra

Star

An open source NLP as a service project focused on providing state of the art systems with ease. Training and inference by simple docker commands

chatbot speech transformers inference speech-recognition asr llm stable-diffusion

Updated Jun 12, 2024
Jupyter Notebook

KevKibe / African-Whisper

Star

🚀 Framework for seamless fine-tuning of Whisper model on a multi-lingual dataset and deployment to prod.

speech speech-recognition speech-to-text whisper asr speech-translation speech-transcription

Updated Jun 12, 2024
Python

huggingface / transformers

Star

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Updated Jun 12, 2024
Python

openvinotoolkit / openvino

Star

OpenVINO™ is an open-source toolkit for optimizing and deploying AI inference

nlp natural-language-processing ai computer-vision deep-learning transformers inference speech-recognition yolo recommendation-system performance-boost good-first-issue openvino diffusion-models stable-diffusion generative-ai llm-inference optimize-ai deploy-ai

Updated Jun 12, 2024
C++

ShoukoChan / Voice-to-Text

Star

Voice to Text Model using OpenAI's Whisper

python torch speech-recognition cuda-toolkit asr nlp-deep-learning openai-whisper

Updated Jun 12, 2024
Jupyter Notebook

ggerganov / whisper.cpp

Sponsor

Star

Port of OpenAI's Whisper model in C/C++

inference transformer speech-recognition openai speech-to-text whisper

Updated Jun 12, 2024
C

modelscope / FunASR

Star

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

pytorch speech-recognition vad punctuation whisper audio-visual-speech-recognition speaker-diarization voice-activity-detection conformer pretrained-model rnnt dfsmn paraformer speechgpt speechllm

Updated Jun 12, 2024
Python

ae9is / subtitle-chan

Star

Live speech transcription and translation in your browser

react translator typescript translation stream live subtitles speech-recognition google-translate vite subtitles-generator

Updated Jun 12, 2024
TypeScript

modelscope / FunClip

Star

Open-source, accurate and easy-to-use video speech recognition & clipping tool, LLM based AI clipping intergrated.

speech-recognition speech-to-text gradio video-clip subtitles-generator video-subtitles llm gradio-python-llm

Updated Jun 12, 2024
Python

semperai / amica

Star

Amica is an open source interface for interactive communication with 3D characters with voice synthesis and speech recognition.

ai computer-vision tts speech-recognition assistant-chat-bots llm

Updated Jun 12, 2024
TypeScript

akscf / mod_whisper_asr

Star

Freeswitch ASR module to working with wisper_cpp

speech-recognition freeswitch speech-to-text whisper-cpp

Updated Jun 12, 2024
C

huuquyet / PhoWhisper-next

Star

Demo using PhoWhisper models of VinAI built with Transformers.js + Next.js

vietnamese nextjs speech-recognition whisper vinai onnx-models phowhisper transformersjs

Updated Jun 12, 2024
TypeScript

deepgram / deepgram-python-sdk

Star

Official Python SDK for Deepgram's automated speech recognition APIs.

python speech-recognition hacktoberfest asr deepgram automated-speech-recognition

Updated Jun 12, 2024
Python

DmitryRyumin / ICASSP-2023-24-Papers

Star

ICASSP 2023-2024 Papers: A complete collection of influential and exciting research papers from the ICASSP 2023-24 conferences. Explore the latest advancements in acoustics, speech and signal processing. Code included. Star the repository to support the advancement of audio and signal processing!