
ElevenLabs
ElevenLabs is an AI voice platform for text-to-speech, voice cloning, dubbing, speech-to-text, voice agents, and generative audio APIs.

ElevenLabs is an AI voice platform for text-to-speech, voice cloning, dubbing, speech-to-text, voice agents, and generative audio APIs.

A deep learning toolkit for Text-to-Speech, battle-tested in research and production

Hailuo AI TTS, also tied to MiniMax Audio, is a text-to-speech and voice-generation product for multilingual AI voices, voice cloning, and audio content workflows.

The best and most realistic voice tools currently available

IndexTTS is Bilibili’s open-source industrial-grade controllable and efficient zero-shot text-to-speech system. It is best for speech researchers and developers who need controllable TTS experiments, not for casual users looking for a polished web voice app.

ML-powered speech recognition directly in your browser. Built with Transformers.js.

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

WhisperDesktop is a Windows desktop app and high-performance GPGPU port for running OpenAI Whisper speech recognition locally, with file and microphone transcription workflows.

Buzz is a free, open-source desktop app for offline audio transcription and translation powered by OpenAI Whisper. It imports audio and video, exports TXT/SRT/VTT/CSV subtitles, supports microphones, Whisper.cpp, Faster Whisper, Hugging Face models, OpenAI API, CLI workflows, speaker identification, and speech separation.

Port of OpenAI's Whisper model in C/C++

OpenAPI open source robust speech recognition model through large-scale weak supervision

Open-source Android real-time translator that can run locally for offline, privacy-conscious speech and text translation.

Open source project. Crossword translation browser plugin and cross-platform desktop application based on ChatGPT API

Immersive Translate is a bilingual translation extension and reading tool for web pages, PDFs, subtitles, ebooks, images, and input boxes. It is strongest when users need to understand foreign-language content while keeping the original text visible.

DeepL is a high-quality AI translation and writing platform for text, full documents, terminology control, writing refinement, and developer translation APIs. It is best for individuals and teams that need accurate multilingual communication with glossary and privacy controls.

Google's service, offered free of charge, instantly translates words, phrases, and web pages between English and over 100 other languages.

Add-in for Microsoft Word that seamlessly integrates essential AI tools, including text generation, proofreading, and more, directly into the user interface.

Edit and correct your grammar, spelling, punctuation, and more with your personal writing assistant, grammar checker, and editor.

DeepL Write is a tool that helps you perfect your writing. Write clearly, precisely, with ease, and without errors. Try for free now!

Build custom agents, search across all your apps, and automate busywork. The AI workspace where teams get more done, faster.

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

This repo includes ChatGPT prompt curation to use ChatGPT better.

The AI Acceleration Cloud. Train, fine-tune and run inference on AI models blazing fast, at low cost, and at production scale.

Use Viidx AI, the ultimate, free AI video generator, to create cinematic videos from text prompts, images, or videos in seconds — no editing experience required. Bring your ideas to life with high-quality, fast AI-powered video creation.