Trance
Digital Nirvana’s pioneering and advanced speech-to-text engines enable content creators to generate highly accurate audio and video content transcripts. The powerful Trance UI allows users to easily navigate, edit and export caption files in all industry-recognized formats. Built-in AI along with custom preset capabilities ensure caption conformance with style guidelines from various delivery platforms.Trance is designed to use machine learning capabilities to enhance the process of generating transcripts, closed captions, and subtitling for media content. Further, Trance also boasts an industry-first tool, Natural Language Processing capabilities. Our NLP technology enables transcript splitting based on grammar rules and styles for individual streaming platforms. Auto-generate captions to conform with multiple style guidelines and file types - all in the shortest time frame possible.
Learn more
Subanana
Subanana is an AI speech-to-text web app that turns audio and video into subtitles, transcripts, and meeting summaries in 80+ languages, with standout accuracy on Asian and mixed-language speech (Cantonese, Mandarin, Japanese, Korean, and code-switching) that English-first tools handle poorly.
Subtitles: import a file or a YouTube/Instagram/Facebook link, edit with a glossary and AI auto-correct, and export SRT, VTT, TXT, DOCX, bilingual subtitles, or burned-in video.
Transcripts: speaker labels, filler-word removal, automatic punctuation and paragraphs.
Meeting summaries: templates, decisions and action items, plus a Google Meet and Microsoft Teams recording bot that processes the meeting after it ends.
Live captions: real-time captioning with translation for events.
Learn more
EKHOS AI
EKHOS AI is a secure offline transcription software developed for professionals who work with sensitive audio data. It performs accurate speech-to-text conversion without relying on cloud services, ensuring that all files remain local and private. Designed with legal, medical, academic, and research use cases in mind, EKHOS AI supports common audio formats and offers features such as timestamped transcriptions, multi-speaker diarization, segment tagging, and export to multiple text formats. An intuitive editor is included to review and refine transcripts directly within the app. The software also supports real-time audio recording and playback. EKHOS AI is built to perform reliably on a wide range of Windows systems, offering practical functionality for users who prioritize data control, security, and data privacy.
Learn more
Rev
Rev provides premium on-demand, manual and automated transcription, closed caption, and foreign subtitling services. With 170,000+ customers, Rev's clients span from global enterprises to freelance journalists. Rev processes more audio and video than any other provider and has the ability to scale to fit any customer's needs. Pricing is simple starting at just $0.25 per audio/video minute for automated speech-to-text services and $1.25/min for manual with 99% accuracy. Rev also offers Rev.ai which is a speech recognition engine that's available to companies that want it.
Learn more