Skip to content
@linto-ai

linto.ai

Your Open Source end-to-end platform for voice-operated solutions

LinTO AI

Open Source Ecosystem for Transcription, Collaborative Media Management, Annotation, Live Subtitling, and Summarization

LinTO AI Banner

Overview

LinTO AI provides a powerful suite of open-source tools for transcription, collaborative media editing, annotation, live subtitling, and summarization utilizing large language models (LLMs).

Hosted by
LINAGORA
Try LinTO Studio

Quick Start

  • LinTO Studio: 🎤 A media management platform offering advanced tools for transcription and collaborative media editing. Key features include:
    • Speaker identification/diarization: Automatically segment and identify speakers.
    • Automatic timestamp alignment: Synchronize transcripts with media.
    • Collaborative editing: Work collaboratively on media annotations and transcriptions in real-time.
    • Summarization: Generate concise summaries of media content using LLMs.
    • Building and syncing subtitles: Create and synchronize subtitles for video content with ease.
    • Live transcription from the browser: Record and transcribe audio directly from your browser.
    • AI Agent for videoconferences: A bot system that joins videoconferences to capture live audio streams for transcription and subtitling. This allows LinTO Studio to act as a powerful assistant during meetings, leveraging videoconference platforms as live audio sources.

LinTO Studio leverages our other technologies, including:

  • LinTO-STT for speech-to-text conversion.
  • LinTO-Diarization for speaker segmentation and identification.
  • LLM-Gateway for advanced summarization.

To deploy LinTO Studio and its associated services, use the LinTO Deployment Tool, which simplifies the setup process.

Key Projects

  • LinTO-STT: 🗣️ An automatic speech recognition API supporting both offline and real-time transcriptions. It accommodates models like Kaldi and Whisper and can operate as a standalone service or within a microservices infrastructure. Learn more

  • Whisper-Timestamped: ⏱️ A multilingual automatic speech recognition tool providing word-level timestamps and confidence scores. It enhances OpenAI's Whisper models to deliver more precise transcriptions with detailed timing information. Learn more

  • LLM-Gateway: 📝 A service dedicated to rolling summarization using large language models (LLMs), enabling efficient processing and summarization of extensive textual data. Learn more

  • LinTO-Diarization: 🔊 A speaker diarization service that segments audio streams into homogeneous segments based on speaker identity, with capabilities for speaker identification when audio samples of known speakers are provided. Learn more

  • WebVoiceSDK: 🌐 A JavaScript library offering lightweight and optimized building blocks for always-listening voice-enabled applications directly in the browser. It manages various aspects of voice input, including hardware microphone handling, voice activity detection, and wake word detection. Learn more

Get Involved

LinTO AI is committed to open-source development, ensuring our tools are accessible and adaptable, fostering innovation in business-aware media transcription and summarization. For more information or to contribute, contact us at [email protected].

Pinned Loading

  1. linto-stt linto-stt Public

    An automatic speech recognition API

    Python 48 14

  2. linto-studio linto-studio Public

    Transcription and annotation interface for recorded audio or video files

    JavaScript 27 1

  3. whisper-timestamped whisper-timestamped Public

    Multilingual Automatic Speech Recognition with word-level timestamps and confidence

    Python 2.1k 162

Repositories

Showing 10 of 50 repositories
  • linto-punctuation Public

    LinTO Platform punctuation service.

    linto-ai/linto-punctuation’s past year of commit activity
    Python 5 AGPL-3.0 1 0 0 Updated Dec 21, 2024
  • linto-transcription Public

    Transcription service for LinTO stack.

    linto-ai/linto-transcription’s past year of commit activity
    Python 3 AGPL-3.0 0 1 0 Updated Dec 21, 2024
  • linto-diarization Public

    Speaker diarization service

    linto-ai/linto-diarization’s past year of commit activity
    Python 20 AGPL-3.0 0 1 2 Updated Dec 21, 2024
  • linto-stt Public

    An automatic speech recognition API

    linto-ai/linto-stt’s past year of commit activity
    Python 48 AGPL-3.0 14 4 3 Updated Dec 21, 2024
  • linto-studio Public

    Transcription and annotation interface for recorded audio or video files

    linto-ai/linto-studio’s past year of commit activity
    JavaScript 27 AGPL-3.0 1 5 2 Updated Dec 20, 2024
  • llm-gateway Public

    Rolling summarization using LLM

    linto-ai/llm-gateway’s past year of commit activity
    Python 1 0 2 0 Updated Dec 19, 2024
  • linto Public

    Start here !

    linto-ai/linto’s past year of commit activity
    Jsonnet 0 0 0 0 Updated Dec 18, 2024
  • linto-studio-plugins Public

    Live websocket, rtmp, srt streaming plugins for Linto Studio

    linto-ai/linto-studio-plugins’s past year of commit activity
    JavaScript 1 EUPL-1.2 0 0 0 Updated Dec 18, 2024
  • .github Public
    linto-ai/.github’s past year of commit activity
    0 0 0 0 Updated Dec 12, 2024
  • whisper-timestamped Public

    Multilingual Automatic Speech Recognition with word-level timestamps and confidence

    linto-ai/whisper-timestamped’s past year of commit activity
    Python 2,137 AGPL-3.0 162 36 (1 issue needs help) 1 Updated Dec 6, 2024