Welcome to Speakr¶
Speakr is a powerful self-hosted transcription platform that helps you capture, transcribe, and understand your audio content. Whether you're recording meetings, interviews, lectures, or personal notes, Speakr transforms spoken words into valuable, searchable knowledge.
Latest Release: v0.9.0-alpha — Multi-platform recording, Stats tab, mobile rebuild, design-system unification
The first non-patch release in the v0.8 line. Three big user-facing themes: capturing audio is now multi-platform with a per-OS help guide and a Web Audio mixing path for capturing both sides of a meeting; a new Stats tab shows per-recording metrics; the mobile detail view is a first-class member of the design system. Upload modal redesigned, inquire mode polished, dark-mode select dropdowns finally legible.
See the full release notes for the complete list. Backwards compatible with v0.8.x; database migrations run automatically.
Quick Navigation¶
Core Features¶
Speakr takes a recording from raw audio to organized, searchable, shareable knowledge. The pipeline:
Capture
- Mic, system/tab audio, or both mixed
- Hours-long server-side recording sessions
- Drag-and-drop upload and black-hole auto-import
Transcribe
- Bring your own engine: WhisperX, OpenAI, Mistral, custom ASR
- Speaker diarization and voice profiles (WhisperX backend)
- Auto-detect plus 11 common languages
- Custom vocabulary and hotwords (most effective with WhisperX)
Understand
- Customizable AI summaries
- Event extraction and per-recording chat
- Inquire Mode: semantic search across everything
Organize
- Smart tags with custom prompts, stackable
- Folders and bulk operations
- Retention policies and auto-deletion
Collaborate
- Granular internal sharing and public links
- Groups with auto-share group tags
- Multi-user with Single Sign-On (OIDC)
Automate
- REST API v1 with Swagger UI
- Signed webhooks on lifecycle events
- n8n, Zapier, Make integration
Interactive Audio Synchronization¶
Experience seamless bidirectional synchronization between your audio and transcript. Click any part of the transcript to jump directly to that moment in the audio, or watch as the system automatically highlights the currently spoken text as the audio plays. Enable auto-scroll follow mode to keep the active segment centered in view, creating an effortless reading experience for even the longest recordings.
Real-time transcript highlighting synchronized with audio playback, with auto-scroll follow mode
Learn more about audio synchronization features in the user guide.
Transform Your Recordings with Custom Tag Prompts
Tags aren't just for organization - they transform content. Create a "Recipe" tag to convert cooking narration into formatted recipes. Use "Study Notes" tags to turn lecture recordings into organized outlines. Stack tags like "Client Meeting" + "Legal Review" for combined analysis. Learn more in the Custom Prompts guide.
Latest Updates¶
Version 0.9.0-alpha - Multi-platform recording, Stats tab, mobile rebuild, design-system unification
The first non-patch release in the v0.8 line. Three big user-facing themes: capturing audio is now multi-platform, the mobile app is a first-class member of the design system, and the upload modal stops feeling like a desktop card pasted onto a phone. Backwards compatible with v0.8.x; database migrations run automatically.
- System Audio & Multi-Input Recording - Platform detection with a per-OS help guide (macOS BlackHole + Multi-Output Device, Windows "Share system audio", Linux pavucontrol +
pactl module-virtual-source). New Input devices picker mixes a primary mic plus an optional secondary device via Web Audio into one track, with a toggle to disable Chrome's echo cancellation / noise suppression / auto-gain and virtual-audio-device discovery. - Stats Tab - New per-recording tab: total length, speaker count, turns, and words as headline cards; per-speaker time / % / turns / words / WPM breakdown; silence row. Available on desktop and mobile.
- Upload Modal Redesign - Real modal overlay (not a full-screen takeover), progressive disclosure of Options behind a chip summary, inline file preview with duration probe, sticky-footer Upload action, last-used tag / folder / language auto-restore, and a mobile bottom-sheet with drag-to-dismiss.
- Mobile UI Rebuild - 56 px bottom navigation, contextual icons in the chevron row, edge-to-edge content, sticky speaker pills, sticky editor Cancel / Save footer, and audio-player polish.
- PWA Web Share Target - Pick Speakr from your phone's native share sheet to send a recording straight in.
- Webhooks - HMAC-SHA256-signed outbound notifications on recording lifecycle events, with SSRF guard and exponential-backoff retries, managed per-user from Account settings → Webhooks.
- Server-side recording sessions - Long recordings stream chunks to the server during capture; the size cap is replaced by a configurable hours-based ceiling with resume-on-reload.
- Design-system unification - 22 modals on shared
.modal-*primitives,.btn+.fieldeverywhere, dark-mode<select>theming, header consolidation, sidebar redesign, floating dockable chat panel. - Inquire mode - "+ New Recording" opens the upload modal directly via
?upload=1. Also:GET /api/v1/users/me, an audio-player position preference, and a localization refresh across all seven languages.
See the full release notes for the complete list.
Version 0.8.21-alpha - Security: CSRF bypass and SSO account takeover
Security patch release on top of v0.8.20-alpha. Tracked as a GitHub Security Advisory; reported by @Irench1k.
- Fixed a CSRF bypass where the
csrf_exempt_for_api_tokensbefore_request hook permanently disabled CSRF protection on the targeted view as soon as any request carried a?token=query parameter (CWE-287). The hook is gone; CSRF skipping is now a per-request decision driven byload_user_from_token_headers_only(). change_passwordno longer silently sets a password on an SSO-only account, closing the chained account-takeover path.
Earlier releases
The full version history (the rest of the v0.8.x line and the v0.5 to v0.7 releases) is on the GitHub Releases page.
Getting Help¶
Need assistance? We're here to help:
Documentation
You're already here! Browse our comprehensive guides:
Ready to transform your audio into actionable insights? Get started now →