Skip to Content
👋 Welcome to HowToUseOpenClaw Quick Start
Skills CatalogSpeech & Transcription

Speech & Transcription

Skills in this category help with speech & transcription. Install any skill using npx clawdhub@latest install <skill-slug>.

Skill List

assemblyai-transcribe

Description: Transcribe audio/video with AssemblyAI (local upload.

Install:

npx clawdhub@latest install assemblyai-transcribe

audio-gen

Description: Generate audiobooks, podcasts, or educational audio content on demand.

Install:

npx clawdhub@latest install audio-gen

audio-reply

Description: Generate audio replies using TTS. Trigger with “read it to me [URL]” to fetch.

Install:

npx clawdhub@latest install audio-reply-skill

edge-tts

Description: See ClawdHub page

Install:

npx clawdhub@latest install edge-tts

gettr-transcribe-summarize

Description: Download audio from a GETTR post (via HTML og:video), transcribe it locally.

Install:

npx clawdhub@latest install gettr-transcribe-summarize

llmwhisperer

Description: Extract text and layout from images and PDFs using LLMWhisperer API.

Install:

npx clawdhub@latest install llmwhisperer

local-whisper

Description: Local speech-to-text using OpenAI Whisper. Runs fully offline after model download.

Install:

npx clawdhub@latest install local-whisper

mlx-whisper

Description: Local speech-to-text with MLX Whisper (Apple Silicon optimized, no API key).

Install:

npx clawdhub@latest install mlx-whisper

openai-whisper

Description: Local speech-to-text with the Whisper CLI (no API key).

Install:

npx clawdhub@latest install openai-whisper

openai-whisper-api

Description: Transcribe audio via OpenAI Audio Transcriptions API (Whisper).

Install:

npx clawdhub@latest install openai-whisper-api

parakeet-mlx

Description: Local speech-to-text with Parakeet MLX (ASR) for Apple Silicon (no API key).

Install:

npx clawdhub@latest install parakeet-mlx

parakeet-stt

Description: >-.

Install:

npx clawdhub@latest install parakeet-stt

pocket-transcripts

Description: Read transcripts and summaries from Pocket AI (heypocket.com) recording devices.

Install:

npx clawdhub@latest install heypocket-reader

pocket-tts

Description: pocket-tts

Install:

npx clawdhub@latest install pocket-tts

tts-whatsapp

Description: Send high-quality text-to-speech voice messages on WhatsApp in 40+ languages with automatic delivery.

Install:

npx clawdhub@latest install tts-whatsapp

video-subtitles

Description: Generate SRT subtitles from video/audio with translation support.

Install:

npx clawdhub@latest install video-subtitles

voice-transcribe

Description: Transcribe audio files using OpenAI’s gpt-4o-mini-transcribe model with vocabulary hints.

Install:

npx clawdhub@latest install voice-transcribe

elevenlabs-voices

Description: ElevenLabs voice synthesis: 18 personas, 32 languages, sound effects.

Install:

npx clawdhub@latest install elevenlabs-voices

elevenlabs-media

Description: ElevenLabs music generation and speech-to-text (Scribe v2).

Install:

npx clawdhub@latest install clawdbotborges

elevenlabs-agents

Description: Create and manage ElevenLabs conversational AI agents.

Install:

npx clawdhub@latest install elevenlabs-agents

tts

Description: Text-to-speech using Hume AI or OpenAI API.

Install:

npx clawdhub@latest install tts

Resources

Last updated on: