startersbởi deepgram

Clone a working demo and start building. Every starter is a minimal, runnable app you can extend.

npx skills add https://github.com/deepgram/skills --skill starters

Deepgram Starter Apps

Clone a working demo and start building. Every starter is a minimal, runnable app you can extend.

1. Pick Your Feature

What do you want to build?

  • Transcribe a filetranscription — send audio/video, get text back (REST, Nova)
  • Transcribe a live streamlive-transcription — real-time speech-to-text (WebSocket, Nova)
  • Generate speechtext-to-speech — send text, get audio back (REST)
  • Stream speechlive-text-to-speech — real-time text-to-audio (WebSocket)
  • Analyze text or audiotext-intelligence — sentiment, topics, intents, summaries (REST)
  • Build a voice agentvoice-agent — conversational AI agent (WebSocket, agent.deepgram.com)
  • Conversational STT with turn detectionflux — Deepgram Flux for voice agents and interactive assistants (WebSocket, /v2/listen)

Nova vs Flux for speech-to-text: use transcription or live-transcription (Nova, /v1/listen) for general-purpose transcription, captions, and batch workloads. Use flux (Flux, /v2/listen) when you need built-in turn detection for conversational audio. See the api skill for a full comparison.

2. Pick Your Stack

LanguageFrameworks
JavaScriptnode
TypeScriptbun, deno
Pythonfastapi, flask, django
Gogo
Javajava
C#csharp
Rustrust
Rubyruby
PHPphp
C++cpp

3. Clone and Run

Every starter lives at https://github.com/deepgram-starters/{framework}-{feature}:

git clone https://github.com/deepgram-starters/{framework}-{feature}.git
cd {framework}-{feature}

Set your API key and follow the README:

export DEEPGRAM_API_KEY=your_key_here

Get an API key at https://console.deepgram.com.

Examples

"I want to build a voice agent in Python"git clone https://github.com/deepgram-starters/fastapi-voice-agent.git

"I need live transcription in my Node app"git clone https://github.com/deepgram-starters/node-live-transcription.git

"I want to add text-to-speech to my Go service"git clone https://github.com/deepgram-starters/go-text-to-speech.git

"I want to analyze audio for sentiment in C#"git clone https://github.com/deepgram-starters/csharp-text-intelligence.git

All Starters

transcriptionlive-transcriptiontext-to-speechlive-text-to-speechtext-intelligencevoice-agentflux
nodereporeporeporeporeporeporepo
bunreporeporeporeporeporeporepo
denoreporeporeporeporeporeporepo
fastapireporeporeporeporeporeporepo
flaskreporeporeporeporeporeporepo
djangoreporeporeporeporeporeporepo
goreporeporeporeporeporeporepo
javareporeporeporeporeporeporepo
csharpreporeporeporeporeporeporepo
rustreporeporeporeporeporeporepo
rubyreporeporeporeporeporeporepo
phpreporeporeporeporeporeporepo
cppreporeporeporeporeporeporepo

Need something more specific?

  • Focused feature snippets (one feature, one language, < 50 lines) → recipes skill → https://github.com/deepgram/recipes
  • Third-party integrations (Twilio, LiveKit, LangChain, Vercel AI SDK, Discord, etc.) → examples skill → https://github.com/deepgram/examples
  • SDK-specific code skills (idiomatic imports, async patterns, gotchas) → npx skills add deepgram/deepgram-{lang}-sdk — see the api skill for the full list of 9 SDKs.

Related Deepgram skills

  • api — consolidated REST + WebSocket API reference
  • recipes — minimal runnable feature snippets per language
  • examples — full integration examples with third-party platforms
  • docs — documentation finder
  • setup-mcp — Deepgram MCP server installation

NotebookLM Web Importer

Nhập trang web và video YouTube vào NotebookLM chỉ với một cú nhấp. Được tin dùng bởi hơn 200.000 người dùng.

Cài đặt tiện ích Chrome