Skip to content

server : add Voice Activity Detection (VAD) support #3246

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 3 commits into from
Jun 13, 2025

Conversation

danbev
Copy link
Member

@danbev danbev commented Jun 13, 2025

This commit adds support for Voice Activity Detection (VAD) in the server example.

The motivation for this is to enable VAD processing when using whisper-server.

Resolves: #3089

danbev added 2 commits June 13, 2025 09:57
This commit adds support for Voice Activity Detection (VAD) in the
server example.

The motivation for this is to enable VAD processing when using
whisper-server.

Resolves: ggml-org#3089
This commit also adds a few missing parameters.
danbev added a commit to danbev/whisper.cpp that referenced this pull request Jun 13, 2025
This commit fixes a short name conflict whisper-cli for
`--vad-min-speech-duration-ms` and `--vad-min-silence-duration-ms` which
currently have the same short name `-vsd`.

Refs: ggml-org#3246 (review)
danbev added a commit that referenced this pull request Jun 13, 2025
This commit fixes a short name conflict whisper-cli for
`--vad-min-speech-duration-ms` and `--vad-min-silence-duration-ms` which
currently have the same short name `-vsd`.

Refs: #3246 (review)
@danbev danbev merged commit 0a4d85c into ggml-org:master Jun 13, 2025
bygreencn added a commit to bygreencn/whisper.cpp that referenced this pull request Jun 29, 2025
* ggerganov/master: (335 commits)
  server : add Voice Activity Detection (VAD) support (ggml-org#3246)
  cli : fix short name conflict for vad options [no ci] (ggml-org#3247)
  ruby : add .gitignore entries for ext directory (ggml-org#3245)
  ci : update windows runner to windows-2022 (ggml-org#3242)
  ruby : add cleaning of library names in dependencies (ggml-org#3241)
  ggml : fix weak alias win32 (#0)
  android : fix builds (#0)
  sync : ggml
  files : remove old sources (part 2)
  sync : ggml
  files : remove old sources
  talk-llama : sync llama.cpp
  sync : ggml
  metal : use less stack memory in FA kernel (llama/14088)
  ggml-cpu : split arch-specific implementations (llama/13892)
  cuda : fix device sync on buffer clear (llama/14033)
  CANN: Simplify the environment variable setting(#13104)
  sycl: Add reorder to Q6_K mmvq implementation (llama/13885)
  cuda : fix buffer type check with integrated GPUs (llama/14069)
  SYCL: Implement few same quantized type copy kernels (llama/13739)
  ...
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

server : add VAD support
2 participants