Kokoro usage improvements by michalkulakowski · Pull Request #4357 · openvinotoolkit/model_server

michalkulakowski · 2026-07-03T13:56:19Z

🛠 Summary

JIRA/Issue if applicable.
Describe the changes.

🧪 Checklist

Unit tests added.
The documentation updated.
Change follows security best practices.
``

Copilot

Pull request overview

This PR updates Kokoro text-to-speech (TTS) integration by shifting voice embedding discovery to runtime (from the model directory) and moving espeak-ng from an OVMS binary dependency to a separately built artifact in Docker builds.

Changes:

Load Kokoro voice embeddings from <models_path>/voices/*.bin when the graph doesn’t explicitly specify voices.
Remove Bazel --//:espeak=on/off flag plumbing and build espeak-ng as an optional standalone Docker step (ARG ESPEAK=1/0).
Adjust tests and export tooling to align with the new Kokoro usage expectations.

Reviewed changes

Copilot reviewed 14 out of 14 changed files in this pull request and generated 4 comments.

Show a summary per file

File	Description
windows_build.bat	Removes Bazel espeak flag wiring from Windows build invocation.
third_party/BUILD	Makes espeak-ng aliases unconditional (no build-flag select).
src/test/graph_export_test.cpp	Removes Kokoro voices list expectations and voices/ directory precreation.
src/test/audio/text2speech_test.cpp	Adds config validation test for missing `voices` in graph.
src/graph_export/graph_export.cpp	Stops enumerating Kokoro `voices/*.bin` into generated graph templates.
src/BUILD	Removes espeak-ng deps from the OVMS binary build.
src/audio/text_to_speech/t2s_servable.cpp	Implements fallback loading of embeddings from `<models_path>/voices` when graph voices are omitted.
src/audio/text_to_speech/t2s_calculator.cc	Minor comment cleanup in voice-selection error handling.
Makefile	Removes Bazel espeak flag usage; passes ESPEAK as a Docker build arg.
Dockerfile.ubuntu	Adds optional standalone Bazel build step for espeak-ng targets controlled by `ARG ESPEAK`.
Dockerfile.redhat	Same as Ubuntu Dockerfile: optional standalone espeak-ng build step.
distro.bzl	Removes the Bazel espeak build flag definition/config settings.
demos/common/export_models/export_model.py	Changes TTS exporter behavior, including default `--model_type`.
demos/audio/README.md	Adds documentation for ASR leaderboard-based transcription evaluation.
common_settings.bzl	Stops loading/creating the removed espeak flag config settings.

 #include <fstream>
 #include <sstream>
 #include <limits>
+#include <vector>


+static std::vector<std::filesystem::path> getVoiceEmbeddingPaths(const std::filesystem::path& voicesDir) {
+    std::vector<std::filesystem::path> voicePaths;
+    std::error_code ec;
+    for (const auto& entry : std::filesystem::directory_iterator(voicesDir, ec)) {
+        if (ec) {
+            throw std::runtime_error("Failed to iterate voices directory: " + voicesDir.string());
+        }
+        if (!entry.is_regular_file(ec) || ec) {
+            ec.clear();
+            continue;
+        }
+        if (entry.path().extension() == ".bin") {
+            voicePaths.emplace_back(entry.path());
+        }
+    }
+    std::sort(voicePaths.begin(), voicePaths.end());
+    return voicePaths;
+}


 add_common_arguments(parser_text2speech)
 parser_text2speech.add_argument('--num_streams', default=0, type=int, help='The number of parallel execution streams to use for the models in the pipeline.', dest='num_streams')
-parser_text2speech.add_argument('--model_type', default='speecht5', choices=['speecht5', 'kokoro'], help='Type of the source TTS model. speecht5 uses optimum-cli; kokoro uses a dedicated PyTorch->OpenVINO conversion path.', dest='model_type')
+parser_text2speech.add_argument('--model_type', default='kokoro', choices=['speecht5', 'kokoro'], help='Type of the source TTS model. speecht5 uses optimum-cli; kokoro uses a dedicated PyTorch->OpenVINO conversion path.', dest='model_type')


+        [type.googleapis.com / mediapipe.T2sCalculatorOptions]: {
+            models_path: "/ovms/models_audio/Kokoro-82M"
+            plugin_config: '{"NUM_STREAMS": "1" }',
+            target_device: "CPU"
+        }
+        }


Copilot AI review requested due to automatic review settings July 3, 2026 13:56

Copilot started reviewing on behalf of michalkulakowski July 3, 2026 13:56 View session

Kokoro usage improvements

80f3156

michalkulakowski force-pushed the mkulakow/kokoro_improvements branch from 666f708 to 80f3156 Compare July 3, 2026 13:59

Copilot AI reviewed Jul 3, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Kokoro usage improvements#4357

Kokoro usage improvements#4357
michalkulakowski wants to merge 1 commit into
mainfrom
mkulakow/kokoro_improvements

michalkulakowski commented Jul 3, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

michalkulakowski commented Jul 3, 2026

🛠 Summary

🧪 Checklist

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants