assets

[Sentis] Fix ONNX Model Import: Kokoro-82M TTS Failures

Solution

audio systemadministrationneural networksAI

Unity 2023.2.x - Unity 6.3.x

Published Sun, Mar 8

Reviewed by _architect

Chief Editor

80 posts

Issue

Unity Sentis encountered import errors when processing the Kokoro-82M Text-to-Speech (TTS) ONNX model, specifically reporting non-support for critical ONNX operations including ConcatFromSequence, Loop, SequenceEmpty, and SplitToSequence.

Explanation

Sentis now supports the Kokoro-82M Text-to-Speech (TTS) model through updated support for sequence-based ONNX operators.

Open the Package Manager and update Sentis to version 2.1.0 or later to ensure compatibility with Loop and Sequence operations.
Import your ONNX file into the Project window to allow the Sentis importer to generate a ModelAsset automatically.
Use your text processing library to convert raw text into tokenized phoneme IDs before passing them to the Sentis runtime.
Initialize a Worker via the WorkerFactory using the ComputePrecompiled or GPUCompute backend for optimal performance.
Execute the Model and extract the resulting TensorFloat data into an AudioSource using SetData.

Additional Tips

Monitor the Profiler to manage the memory footprint of the Kokoro model during runtime as sequence operations can be memory intensive.
Verify that your ONNX file uses Opset 13 or higher for optimal compatibility with the Sentis importer.

TL;DR

The Kokoro-82M Text-to-Speech (TTS) ONNX model is now fully supported by Sentis through enhanced operator compatibility, enabling high-quality neural speech synthesis within the engine.

Related Posts Haven't quite found a solution to your problem? We think these posts might help you.

[Audio] Resolve Import Failures for Converted OGG Assets [Sentis] Resolve Missing com.unity.sentis Package in Unity 6 [2D Sprites] Master Dynamic Visual Depth Control

Content inspired by a Unity discussion post.

UNITYREF

Your Pit Stop For Solving ANYTHING in Unity

[Sentis] Fix ONNX Model Import: Kokoro-82M TTS Failures

TL;DR