by blank
Ready
--
--
--
Our model utilizes a custom-trained architecture optimized for on-device inference. By leveraging advanced quantization techniques and the ONNX runtime, we achieve state-of-the-art performance across a wide range of hardware configurations.
Training involved a diverse dataset of high-fidelity speech, processed to ensure robustness and natural prosody. The result is a lightweight, low-latency synthesis engine that operates entirely locally, preserving user privacy without sacrificing quality.
This hybrid approach combines the flexibility of deep learning with the efficiency of edge computing, enabling real-time voice generation even on consumer-grade CPUs and mobile accelerators.
Integrate our TTS engine directly.
Deployment NoticeWe are currently working to resolve ONNX Runtime deployment issues on Vercel Serverless. For the best experience and guaranteed performance, we recommend cloning the repository and running it locally.
Repository
github.com/kiritocode1/voices
Voices is a lightning-fast, on-device text-to-speech system designed for extreme performance with minimal computational overhead. Powered by ONNX Runtime, it runs entirely on your device—no cloud, no API calls, no privacy concerns.
MIT License (Code) / OpenRAIL-M (Weights)
© 2025 Blank Technologies Inc.
All rights reserved.
Made with 🖤
Powered by ▲ + ONNX and SuperTonic