Be
Before my target audience mga Filipinos okay naman, sa kanila Ang Gemini Audio, now American Audience, laging comment , irritated Sila sa voice ng Gemini ai, competitor ko Kasi eleven labs ata ung voice na gamit,OmniVoice yung may external service na may magandang tagalog din at may E×ρréššions na automatically. Sa You do not have permission to view the full content of this post. Log in or register now. nila ako nag-test a few months back na may zero shot cloning. Kahit may accent kuha nya (manilenyo, taga-cavite, bisaya, ilokano, ilonggo...) basta may 10-sec clip ka. Wala lang dyan option sa ttsomni. It supports 600-languages. Abangan mo na lang yung mga updates sa mga TTS dahil ang dev-trend is adding a voice clip cloning feature to create a multi-language replica. Parehas sa llm na dinadagdagan ng lora for customized/specific use para iwas na yung user to train it.
Ang regular na gamit ko for reading aloud is Supertonic TTS na ka-partner ng Gemini Nano (LLM) for light tasks - all local AI. Gemma4-2b-Q4 pag may tool-calling for simple automation. Kahit 3rd-gen pc kaya yan.
Yang OmniVoice with voice cloning, pwede sa cpu mode pero di ko pa nasubukan. Sa voice cloning, skip mo lang yung ASR (via Whisper) para bawas sa trabaho ng cpu. Ikaw na yung mag-transcribe ng audio. Yung vibevoice ayos din sa cpu mode. Kailangan, subukan nyo muna yung local AI kaysa mag-rely sa online services na very limited.
Test mo dito: You do not have permission to view the full content of this post. Log in or register now.
Hanap ka ng omnivoice o clone nya like Ming-omni-tts sa huggingface spaces to test
.