i currently have a tts which is built using festival and festvox. i need to convert these voices and build a TTS in flite. apparently you can do the conversion using festvox (the festvox and flite websites say so but no proper steps on how to do it). can some one please help me out with it as i am new to this area?
thanx in advance ..
Just in-case anyone else was wondering the same i found the steps mentioned in this document useful and also subscribe to the mailing lists and feel free to ask question.
although i must mention i never implemented to TTS using "flite". i went ahead with "espeak"
Related
Would you please help me to find out the best mobile "text-to-speech" and "speech-to-text" tool for both android and IOS?
I searched the web and found some tools such as:
For Speech to Text(STT):
1.react-native-voice
2.RNSpeakChat
3.Using Google Cloud
4.SpeechRecognizer
5.react-native-watson
6.react-speech-recognition
7.react-native-speech-recognition
and for text to speech (TTS):
1.react-native-tts
2.react-native-watson
3.react-native-speech
But I couldn't find which one is the best choice. Would you please help me in this regard?
Thanks in advance
For Speech to text you can refer this article as it explains precisely and also it uses react-native-voice as its an easy library to get started with . react-native-speech-to-text
For text-to-speech functionality, as per documentation of react-native-tts looks simple. you could implement that.
Hope it helps. feel free for doubts
I am developing a PoC using Watson text to speech and Watson conversation.
Sometimes, the chatbot needs to ask a question, so I'd like text to speech to synthesize the voice using an interrogation intonation.
Is it possible to be done?
Watson Text to Speech supports SSML, and has expressive SSML tags.
The one you want to use is Uncertainty. As it is defined as "conveys an uncertain, interrogative message".
Example:
<express-as type="Uncertainty">
Could she still be in the office? She told me that she might leave early.
</express-as>
More details on it's usage is here:
https://console.bluemix.net/docs/services/text-to-speech/SSML-expressive.html#the-express-as-element
Yes, you can certainly use text-to-speech (TTS) for output and speech-to-text (STT) for input. You would need to use a middleware or app layer to drive the conversation and route the input/output to the other services (see "how to use" in the docs).
I have used the following TJBot recipe as a simple and good started for some projects: https://github.com/damiancummins/tell_the_time
Unfortunately Concatenative TTS may have problems to create correct intonation in questions. If you think it happens consistently or too often please open a bug.
If you have a specific question which gets incorrect intonation try to rephrase it a little bit if possible. A useful trick for this voice could be to use double question mark '??'
I'm trying to give a little more clarity to TTS sentences by indicating emphasis, etc. I'm using the Chrome TTS API, which indicates that it accepts SSML-formatted documents in addition to raw text.
After many attempts, and a reading a few comments on the web, it doesn't look like this is actually supported, or possibly that this is up to individual voices for implementation.
Does anyone know:
Has SSML been abandoned under Chrome?
If not, is there any indication whether they expect to support it via native voice, or they're hoping that someone else will implement?
Do any Chrome voices currently exist that support this?
Thanks!
I'm a Chrome engineer. SSML support has not been implemented yet, but it's planned. Obviously not all engines would support it, but when we implement SSML support we'll also implement support for stripping SSML from engines that don't support it.
Sorry the documentation is misleading here.
Star this bug to express interest and get notified when it's fixed: https://code.google.com/p/chromium/issues/detail?id=88072
If anyone's looking at this later, you can control prosody on Mac Chrome using Apple's native command syntax, at least for the default voices:
the square root of [[pbas +4]] 2 [[char LTRL]]a[[char NORM]] to the [[pbas +4]] 14 [[char LTRL]]x[[char NORM]]
Documented here.
Does anyone know or recommend a method for a simple way to convert a site into a different language. I just need the site to change from spanish to english and vice versa, but the site will load in spanish first. Perhaps a plugin is available? Most of the content is dynamic and the site is being developed with Concrete5 CMS. Any ideas would be appreciated. Thank you.
I think you need to install an add-on for this. Check out Internationalization, it's free: http://www.concrete5.org/marketplace/addons/internationalization/
Here is a YouTube video showing it in action, so you can quickly see if it's what you had in mind: https://www.youtube.com/watch?v=Hd936iaDLqw&feature=player_embedded
You need some automatic translation tool (since you said your content is dynamic).There are many in internet, you just have to search for "Automatic translation API".
I recommend you using the Google Translate one.
I have a problem with Google suggest API when using Chinese locale. I am picking Chinese hieroglyphs at random and use the REST API to retrieve suggestions. Unfortunately, Google always return an empty list of suggestions (I am completely sure, that I convert my request in utf-8, and it is working fine with other languages, ex. Russian):
Sample request:
http://suggestqueries.google.com/complete/search?qu=%E9%80%9F
Google answer:
window.google.ac.h(["速",[]])
Does anyone know how to retrieve suggestions for Chinese locale? Maybe I am missing some flags or something? Maybe there is an official document from Google, saying that Chinese is not supported? And are there any people from China, using Google toolbar? Does it really work? I'd appreciate any help!
If it matters, I am writing a simple WinAPI application, using C++...
http://googleblog.blogspot.com/2009/03/local-flavor-for-google-suggest.html says, at the bottom, that google suggest now supports "155 domains in 51 languages." This implies a possible solution, which is that you need to use the relevant domain.
http://suggestqueries.google.cn/complete/search?qu=%E9%80%9F
gives me
window.google.ac.h(["速",[["速腾","2,020,000 结果","0"] ...
A bit of experimentation shows that I'm able to get results for japanese only from google.co.jp, but not from google.com. I'm in Japan, YMMV.
Hmm I don't know it seems to work for me. e.g
http://suggestqueries.google.com/complete/search?qu=%e6%9c%a8
yields
window.google.ac.h(["木",[["木村了","297,026 results","0z"]]])