Closed. This question needs to be more focused. It is not currently accepting answers.
Want to improve this question? Update the question so it focuses on one problem only by editing this post.
Closed 2 years ago.
Improve this question
How can i create TTS voice Packages?
I have a small app converting text to speach, right now i am using microsoft's built in tts packages to do the text to speach, however, i want to get a proffesional voice artist to create a new text to speach voice package. problem being, i do not know how to create that voics package(TTS)
Creating a new voice for a text-to-speech engine is a complex process. It is not just a matter of getting a voice artist to record audio and simply creating a voice from that. There is a lot of work that goes into this (segmenting the audio into phonemes; building the voice data; building the dictionary; getting the prosody and audio joining/synthesizing rules correct).
For a voice engine like the Microsoft Text-to-Speech engine, you are also facing the problem that the voice format is proprietary and so you cannot create new voices in that format. You are also limited by the capabilities of the engine.
Your best bet at the moment is either:
switching to using the eSpeak text-to-speech engine and using espeakedit to create your own voice (contacting the developer for help with this) -- this engine uses a synthesis method that makes it sound similar to the Microsoft's and the voice Stephen Hawking is using, but they are very clear and the pronunciation is on the whole very good;
using a different text-to-speech engine like Cepstral that use voice recordings (these tend to sound more human-like, but I have found that the prosody is not very good, ruining the resulting audio);
using the service from Cepstral to create a voice specific for your needs (which is likely to be expensive).
I am looking at using the audio data from librivox.org to generate text-to-speech voices from. This is likely 3-4 years away though, before I have anything close to being functional.
Related
Closed. This question needs to be more focused. It is not currently accepting answers.
Want to improve this question? Update the question so it focuses on one problem only by editing this post.
Closed 4 years ago.
Improve this question
I'm working on a webshop. And would like to add the feature to be able to post product images to the shops Instagram account when a new product is added.
The thing is that the Instagram API does not seems to allow to post images. Either does the official Instagram Website. And a Google Search gives me multiple SO QA which says the same.
But at the same time, there seems to be a ton of websites, applications, and services and other non-emulators who let you post to Instagram (to mention a few: http://techwiser.com/effectivly-use-instagram-on-pc-upload-pics-bluestacks/ and http://dskgrm.com/ is another one, there also seems to be some WordPress plugins who does it ex. https://github.com/roladn/wordpress-to-instagram but I'm not too good with eirhet PHP or WordPress to be able to tell how they have done it.).
And before anyone reports it as a duplicate. I do not consider this a duplicate since all other QA simply states that it is not possible via the API, but obviously, it is possible somehow, and I'm wondering how.
So I simply wondering how people are successfully implementing this feature when either the API or the official website allows it?
No official way to post media to Instagram doesn't mean that there is no way to do this.
If you are a developer you could simply reverse engineering the Instagram API and build some software to use that, otherwise you could just use some tools already developed.
PHP
mgp25/Instagram-API
Python
b3nab/instapy-cli
And if this don't fit your needs you can go to github and search some other softwares already developed.
Closed. This question is off-topic. It is not currently accepting answers.
Want to improve this question? Update the question so it's on-topic for Stack Overflow.
Closed 10 years ago.
Improve this question
Inspired by a recent Kickstarter campaign: http://www.kickstarter.com/projects/dominikmazur/camfind-a-mobile-visual-search-app?ref=category
The app uses the mobile camera to take a picture and identify virtually any object. Snapping a photo of a movie poster will recognize the movie and pull up results on the web for you about it, taking a picture of a product will show you websites that product is available for sale on.
My question is, is this realistic? I find it very intriguing, but it object detection really that simple? I'm interested in some feedback regarding resources to help someone get started in learning about this topic.
Computer vision and Pattern Recognition is not easy at all. It's an entire field related to Artificial Intelligence. It is, however, relatively straightforward to understand at a high level though. There is NO WAY they are doing this all on the client. The phones just aren't fast enough, and do not have even close to enough storage space.
What they are most likely doing is sending the image to their servers, then use some kind of nearest neighbour approximation on the image, and run the result through a decision tree look-up in a massive database on images which all have some hash. This will give a close match to an image they have (assuming they have A LOT of images in there database), even if only part of the image matches. Then, using the hash, they look up some other information about that image to send to the device.
Hope that Helps!
Closed. This question needs to be more focused. It is not currently accepting answers.
Want to improve this question? Update the question so it focuses on one problem only by editing this post.
Closed 8 years ago.
Improve this question
Lately I have been reading about web crawling, indexing and serving. I have found some information on the Google Web Masters Tool - Google Basics about the process that Google does to crawl the Web and serve the searches.
What I am wondering is how they save all those indexs? I mean, that's a lot to store right? How do they do it?
Thanks
I'm answering myself because I found some interesting stuff that talks about Google index:
In Google Webmasters YouTube Channel, Matt Cutts give us some references about the architecture behind Google Index: Google Webmaster YouTube Channel
One of those references, and from my point of view a worth reading, is this one: The Anatomy of a Large-Scale Hypertextual Web Search Engine
This helped me to understand it better, and I hope it help you too!
They use a variety of different types of data stores depending on the type of information. Generally, they don't use SQL because it has too much overhead and isn't very compatible with large-scale distribution of information.
Google actually developed their own data store that they use for large read-mostly applications such as Google Earth and the search engine's cache. This supports distributing information over a very large number of computers with each piece of information stored on three or four different computers. This allows them to use cheap hardware -- if one computer fails, the others immediately begin restoring all the data it held to the appropriate number of copies
Closed. This question does not meet Stack Overflow guidelines. It is not currently accepting answers.
We don’t allow questions seeking recommendations for books, tools, software libraries, and more. You can edit the question so it can be answered with facts and citations.
Closed 7 years ago.
Improve this question
How did they implement text to speech (TTS)? Is there an open, free API for TTS synthesis? I know about Google Translate, but the license is not clear to me (another issue is that they block a request if it contains a referrer). Any idea?
Majdron,
I'm a lead developer at Quizlet. We're using a combination of our own technology and licensing/purchasing TTS software from several different companies.
There are some open source TTS engines/voices:
http://www.cstr.ed.ac.uk/projects/festival/
http://www.babelfish.org/tts-free.htm
http://espeak.sourceforge.net/
http://freetts.sourceforge.net/docs/index.php
http://mary.dfki.de/
Good luck!
The voices sound exactly the same as http://www.neospeech.com. Also, their list of languages match exactly.
It's not free, you have to license it.
Google has just introduced browser-based access to its speech engine through HTML5.
http://slides.html5rocks.com/#speech-input
To get this page to work, I launched the Chromium browser as follows in Ubuntu:
$ chromium-browser --enable-speech-input
I'm not sure if this works in other operating systems.
Another interesting project is WAMI from MIT:
http://wami.csail.mit.edu
I don't know which specific engine Quizlet are using, but assuming they are using a free service then it might be TTS-API (http://tts-api.com/) which was recently featured on Hacker News.
From what I know is the only "free-to-use" TTS web-API out there. Please comment below if I'm wrong - I'd love to find similar free services. There are a lot of pay only services out there but very very few truly free ones.
Since finding out about TTS-API on HN I've successfully used it in a recent app project. Since the TTS is only a HTTP fetch away I was able to quickly integrate it in both the iOS and Android versions of my app. The service appears to be very quick, so no complaints so far :-)
Nobody gave the right answer. They have their own TTS engine that is connected to a single file located at http://quizlet.com/tts/en.mp3 the file takes arguments with it so the url http://quizlet.com/tts/en.mp3?v=14&b=QXJlYSBvZiBwYXJhbGxlbG9ncmFt&s=m5dx52Q. says "Area of parallelogram" thanks the first base64 string labeled b. I have not discovered what v or s are used for but I know they are essential for making the file speak. I will do more research and get back to this answer.
jj b is correct. The core engine of Quizlet's speech features is Neospeech, and uses Neospeech's VTML (VoiceText [TM] Markup Language) exactly, as far as I can tell.
Closed. This question is off-topic. It is not currently accepting answers.
Want to improve this question? Update the question so it's on-topic for Stack Overflow.
Closed 11 years ago.
Improve this question
I have a Logitech webcam (QuickCam Pro for Notebooks), and I'd like to take a snapshot using the cam from a command line.
Does it have any command line interface?
If not, is there an API that will allow me to build a command line tool like this?
I know this is an old discussion, but maybe you arrived here like I did recently looking for a simple command line app to capture a snapshot image from a webcam in Windows. If so, have a look at the following post on my blog:
http://batchloaf.wordpress.com/2011/04/06/snapz-a-command-line-cam-image-grabber/
It's about a little program called snapz that I've just written to take a snapshot from a USB camera in a batch file. It's really basic - when you run it, it snaps a 640x480 image from the default video capture device (e.g. webcam) and saves it to the current directory. It's free and easy to use.
You could use OpenCV to build such a command line application yourself. Basically you would use captureFromCam() to initialize the camera and then call QueryFrame() to capture a picture. Should be straightforward.
The question itself is not programming related, just a request of some hardwares bundled softwares documentation.
I will answer the question as it WAS a programming question.
If you talking about webcams on a windows OS you can use DirectShow as in this example:
http://www.codeproject.com/KB/audio-video/WebcamUsingDirectShowNET.aspx
Just create a console application in visual studio that takes some commandline parameters and let it use the code in the example and you are on your way.