I making gift to my friend, magic wand with voice recognition. in dictionary i have some spells like Lumos and Nox, but problem in that PocketSphinx recognize any word and sound as Lumos or Nox. How can i fix it? Can i check confidence % and filter, or something like that to stop recognize every sound like word.
Ex. i walk around mic and speaking with some one, and on display is something like
LUMOS NOX NOX LUMOS LUMOS LUMOS NOX
I doing it on Raspberry Pi 2
Use keyword spotting mode, you can configure threshold for each word and they will be properly detected.
Related
Is there any way to create and display music notation inside of a codenameone app?
For Java generally there are some libraries like for example JFugue that let you write music inside a program. Maybe also display it, i didn't try that out.
There is lilypond, which would work in a desktop environment if you were able to run it to make the pdf after generating the file itself.
I wrote a small app in Android Studio and had to write my own music notation logic and drew it with the help of png files on a Canvas. That worked okay for small musical examples of a clef and around 2-7 notes.
Now i want to do something similar in Codenameone and display at least a clef and some notes inside the app (maybe as an image) - they have to be generated with some random element while the program runs.
It would also be great to be able to write and show more than a few notes, displaying it somehow and maybe also with the ability to have it as a pdf file later.
Is it possible to use something that already exists?
Thanks a lot!
I just started learning how to create a website and am watching tutorial videos. When they write $-sign it turns into green and you can see that it recognizes it, but in my case it does nothing, like it would be a random character (it remains white all the time and even if I write the same lines as it is in the video, I can not get the same results).
If you would like to see here it is:
$dayofweek = date("w");
echo "$dayofweek";
I have ticketing system that I use for students who do consultation with their teachers. The system is working right now, the teacher need to input ticket number which is generated by students.
I want to change the way, I want the ticket number is shown as QRCode, and what the teacher need to do just scan the QRCode thru the camera of their laptop. So it's like replace the way they input the ticket number, from typing the ticket number to just scan the QRCode.
The simplest way to do that is with old-fashioned barcodes.
If the ticket number is short enough (up to 14 digits), you can format it as a EAN13 barcode (13 digits + control).
The number conversion is trivial to make (just Google "generate EAN13" + your programming language)
EAN13 barcodes can be rendered as text with this TTF font: http://www.fontpalace.com/font-details/EAN-13/
Almost every QR reader understand linear barcodes
You can get a Barcode Scanner Gun for 25$. They are plugged via USB, detected as USB keyboards, they input scanned codes as if you typed them and they are fast as hell!
I think you are looking for a scanner soft keyboard. It scans the barcode / qr and set the result in input the same like you typed it.
In example this keyboard
https://play.google.com/store/apps/details?id=com.nikosoft.nikokeyboard
It has the config option to scan and automatically send an Enter keyboard event.
I successfully use it on a telegram bot to similar functionality and I think is a very simple solution.
when i was creating text chepoing in qtp the recording results getting odd recording results
the scanned input is
alt text http://img22.imageshack.us/img22/8255/capture1pa.png
and i get output like this insted of secs it display sees
alt text http://img22.imageshack.us/img22/5273/capture2c.png
and i made change in text like this sees to secs
alt text http://img801.imageshack.us/img801/7492/capture4f.png
but when i run the test the test result will be
alt text http://img94.imageshack.us/img94/2575/capture3z.png
I captured many times but it display same as above. I don't understand how display like this, please help me.
The problem is that QTP recognizes the word secs in your application as sees, which causes the checkpoint to fail. Most probably, this happens because the text recognition is performed using OCR, which can't guarantee 100% accuracy. You can try changing the text recognition mechanism to Windows API and see if that helps. To do this, go to Tools > Options > General > Text Recognition and set the Use text recognition mechanisms in this order option to either Use Only Windows API or First Windows API then OCR.
If this doesn't help, you can replace the word secs in the checkpoint's Checked Text with a regular expression that matches both secs and sees:
[0-9]+ days, [0-9]+ hours, [0-9]+ mins, [0-9]+ se[ce]s
Also, make sure that the Regular Expression option is turned on for the Checked Text.
Here's a tough one:
I need to be able to find a word's position and size (its frame) on the screen (its first occurence is enough, from there I should be able to get the next ones).
For example, I would like to be able to detect word positions in (but not limited to) Word, Excel and PowerPoint for Mac, as well as Safari and others.
The solution should be as fast as possible; I should be able to find at least 5-6 words per second and use as little CPU time as possible.
Here's what I thought of so far:
OCR in a window's screenshot / graphics context (any good Open Source framework that works on Mac OS X 10.4 and that can be used in a commercial product?). Evernote is very good at spotting words in images. I don't know if it uses a custom in-house engine or an Open Source / commercial one but that would be the kind of engine I would like to use if this is a "valid" solution. Ideally I would detect the word's frame in the active application's window (how to get the frame of another application?).
Getting some kind of "hook" on Quartz drawing of text and intercepting the location of the word when it's drawn (does not seem very feasible at first glance!).
AppleScript, but it depends a lot on what API the application offers (I don't think you can get a word's coordinates in a Word document from what I've seen) and it's slow.
... out of ideas ...
My goal is to get all the word's frames in a paragraph in the right order based on a string containing the text of the paragraph.
Thanks in advance for any hints!
As a starting place, you may want to take a look at QuickCursor's code. It retrieves text from many different applications through the AX Accessibility APIs. Now, it won't grab the pixel placement of the word, but it will at least return the NSString associated with the text in that UI element. Of course this means that the app in question has to support these APIs; I don't know if the MS Office suite would. In addition, it only supports editable elements, so an un-editable webpage in Safari won't work either. But it may give you a starting point for some ideas.
Take a look at the QCUIElement.{m,h}, and then the implementation in the QCAppDelegate.m (beginQuickCursorEdit:)... the implementation of his abstracted QCUIElement seems to be as simple as:
QCUIElement *focusedElement = [QCUIElement focusedElement];
id value = focusedElement.value;
Edit: Aha! Check out the Accessibility Inspector Sample code: UIElementInspector. It can actually get the AXPosition of elements on a page. Now, it's not word-by-word, but we're getting closer. It'll tell you the x,y placement of a textblock, as well as the words contained in the textblock.
This is possible, but very hard to get working reliably. You can play with Spell Catcher's Direct Connect feature to see an example.