Python handwriting recognition software? - python

Is there a Python handwriting recognition library? What are the inputs to hand writing recognition packages, .jpg images? .pdf images?

Zinnia is a C/C++ library with SWIG generated wrappers for Perl/Python/Ruby. It has a BSD license and converts user pen strokes provided as coordinates into character best matches. It also has a training module.
It looks like it performs single character recognition, so you might need to build something on top of it to improve the results.
PenCommander from PhatWare is a commercial, non-Python, Windows-only SDK. If you can live with all of those limitations, PhatWare products are the best handwriting recognition products that I've found so far, although I haven't been looking that hard since the Microsoft's digital ink for the Tablet PC came out. I'm still saving for the Tablet PC though :-(

Related

python voice signature identification?

I'm working on a system that locks several parts in my computer and opens them ONLY using my voice saying specific
words (in python). I've already made the system that locks parts in my computer until you give it password but I want to change it to voice.
I did find some voice processing on the web but its really complicated and without explanation
in python.
I know python might not be the right language to do so, but I want to try!
thanks for any help!
You can start from pre-built services, like Azure Speaker Recognition. The process is quite straightforward, you provide audio training sample for each single speaker (yourself for example). This will create an enrollment profile based on the unique characteristics of your voice. If your are looking to build a solution from scratch, then Fast Fourier Transform would be a great start.
Python is a good point to start!
Search Python Speech recognition in google in you will have a lot of examples for doing that.
Enjoy!

Use Google Vision Python client from a FileMaker Pro (NOT ON A SERVER) solution built for historical research?

I'm what Cornelis van Lit from Digital Orientalist calls a "centaur," or a scholar who devotes time to developing software solutions to humanities research problems. I've run into a problem that been NOT BEEN able to solve by searching stackoverflow or other online resources.
I developed a FileMaker solution to manage 150,000 digital surrogates of original sources similar to way Reddit user restricteddata suggested a few years ago.
I want to extend my solution with Google's Vision API. In particular, I want to use Vision perform OCR on these digital surrogates. I saw a Youtube video which does exactly what I'm asking but with Amazon's Textract API. I've tried Textract on my digital surrogates and found unsatifactory results. My surrogates are in Spanish and a signifcant number of them are handwritten. Google's Vision API, in my case, has produced better results. Also, Vision has a Python client library that I'm very familiar with.
So my problem and question deals with Python integration with Filemaker Pro Advanced (NOT HOSTED ON FILEMAKER SERVER)
Is there a way to pass a PDF from a container field to Python? And after Python does its thing--splitting the PDF, processing individual images, sending said images to Vision, pasring results, and recombining them--send the output string back to FileMaker in a new field?
The trigger would be from the FileMaker side so using available python libs or making the solution an ODBC source would not be useful. There are some FileMaker pluggins that can run a Python script, I think, the way Python would run a subprocess from FileMaker, but there's no clear direction on how to do that. I'm a graduate student so paying for consulting is out of the question. Is there anyone out there that can help?

How do I train the Python SpeechRecognition 2.1.1 Library

I am just getting started in speech recognition and was wondering what the general process was for training the SpeechRecognition library from Python:
https://pypi.python.org/pypi/SpeechRecognition/
I know basic machine learning techniques and basic text analytics, but I am not sure how to apply this to train sound data. (my end result would resemble the typical speech typing from phones where if you change the speech analyzer result often enough, it will "remember" the user preference).
Thanks!
That speech recognition library is using Google's speech recognition engine so there is no particular provision for training at the user end. Your sound data goes to Google (in digest form). If you get a dedicated API (as that documentation page suggests) it is possible Google will be building a user-specific profile on your voice and will gain statistical quality over time based on this, but that is not something that would be stored or written at your end.
Any further questions or unaddressed elements of your question, please let me know.

Working With Audio in PyProcessing

I am learning Pyprocessing. It comes with the regular processing platform that originally was written in Java. Many of the example projects that come bundled with processing have also been written in Python but not any of the audio libraries/examples.
I tried searching google but haven't found anything as of yet.
Does anyone know of a good resource where I can learn to do basic things with the audio library in pyprocessing such as playing audio and filtering audio?
I've used pyaudio and SWMixer for basic audio needs on a project.
Other python-audio resources I found useful at the time:
Scott W Harden's blog post on FFT analysis in Python (lots of neat things there)
PyAudioMixer
python-sounddevice
I hadn't used these exhaustively though to be able to advise on which one is most stable and easy to use.

How can I represent avi video as set of matrices using Python?

I have video files written in avi format and I would like to analyze these videos using Python. For that I would like to represent every frame of the video as a 2D matrix.
How can I do that? Google search gives me PyMedia as a way to go? Is it really the best choice or there some other approaches that I should to considered?
If the PyMedia is a good choice, could anybody pleas to give me a link where I can get exe files to install the module on Windows from binaries?
By the way, is it a good idea, in general, to use Python for these purposes? I like Python very much because of its simplicity and I prefer to use it, but if it is really not suitable for analysis of video, I am ready to use something else.
ADDED
Some people claim that PyMedia is "dead". Is it true?
Yeah, the latest news on the PyMedia web site is dated 01 Feb 2006. That's a pretty bad sign.
The most active and up-to-date open project for manipulating video is ffmpeg. Apparently there is a recently updated python wrapper for it: http://code.google.com/p/pyffmpeg/
In general Python is much too slow for doing any sort of pixel analysis of video. Therefore there will be practically zero libraries of any reasonable level of quality and support for helping at the pixel level of granularity. There are well supported libraries for working at an image level of granularity though. PIL seems to be a popular choice: http://www.pythonware.com/products/pil/

Categories