How word Image Sequences can performed using HMM? With the reference link : lip image sequence , please let me know any idea how i can extract word's ?
How to use Support Vector Machine for Speech recognition
3 views (last 30 days)
Show older comments
SAMEER ahamed
on 2 Jan 2014
Commented: Image Analyst
on 8 Jan 2014
Hi,
I want to recognize silent speech words based on lip contour centroid key points .
Example : I have created MYSQL Database table column name "Words" , here 1. Hello , 2.How are You ,3. Nice to Meet You . etc.. after i have connected MySQL connection using matlab .Now i want to silent lip SVM(Support Vector Machine Speech Recognition) isolated word Pattern Matching? here reference link which i am following : http://www.ee.oulu.fi/~gyzhao/Download/Databases/OuluVS/lipreading-final-DC.pdf if any solution let me know .
Thanks
7 Comments
Accepted Answer
Image Analyst
on 7 Jan 2014
I'm sorry but to me this code looks totally random and disorganized, like you just threw together a bunch of unrelated chunks of code and hope they would somehow magically work together. Taking a (supposed) hue image and converting to YIQ color space? Huh? Taking both the Y and the blue channels and automatically thresholding them? Huh? Then taking the red channel and manually thresholding it? What? Calling imadjust on the Y channel? Totally unnecessary. Then threshold that? What? Measuring the filled binary image but then never doing anything with the results? Why? Then doing a bunch of morphological stuff on a binary image that came originally from the red channel and burning it into the original image. Why not just use bwboundaries and plot to put the outline into the overlay? Okay, that's just a jumble of code with no organized algorithm behind it.
Sorry to be so harsh but there is so much wrong with this algorithm that it isn't worth trying to fix. Like Walter said, we don't do algorithm development in this forum. I suggest you try to follow the algorithm in the paper rather than the one you listed here, which looks nothing like that.
One tip I can give is that for skin, the red channel has the least contrast between kips and surrounding skin, so you you probably shouldn't be using the red channel, at least not by itself.
4 Comments
Image Analyst
on 8 Jan 2014
More light is the best solution. Lack of light is giving no contrast to these lips.
More Answers (0)
See Also
Categories
Find more on Measurements and Spatial Audio in Help Center and File Exchange
Products
Community Treasure Hunt
Find the treasures in MATLAB Central and discover how the community can help you!
Start Hunting!