How to use ocr function for pdf files?
Show older comments
The pdf format is not recognized in MATLAB. I want to perform ocr on the pdf files. Also, I am not being able to convert the pdf into any image format in MATLAB. Please help. I'm using MATLAB on Windows.
Answers (2)
Chinnu Rockz
on 17 Mar 2019
1 vote
You can use "extractFileText()" to extract text from pdf,docs or word.But you have to download Text Analytics Toolbox for this function to use.
To know more visit
Walter Roberson
on 27 May 2016
0 votes
You will need to call an external program such as ghostscript to render the PDF into images.
ghostscript was included as part of the MATLAB release until recently (somewhere around R2014b or R2015b), but if you have a new release you will need to find and install the program yourself.
Categories
Find more on Convert Image Type in Help Center and File Exchange
Community Treasure Hunt
Find the treasures in MATLAB Central and discover how the community can help you!
Start Hunting!