|
I got the title of the graduation project "OCR Character Recognition System Design" some time ago, and my mind was at a loss~~~
It is required to realize the conversion of graphics and text, which mainly refers to the extraction of text information in common images (.bmp|.jpg|.gif|.pdf) into common documents (.txt|.doc|), and the recognition efficiency is required to be 80%. Optional development tools.
I've never touched this stuff before, so I don't know how to start.
Hope that all the experts will not hesitate to impart relevant knowledge and techniques.
************************************************** ******************************
System design: layout analysis-text segmentation-single character normalization-image preprocessing (including binarization, denoising, tilt correction, etc.)-feature extraction-classification and recognition
Any step can be used as a research topic. Your teacher should designate a small piece for you to achieve the following, and you can do a doctoral thesis for such a big topic. |
|