OCR Past Paper Recognising Tool

I am trying to make a program that would check through past papers which some are readable PDFs whilst others aren’t and are just scans. I was thinking of solving the using OCR to solve the readablity issue. I want to make an interface (i.e. a website) where a user can ask questions and the question get compared to a database (with the post-proccessed past papers) and spits out an corresponding answer or which past paper/papers it is from.

Any help would be much appreciated, as I am quite novice in the programming world, but I strive towards becoming something more and in the process creating useful programs.

Yours sincerly
Evan

1 Like