tesseract (.NET) + searchable PDF. How to apply?

msantiago picture msantiago · Nov 29, 2013 · Viewed 7.8k times · Source

I have the need to develop a system that turns an image into a searchable PDF. How is a school work i need something with open source After much research I found tessnet2 (tesseract) and I can remove a text the image in tiff format. But how to convert this information into a PDF? Attention : I need to keep the file structure.

I need a direction to proceed with my research. Someone help me please.

thank you

Answer

nguyenq picture nguyenq · Nov 29, 2013

There is a couple of .NET hOCR-to-PDF libraries that you may want to check out at Tesseract 3rdParty page.