Optical Character Recognition for .NET

OCR.NET
OCR.NET Demo
Use OCR.NET component to retrieve text from image, for example from scanned paper document.
  • uses Tesseract OCR engine and Leptonica image processing library
  • available for .NET 2 - 4.6
  • source code included in full version
  • royalty free distribution in applications

Download and order

Download Tesseract language data and place to tessdata folder.
Order OCR.NET component $100,- USD (single developer license)
Order OCR.NET multilicense $300,- USD (license for all developers in company)
Order OCR.NET year upgrades $50,- USD (registered users only)
Order OCR.NET year upgrades multilicense $150,- USD (registered multilicense users only)

FAQ

An unhandled exception of type 'System.BadImageFormatException' occurred in Winsoft.Ocr.dll
It's caused by using incorrect ocr.dll library, i.e. 32bit instead of 64bit or vice versa.
Select x86 or x64 platform in Visual Studio accordingly to the ocr.dll library and rebuild your application.
Or place proper ocr.dll to the folder where is your application exe file located.
32-bit ocr.dll is located in folder C:\Program Files (x86)\Winsoft\OCR.NET\DLL\32bit
64-bit ocr.dll is located in folder C:\Program Files (x86)\Winsoft\OCR.NET\DLL\64bit

How can I improve OCR output?
Icon Improving the quality of the output

How can I solve "Cannot initialize Tesseract library" error?
Set NOcr.DataPath property to the folder containing tessdata folder with Tesseract language data (i.e. not to the tessdata folder itself).

Related links