Skip to content

SMI/tesseract

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

A .NET wrapper for tesseract-ocr 5.2.0.

This has been updated to work on Ubuntu 22.04 and other similar systems, and re-published under the name HIC.Tesseract.

Dependencies

Visual Studio 2019 x86 and x64 Runtimes

Since tesseract and leptonica binaries are compiled with Visual Studio 2019 you'll need to ensure you have the Visual Studio 2019 Runtime installed.

Tesseract language data

You will also need to download the language data files for tesseract 4.0.0 or above from tesseract-tessdata.

Docs

See ./docs/ReadMe.md

Getting started quickly

  1. Add the Tesseract NuGet Package by running Install-Package Tesseract from the Package Manager Console.
  2. (Optional) Add the Tesseract.Drawing NuGet package to support interop with System.Drawing in .NET Core, for instance to allow passing Bitmap to Tesseract
  3. Ensure you have Visual Studio 2019 x86 & x64 runtimes installed (see note above).
  4. Download language data files for tesseract 4.00 from the tessdata repository and add them to your project, ensure 'Copy to output directory' is set to Always.
  5. Check out the Samples solution ~/src/Tesseract.Samples.sln in the tesseract-samples repository for a working example.

If you run into any issues please check out this wiki page which details a number common issues and some potential solutions.

Support

Please only file issues for bugs.

If you have any questions or feature/improvement ideas please ask them on our forum.

Note for contributors

Please create your pull requests to target the "Master" branch.

License

Copyright 2012-2022 Charles Weld.

Licensed under the Apache License, Version 2.0 (the "License"); you may not use this software except in compliance with the License. You may obtain a copy of the License at:

http://www.apache.org/licenses/LICENSE-2.0

Unless required by applicable law or agreed to in writing, software distributed under the License is distributed on an "AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. See the License for the specific language governing permissions and limitations under the License.

InteropDotNet

Copyright 2014 Andrey Akinshin Project URL: https://github.com/AndreyAkinshin/InteropDotNet Distributed under the MIT License: http://opensource.org/licenses/MIT

Core Team

Contributors

A big thanks to GitHub and all of Tesseract's contributors:

Also thanks to the following projects\resources without which this project would not exist in its current form:

About

A .Net wrapper for tesseract-ocr

Resources

License

Stars

Watchers

Forks

Packages

No packages published

Languages

  • C# 100.0%