Pure Javascript OCR for more than 100 Languages 📖🎉🖥
You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
 
 

978 B

UNDER CONTRUCTION

Due for Release on Monday, Oct 3, 2016

tesseract.js

Tesseract.js is a pure javascript version of the Tesseract OCR Engine that can recognize English, Chinese, Russian, and 60 other languages.

Installation

Tesseract.js works with a <script> tag via local copy or cdn, or with npm (if you're using webpack / browserify).

<script/>

First grab copies of tesseract.js and tesseract.worker.js from the dist folder. Then include tesseract.js on your page like this:

<script src='/path/to/tesseract.js'></script>

<script>
var worker = createTesseractWorker('/path/to/tesseract.worker.js')

worker.recognize('#my-image')
    .progress(function (p) { console.log('progress', p) })
    .then(function (result) { console.log('result', result) })
</script>

After that, you should

npm

npm install tesseract