14

How to convert hOCR to HTML for visualization?

If you open the raw hOCR file its only rendered as plain text (the elements are not positioned)

Daniel Cassidy
  • 24,676
  • 5
  • 41
  • 54
clarkk
  • 27,151
  • 72
  • 200
  • 340

3 Answers3

19

There are different solutions for this task and I know these three:

All of these repos seem to consist mainly of some JavaScript and CSS files. The first two repos have both a link to some demo page where I have taken the pictures from.

The first one provides a Greasemonkey/Tampermonkey script which allows to inject this overlay on any suitable hocr website online and local (some configuration may be possible for that). I don't know how difficult it is to use the other solutins for your own hocr files, but it should be doable.

zuphilip
  • 520
  • 3
  • 12
13

To add the interface to a plain hOCR file, add this line just before the closing </body> tag:

<script src="https://unpkg.com/hocrjs"></script>

Then open the html (hOCR) file in your browser.

Source

ATP
  • 2,939
  • 4
  • 13
  • 34
Philip
  • 3,135
  • 2
  • 29
  • 43
-7

hOCR is HTML. You can view it in a web browser.

nguyenq
  • 8,212
  • 1
  • 16
  • 16