Concept information
Término preferido
hOCR
Definición
- hOCR is an open standard for representing document layout analysis and OCR results as a subset of HTML. The goal is to reuse as much existing technology as possible, and to arrive at a representation that makes it easy to store, share, process and display OCR results. This specification defines many features that can represent a variety of OCR-related information. However, being built on top of HTML, hOCR is designed to make it easy to start simple and gradually use more complex constructs when necessary. Consider you have an HTML document that encodes a book: Wrapping page elements in <div class="ocr_page"> tags will convey the page boundaries to hOCR-capable agents and turn the HTML document into an hOCR document.
URI
https://vocabs.sshopencloud.eu/vocabularies/standard/hocr
{{label}}
{{#each values }} {{! loop through ConceptPropertyValue objects }}
{{#if prefLabel }}
{{/if}}
{{/each}}
{{#if notation }}{{ notation }} {{/if}}{{ prefLabel }}
{{#ifDifferentLabelLang lang }} ({{ lang }}){{/ifDifferentLabelLang}}
{{#if vocabName }}
{{ vocabName }}
{{/if}}