diff --git a/1.2/index.bs b/1.2/index.bs
index 8e9d11f..9170920 100644
--- a/1.2/index.bs
+++ b/1.2/index.bs
@@ -10,7 +10,7 @@ Editor: Konstantin Baierer, UB Mannheim http://github.com/UB-Mannheim, konstanti
 Former Editor: Thomas Breuel, http://www.9x9.com/
 Previous Version: https://github.com/kba/hocr-spec/blob/master/1.1/spec.md
 Abstract: A subset of HTML for marking up OCR results
-Markup Shorthands: markdown on, biblio on
+Markup Shorthands: markdown on, biblio on, markup on
 </pre>
 <pre class="biblio">
 {
@@ -42,7 +42,7 @@ arrive at a representation that makes it easy to reuse OCR results.
 
 This document describes many tags and a lot of information that can be output.
 However, getting started with hOCR is easy: you only need to output the tags
-and information you actually want to.  For example, just outputting `ocr_line`
+and information you actually want to.  For example, just outputting <{ocr_line}>
 tags with bounding boxes is already very useful for many applications.  Just
 start simple and add more output information as the need arises.
 
@@ -97,7 +97,7 @@ multiple properties are separated by semicolons.
 
 The following properties can apply to most elements (where it makes sense):
 
-### `bbox`
+### <dfn property>bbox</dfn>
 
 `bbox x0 y0 x1 y1`
 
@@ -108,8 +108,8 @@ the lower-right corner (x1, y1).
   * the values are with reference to the the top-left corner of the document image
     and measured in pixels
   * the order of the values are `x0 y0 x1 y1` = "left top right bottom"
-  * use `x_bboxes` below for character bounding boxes
-  * do not use `bbox` unless the bounding box of the layout component is, in
+  * use 'x_bboxes' below for character bounding boxes
+  * do not use 'bbox' unless the bounding box of the layout component is, in
     fact, rectangular
   * some non-rectangular layout components may have rectangular bounding boxes
     if the non-rectangularity is caused by floating elements around which text flows
@@ -135,7 +135,7 @@ the document image which border is drawn in black.
 
 </div>
 
-### `textangle`
+### <dfn property>textangle</dfn>
 
 `textangle alpha`
 
@@ -150,7 +150,7 @@ which should be indicated using standard HTML properties
 The following properties can apply to most elements but should not be used
 unless there is no alternative:
 
-### `poly`
+### <dfn property>poly</dfn>
 
 `poly x0 y0 x1 y1 ...`
 
@@ -163,11 +163,11 @@ A closed polygon for elements with non-rectangular bounds
   * note that the natural and correct representation of many non-rectangular
     layouts is in terms of rectangular content areas and rectangular floats
   * documents using polygonal borders anywhere must indicate this by adding
-    [[#ocrp_poly]] to the list of `ocr-capabilities` in the
-    [[#required-meta-information]]
-  * documents should attempt to provide a reasonable bbox equivalent as well
+    ''ocr-capabilities/ocrp_poly'' to the list of 'ocr-capabilities' (see
+    [[#required-meta-information]])
+  * documents should attempt to provide a reasonable 'bbox' equivalent as well
 
-### `order`
+### <dfn property>order</dfn>
 
 `order n`
 
@@ -177,27 +177,27 @@ The reading order of the element (an integer)
     the reading order of the page by element ordering within the page, since
     many tools will not be able to deal with content that is not in reading order
 
-### `presence`
+### <dfn property>presence</dfn>
 
 Issue: [Use of property presence](https://github.com/kba/hocr-spec/issues/10)
 
-`presence` presence must be declared in the document meta data
+'presence' presence must be declared in the document meta data
 
-### `cflow`
+### <dfn property>cflow</dfn>
 
 `cflow s`
 
-This property relates the flow between multiple [[#ocr_carea]] elements,
-and between [[#ocr_carea]] and [[#ocr_linear]] elements.
+This property relates the flow between multiple <{ocr_carea}> elements,
+and between <{ocr_carea}> and <{ocr_linear}> elements.
 
 The content flow on the page that this element is a part of
 
   * s must be a unique string for each content flow
-  * must be present on [[#ocr_carea]] and [[#ocrx_block]] tags when reading
+  * must be present on <{ocr_carea}> and <{ocrx_block}> tags when reading
     order is attempted and multiple content flows are present
   * presence must be declared in the document meta data
 
-### `baseline`
+### <dfn property>baseline</dfn>
 
 `baseline pn pn-1 ... p0`
 
@@ -220,7 +220,7 @@ contains the following information:
     title="bbox 105 66 823 113; baseline 0.015 -18">...</span>
 ```
 
-bbox is the bounding box of the line in image coordinates (blue). The two
+'bbox' is the bounding box of the line in image coordinates (blue). The two
 numbers for the baseline are the slope (1st number) and constant term (2nd
 number) of a linear equation describing the baseline relative to the bottom
 left corner of the bounding box (red). The baseline crosses the y-axis at `-18`
@@ -237,30 +237,30 @@ and its slope angle is `arctan(0.015) = 0.86°`.
 
 We recognize the following logical structuring elements:
 
-  * `ocr_document`
-    * `ocr_linear`
-      * `ocr_title`
-      * `ocr_author`
-      * `ocr_abstract`
-      * `ocr_part` [`<h1>`]
-        * `ocr_chapter` [`<h1>`]
-          * `ocr_section` [`<h2>`]
+  * <{ocr_document}>
+    * <{ocr_linear}>
+      * <{ocr_title}>
+      * <{ocr_author}>
+      * <{ocr_abstract}>
+      * <{ocr_part}> [`<h1>`]
+        * <{ocr_chapter}> [`<h1>`]
+          * <{ocr_section}> [`<h2>`]
             * `ocr_sub*section` [`<h3>`,`<h4>`]
-              * `ocr_display` 
-              * `ocr_blockquote` [`<blockquote>`]
-              * `ocr_par` [`<p>`]
-
-## `ocr_document`
-## `ocr_title`
-## `ocr_author`
-## `ocr_abstract`
-## `ocr_part`
-## `ocr_chapter`
-## `ocr_section`
-## `ocr_subsubsection`
-## `ocr_display`
-## `ocr_blockquote`
-## `ocr_par`
+              * <{ocr_display}> 
+              * <{ocr_blockquote}> [`<blockquote>`]
+              * <{ocr_par}> [`<p>`]
+
+## <dfn element>ocr_document</dfn>
+## <dfn element>ocr_title</dfn>
+## <dfn element>ocr_author</dfn>
+## <dfn element>ocr_abstract</dfn>
+## <dfn element>ocr_part</dfn>
+## <dfn element>ocr_chapter</dfn>
+## <dfn element>ocr_section</dfn>
+## <dfn element>ocr_subsubsection</dfn>
+## <dfn element>ocr_display</dfn>
+## <dfn element>ocr_blockquote</dfn>
+## <dfn element>ocr_par</dfn>
 
 These logical tags have their standard meaning as used in the publishing
 industry and tools like LaTeX, MS Word, and others.
@@ -270,15 +270,15 @@ with those logical structuring elements, but it may not be possible or
 desirable to actually chose those tags (e.g., when adding hOCR information to
 an existing HTML output routine).
 
-## `ocr_linear`
+### <dfn element>ocr_linear</dfn>
 
-For all of these elements except `ocr_linear`, there exists a natural linear
-ordering defined by reading order (`ocr_linear` indicates that the elements
-contained in it have a linear ordering). At the level of `ocr_linear`, there
-may not be a single distinguished order. A common example of `ocr_linear` is a
+For all of these elements except <{ocr_linear}>, there exists a natural linear
+ordering defined by reading order (<{ocr_linear}> indicates that the elements
+contained in it have a linear ordering). At the level of <{ocr_linear}>, there
+may not be a single distinguished order. A common example of <{ocr_linear}> is a
 newspaper, in which a single newspaper may contain many linear, but there is no
 unique reading order for the different linear. OCR evaluation tools should
-therefore be sensitive to the order of all elements other than `ocr_linear`.
+therefore be sensitive to the order of all elements other than <{ocr_linear}>.
 
 Tags must be nested as indicated by nesting above, but not all tags within the
 hierarchy need to be present.
@@ -289,11 +289,11 @@ text inside the containing element.
 Documents whose logical structure does not map naturally onto these logical
 structuring elemetns must not use them for other purpose.
 
-## `ocr_caption`
+## <dfn element>ocr_caption</dfn>
 
-Image captions may be indicated using the `ocr_caption` element; such an
+Image captions may be indicated using the <{ocr_caption}> element; such an
 element refers to the image(s) contained within the same float, or the
-immediately adjacent image if both the image and the `ocr_caption` element are
+immediately adjacent image if both the image and the <{ocr_caption}> element are
 in running text.
 
 
@@ -332,57 +332,57 @@ properties for floating elements; properties need to be defined for this.
 The following classes, as well as [floats](#classes-for-floats) are used for type-setting
 elements.
 
-### `ocr_page`
+### <dfn element>ocr_page</dfn>
 
-The `ocr_page` element must be present in all hOCR documents.
+The <{ocr_page}> element must be present in all hOCR documents.
 
-### `ocr_column`
+### <dfn element>ocr_column</dfn>
 
 <div class="annoying-warning">
 **OBSOLETE**
 
-Please use [[#ocr_carea]] instead
+Please use <{ocr_carea}> instead
 </div>
 
-### `ocr_carea`
+### <dfn element>ocr_carea</dfn>
 
 "ocr content area" or "body area"
 
 Used to be called <del>ocr_column</del>
 
-The `ocr_carea` elements should appear in reading order unless this is impossible
+The <{ocr_carea}> elements should appear in reading order unless this is impossible
 because of some other structuring requirement. If the document contains multiple
-`ocr_linear` streams, then each `ocr_carea` must indicate which stream it belongs
+<{ocr_linear}> streams, then each <{ocr_carea}> must indicate which stream it belongs
 to.
 
 Note that for many documents, the actual ground truth careas are well-defined
 by the document style of the original document before printing and scanning.
 From a single page, the `careas` of the original document style cannot be
-recovered exactly. However, the partition of a document by `ocr_carea` for an
+recovered exactly. However, the partition of a document by <{ocr_carea}> for an
 individual page shall be considered correct relative to ground truth if
 
   1. all the text contained in a ground truth carea is fully contained within a
-    single `ocr_carea`,
+    single <{ocr_carea}>,
   2. no text outside a ground truth `carea` is contained within an
-    `ocr_carea`, and 
-  3. the `ocr_careas` appear in the same order as the text flow
+    <{ocr_carea}>, and 
+  3. the <{ocr_carea}> appear in the same order as the text flow
     relationships between the ground truth careas.
 
-### `ocr_line`
+### <dfn element>ocr_line</dfn>
 
 In typesetting systems, content areas are filled with “blocks”, but most of
 those blocks are not recoverable or semantically meaningful. However, one type
 of block is visible and very important for OCR engines: the line. Lines are
 typesetting blocks that only contain glyphs (“inlines” in XSL terminology).
-They are represented by the `ocr_line` area.
+They are represented by the <{ocr_line}> area.
 
-`ocr_line` should be in a `<span>`
+<{ocr_line}> should be in a `<span>`
 
-### `ocr_separator`
+### <dfn element>ocr_separator</dfn>
 
 Any separator or similar element
 
-### `ocr_noise`
+### <dfn element>ocr_noise</dfn>
 
 Any noise element that isn't part of typesetting
 
@@ -395,7 +395,7 @@ The following properties should be present:
 The bounding box of the page; for pages, the top left corner must be at
 `(0,0)`, so a typical page bounding box will look like `bbox 0 0 2300 3200`
 
-### `image`
+### <dfn property>image</dfn>
 
 `image imagefile`
 
@@ -407,14 +407,14 @@ The bounding box of the page; for pages, the top left corner must be at
   * if the hOCR file is present in a directory hierarchy or file archive, should
     resolve to the corresponding image file
 
-### `imagemd5`
+### <dfn property>imagemd5</dfn>
 
 `imagemd5 checksum`
 
   * MD5 fingerprint of the image file that this page was derived from
   * allows re-associating pages with source images
 
-### `ppageno`
+### <dfn property>ppageno</dfn>
 
 `ppageno n`
 
@@ -424,7 +424,7 @@ The bounding box of the page; for pages, the top left corner must be at
   * must not be present unless the pages in the document have a physical ordering
   * must not be present unless it is well defined and unique
 
-### `lpageno`
+### <dfn property>lpageno</dfn>
 
 `lpageno string`
 
@@ -437,19 +437,19 @@ The bounding box of the page; for pages, the top left corner must be at
 
 The following properties MAY be present:
 
-### `scan_res`
+### <dfn property>scan_res</dfn>
 
 `scan_res x_res y_res`
 
   * scanning resolution in DPI
 
-### `x_scanner`
+### <dfn property>x_scanner</dfn>
 
 `x_scanner string`
 
   * a representation of the scanner
 
-### `x_source`
+### <dfn property>x_source</dfn>
 
 `x_source string`
 
@@ -462,9 +462,9 @@ The following properties MAY be present:
     * `x_source http://pageserver/012345678911&page=17`
 
 In addition to the standard
-properties, the `ocr_line` area supports the following additional properties:
+properties, the <{ocr_line}> area supports the following additional properties:
 
-### `hardbreak`
+### <dfn property>hardbreak</dfn>
 
 `hardbreak n`
 
@@ -473,7 +473,7 @@ properties, the `ocr_line` area supports the following additional properties:
   * a one indicates that the line is a hard (explicit) line break
 
 Any special characters representing the desired end-of-line processing must be
-present inside the `ocr_line` element. Examples of such special characters are a
+present inside the <{ocr_line}> element. Examples of such special characters are a
 soft hyphen ("­", `U+00AD`), a hard line break (`<br>`), or whitespace (` `) for soft
 line breaks.
 
@@ -483,48 +483,48 @@ Floats should not be nested.
 
 The following floats are defined:
 
-### `ocr_float`
+### <dfn element>ocr_float</dfn>
 
 `ocr_float`
 
-### `ocr_separator`
+### <dfn element>ocr_separator</dfn>
 
-`ocr_separator`
+`ocr_separator` in the context of float classes.
 
-### `ocr_textfloat`
+### <dfn element>ocr_textfloat</dfn>
 
 `ocr_textfloat`
 
-### `ocr_textimage`
+### <dfn element>ocr_textimage</dfn>
 
 `ocr_textimage`
 
-### `ocr_image`
+### <dfn element>ocr_image</dfn>
 
 `ocr_image`
 
-### `ocr_linedrawing`
+### <dfn element>ocr_linedrawing</dfn>
 
 Something that could be represented well and naturally in a vector graphics
 format like SVG (even if it is actually represented as PNG)
 
-### `ocr_photo`
+### <dfn element>ocr_photo</dfn>
 
 Something that requires JPEG or PNG to be represented well
 
-### `ocr_header`
+### <dfn element>ocr_header</dfn>
 
 `ocr_header`
 
-### `ocr_footer`
+### <dfn element>ocr_footer</dfn>
 
 `ocr_footer`
 
-### `ocr_pageno`
+### <dfn element>ocr_pageno</dfn>
 
 `ocr_pageno`
 
-### `ocr_table`
+### <dfn element>ocr_table</dfn>
 
 `ocr_table`
 
@@ -534,44 +534,44 @@ There is some content that should behave and flow like text
 
 ## Classes for Inline Representation
 
-### `ocr_glyph`
+### <dfn element>ocr_glyph</dfn>
 
 An individual glyph represented as an image (e.g., an unrecognized character)
 
 Must contain a single `<img>` tag, or be present on one
 
-### `ocr_glyphs`
+### <dfn element>ocr_glyphs</dfn>
 
 Multiple glyphs represented as an image (e.g., an unrecognized word)
 
 Must contain a single `<img>` tag, or be present on one
 
-### `ocr_dropcap`
+### <dfn element>ocr_dropcap</dfn>
 
 An individual glyph representing a dropcap
 
 May contain text or an `<img>` tag; the `alt` of the image tag should contain
 the corresponding text
 
-### `ocr_chem`
+### <dfn element>ocr_chem</dfn>
 
 A chemical formula
 
 Must contain either a single `<img>` tag or [[CML]] markup, or be present on
 one
 
-### `ocr_math`
+### <dfn element>ocr_math</dfn>
 
 A mathematical formula
 
 Must contain either a single `<img>` tag or [[MathML]] markup, or be present on
 one
 
-Mathematical and chemical formulas that float must be put into an `ocr_float`
+Mathematical and chemical formulas that float must be put into an <{ocr_float}>
 section.
 
 Mathematical and chemical formulas that are “display” mode should be put into
-an `ocr_display` section.
+an <{ocr_display}> section.
 
 ### Non-breaking space
 
@@ -586,8 +586,9 @@ Different space widths should be indicated using HTML and `&ensp;`, `&emsp`,
 
 Soft hyphens must be represented using the HTML `&shy;` entity.
 
-The HTML `&lrm;` and `&rlm;` entities (indicating writing direction) must not
-be used; all writing direction changes must be indicated with tags.
+The HTML <a href="https://www.w3.org/TR/REC-html40/struct/dirlang.html#h-8.2.5">`&lrm;` and
+`&rlm;` entities</a> (indicating writing direction) must not be used; all
+writing direction changes must be indicated with tags.
 
 ### Superscript and Subscript
 
@@ -606,20 +607,20 @@ must be represented using their correct Unicode encoding.
 Character-level information may be put on any element that contains only a
 single "line" of text.
 
-### `ocr_cinfo`
+### <dfn element>ocr_cinfo</dfn>
 
-If no other layout element applies, the `ocr_cinfo` element may be used.
+If no other layout element applies, the <{ocr_cinfo}> element may be used.
 
 ## Properties for Character Information
 
-### `cuts`
+### <dfn property>cuts</dfn>
 
 `cuts c1 c2 c3 ...`
 
   * character segmentation cuts (see below)
-  * there must be a bbox property relative to which the cuts can be interpreted
+  * there must be a 'bbox' property relative to which the 'cuts' can be interpreted
 
-### `nlp`
+### <dfn property>nlp</dfn>
 
 `nlp c1 c2 c3 ...`
 
@@ -670,21 +671,21 @@ Common suggested engine-specific markup are:
 
 ## Classes for engine specific markup
 
-### `ocrx_block`
+### <dfn element>ocrx_block</dfn>
 
 Issue: [ocr_carea vs ocrx_block](https://github.com/kba/hocr-spec/issues/28)
 
   * any kind of "block" returned by an OCR system
   * engine-specific because the definition of a "block" depends on the engine
 
-### `ocrx_line`
+### <dfn element>ocrx_line</dfn>
 
 Issue: [ocr_line vs ocrx_line](https://github.com/kba/hocr-spec/issues/19)
 
-  * any kind of "line" returned by an OCR system that differs from the standard ocr_line above
+  * any kind of "line" returned by an OCR system that differs from the standard <{ocr_line}> above
   * might be some kind of "logical" line
 
-### `ocrx_word`
+### <dfn element>ocrx_word</dfn>
 
   * any kind of "word" returned by an OCR system
   * engine specific because the definition of a "word" depends on the engine
@@ -692,42 +693,44 @@ Issue: [ocr_line vs ocrx_line](https://github.com/kba/hocr-spec/issues/19)
 The meaning of these tags is OCR engine specific. However, generators should
 attempt to ensure the following properties:
 
-* an `ocrx_block` should not contain content from multiple ocr_careas
-* the union of all `ocrx_blocks` should approximately cover all `ocr_careas`
-* an `ocrx_block` should contain either a float or body text, but not both
-* an `ocrx_block` should contain either an image or text, but not both
-* an `ocrx_line` should correspond as closely as possible to an `ocr_line`
-* `ocrx_cinfo` should nest inside `ocrx_line`
-* `ocrx_cinfo` should contain only `x_conf`, `x_bboxes`, and `cuts` attributes
+* An <{ocrx_block}> should not contain content from multiple <{ocr_carea}>.
+* The union of all <{ocrx_block|ocrx_blocks}> should approximately cover all <{ocr_carea}>.
+* an <{ocrx_block}> should contain either a float or body text, but not both
+* an <{ocrx_block}> should contain either an image or text, but not both
+* an <{ocrx_line}> should correspond as closely as possible to an <{ocr_line}>
+* <{ocrx_cinfo}> should nest inside <{ocrx_line}>
+* <{ocrx_cinfo}> should contain only 'x_confs', 'x_bboxes', and 'cuts' attributes
+
+Issue: ocrx_cinfo?
 
 ## Properties for engine-specific markup
 
 The following properties are defined:
 
-### `x_font`
+### <dfn property>x_font</dfn>
 
 `x_font s`
 
   * OCR-engine specific font names
 
-### `x_fsize`
+### <dfn property>x_fsize</dfn>
 
 `x_fsize n`
 
   * OCR-engine specific font size
 
-### `x_bboxes`
+### <dfn property>x_bboxes</dfn>
 
 `x_bboxes b1x0 b1y0 b1x1 b1y1 b2x0 b2y0 b2x1 b2y1 ...`
 
   * OCR-engine specific boxes associated with each codepoint contained in the
     element
-  * note that the bbox property is a property for the bounding box of a layout
+  * note that the 'bbox' property is a property for the bounding box of a layout
     element, not of individual characters
   * in particular, use `<span class="ocr_cinfo" title="x_bboxes ....">`, not
     `<span class="ocr_cinfo" title="bbox ...">`
 
-### `x_confs`
+### <dfn property>x_confs</dfn>
 
 `x_confs c1 c2 c3 ...`
 
@@ -737,7 +740,7 @@ The following properties are defined:
   * if possible, convert character confidences to values between 0 and 100 and
     have them approximate posterior probabilities (expressed in %)
 
-### `x_wconf`
+### <dfn property>x_wconf</dfn>
 
 `x_wconf n`
 
@@ -777,7 +780,7 @@ Alternative segmentations and readings are indicated by a `<span>` with
 `class="alternatives"`. It must contains `<ins>` and `<del>` elements. The first
 contained element should be `<ins>` and represent the most probable interpretation,
 the subsequent ones `<del>`. Each `<ins>` and `<del>` element should have `class="alt"` and a
-property of either `nlp` or `x_cost`. These `<span>`, `<ins>`, and `<del>` tags can nest
+property of either 'nlp' or 'x_cost'. These `<span>`, `<ins>`, and `<del>` tags can nest
 arbitrarily.
 
 <div class="example">
@@ -798,7 +801,7 @@ when viewed in a browser.
 
 The different levels of layout information (logical, physical, engine-specific)
 each form hierarchies, but those hierarchies may not be mutually compatible;
-for example, a single `ocr_page` may contain information from multiple sections
+for example, a single <{ocr_page}> may contain information from multiple sections
 or chapters. To represent both hierarchies within a single document, elements
 may be grouped together.  That is, two elements with the same class may be
 treated as one element by adding a "groupid identifier" property to them and
@@ -816,8 +819,8 @@ removing tags that are not of interest for the subsequent processing step, and
 then collapsing grouped elements into single elements.  For example, output
 that contains both logical and physical layout information, where the logical
 layout information uses grouped elements, can be transformed by removing all
-the physical layout information, and then collapsing all split `ocr_chapter`
-elements into single `ocr_chapter` elements based on the groupid.  The result is
+the physical layout information, and then collapsing all split <{ocr_chapter}>
+elements into single <{ocr_chapter}> elements based on the groupid.  The result is
 a simple DOM tree.  This transformation can be provided generically as a
 pre-processor or Javascript.
 
@@ -838,23 +841,23 @@ document.
 The capability to generate specific properties is given by the prefix `ocrp_...`;
 the important properties are:
 
-## `ocrp_lang`
+## <dfn value for="ocr-capabilities">ocrp_lang</dfn>
 
 Capable of generating `lang=` attributes
 
-## `ocrp_dir`
+## <dfn value for="ocr-capabilities">ocrp_dir</dfn>
 
 Capable of generating `dir=` attributes
 
-## `ocrp_poly`
+## <dfn value for="ocr-capabilities">ocrp_poly</dfn>
 
 Capable of generating [polygonal bounds](#poly)
 
-## `ocrp_font`
+## <dfn value for="ocr-capabilities">ocrp_font</dfn>
 
 Capable of generating font information (standard font information)
 
-## `ocrp_nlp`
+## <dfn value for="ocr-capabilities">ocrp_nlp</dfn>
 
 Capable of generating [nlp confidences](#nlp)
 
@@ -880,16 +883,31 @@ corresponding element or attribute must not be present in the document.
 
 The OCR system is required to indicate the following using meta tags in the header:
 
+### <dfn property>ocr-system</dfn>
+
   * `<meta name="ocr-system" content="name version"/>`
+
+### <dfn property>ocr-capabilities</dfn>
+
   * `<meta name="ocr-capabilities" content="capabilities"/>`
     * see [[#capabilities]]
 
+## Recommended Meta Information
+
 The OCR system should indicate the following information
 
+### <dfn property>ocr-number-of-pages</dfn>
+
   * `<meta name="ocr-number-of-pages" content="number-of-pages"/>`
+
+### <dfn property>ocr-langs</dfn>
+
   * `<meta name="ocr-langs" content="languages-considered-by-ocr"/>`
     * use [ISO 639-1](https://www.loc.gov/standards/iso639-2/php/code_list.php) codes
     * value may be `unknown`
+
+### <dfn property>ocr-scripts</dfn>
+
   * `<meta name="ocr-scripts" content="scripts-considered-by-ocr"/>`
     * use [ISO 15924](http://www.unicode.org/iso15924/codelists.html) letter codes
     * value may be `unknown`
@@ -930,17 +948,17 @@ Other possible profiles might be defined for specific engines or specific
 document classes:
 
   * common commercial OCR output (e.g., Abbyy)
-    * ocr_page
-    * ocrx_block, ocrx_line, ocrx_word
-    * ocrp_lang
-    * ocrp_font
+    * <{ocr_page}>
+    * <{ocrx_block}>, <{ocrx_line}>, <{ocrx_word}>
+    * ''ocr-capabilities/ocrp_lang''
+    * ''ocr-capabilities/ocrp_font''
   * book target
-    * all logical structuring elements (as applicable), except ocr_linear
-    * ocr_page
+    * all logical structuring elements (as applicable), except <{ocr_linear}>
+    * <{ocr_page}>
   * newspaper target
     * all logical structuring elements (as applicable)
-    * articles map on ocr_linear
-    * ocr_page
+    * articles map on <{ocr_linear}>
+    * <{ocr_page}>
 
 # HTML Markup
 
@@ -1200,3 +1218,7 @@ Issue: [correct MIME type for hOCR?](https://github.com/kba/hocr-spec/issues/27)
   : Applications which use this media type:
   : File extension(s):
   :: `*.html`, `*.hocr`
+
+
+
+<!-- vim: set textwidth=120: -->
diff --git a/1.2/index.html b/1.2/index.html
index a27e208..8bde9db 100644
--- a/1.2/index.html
+++ b/1.2/index.html
@@ -1185,93 +1185,6 @@
             [data-md] > :last-child {
                 margin-bottom: 0;
             }</style>
-<style>/* style-counters */
-
-            body {
-                counter-reset: example figure issue;
-            }
-            .issue {
-                counter-increment: issue;
-            }
-            .issue:not(.no-marker)::before {
-                content: "Issue " counter(issue);
-            }
-
-            .example {
-                counter-increment: example;
-            }
-            .example:not(.no-marker)::before {
-                content: "Example " counter(example);
-            }
-            .invalid.example:not(.no-marker)::before,
-            .illegal.example:not(.no-marker)::before {
-                content: "Invalid Example" counter(example);
-            }
-
-            figcaption {
-                counter-increment: figure;
-            }
-            figcaption:not(.no-marker)::before {
-                content: "Figure " counter(figure) " ";
-            }</style>
-<style>/* style-syntax-highlighting */
-
-        .highlight:not(.idl) { background: hsl(24, 20%, 95%); }
-        code.highlight { padding: .1em; border-radius: .3em; }
-        pre.highlight, pre > code.highlight { display: block; padding: 1em; margin: .5em 0; overflow: auto; border-radius: 0; }
-        .highlight .c { color: #708090 } /* Comment */
-        .highlight .k { color: #990055 } /* Keyword */
-        .highlight .l { color: #000000 } /* Literal */
-        .highlight .n { color: #0077aa } /* Name */
-        .highlight .o { color: #999999 } /* Operator */
-        .highlight .p { color: #999999 } /* Punctuation */
-        .highlight .cm { color: #708090 } /* Comment.Multiline */
-        .highlight .cp { color: #708090 } /* Comment.Preproc */
-        .highlight .c1 { color: #708090 } /* Comment.Single */
-        .highlight .cs { color: #708090 } /* Comment.Special */
-        .highlight .kc { color: #990055 } /* Keyword.Constant */
-        .highlight .kd { color: #990055 } /* Keyword.Declaration */
-        .highlight .kn { color: #990055 } /* Keyword.Namespace */
-        .highlight .kp { color: #990055 } /* Keyword.Pseudo */
-        .highlight .kr { color: #990055 } /* Keyword.Reserved */
-        .highlight .kt { color: #990055 } /* Keyword.Type */
-        .highlight .ld { color: #000000 } /* Literal.Date */
-        .highlight .m { color: #000000 } /* Literal.Number */
-        .highlight .s { color: #a67f59 } /* Literal.String */
-        .highlight .na { color: #0077aa } /* Name.Attribute */
-        .highlight .nc { color: #0077aa } /* Name.Class */
-        .highlight .no { color: #0077aa } /* Name.Constant */
-        .highlight .nd { color: #0077aa } /* Name.Decorator */
-        .highlight .ni { color: #0077aa } /* Name.Entity */
-        .highlight .ne { color: #0077aa } /* Name.Exception */
-        .highlight .nf { color: #0077aa } /* Name.Function */
-        .highlight .nl { color: #0077aa } /* Name.Label */
-        .highlight .nn { color: #0077aa } /* Name.Namespace */
-        .highlight .py { color: #0077aa } /* Name.Property */
-        .highlight .nt { color: #669900 } /* Name.Tag */
-        .highlight .nv { color: #222222 } /* Name.Variable */
-        .highlight .ow { color: #999999 } /* Operator.Word */
-        .highlight .mb { color: #000000 } /* Literal.Number.Bin */
-        .highlight .mf { color: #000000 } /* Literal.Number.Float */
-        .highlight .mh { color: #000000 } /* Literal.Number.Hex */
-        .highlight .mi { color: #000000 } /* Literal.Number.Integer */
-        .highlight .mo { color: #000000 } /* Literal.Number.Oct */
-        .highlight .sb { color: #a67f59 } /* Literal.String.Backtick */
-        .highlight .sc { color: #a67f59 } /* Literal.String.Char */
-        .highlight .sd { color: #a67f59 } /* Literal.String.Doc */
-        .highlight .s2 { color: #a67f59 } /* Literal.String.Double */
-        .highlight .se { color: #a67f59 } /* Literal.String.Escape */
-        .highlight .sh { color: #a67f59 } /* Literal.String.Heredoc */
-        .highlight .si { color: #a67f59 } /* Literal.String.Interpol */
-        .highlight .sx { color: #a67f59 } /* Literal.String.Other */
-        .highlight .sr { color: #a67f59 } /* Literal.String.Regex */
-        .highlight .s1 { color: #a67f59 } /* Literal.String.Single */
-        .highlight .ss { color: #a67f59 } /* Literal.String.Symbol */
-        .highlight .vc { color: #0077aa } /* Name.Variable.Class */
-        .highlight .vg { color: #0077aa } /* Name.Variable.Global */
-        .highlight .vi { color: #0077aa } /* Name.Variable.Instance */
-        .highlight .il { color: #000000 } /* Literal.Number.Integer.Long */
-        </style>
 <style>/* style-selflinks */
 
             .heading, .issue, .note, .example, li, dt {
@@ -1318,6 +1231,35 @@
             a.self-link::before            { content: "¶"; }
             .heading > a.self-link::before { content: "§"; }
             dfn > a.self-link::before      { content: "#"; }</style>
+<style>/* style-counters */
+
+            body {
+                counter-reset: example figure issue;
+            }
+            .issue {
+                counter-increment: issue;
+            }
+            .issue:not(.no-marker)::before {
+                content: "Issue " counter(issue);
+            }
+
+            .example {
+                counter-increment: example;
+            }
+            .example:not(.no-marker)::before {
+                content: "Example " counter(example);
+            }
+            .invalid.example:not(.no-marker)::before,
+            .illegal.example:not(.no-marker)::before {
+                content: "Invalid Example" counter(example);
+            }
+
+            figcaption {
+                counter-increment: figure;
+            }
+            figcaption:not(.no-marker)::before {
+                content: "Figure " counter(figure) " ";
+            }</style>
 <style>/* style-autolinks */
 
             .css.css, .property.property, .descriptor.descriptor {
@@ -1380,11 +1322,106 @@
             [data-link-type=biblio] {
                 white-space: pre;
             }</style>
+<style>/* style-dfn-panel */
+
+        .dfn-panel {
+            position: absolute;
+            z-index: 35;
+            height: auto;
+            width: -webkit-fit-content;
+            width: fit-content;
+            max-width: 300px;
+            max-height: 500px;
+            overflow: auto;
+            padding: 0.5em 0.75em;
+            font: small Helvetica Neue, sans-serif, Droid Sans Fallback;
+            background: #DDDDDD;
+            color: black;
+            border: outset 0.2em;
+        }
+        .dfn-panel:not(.on) { display: none; }
+        .dfn-panel * { margin: 0; padding: 0; text-indent: 0; }
+        .dfn-panel > b { display: block; }
+        .dfn-panel a { color: black; }
+        .dfn-panel a:not(:hover) { text-decoration: none !important; border-bottom: none !important; }
+        .dfn-panel > b + b { margin-top: 0.25em; }
+        .dfn-panel ul { padding: 0; }
+        .dfn-panel li { list-style: inside; }
+        .dfn-panel.activated {
+            display: inline-block;
+            position: fixed;
+            left: .5em;
+            bottom: 2em;
+            margin: 0 auto;
+            max-width: calc(100vw - 1.5em - .4em - .5em);
+            max-height: 30vh;
+        }
+
+        .dfn-paneled { cursor: pointer; }
+        </style>
+<style>/* style-syntax-highlighting */
+
+        .highlight:not(.idl) { background: hsl(24, 20%, 95%); }
+        code.highlight { padding: .1em; border-radius: .3em; }
+        pre.highlight, pre > code.highlight { display: block; padding: 1em; margin: .5em 0; overflow: auto; border-radius: 0; }
+        .highlight .c { color: #708090 } /* Comment */
+        .highlight .k { color: #990055 } /* Keyword */
+        .highlight .l { color: #000000 } /* Literal */
+        .highlight .n { color: #0077aa } /* Name */
+        .highlight .o { color: #999999 } /* Operator */
+        .highlight .p { color: #999999 } /* Punctuation */
+        .highlight .cm { color: #708090 } /* Comment.Multiline */
+        .highlight .cp { color: #708090 } /* Comment.Preproc */
+        .highlight .c1 { color: #708090 } /* Comment.Single */
+        .highlight .cs { color: #708090 } /* Comment.Special */
+        .highlight .kc { color: #990055 } /* Keyword.Constant */
+        .highlight .kd { color: #990055 } /* Keyword.Declaration */
+        .highlight .kn { color: #990055 } /* Keyword.Namespace */
+        .highlight .kp { color: #990055 } /* Keyword.Pseudo */
+        .highlight .kr { color: #990055 } /* Keyword.Reserved */
+        .highlight .kt { color: #990055 } /* Keyword.Type */
+        .highlight .ld { color: #000000 } /* Literal.Date */
+        .highlight .m { color: #000000 } /* Literal.Number */
+        .highlight .s { color: #a67f59 } /* Literal.String */
+        .highlight .na { color: #0077aa } /* Name.Attribute */
+        .highlight .nc { color: #0077aa } /* Name.Class */
+        .highlight .no { color: #0077aa } /* Name.Constant */
+        .highlight .nd { color: #0077aa } /* Name.Decorator */
+        .highlight .ni { color: #0077aa } /* Name.Entity */
+        .highlight .ne { color: #0077aa } /* Name.Exception */
+        .highlight .nf { color: #0077aa } /* Name.Function */
+        .highlight .nl { color: #0077aa } /* Name.Label */
+        .highlight .nn { color: #0077aa } /* Name.Namespace */
+        .highlight .py { color: #0077aa } /* Name.Property */
+        .highlight .nt { color: #669900 } /* Name.Tag */
+        .highlight .nv { color: #222222 } /* Name.Variable */
+        .highlight .ow { color: #999999 } /* Operator.Word */
+        .highlight .mb { color: #000000 } /* Literal.Number.Bin */
+        .highlight .mf { color: #000000 } /* Literal.Number.Float */
+        .highlight .mh { color: #000000 } /* Literal.Number.Hex */
+        .highlight .mi { color: #000000 } /* Literal.Number.Integer */
+        .highlight .mo { color: #000000 } /* Literal.Number.Oct */
+        .highlight .sb { color: #a67f59 } /* Literal.String.Backtick */
+        .highlight .sc { color: #a67f59 } /* Literal.String.Char */
+        .highlight .sd { color: #a67f59 } /* Literal.String.Doc */
+        .highlight .s2 { color: #a67f59 } /* Literal.String.Double */
+        .highlight .se { color: #a67f59 } /* Literal.String.Escape */
+        .highlight .sh { color: #a67f59 } /* Literal.String.Heredoc */
+        .highlight .si { color: #a67f59 } /* Literal.String.Interpol */
+        .highlight .sx { color: #a67f59 } /* Literal.String.Other */
+        .highlight .sr { color: #a67f59 } /* Literal.String.Regex */
+        .highlight .s1 { color: #a67f59 } /* Literal.String.Single */
+        .highlight .ss { color: #a67f59 } /* Literal.String.Symbol */
+        .highlight .vc { color: #0077aa } /* Name.Variable.Class */
+        .highlight .vg { color: #0077aa } /* Name.Variable.Global */
+        .highlight .vi { color: #0077aa } /* Name.Variable.Instance */
+        .highlight .il { color: #000000 } /* Literal.Number.Integer.Long */
+        </style>
  <body class="h-entry">
   <div class="head">
    <p data-fill-with="logo"></p>
    <h1 class="p-name no-ref" id="title">hOCR - OCR Workflow and Output embedded in HTML</h1>
-   <h2 class="no-num no-toc no-ref heading settled" id="subtitle"><span class="content">Living Standard, <time class="dt-updated" datetime="2016-10-17">17 October 2016</time></span></h2>
+   <h2 class="no-num no-toc no-ref heading settled" id="subtitle"><span class="content">Living Standard, <time class="dt-updated" datetime="2016-10-18">18 October 2016</time></span></h2>
    <div data-fill-with="spec-metadata">
     <dl>
      <dt>This version:
@@ -1403,7 +1440,7 @@ <h2 class="no-num no-toc no-ref heading settled" id="subtitle"><span class="cont
    <div data-fill-with="warning"></div>
    <p class="copyright" data-fill-with="copyright"><a href="http://creativecommons.org/publicdomain/zero/1.0/" rel="license"><img alt="CC0" src="https://licensebuttons.net/p/zero/1.0/80x15.png"></a> To the extent possible under law, the editors have waived all copyright
 and related or neighboring rights to this work.
-In addition, as of 17 October 2016,
+In addition, as of 18 October 2016,
 the editors have made this specification available under the <a href="http://www.openwebfoundation.org/legal/the-owf-1-0-agreements/owfa-1-0" rel="license">Open Web Foundation Agreement Version 1.0</a>,
 which is available at http://www.openwebfoundation.org/legal/the-owf-1-0-agreements/owfa-1-0.
 Parts of this work may be from another specification document.  If so, those parts are instead covered by the license of that specification document. </p>
@@ -1425,35 +1462,38 @@ <h2 class="no-num no-toc no-ref" id="contents">Table of Contents</h2>
       <li>
        <a href="#general-properties"><span class="secno">3.1</span> <span class="content">General Properties</span></a>
        <ol class="toc">
-        <li><a href="#bbox"><span class="secno">3.1.1</span> <span class="content"><code>bbox</code></span></a>
-        <li><a href="#textangle"><span class="secno">3.1.2</span> <span class="content"><code>textangle</code></span></a>
+        <li><a href="#bbox"><span class="secno">3.1.1</span> <span class="content"><span>bbox</span></span></a>
+        <li><a href="#textangle"><span class="secno">3.1.2</span> <span class="content"><span>textangle</span></span></a>
        </ol>
       <li>
        <a href="#non-recommended-general-properties"><span class="secno">3.2</span> <span class="content">Non-recommended general properties</span></a>
        <ol class="toc">
-        <li><a href="#poly"><span class="secno">3.2.1</span> <span class="content"><code>poly</code></span></a>
-        <li><a href="#order"><span class="secno">3.2.2</span> <span class="content"><code>order</code></span></a>
-        <li><a href="#presence"><span class="secno">3.2.3</span> <span class="content"><code>presence</code></span></a>
-        <li><a href="#cflow"><span class="secno">3.2.4</span> <span class="content"><code>cflow</code></span></a>
-        <li><a href="#baseline"><span class="secno">3.2.5</span> <span class="content"><code>baseline</code></span></a>
+        <li><a href="#poly"><span class="secno">3.2.1</span> <span class="content"><span>poly</span></span></a>
+        <li><a href="#order"><span class="secno">3.2.2</span> <span class="content"><span>order</span></span></a>
+        <li><a href="#presence"><span class="secno">3.2.3</span> <span class="content"><span>presence</span></span></a>
+        <li><a href="#cflow"><span class="secno">3.2.4</span> <span class="content"><span>cflow</span></span></a>
+        <li><a href="#baseline"><span class="secno">3.2.5</span> <span class="content"><span>baseline</span></span></a>
        </ol>
      </ol>
     <li>
      <a href="#logical-structuring-elements"><span class="secno">4</span> <span class="content">Logical Structuring Elements</span></a>
      <ol class="toc">
-      <li><a href="#ocr_document"><span class="secno">4.1</span> <span class="content"><code>ocr_document</code></span></a>
-      <li><a href="#ocr_title"><span class="secno">4.2</span> <span class="content"><code>ocr_title</code></span></a>
-      <li><a href="#ocr_author"><span class="secno">4.3</span> <span class="content"><code>ocr_author</code></span></a>
-      <li><a href="#ocr_abstract"><span class="secno">4.4</span> <span class="content"><code>ocr_abstract</code></span></a>
-      <li><a href="#ocr_part"><span class="secno">4.5</span> <span class="content"><code>ocr_part</code></span></a>
-      <li><a href="#ocr_chapter"><span class="secno">4.6</span> <span class="content"><code>ocr_chapter</code></span></a>
-      <li><a href="#ocr_section"><span class="secno">4.7</span> <span class="content"><code>ocr_section</code></span></a>
-      <li><a href="#ocr_subsubsection"><span class="secno">4.8</span> <span class="content"><code>ocr_subsubsection</code></span></a>
-      <li><a href="#ocr_display"><span class="secno">4.9</span> <span class="content"><code>ocr_display</code></span></a>
-      <li><a href="#ocr_blockquote"><span class="secno">4.10</span> <span class="content"><code>ocr_blockquote</code></span></a>
-      <li><a href="#ocr_par"><span class="secno">4.11</span> <span class="content"><code>ocr_par</code></span></a>
-      <li><a href="#ocr_linear"><span class="secno">4.12</span> <span class="content"><code>ocr_linear</code></span></a>
-      <li><a href="#ocr_caption"><span class="secno">4.13</span> <span class="content"><code>ocr_caption</code></span></a>
+      <li><a href="#ocr_document"><span class="secno">4.1</span> <span class="content"><span>ocr_document</span></span></a>
+      <li><a href="#ocr_title"><span class="secno">4.2</span> <span class="content"><span>ocr_title</span></span></a>
+      <li><a href="#ocr_author"><span class="secno">4.3</span> <span class="content"><span>ocr_author</span></span></a>
+      <li><a href="#ocr_abstract"><span class="secno">4.4</span> <span class="content"><span>ocr_abstract</span></span></a>
+      <li><a href="#ocr_part"><span class="secno">4.5</span> <span class="content"><span>ocr_part</span></span></a>
+      <li><a href="#ocr_chapter"><span class="secno">4.6</span> <span class="content"><span>ocr_chapter</span></span></a>
+      <li><a href="#ocr_section"><span class="secno">4.7</span> <span class="content"><span>ocr_section</span></span></a>
+      <li><a href="#ocr_subsubsection"><span class="secno">4.8</span> <span class="content"><span>ocr_subsubsection</span></span></a>
+      <li><a href="#ocr_display"><span class="secno">4.9</span> <span class="content"><span>ocr_display</span></span></a>
+      <li><a href="#ocr_blockquote"><span class="secno">4.10</span> <span class="content"><span>ocr_blockquote</span></span></a>
+      <li>
+       <a href="#ocr_par"><span class="secno">4.11</span> <span class="content"><span>ocr_par</span></span></a>
+       <ol class="toc">
+        <li><a href="#ocr_linear"><span class="secno">4.11.1</span> <span class="content"><span>ocr_linear</span></span></a>
+       </ol>
+      <li><a href="#ocr_caption"><span class="secno">4.12</span> <span class="content"><span>ocr_caption</span></span></a>
      </ol>
     <li>
      <a href="#typesetting-related-elements"><span class="secno">5</span> <span class="content">Typesetting Related Elements</span></a>
@@ -1461,44 +1501,44 @@ <h2 class="no-num no-toc no-ref" id="contents">Table of Contents</h2>
       <li>
        <a href="#classes-for-typesetting-elements"><span class="secno">5.1</span> <span class="content">Classes for typesetting elements</span></a>
        <ol class="toc">
-        <li><a href="#ocr_page"><span class="secno">5.1.1</span> <span class="content"><code>ocr_page</code></span></a>
-        <li><a href="#ocr_column"><span class="secno">5.1.2</span> <span class="content"><code>ocr_column</code></span></a>
-        <li><a href="#ocr_carea"><span class="secno">5.1.3</span> <span class="content"><code>ocr_carea</code></span></a>
-        <li><a href="#ocr_line"><span class="secno">5.1.4</span> <span class="content"><code>ocr_line</code></span></a>
-        <li><a href="#ocr_separator"><span class="secno">5.1.5</span> <span class="content"><code>ocr_separator</code></span></a>
-        <li><a href="#ocr_noise"><span class="secno">5.1.6</span> <span class="content"><code>ocr_noise</code></span></a>
+        <li><a href="#ocr_page"><span class="secno">5.1.1</span> <span class="content"><span>ocr_page</span></span></a>
+        <li><a href="#ocr_column"><span class="secno">5.1.2</span> <span class="content"><span>ocr_column</span></span></a>
+        <li><a href="#ocr_carea"><span class="secno">5.1.3</span> <span class="content"><span>ocr_carea</span></span></a>
+        <li><a href="#ocr_line"><span class="secno">5.1.4</span> <span class="content"><span>ocr_line</span></span></a>
+        <li><a href="#ocr_separator"><span class="secno">5.1.5</span> <span class="content"><span>ocr_separator</span></span></a>
+        <li><a href="#ocr_noise"><span class="secno">5.1.6</span> <span class="content"><span>ocr_noise</span></span></a>
        </ol>
       <li>
        <a href="#recommended-properties-for-typesetting-elements"><span class="secno">5.2</span> <span class="content">Recommended Properties for typesetting elements</span></a>
        <ol class="toc">
         <li><a href="#bbox-typesetting"><span class="secno">5.2.1</span> <span class="content"><code>bbox (typesetting)</code></span></a>
-        <li><a href="#image"><span class="secno">5.2.2</span> <span class="content"><code>image</code></span></a>
-        <li><a href="#imagemd5"><span class="secno">5.2.3</span> <span class="content"><code>imagemd5</code></span></a>
-        <li><a href="#ppageno"><span class="secno">5.2.4</span> <span class="content"><code>ppageno</code></span></a>
-        <li><a href="#lpageno"><span class="secno">5.2.5</span> <span class="content"><code>lpageno</code></span></a>
+        <li><a href="#image"><span class="secno">5.2.2</span> <span class="content"><span>image</span></span></a>
+        <li><a href="#imagemd5"><span class="secno">5.2.3</span> <span class="content"><span>imagemd5</span></span></a>
+        <li><a href="#ppageno"><span class="secno">5.2.4</span> <span class="content"><span>ppageno</span></span></a>
+        <li><a href="#lpageno"><span class="secno">5.2.5</span> <span class="content"><span>lpageno</span></span></a>
        </ol>
       <li>
        <a href="#optional-properties-for-typesetting-elements"><span class="secno">5.3</span> <span class="content">Optional Properties for typesetting elements</span></a>
        <ol class="toc">
-        <li><a href="#scan_res"><span class="secno">5.3.1</span> <span class="content"><code>scan_res</code></span></a>
-        <li><a href="#x_scanner"><span class="secno">5.3.2</span> <span class="content"><code>x_scanner</code></span></a>
-        <li><a href="#x_source"><span class="secno">5.3.3</span> <span class="content"><code>x_source</code></span></a>
-        <li><a href="#hardbreak"><span class="secno">5.3.4</span> <span class="content"><code>hardbreak</code></span></a>
+        <li><a href="#scan_res"><span class="secno">5.3.1</span> <span class="content"><span>scan_res</span></span></a>
+        <li><a href="#x_scanner"><span class="secno">5.3.2</span> <span class="content"><span>x_scanner</span></span></a>
+        <li><a href="#x_source"><span class="secno">5.3.3</span> <span class="content"><span>x_source</span></span></a>
+        <li><a href="#hardbreak"><span class="secno">5.3.4</span> <span class="content"><span>hardbreak</span></span></a>
        </ol>
       <li>
        <a href="#classes-for-floats"><span class="secno">5.4</span> <span class="content">Classes for floats</span></a>
        <ol class="toc">
-        <li><a href="#ocr_float"><span class="secno">5.4.1</span> <span class="content"><code>ocr_float</code></span></a>
-        <li><a href="#ocr_separator0"><span class="secno">5.4.2</span> <span class="content"><code>ocr_separator</code></span></a>
-        <li><a href="#ocr_textfloat"><span class="secno">5.4.3</span> <span class="content"><code>ocr_textfloat</code></span></a>
-        <li><a href="#ocr_textimage"><span class="secno">5.4.4</span> <span class="content"><code>ocr_textimage</code></span></a>
-        <li><a href="#ocr_image"><span class="secno">5.4.5</span> <span class="content"><code>ocr_image</code></span></a>
-        <li><a href="#ocr_linedrawing"><span class="secno">5.4.6</span> <span class="content"><code>ocr_linedrawing</code></span></a>
-        <li><a href="#ocr_photo"><span class="secno">5.4.7</span> <span class="content"><code>ocr_photo</code></span></a>
-        <li><a href="#ocr_header"><span class="secno">5.4.8</span> <span class="content"><code>ocr_header</code></span></a>
-        <li><a href="#ocr_footer"><span class="secno">5.4.9</span> <span class="content"><code>ocr_footer</code></span></a>
-        <li><a href="#ocr_pageno"><span class="secno">5.4.10</span> <span class="content"><code>ocr_pageno</code></span></a>
-        <li><a href="#ocr_table"><span class="secno">5.4.11</span> <span class="content"><code>ocr_table</code></span></a>
+        <li><a href="#ocr_float"><span class="secno">5.4.1</span> <span class="content"><span>ocr_float</span></span></a>
+        <li><a href="#ocr_separator0"><span class="secno">5.4.2</span> <span class="content"><span>ocr_separator</span></span></a>
+        <li><a href="#ocr_textfloat"><span class="secno">5.4.3</span> <span class="content"><span>ocr_textfloat</span></span></a>
+        <li><a href="#ocr_textimage"><span class="secno">5.4.4</span> <span class="content"><span>ocr_textimage</span></span></a>
+        <li><a href="#ocr_image"><span class="secno">5.4.5</span> <span class="content"><span>ocr_image</span></span></a>
+        <li><a href="#ocr_linedrawing"><span class="secno">5.4.6</span> <span class="content"><span>ocr_linedrawing</span></span></a>
+        <li><a href="#ocr_photo"><span class="secno">5.4.7</span> <span class="content"><span>ocr_photo</span></span></a>
+        <li><a href="#ocr_header"><span class="secno">5.4.8</span> <span class="content"><span>ocr_header</span></span></a>
+        <li><a href="#ocr_footer"><span class="secno">5.4.9</span> <span class="content"><span>ocr_footer</span></span></a>
+        <li><a href="#ocr_pageno"><span class="secno">5.4.10</span> <span class="content"><span>ocr_pageno</span></span></a>
+        <li><a href="#ocr_table"><span class="secno">5.4.11</span> <span class="content"><span>ocr_table</span></span></a>
        </ol>
      </ol>
     <li>
@@ -1507,11 +1547,11 @@ <h2 class="no-num no-toc no-ref" id="contents">Table of Contents</h2>
       <li>
        <a href="#classes-for-inline-representation"><span class="secno">6.1</span> <span class="content">Classes for Inline Representation</span></a>
        <ol class="toc">
-        <li><a href="#ocr_glyph"><span class="secno">6.1.1</span> <span class="content"><code>ocr_glyph</code></span></a>
-        <li><a href="#ocr_glyphs"><span class="secno">6.1.2</span> <span class="content"><code>ocr_glyphs</code></span></a>
-        <li><a href="#ocr_dropcap"><span class="secno">6.1.3</span> <span class="content"><code>ocr_dropcap</code></span></a>
-        <li><a href="#ocr_chem"><span class="secno">6.1.4</span> <span class="content"><code>ocr_chem</code></span></a>
-        <li><a href="#ocr_math"><span class="secno">6.1.5</span> <span class="content"><code>ocr_math</code></span></a>
+        <li><a href="#ocr_glyph"><span class="secno">6.1.1</span> <span class="content"><span>ocr_glyph</span></span></a>
+        <li><a href="#ocr_glyphs"><span class="secno">6.1.2</span> <span class="content"><span>ocr_glyphs</span></span></a>
+        <li><a href="#ocr_dropcap"><span class="secno">6.1.3</span> <span class="content"><span>ocr_dropcap</span></span></a>
+        <li><a href="#ocr_chem"><span class="secno">6.1.4</span> <span class="content"><span>ocr_chem</span></span></a>
+        <li><a href="#ocr_math"><span class="secno">6.1.5</span> <span class="content"><span>ocr_math</span></span></a>
         <li><a href="#non-breaking-space"><span class="secno">6.1.6</span> <span class="content">Non-breaking space</span></a>
         <li><a href="#non-default-spaces"><span class="secno">6.1.7</span> <span class="content">Non-default spaces</span></a>
         <li><a href="#hyphenation"><span class="secno">6.1.8</span> <span class="content">Hyphenation</span></a>
@@ -1525,13 +1565,13 @@ <h2 class="no-num no-toc no-ref" id="contents">Table of Contents</h2>
       <li>
        <a href="#classes-for-character-information"><span class="secno">7.1</span> <span class="content">Classes for Character Information</span></a>
        <ol class="toc">
-        <li><a href="#ocr_cinfo"><span class="secno">7.1.1</span> <span class="content"><code>ocr_cinfo</code></span></a>
+        <li><a href="#ocr_cinfo"><span class="secno">7.1.1</span> <span class="content"><span>ocr_cinfo</span></span></a>
        </ol>
       <li>
        <a href="#properties-for-character-information"><span class="secno">7.2</span> <span class="content">Properties for Character Information</span></a>
        <ol class="toc">
-        <li><a href="#cuts"><span class="secno">7.2.1</span> <span class="content"><code>cuts</code></span></a>
-        <li><a href="#nlp"><span class="secno">7.2.2</span> <span class="content"><code>nlp</code></span></a>
+        <li><a href="#cuts"><span class="secno">7.2.1</span> <span class="content"><span>cuts</span></span></a>
+        <li><a href="#nlp"><span class="secno">7.2.2</span> <span class="content"><span>nlp</span></span></a>
        </ol>
      </ol>
     <li>
@@ -1540,18 +1580,18 @@ <h2 class="no-num no-toc no-ref" id="contents">Table of Contents</h2>
       <li>
        <a href="#classes-for-engine-specific-markup"><span class="secno">8.1</span> <span class="content">Classes for engine specific markup</span></a>
        <ol class="toc">
-        <li><a href="#ocrx_block"><span class="secno">8.1.1</span> <span class="content"><code>ocrx_block</code></span></a>
-        <li><a href="#ocrx_line"><span class="secno">8.1.2</span> <span class="content"><code>ocrx_line</code></span></a>
-        <li><a href="#ocrx_word"><span class="secno">8.1.3</span> <span class="content"><code>ocrx_word</code></span></a>
+        <li><a href="#ocrx_block"><span class="secno">8.1.1</span> <span class="content"><span>ocrx_block</span></span></a>
+        <li><a href="#ocrx_line"><span class="secno">8.1.2</span> <span class="content"><span>ocrx_line</span></span></a>
+        <li><a href="#ocrx_word"><span class="secno">8.1.3</span> <span class="content"><span>ocrx_word</span></span></a>
        </ol>
       <li>
        <a href="#properties-for-engine-specific-markup"><span class="secno">8.2</span> <span class="content">Properties for engine-specific markup</span></a>
        <ol class="toc">
-        <li><a href="#x_font"><span class="secno">8.2.1</span> <span class="content"><code>x_font</code></span></a>
-        <li><a href="#x_fsize"><span class="secno">8.2.2</span> <span class="content"><code>x_fsize</code></span></a>
-        <li><a href="#x_bboxes"><span class="secno">8.2.3</span> <span class="content"><code>x_bboxes</code></span></a>
-        <li><a href="#x_confs"><span class="secno">8.2.4</span> <span class="content"><code>x_confs</code></span></a>
-        <li><a href="#x_wconf"><span class="secno">8.2.5</span> <span class="content"><code>x_wconf</code></span></a>
+        <li><a href="#x_font"><span class="secno">8.2.1</span> <span class="content"><span>x_font</span></span></a>
+        <li><a href="#x_fsize"><span class="secno">8.2.2</span> <span class="content"><span>x_fsize</span></span></a>
+        <li><a href="#x_bboxes"><span class="secno">8.2.3</span> <span class="content"><span>x_bboxes</span></span></a>
+        <li><a href="#x_confs"><span class="secno">8.2.4</span> <span class="content"><span>x_confs</span></span></a>
+        <li><a href="#x_wconf"><span class="secno">8.2.5</span> <span class="content"><span>x_wconf</span></span></a>
        </ol>
      </ol>
     <li><a href="#font-text-color-language-direction"><span class="secno">9</span> <span class="content">Font, Text Color, Language, Direction</span></a>
@@ -1560,19 +1600,31 @@ <h2 class="no-num no-toc no-ref" id="contents">Table of Contents</h2>
     <li>
      <a href="#capabilities"><span class="secno">12</span> <span class="content">Capabilities</span></a>
      <ol class="toc">
-      <li><a href="#ocrp_lang"><span class="secno">12.1</span> <span class="content"><code>ocrp_lang</code></span></a>
-      <li><a href="#ocrp_dir"><span class="secno">12.2</span> <span class="content"><code>ocrp_dir</code></span></a>
-      <li><a href="#ocrp_poly"><span class="secno">12.3</span> <span class="content"><code>ocrp_poly</code></span></a>
-      <li><a href="#ocrp_font"><span class="secno">12.4</span> <span class="content"><code>ocrp_font</code></span></a>
-      <li><a href="#ocrp_nlp"><span class="secno">12.5</span> <span class="content"><code>ocrp_nlp</code></span></a>
+      <li><a href="#ocrp_lang"><span class="secno">12.1</span> <span class="content"><span>ocrp_lang</span></span></a>
+      <li><a href="#ocrp_dir"><span class="secno">12.2</span> <span class="content"><span>ocrp_dir</span></span></a>
+      <li><a href="#ocrp_poly"><span class="secno">12.3</span> <span class="content"><span>ocrp_poly</span></span></a>
+      <li><a href="#ocrp_font"><span class="secno">12.4</span> <span class="content"><span>ocrp_font</span></span></a>
+      <li><a href="#ocrp_nlp"><span class="secno">12.5</span> <span class="content"><span>ocrp_nlp</span></span></a>
       <li><a href="#ocr_embeddedformat_formatname"><span class="secno">12.6</span> <span class="content"><code>ocr_embeddedformat_&lt;formatname></code></span></a>
       <li><a href="#ocr_tag_unordered"><span class="secno">12.7</span> <span class="content"><code>ocr_&lt;tag>_unordered</code></span></a>
      </ol>
     <li>
      <a href="#metadata"><span class="secno">13</span> <span class="content">Metadata</span></a>
      <ol class="toc">
-      <li><a href="#required-meta-information"><span class="secno">13.1</span> <span class="content">Required Meta Information</span></a>
-      <li><a href="#document-metadata"><span class="secno">13.2</span> <span class="content">Document metadata</span></a>
+      <li>
+       <a href="#required-meta-information"><span class="secno">13.1</span> <span class="content">Required Meta Information</span></a>
+       <ol class="toc">
+        <li><a href="#ocr-system"><span class="secno">13.1.1</span> <span class="content"><span>ocr-system</span></span></a>
+        <li><a href="#ocr-capabilities"><span class="secno">13.1.2</span> <span class="content"><span>ocr-capabilities</span></span></a>
+       </ol>
+      <li>
+       <a href="#recommended-meta-information"><span class="secno">13.2</span> <span class="content">Recommended Meta Information</span></a>
+       <ol class="toc">
+        <li><a href="#ocr-number-of-pages"><span class="secno">13.2.1</span> <span class="content"><span>ocr-number-of-pages</span></span></a>
+        <li><a href="#ocr-langs"><span class="secno">13.2.2</span> <span class="content"><span>ocr-langs</span></span></a>
+        <li><a href="#ocr-scripts"><span class="secno">13.2.3</span> <span class="content"><span>ocr-scripts</span></span></a>
+       </ol>
+      <li><a href="#document-metadata"><span class="secno">13.3</span> <span class="content">Document metadata</span></a>
      </ol>
     <li><a href="#profiles"><span class="secno">14</span> <span class="content">Profiles</span></a>
     <li>
@@ -1610,6 +1662,11 @@ <h2 class="no-num no-toc no-ref" id="contents">Table of Contents</h2>
       <li><a href="#media-type"><span class="secno">17.1</span> <span class="content">Media Type</span></a>
      </ol>
     <li><a href="#conformance"><span class="secno"></span> <span class="content"> Conformance</span></a>
+    <li>
+     <a href="#index"><span class="secno"></span> <span class="content">Index</span></a>
+     <ol class="toc">
+      <li><a href="#index-defined-here"><span class="secno"></span> <span class="content">Terms defined by this specification</span></a>
+     </ol>
     <li>
      <a href="#references"><span class="secno"></span> <span class="content">References</span></a>
      <ol class="toc">
@@ -1630,7 +1687,7 @@ <h2 class="heading settled" data-level="2" id="introduction"><span class="secno"
 arrive at a representation that makes it easy to reuse OCR results.</p>
    <p>This document describes many tags and a lot of information that can be output.
 However, getting started with hOCR is easy: you only need to output the tags
-and information you actually want to.  For example, just outputting <code>ocr_line</code> tags with bounding boxes is already very useful for many applications.  Just
+and information you actually want to.  For example, just outputting <code><a data-link-type="element" href="#elementdef-ocr_line" id="ref-for-elementdef-ocr_line-1">ocr_line</a></code> tags with bounding boxes is already very useful for many applications.  Just
 start simple and add more output information as the need arises.</p>
    <h2 class="heading settled" data-level="3" id="terminology-and-representation"><span class="secno">3. </span><span class="content">Terminology and Representation</span><a class="self-link" href="#terminology-and-representation"></a></h2>
    <p>This document describes a representation of various aspects of OCR output in an
@@ -1670,17 +1727,17 @@ <h2 class="heading settled" data-level="3" id="terminology-and-representation"><
 multiple properties are separated by semicolons.</p>
    <div class="example" id="example-b38b5c14">
     <a class="self-link" href="#example-b38b5c14"></a> 
-<pre class="language-html highlight"><span class="nt">&lt;div</span> <span class="na">class=</span><span class="s">"ocr_page"</span> <span class="na">id=</span><span class="s">"page_1"</span><span class="nt">></span>
-  <span class="nt">&lt;div</span> <span class="na">class=</span><span class="s">"ocr_carea"</span> <span class="na">id=</span><span class="s">"column_2"</span> <span class="na">title=</span><span class="s">"bbox 313 324 733 1922"</span><span class="nt">></span>
-    <span class="nt">&lt;div</span> <span class="na">class=</span><span class="s">"ocr_par"</span> <span class="na">id=</span><span class="s">"par_7"</span><span class="nt">></span> ... <span class="nt">&lt;/div></span>
-    <span class="nt">&lt;div</span> <span class="na">class=</span><span class="s">"ocr_par"</span> <span class="na">id=</span><span class="s">"par_19"</span><span class="nt">></span> ... <span class="nt">&lt;/div></span>
-  <span class="nt">&lt;/div></span>
-<span class="nt">&lt;/div></span>
+<pre class="language-html highlight"><span class="p">&lt;</span><span class="nt">div</span> <span class="na">class</span><span class="o">=</span><span class="s">"ocr_page"</span> <span class="na">id</span><span class="o">=</span><span class="s">"page_1"</span><span class="p">></span>
+  <span class="p">&lt;</span><span class="nt">div</span> <span class="na">class</span><span class="o">=</span><span class="s">"ocr_carea"</span> <span class="na">id</span><span class="o">=</span><span class="s">"column_2"</span> <span class="na">title</span><span class="o">=</span><span class="s">"bbox 313 324 733 1922"</span><span class="p">></span>
+    <span class="p">&lt;</span><span class="nt">div</span> <span class="na">class</span><span class="o">=</span><span class="s">"ocr_par"</span> <span class="na">id</span><span class="o">=</span><span class="s">"par_7"</span><span class="p">></span> ... <span class="p">&lt;</span><span class="p">/</span><span class="nt">div</span><span class="p">></span>
+    <span class="p">&lt;</span><span class="nt">div</span> <span class="na">class</span><span class="o">=</span><span class="s">"ocr_par"</span> <span class="na">id</span><span class="o">=</span><span class="s">"par_19"</span><span class="p">></span> ... <span class="p">&lt;</span><span class="p">/</span><span class="nt">div</span><span class="p">></span>
+  <span class="p">&lt;</span><span class="p">/</span><span class="nt">div</span><span class="p">></span>
+<span class="p">&lt;</span><span class="p">/</span><span class="nt">div</span><span class="p">></span>
 </pre>
    </div>
    <h3 class="heading settled" data-level="3.1" id="general-properties"><span class="secno">3.1. </span><span class="content">General Properties</span><a class="self-link" href="#general-properties"></a></h3>
    <p>The following properties can apply to most elements (where it makes sense):</p>
-   <h4 class="heading settled" data-level="3.1.1" id="bbox"><span class="secno">3.1.1. </span><span class="content"><code>bbox</code></span><a class="self-link" href="#bbox"></a></h4>
+   <h4 class="heading settled" data-level="3.1.1" id="bbox"><span class="secno">3.1.1. </span><span class="content"><dfn class="dfn-paneled css" data-dfn-type="property" data-export="" id="propdef-bbox">bbox</dfn></span><a class="self-link" href="#bbox"></a></h4>
    <p><code>bbox x0 y0 x1 y1</code></p>
    <p>The <code>bbox</code> - short for "bounding box" - of an element is a rectangular box
 around this element, which is defined by the upper-left corner (x0, y0) and
@@ -1692,9 +1749,9 @@ <h4 class="heading settled" data-level="3.1.1" id="bbox"><span class="secno">3.1
     <li data-md="">
      <p>the order of the values are <code>x0 y0 x1 y1</code> = "left top right bottom"</p>
     <li data-md="">
-     <p>use <code>x_bboxes</code> below for character bounding boxes</p>
+     <p>use <a class="property" data-link-type="propdesc" href="#propdef-x_bboxes" id="ref-for-propdef-x_bboxes-1">x_bboxes</a> below for character bounding boxes</p>
     <li data-md="">
-     <p>do not use <code>bbox</code> unless the bounding box of the layout component is, in
+     <p>do not use <a class="property" data-link-type="propdesc" href="#propdef-bbox" id="ref-for-propdef-bbox-1">bbox</a> unless the bounding box of the layout component is, in
 fact, rectangular</p>
     <li data-md="">
      <p>some non-rectangular layout components may have rectangular bounding boxes
@@ -1703,8 +1760,8 @@ <h4 class="heading settled" data-level="3.1.1" id="bbox"><span class="secno">3.1
    <p>See also the section <a href="#bbox-typesetting">§5.2.1 bbox (typesetting)</a>.</p>
    <div class="example" id="example-d34e6dbe">
     <a class="self-link" href="#example-d34e6dbe"></a> 
-<pre class="language-html highlight"><span class="nt">&lt;span</span> <span class="na">class=</span><span class="s">'ocr_line'</span> <span class="na">id=</span><span class="s">'line_1'</span>
-    <span class="na">title=</span><span class="s">"bbox 10 20 160 30"</span><span class="nt">></span>...<span class="nt">&lt;/span></span>
+<pre class="language-html highlight"><span class="p">&lt;</span><span class="nt">span</span> <span class="na">class</span><span class="o">=</span><span class="s">'ocr_line'</span> <span class="na">id</span><span class="o">=</span><span class="s">'line_1'</span>
+    <span class="na">title</span><span class="o">=</span><span class="s">"bbox 10 20 160 30"</span><span class="p">></span>...<span class="p">&lt;</span><span class="p">/</span><span class="nt">span</span><span class="p">></span>
 </pre>
     <p>The bounding box <code>bbox</code> of this line is shown in blue and it is span
 by the upper-left corner (10, 20) and the lower-right corner (160, 30).
@@ -1712,7 +1769,7 @@ <h4 class="heading settled" data-level="3.1.1" id="bbox"><span class="secno">3.1
 the document image which border is drawn in black.</p>
     <figure><img alt="bbox explained" src="../images/bbox-crop.png"> </figure>
    </div>
-   <h4 class="heading settled" data-level="3.1.2" id="textangle"><span class="secno">3.1.2. </span><span class="content"><code>textangle</code></span><a class="self-link" href="#textangle"></a></h4>
+   <h4 class="heading settled" data-level="3.1.2" id="textangle"><span class="secno">3.1.2. </span><span class="content"><dfn class="css" data-dfn-type="property" data-export="" id="propdef-textangle">textangle<a class="self-link" href="#propdef-textangle"></a></dfn></span><a class="self-link" href="#textangle"></a></h4>
    <p><code>textangle alpha</code></p>
    <p>The angle in degrees by which textual content has been rotate relative to the
 rest of the page (if not present, the angle is assumed to be zero); rotations
@@ -1722,7 +1779,7 @@ <h4 class="heading settled" data-level="3.1.2" id="textangle"><span class="secno
    <h3 class="heading settled" data-level="3.2" id="non-recommended-general-properties"><span class="secno">3.2. </span><span class="content">Non-recommended general properties</span><a class="self-link" href="#non-recommended-general-properties"></a></h3>
    <p>The following properties can apply to most elements but should not be used
 unless there is no alternative:</p>
-   <h4 class="heading settled" data-level="3.2.1" id="poly"><span class="secno">3.2.1. </span><span class="content"><code>poly</code></span><a class="self-link" href="#poly"></a></h4>
+   <h4 class="heading settled" data-level="3.2.1" id="poly"><span class="secno">3.2.1. </span><span class="content"><dfn class="css" data-dfn-type="property" data-export="" id="propdef-poly">poly<a class="self-link" href="#propdef-poly"></a></dfn></span><a class="self-link" href="#poly"></a></h4>
    <p><code>poly x0 y0 x1 y1 ...</code></p>
    <p>A closed polygon for elements with non-rectangular bounds</p>
    <ul>
@@ -1735,11 +1792,11 @@ <h4 class="heading settled" data-level="3.2.1" id="poly"><span class="secno">3.2
      <p>note that the natural and correct representation of many non-rectangular
 layouts is in terms of rectangular content areas and rectangular floats</p>
     <li data-md="">
-     <p>documents using polygonal borders anywhere must indicate this by adding <a href="#ocrp_poly">§12.3 ocrp_poly</a> to the list of <code>ocr-capabilities</code> in the <a href="#required-meta-information">§13.1 Required Meta Information</a></p>
+     <p>documents using polygonal borders anywhere must indicate this by adding <a class="css" data-link-type="maybe" href="#valdef-ocr-capabilities-ocrp_poly" id="ref-for-valdef-ocr-capabilities-ocrp_poly-1">ocrp_poly</a> to the list of <a class="property" data-link-type="propdesc" href="#propdef-ocr-capabilities" id="ref-for-propdef-ocr-capabilities-1">ocr-capabilities</a> (see <a href="#required-meta-information">§13.1 Required Meta Information</a>)</p>
     <li data-md="">
-     <p>documents should attempt to provide a reasonable bbox equivalent as well</p>
+     <p>documents should attempt to provide a reasonable <a class="property" data-link-type="propdesc" href="#propdef-bbox" id="ref-for-propdef-bbox-2">bbox</a> equivalent as well</p>
    </ul>
-   <h4 class="heading settled" data-level="3.2.2" id="order"><span class="secno">3.2.2. </span><span class="content"><code>order</code></span><a class="self-link" href="#order"></a></h4>
+   <h4 class="heading settled" data-level="3.2.2" id="order"><span class="secno">3.2.2. </span><span class="content"><dfn class="css" data-dfn-type="property" data-export="" id="propdef-order">order<a class="self-link" href="#propdef-order"></a></dfn></span><a class="self-link" href="#order"></a></h4>
    <p><code>order n</code></p>
    <p>The reading order of the element (an integer)</p>
    <ul>
@@ -1748,36 +1805,36 @@ <h4 class="heading settled" data-level="3.2.2" id="order"><span class="secno">3.
 the reading order of the page by element ordering within the page, since
 many tools will not be able to deal with content that is not in reading order</p>
    </ul>
-   <h4 class="heading settled" data-level="3.2.3" id="presence"><span class="secno">3.2.3. </span><span class="content"><code>presence</code></span><a class="self-link" href="#presence"></a></h4>
+   <h4 class="heading settled" data-level="3.2.3" id="presence"><span class="secno">3.2.3. </span><span class="content"><dfn class="dfn-paneled css" data-dfn-type="property" data-export="" id="propdef-presence">presence</dfn></span><a class="self-link" href="#presence"></a></h4>
    <p class="issue" id="issue-4c7527e8"><a class="self-link" href="#issue-4c7527e8"></a> <a href="https://github.com/kba/hocr-spec/issues/10">Use of property presence</a></p>
-   <p><code>presence</code> presence must be declared in the document meta data</p>
-   <h4 class="heading settled" data-level="3.2.4" id="cflow"><span class="secno">3.2.4. </span><span class="content"><code>cflow</code></span><a class="self-link" href="#cflow"></a></h4>
+   <p><a class="property" data-link-type="propdesc" href="#propdef-presence" id="ref-for-propdef-presence-1">presence</a> presence must be declared in the document meta data</p>
+   <h4 class="heading settled" data-level="3.2.4" id="cflow"><span class="secno">3.2.4. </span><span class="content"><dfn class="css" data-dfn-type="property" data-export="" id="propdef-cflow">cflow<a class="self-link" href="#propdef-cflow"></a></dfn></span><a class="self-link" href="#cflow"></a></h4>
    <p><code>cflow s</code></p>
-   <p>This property relates the flow between multiple <a href="#ocr_carea">§5.1.3 ocr_carea</a> elements,
-and between <a href="#ocr_carea">§5.1.3 ocr_carea</a> and <a href="#ocr_linear">§4.12 ocr_linear</a> elements.</p>
+   <p>This property relates the flow between multiple <code><a data-link-type="element" href="#elementdef-ocr_carea" id="ref-for-elementdef-ocr_carea-1">ocr_carea</a></code> elements,
+and between <code><a data-link-type="element" href="#elementdef-ocr_carea" id="ref-for-elementdef-ocr_carea-2">ocr_carea</a></code> and <code><a data-link-type="element" href="#elementdef-ocr_linear" id="ref-for-elementdef-ocr_linear-1">ocr_linear</a></code> elements.</p>
    <p>The content flow on the page that this element is a part of</p>
    <ul>
     <li data-md="">
      <p>s must be a unique string for each content flow</p>
     <li data-md="">
-     <p>must be present on <a href="#ocr_carea">§5.1.3 ocr_carea</a> and <a href="#ocrx_block">§8.1.1 ocrx_block</a> tags when reading
+     <p>must be present on <code><a data-link-type="element" href="#elementdef-ocr_carea" id="ref-for-elementdef-ocr_carea-3">ocr_carea</a></code> and <code><a data-link-type="element" href="#elementdef-ocrx_block" id="ref-for-elementdef-ocrx_block-1">ocrx_block</a></code> tags when reading
 order is attempted and multiple content flows are present</p>
     <li data-md="">
      <p>presence must be declared in the document meta data</p>
    </ul>
-   <h4 class="heading settled" data-level="3.2.5" id="baseline"><span class="secno">3.2.5. </span><span class="content"><code>baseline</code></span><a class="self-link" href="#baseline"></a></h4>
+   <h4 class="heading settled" data-level="3.2.5" id="baseline"><span class="secno">3.2.5. </span><span class="content"><dfn class="css" data-dfn-type="property" data-export="" id="propdef-baseline">baseline<a class="self-link" href="#propdef-baseline"></a></dfn></span><a class="self-link" href="#baseline"></a></h4>
    <p><code>baseline pn pn-1 ... p0</code></p>
    <p>This property applies primarily to textlines.</p>
    <p>The baseline is described by a polynomial of order <code>n</code> with the coefficients <code>pn ... p0</code> with <code>n = 1</code> for a linear (i.e. straight) line.</p>
    <p>The polynomial is in the coordinate system of the line, with the bottom left of
 the bounding box as the origin.</p>
-   <div class="example" id="example-11ed160f">
-    <a class="self-link" href="#example-11ed160f"></a> 
+   <div class="example" id="example-f7fe2e88">
+    <a class="self-link" href="#example-f7fe2e88"></a> 
     <p>The hOCR output for the first line of <a href="https://github.com/tesseract-ocr/tesseract/blob/master/testing/eurotext.tif">eurotext.tif</a> contains the following information:</p>
-<pre class="language-html highlight"><span class="nt">&lt;span</span> <span class="na">class=</span><span class="s">'ocr_line'</span> <span class="na">id=</span><span class="s">'line_1_1'</span>
-    <span class="na">title=</span><span class="s">"bbox 105 66 823 113; baseline 0.015 -18"</span><span class="nt">></span>...<span class="nt">&lt;/span></span>
+<pre class="language-html highlight"><span class="p">&lt;</span><span class="nt">span</span> <span class="na">class</span><span class="o">=</span><span class="s">'ocr_line'</span> <span class="na">id</span><span class="o">=</span><span class="s">'line_1_1'</span>
+    <span class="na">title</span><span class="o">=</span><span class="s">"bbox 105 66 823 113; baseline 0.015 -18"</span><span class="p">></span>...<span class="p">&lt;</span><span class="p">/</span><span class="nt">span</span><span class="p">></span>
 </pre>
-    <p>bbox is the bounding box of the line in image coordinates (blue). The two
+    <p><a class="property" data-link-type="propdesc" href="#propdef-bbox" id="ref-for-propdef-bbox-3">bbox</a> is the bounding box of the line in image coordinates (blue). The two
 numbers for the baseline are the slope (1st number) and constant term (2nd
 number) of a linear equation describing the baseline relative to the bottom
 left corner of the bounding box (red). The baseline crosses the y-axis at <code>-18</code> and its slope angle is <code>arctan(0.015) = 0.86°</code>.</p>
@@ -1787,71 +1844,71 @@ <h2 class="heading settled" data-level="4" id="logical-structuring-elements"><sp
    <p>We recognize the following logical structuring elements:</p>
    <ul>
     <li data-md="">
-     <p><code>ocr_document</code></p>
+     <p><code><a data-link-type="element" href="#elementdef-ocr_document" id="ref-for-elementdef-ocr_document-1">ocr_document</a></code></p>
      <ul>
       <li data-md="">
-       <p><code>ocr_linear</code></p>
+       <p><code><a data-link-type="element" href="#elementdef-ocr_linear" id="ref-for-elementdef-ocr_linear-2">ocr_linear</a></code></p>
       <li data-md="">
-       <p><code>ocr_title</code></p>
+       <p><code><a data-link-type="element" href="#elementdef-ocr_title" id="ref-for-elementdef-ocr_title-1">ocr_title</a></code></p>
       <li data-md="">
-       <p><code>ocr_author</code></p>
+       <p><code><a data-link-type="element" href="#elementdef-ocr_author" id="ref-for-elementdef-ocr_author-1">ocr_author</a></code></p>
       <li data-md="">
-       <p><code>ocr_abstract</code></p>
+       <p><code><a data-link-type="element" href="#elementdef-ocr_abstract" id="ref-for-elementdef-ocr_abstract-1">ocr_abstract</a></code></p>
       <li data-md="">
-       <p><code>ocr_part</code> [<code>&lt;h1></code>]</p>
+       <p><code><a data-link-type="element" href="#elementdef-ocr_part" id="ref-for-elementdef-ocr_part-1">ocr_part</a></code> [<code>&lt;h1></code>]</p>
        <ul>
         <li data-md="">
-         <p><code>ocr_chapter</code> [<code>&lt;h1></code>]</p>
+         <p><code><a data-link-type="element" href="#elementdef-ocr_chapter" id="ref-for-elementdef-ocr_chapter-1">ocr_chapter</a></code> [<code>&lt;h1></code>]</p>
         <li data-md="">
-         <p><code>ocr_section</code> [<code>&lt;h2></code>]</p>
+         <p><code><a data-link-type="element" href="#elementdef-ocr_section" id="ref-for-elementdef-ocr_section-1">ocr_section</a></code> [<code>&lt;h2></code>]</p>
          <ul>
           <li data-md="">
            <p><code>ocr_sub*section</code> [<code>&lt;h3></code>,<code>&lt;h4></code>]</p>
           <li data-md="">
-           <p><code>ocr_display</code></p>
+           <p><code><a data-link-type="element" href="#elementdef-ocr_display" id="ref-for-elementdef-ocr_display-1">ocr_display</a></code></p>
           <li data-md="">
-           <p><code>ocr_blockquote</code> [<code>&lt;blockquote></code>]</p>
+           <p><code><a data-link-type="element" href="#elementdef-ocr_blockquote" id="ref-for-elementdef-ocr_blockquote-1">ocr_blockquote</a></code> [<code>&lt;blockquote></code>]</p>
           <li data-md="">
-           <p><code>ocr_par</code> [<code>&lt;p></code>]</p>
+           <p><code><a data-link-type="element" href="#elementdef-ocr_par" id="ref-for-elementdef-ocr_par-1">ocr_par</a></code> [<code>&lt;p></code>]</p>
          </ul>
        </ul>
      </ul>
    </ul>
-   <h3 class="heading settled" data-level="4.1" id="ocr_document"><span class="secno">4.1. </span><span class="content"><code>ocr_document</code></span><a class="self-link" href="#ocr_document"></a></h3>
-   <h3 class="heading settled" data-level="4.2" id="ocr_title"><span class="secno">4.2. </span><span class="content"><code>ocr_title</code></span><a class="self-link" href="#ocr_title"></a></h3>
-   <h3 class="heading settled" data-level="4.3" id="ocr_author"><span class="secno">4.3. </span><span class="content"><code>ocr_author</code></span><a class="self-link" href="#ocr_author"></a></h3>
-   <h3 class="heading settled" data-level="4.4" id="ocr_abstract"><span class="secno">4.4. </span><span class="content"><code>ocr_abstract</code></span><a class="self-link" href="#ocr_abstract"></a></h3>
-   <h3 class="heading settled" data-level="4.5" id="ocr_part"><span class="secno">4.5. </span><span class="content"><code>ocr_part</code></span><a class="self-link" href="#ocr_part"></a></h3>
-   <h3 class="heading settled" data-level="4.6" id="ocr_chapter"><span class="secno">4.6. </span><span class="content"><code>ocr_chapter</code></span><a class="self-link" href="#ocr_chapter"></a></h3>
-   <h3 class="heading settled" data-level="4.7" id="ocr_section"><span class="secno">4.7. </span><span class="content"><code>ocr_section</code></span><a class="self-link" href="#ocr_section"></a></h3>
-   <h3 class="heading settled" data-level="4.8" id="ocr_subsubsection"><span class="secno">4.8. </span><span class="content"><code>ocr_subsubsection</code></span><a class="self-link" href="#ocr_subsubsection"></a></h3>
-   <h3 class="heading settled" data-level="4.9" id="ocr_display"><span class="secno">4.9. </span><span class="content"><code>ocr_display</code></span><a class="self-link" href="#ocr_display"></a></h3>
-   <h3 class="heading settled" data-level="4.10" id="ocr_blockquote"><span class="secno">4.10. </span><span class="content"><code>ocr_blockquote</code></span><a class="self-link" href="#ocr_blockquote"></a></h3>
-   <h3 class="heading settled" data-level="4.11" id="ocr_par"><span class="secno">4.11. </span><span class="content"><code>ocr_par</code></span><a class="self-link" href="#ocr_par"></a></h3>
+   <h3 class="heading settled" data-level="4.1" id="ocr_document"><span class="secno">4.1. </span><span class="content"><dfn class="dfn-paneled" data-dfn-type="element" data-export="" id="elementdef-ocr_document">ocr_document</dfn></span><a class="self-link" href="#ocr_document"></a></h3>
+   <h3 class="heading settled" data-level="4.2" id="ocr_title"><span class="secno">4.2. </span><span class="content"><dfn class="dfn-paneled" data-dfn-type="element" data-export="" id="elementdef-ocr_title">ocr_title</dfn></span><a class="self-link" href="#ocr_title"></a></h3>
+   <h3 class="heading settled" data-level="4.3" id="ocr_author"><span class="secno">4.3. </span><span class="content"><dfn class="dfn-paneled" data-dfn-type="element" data-export="" id="elementdef-ocr_author">ocr_author</dfn></span><a class="self-link" href="#ocr_author"></a></h3>
+   <h3 class="heading settled" data-level="4.4" id="ocr_abstract"><span class="secno">4.4. </span><span class="content"><dfn class="dfn-paneled" data-dfn-type="element" data-export="" id="elementdef-ocr_abstract">ocr_abstract</dfn></span><a class="self-link" href="#ocr_abstract"></a></h3>
+   <h3 class="heading settled" data-level="4.5" id="ocr_part"><span class="secno">4.5. </span><span class="content"><dfn class="dfn-paneled" data-dfn-type="element" data-export="" id="elementdef-ocr_part">ocr_part</dfn></span><a class="self-link" href="#ocr_part"></a></h3>
+   <h3 class="heading settled" data-level="4.6" id="ocr_chapter"><span class="secno">4.6. </span><span class="content"><dfn class="dfn-paneled" data-dfn-type="element" data-export="" id="elementdef-ocr_chapter">ocr_chapter</dfn></span><a class="self-link" href="#ocr_chapter"></a></h3>
+   <h3 class="heading settled" data-level="4.7" id="ocr_section"><span class="secno">4.7. </span><span class="content"><dfn class="dfn-paneled" data-dfn-type="element" data-export="" id="elementdef-ocr_section">ocr_section</dfn></span><a class="self-link" href="#ocr_section"></a></h3>
+   <h3 class="heading settled" data-level="4.8" id="ocr_subsubsection"><span class="secno">4.8. </span><span class="content"><dfn data-dfn-type="element" data-export="" id="elementdef-ocr_subsubsection">ocr_subsubsection<a class="self-link" href="#elementdef-ocr_subsubsection"></a></dfn></span><a class="self-link" href="#ocr_subsubsection"></a></h3>
+   <h3 class="heading settled" data-level="4.9" id="ocr_display"><span class="secno">4.9. </span><span class="content"><dfn class="dfn-paneled" data-dfn-type="element" data-export="" id="elementdef-ocr_display">ocr_display</dfn></span><a class="self-link" href="#ocr_display"></a></h3>
+   <h3 class="heading settled" data-level="4.10" id="ocr_blockquote"><span class="secno">4.10. </span><span class="content"><dfn class="dfn-paneled" data-dfn-type="element" data-export="" id="elementdef-ocr_blockquote">ocr_blockquote</dfn></span><a class="self-link" href="#ocr_blockquote"></a></h3>
+   <h3 class="heading settled" data-level="4.11" id="ocr_par"><span class="secno">4.11. </span><span class="content"><dfn class="dfn-paneled" data-dfn-type="element" data-export="" id="elementdef-ocr_par">ocr_par</dfn></span><a class="self-link" href="#ocr_par"></a></h3>
    <p>These logical tags have their standard meaning as used in the publishing
 industry and tools like LaTeX, MS Word, and others.</p>
    <p>The standard HTML tags given in brackets specify the preferred HTML tags to use
 with those logical structuring elements, but it may not be possible or
 desirable to actually chose those tags (e.g., when adding hOCR information to
 an existing HTML output routine).</p>
-   <h3 class="heading settled" data-level="4.12" id="ocr_linear"><span class="secno">4.12. </span><span class="content"><code>ocr_linear</code></span><a class="self-link" href="#ocr_linear"></a></h3>
-   <p>For all of these elements except <code>ocr_linear</code>, there exists a natural linear
-ordering defined by reading order (<code>ocr_linear</code> indicates that the elements
-contained in it have a linear ordering). At the level of <code>ocr_linear</code>, there
-may not be a single distinguished order. A common example of <code>ocr_linear</code> is a
+   <h4 class="heading settled" data-level="4.11.1" id="ocr_linear"><span class="secno">4.11.1. </span><span class="content"><dfn class="dfn-paneled" data-dfn-type="element" data-export="" id="elementdef-ocr_linear">ocr_linear</dfn></span><a class="self-link" href="#ocr_linear"></a></h4>
+   <p>For all of these elements except <code><a data-link-type="element" href="#elementdef-ocr_linear" id="ref-for-elementdef-ocr_linear-3">ocr_linear</a></code>, there exists a natural linear
+ordering defined by reading order (<code><a data-link-type="element" href="#elementdef-ocr_linear" id="ref-for-elementdef-ocr_linear-4">ocr_linear</a></code> indicates that the elements
+contained in it have a linear ordering). At the level of <code><a data-link-type="element" href="#elementdef-ocr_linear" id="ref-for-elementdef-ocr_linear-5">ocr_linear</a></code>, there
+may not be a single distinguished order. A common example of <code><a data-link-type="element" href="#elementdef-ocr_linear" id="ref-for-elementdef-ocr_linear-6">ocr_linear</a></code> is a
 newspaper, in which a single newspaper may contain many linear, but there is no
 unique reading order for the different linear. OCR evaluation tools should
-therefore be sensitive to the order of all elements other than <code>ocr_linear</code>.</p>
+therefore be sensitive to the order of all elements other than <code><a data-link-type="element" href="#elementdef-ocr_linear" id="ref-for-elementdef-ocr_linear-7">ocr_linear</a></code>.</p>
    <p>Tags must be nested as indicated by nesting above, but not all tags within the
 hierarchy need to be present.</p>
    <p>Textual information like section numbers and bullets must be represented as
 text inside the containing element.</p>
    <p>Documents whose logical structure does not map naturally onto these logical
 structuring elemetns must not use them for other purpose.</p>
-   <h3 class="heading settled" data-level="4.13" id="ocr_caption"><span class="secno">4.13. </span><span class="content"><code>ocr_caption</code></span><a class="self-link" href="#ocr_caption"></a></h3>
-   <p>Image captions may be indicated using the <code>ocr_caption</code> element; such an
+   <h3 class="heading settled" data-level="4.12" id="ocr_caption"><span class="secno">4.12. </span><span class="content"><dfn class="dfn-paneled" data-dfn-type="element" data-export="" id="elementdef-ocr_caption">ocr_caption</dfn></span><a class="self-link" href="#ocr_caption"></a></h3>
+   <p>Image captions may be indicated using the <code><a data-link-type="element" href="#elementdef-ocr_caption" id="ref-for-elementdef-ocr_caption-1">ocr_caption</a></code> element; such an
 element refers to the image(s) contained within the same float, or the
-immediately adjacent image if both the image and the <code>ocr_caption</code> element are
+immediately adjacent image if both the image and the <code><a data-link-type="element" href="#elementdef-ocr_caption" id="ref-for-elementdef-ocr_caption-2">ocr_caption</a></code> element are
 in running text.</p>
    <h2 class="heading settled" data-level="5" id="typesetting-related-elements"><span class="secno">5. </span><span class="content">Typesetting Related Elements</span><a class="self-link" href="#typesetting-related-elements"></a></h2>
    <p>The following typesetting related elements are based on a typesetting model as
@@ -1878,53 +1935,53 @@ <h2 class="heading settled" data-level="5" id="typesetting-related-elements"><sp
    <h3 class="heading settled" data-level="5.1" id="classes-for-typesetting-elements"><span class="secno">5.1. </span><span class="content">Classes for typesetting elements</span><a class="self-link" href="#classes-for-typesetting-elements"></a></h3>
    <p>The following classes, as well as <a href="#classes-for-floats">floats</a> are used for type-setting
 elements.</p>
-   <h4 class="heading settled" data-level="5.1.1" id="ocr_page"><span class="secno">5.1.1. </span><span class="content"><code>ocr_page</code></span><a class="self-link" href="#ocr_page"></a></h4>
-   <p>The <code>ocr_page</code> element must be present in all hOCR documents.</p>
-   <h4 class="heading settled" data-level="5.1.2" id="ocr_column"><span class="secno">5.1.2. </span><span class="content"><code>ocr_column</code></span><a class="self-link" href="#ocr_column"></a></h4>
+   <h4 class="heading settled" data-level="5.1.1" id="ocr_page"><span class="secno">5.1.1. </span><span class="content"><dfn class="dfn-paneled" data-dfn-type="element" data-export="" id="elementdef-ocr_page">ocr_page</dfn></span><a class="self-link" href="#ocr_page"></a></h4>
+   <p>The <code><a data-link-type="element" href="#elementdef-ocr_page" id="ref-for-elementdef-ocr_page-1">ocr_page</a></code> element must be present in all hOCR documents.</p>
+   <h4 class="heading settled" data-level="5.1.2" id="ocr_column"><span class="secno">5.1.2. </span><span class="content"><dfn data-dfn-type="element" data-export="" id="elementdef-ocr_column">ocr_column<a class="self-link" href="#elementdef-ocr_column"></a></dfn></span><a class="self-link" href="#ocr_column"></a></h4>
    <div class="annoying-warning">
      <strong>OBSOLETE</strong> 
-    <p>Please use <a href="#ocr_carea">§5.1.3 ocr_carea</a> instead</p>
+    <p>Please use <code><a data-link-type="element" href="#elementdef-ocr_carea" id="ref-for-elementdef-ocr_carea-4">ocr_carea</a></code> instead</p>
    </div>
-   <h4 class="heading settled" data-level="5.1.3" id="ocr_carea"><span class="secno">5.1.3. </span><span class="content"><code>ocr_carea</code></span><a class="self-link" href="#ocr_carea"></a></h4>
+   <h4 class="heading settled" data-level="5.1.3" id="ocr_carea"><span class="secno">5.1.3. </span><span class="content"><dfn class="dfn-paneled" data-dfn-type="element" data-export="" id="elementdef-ocr_carea">ocr_carea</dfn></span><a class="self-link" href="#ocr_carea"></a></h4>
    <p>"ocr content area" or "body area"</p>
    <p>
     Used to be called 
     <del>ocr_column</del>
    </p>
-   <p>The <code>ocr_carea</code> elements should appear in reading order unless this is impossible
-because of some other structuring requirement. If the document contains multiple <code>ocr_linear</code> streams, then each <code>ocr_carea</code> must indicate which stream it belongs
+   <p>The <code><a data-link-type="element" href="#elementdef-ocr_carea" id="ref-for-elementdef-ocr_carea-5">ocr_carea</a></code> elements should appear in reading order unless this is impossible
+because of some other structuring requirement. If the document contains multiple <code><a data-link-type="element" href="#elementdef-ocr_linear" id="ref-for-elementdef-ocr_linear-8">ocr_linear</a></code> streams, then each <code><a data-link-type="element" href="#elementdef-ocr_carea" id="ref-for-elementdef-ocr_carea-6">ocr_carea</a></code> must indicate which stream it belongs
 to.</p>
    <p>Note that for many documents, the actual ground truth careas are well-defined
 by the document style of the original document before printing and scanning.
 From a single page, the <code>careas</code> of the original document style cannot be
-recovered exactly. However, the partition of a document by <code>ocr_carea</code> for an
+recovered exactly. However, the partition of a document by <code><a data-link-type="element" href="#elementdef-ocr_carea" id="ref-for-elementdef-ocr_carea-7">ocr_carea</a></code> for an
 individual page shall be considered correct relative to ground truth if</p>
    <ol>
     <li data-md="">
      <p>all the text contained in a ground truth carea is fully contained within a
-single <code>ocr_carea</code>,</p>
+single <code><a data-link-type="element" href="#elementdef-ocr_carea" id="ref-for-elementdef-ocr_carea-8">ocr_carea</a></code>,</p>
     <li data-md="">
-     <p>no text outside a ground truth <code>carea</code> is contained within an <code>ocr_carea</code>, and</p>
+     <p>no text outside a ground truth <code>carea</code> is contained within an <code><a data-link-type="element" href="#elementdef-ocr_carea" id="ref-for-elementdef-ocr_carea-9">ocr_carea</a></code>, and</p>
     <li data-md="">
-     <p>the <code>ocr_careas</code> appear in the same order as the text flow
+     <p>the <code><a data-link-type="element" href="#elementdef-ocr_carea" id="ref-for-elementdef-ocr_carea-10">ocr_carea</a></code> appear in the same order as the text flow
 relationships between the ground truth careas.</p>
    </ol>
-   <h4 class="heading settled" data-level="5.1.4" id="ocr_line"><span class="secno">5.1.4. </span><span class="content"><code>ocr_line</code></span><a class="self-link" href="#ocr_line"></a></h4>
+   <h4 class="heading settled" data-level="5.1.4" id="ocr_line"><span class="secno">5.1.4. </span><span class="content"><dfn class="dfn-paneled" data-dfn-type="element" data-export="" id="elementdef-ocr_line">ocr_line</dfn></span><a class="self-link" href="#ocr_line"></a></h4>
    <p>In typesetting systems, content areas are filled with “blocks”, but most of
 those blocks are not recoverable or semantically meaningful. However, one type
 of block is visible and very important for OCR engines: the line. Lines are
 typesetting blocks that only contain glyphs (“inlines” in XSL terminology).
-They are represented by the <code>ocr_line</code> area.</p>
-   <p><code>ocr_line</code> should be in a <code>&lt;span></code></p>
-   <h4 class="heading settled" data-level="5.1.5" id="ocr_separator"><span class="secno">5.1.5. </span><span class="content"><code>ocr_separator</code></span><a class="self-link" href="#ocr_separator"></a></h4>
+They are represented by the <code><a data-link-type="element" href="#elementdef-ocr_line" id="ref-for-elementdef-ocr_line-2">ocr_line</a></code> area.</p>
+   <p><code><a data-link-type="element" href="#elementdef-ocr_line" id="ref-for-elementdef-ocr_line-3">ocr_line</a></code> should be in a <code>&lt;span></code></p>
+   <h4 class="heading settled" data-level="5.1.5" id="ocr_separator"><span class="secno">5.1.5. </span><span class="content"><dfn data-dfn-type="element" data-export="" id="elementdef-ocr_separator">ocr_separator<a class="self-link" href="#elementdef-ocr_separator"></a></dfn></span><a class="self-link" href="#ocr_separator"></a></h4>
    <p>Any separator or similar element</p>
-   <h4 class="heading settled" data-level="5.1.6" id="ocr_noise"><span class="secno">5.1.6. </span><span class="content"><code>ocr_noise</code></span><a class="self-link" href="#ocr_noise"></a></h4>
+   <h4 class="heading settled" data-level="5.1.6" id="ocr_noise"><span class="secno">5.1.6. </span><span class="content"><dfn data-dfn-type="element" data-export="" id="elementdef-ocr_noise">ocr_noise<a class="self-link" href="#elementdef-ocr_noise"></a></dfn></span><a class="self-link" href="#ocr_noise"></a></h4>
    <p>Any noise element that isn’t part of typesetting</p>
    <h3 class="heading settled" data-level="5.2" id="recommended-properties-for-typesetting-elements"><span class="secno">5.2. </span><span class="content">Recommended Properties for typesetting elements</span><a class="self-link" href="#recommended-properties-for-typesetting-elements"></a></h3>
    <p>The following properties should be present:</p>
    <h4 class="heading settled" data-level="5.2.1" id="bbox-typesetting"><span class="secno">5.2.1. </span><span class="content"><code>bbox (typesetting)</code></span><a class="self-link" href="#bbox-typesetting"></a></h4>
    <p>The bounding box of the page; for pages, the top left corner must be at <code>(0,0)</code>, so a typical page bounding box will look like <code>bbox 0 0 2300 3200</code></p>
-   <h4 class="heading settled" data-level="5.2.2" id="image"><span class="secno">5.2.2. </span><span class="content"><code>image</code></span><a class="self-link" href="#image"></a></h4>
+   <h4 class="heading settled" data-level="5.2.2" id="image"><span class="secno">5.2.2. </span><span class="content"><dfn class="css" data-dfn-type="property" data-export="" id="propdef-image">image<a class="self-link" href="#propdef-image"></a></dfn></span><a class="self-link" href="#image"></a></h4>
    <p><code>image imagefile</code></p>
    <ul>
     <li data-md="">
@@ -1940,7 +1997,7 @@ <h4 class="heading settled" data-level="5.2.2" id="image"><span class="secno">5.
      <p>if the hOCR file is present in a directory hierarchy or file archive, should
 resolve to the corresponding image file</p>
    </ul>
-   <h4 class="heading settled" data-level="5.2.3" id="imagemd5"><span class="secno">5.2.3. </span><span class="content"><code>imagemd5</code></span><a class="self-link" href="#imagemd5"></a></h4>
+   <h4 class="heading settled" data-level="5.2.3" id="imagemd5"><span class="secno">5.2.3. </span><span class="content"><dfn class="css" data-dfn-type="property" data-export="" id="propdef-imagemd5">imagemd5<a class="self-link" href="#propdef-imagemd5"></a></dfn></span><a class="self-link" href="#imagemd5"></a></h4>
    <p><code>imagemd5 checksum</code></p>
    <ul>
     <li data-md="">
@@ -1948,7 +2005,7 @@ <h4 class="heading settled" data-level="5.2.3" id="imagemd5"><span class="secno"
     <li data-md="">
      <p>allows re-associating pages with source images</p>
    </ul>
-   <h4 class="heading settled" data-level="5.2.4" id="ppageno"><span class="secno">5.2.4. </span><span class="content"><code>ppageno</code></span><a class="self-link" href="#ppageno"></a></h4>
+   <h4 class="heading settled" data-level="5.2.4" id="ppageno"><span class="secno">5.2.4. </span><span class="content"><dfn class="css" data-dfn-type="property" data-export="" id="propdef-ppageno">ppageno<a class="self-link" href="#propdef-ppageno"></a></dfn></span><a class="self-link" href="#ppageno"></a></h4>
    <p><code>ppageno n</code></p>
    <ul>
     <li data-md="">
@@ -1962,7 +2019,7 @@ <h4 class="heading settled" data-level="5.2.4" id="ppageno"><span class="secno">
     <li data-md="">
      <p>must not be present unless it is well defined and unique</p>
    </ul>
-   <h4 class="heading settled" data-level="5.2.5" id="lpageno"><span class="secno">5.2.5. </span><span class="content"><code>lpageno</code></span><a class="self-link" href="#lpageno"></a></h4>
+   <h4 class="heading settled" data-level="5.2.5" id="lpageno"><span class="secno">5.2.5. </span><span class="content"><dfn class="css" data-dfn-type="property" data-export="" id="propdef-lpageno">lpageno<a class="self-link" href="#propdef-lpageno"></a></dfn></span><a class="self-link" href="#lpageno"></a></h4>
    <p><code>lpageno string</code></p>
    <ul>
     <li data-md="">
@@ -1976,19 +2033,19 @@ <h4 class="heading settled" data-level="5.2.5" id="lpageno"><span class="secno">
    </ul>
    <h3 class="heading settled" data-level="5.3" id="optional-properties-for-typesetting-elements"><span class="secno">5.3. </span><span class="content">Optional Properties for typesetting elements</span><a class="self-link" href="#optional-properties-for-typesetting-elements"></a></h3>
    <p>The following properties MAY be present:</p>
-   <h4 class="heading settled" data-level="5.3.1" id="scan_res"><span class="secno">5.3.1. </span><span class="content"><code>scan_res</code></span><a class="self-link" href="#scan_res"></a></h4>
+   <h4 class="heading settled" data-level="5.3.1" id="scan_res"><span class="secno">5.3.1. </span><span class="content"><dfn class="css" data-dfn-type="property" data-export="" id="propdef-scan_res">scan_res<a class="self-link" href="#propdef-scan_res"></a></dfn></span><a class="self-link" href="#scan_res"></a></h4>
    <p><code>scan_res x_res y_res</code></p>
    <ul>
     <li data-md="">
      <p>scanning resolution in DPI</p>
    </ul>
-   <h4 class="heading settled" data-level="5.3.2" id="x_scanner"><span class="secno">5.3.2. </span><span class="content"><code>x_scanner</code></span><a class="self-link" href="#x_scanner"></a></h4>
+   <h4 class="heading settled" data-level="5.3.2" id="x_scanner"><span class="secno">5.3.2. </span><span class="content"><dfn class="css" data-dfn-type="property" data-export="" id="propdef-x_scanner">x_scanner<a class="self-link" href="#propdef-x_scanner"></a></dfn></span><a class="self-link" href="#x_scanner"></a></h4>
    <p><code>x_scanner string</code></p>
    <ul>
     <li data-md="">
      <p>a representation of the scanner</p>
    </ul>
-   <h4 class="heading settled" data-level="5.3.3" id="x_source"><span class="secno">5.3.3. </span><span class="content"><code>x_source</code></span><a class="self-link" href="#x_source"></a></h4>
+   <h4 class="heading settled" data-level="5.3.3" id="x_source"><span class="secno">5.3.3. </span><span class="content"><dfn class="css" data-dfn-type="property" data-export="" id="propdef-x_source">x_source<a class="self-link" href="#propdef-x_source"></a></dfn></span><a class="self-link" href="#x_source"></a></h4>
    <p><code>x_source string</code></p>
    <ul>
     <li data-md="">
@@ -2008,8 +2065,8 @@ <h4 class="heading settled" data-level="5.3.3" id="x_source"><span class="secno"
      </ul>
    </ul>
    <p>In addition to the standard
-properties, the <code>ocr_line</code> area supports the following additional properties:</p>
-   <h4 class="heading settled" data-level="5.3.4" id="hardbreak"><span class="secno">5.3.4. </span><span class="content"><code>hardbreak</code></span><a class="self-link" href="#hardbreak"></a></h4>
+properties, the <code><a data-link-type="element" href="#elementdef-ocr_line" id="ref-for-elementdef-ocr_line-4">ocr_line</a></code> area supports the following additional properties:</p>
+   <h4 class="heading settled" data-level="5.3.4" id="hardbreak"><span class="secno">5.3.4. </span><span class="content"><dfn class="css" data-dfn-type="property" data-export="" id="propdef-hardbreak">hardbreak<a class="self-link" href="#propdef-hardbreak"></a></dfn></span><a class="self-link" href="#hardbreak"></a></h4>
    <p><code>hardbreak n</code></p>
    <ul>
     <li data-md="">
@@ -2019,67 +2076,67 @@ <h4 class="heading settled" data-level="5.3.4" id="hardbreak"><span class="secno
      <p>a one indicates that the line is a hard (explicit) line break</p>
    </ul>
    <p>Any special characters representing the desired end-of-line processing must be
-present inside the <code>ocr_line</code> element. Examples of such special characters are a
+present inside the <code><a data-link-type="element" href="#elementdef-ocr_line" id="ref-for-elementdef-ocr_line-5">ocr_line</a></code> element. Examples of such special characters are a
 soft hyphen ("­", <code>U+00AD</code>), a hard line break (<code>&lt;br></code>), or whitespace (<code></code>) for soft
 line breaks.</p>
    <h3 class="heading settled" data-level="5.4" id="classes-for-floats"><span class="secno">5.4. </span><span class="content">Classes for floats</span><a class="self-link" href="#classes-for-floats"></a></h3>
    <p>Floats should not be nested.</p>
    <p>The following floats are defined:</p>
-   <h4 class="heading settled" data-level="5.4.1" id="ocr_float"><span class="secno">5.4.1. </span><span class="content"><code>ocr_float</code></span><a class="self-link" href="#ocr_float"></a></h4>
+   <h4 class="heading settled" data-level="5.4.1" id="ocr_float"><span class="secno">5.4.1. </span><span class="content"><dfn class="dfn-paneled" data-dfn-type="element" data-export="" id="elementdef-ocr_float">ocr_float</dfn></span><a class="self-link" href="#ocr_float"></a></h4>
    <p><code>ocr_float</code></p>
-   <h4 class="heading settled" data-level="5.4.2" id="ocr_separator0"><span class="secno">5.4.2. </span><span class="content"><code>ocr_separator</code></span><a class="self-link" href="#ocr_separator0"></a></h4>
-   <p><code>ocr_separator</code></p>
-   <h4 class="heading settled" data-level="5.4.3" id="ocr_textfloat"><span class="secno">5.4.3. </span><span class="content"><code>ocr_textfloat</code></span><a class="self-link" href="#ocr_textfloat"></a></h4>
+   <h4 class="heading settled" data-level="5.4.2" id="ocr_separator0"><span class="secno">5.4.2. </span><span class="content"><dfn data-dfn-type="element" data-export="" id="elementdef-ocr_separator0">ocr_separator<a class="self-link" href="#elementdef-ocr_separator0"></a></dfn></span><a class="self-link" href="#ocr_separator0"></a></h4>
+   <p><code>ocr_separator</code> in the context of float classes.</p>
+   <h4 class="heading settled" data-level="5.4.3" id="ocr_textfloat"><span class="secno">5.4.3. </span><span class="content"><dfn data-dfn-type="element" data-export="" id="elementdef-ocr_textfloat">ocr_textfloat<a class="self-link" href="#elementdef-ocr_textfloat"></a></dfn></span><a class="self-link" href="#ocr_textfloat"></a></h4>
    <p><code>ocr_textfloat</code></p>
-   <h4 class="heading settled" data-level="5.4.4" id="ocr_textimage"><span class="secno">5.4.4. </span><span class="content"><code>ocr_textimage</code></span><a class="self-link" href="#ocr_textimage"></a></h4>
+   <h4 class="heading settled" data-level="5.4.4" id="ocr_textimage"><span class="secno">5.4.4. </span><span class="content"><dfn data-dfn-type="element" data-export="" id="elementdef-ocr_textimage">ocr_textimage<a class="self-link" href="#elementdef-ocr_textimage"></a></dfn></span><a class="self-link" href="#ocr_textimage"></a></h4>
    <p><code>ocr_textimage</code></p>
-   <h4 class="heading settled" data-level="5.4.5" id="ocr_image"><span class="secno">5.4.5. </span><span class="content"><code>ocr_image</code></span><a class="self-link" href="#ocr_image"></a></h4>
+   <h4 class="heading settled" data-level="5.4.5" id="ocr_image"><span class="secno">5.4.5. </span><span class="content"><dfn data-dfn-type="element" data-export="" id="elementdef-ocr_image">ocr_image<a class="self-link" href="#elementdef-ocr_image"></a></dfn></span><a class="self-link" href="#ocr_image"></a></h4>
    <p><code>ocr_image</code></p>
-   <h4 class="heading settled" data-level="5.4.6" id="ocr_linedrawing"><span class="secno">5.4.6. </span><span class="content"><code>ocr_linedrawing</code></span><a class="self-link" href="#ocr_linedrawing"></a></h4>
+   <h4 class="heading settled" data-level="5.4.6" id="ocr_linedrawing"><span class="secno">5.4.6. </span><span class="content"><dfn data-dfn-type="element" data-export="" id="elementdef-ocr_linedrawing">ocr_linedrawing<a class="self-link" href="#elementdef-ocr_linedrawing"></a></dfn></span><a class="self-link" href="#ocr_linedrawing"></a></h4>
    <p>Something that could be represented well and naturally in a vector graphics
 format like SVG (even if it is actually represented as PNG)</p>
-   <h4 class="heading settled" data-level="5.4.7" id="ocr_photo"><span class="secno">5.4.7. </span><span class="content"><code>ocr_photo</code></span><a class="self-link" href="#ocr_photo"></a></h4>
+   <h4 class="heading settled" data-level="5.4.7" id="ocr_photo"><span class="secno">5.4.7. </span><span class="content"><dfn data-dfn-type="element" data-export="" id="elementdef-ocr_photo">ocr_photo<a class="self-link" href="#elementdef-ocr_photo"></a></dfn></span><a class="self-link" href="#ocr_photo"></a></h4>
    <p>Something that requires JPEG or PNG to be represented well</p>
-   <h4 class="heading settled" data-level="5.4.8" id="ocr_header"><span class="secno">5.4.8. </span><span class="content"><code>ocr_header</code></span><a class="self-link" href="#ocr_header"></a></h4>
+   <h4 class="heading settled" data-level="5.4.8" id="ocr_header"><span class="secno">5.4.8. </span><span class="content"><dfn data-dfn-type="element" data-export="" id="elementdef-ocr_header">ocr_header<a class="self-link" href="#elementdef-ocr_header"></a></dfn></span><a class="self-link" href="#ocr_header"></a></h4>
    <p><code>ocr_header</code></p>
-   <h4 class="heading settled" data-level="5.4.9" id="ocr_footer"><span class="secno">5.4.9. </span><span class="content"><code>ocr_footer</code></span><a class="self-link" href="#ocr_footer"></a></h4>
+   <h4 class="heading settled" data-level="5.4.9" id="ocr_footer"><span class="secno">5.4.9. </span><span class="content"><dfn data-dfn-type="element" data-export="" id="elementdef-ocr_footer">ocr_footer<a class="self-link" href="#elementdef-ocr_footer"></a></dfn></span><a class="self-link" href="#ocr_footer"></a></h4>
    <p><code>ocr_footer</code></p>
-   <h4 class="heading settled" data-level="5.4.10" id="ocr_pageno"><span class="secno">5.4.10. </span><span class="content"><code>ocr_pageno</code></span><a class="self-link" href="#ocr_pageno"></a></h4>
+   <h4 class="heading settled" data-level="5.4.10" id="ocr_pageno"><span class="secno">5.4.10. </span><span class="content"><dfn data-dfn-type="element" data-export="" id="elementdef-ocr_pageno">ocr_pageno<a class="self-link" href="#elementdef-ocr_pageno"></a></dfn></span><a class="self-link" href="#ocr_pageno"></a></h4>
    <p><code>ocr_pageno</code></p>
-   <h4 class="heading settled" data-level="5.4.11" id="ocr_table"><span class="secno">5.4.11. </span><span class="content"><code>ocr_table</code></span><a class="self-link" href="#ocr_table"></a></h4>
+   <h4 class="heading settled" data-level="5.4.11" id="ocr_table"><span class="secno">5.4.11. </span><span class="content"><dfn data-dfn-type="element" data-export="" id="elementdef-ocr_table">ocr_table<a class="self-link" href="#elementdef-ocr_table"></a></dfn></span><a class="self-link" href="#ocr_table"></a></h4>
    <p><code>ocr_table</code></p>
    <h2 class="heading settled" data-level="6" id="inline-representations"><span class="secno">6. </span><span class="content">Inline Representations</span><a class="self-link" href="#inline-representations"></a></h2>
    <p>There is some content that should behave and flow like text</p>
    <h3 class="heading settled" data-level="6.1" id="classes-for-inline-representation"><span class="secno">6.1. </span><span class="content">Classes for Inline Representation</span><a class="self-link" href="#classes-for-inline-representation"></a></h3>
-   <h4 class="heading settled" data-level="6.1.1" id="ocr_glyph"><span class="secno">6.1.1. </span><span class="content"><code>ocr_glyph</code></span><a class="self-link" href="#ocr_glyph"></a></h4>
+   <h4 class="heading settled" data-level="6.1.1" id="ocr_glyph"><span class="secno">6.1.1. </span><span class="content"><dfn data-dfn-type="element" data-export="" id="elementdef-ocr_glyph">ocr_glyph<a class="self-link" href="#elementdef-ocr_glyph"></a></dfn></span><a class="self-link" href="#ocr_glyph"></a></h4>
    <p>An individual glyph represented as an image (e.g., an unrecognized character)</p>
    <p>Must contain a single <code>&lt;img></code> tag, or be present on one</p>
-   <h4 class="heading settled" data-level="6.1.2" id="ocr_glyphs"><span class="secno">6.1.2. </span><span class="content"><code>ocr_glyphs</code></span><a class="self-link" href="#ocr_glyphs"></a></h4>
+   <h4 class="heading settled" data-level="6.1.2" id="ocr_glyphs"><span class="secno">6.1.2. </span><span class="content"><dfn data-dfn-type="element" data-export="" id="elementdef-ocr_glyphs">ocr_glyphs<a class="self-link" href="#elementdef-ocr_glyphs"></a></dfn></span><a class="self-link" href="#ocr_glyphs"></a></h4>
    <p>Multiple glyphs represented as an image (e.g., an unrecognized word)</p>
    <p>Must contain a single <code>&lt;img></code> tag, or be present on one</p>
-   <h4 class="heading settled" data-level="6.1.3" id="ocr_dropcap"><span class="secno">6.1.3. </span><span class="content"><code>ocr_dropcap</code></span><a class="self-link" href="#ocr_dropcap"></a></h4>
+   <h4 class="heading settled" data-level="6.1.3" id="ocr_dropcap"><span class="secno">6.1.3. </span><span class="content"><dfn data-dfn-type="element" data-export="" id="elementdef-ocr_dropcap">ocr_dropcap<a class="self-link" href="#elementdef-ocr_dropcap"></a></dfn></span><a class="self-link" href="#ocr_dropcap"></a></h4>
    <p>An individual glyph representing a dropcap</p>
    <p>May contain text or an <code>&lt;img></code> tag; the <code>alt</code> of the image tag should contain
 the corresponding text</p>
-   <h4 class="heading settled" data-level="6.1.4" id="ocr_chem"><span class="secno">6.1.4. </span><span class="content"><code>ocr_chem</code></span><a class="self-link" href="#ocr_chem"></a></h4>
+   <h4 class="heading settled" data-level="6.1.4" id="ocr_chem"><span class="secno">6.1.4. </span><span class="content"><dfn data-dfn-type="element" data-export="" id="elementdef-ocr_chem">ocr_chem<a class="self-link" href="#elementdef-ocr_chem"></a></dfn></span><a class="self-link" href="#ocr_chem"></a></h4>
    <p>A chemical formula</p>
    <p>Must contain either a single <code>&lt;img></code> tag or <a data-link-type="biblio" href="#biblio-cml">[CML]</a> markup, or be present on
 one</p>
-   <h4 class="heading settled" data-level="6.1.5" id="ocr_math"><span class="secno">6.1.5. </span><span class="content"><code>ocr_math</code></span><a class="self-link" href="#ocr_math"></a></h4>
+   <h4 class="heading settled" data-level="6.1.5" id="ocr_math"><span class="secno">6.1.5. </span><span class="content"><dfn data-dfn-type="element" data-export="" id="elementdef-ocr_math">ocr_math<a class="self-link" href="#elementdef-ocr_math"></a></dfn></span><a class="self-link" href="#ocr_math"></a></h4>
    <p>A mathematical formula</p>
    <p>Must contain either a single <code>&lt;img></code> tag or <a data-link-type="biblio" href="#biblio-mathml">[MathML]</a> markup, or be present on
 one</p>
-   <p>Mathematical and chemical formulas that float must be put into an <code>ocr_float</code> section.</p>
+   <p>Mathematical and chemical formulas that float must be put into an <code><a data-link-type="element" href="#elementdef-ocr_float" id="ref-for-elementdef-ocr_float-1">ocr_float</a></code> section.</p>
    <p>Mathematical and chemical formulas that are “display” mode should be put into
-an <code>ocr_display</code> section.</p>
+an <code><a data-link-type="element" href="#elementdef-ocr_display" id="ref-for-elementdef-ocr_display-2">ocr_display</a></code> section.</p>
    <h4 class="heading settled" data-level="6.1.6" id="non-breaking-space"><span class="secno">6.1.6. </span><span class="content">Non-breaking space</span><a class="self-link" href="#non-breaking-space"></a></h4>
    <p>Non-breaking spaces must be represented using the HTML <code>&amp;nbsp;</code> entity.</p>
    <h4 class="heading settled" data-level="6.1.7" id="non-default-spaces"><span class="secno">6.1.7. </span><span class="content">Non-default spaces</span><a class="self-link" href="#non-default-spaces"></a></h4>
    <p>Different space widths should be indicated using HTML and <code>&amp;ensp;</code>, <code>&amp;emsp</code>, <code>&amp;thinsp;</code>, <code>&amp;zwnj;</code>, <code>&amp;zwj;</code>.</p>
    <h4 class="heading settled" data-level="6.1.8" id="hyphenation"><span class="secno">6.1.8. </span><span class="content">Hyphenation</span><a class="self-link" href="#hyphenation"></a></h4>
    <p>Soft hyphens must be represented using the HTML <code>&amp;shy;</code> entity.</p>
-   <p>The HTML <code>&amp;lrm;</code> and <code>&amp;rlm;</code> entities (indicating writing direction) must not
-be used; all writing direction changes must be indicated with tags.</p>
+   <p>The HTML <a href="https://www.w3.org/TR/REC-html40/struct/dirlang.html#h-8.2.5"><code>&amp;lrm;</code> and <code>&amp;rlm;</code> entities</a> (indicating writing direction) must not be used; all
+writing direction changes must be indicated with tags.</p>
    <h4 class="heading settled" data-level="6.1.9" id="superscript-and-subscript"><span class="secno">6.1.9. </span><span class="content">Superscript and Subscript</span><a class="self-link" href="#superscript-and-subscript"></a></h4>
    <p>Other superscripts and subscripts must be represented using the HTML <code>&lt;sup></code> and <code>&lt;sub></code> tags, even if special Unicode characters are available.</p>
    <h4 class="heading settled" data-level="6.1.10" id="ruby-characters"><span class="secno">6.1.10. </span><span class="content">Ruby characters</span><a class="self-link" href="#ruby-characters"></a></h4>
@@ -2088,18 +2145,18 @@ <h2 class="heading settled" data-level="7" id="character-information"><span clas
    <h3 class="heading settled" data-level="7.1" id="classes-for-character-information"><span class="secno">7.1. </span><span class="content">Classes for Character Information</span><a class="self-link" href="#classes-for-character-information"></a></h3>
    <p>Character-level information may be put on any element that contains only a
 single "line" of text.</p>
-   <h4 class="heading settled" data-level="7.1.1" id="ocr_cinfo"><span class="secno">7.1.1. </span><span class="content"><code>ocr_cinfo</code></span><a class="self-link" href="#ocr_cinfo"></a></h4>
-   <p>If no other layout element applies, the <code>ocr_cinfo</code> element may be used.</p>
+   <h4 class="heading settled" data-level="7.1.1" id="ocr_cinfo"><span class="secno">7.1.1. </span><span class="content"><dfn class="dfn-paneled" data-dfn-type="element" data-export="" id="elementdef-ocr_cinfo">ocr_cinfo</dfn></span><a class="self-link" href="#ocr_cinfo"></a></h4>
+   <p>If no other layout element applies, the <code><a data-link-type="element" href="#elementdef-ocr_cinfo" id="ref-for-elementdef-ocr_cinfo-1">ocr_cinfo</a></code> element may be used.</p>
    <h3 class="heading settled" data-level="7.2" id="properties-for-character-information"><span class="secno">7.2. </span><span class="content">Properties for Character Information</span><a class="self-link" href="#properties-for-character-information"></a></h3>
-   <h4 class="heading settled" data-level="7.2.1" id="cuts"><span class="secno">7.2.1. </span><span class="content"><code>cuts</code></span><a class="self-link" href="#cuts"></a></h4>
+   <h4 class="heading settled" data-level="7.2.1" id="cuts"><span class="secno">7.2.1. </span><span class="content"><dfn class="dfn-paneled css" data-dfn-type="property" data-export="" id="propdef-cuts">cuts</dfn></span><a class="self-link" href="#cuts"></a></h4>
    <p><code>cuts c1 c2 c3 ...</code></p>
    <ul>
     <li data-md="">
      <p>character segmentation cuts (see below)</p>
     <li data-md="">
-     <p>there must be a bbox property relative to which the cuts can be interpreted</p>
+     <p>there must be a <a class="property" data-link-type="propdesc" href="#propdef-bbox" id="ref-for-propdef-bbox-4">bbox</a> property relative to which the <a class="property" data-link-type="propdesc" href="#propdef-cuts" id="ref-for-propdef-cuts-1">cuts</a> can be interpreted</p>
    </ul>
-   <h4 class="heading settled" data-level="7.2.2" id="nlp"><span class="secno">7.2.2. </span><span class="content"><code>nlp</code></span><a class="self-link" href="#nlp"></a></h4>
+   <h4 class="heading settled" data-level="7.2.2" id="nlp"><span class="secno">7.2.2. </span><span class="content"><dfn class="dfn-paneled css" data-dfn-type="property" data-export="" id="propdef-nlp">nlp</dfn></span><a class="self-link" href="#nlp"></a></h4>
    <p><code>nlp c1 c2 c3 ...</code></p>
    <ul>
     <li data-md="">
@@ -2112,12 +2169,12 @@ <h4 class="heading settled" data-level="7.2.2" id="nlp"><span class="secno">7.2.
    <div class="example" id="example-382eb02f">
     <a class="self-link" href="#example-382eb02f"></a> 
     <p>Assume a bounding box of <code>(0,0,300,100)</code>; then</p>
-<pre class="language-python highlight"><span class="n">cuts</span><span class="p">(</span><span class="s">"</span><span class="s">10 11 7 19</span><span class="s">"</span><span class="p">)</span> <span class="o">=</span>
+<pre class="language-python highlight"><span class="n">cuts</span><span class="p">(</span><span class="s2">"</span><span class="s2">10 11 7 19</span><span class="s2">"</span><span class="p">)</span> <span class="o">=</span>
     <span class="p">[</span> <span class="p">[</span><span class="p">(</span><span class="mi">10</span><span class="p">,</span><span class="mi">0</span><span class="p">)</span><span class="p">,</span><span class="p">(</span><span class="mi">10</span><span class="p">,</span><span class="mi">100</span><span class="p">)</span><span class="p">]</span><span class="p">,</span> <span class="p">[</span><span class="p">(</span><span class="mi">21</span><span class="p">,</span><span class="mi">0</span><span class="p">)</span><span class="p">,</span><span class="p">(</span><span class="mi">21</span><span class="p">,</span><span class="mi">100</span><span class="p">)</span><span class="p">]</span><span class="p">,</span> <span class="p">[</span><span class="p">(</span><span class="mi">28</span><span class="p">,</span><span class="mi">0</span><span class="p">)</span><span class="p">,</span><span class="p">(</span><span class="mi">28</span><span class="p">,</span><span class="mi">100</span><span class="p">)</span><span class="p">]</span><span class="p">,</span> <span class="p">[</span><span class="p">(</span><span class="mi">47</span><span class="p">,</span><span class="mi">0</span><span class="p">)</span><span class="p">,</span><span class="p">(</span><span class="mi">47</span><span class="p">,</span><span class="mi">100</span><span class="p">)</span><span class="p">]</span> <span class="p">]</span>
-<span class="n">cuts</span><span class="p">(</span><span class="s">"</span><span class="s">10,50,3 11,30,-3</span><span class="s">"</span><span class="p">)</span> <span class="o">=</span>
+<span class="n">cuts</span><span class="p">(</span><span class="s2">"</span><span class="s2">10,50,3 11,30,-3</span><span class="s2">"</span><span class="p">)</span> <span class="o">=</span>
     <span class="p">[</span> <span class="p">[</span><span class="p">(</span><span class="mi">10</span><span class="p">,</span><span class="mi">0</span><span class="p">)</span><span class="p">,</span><span class="p">(</span><span class="mi">10</span><span class="p">,</span><span class="mi">50</span><span class="p">)</span><span class="p">,</span><span class="p">(</span><span class="mi">13</span><span class="p">,</span><span class="mi">50</span><span class="p">)</span><span class="p">,</span><span class="p">(</span><span class="mi">13</span><span class="p">,</span><span class="mi">100</span><span class="p">)</span><span class="p">]</span><span class="p">,</span> <span class="p">[</span><span class="p">(</span><span class="mi">21</span><span class="p">,</span><span class="mi">0</span><span class="p">)</span><span class="p">,</span><span class="p">(</span><span class="mi">21</span><span class="p">,</span><span class="mi">30</span><span class="p">)</span><span class="p">,</span><span class="p">(</span><span class="mi">18</span><span class="p">,</span><span class="mi">30</span><span class="p">)</span><span class="p">,</span><span class="p">(</span><span class="mi">18</span><span class="p">,</span><span class="mi">100</span><span class="p">)</span><span class="p">]</span> <span class="p">]</span>
 </pre>
-<pre class="language-html highlight"><span class="nt">&lt;span</span> <span class="na">class=</span><span class="s">"ocr_cinfo"</span> <span class="na">title=</span><span class="s">"bbox 0 0 300 100; nlp 1.7 2.3 3.9 2.7; cuts 9 11 7,8,-2 15 3"</span><span class="nt">></span>hello<span class="nt">&lt;/span></span>
+<pre class="language-html highlight"><span class="p">&lt;</span><span class="nt">span</span> <span class="na">class</span><span class="o">=</span><span class="s">"ocr_cinfo"</span> <span class="na">title</span><span class="o">=</span><span class="s">"bbox 0 0 300 100; nlp 1.7 2.3 3.9 2.7; cuts 9 11 7,8,-2 15 3"</span><span class="p">></span>hello<span class="p">&lt;</span><span class="p">/</span><span class="nt">span</span><span class="p">></span>
 </pre>
    </div>
    <p>Cuts are between all codepoints contained within the element, including any
@@ -2135,7 +2192,7 @@ <h2 class="heading settled" data-level="8" id="ocr-engine-specific-markup"><span
 existing OCR output, say for workflow abstractions.</p>
    <p>Common suggested engine-specific markup are:</p>
    <h3 class="heading settled" data-level="8.1" id="classes-for-engine-specific-markup"><span class="secno">8.1. </span><span class="content">Classes for engine specific markup</span><a class="self-link" href="#classes-for-engine-specific-markup"></a></h3>
-   <h4 class="heading settled" data-level="8.1.1" id="ocrx_block"><span class="secno">8.1.1. </span><span class="content"><code>ocrx_block</code></span><a class="self-link" href="#ocrx_block"></a></h4>
+   <h4 class="heading settled" data-level="8.1.1" id="ocrx_block"><span class="secno">8.1.1. </span><span class="content"><dfn class="dfn-paneled" data-dfn-type="element" data-export="" id="elementdef-ocrx_block">ocrx_block</dfn></span><a class="self-link" href="#ocrx_block"></a></h4>
    <p class="issue" id="issue-66c198d9"><a class="self-link" href="#issue-66c198d9"></a> <a href="https://github.com/kba/hocr-spec/issues/28">ocr_carea vs ocrx_block</a></p>
    <ul>
     <li data-md="">
@@ -2143,15 +2200,15 @@ <h4 class="heading settled" data-level="8.1.1" id="ocrx_block"><span class="secn
     <li data-md="">
      <p>engine-specific because the definition of a "block" depends on the engine</p>
    </ul>
-   <h4 class="heading settled" data-level="8.1.2" id="ocrx_line"><span class="secno">8.1.2. </span><span class="content"><code>ocrx_line</code></span><a class="self-link" href="#ocrx_line"></a></h4>
+   <h4 class="heading settled" data-level="8.1.2" id="ocrx_line"><span class="secno">8.1.2. </span><span class="content"><dfn class="dfn-paneled" data-dfn-type="element" data-export="" id="elementdef-ocrx_line">ocrx_line</dfn></span><a class="self-link" href="#ocrx_line"></a></h4>
    <p class="issue" id="issue-8ef34561"><a class="self-link" href="#issue-8ef34561"></a> <a href="https://github.com/kba/hocr-spec/issues/19">ocr_line vs ocrx_line</a></p>
    <ul>
     <li data-md="">
-     <p>any kind of "line" returned by an OCR system that differs from the standard ocr_line above</p>
+     <p>any kind of "line" returned by an OCR system that differs from the standard <code><a data-link-type="element" href="#elementdef-ocr_line" id="ref-for-elementdef-ocr_line-6">ocr_line</a></code> above</p>
     <li data-md="">
      <p>might be some kind of "logical" line</p>
    </ul>
-   <h4 class="heading settled" data-level="8.1.3" id="ocrx_word"><span class="secno">8.1.3. </span><span class="content"><code>ocrx_word</code></span><a class="self-link" href="#ocrx_word"></a></h4>
+   <h4 class="heading settled" data-level="8.1.3" id="ocrx_word"><span class="secno">8.1.3. </span><span class="content"><dfn class="dfn-paneled" data-dfn-type="element" data-export="" id="elementdef-ocrx_word">ocrx_word</dfn></span><a class="self-link" href="#ocrx_word"></a></h4>
    <ul>
     <li data-md="">
      <p>any kind of "word" returned by an OCR system</p>
@@ -2162,47 +2219,48 @@ <h4 class="heading settled" data-level="8.1.3" id="ocrx_word"><span class="secno
 attempt to ensure the following properties:</p>
    <ul>
     <li data-md="">
-     <p>an <code>ocrx_block</code> should not contain content from multiple ocr_careas</p>
+     <p>An <code><a data-link-type="element" href="#elementdef-ocrx_block" id="ref-for-elementdef-ocrx_block-2">ocrx_block</a></code> should not contain content from multiple <code><a data-link-type="element" href="#elementdef-ocr_carea" id="ref-for-elementdef-ocr_carea-11">ocr_carea</a></code>.</p>
     <li data-md="">
-     <p>the union of all <code>ocrx_blocks</code> should approximately cover all <code>ocr_careas</code></p>
+     <p>The union of all <code><a data-link-type="element" href="#elementdef-ocrx_block" id="ref-for-elementdef-ocrx_block-3">ocrx_blocks</a></code> should approximately cover all <code><a data-link-type="element" href="#elementdef-ocr_carea" id="ref-for-elementdef-ocr_carea-12">ocr_carea</a></code>.</p>
     <li data-md="">
-     <p>an <code>ocrx_block</code> should contain either a float or body text, but not both</p>
+     <p>an <code><a data-link-type="element" href="#elementdef-ocrx_block" id="ref-for-elementdef-ocrx_block-4">ocrx_block</a></code> should contain either a float or body text, but not both</p>
     <li data-md="">
-     <p>an <code>ocrx_block</code> should contain either an image or text, but not both</p>
+     <p>an <code><a data-link-type="element" href="#elementdef-ocrx_block" id="ref-for-elementdef-ocrx_block-5">ocrx_block</a></code> should contain either an image or text, but not both</p>
     <li data-md="">
-     <p>an <code>ocrx_line</code> should correspond as closely as possible to an <code>ocr_line</code></p>
+     <p>an <code><a data-link-type="element" href="#elementdef-ocrx_line" id="ref-for-elementdef-ocrx_line-1">ocrx_line</a></code> should correspond as closely as possible to an <code><a data-link-type="element" href="#elementdef-ocr_line" id="ref-for-elementdef-ocr_line-7">ocr_line</a></code></p>
     <li data-md="">
-     <p><code>ocrx_cinfo</code> should nest inside <code>ocrx_line</code></p>
+     <p><code><a data-link-type="element">ocrx_cinfo</a></code> should nest inside <code><a data-link-type="element" href="#elementdef-ocrx_line" id="ref-for-elementdef-ocrx_line-2">ocrx_line</a></code></p>
     <li data-md="">
-     <p><code>ocrx_cinfo</code> should contain only <code>x_conf</code>, <code>x_bboxes</code>, and <code>cuts</code> attributes</p>
+     <p><code><a data-link-type="element">ocrx_cinfo</a></code> should contain only <a class="property" data-link-type="propdesc" href="#propdef-x_confs" id="ref-for-propdef-x_confs-1">x_confs</a>, <a class="property" data-link-type="propdesc" href="#propdef-x_bboxes" id="ref-for-propdef-x_bboxes-2">x_bboxes</a>, and <a class="property" data-link-type="propdesc" href="#propdef-cuts" id="ref-for-propdef-cuts-2">cuts</a> attributes</p>
    </ul>
+   <p class="issue" id="issue-000a0ed5"><a class="self-link" href="#issue-000a0ed5"></a> ocrx_cinfo?</p>
    <h3 class="heading settled" data-level="8.2" id="properties-for-engine-specific-markup"><span class="secno">8.2. </span><span class="content">Properties for engine-specific markup</span><a class="self-link" href="#properties-for-engine-specific-markup"></a></h3>
    <p>The following properties are defined:</p>
-   <h4 class="heading settled" data-level="8.2.1" id="x_font"><span class="secno">8.2.1. </span><span class="content"><code>x_font</code></span><a class="self-link" href="#x_font"></a></h4>
+   <h4 class="heading settled" data-level="8.2.1" id="x_font"><span class="secno">8.2.1. </span><span class="content"><dfn class="css" data-dfn-type="property" data-export="" id="propdef-x_font">x_font<a class="self-link" href="#propdef-x_font"></a></dfn></span><a class="self-link" href="#x_font"></a></h4>
    <p><code>x_font s</code></p>
    <ul>
     <li data-md="">
      <p>OCR-engine specific font names</p>
    </ul>
-   <h4 class="heading settled" data-level="8.2.2" id="x_fsize"><span class="secno">8.2.2. </span><span class="content"><code>x_fsize</code></span><a class="self-link" href="#x_fsize"></a></h4>
+   <h4 class="heading settled" data-level="8.2.2" id="x_fsize"><span class="secno">8.2.2. </span><span class="content"><dfn class="css" data-dfn-type="property" data-export="" id="propdef-x_fsize">x_fsize<a class="self-link" href="#propdef-x_fsize"></a></dfn></span><a class="self-link" href="#x_fsize"></a></h4>
    <p><code>x_fsize n</code></p>
    <ul>
     <li data-md="">
      <p>OCR-engine specific font size</p>
    </ul>
-   <h4 class="heading settled" data-level="8.2.3" id="x_bboxes"><span class="secno">8.2.3. </span><span class="content"><code>x_bboxes</code></span><a class="self-link" href="#x_bboxes"></a></h4>
+   <h4 class="heading settled" data-level="8.2.3" id="x_bboxes"><span class="secno">8.2.3. </span><span class="content"><dfn class="dfn-paneled css" data-dfn-type="property" data-export="" id="propdef-x_bboxes">x_bboxes</dfn></span><a class="self-link" href="#x_bboxes"></a></h4>
    <p><code>x_bboxes b1x0 b1y0 b1x1 b1y1 b2x0 b2y0 b2x1 b2y1 ...</code></p>
    <ul>
     <li data-md="">
      <p>OCR-engine specific boxes associated with each codepoint contained in the
 element</p>
     <li data-md="">
-     <p>note that the bbox property is a property for the bounding box of a layout
+     <p>note that the <a class="property" data-link-type="propdesc" href="#propdef-bbox" id="ref-for-propdef-bbox-5">bbox</a> property is a property for the bounding box of a layout
 element, not of individual characters</p>
     <li data-md="">
      <p>in particular, use <code>&lt;span class="ocr_cinfo" title="x_bboxes ...."></code>, not <code>&lt;span class="ocr_cinfo" title="bbox ..."></code></p>
    </ul>
-   <h4 class="heading settled" data-level="8.2.4" id="x_confs"><span class="secno">8.2.4. </span><span class="content"><code>x_confs</code></span><a class="self-link" href="#x_confs"></a></h4>
+   <h4 class="heading settled" data-level="8.2.4" id="x_confs"><span class="secno">8.2.4. </span><span class="content"><dfn class="dfn-paneled css" data-dfn-type="property" data-export="" id="propdef-x_confs">x_confs</dfn></span><a class="self-link" href="#x_confs"></a></h4>
    <p><code>x_confs c1 c2 c3 ...</code></p>
    <ul>
     <li data-md="">
@@ -2215,7 +2273,7 @@ <h4 class="heading settled" data-level="8.2.4" id="x_confs"><span class="secno">
      <p>if possible, convert character confidences to values between 0 and 100 and
 have them approximate posterior probabilities (expressed in %)</p>
    </ul>
-   <h4 class="heading settled" data-level="8.2.5" id="x_wconf"><span class="secno">8.2.5. </span><span class="content"><code>x_wconf</code></span><a class="self-link" href="#x_wconf"></a></h4>
+   <h4 class="heading settled" data-level="8.2.5" id="x_wconf"><span class="secno">8.2.5. </span><span class="content"><dfn class="css" data-dfn-type="property" data-export="" id="propdef-x_wconf">x_wconf<a class="self-link" href="#propdef-x_wconf"></a></dfn></span><a class="self-link" href="#x_wconf"></a></h4>
    <p><code>x_wconf n</code></p>
    <ul>
     <li data-md="">
@@ -2248,14 +2306,14 @@ <h2 class="heading settled" data-level="10" id="alternative-segmentations-readin
    <p>Alternative segmentations and readings are indicated by a <code>&lt;span></code> with <code>class="alternatives"</code>. It must contains <code>&lt;ins></code> and <code>&lt;del></code> elements. The first
 contained element should be <code>&lt;ins></code> and represent the most probable interpretation,
 the subsequent ones <code>&lt;del></code>. Each <code>&lt;ins></code> and <code>&lt;del></code> element should have <code>class="alt"</code> and a
-property of either <code>nlp</code> or <code>x_cost</code>. These <code>&lt;span></code>, <code>&lt;ins></code>, and <code>&lt;del></code> tags can nest
+property of either <a class="property" data-link-type="propdesc" href="#propdef-nlp" id="ref-for-propdef-nlp-1">nlp</a> or <a class="property" data-link-type="propdesc">x_cost</a>. These <code>&lt;span></code>, <code>&lt;ins></code>, and <code>&lt;del></code> tags can nest
 arbitrarily.</p>
    <div class="example" id="example-861f64bd">
     <a class="self-link" href="#example-861f64bd"></a> 
-<pre class="language-html highlight"><span class="nt">&lt;span</span> <span class="na">class=</span><span class="s">"alternatives"</span><span class="nt">></span>
-<span class="nt">&lt;ins</span> <span class="na">class=</span><span class="s">"alt"</span> <span class="na">title=</span><span class="s">"nlp 0.3"</span><span class="nt">></span>hello<span class="nt">&lt;/ins></span>
-<span class="nt">&lt;del</span> <span class="na">class=</span><span class="s">"alt"</span> <span class="na">title=</span><span class="s">"nlp 1.1"</span><span class="nt">></span>hallo<span class="nt">&lt;/del></span>
-<span class="nt">&lt;/span></span>
+<pre class="language-html highlight"><span class="p">&lt;</span><span class="nt">span</span> <span class="na">class</span><span class="o">=</span><span class="s">"alternatives"</span><span class="p">></span>
+<span class="p">&lt;</span><span class="nt">ins</span> <span class="na">class</span><span class="o">=</span><span class="s">"alt"</span> <span class="na">title</span><span class="o">=</span><span class="s">"nlp 0.3"</span><span class="p">></span>hello<span class="p">&lt;</span><span class="p">/</span><span class="nt">ins</span><span class="p">></span>
+<span class="p">&lt;</span><span class="nt">del</span> <span class="na">class</span><span class="o">=</span><span class="s">"alt"</span> <span class="na">title</span><span class="o">=</span><span class="s">"nlp 1.1"</span><span class="p">></span>hallo<span class="p">&lt;</span><span class="p">/</span><span class="nt">del</span><span class="p">></span>
+<span class="p">&lt;</span><span class="p">/</span><span class="nt">span</span><span class="p">></span>
 </pre>
    </div>
    <p>Whitespace within the <code>&lt;span></code> but outside the contained <code>&lt;ins></code>/<code>&lt;del></code> elements is ignored and should be inserted to improve readability of the HTML
@@ -2263,7 +2321,7 @@ <h2 class="heading settled" data-level="10" id="alternative-segmentations-readin
    <h2 class="heading settled" data-level="11" id="grouped-elements-and-multiple-hierarchies"><span class="secno">11. </span><span class="content">Grouped Elements and Multiple Hierarchies</span><a class="self-link" href="#grouped-elements-and-multiple-hierarchies"></a></h2>
    <p>The different levels of layout information (logical, physical, engine-specific)
 each form hierarchies, but those hierarchies may not be mutually compatible;
-for example, a single <code>ocr_page</code> may contain information from multiple sections
+for example, a single <code><a data-link-type="element" href="#elementdef-ocr_page" id="ref-for-elementdef-ocr_page-2">ocr_page</a></code> may contain information from multiple sections
 or chapters. To represent both hierarchies within a single document, elements
 may be grouped together.  That is, two elements with the same class may be
 treated as one element by adding a "groupid identifier" property to them and
@@ -2279,7 +2337,7 @@ <h2 class="heading settled" data-level="11" id="grouped-elements-and-multiple-hi
 then collapsing grouped elements into single elements.  For example, output
 that contains both logical and physical layout information, where the logical
 layout information uses grouped elements, can be transformed by removing all
-the physical layout information, and then collapsing all split <code>ocr_chapter</code> elements into single <code>ocr_chapter</code> elements based on the groupid.  The result is
+the physical layout information, and then collapsing all split <code><a data-link-type="element" href="#elementdef-ocr_chapter" id="ref-for-elementdef-ocr_chapter-2">ocr_chapter</a></code> elements into single <code><a data-link-type="element" href="#elementdef-ocr_chapter" id="ref-for-elementdef-ocr_chapter-3">ocr_chapter</a></code> elements based on the groupid.  The result is
 a simple DOM tree.  This transformation can be provided generically as a
 pre-processor or Javascript.</p>
    <p>The presence of grouped elements does not need to be indicated in the header;
@@ -2294,15 +2352,15 @@ <h2 class="heading settled" data-level="12" id="capabilities"><span class="secno
 document.</p>
    <p>The capability to generate specific properties is given by the prefix <code>ocrp_...</code>;
 the important properties are:</p>
-   <h3 class="heading settled" data-level="12.1" id="ocrp_lang"><span class="secno">12.1. </span><span class="content"><code>ocrp_lang</code></span><a class="self-link" href="#ocrp_lang"></a></h3>
+   <h3 class="heading settled" data-level="12.1" id="ocrp_lang"><span class="secno">12.1. </span><span class="content"><dfn class="dfn-paneled css" data-dfn-for="ocr-capabilities" data-dfn-type="value" data-export="" id="valdef-ocr-capabilities-ocrp_lang">ocrp_lang</dfn></span><a class="self-link" href="#ocrp_lang"></a></h3>
    <p>Capable of generating <code>lang=</code> attributes</p>
-   <h3 class="heading settled" data-level="12.2" id="ocrp_dir"><span class="secno">12.2. </span><span class="content"><code>ocrp_dir</code></span><a class="self-link" href="#ocrp_dir"></a></h3>
+   <h3 class="heading settled" data-level="12.2" id="ocrp_dir"><span class="secno">12.2. </span><span class="content"><dfn class="css" data-dfn-for="ocr-capabilities" data-dfn-type="value" data-export="" id="valdef-ocr-capabilities-ocrp_dir">ocrp_dir<a class="self-link" href="#valdef-ocr-capabilities-ocrp_dir"></a></dfn></span><a class="self-link" href="#ocrp_dir"></a></h3>
    <p>Capable of generating <code>dir=</code> attributes</p>
-   <h3 class="heading settled" data-level="12.3" id="ocrp_poly"><span class="secno">12.3. </span><span class="content"><code>ocrp_poly</code></span><a class="self-link" href="#ocrp_poly"></a></h3>
+   <h3 class="heading settled" data-level="12.3" id="ocrp_poly"><span class="secno">12.3. </span><span class="content"><dfn class="dfn-paneled css" data-dfn-for="ocr-capabilities" data-dfn-type="value" data-export="" id="valdef-ocr-capabilities-ocrp_poly">ocrp_poly</dfn></span><a class="self-link" href="#ocrp_poly"></a></h3>
    <p>Capable of generating <a href="#poly">polygonal bounds</a></p>
-   <h3 class="heading settled" data-level="12.4" id="ocrp_font"><span class="secno">12.4. </span><span class="content"><code>ocrp_font</code></span><a class="self-link" href="#ocrp_font"></a></h3>
+   <h3 class="heading settled" data-level="12.4" id="ocrp_font"><span class="secno">12.4. </span><span class="content"><dfn class="dfn-paneled css" data-dfn-for="ocr-capabilities" data-dfn-type="value" data-export="" id="valdef-ocr-capabilities-ocrp_font">ocrp_font</dfn></span><a class="self-link" href="#ocrp_font"></a></h3>
    <p>Capable of generating font information (standard font information)</p>
-   <h3 class="heading settled" data-level="12.5" id="ocrp_nlp"><span class="secno">12.5. </span><span class="content"><code>ocrp_nlp</code></span><a class="self-link" href="#ocrp_nlp"></a></h3>
+   <h3 class="heading settled" data-level="12.5" id="ocrp_nlp"><span class="secno">12.5. </span><span class="content"><dfn class="css" data-dfn-for="ocr-capabilities" data-dfn-type="value" data-export="" id="valdef-ocr-capabilities-ocrp_nlp">ocrp_nlp<a class="self-link" href="#valdef-ocr-capabilities-ocrp_nlp"></a></dfn></span><a class="self-link" href="#ocrp_nlp"></a></h3>
    <p>Capable of generating <a href="#nlp">nlp confidences</a></p>
    <h3 class="heading settled" data-level="12.6" id="ocr_embeddedformat_formatname"><span class="secno">12.6. </span><span class="content"><code>ocr_embeddedformat_&lt;formatname></code></span><a class="self-link" href="#ocr_embeddedformat_formatname"></a></h3>
    <p>The capability to generate other specific embedded formats is given by the
@@ -2317,9 +2375,13 @@ <h3 class="heading settled" data-level="12.7" id="ocr_tag_unordered"><span class
    <h2 class="heading settled" data-level="13" id="metadata"><span class="secno">13. </span><span class="content">Metadata</span><a class="self-link" href="#metadata"></a></h2>
    <h3 class="heading settled" data-level="13.1" id="required-meta-information"><span class="secno">13.1. </span><span class="content">Required Meta Information</span><a class="self-link" href="#required-meta-information"></a></h3>
    <p>The OCR system is required to indicate the following using meta tags in the header:</p>
+   <h4 class="heading settled" data-level="13.1.1" id="ocr-system"><span class="secno">13.1.1. </span><span class="content"><dfn class="css" data-dfn-type="property" data-export="" id="propdef-ocr-system">ocr-system<a class="self-link" href="#propdef-ocr-system"></a></dfn></span><a class="self-link" href="#ocr-system"></a></h4>
    <ul>
     <li data-md="">
      <p><code>&lt;meta name="ocr-system" content="name version"/></code></p>
+   </ul>
+   <h4 class="heading settled" data-level="13.1.2" id="ocr-capabilities"><span class="secno">13.1.2. </span><span class="content"><dfn class="dfn-paneled css" data-dfn-type="property" data-export="" id="propdef-ocr-capabilities">ocr-capabilities</dfn></span><a class="self-link" href="#ocr-capabilities"></a></h4>
+   <ul>
     <li data-md="">
      <p><code>&lt;meta name="ocr-capabilities" content="capabilities"/></code></p>
      <ul>
@@ -2327,10 +2389,15 @@ <h3 class="heading settled" data-level="13.1" id="required-meta-information"><sp
        <p>see <a href="#capabilities">§12 Capabilities</a></p>
      </ul>
    </ul>
+   <h3 class="heading settled" data-level="13.2" id="recommended-meta-information"><span class="secno">13.2. </span><span class="content">Recommended Meta Information</span><a class="self-link" href="#recommended-meta-information"></a></h3>
    <p>The OCR system should indicate the following information</p>
+   <h4 class="heading settled" data-level="13.2.1" id="ocr-number-of-pages"><span class="secno">13.2.1. </span><span class="content"><dfn class="css" data-dfn-type="property" data-export="" id="propdef-ocr-number-of-pages">ocr-number-of-pages<a class="self-link" href="#propdef-ocr-number-of-pages"></a></dfn></span><a class="self-link" href="#ocr-number-of-pages"></a></h4>
    <ul>
     <li data-md="">
      <p><code>&lt;meta name="ocr-number-of-pages" content="number-of-pages"/></code></p>
+   </ul>
+   <h4 class="heading settled" data-level="13.2.2" id="ocr-langs"><span class="secno">13.2.2. </span><span class="content"><dfn class="css" data-dfn-type="property" data-export="" id="propdef-ocr-langs">ocr-langs<a class="self-link" href="#propdef-ocr-langs"></a></dfn></span><a class="self-link" href="#ocr-langs"></a></h4>
+   <ul>
     <li data-md="">
      <p><code>&lt;meta name="ocr-langs" content="languages-considered-by-ocr"/></code></p>
      <ul>
@@ -2339,6 +2406,9 @@ <h3 class="heading settled" data-level="13.1" id="required-meta-information"><sp
       <li data-md="">
        <p>value may be <code>unknown</code></p>
      </ul>
+   </ul>
+   <h4 class="heading settled" data-level="13.2.3" id="ocr-scripts"><span class="secno">13.2.3. </span><span class="content"><dfn class="css" data-dfn-type="property" data-export="" id="propdef-ocr-scripts">ocr-scripts<a class="self-link" href="#propdef-ocr-scripts"></a></dfn></span><a class="self-link" href="#ocr-scripts"></a></h4>
+   <ul>
     <li data-md="">
      <p><code>&lt;meta name="ocr-scripts" content="scripts-considered-by-ocr"/></code></p>
      <ul>
@@ -2348,7 +2418,7 @@ <h3 class="heading settled" data-level="13.1" id="required-meta-information"><sp
        <p>value may be <code>unknown</code></p>
      </ul>
    </ul>
-   <h3 class="heading settled" data-level="13.2" id="document-metadata"><span class="secno">13.2. </span><span class="content">Document metadata</span><a class="self-link" href="#document-metadata"></a></h3>
+   <h3 class="heading settled" data-level="13.3" id="document-metadata"><span class="secno">13.3. </span><span class="content">Document metadata</span><a class="self-link" href="#document-metadata"></a></h3>
    <p>For document meta information, use the <a href="http://dublincore.org/documents/dcq-html/">Dublin Core Embedding into
 HTML</a>. See also <a href="http://dublincore.org/documents/dc-citation-guidelines/">Citation Guidelines
 for Dublin Core</a>.</p>
@@ -2384,21 +2454,21 @@ <h2 class="heading settled" data-level="14" id="profiles"><span class="secno">14
      <p>common commercial OCR output (e.g., Abbyy)</p>
      <ul>
       <li data-md="">
-       <p>ocr_page</p>
+       <p><code><a data-link-type="element" href="#elementdef-ocr_page" id="ref-for-elementdef-ocr_page-3">ocr_page</a></code></p>
       <li data-md="">
-       <p>ocrx_block, ocrx_line, ocrx_word</p>
+       <p><code><a data-link-type="element" href="#elementdef-ocrx_block" id="ref-for-elementdef-ocrx_block-6">ocrx_block</a></code>, <code><a data-link-type="element" href="#elementdef-ocrx_line" id="ref-for-elementdef-ocrx_line-3">ocrx_line</a></code>, <code><a data-link-type="element" href="#elementdef-ocrx_word" id="ref-for-elementdef-ocrx_word-1">ocrx_word</a></code></p>
       <li data-md="">
-       <p>ocrp_lang</p>
+       <p><a class="css" data-link-type="maybe" href="#valdef-ocr-capabilities-ocrp_lang" id="ref-for-valdef-ocr-capabilities-ocrp_lang-1">ocrp_lang</a></p>
       <li data-md="">
-       <p>ocrp_font</p>
+       <p><a class="css" data-link-type="maybe" href="#valdef-ocr-capabilities-ocrp_font" id="ref-for-valdef-ocr-capabilities-ocrp_font-1">ocrp_font</a></p>
      </ul>
     <li data-md="">
      <p>book target</p>
      <ul>
       <li data-md="">
-       <p>all logical structuring elements (as applicable), except ocr_linear</p>
+       <p>all logical structuring elements (as applicable), except <code><a data-link-type="element" href="#elementdef-ocr_linear" id="ref-for-elementdef-ocr_linear-9">ocr_linear</a></code></p>
       <li data-md="">
-       <p>ocr_page</p>
+       <p><code><a data-link-type="element" href="#elementdef-ocr_page" id="ref-for-elementdef-ocr_page-4">ocr_page</a></code></p>
      </ul>
     <li data-md="">
      <p>newspaper target</p>
@@ -2406,9 +2476,9 @@ <h2 class="heading settled" data-level="14" id="profiles"><span class="secno">14
       <li data-md="">
        <p>all logical structuring elements (as applicable)</p>
       <li data-md="">
-       <p>articles map on ocr_linear</p>
+       <p>articles map on <code><a data-link-type="element" href="#elementdef-ocr_linear" id="ref-for-elementdef-ocr_linear-10">ocr_linear</a></code></p>
       <li data-md="">
-       <p>ocr_page</p>
+       <p><code><a data-link-type="element" href="#elementdef-ocr_page" id="ref-for-elementdef-ocr_page-5">ocr_page</a></code></p>
      </ul>
    </ul>
    <h2 class="heading settled" data-level="15" id="html-markup"><span class="secno">15. </span><span class="content">HTML Markup</span><a class="self-link" href="#html-markup"></a></h2>
@@ -2595,32 +2665,32 @@ <h2 class="heading settled" data-level="16" id="sample-usage"><span class="secno
 they’re actually pretty easy to manipulate. Here are some examples:</p>
 <pre class="language-python highlight"><span class="kn">import</span> <span class="nn">libxml2</span><span class="o">,</span><span class="nn">re</span><span class="o">,</span><span class="nn">os</span><span class="o">,</span><span class="nn">string</span>
 
-<span class="c"># convert the HTML to XHTML (if necessary)</span>
-<span class="n">os</span><span class="o">.</span><span class="n">system</span><span class="p">(</span><span class="s">"</span><span class="s">tidy -q -asxhtml &lt; page.html > page.xhtml 2> /dev/null</span><span class="s">"</span><span class="p">)</span>
+<span class="c1"># convert the HTML to XHTML (if necessary)</span>
+<span class="n">os</span><span class="o">.</span><span class="n">system</span><span class="p">(</span><span class="s2">"</span><span class="s2">tidy -q -asxhtml &lt; page.html > page.xhtml 2> /dev/null</span><span class="s2">"</span><span class="p">)</span>
 
-<span class="c"># parse the XML</span>
-<span class="n">doc</span> <span class="o">=</span> <span class="n">libxml2</span><span class="o">.</span><span class="n">parseFile</span><span class="p">(</span><span class="s">'</span><span class="s">page.xhtml</span><span class="s">'</span><span class="p">)</span>
+<span class="c1"># parse the XML</span>
+<span class="n">doc</span> <span class="o">=</span> <span class="n">libxml2</span><span class="o">.</span><span class="n">parseFile</span><span class="p">(</span><span class="s1">'</span><span class="s1">page.xhtml</span><span class="s1">'</span><span class="p">)</span>
 
-<span class="c"># search all nodes having a class of ocr_line</span>
-<span class="n">lines</span> <span class="o">=</span> <span class="n">doc</span><span class="o">.</span><span class="n">xpathEval</span><span class="p">(</span><span class="s">"</span><span class="s">//*[@class=</span><span class="s">'</span><span class="s">ocr_line</span><span class="s">'</span><span class="s">]</span><span class="s">"</span><span class="p">)</span>
+<span class="c1"># search all nodes having a class of ocr_line</span>
+<span class="n">lines</span> <span class="o">=</span> <span class="n">doc</span><span class="o">.</span><span class="n">xpathEval</span><span class="p">(</span><span class="s2">"</span><span class="s2">//*[@class=</span><span class="s2">'</span><span class="s2">ocr_line</span><span class="s2">'</span><span class="s2">]</span><span class="s2">"</span><span class="p">)</span>
 
-<span class="c"># a function for extracting the text from a node</span>
+<span class="c1"># a function for extracting the text from a node</span>
 <span class="k">def</span> <span class="nf">get_text</span><span class="p">(</span><span class="n">node</span><span class="p">)</span><span class="p">:</span>
-    <span class="n">textnodes</span> <span class="o">=</span> <span class="n">node</span><span class="o">.</span><span class="n">xpathEval</span><span class="p">(</span><span class="s">"</span><span class="s">.//text()</span><span class="s">"</span><span class="p">)</span>
+    <span class="n">textnodes</span> <span class="o">=</span> <span class="n">node</span><span class="o">.</span><span class="n">xpathEval</span><span class="p">(</span><span class="s2">"</span><span class="s2">.//text()</span><span class="s2">"</span><span class="p">)</span>
     <span class="n">s</span> <span class="o">=</span> <span class="n">string</span><span class="o">.</span><span class="n">join</span><span class="p">(</span><span class="p">[</span><span class="n">node</span><span class="o">.</span><span class="n">getContent</span><span class="p">(</span><span class="p">)</span> <span class="k">for</span> <span class="n">node</span> <span class="ow">in</span> <span class="n">textnodes</span><span class="p">]</span><span class="p">)</span>
-    <span class="k">return</span> <span class="n">re</span><span class="o">.</span><span class="n">sub</span><span class="p">(</span><span class="s">r'</span><span class="s">\</span><span class="s">s+</span><span class="s">'</span><span class="p">,</span><span class="s">'</span><span class="s"> </span><span class="s">'</span><span class="p">,</span><span class="n">s</span><span class="p">)</span>
+    <span class="k">return</span> <span class="n">re</span><span class="o">.</span><span class="n">sub</span><span class="p">(</span><span class="s1">r'</span><span class="s1">\</span><span class="s1">s+</span><span class="s1">'</span><span class="p">,</span><span class="s1">'</span><span class="s1"> </span><span class="s1">'</span><span class="p">,</span><span class="n">s</span><span class="p">)</span>
 
-<span class="c"># a function for extracting the bbox property from a node</span>
-<span class="c"># note that the title= attribute on a node with an ocr_ class must</span>
-<span class="c"># conform with the OCR spec</span>
+<span class="c1"># a function for extracting the bbox property from a node</span>
+<span class="c1"># note that the title= attribute on a node with an ocr_ class must</span>
+<span class="c1"># conform with the OCR spec</span>
 
 <span class="k">def</span> <span class="nf">get_bbox</span><span class="p">(</span><span class="n">node</span><span class="p">)</span><span class="p">:</span>
-    <span class="n">data</span> <span class="o">=</span> <span class="n">node</span><span class="o">.</span><span class="n">prop</span><span class="p">(</span><span class="s">'</span><span class="s">title</span><span class="s">'</span><span class="p">)</span>
-    <span class="n">bboxre</span> <span class="o">=</span> <span class="n">re</span><span class="o">.</span><span class="n">compile</span><span class="p">(</span><span class="s">r'</span><span class="s">\</span><span class="s">bbbox</span><span class="s">\</span><span class="s">s+(</span><span class="s">\</span><span class="s">d+)</span><span class="s">\</span><span class="s">s+(</span><span class="s">\</span><span class="s">d+)</span><span class="s">\</span><span class="s">s+(</span><span class="s">\</span><span class="s">d+)</span><span class="s">\</span><span class="s">s+(</span><span class="s">\</span><span class="s">d+)</span><span class="s">'</span><span class="p">)</span>
+    <span class="n">data</span> <span class="o">=</span> <span class="n">node</span><span class="o">.</span><span class="n">prop</span><span class="p">(</span><span class="s1">'</span><span class="s1">title</span><span class="s1">'</span><span class="p">)</span>
+    <span class="n">bboxre</span> <span class="o">=</span> <span class="n">re</span><span class="o">.</span><span class="n">compile</span><span class="p">(</span><span class="s1">r'</span><span class="s1">\</span><span class="s1">bbbox</span><span class="s1">\</span><span class="s1">s+(</span><span class="s1">\</span><span class="s1">d+)</span><span class="s1">\</span><span class="s1">s+(</span><span class="s1">\</span><span class="s1">d+)</span><span class="s1">\</span><span class="s1">s+(</span><span class="s1">\</span><span class="s1">d+)</span><span class="s1">\</span><span class="s1">s+(</span><span class="s1">\</span><span class="s1">d+)</span><span class="s1">'</span><span class="p">)</span>
     <span class="k">return</span> <span class="p">[</span>int<span class="p">(</span><span class="n">x</span><span class="p">)</span> <span class="k">for</span> <span class="n">x</span> <span class="ow">in</span> <span class="n">bboxre</span><span class="o">.</span><span class="n">search</span><span class="p">(</span><span class="n">data</span><span class="p">)</span><span class="o">.</span><span class="n">groups</span><span class="p">(</span><span class="p">)</span><span class="p">]</span>
 
-<span class="c"># this extracts all the bounding boxes and the text they contain</span>
-<span class="c"># it doesn’t matter what other markup the line node may contain</span>
+<span class="c1"># this extracts all the bounding boxes and the text they contain</span>
+<span class="c1"># it doesn’t matter what other markup the line node may contain</span>
 <span class="k">for</span> <span class="n">line</span> <span class="ow">in</span> <span class="n">lines</span><span class="p">:</span>
     <span class="k">print</span> <span class="n">get_bbox</span><span class="p">(</span><span class="n">line</span><span class="p">)</span><span class="p">,</span><span class="n">get_text</span><span class="p">(</span><span class="n">line</span><span class="p">)</span>
 </pre>
@@ -2807,6 +2877,85 @@ <h2 class="no-ref no-num heading settled" id="conformance"><span class="content"
 
 })();
 </script>
+  <h2 class="no-num no-ref heading settled" id="index"><span class="content">Index</span><a class="self-link" href="#index"></a></h2>
+  <h3 class="no-num no-ref heading settled" id="index-defined-here"><span class="content">Terms defined by this specification</span><a class="self-link" href="#index-defined-here"></a></h3>
+  <ul class="index">
+   <li><a href="#propdef-baseline">baseline</a><span>, in §3.2.5</span>
+   <li><a href="#propdef-bbox">bbox</a><span>, in §3.1.1</span>
+   <li><a href="#propdef-cflow">cflow</a><span>, in §3.2.4</span>
+   <li><a href="#propdef-cuts">cuts</a><span>, in §7.2.1</span>
+   <li><a href="#propdef-hardbreak">hardbreak</a><span>, in §5.3.4</span>
+   <li><a href="#propdef-image">image</a><span>, in §5.2.2</span>
+   <li><a href="#propdef-imagemd5">imagemd5</a><span>, in §5.2.3</span>
+   <li><a href="#propdef-lpageno">lpageno</a><span>, in §5.2.5</span>
+   <li><a href="#propdef-nlp">nlp</a><span>, in §7.2.2</span>
+   <li><a href="#elementdef-ocr_abstract">ocr_abstract</a><span>, in §4.4</span>
+   <li><a href="#elementdef-ocr_author">ocr_author</a><span>, in §4.3</span>
+   <li><a href="#elementdef-ocr_blockquote">ocr_blockquote</a><span>, in §4.10</span>
+   <li><a href="#propdef-ocr-capabilities">ocr-capabilities</a><span>, in §13.1.2</span>
+   <li><a href="#elementdef-ocr_caption">ocr_caption</a><span>, in §4.12</span>
+   <li><a href="#elementdef-ocr_carea">ocr_carea</a><span>, in §5.1.3</span>
+   <li><a href="#elementdef-ocr_chapter">ocr_chapter</a><span>, in §4.6</span>
+   <li><a href="#elementdef-ocr_chem">ocr_chem</a><span>, in §6.1.4</span>
+   <li><a href="#elementdef-ocr_cinfo">ocr_cinfo</a><span>, in §7.1.1</span>
+   <li><a href="#elementdef-ocr_column">ocr_column</a><span>, in §5.1.2</span>
+   <li><a href="#elementdef-ocr_display">ocr_display</a><span>, in §4.9</span>
+   <li><a href="#elementdef-ocr_document">ocr_document</a><span>, in §4.1</span>
+   <li><a href="#elementdef-ocr_dropcap">ocr_dropcap</a><span>, in §6.1.3</span>
+   <li><a href="#elementdef-ocr_float">ocr_float</a><span>, in §5.4.1</span>
+   <li><a href="#elementdef-ocr_footer">ocr_footer</a><span>, in §5.4.9</span>
+   <li><a href="#elementdef-ocr_glyph">ocr_glyph</a><span>, in §6.1.1</span>
+   <li><a href="#elementdef-ocr_glyphs">ocr_glyphs</a><span>, in §6.1.2</span>
+   <li><a href="#elementdef-ocr_header">ocr_header</a><span>, in §5.4.8</span>
+   <li><a href="#elementdef-ocr_image">ocr_image</a><span>, in §5.4.5</span>
+   <li><a href="#propdef-ocr-langs">ocr-langs</a><span>, in §13.2.2</span>
+   <li><a href="#elementdef-ocr_line">ocr_line</a><span>, in §5.1.4</span>
+   <li><a href="#elementdef-ocr_linear">ocr_linear</a><span>, in §4.11.1</span>
+   <li><a href="#elementdef-ocr_linedrawing">ocr_linedrawing</a><span>, in §5.4.6</span>
+   <li><a href="#elementdef-ocr_math">ocr_math</a><span>, in §6.1.5</span>
+   <li><a href="#elementdef-ocr_noise">ocr_noise</a><span>, in §5.1.6</span>
+   <li><a href="#propdef-ocr-number-of-pages">ocr-number-of-pages</a><span>, in §13.2.1</span>
+   <li><a href="#elementdef-ocr_page">ocr_page</a><span>, in §5.1.1</span>
+   <li><a href="#elementdef-ocr_pageno">ocr_pageno</a><span>, in §5.4.10</span>
+   <li><a href="#elementdef-ocr_par">ocr_par</a><span>, in §4.11</span>
+   <li><a href="#elementdef-ocr_part">ocr_part</a><span>, in §4.5</span>
+   <li><a href="#valdef-ocr-capabilities-ocrp_dir">ocrp_dir</a><span>, in §12.2</span>
+   <li><a href="#valdef-ocr-capabilities-ocrp_font">ocrp_font</a><span>, in §12.4</span>
+   <li><a href="#elementdef-ocr_photo">ocr_photo</a><span>, in §5.4.7</span>
+   <li><a href="#valdef-ocr-capabilities-ocrp_lang">ocrp_lang</a><span>, in §12.1</span>
+   <li><a href="#valdef-ocr-capabilities-ocrp_nlp">ocrp_nlp</a><span>, in §12.5</span>
+   <li><a href="#valdef-ocr-capabilities-ocrp_poly">ocrp_poly</a><span>, in §12.3</span>
+   <li><a href="#propdef-ocr-scripts">ocr-scripts</a><span>, in §13.2.3</span>
+   <li><a href="#elementdef-ocr_section">ocr_section</a><span>, in §4.7</span>
+   <li>
+    ocr_separator
+    <ul>
+     <li><a href="#elementdef-ocr_separator">(element)</a><span>, in §5.1.5</span>
+     <li><a href="#elementdef-ocr_separator0">(element)</a><span>, in §5.4.2</span>
+    </ul>
+   <li><a href="#elementdef-ocr_subsubsection">ocr_subsubsection</a><span>, in §4.8</span>
+   <li><a href="#propdef-ocr-system">ocr-system</a><span>, in §13.1.1</span>
+   <li><a href="#elementdef-ocr_table">ocr_table</a><span>, in §5.4.11</span>
+   <li><a href="#elementdef-ocr_textfloat">ocr_textfloat</a><span>, in §5.4.3</span>
+   <li><a href="#elementdef-ocr_textimage">ocr_textimage</a><span>, in §5.4.4</span>
+   <li><a href="#elementdef-ocr_title">ocr_title</a><span>, in §4.2</span>
+   <li><a href="#elementdef-ocrx_block">ocrx_block</a><span>, in §8.1.1</span>
+   <li><a href="#elementdef-ocrx_line">ocrx_line</a><span>, in §8.1.2</span>
+   <li><a href="#elementdef-ocrx_word">ocrx_word</a><span>, in §8.1.3</span>
+   <li><a href="#propdef-order">order</a><span>, in §3.2.2</span>
+   <li><a href="#propdef-poly">poly</a><span>, in §3.2.1</span>
+   <li><a href="#propdef-ppageno">ppageno</a><span>, in §5.2.4</span>
+   <li><a href="#propdef-presence">presence</a><span>, in §3.2.3</span>
+   <li><a href="#propdef-scan_res">scan_res</a><span>, in §5.3.1</span>
+   <li><a href="#propdef-textangle">textangle</a><span>, in §3.1.2</span>
+   <li><a href="#propdef-x_bboxes">x_bboxes</a><span>, in §8.2.3</span>
+   <li><a href="#propdef-x_confs">x_confs</a><span>, in §8.2.4</span>
+   <li><a href="#propdef-x_font">x_font</a><span>, in §8.2.1</span>
+   <li><a href="#propdef-x_fsize">x_fsize</a><span>, in §8.2.2</span>
+   <li><a href="#propdef-x_scanner">x_scanner</a><span>, in §5.3.2</span>
+   <li><a href="#propdef-x_source">x_source</a><span>, in §5.3.3</span>
+   <li><a href="#propdef-x_wconf">x_wconf</a><span>, in §8.2.5</span>
+  </ul>
   <h2 class="no-num no-ref heading settled" id="references"><span class="content">References</span><a class="self-link" href="#references"></a></h2>
   <h3 class="no-num no-ref heading settled" id="normative"><span class="content">Normative References</span><a class="self-link" href="#normative"></a></h3>
   <dl>
@@ -2838,8 +2987,271 @@ <h2 class="no-num no-ref heading settled" id="issues-index"><span class="content
 properties for floating elements; properties need to be defined for this.<a href="#issue-3f2f70ed"> ↵ </a></div>
    <div class="issue"> <a href="https://github.com/kba/hocr-spec/issues/28">ocr_carea vs ocrx_block</a><a href="#issue-66c198d9"> ↵ </a></div>
    <div class="issue"> <a href="https://github.com/kba/hocr-spec/issues/19">ocr_line vs ocrx_line</a><a href="#issue-8ef34561"> ↵ </a></div>
+   <div class="issue"> ocrx_cinfo?<a href="#issue-000a0ed5"> ↵ </a></div>
    <div class="issue"> <a href="https://github.com/kba/hocr-spec/issues/9">Delete x_cost</a><a href="#issue-b35297dd"> ↵ </a></div>
    <div class="issue"> <a href="https://github.com/kba/hocr-spec/issues/2">XML namespace for hOCR HTML?</a><a href="#issue-f6d39356"> ↵ </a></div>
    <div class="issue"> <a href="https://github.com/kba/hocr-spec/issues/1">What DOCTYPE for hOCR HTML?</a><a href="#issue-a3899b99"> ↵ </a></div>
    <div class="issue"> <a href="https://github.com/kba/hocr-spec/issues/27">correct MIME type for hOCR?</a><a href="#issue-19855aac"> ↵ </a></div>
-  </div>
\ No newline at end of file
+  </div>
+  <aside class="dfn-panel" data-for="propdef-bbox">
+   <b><a href="#propdef-bbox">#propdef-bbox</a></b><b>Referenced in:</b>
+   <ul>
+    <li><a href="#ref-for-propdef-bbox-1">3.1.1. bbox</a>
+    <li><a href="#ref-for-propdef-bbox-2">3.2.1. poly</a>
+    <li><a href="#ref-for-propdef-bbox-3">3.2.5. baseline</a>
+    <li><a href="#ref-for-propdef-bbox-4">7.2.1. cuts</a>
+    <li><a href="#ref-for-propdef-bbox-5">8.2.3. x_bboxes</a>
+   </ul>
+  </aside>
+  <aside class="dfn-panel" data-for="propdef-presence">
+   <b><a href="#propdef-presence">#propdef-presence</a></b><b>Referenced in:</b>
+   <ul>
+    <li><a href="#ref-for-propdef-presence-1">3.2.3. presence</a>
+   </ul>
+  </aside>
+  <aside class="dfn-panel" data-for="elementdef-ocr_document">
+   <b><a href="#elementdef-ocr_document">#elementdef-ocr_document</a></b><b>Referenced in:</b>
+   <ul>
+    <li><a href="#ref-for-elementdef-ocr_document-1">4. Logical Structuring Elements</a>
+   </ul>
+  </aside>
+  <aside class="dfn-panel" data-for="elementdef-ocr_title">
+   <b><a href="#elementdef-ocr_title">#elementdef-ocr_title</a></b><b>Referenced in:</b>
+   <ul>
+    <li><a href="#ref-for-elementdef-ocr_title-1">4. Logical Structuring Elements</a>
+   </ul>
+  </aside>
+  <aside class="dfn-panel" data-for="elementdef-ocr_author">
+   <b><a href="#elementdef-ocr_author">#elementdef-ocr_author</a></b><b>Referenced in:</b>
+   <ul>
+    <li><a href="#ref-for-elementdef-ocr_author-1">4. Logical Structuring Elements</a>
+   </ul>
+  </aside>
+  <aside class="dfn-panel" data-for="elementdef-ocr_abstract">
+   <b><a href="#elementdef-ocr_abstract">#elementdef-ocr_abstract</a></b><b>Referenced in:</b>
+   <ul>
+    <li><a href="#ref-for-elementdef-ocr_abstract-1">4. Logical Structuring Elements</a>
+   </ul>
+  </aside>
+  <aside class="dfn-panel" data-for="elementdef-ocr_part">
+   <b><a href="#elementdef-ocr_part">#elementdef-ocr_part</a></b><b>Referenced in:</b>
+   <ul>
+    <li><a href="#ref-for-elementdef-ocr_part-1">4. Logical Structuring Elements</a>
+   </ul>
+  </aside>
+  <aside class="dfn-panel" data-for="elementdef-ocr_chapter">
+   <b><a href="#elementdef-ocr_chapter">#elementdef-ocr_chapter</a></b><b>Referenced in:</b>
+   <ul>
+    <li><a href="#ref-for-elementdef-ocr_chapter-1">4. Logical Structuring Elements</a>
+    <li><a href="#ref-for-elementdef-ocr_chapter-2">11. Grouped Elements and Multiple Hierarchies</a> <a href="#ref-for-elementdef-ocr_chapter-3">(2)</a>
+   </ul>
+  </aside>
+  <aside class="dfn-panel" data-for="elementdef-ocr_section">
+   <b><a href="#elementdef-ocr_section">#elementdef-ocr_section</a></b><b>Referenced in:</b>
+   <ul>
+    <li><a href="#ref-for-elementdef-ocr_section-1">4. Logical Structuring Elements</a>
+   </ul>
+  </aside>
+  <aside class="dfn-panel" data-for="elementdef-ocr_display">
+   <b><a href="#elementdef-ocr_display">#elementdef-ocr_display</a></b><b>Referenced in:</b>
+   <ul>
+    <li><a href="#ref-for-elementdef-ocr_display-1">4. Logical Structuring Elements</a>
+    <li><a href="#ref-for-elementdef-ocr_display-2">6.1.5. ocr_math</a>
+   </ul>
+  </aside>
+  <aside class="dfn-panel" data-for="elementdef-ocr_blockquote">
+   <b><a href="#elementdef-ocr_blockquote">#elementdef-ocr_blockquote</a></b><b>Referenced in:</b>
+   <ul>
+    <li><a href="#ref-for-elementdef-ocr_blockquote-1">4. Logical Structuring Elements</a>
+   </ul>
+  </aside>
+  <aside class="dfn-panel" data-for="elementdef-ocr_par">
+   <b><a href="#elementdef-ocr_par">#elementdef-ocr_par</a></b><b>Referenced in:</b>
+   <ul>
+    <li><a href="#ref-for-elementdef-ocr_par-1">4. Logical Structuring Elements</a>
+   </ul>
+  </aside>
+  <aside class="dfn-panel" data-for="elementdef-ocr_linear">
+   <b><a href="#elementdef-ocr_linear">#elementdef-ocr_linear</a></b><b>Referenced in:</b>
+   <ul>
+    <li><a href="#ref-for-elementdef-ocr_linear-1">3.2.4. cflow</a>
+    <li><a href="#ref-for-elementdef-ocr_linear-2">4. Logical Structuring Elements</a>
+    <li><a href="#ref-for-elementdef-ocr_linear-3">4.11.1. ocr_linear</a> <a href="#ref-for-elementdef-ocr_linear-4">(2)</a> <a href="#ref-for-elementdef-ocr_linear-5">(3)</a> <a href="#ref-for-elementdef-ocr_linear-6">(4)</a> <a href="#ref-for-elementdef-ocr_linear-7">(5)</a>
+    <li><a href="#ref-for-elementdef-ocr_linear-8">5.1.3. ocr_carea</a>
+    <li><a href="#ref-for-elementdef-ocr_linear-9">14. Profiles</a> <a href="#ref-for-elementdef-ocr_linear-10">(2)</a>
+   </ul>
+  </aside>
+  <aside class="dfn-panel" data-for="elementdef-ocr_caption">
+   <b><a href="#elementdef-ocr_caption">#elementdef-ocr_caption</a></b><b>Referenced in:</b>
+   <ul>
+    <li><a href="#ref-for-elementdef-ocr_caption-1">4.12. ocr_caption</a> <a href="#ref-for-elementdef-ocr_caption-2">(2)</a>
+   </ul>
+  </aside>
+  <aside class="dfn-panel" data-for="elementdef-ocr_page">
+   <b><a href="#elementdef-ocr_page">#elementdef-ocr_page</a></b><b>Referenced in:</b>
+   <ul>
+    <li><a href="#ref-for-elementdef-ocr_page-1">5.1.1. ocr_page</a>
+    <li><a href="#ref-for-elementdef-ocr_page-2">11. Grouped Elements and Multiple Hierarchies</a>
+    <li><a href="#ref-for-elementdef-ocr_page-3">14. Profiles</a> <a href="#ref-for-elementdef-ocr_page-4">(2)</a> <a href="#ref-for-elementdef-ocr_page-5">(3)</a>
+   </ul>
+  </aside>
+  <aside class="dfn-panel" data-for="elementdef-ocr_carea">
+   <b><a href="#elementdef-ocr_carea">#elementdef-ocr_carea</a></b><b>Referenced in:</b>
+   <ul>
+    <li><a href="#ref-for-elementdef-ocr_carea-1">3.2.4. cflow</a> <a href="#ref-for-elementdef-ocr_carea-2">(2)</a> <a href="#ref-for-elementdef-ocr_carea-3">(3)</a>
+    <li><a href="#ref-for-elementdef-ocr_carea-4">5.1.2. ocr_column</a>
+    <li><a href="#ref-for-elementdef-ocr_carea-5">5.1.3. ocr_carea</a> <a href="#ref-for-elementdef-ocr_carea-6">(2)</a> <a href="#ref-for-elementdef-ocr_carea-7">(3)</a> <a href="#ref-for-elementdef-ocr_carea-8">(4)</a> <a href="#ref-for-elementdef-ocr_carea-9">(5)</a> <a href="#ref-for-elementdef-ocr_carea-10">(6)</a>
+    <li><a href="#ref-for-elementdef-ocr_carea-11">8.1.3. ocrx_word</a> <a href="#ref-for-elementdef-ocr_carea-12">(2)</a>
+   </ul>
+  </aside>
+  <aside class="dfn-panel" data-for="elementdef-ocr_line">
+   <b><a href="#elementdef-ocr_line">#elementdef-ocr_line</a></b><b>Referenced in:</b>
+   <ul>
+    <li><a href="#ref-for-elementdef-ocr_line-1">2. Introduction</a>
+    <li><a href="#ref-for-elementdef-ocr_line-2">5.1.4. ocr_line</a> <a href="#ref-for-elementdef-ocr_line-3">(2)</a>
+    <li><a href="#ref-for-elementdef-ocr_line-4">5.3.3. x_source</a>
+    <li><a href="#ref-for-elementdef-ocr_line-5">5.3.4. hardbreak</a>
+    <li><a href="#ref-for-elementdef-ocr_line-6">8.1.2. ocrx_line</a>
+    <li><a href="#ref-for-elementdef-ocr_line-7">8.1.3. ocrx_word</a>
+   </ul>
+  </aside>
+  <aside class="dfn-panel" data-for="elementdef-ocr_float">
+   <b><a href="#elementdef-ocr_float">#elementdef-ocr_float</a></b><b>Referenced in:</b>
+   <ul>
+    <li><a href="#ref-for-elementdef-ocr_float-1">6.1.5. ocr_math</a>
+   </ul>
+  </aside>
+  <aside class="dfn-panel" data-for="elementdef-ocr_cinfo">
+   <b><a href="#elementdef-ocr_cinfo">#elementdef-ocr_cinfo</a></b><b>Referenced in:</b>
+   <ul>
+    <li><a href="#ref-for-elementdef-ocr_cinfo-1">7.1.1. ocr_cinfo</a>
+   </ul>
+  </aside>
+  <aside class="dfn-panel" data-for="propdef-cuts">
+   <b><a href="#propdef-cuts">#propdef-cuts</a></b><b>Referenced in:</b>
+   <ul>
+    <li><a href="#ref-for-propdef-cuts-1">7.2.1. cuts</a>
+    <li><a href="#ref-for-propdef-cuts-2">8.1.3. ocrx_word</a>
+   </ul>
+  </aside>
+  <aside class="dfn-panel" data-for="propdef-nlp">
+   <b><a href="#propdef-nlp">#propdef-nlp</a></b><b>Referenced in:</b>
+   <ul>
+    <li><a href="#ref-for-propdef-nlp-1">10. Alternative Segmentations / Readings</a>
+   </ul>
+  </aside>
+  <aside class="dfn-panel" data-for="elementdef-ocrx_block">
+   <b><a href="#elementdef-ocrx_block">#elementdef-ocrx_block</a></b><b>Referenced in:</b>
+   <ul>
+    <li><a href="#ref-for-elementdef-ocrx_block-1">3.2.4. cflow</a>
+    <li><a href="#ref-for-elementdef-ocrx_block-2">8.1.3. ocrx_word</a> <a href="#ref-for-elementdef-ocrx_block-3">(2)</a> <a href="#ref-for-elementdef-ocrx_block-4">(3)</a> <a href="#ref-for-elementdef-ocrx_block-5">(4)</a>
+    <li><a href="#ref-for-elementdef-ocrx_block-6">14. Profiles</a>
+   </ul>
+  </aside>
+  <aside class="dfn-panel" data-for="elementdef-ocrx_line">
+   <b><a href="#elementdef-ocrx_line">#elementdef-ocrx_line</a></b><b>Referenced in:</b>
+   <ul>
+    <li><a href="#ref-for-elementdef-ocrx_line-1">8.1.3. ocrx_word</a> <a href="#ref-for-elementdef-ocrx_line-2">(2)</a>
+    <li><a href="#ref-for-elementdef-ocrx_line-3">14. Profiles</a>
+   </ul>
+  </aside>
+  <aside class="dfn-panel" data-for="elementdef-ocrx_word">
+   <b><a href="#elementdef-ocrx_word">#elementdef-ocrx_word</a></b><b>Referenced in:</b>
+   <ul>
+    <li><a href="#ref-for-elementdef-ocrx_word-1">14. Profiles</a>
+   </ul>
+  </aside>
+  <aside class="dfn-panel" data-for="propdef-x_bboxes">
+   <b><a href="#propdef-x_bboxes">#propdef-x_bboxes</a></b><b>Referenced in:</b>
+   <ul>
+    <li><a href="#ref-for-propdef-x_bboxes-1">3.1.1. bbox</a>
+    <li><a href="#ref-for-propdef-x_bboxes-2">8.1.3. ocrx_word</a>
+   </ul>
+  </aside>
+  <aside class="dfn-panel" data-for="propdef-x_confs">
+   <b><a href="#propdef-x_confs">#propdef-x_confs</a></b><b>Referenced in:</b>
+   <ul>
+    <li><a href="#ref-for-propdef-x_confs-1">8.1.3. ocrx_word</a>
+   </ul>
+  </aside>
+  <aside class="dfn-panel" data-for="valdef-ocr-capabilities-ocrp_lang">
+   <b><a href="#valdef-ocr-capabilities-ocrp_lang">#valdef-ocr-capabilities-ocrp_lang</a></b><b>Referenced in:</b>
+   <ul>
+    <li><a href="#ref-for-valdef-ocr-capabilities-ocrp_lang-1">14. Profiles</a>
+   </ul>
+  </aside>
+  <aside class="dfn-panel" data-for="valdef-ocr-capabilities-ocrp_poly">
+   <b><a href="#valdef-ocr-capabilities-ocrp_poly">#valdef-ocr-capabilities-ocrp_poly</a></b><b>Referenced in:</b>
+   <ul>
+    <li><a href="#ref-for-valdef-ocr-capabilities-ocrp_poly-1">3.2.1. poly</a>
+   </ul>
+  </aside>
+  <aside class="dfn-panel" data-for="valdef-ocr-capabilities-ocrp_font">
+   <b><a href="#valdef-ocr-capabilities-ocrp_font">#valdef-ocr-capabilities-ocrp_font</a></b><b>Referenced in:</b>
+   <ul>
+    <li><a href="#ref-for-valdef-ocr-capabilities-ocrp_font-1">14. Profiles</a>
+   </ul>
+  </aside>
+  <aside class="dfn-panel" data-for="propdef-ocr-capabilities">
+   <b><a href="#propdef-ocr-capabilities">#propdef-ocr-capabilities</a></b><b>Referenced in:</b>
+   <ul>
+    <li><a href="#ref-for-propdef-ocr-capabilities-1">3.2.1. poly</a>
+   </ul>
+  </aside>
+<script>/* script-dfn-panel */
+
+        document.body.addEventListener("click", function(e) {
+            var queryAll = function(sel) { return [].slice.call(document.querySelectorAll(sel)); }
+            // Find the dfn element or panel, if any, that was clicked on.
+            var el = e.target;
+            var target;
+            var hitALink = false;
+            while(el.parentElement) {
+                if(el.tagName == "A") {
+                    // Clicking on a link in a <dfn> shouldn't summon the panel
+                    hitALink = true;
+                }
+                if(el.classList.contains("dfn-paneled")) {
+                    target = "dfn";
+                    break;
+                }
+                if(el.classList.contains("dfn-panel")) {
+                    target = "dfn-panel";
+                    break;
+                }
+                el = el.parentElement;
+            }
+            if(target != "dfn-panel") {
+                // Turn off any currently "on" or "activated" panels.
+                queryAll(".dfn-panel.on, .dfn-panel.activated").forEach(function(el){
+                    el.classList.remove("on");
+                    el.classList.remove("activated");
+                });
+            }
+            if(target == "dfn" && !hitALink) {
+                // open the panel
+                var dfnPanel = document.querySelector(".dfn-panel[data-for='" + el.id + "']");
+                if(dfnPanel) {
+                    console.log(dfnPanel);
+                    dfnPanel.classList.add("on");
+                    var rect = el.getBoundingClientRect();
+                    dfnPanel.style.left = window.scrollX + rect.right + 5 + "px";
+                    dfnPanel.style.top = window.scrollY + rect.top + "px";
+                    var panelRect = dfnPanel.getBoundingClientRect();
+                    var panelWidth = panelRect.right - panelRect.left;
+                    if(panelRect.right > document.body.scrollWidth && (rect.left - (panelWidth + 5)) > 0) {
+                        // Reposition, because the panel is overflowing
+                        dfnPanel.style.left = window.scrollX + rect.left - (panelWidth + 5) + "px";
+                    }
+                } else {
+                    console.log("Couldn't find .dfn-panel[data-for='" + el.id + "']");
+                }
+            } else if(target == "dfn-panel") {
+                // Switch it to "activated" state, which pins it.
+                el.classList.add("activated");
+                el.style.left = null;
+                el.style.top = null;
+            }
+
+        });
+        </script>
\ No newline at end of file
diff --git a/1.2/metadata b/1.2/metadata
index 38cdc08..97c54c1 100644
--- a/1.2/metadata
+++ b/1.2/metadata
@@ -9,4 +9,4 @@ Editor: Konstantin Baierer, UB Mannheim http://github.com/UB-Mannheim, konstanti
 Former Editor: Thomas Breuel, http://www.9x9.com/
 Previous Version: https://github.com/kba/hocr-spec/blob/master/1.1/spec.md
 Abstract: A subset of HTML for marking up OCR results
-Markup Shorthands: markdown on, biblio on
+Markup Shorthands: markdown on, biblio on, markup on
diff --git a/1.2/spec.md b/1.2/spec.md
index 78f1135..ca94594 100644
--- a/1.2/spec.md
+++ b/1.2/spec.md
@@ -13,7 +13,7 @@ arrive at a representation that makes it easy to reuse OCR results.
 
 This document describes many tags and a lot of information that can be output.
 However, getting started with hOCR is easy: you only need to output the tags
-and information you actually want to.  For example, just outputting `ocr_line`
+and information you actually want to.  For example, just outputting <{ocr_line}>
 tags with bounding boxes is already very useful for many applications.  Just
 start simple and add more output information as the need arises.
 
@@ -68,7 +68,7 @@ multiple properties are separated by semicolons.
 
 The following properties can apply to most elements (where it makes sense):
 
-### `bbox`
+### <dfn property>bbox</dfn>
 
 `bbox x0 y0 x1 y1`
 
@@ -79,8 +79,8 @@ the lower-right corner (x1, y1).
   * the values are with reference to the the top-left corner of the document image
     and measured in pixels
   * the order of the values are `x0 y0 x1 y1` = "left top right bottom"
-  * use `x_bboxes` below for character bounding boxes
-  * do not use `bbox` unless the bounding box of the layout component is, in
+  * use 'x_bboxes' below for character bounding boxes
+  * do not use 'bbox' unless the bounding box of the layout component is, in
     fact, rectangular
   * some non-rectangular layout components may have rectangular bounding boxes
     if the non-rectangularity is caused by floating elements around which text flows
@@ -106,7 +106,7 @@ the document image which border is drawn in black.
 
 </div>
 
-### `textangle`
+### <dfn property>textangle</dfn>
 
 `textangle alpha`
 
@@ -121,7 +121,7 @@ which should be indicated using standard HTML properties
 The following properties can apply to most elements but should not be used
 unless there is no alternative:
 
-### `poly`
+### <dfn property>poly</dfn>
 
 `poly x0 y0 x1 y1 ...`
 
@@ -134,11 +134,11 @@ A closed polygon for elements with non-rectangular bounds
   * note that the natural and correct representation of many non-rectangular
     layouts is in terms of rectangular content areas and rectangular floats
   * documents using polygonal borders anywhere must indicate this by adding
-    [[#ocrp_poly]] to the list of `ocr-capabilities` in the
-    [[#required-meta-information]]
-  * documents should attempt to provide a reasonable bbox equivalent as well
+    ''ocr-capabilities/ocrp_poly'' to the list of 'ocr-capabilities' (see
+    [[#required-meta-information]])
+  * documents should attempt to provide a reasonable 'bbox' equivalent as well
 
-### `order`
+### <dfn property>order</dfn>
 
 `order n`
 
@@ -148,27 +148,27 @@ The reading order of the element (an integer)
     the reading order of the page by element ordering within the page, since
     many tools will not be able to deal with content that is not in reading order
 
-### `presence`
+### <dfn property>presence</dfn>
 
 Issue: [Use of property presence](https://github.com/kba/hocr-spec/issues/10)
 
-`presence` presence must be declared in the document meta data
+'presence' presence must be declared in the document meta data
 
-### `cflow`
+### <dfn property>cflow</dfn>
 
 `cflow s`
 
-This property relates the flow between multiple [[#ocr_carea]] elements,
-and between [[#ocr_carea]] and [[#ocr_linear]] elements.
+This property relates the flow between multiple <{ocr_carea}> elements,
+and between <{ocr_carea}> and <{ocr_linear}> elements.
 
 The content flow on the page that this element is a part of
 
   * s must be a unique string for each content flow
-  * must be present on [[#ocr_carea]] and [[#ocrx_block]] tags when reading
+  * must be present on <{ocr_carea}> and <{ocrx_block}> tags when reading
     order is attempted and multiple content flows are present
   * presence must be declared in the document meta data
 
-### `baseline`
+### <dfn property>baseline</dfn>
 
 `baseline pn pn-1 ... p0`
 
@@ -191,7 +191,7 @@ contains the following information:
     title="bbox 105 66 823 113; baseline 0.015 -18">...</span>
 ```
 
-bbox is the bounding box of the line in image coordinates (blue). The two
+'bbox' is the bounding box of the line in image coordinates (blue). The two
 numbers for the baseline are the slope (1st number) and constant term (2nd
 number) of a linear equation describing the baseline relative to the bottom
 left corner of the bounding box (red). The baseline crosses the y-axis at `-18`
@@ -208,30 +208,30 @@ and its slope angle is `arctan(0.015) = 0.86°`.
 
 We recognize the following logical structuring elements:
 
-  * `ocr_document`
-    * `ocr_linear`
-      * `ocr_title`
-      * `ocr_author`
-      * `ocr_abstract`
-      * `ocr_part` [`<h1>`]
-        * `ocr_chapter` [`<h1>`]
-          * `ocr_section` [`<h2>`]
+  * <{ocr_document}>
+    * <{ocr_linear}>
+      * <{ocr_title}>
+      * <{ocr_author}>
+      * <{ocr_abstract}>
+      * <{ocr_part}> [`<h1>`]
+        * <{ocr_chapter}> [`<h1>`]
+          * <{ocr_section}> [`<h2>`]
             * `ocr_sub*section` [`<h3>`,`<h4>`]
-              * `ocr_display` 
-              * `ocr_blockquote` [`<blockquote>`]
-              * `ocr_par` [`<p>`]
-
-## `ocr_document`
-## `ocr_title`
-## `ocr_author`
-## `ocr_abstract`
-## `ocr_part`
-## `ocr_chapter`
-## `ocr_section`
-## `ocr_subsubsection`
-## `ocr_display`
-## `ocr_blockquote`
-## `ocr_par`
+              * <{ocr_display}> 
+              * <{ocr_blockquote}> [`<blockquote>`]
+              * <{ocr_par}> [`<p>`]
+
+## <dfn element>ocr_document</dfn>
+## <dfn element>ocr_title</dfn>
+## <dfn element>ocr_author</dfn>
+## <dfn element>ocr_abstract</dfn>
+## <dfn element>ocr_part</dfn>
+## <dfn element>ocr_chapter</dfn>
+## <dfn element>ocr_section</dfn>
+## <dfn element>ocr_subsubsection</dfn>
+## <dfn element>ocr_display</dfn>
+## <dfn element>ocr_blockquote</dfn>
+## <dfn element>ocr_par</dfn>
 
 These logical tags have their standard meaning as used in the publishing
 industry and tools like LaTeX, MS Word, and others.
@@ -241,15 +241,15 @@ with those logical structuring elements, but it may not be possible or
 desirable to actually chose those tags (e.g., when adding hOCR information to
 an existing HTML output routine).
 
-## `ocr_linear`
+### <dfn element>ocr_linear</dfn>
 
-For all of these elements except `ocr_linear`, there exists a natural linear
-ordering defined by reading order (`ocr_linear` indicates that the elements
-contained in it have a linear ordering). At the level of `ocr_linear`, there
-may not be a single distinguished order. A common example of `ocr_linear` is a
+For all of these elements except <{ocr_linear}>, there exists a natural linear
+ordering defined by reading order (<{ocr_linear}> indicates that the elements
+contained in it have a linear ordering). At the level of <{ocr_linear}>, there
+may not be a single distinguished order. A common example of <{ocr_linear}> is a
 newspaper, in which a single newspaper may contain many linear, but there is no
 unique reading order for the different linear. OCR evaluation tools should
-therefore be sensitive to the order of all elements other than `ocr_linear`.
+therefore be sensitive to the order of all elements other than <{ocr_linear}>.
 
 Tags must be nested as indicated by nesting above, but not all tags within the
 hierarchy need to be present.
@@ -260,11 +260,11 @@ text inside the containing element.
 Documents whose logical structure does not map naturally onto these logical
 structuring elemetns must not use them for other purpose.
 
-## `ocr_caption`
+## <dfn element>ocr_caption</dfn>
 
-Image captions may be indicated using the `ocr_caption` element; such an
+Image captions may be indicated using the <{ocr_caption}> element; such an
 element refers to the image(s) contained within the same float, or the
-immediately adjacent image if both the image and the `ocr_caption` element are
+immediately adjacent image if both the image and the <{ocr_caption}> element are
 in running text.
 
 
@@ -303,57 +303,57 @@ properties for floating elements; properties need to be defined for this.
 The following classes, as well as [floats](#classes-for-floats) are used for type-setting
 elements.
 
-### `ocr_page`
+### <dfn element>ocr_page</dfn>
 
-The `ocr_page` element must be present in all hOCR documents.
+The <{ocr_page}> element must be present in all hOCR documents.
 
-### `ocr_column`
+### <dfn element>ocr_column</dfn>
 
 <div class="annoying-warning">
 **OBSOLETE**
 
-Please use [[#ocr_carea]] instead
+Please use <{ocr_carea}> instead
 </div>
 
-### `ocr_carea`
+### <dfn element>ocr_carea</dfn>
 
 "ocr content area" or "body area"
 
 Used to be called <del>ocr_column</del>
 
-The `ocr_carea` elements should appear in reading order unless this is impossible
+The <{ocr_carea}> elements should appear in reading order unless this is impossible
 because of some other structuring requirement. If the document contains multiple
-`ocr_linear` streams, then each `ocr_carea` must indicate which stream it belongs
+<{ocr_linear}> streams, then each <{ocr_carea}> must indicate which stream it belongs
 to.
 
 Note that for many documents, the actual ground truth careas are well-defined
 by the document style of the original document before printing and scanning.
 From a single page, the `careas` of the original document style cannot be
-recovered exactly. However, the partition of a document by `ocr_carea` for an
+recovered exactly. However, the partition of a document by <{ocr_carea}> for an
 individual page shall be considered correct relative to ground truth if
 
   1. all the text contained in a ground truth carea is fully contained within a
-    single `ocr_carea`,
+    single <{ocr_carea}>,
   2. no text outside a ground truth `carea` is contained within an
-    `ocr_carea`, and 
-  3. the `ocr_careas` appear in the same order as the text flow
+    <{ocr_carea}>, and 
+  3. the <{ocr_carea}> appear in the same order as the text flow
     relationships between the ground truth careas.
 
-### `ocr_line`
+### <dfn element>ocr_line</dfn>
 
 In typesetting systems, content areas are filled with “blocks”, but most of
 those blocks are not recoverable or semantically meaningful. However, one type
 of block is visible and very important for OCR engines: the line. Lines are
 typesetting blocks that only contain glyphs (“inlines” in XSL terminology).
-They are represented by the `ocr_line` area.
+They are represented by the <{ocr_line}> area.
 
-`ocr_line` should be in a `<span>`
+<{ocr_line}> should be in a `<span>`
 
-### `ocr_separator`
+### <dfn element>ocr_separator</dfn>
 
 Any separator or similar element
 
-### `ocr_noise`
+### <dfn element>ocr_noise</dfn>
 
 Any noise element that isn't part of typesetting
 
@@ -366,7 +366,7 @@ The following properties should be present:
 The bounding box of the page; for pages, the top left corner must be at
 `(0,0)`, so a typical page bounding box will look like `bbox 0 0 2300 3200`
 
-### `image`
+### <dfn property>image</dfn>
 
 `image imagefile`
 
@@ -378,14 +378,14 @@ The bounding box of the page; for pages, the top left corner must be at
   * if the hOCR file is present in a directory hierarchy or file archive, should
     resolve to the corresponding image file
 
-### `imagemd5`
+### <dfn property>imagemd5</dfn>
 
 `imagemd5 checksum`
 
   * MD5 fingerprint of the image file that this page was derived from
   * allows re-associating pages with source images
 
-### `ppageno`
+### <dfn property>ppageno</dfn>
 
 `ppageno n`
 
@@ -395,7 +395,7 @@ The bounding box of the page; for pages, the top left corner must be at
   * must not be present unless the pages in the document have a physical ordering
   * must not be present unless it is well defined and unique
 
-### `lpageno`
+### <dfn property>lpageno</dfn>
 
 `lpageno string`
 
@@ -408,19 +408,19 @@ The bounding box of the page; for pages, the top left corner must be at
 
 The following properties MAY be present:
 
-### `scan_res`
+### <dfn property>scan_res</dfn>
 
 `scan_res x_res y_res`
 
   * scanning resolution in DPI
 
-### `x_scanner`
+### <dfn property>x_scanner</dfn>
 
 `x_scanner string`
 
   * a representation of the scanner
 
-### `x_source`
+### <dfn property>x_source</dfn>
 
 `x_source string`
 
@@ -433,9 +433,9 @@ The following properties MAY be present:
     * `x_source http://pageserver/012345678911&page=17`
 
 In addition to the standard
-properties, the `ocr_line` area supports the following additional properties:
+properties, the <{ocr_line}> area supports the following additional properties:
 
-### `hardbreak`
+### <dfn property>hardbreak</dfn>
 
 `hardbreak n`
 
@@ -444,7 +444,7 @@ properties, the `ocr_line` area supports the following additional properties:
   * a one indicates that the line is a hard (explicit) line break
 
 Any special characters representing the desired end-of-line processing must be
-present inside the `ocr_line` element. Examples of such special characters are a
+present inside the <{ocr_line}> element. Examples of such special characters are a
 soft hyphen ("­", `U+00AD`), a hard line break (`<br>`), or whitespace (` `) for soft
 line breaks.
 
@@ -454,48 +454,48 @@ Floats should not be nested.
 
 The following floats are defined:
 
-### `ocr_float`
+### <dfn element>ocr_float</dfn>
 
 `ocr_float`
 
-### `ocr_separator`
+### <dfn element>ocr_separator</dfn>
 
-`ocr_separator`
+`ocr_separator` in the context of float classes.
 
-### `ocr_textfloat`
+### <dfn element>ocr_textfloat</dfn>
 
 `ocr_textfloat`
 
-### `ocr_textimage`
+### <dfn element>ocr_textimage</dfn>
 
 `ocr_textimage`
 
-### `ocr_image`
+### <dfn element>ocr_image</dfn>
 
 `ocr_image`
 
-### `ocr_linedrawing`
+### <dfn element>ocr_linedrawing</dfn>
 
 Something that could be represented well and naturally in a vector graphics
 format like SVG (even if it is actually represented as PNG)
 
-### `ocr_photo`
+### <dfn element>ocr_photo</dfn>
 
 Something that requires JPEG or PNG to be represented well
 
-### `ocr_header`
+### <dfn element>ocr_header</dfn>
 
 `ocr_header`
 
-### `ocr_footer`
+### <dfn element>ocr_footer</dfn>
 
 `ocr_footer`
 
-### `ocr_pageno`
+### <dfn element>ocr_pageno</dfn>
 
 `ocr_pageno`
 
-### `ocr_table`
+### <dfn element>ocr_table</dfn>
 
 `ocr_table`
 
@@ -505,44 +505,44 @@ There is some content that should behave and flow like text
 
 ## Classes for Inline Representation
 
-### `ocr_glyph`
+### <dfn element>ocr_glyph</dfn>
 
 An individual glyph represented as an image (e.g., an unrecognized character)
 
 Must contain a single `<img>` tag, or be present on one
 
-### `ocr_glyphs`
+### <dfn element>ocr_glyphs</dfn>
 
 Multiple glyphs represented as an image (e.g., an unrecognized word)
 
 Must contain a single `<img>` tag, or be present on one
 
-### `ocr_dropcap`
+### <dfn element>ocr_dropcap</dfn>
 
 An individual glyph representing a dropcap
 
 May contain text or an `<img>` tag; the `alt` of the image tag should contain
 the corresponding text
 
-### `ocr_chem`
+### <dfn element>ocr_chem</dfn>
 
 A chemical formula
 
 Must contain either a single `<img>` tag or [[CML]] markup, or be present on
 one
 
-### `ocr_math`
+### <dfn element>ocr_math</dfn>
 
 A mathematical formula
 
 Must contain either a single `<img>` tag or [[MathML]] markup, or be present on
 one
 
-Mathematical and chemical formulas that float must be put into an `ocr_float`
+Mathematical and chemical formulas that float must be put into an <{ocr_float}>
 section.
 
 Mathematical and chemical formulas that are “display” mode should be put into
-an `ocr_display` section.
+an <{ocr_display}> section.
 
 ### Non-breaking space
 
@@ -557,8 +557,9 @@ Different space widths should be indicated using HTML and `&ensp;`, `&emsp`,
 
 Soft hyphens must be represented using the HTML `&shy;` entity.
 
-The HTML `&lrm;` and `&rlm;` entities (indicating writing direction) must not
-be used; all writing direction changes must be indicated with tags.
+The HTML <a href="https://www.w3.org/TR/REC-html40/struct/dirlang.html#h-8.2.5">`&lrm;` and
+`&rlm;` entities</a> (indicating writing direction) must not be used; all
+writing direction changes must be indicated with tags.
 
 ### Superscript and Subscript
 
@@ -577,20 +578,20 @@ must be represented using their correct Unicode encoding.
 Character-level information may be put on any element that contains only a
 single "line" of text.
 
-### `ocr_cinfo`
+### <dfn element>ocr_cinfo</dfn>
 
-If no other layout element applies, the `ocr_cinfo` element may be used.
+If no other layout element applies, the <{ocr_cinfo}> element may be used.
 
 ## Properties for Character Information
 
-### `cuts`
+### <dfn property>cuts</dfn>
 
 `cuts c1 c2 c3 ...`
 
   * character segmentation cuts (see below)
-  * there must be a bbox property relative to which the cuts can be interpreted
+  * there must be a 'bbox' property relative to which the 'cuts' can be interpreted
 
-### `nlp`
+### <dfn property>nlp</dfn>
 
 `nlp c1 c2 c3 ...`
 
@@ -641,21 +642,21 @@ Common suggested engine-specific markup are:
 
 ## Classes for engine specific markup
 
-### `ocrx_block`
+### <dfn element>ocrx_block</dfn>
 
 Issue: [ocr_carea vs ocrx_block](https://github.com/kba/hocr-spec/issues/28)
 
   * any kind of "block" returned by an OCR system
   * engine-specific because the definition of a "block" depends on the engine
 
-### `ocrx_line`
+### <dfn element>ocrx_line</dfn>
 
 Issue: [ocr_line vs ocrx_line](https://github.com/kba/hocr-spec/issues/19)
 
-  * any kind of "line" returned by an OCR system that differs from the standard ocr_line above
+  * any kind of "line" returned by an OCR system that differs from the standard <{ocr_line}> above
   * might be some kind of "logical" line
 
-### `ocrx_word`
+### <dfn element>ocrx_word</dfn>
 
   * any kind of "word" returned by an OCR system
   * engine specific because the definition of a "word" depends on the engine
@@ -663,42 +664,44 @@ Issue: [ocr_line vs ocrx_line](https://github.com/kba/hocr-spec/issues/19)
 The meaning of these tags is OCR engine specific. However, generators should
 attempt to ensure the following properties:
 
-* an `ocrx_block` should not contain content from multiple ocr_careas
-* the union of all `ocrx_blocks` should approximately cover all `ocr_careas`
-* an `ocrx_block` should contain either a float or body text, but not both
-* an `ocrx_block` should contain either an image or text, but not both
-* an `ocrx_line` should correspond as closely as possible to an `ocr_line`
-* `ocrx_cinfo` should nest inside `ocrx_line`
-* `ocrx_cinfo` should contain only `x_conf`, `x_bboxes`, and `cuts` attributes
+* An <{ocrx_block}> should not contain content from multiple <{ocr_carea}>.
+* The union of all <{ocrx_block|ocrx_blocks}> should approximately cover all <{ocr_carea}>.
+* an <{ocrx_block}> should contain either a float or body text, but not both
+* an <{ocrx_block}> should contain either an image or text, but not both
+* an <{ocrx_line}> should correspond as closely as possible to an <{ocr_line}>
+* <{ocrx_cinfo}> should nest inside <{ocrx_line}>
+* <{ocrx_cinfo}> should contain only 'x_confs', 'x_bboxes', and 'cuts' attributes
+
+Issue: ocrx_cinfo?
 
 ## Properties for engine-specific markup
 
 The following properties are defined:
 
-### `x_font`
+### <dfn property>x_font</dfn>
 
 `x_font s`
 
   * OCR-engine specific font names
 
-### `x_fsize`
+### <dfn property>x_fsize</dfn>
 
 `x_fsize n`
 
   * OCR-engine specific font size
 
-### `x_bboxes`
+### <dfn property>x_bboxes</dfn>
 
 `x_bboxes b1x0 b1y0 b1x1 b1y1 b2x0 b2y0 b2x1 b2y1 ...`
 
   * OCR-engine specific boxes associated with each codepoint contained in the
     element
-  * note that the bbox property is a property for the bounding box of a layout
+  * note that the 'bbox' property is a property for the bounding box of a layout
     element, not of individual characters
   * in particular, use `<span class="ocr_cinfo" title="x_bboxes ....">`, not
     `<span class="ocr_cinfo" title="bbox ...">`
 
-### `x_confs`
+### <dfn property>x_confs</dfn>
 
 `x_confs c1 c2 c3 ...`
 
@@ -708,7 +711,7 @@ The following properties are defined:
   * if possible, convert character confidences to values between 0 and 100 and
     have them approximate posterior probabilities (expressed in %)
 
-### `x_wconf`
+### <dfn property>x_wconf</dfn>
 
 `x_wconf n`
 
@@ -748,7 +751,7 @@ Alternative segmentations and readings are indicated by a `<span>` with
 `class="alternatives"`. It must contains `<ins>` and `<del>` elements. The first
 contained element should be `<ins>` and represent the most probable interpretation,
 the subsequent ones `<del>`. Each `<ins>` and `<del>` element should have `class="alt"` and a
-property of either `nlp` or `x_cost`. These `<span>`, `<ins>`, and `<del>` tags can nest
+property of either 'nlp' or 'x_cost'. These `<span>`, `<ins>`, and `<del>` tags can nest
 arbitrarily.
 
 <div class="example">
@@ -769,7 +772,7 @@ when viewed in a browser.
 
 The different levels of layout information (logical, physical, engine-specific)
 each form hierarchies, but those hierarchies may not be mutually compatible;
-for example, a single `ocr_page` may contain information from multiple sections
+for example, a single <{ocr_page}> may contain information from multiple sections
 or chapters. To represent both hierarchies within a single document, elements
 may be grouped together.  That is, two elements with the same class may be
 treated as one element by adding a "groupid identifier" property to them and
@@ -787,8 +790,8 @@ removing tags that are not of interest for the subsequent processing step, and
 then collapsing grouped elements into single elements.  For example, output
 that contains both logical and physical layout information, where the logical
 layout information uses grouped elements, can be transformed by removing all
-the physical layout information, and then collapsing all split `ocr_chapter`
-elements into single `ocr_chapter` elements based on the groupid.  The result is
+the physical layout information, and then collapsing all split <{ocr_chapter}>
+elements into single <{ocr_chapter}> elements based on the groupid.  The result is
 a simple DOM tree.  This transformation can be provided generically as a
 pre-processor or Javascript.
 
@@ -809,23 +812,23 @@ document.
 The capability to generate specific properties is given by the prefix `ocrp_...`;
 the important properties are:
 
-## `ocrp_lang`
+## <dfn value for="ocr-capabilities">ocrp_lang</dfn>
 
 Capable of generating `lang=` attributes
 
-## `ocrp_dir`
+## <dfn value for="ocr-capabilities">ocrp_dir</dfn>
 
 Capable of generating `dir=` attributes
 
-## `ocrp_poly`
+## <dfn value for="ocr-capabilities">ocrp_poly</dfn>
 
 Capable of generating [polygonal bounds](#poly)
 
-## `ocrp_font`
+## <dfn value for="ocr-capabilities">ocrp_font</dfn>
 
 Capable of generating font information (standard font information)
 
-## `ocrp_nlp`
+## <dfn value for="ocr-capabilities">ocrp_nlp</dfn>
 
 Capable of generating [nlp confidences](#nlp)
 
@@ -851,16 +854,31 @@ corresponding element or attribute must not be present in the document.
 
 The OCR system is required to indicate the following using meta tags in the header:
 
+### <dfn property>ocr-system</dfn>
+
   * `<meta name="ocr-system" content="name version"/>`
+
+### <dfn property>ocr-capabilities</dfn>
+
   * `<meta name="ocr-capabilities" content="capabilities"/>`
     * see [[#capabilities]]
 
+## Recommended Meta Information
+
 The OCR system should indicate the following information
 
+### <dfn property>ocr-number-of-pages</dfn>
+
   * `<meta name="ocr-number-of-pages" content="number-of-pages"/>`
+
+### <dfn property>ocr-langs</dfn>
+
   * `<meta name="ocr-langs" content="languages-considered-by-ocr"/>`
     * use [ISO 639-1](https://www.loc.gov/standards/iso639-2/php/code_list.php) codes
     * value may be `unknown`
+
+### <dfn property>ocr-scripts</dfn>
+
   * `<meta name="ocr-scripts" content="scripts-considered-by-ocr"/>`
     * use [ISO 15924](http://www.unicode.org/iso15924/codelists.html) letter codes
     * value may be `unknown`
@@ -901,17 +919,17 @@ Other possible profiles might be defined for specific engines or specific
 document classes:
 
   * common commercial OCR output (e.g., Abbyy)
-    * ocr_page
-    * ocrx_block, ocrx_line, ocrx_word
-    * ocrp_lang
-    * ocrp_font
+    * <{ocr_page}>
+    * <{ocrx_block}>, <{ocrx_line}>, <{ocrx_word}>
+    * ''ocr-capabilities/ocrp_lang''
+    * ''ocr-capabilities/ocrp_font''
   * book target
-    * all logical structuring elements (as applicable), except ocr_linear
-    * ocr_page
+    * all logical structuring elements (as applicable), except <{ocr_linear}>
+    * <{ocr_page}>
   * newspaper target
     * all logical structuring elements (as applicable)
-    * articles map on ocr_linear
-    * ocr_page
+    * articles map on <{ocr_linear}>
+    * <{ocr_page}>
 
 # HTML Markup
 
@@ -1171,3 +1189,7 @@ Issue: [correct MIME type for hOCR?](https://github.com/kba/hocr-spec/issues/27)
   : Applications which use this media type:
   : File extension(s):
   :: `*.html`, `*.hocr`
+
+
+
+<!-- vim: set textwidth=120: -->