Methodology and practice
This document explains how the online texts in HTML for use with a plain Web browser are generated from the TEI master copies.
Representation of the markup
HTML does not possess sufficient capability to represent adequately all the various nuances of markup which are encoded in the original copies. It is therefore necessary to make some typographical judgments about how best to retain as much of the original value as possible, without overloading the display.
To see the original TEI file, with all the markup preserved, you can use a real SGML browser like Panorama or MultiDoc Pro for which stylesheets and resource files have been developed. In the meantime, the following representations have been adopted:
The ISO 8859-1 (Latin-1) character set is used by default, but this handles only one length mark, the fada or acute, and only over vowels. When XML software becomes available, UniCode (ISO 10646) will be used.
Editorial length marks given as macrons in the source texts are shown in the HTML copies online as acute accents on vowels, but with the character underlined. Macrons on non-vowels are shown by underlining only. A rare suspension dí (dī) is spelled out in full as didiu.
The séimhiú (dot-over) accent is rendered with a following `h' and underlined. The dot-under accent used to indicate elision is rendered with underlining alone.
The notum for uel, graphically identical with stroked-l is rendered l^, L^.
E, e with an ogonek which represents tall e (e-caudata) are rendered as simple E, e, but underlined.
The Insular Ampersand is rendered &.
If you have any suggestions about how the information could be better presented in HTML, please let us know.
Corpus of Electronic Texts (UCC)