Encoding

From Chswiki

Jump to: navigation, search

Encoding texts for the MTH project

To encode a text:

  • characters in a digital character set encode characters in the source text's writing system
  • markup encodes other information (metadata)

Digital character set

For the Homer Multitext, we encode alphabetic characters (including breathing and accents) and punctuation following the conventions of the TLG's beta-code system. (Here is a summary to TLG beta code.)

Markup

The Homer Multitext project marks the semantic structure of texts in XML following the guidelines of the Text Encoding Initiative. (See the home page.)

Because the TEI Guidelines allow many different solutions to some markup problems, we are developing project standards specifying a specific subset of the TEI possibilities we follow.