We found a match
Your institution may have access to this item. Find your institution then sign in to continue.
- Title
Using standoff properties for marking-up historical documents in the humanities.
- Authors
Schmidt, Desmond Allan
- Abstract
Markup in the form of tags is often embedded into documents to describe formatting structures and other features, as in HTML on the Web. But in the humanities, the use of embedded markup for the transcription of historical documents leads to problems in the representation of overlapping features, and subjective variation in the use of different markup tags for the same features compromises interoperability of the transcriptions. "Standoff" techniques, in which the markup and the text it describes are stored separately, can help alleviate these problems. "Standoff properties" is a technique for recording textual properties that do not conform to a context-free grammar, and can freely overlap. This allows a divide-and-conquer approach to markup, whereby sets of markup properties can record different aspects of a text, which can then be recombined as needed. Despite these advantages, standoff techniques are usually considered impractical when both the underlying text and its markup are subject to change. To circumvent this problem, this paper describes a practical algorithm for updating a set of standoffmarkup properties separately from the text.
- Subjects
HTML (Document markup language); TEXT editors (Computer programs); COMPUTER network resources
- Publication
IT: Information Technology, 2016, Vol 58, Issue 2, p63
- ISSN
1611-2776
- Publication type
Article
- DOI
10.1515/itit-2015-0030