Home :: Introduction :: TranscriptionIntroductionThe Purpose of This ProjectThe aim of this project was to learn about making electronic editions of texts generally and XML specifically, as well as to gain more experience working with manuscripts. To that end, I have created an online transcription of the Gospel of John from two manuscripts, designated by New Testament scholars GA 713 and GA 2786. I transcribed John in GA 713 during the Autumn 2009 term for the International Greek New Testament Project (IGNTP) as as part of a course on Greek paleography. GA 713 was chosen for two reasons. First, the IGNTP needed a new transcription of it for their edition of John. Second, it provided an ideal scenario for working with a manuscript. High-quality digital images of it are readily available online, allowing to me gain experience working from a good surrogate. Moreover, the manuscript itself is part of the Mingana Collection at the University of Birmingham, so it was available to me to work from directly, providing valuable experience in working with a manuscript in person. As for the manuscript, it is well preserved and written in a clear, confident hand, making it a good starting point for a beginner, as may be seen in the following image, which shows the last page of John. For the current project, I have transcribed a second manuscript, GA 2786, which was chosen, again, for two reasons. First, like GA 713, the IGNTP needed a transcription of it. Second, unlike GA 713, it presents a number of challenges to the transcriber. Its current whereabouts are unknown, and the only surrogate available is black and white microfilm (or the digital images made from it). Moreover, the text is much more difficult to read, being highly abbreviated and in places damaged and illegible. These factors made for a good contrast to working with GA 713. Below is the last page of John in GA 2786. The Transcription ProcessEach manuscript was initially transcribed according to the guidelines of the International Greek New Testament Project (IGNTP).1 Thus the transcription was done in Unicode using the encoding for the program Collate, used by the IGNTP for manuscript comparison. Below is the text from the above page of GA 713 transcribed using Collate encoding.2 |F 351v| |L|αυτου <V 25> εστι δε και αλλα πολλα α εποι= |L|ησεν ο ις̅ ενωπιον των μαθητων αυτου |L|ατινα εαν γραφηται καθ εν |L|ουδε αυτον οιμαι τον |L|κοσμον χωρη= |L|σαι τα |L|γραφο= |L|με= |L|να βιβλια |L|α= |L|μην |L|<K 22> <V 0> ευαγγελιον κατα ιωαννην The IGNTP transcription focuses on the main text of the manuscript, since its aim is to construct an exhaustive critical apparatus. Alternate readings and corrections are included, but lectionary markings, comments, headings, and other paratextual features are ignored. Moreover, basic textual layout is represented with page breaks and line breaks. The encoding seen here, however, is for use with the Collate software only, and is in no way designed for displaying texts via the web. Thus for the online version, I re-encoded the transcriptions by hand using XML (Extensible Markup Language) according to the guidelines of the Text Encoding Initiative (TEI), a consortium which develops "a general-purpose encoding scheme which makes it possible to encode different views of text, possibly intended for different applications, serving the majority of scholarly purposes of text studies in the humanities."3 As this quote makes clear, the TEI guidelines are highly flexible, and so the transcriber must make choices about what to record based on the transcription's intended purpose. For example, someone interested in grammar might morphologically tag each word. Someone interested in Greek paleography and the rate at which Greek majuscule letters were reintroduced alongside minuscule script might tag each letter as being majuscule or minuscule. The possibilities are almost limitless. Such exercises, worthy though they may be, are beyond the time limit for a term project when two transcriptions of John are involved. I decided therefore to record only features stipulated in the IGNTP guidelines. Below is the same page from GA 713 as above, but encoded using XML. <pb n="351v" /> <lb n="1" />αυτου </ab><ab xml:id="B4-K21-V25"> εστι δε και αλλα πολλα α εποι= <lb n="2" />ησεν ο ις̅ ενωπιον των μαθητων αυτου <lb n="3" />ατινα εαν γραφηται καθ εν <lb n="4" />ουδε αυτον οιμαι τον <lb n="5" />κοσμον χωρη= <lb n="6" />σαι τα <lb n="7" />γραφο= <lb n="8" />με= <lb n="9" />να βιβλια <lb n="10" />α= <lb n="11" />μην </ab></div> <lb n="12" /><div type="chapter" xml:id="B4-K22"><ab xml:id="B4-K22-V0"> ευαγγελιον κατα ιωαννην</ab></div> Currently, XML is not directly viewable in most internet browsers, but must be transformed into HTML (HyperText Markup Language). I had two main options for this transformation, namely, SDPublisher and XSLT. SDPublisher has the advantage of seeing "XML as a stream as well as a hierarchy," thus making the program "highly suited to processing documents characterized by multiple overlapping hierarchies,"4 such as New Testament manuscripts, which are divided by pages and lines on the one hand, and by books, chapters, and verses on the other. However, SDPublisher is primarily designed to handle large amounts of data and numerous versions of a text; since the current project involves only two manuscripts, I chose XSLT. XSL (Extensible Stylesheet Language) is a language used for transforming XML into HTML (or other other things, such as another form of XML) using a stylesheet. "The term stylesheet reflects the fact that one of the important roles of XSLT is to add styling information to an XML source document, by transforming it into a document consisting of XSL formatted objects . . . or into another presentation-oriented format such as HTML, XHTML, or SVG."6 Thus the stylesheet specifies what each XML element in the source document should be transformed into. For example, line breaks are indicated in my XML source documents with the tag <lb n="x" /> (x being replaced with consecutive line numbers for each page). In HTML, line breaks are indicated by the tag </br>. Thus in the XSL stylesheet one finds the following entry. <xsl:template match="lb"> </br> </xsl:template> This specifies that every XML tag <lb n="x" /> should be transformed into the HTML tag </br>. Thus every XML element is transformed into HTML and made viewable in an internet browser. Encoding the text and making the XSL stylesheet required diligence, caution, and patience. At varies times, typos made visible a tag which should be invisible, changed the color of all the text from black to red, and made all the text disappear. Reading the TranscriptionsIn compliance with the IGNTP transcription guidelines, the following conventions are used.
For this project, I have also used the following methods for displaying certain information about the text.
The ManuscriptsGA 2786 GA 713 Notes1 IGNTP-INTF, "Guidelines for the Transcription of Manuscripts Using Unicode," (2010). The guidelines may be downloaded from the IGNTP website at http://www.igntp.org/. For a recent discussion of transcribing and collating New Testament manuscripts see D. C. Parker, An Introduction to the New Testament Manuscripts and Their Texts (Cambridge: Cambridge University Press, 2008), 95-107. 2 For more on Collate, see http://www.itsee.bham.ac.uk/software/collate/. 3 Lou Burnard and Syd Bauman, eds., TEI P5: Guidelines for Electronic Text Encoding and Interchange (Oxford, Providence, Charlotte, Nancy: The TEI Consortium, 2009), xxvi. In addition to the print edition, the guidelines may be read online or downloaded at http://www.tei-c.org/Guidelines/P5/. 4 See the website of Scholarly Digital Editions (the developers of SDPublisher) at http://www.sd-editions.com/index.html. The documentation for SDPublisher may be viewed or downloaded at http://www.sd-editions.com/SDPublisher/. 5 For a thorough treatment of Greek abbreviations, see Al. N. Oikonomides, ed., Abbreviations in Greek (Chicago: Ares, 1974). 6 Michael Kay, ed., "XSL Transformations (XSLT) Version 2.0," (World Wide Web Consortium, 2007), online: http://www.w3.org/TR/xslt20/. 7 Martin Faßnacht and Ulrich Schmid, "NT.VMR Handschriftenliste," version 1.0 (INTF, 2009), online: http://intf. uni-muenster.de/vmr/NTVMR/ListeHandschriften.php; Lucy-Anne Hunt, The Mingana and Related Collections: a Survey of Illustrated Arabic, Greek, Eastern Christian, Persian and Turkish Manuscripts in the Selly Oak Colleges, Birmingham (Birmingham: The Mingana Collection of the Edward Cadbury Charitable Trust), 42-51. 8 Faßnacht and Schmid, "NT.VMR Handschriftenliste." Home :: Introduction :: Transcription |