<?xml version="1.0" encoding="utf-8"?>
<TEI xmlns="http://www.tei-c.org/ns/1.0">
   <teiHeader>
      <fileDesc>
         <titleStmt>
            <title level="a">Bytes, Words, Texts: The Anglo-Norman Dictionary and its
               Text-Base</title>
            <author>
               <name>David Trotter</name>
               <address>
                  <addrLine>Aberystwyth University</addrLine>
                  <addrLine><ref target="mailto:dtt@aber.ac.uk">dtt@aber.ac.uk
                  </ref></addrLine>
               </address>
            </author>
            <editor role="acceptingeditor">
               <name>Christine McWebb</name>
               <address>
            <addrLine>University of Waterloo</addrLine>
          </address>
            </editor>
            <editor role="recommendingreader">
               <name>Helen Swift</name>
               <address>
            <addrLine>St Hilda's, Oxford</addrLine>
          </address>
            </editor>
         </titleStmt>
         <publicationStmt>
            <publisher>Digital Medievalist, University of Lethbridge</publisher>
            <pubPlace>Lethbridge AB, Canada T1K 3M4 </pubPlace>
            <availability>
               <p>© David Trotter, 2011. Creative Commons Attribution-NonCommercial licence</p>
            </availability>
            <date n="received" when="2011-11-03">November 3, 2011</date>
            <date n="revised" when="2011-11-27">November 27, 2011</date>
            <date n="published" when="2012-02-07">February 7, 2012</date>
         </publicationStmt>
         <seriesStmt>
            <title>Digital Medievalist</title>
            <idno type="issue">7</idno>
            <idno type="date">2011</idno>
         </seriesStmt>
         <sourceDesc>
            <p>Born digital</p>
         </sourceDesc>
      </fileDesc>
      <encodingDesc>
         <projectDesc>
            <p>Article from Digital Medievalist Journal (URL: <ref
                  target="http://www.digitalmedievalist.org/"/>)</p>
         </projectDesc>
         <refsDecl>
            <p>Citations from the text of this article should be by paragraph number (found on the
               ID attribute of the p element).</p>
         </refsDecl>
      </encodingDesc>
      <profileDesc>
         <creation/>
         <langUsage>
            <language ident="en-GB">en-GB</language>
         </langUsage>
         <textClass>
            <keywords scheme="DM">
               <term type="DMType">Article</term>
               <term type="keyword">Dictionaries</term>
               <term type="keyword">Anglo-Norman</term>
               <term type="keyword">Text-bases</term>
               <term type="keyword">Encoding</term>
               <term type="keyword">eXtensible Markup Language (XML)</term>
            </keywords>
         </textClass>
      </profileDesc>
      <revisionDesc>
         <change who="http://www.digitalmedievalist.org/about/#cf"><date when="2011-11-27"/>cf
            initial encoding</change>
         <change who="raw" when="2011-12-12">checking and finishing encoding</change>
      </revisionDesc>
   </teiHeader>
   <text>
      <front>
         <argument n="abstract">
            <p>This paper summarizes how the Anglo-Norman Dictionary project has developed from an
               originally paper-based and print dictionary, to a fully digital, freely available
               online resource, comprising not only the Dictionary itself, but a substantial
               text-base in the same XML format as the Dictionary. The paper reviews the history of
               the Dictionary; the principles of structured encoding; the relationship between the
               citations (attestations) in the Dictionary, as instances of
               <foreign>parole</foreign>, and the broader Anglo-Norman language (the langue); and
               the detailed encoding of entries. </p>
         </argument>
      </front>
      <body>
         <div>
            <p xml:id="p0001">The purpose of this paper is to outline what might be called the
                  <soCalled>constructional principles</soCalled> of the present, online Anglo-Norman
               Dictionary, or AND (<ref target="http://www.anglo-norman.net/"
                  >www.anglo-norman.net</ref>).<note place="end" n="1"><p>Further material (some now
                     out of date) on the subject of this paper may be found on the AND site, and in
                        <ref target="#Beddow2007">Beddow 2007</ref>; <ref target="#Trotter2000"
                        >Trotter 2000</ref>, <ref target="#Trotter2007">2007</ref>; <ref
                        target="#TrotterRothwell2007">Trotter and Rothwell 2007</ref>.</p></note>
               Essentially, as the title perhaps suggests, this has to do with the relationship
               between (i) the texts on which the dictionary draws (whether these have been
               digitized or not), (ii) the citations which it draws from them, and (iii) the
               dictionary entries. As will be apparent, the underlying methodology is no different
               from what it was before the computerization of the project, and more generally,
               before the advent of computing in the humanities from the 1980s onwards. I appreciate
               that Humanities computing started earlier than the date suggested here, but for the
               purposes of this project (and for the vast majority of researchers in Humanities in
               the UK at least), the 1980s saw the emergence of affordable personal computers
               capable of significantly changing working methods. What is different, however, is how
               the lexicographical process is implemented in a digital dictionary, and (to some
               extent) what the implications of this process might be. </p>
         </div>
         <div>
            <head>Production of concordances</head>
            <p xml:id="p0002">At the heart of any dictionary lies the assembling of the core data on
               which it is based. In the pre-electronic age, there were three major components of
               the process. The first was the production of concordances, exemplified by Alexander
               Cruden's manually-compiled concordance to the King James Bible in the eighteenth
               century. As William Youngman's biographical memoir of Cruden, appended to the printed
               version, indicates (<ref target="#Youngman1891">Youngman 1891</ref>, xiv), Cruden
               appears to have spent a good deal of his life in <q>happy and harmless lunacy</q>,
               probably in part as a result of this enormous and painstaking undertaking. Around
               1320, the English Franciscan Nicole Bozon describes the technique of the friars as
               follows: <quote>
                  <p><foreign>Pur ceo vodereie qe chescun feseit com fierent jadis les freres qe
                        compilerent [les] concordaunces. Chescun prist gard a la lettre qe a lui
                        fist mandee. Cil qe aveit <emph>A</emph> ne avoit qe fere de <emph>B</emph>,
                        e cil qi out gard de <emph>B</emph>, rien se entirmettout de <emph>C</emph>;
                        et si qe chescun lettre del abicee a divers estoit liveree, et chescun se
                        prist a sa lettre, e nul de vousist de autri fet se entremetter
                        ...</foreign> (<ref target="#MeyerSmith1889">Meyer and Smith 1889</ref>,
                     160)</p>
               </quote></p>
            <p xml:id="p0003">This, in fact, looks ominously like the methodology often deployed in
               the construction of dictionaries.</p>
         </div>
         <div>
            <head>Gleaning of quotations</head>
            <p xml:id="p0004">The second component of any dictionary project is typically the
               gleaning of quotations from texts. Traditionally, this entailed reading, and copying
               out onto slips a quotation, perhaps with a translation, a bibliographical reference,
               a date, and so forth. The <title level="m">Anglo-Norman Dictionary</title>
               (henceforth: <title level="m">AND</title>) still has a substantial number of these
               slips, deriving from the early days of the dictionary in the 1940s, or, in the case
               of the one million or so slips of the J.P. Collas collection, bequeathed to Professor
               William Rothwell.<note place="end" n="2"><p>See <ref target="#Appendix1">Appendix
                        1</ref>, in which the principal dates of the history of the AND are
                     provided.</p></note> They are surprisingly durable and largely legible, as
               these original (1940s) examples show: <figure>
                  <figDesc>Facsimiles of original paper dictionary slips from the 1940s.</figDesc>
                  <graphic url="support/Figure1.png"/>
               </figure>
               <figure>
                  <figDesc>Facsimiles of original paper dictionary slips from the 1940s.</figDesc>
                  <graphic url="support/Figure2.png"/>
               </figure></p>
            <p xml:id="p0005">Later examples from the Collas collection continue also to furnish
               invaluable material: <figure>
                  <figDesc>Facsimiles of original paper dictionary slips from the Collas
                     collection.</figDesc>
                  <graphic url="support/Figure3.png"/>
               </figure>
               <figure>
                  <figDesc>Facsimiles of original paper dictionary slips from the Collas
                     collection.</figDesc>
                  <graphic url="support/Figure4.png"/>
               </figure></p>
            <p xml:id="p0006">Finally, the first edition of the AND was compiled via manually-typed
               and subsequently annotated on 210mm × 125mm slips, in which Professor William
               Rothwell's hand is (to the connoisseur) instantly identifiable: <figure>
                  <figDesc>Typed paper slips with manuscript annotations by William
                     Rothwell.</figDesc>
                  <graphic url="support/Figure5.jpg"/>
               </figure></p>
         </div>
         <div>
            <head>Use of glossaries</head>
            <p xml:id="p0007">The third component in the compilation of the majority of dictionaries
               is the use of glossaries provided by the editors to critical texts. For a variety of
               reasons, not least a well-founded scepticism about the reliability of glossaries, the
               AND has not usually made much direct use of glossaries, and certainly does not rely
               on them. One conspicuous problem, familiar to anyone who regularly uses critical
               editions of medieval texts, is that glossaries typically omit precisely those words
               which the lexicographer finds interesting, or difficult. Whether this is because
               these words are of no interest to the editor of the text, or because their meaning is
               deemed self-evident and only problematic for the dim-witted lexicographer, must
               remain a matter for speculation. Less charitable interpretations to account for such
               sins of omission may well come to mind.</p>
            <p xml:id="p0008">At the heart of a dictionary (any serious dictionary) thus lies the
               assembling, and subsequent ordering (according to semantic or historical criteria) of
               a number of citations drawn from as wide as possible a range of relevant texts.
               Whilst dictionaries of modern languages can reasonably aspire to exhaustiveness, in
               particular by the use of vast electronic corpora, this is probably never achievable
               for medieval languages. In the first instance, it is very difficult to imagine
               successfully creating a properly <emph>representative</emph> corpus; and clearly, it
               is also in practice unlikely that we will ever digitize the entirety of the surviving
               documents in even as relatively circumscribed a language as Anglo-Norman. It would be
               more correct to describe even the most comprehensive dictionaries of medieval
               languages as making use of textual data banks or what I have called here
                  <emph>text-bases</emph>, rather than corpora <foreign>stricto sensu</foreign>.</p>
            <p xml:id="p0009">The problem in this respect is not solely one for the lexicographer.
               It is fundamental to our understanding of any past state of a language, of necessity
               (if it is older than approximately a century) mediated exclusively through written
               texts. These, however wide-ranging in register and type, and however much they may
               appear to approximate to a reproduction of speech, are of course never an accurate
               record of the living language in its spoken form. Moreover, as the following diagram
               tries to show, the relationship between texts and the <soCalled>Anglo-Norman
                  language</soCalled> is analogous to Saussure's dichotomy between
                  <foreign>langue</foreign> and <foreign>parole</foreign>. <figure>
                  <figDesc>The relationship between texts and <soCalled>Anglo-Norman</soCalled> is
                     analogous to that between <foreign xml:lang="fra">langue</foreign> and <foreign
                        xml:lang="fra">parole</foreign>.</figDesc>
                  <graphic url="support/Figure6.png"/>
               </figure></p>
            <p xml:id="p0010">In this, the text (the product of an individual author) is the
               Saussurean <foreign>parole</foreign>, a manifestation and an emanation of the
               irrecoverable <foreign>langue</foreign>, the phonetic, grammatical and lexical system
               of (in this case) Anglo-Norman. It is irrecoverable not only because we have no
               access to the spoken form, but also because (in the absence of native-speaker
               consciousness) we have no real way of fully grasping the complexities and the rules
               of the <foreign>langue</foreign>. Both denotational meaning and particular
               connotations may well escape us. The lexicographer of medieval languages is thus left
               with a quantity of examples of <foreign>parole</foreign> from which it may (or may
               not) be possible to reconstruct the underlying <foreign>langue</foreign> which is
               ostensibly the object of his survey.</p>
            <p xml:id="p0011">From the text, the lexicographer takes his citations. Those citations,
               in turn, are fed into the dictionary, constituted into articles, conventionally but
               not invariably organized alphabetically(although a rare but useful alternative is an
               onomasiological ordering by concepts), and, depending somewhat on the thoroughness
               and comprehensiveness of the dictionary, the dictionary itself becomes, at least as
               far as the lexis of the language is concerned, an attempt, however imperfect, to
               encompass the language as a whole (<foreign>langue</foreign>). </p>
            <p xml:id="p0012">Thus, a properly-conceived dictionary ought to be a means of access to
               the vocabulary of (ideally) an entire language, structured in a way which makes
               access for the reader as easy as possible. Digitization does not change this,
               although, as we shall see, the digitization process itself can impose order and
               consistency on major publications which are otherwise all too open to variability
               because of human factors, especially over the protracted time-scales within which the
               typical dictionary is put together. Dictionaries, in fact, lend themselves
               particularly well to digitization, not least because they are structurally very
               predictable. By this, I mean that they are very conventional in the way they set out
               information. It is not, for example, necessary to know any Spanish at all to make
               sense of the entry below, derived from a major etymological dictionary of Castilian,
               J. Corominas's <title level="m">Diccionario crítico etimológico de la lengua
                  castellana</title> (<ref target="#Corominas1954">Corominas 1954</ref>): <figure>
                  <figDesc>Sample entry from J. Corominas's <title level="m">Diccionario crítico
                        etimológico de la lengua castellana</title>.</figDesc>
                  <graphic url="support/Figure7.jpg"/>
               </figure></p>
            <p xml:id="p0013">The typographical and organizational conventions adopted mean that
               even a reader with no knowledge whatever of Spanish will immediately work out that
               the first word (in capitals) is the headword, that what follows (in inverted commas)
               is the explanation of it, that the information that follows is etymological, and that
               the last paragraph supplies additional information about derivatives of one sort or
               another. Likewise, no knowledge of Sanskrit is needed in order to understand the gist
               of the entry <foreign>dushta</foreign> from the late-nineteenth-century <title
                  level="m">Sanskrit-English Dictionary</title> by Sir Monier Monier-Williams (<ref
                  target="#Monier-Williams1899">Monier-Williams 1899</ref>): <figure>
                  <figDesc>Sample entry from Sir Monier Monier-Williams, <title level="m"
                        >Sanskrit-English Dictionary</title>.</figDesc>
                  <graphic url="support/Figure8.jpg"/>
               </figure></p>
            <p xml:id="p0014">In other words, experience of handling dictionaries mean that the
               organizational structure is intuitively understood and (with the help, in particular,
               of typographical features) decoded. The process is not unlike that of reading a map,
               where experience leads us to know which way up to hold it, that the top of the map is
               (other things being equal) likely to be north, and so forth. The <title level="m"
                  >AND</title> is no exception to this general rule. The sample entry below
                  (<foreign>janglure</foreign>) illustrates the point. The initial words in bold are
               the headword, and the variant spellings attested. This is followed by an abbreviation
               meaning that the word is a substantive (s.) and a gloss, in English, and in italics.
               A series of quotations with bibliographical references (explained in the associated
               list of texts) follows, and the article concludes with cross references to related or
               otherwise relevant words. Component parts of the dictionary entry are identifiable by
               position and by typography. Therefore, structure and meaning go hand in hand. <quote><p>
                     <hi rend="bold">janglure, janglur</hi> (<hi rend="bold">janlur</hi>
                     <hi>TLL</hi> ii 59) <lb/>s. <hi>babbling, (foolish) chatter</hi>: buccum:
                     gabur, gangeler, janglur <hi>TLL</hi> ii 42; Malveise gent [...] Mult estes ore
                     de mal escole Ke onur [...] ne feites A la reine [...] Ne lessates vostre
                        janglure <hi>Mir N-D</hi> 179.130 → <hi rend="bold">jangle</hi>, <hi
                        rend="bold">jangleis</hi>, <hi rend="bold">janglement</hi>, <hi rend="bold"
                        >jangler</hi>, <hi rend="bold">janglerie</hi>.</p></quote></p>
            <p xml:id="p0015">This, then, is an arrangement of material which, by virtue of its
               structural regularity, lends itself very readily to digitization in an orderly and
               structured way. Component parts of the entry are easily identified and easily fitted
               into a schema, and marked up accordingly. Even in the case of the <title level="m"
                  >AND</title>, substantial parts of which were compiled in Microsoft Word and
               subsequently converted to XML (not the best solution by any means), the process was
               reasonably straightforward. Inevitably, in a work compiled over a period in excess of
               forty years, inconsistencies of abbreviation, textual reference, and so forth
               emerged, but not to such an extent that computerization of the original Word files
               posed insurmountable problems. Since 2003, however, the composition of <title
                  level="m">AND</title> articles has been done directly in XML, using a piece of
               editing software called EpcEdit (<ref target="http://www.epcedit.com/"
                  >www.epcedit.com</ref>). From this it is possible to see the structure of the same
               entry <foreign>janglure</foreign> (see below). <figure>
                  <figDesc>Structure of AND entry for <foreign>janglure</foreign> in the
                     dictionary's <soCalled>editorial XML</soCalled>.</figDesc>
                  <graphic url="support/Figure9.jpg"/>
               </figure></p>
            <p xml:id="p0016">This is the <emph>editorial XML</emph>, that is, the simplified form
               in which articles are produced. The dictionary's DTD (<q>document type
               definition</q>, which prescribes the categories, and options within them, for markup
               and encoding, and imposes the overall structure of the document and of its component
               parts) imposes a template on the entry, incorporates markup for the different
               structural elements and makes explicit those components of the dictionary which in
               print form would be identifiable by typography or position or both. The DTD enforces
               a uniform ordering of component parts of the article, and limits editorial choices in
               a number of key areas (bibliographical abbreviations, usage labels, parts of speech)
               to those which are pre-supplied in a drop-down menu. Variation resulting from human
               inconsistency is thus largely eliminated and the disambiguating discipline of having
               to abide by the constraints of the DTD is directly beneficial to enforcing regularity
               of presentation. Articles in this form are subsequently transformed on the project's
               server into <emph>canonical XML</emph> for storage and to form the version of the
               data from which articles are generated, on request, by users. In fact, the canonical
               XML is not (and could not be) dramatically different from the editorial version, but
               it does add certain details and in particular, system-internal identifiers: <figure>
                  <figDesc>Entry in the AND's <soCalled>Canonical XML</soCalled>.</figDesc>
                  <graphic url="support/Figure10.png"/>
               </figure></p>
            <p xml:id="p0017">The online articles which the user of the AND sees on his or her
               computer screen are produced directly from the XML of the canonical data, rendered
               on-screen in HTML. The exact date and time that appears on the screen is the date of
               production of the article, on that occasion, and on that screen. </p>
            <p xml:id="p0018">Looking in more detail at the XML encoding of a component part of the
               entry <foreign>janglure</foreign>, the extent to which the XML mark up identifies and
               classifies elements which the reader of a print dictionary would simply absorb more
               or less unconsciously, is apparent. For the following citation: <quote><p>buccum:
                     gabur, gangeler, janglur <emph>TLL</emph> ii 42</p></quote></p>
            <p>The XML looks like this:</p>
            <figure>
               <figDesc>Structural components of an AND entry.</figDesc>
               <graphic url="support/Figure11.png"/>
            </figure>
            <p xml:id="p0019">Each structural element (here coloured, though not in the original) is
               marked up: within the overall &lt;cit&gt; or citation is &lt;quote&gt;, the quotation
               from the source, the language of the (Latin) word <emph>buccum</emph> being glossed
               is indicated by a value "LA" (Latin) of the segment by &lt;seg lang="LA"&gt;, the
               bibliographical siglum (TLL) is accompanied by a volume and page-reference which is
               separately encoded &lt;loc&gt;, and so on. The citation, the lowest-level component
               of the text-citation-dictionary triangle, is thus encoded in detail so that all
               elements will be identifiable and recoverable by XML-based search and query
               programs.</p>
            <p xml:id="p0020">In what follows, I will consider the way in which citations and texts
               are linked, and how both are exploited in dictionary production. There are three
               principal constituent elements in the Anglo-Norman On-Line Hub, within which the
                  <title level="m">AND</title> is one. The other two components are, firstly, a text
               base, consisting of digitized Anglo-Norman texts, and a couple of editions produced
               by W. Rothwell; the third element, and the one which underlies the other two, is the
               canonical XML data for the dictionary and text base alike. <figure>
                  <figDesc>The AND system.</figDesc>
                  <graphic url="support/Figure12.png"/>
               </figure></p>
            <p xml:id="p0021">Digitization of the dictionary and of the texts and of the academic
               articles which are also housed on the hub follow the same DTD, and thus the
               constituent elements are entirely interchangeable. From the admittedly somewhat
               narrow perspective of a lexicographer, a text is just a series of (potential)
               citations, to be mined and exploited for the dictionary. Equally, the dictionary
               itself can be regarded as a series of articles, senses of words, and then, at the
               most basic level of the article, a series of citations. In other words, the entire
               operation can be seen (both in the dictionary and in the texts) as a concatenation of
               citations. In an ideal world, a given text containing (for example) six citations
               will feed into a series of articles. A citation from a given text is simply taken and
               transformed into a citation within a dictionary article. This, of course, is nothing
               new: it is how dictionaries have always been produced. Digitization, however,
               concentrates the mind on the underlying process, because the technology has to
               reflect what is happening. Identical XML encoding, following the same DTD, is the key
               to the process whereby the citation in a text <emph>becomes</emph> the citation in an
               article. Needless to say, this process only really applies in as direct a form as
               that to those texts which are digitized, and on the Anglo-Norman on line hub site,
               since for all other citations (and texts), they have no digitized existence outside
               the dictionary article into which they are inserted. For citations which do exist in
               digitized online texts, though, the transposition into a dictionary-article citation
               is (apparently) straightforward. A given citation in a text becomes a citation in an
               article:</p>
            <figure>
               <figDesc>A citation from a text becomes a citation in a dictionary article.</figDesc>
               <graphic url="support/Figure13.png"/>
            </figure>
            <p xml:id="p0022">Let us take a concrete example of how this works. The following
               excerpt, from Walter of Bibbesworth's treatise, contains, amongst other things, two
               words of possible interest, in verses 317 and 319: <quote>
                  <l> Veez ci veint devaunt vous </l>
                  <l> Un chivaler bieau tut rou </l>
                  <l n="312"> Qui une destrere sor se est munté            <hi>reed</hi>
                  </l>
                  <l> Esku de goules ad porté            <hi>reed</hi>
                  </l>
                  <l> Un launce rouge en l'uyn mein, </l>
                  <l> De vin vermaille l'autre plein, </l>
                  <l n="316"> Qi ne manjuwe point de peschoun </l>
                  <l> S[i] de le <hi rend="bold"><hi rend="underlined">haranc</hi></hi> sor
                        noun            <hi>reed</hi>
                  </l>
                  <l> Je vie une reyne sanz rey            <hi>quene</hi>
                  </l>
                  <l> Pur une reyne fere <hi rend="bold"><hi rend="underlined"
                        >desray</hi></hi>            <hi>frock</hi>
                  </l>
                  <p>(<ref target="#Rothwell2009">Rothwell 2009</ref>, vv. 310-319)</p>
               </quote></p>

            <p xml:id="p0023">Both of these, as it happens, are found in the relevant dictionary
               entries (<foreign>harang</foreign> and <foreign>desrei</foreign>). From the
               lexicographer's point of view, in other words, they constitute separable and
               exploitable citations. This is, admittedly, a peculiarly mechanistic interpretation
               of what a text consists of, but it is, nonetheless, key to understanding the
               relationship between the different component parts of a dictionary/text-base set-up.
               Naturally, a given citation can be used more than once (and often is) thus, for
               example, in the Bibbesworth excerpt just quoted, line 317 contains also an instance
               of the word <foreign>sor,</foreign> of <foreign>noun</foreign>, and so forth. </p>
            <p xml:id="p0024">One of the most powerful facilities which the <title level="m"
                  >AND</title> offers is the capacity to search the entirety of the citations in the
               dictionary, which are thus, from that point of view, simply one enormous electronic
               text. For the reasons given above, this does not mean that the dictionary is a corpus
               in the technical sense in which the word would be used by corpus linguists; but it
               does provide to the user, and indeed for purposes completely unrelated to the
               dictionary, a valuable extra resource. Occasionally, of course, this facility exposes
               shortcomings in the dictionary itself when, for example, the search of citations
               revealed words for which there is no corresponding dictionary article. So, for
               example, under <foreign>feu</foreign><hi rend="sup">2</hi>, we find the word
                  <foreign>examen</foreign>, which (as a rapid perusal of the alphabetical list of
               AND articles will reveal), is absent as an entry in the dictionary. The message
               provided by a search of the citations in the dictionary is unambiguous in identifying
               the gap: <quote>
                  <p>
                     <hi>Your search for </hi>
                     <hi rend="bold">examen</hi>
                     <hi> did not match any headwords or variant forms</hi>.<lb/>The form was,
                     however, found in 1 citation:<lb/>transcrire et copier, ou jugement, par bon
                     collacion et <hi rend="bold">examen</hi>, le testament de feu Thomas de Uvedale
                        <hi>Foedera</hi> iii 846 [sub <hi rend="bold">feu2</hi>]</p>
               </quote></p>
            <p xml:id="p0025">In the light of this, the online <title level="m">AND</title> has now
               of course quickly added the missing entry <foreign>examen</foreign>. The relationship
               between citations and articles operates also in the opposite direction, that is, in
               allowing the user to go from a citation in the dictionary to its context. This, of
               course, only functions in the case of those texts which have been digitized as part
               of the project. Thus, for example, within the entry <foreign>coillage</foreign>,
                  <q>levy, tax</q>, the last quotation (from the <title level="m">Black Book of the
                  Admiralty</title>) can be followed into the text itself: <quote><p><hi rend="bold"
                        >coillage, cueillage; quilage</hi>
                     <lb/>s. <hi>levy, tax</hi>: soloient paier la greindre partie de les dismes de
                     la dite ville, leins et autres coillages […] <hi>Rot Parl</hi>
                     <hi rend="sup">1</hi> ii 213; une custume qe l'em apele hildenrath (<hi>l.</hi>
                     hildevrath), c'est assaver, quilage des aveynes <hi>King's Bench</hi> v.cxxxv;
                        ♦ <hi>keelage</hi>: que coillage (var. cueillage) ne soit payé par la coste
                     d'Angleterre, mais ancorage <hi>B</hi>
                     <hi>lk Bk</hi> 74 → <hi rend="bold">kylage</hi>.</p></quote></p>
            <p xml:id="p0026">This, clearly, is something which is not exclusive to a digitized
               dictionary that functions in this way, since the user has always been able to turn to
               a text which has been excerpted from the dictionary and revisit a citation in the
               broader context – provided, of course, that the text is available and at hand.
               Digitization makes this instantaneously possible anywhere in the world, and without
               needing recourse to a library. In this case, the wider context makes more explicit
               the specific sub-sense of <foreign>keelage</foreign> which, or so the dictionary
               argues (and see, by the way of comparison, <ref target="#Trotter2003">Trotter
                  2003</ref>), the word bears in this particular citation: <quote><p>Item, ordonné
                     estoit illecques q'une manere de coustume seroit pris par tout le royalme
                     d'Angleterre en eaue, et les admiralx estoient de ce fermement chargez qu'ilz
                     ou leurs lieutenants deux foiz ou troiz foiz en l'an enquerront de ce
                     fermement, ainsi que nul allene marchant ne privé ne soit endommagé par cause
                     des coustumes, <hi rend="bold">et que coillage ne soit paié par la coste
                        d'Angleterre mais ancorage</hi> …(<ref target="#Twiss1871">Twiss 1871</ref>,
                     74)</p></quote></p>
            <p xml:id="p0027">The relationship between texts, the dictionary, and the citations
               which are the constituent elements of both, is not unique to the <title level="m"
                  >AND</title>, and is also implemented in (for example) the <title level="m"
                  >Dictionnaire du Moyen Français</title>. It is likely that in the future, with the
               growing availability of significant quantities of digitized texts available online
               (for example, <title level="m">Base de Français Médiéval</title>, <title level="m"
                  >Nouveau Corpus d'Amsterdam</title>), the practice will be extended and indeed
               generalized, with the proviso that, for this to be possible, strict adherence to
               mutually comprehensible encoding frameworks will be essential. At present, despite
               (or perhaps because of) systems such as TEI, there is wide variation in how materials
               are encoded, and many projects display a somewhat disturbing instability in terms of
               the target which they offer and to which links could be created. </p>
            <p xml:id="p0028">The question of electronic links to other cognate projects bring me to
               the final part of this paper. No dictionary exists in isolation, and all dictionaries
               are part of an international and multilingual network of lexicographical resources
               which collectively attempt to record and explain the vocabulary of the languages of
               the world. The <title level="m">AND</title> is no exception. However, it has a
               slightly unusual additional function, in that it stands between English and French,
               as the record of the variety of French introduced into medieval England after the
               Norman conquest, and which subsequently dramatically relexified the English language.
               Just as Anglo-Norman was the vector for the transmission of Romance vocabulary into
               (Germanic) Anglo-Saxon, so the <title level="m">AND</title> is the link between
               French and English lexicography. It has, of course, connections also to other Romance
               dictionaries, and to dictionaries of related Germanic varieties (Dutch, German, the
               Scandinavian languages) as well as to medieval Latin. This intellectual network has
               always existed, and digitization does not transform the essential relationship
               between the <title level="m">AND</title> and other dictionaries. But where
               digitization does substantially change the picture is in its potential for enabling
               direct connections to be made from one online dictionary to another. This, like the
               proper exploitation of online databases to which I have already alluded, is dependent
               on the existence of properly encoded and, above all, stable targets for links going
               from one dictionary to another, and for the moment this is not altogether
               unproblematic. Nevertheless, as a dictionary like the <title level="m">DMF</title>
               already shows, the potential is there and, in the not too distant future, it is
               possible to envisage a situation where all the relevant related dictionaries are
               interlinked in such a way that it will become possible to rapidly review the entirety
               of lexicographical evidence, irrespective of language. That would be a significant
               step forwards, and would allow us to reassemble in its full multilingual complexity
               the lexical landscape of medieval Europe. Digitization is thus, ironically for so
                  <q>modern</q> a process, a way to return to medieval reality, via the <q>Digital
                  Middle Ages</q> of the title of the MARGOT conference.</p>
         </div>
      </body>
      <back>
         <div type="appendix" xml:id="Appendix1">
            <head>Appendix 1: Evolution and structure of the Anglo-Norman Dictionary (AND)</head>
            <list type="simple">
               <item><label>c. 1947</label>: <soCalled>glossary committee</soCalled>, founded in
                  Oxford, under aegis of ANTS<list>
                     <item>General editor: †Louise W. Stone; subsequently, William Rothwell; also
                        involved: †T.B.W. Reid (to fascicle 3), †Dafydd Evans, Stewart Gregory, D.A.
                        Trotter, †P.R. Staniforth (DMLBS)</item>
                     <item>Initial combing and creation of traditional slips (ANTS), cf. Publisher's
                        Foreword to AND1 (fasc. 7) </item>
                  </list></item>
               <item><label>1962</label>: W. Rothwell became involved (and subsequently general
                  editor) </item>
               <item><label>1977 – 1992</label>: publication of AND1 in seven fascicles (MHRA/ANTS) <list>
                     <item>from "P" onwards: incorporation of J.P. Collas fichier (? c. 1m slips)
                        and Shanks Dictionary of Law French (CUL; unpublished, 800 pp. typescript,
                        Selden Society) </item>
                  </list></item>
               <item><label>1989</label>: start of work on second edition (Stewart Gregory, D.A.
                  Trotter) </item>
               <item><label>1995</label>: grants from British Academy and Humanities Research Board
                  to support research assistant (Lisa Jefferson) </item>
               <item><label>2002 – 2004</label>: AHRB Resource Enhancement Grant (£109,000) to fund
                  digitization of AND2, A–E (complete), and putting online of scholarly articles
                  (republication), and source texts ("The Anglo-Norman Online Hub", <ptr
                     target="http://www.anglo-norman.net/"/>), based in Aberystwyth but work carried
                  out in Swansea under direction of Andrew Rothwell (technical consultant: Michael
                  Beddow) </item>
               <item><label>2003 – 2007</label>: AHRB Major Research Grant (£426,000), to fund
                  revision of F–H; Virginie Derrien (Poitiers) and Geert De Wilde Belgium/Leeds)
                  appointed as assistant editors (Aberystwyth) Oct. 2003; AHRB-funded Ph.D.
                  studentship available but unfilled (no U.K. candidates) </item>
               <item><label>2004 – 2007</label>: AHRB Resource Enhancement Grant (£309,000) to fund
                  continuation of digitization programme in Swansea </item>
               <item><label>2007-2012</label>: AHRC Major Research Grant (£870,000) to fund I-M;
                  Heather Pagan (Toronto) replaced Virginie Derrien </item>
            </list>
         </div>
         <div type="appendix" xml:id="Appendix2">
            <head>Appendix 2: Current materials </head>
            <list type="simple">
               <item>AND1</item>
               <item>combings for AND2 by D.A. Trotter and W. Rothwell, 1989 –</item>
               <item>continuing programme of combing new and especially newly-discovered
                  texts</item>
               <item>Collas; Shanks</item>
               <item>concordanced texts ("Concordance"): see <ptr
                     target="http://www.concordancesoftware.co.uk"/></item>
               <item>online texts on <ptr target="http://www.anglo-norman.net/"/></item>
               <item>AND2, A–E online (c. 1m words; 10,600 entries) marked up in TEI-compliant XML
                  and stored as canonical data, rendered on demand in HTML by server; all Open
                  Source software, no proprietary software</item>
               <item>new entries (F –) entered direct in XML via "EpcEdit"software (proprietary XML
                  editing software), <ptr target="http://www.epcedit.com/"/></item>
            </list>
         </div>
         <div>
            <listBibl>
               <bibl xml:id="LéquipeBFM2011">L'équipe BFM 2011. <title level="m">Base de français
                     médiéval</title>.<ptr target="http://bfm.ens-lyon.fr/"/>. Accessed January 25,
                  2012.</bibl>
               <bibl xml:id="Beddow2007"><author>Beddow, Michael</author>. 2007. <title level="a"
                     >L'Anglo-Norman on-line hub: une présentation technique</title>. In <title
                     level="m">Actes du XXIV <hi rend="sup">e</hi> Congrès International de
                     Linguistique et de Philologie Romanes</title>, Aberystwyth. Ed. David Trotter.
                  1: 305-310. Tübingen: Niemeyer.</bibl>
               <bibl xml:id="Corominas1954">Corominas, Juan. 1954. <title level="m">Diccionario
                     crítico etimológico de la lengua castellana</title> Berne: Francke.</bibl>
               <bibl xml:id="KunstmannStein2012">Kunstmann, Pierre and Achim Stein 2012. <title
                     level="m">Nouveau Corpus d'Amsterdam</title>. <ptr
                     target="http://www.uni-stuttgart.de/lingrom/stein/corpus/"/>. Accessed January
                  25, 2012.</bibl>
               <bibl xml:id="MeyerSmith1889">Meyer, Paul and Lucy Toulmin Smith, eds. 1889. <title
                     level="m">Nicole Bozon, Contes moralisés</title>. Paris: SATF.</bibl>
               <bibl xml:id="Monier-Williams1899"><author>Monier-Williams, Sir Monier.</author>
                  1899. <title level="m">Sanskrit-English dictionary</title>. Oxford: Oxford
                  University Press.</bibl>
               <bibl xml:id="Rothwell2009"><author>Rothwell, William.</author> 2009. <title
                     level="m">Walter de Bibbesworth: le Tretiz</title>, Aberystwyth: Anglo-Norman
                  Online Hub. <ptr target="http://www.anglo-norman.net/"/>. Accessed January 25,
                  2012.</bibl>
               <bibl xml:id="Trotter2000"><author>Trotter, David.</author> 2000. <title level="a"
                     >L'avenir de la lexicographie anglo-normande: vers une refonte de l'<title
                        level="m">Anglo-Norman Dictionary</title>?</title>. <title level="j">Revue
                     de Linguistique romane</title> 64:391-407.</bibl>
               <bibl xml:id="Trotter2003"><author>Trotter, David.</author> 2003. <title level="a"
                     >Langues en contact en Gascogne médiévale</title>. In <title level="m">Actas
                     del XXIII Congreso Internacional de Lingüística  y Filología Románica,
                     Salamanca, septiembre 2001</title>. Ed. F. Sánchez Miret. 3, 479-486. Tübingen:
                  Niemeyer.</bibl>
               <bibl xml:id="Trotter2007"><author>Trotter, David. 2007</author>. <title level="a"
                     >Habeas corpus ad testificandum: l'<title level="m">Anglo-Norman
                        Dictionary</title> et son corpus</title>. In <title level="m">Le nouveau
                     corpus d'Amsterdam. Actes de l'atelier de Lauterbad, 23-26 février
                  2006</title>. Ed. Pierre Kunstmann and Achim Stein. 153-157. Stuttgart:
                  Steiner.</bibl>
               <bibl xml:id="TrotterRothwell2007"><author>Trotter, David and Andrew
                     Rothwell</author>. 2007. <title level="a">Présentation de
                     l'<emph>AND</emph></title>. In <title level="m">Actes du XXIV <hi rend="sup"
                        >e</hi> congrès international de linguistique et de philologie
                     romanes</title>, Aberystwyth. Ed. David Trotter. 2: 413-421. Tübingen:
                  Niemeyer.</bibl>
               <bibl xml:id="Twiss1871"><author>Twiss, T.</author> 1871. <title level="m">Black book
                     of the Admiralty</title>. <title level="s">Rolls Series.</title> 1. London:
                  Longman et al.</bibl>
               <bibl xml:id="Youngman1891"><author>Youngman, William.</author> 1891. <title
                     level="a">Sketch of the life and character of Alexander Cruden</title>. In
                     <title level="m">A complete concordance to the Old and New Testament</title>.
                  Ed. Alexander Cruden. xii-xv. London and New York: Warne.</bibl>
            </listBibl>
         </div>
      </back>
   </text>
</TEI>
