<TEI xmlns="http://www.tei-c.org/ns/1.0">
   <teiHeader>
      <fileDesc>
         <titleStmt>
            <title level="a">The ARQUIBANC Project: Location, Recovery, Arrangement, and
               Dissemination of Catalan Private Archives and Documents</title>
            <author>
               <name>Elena Cantarell</name>
               <address>
              <addrLine>University of Barcelona</addrLine>
            <addrLine>
              <ref target="mailto:ecantarell@ub.edu">ecantarell@ub.edu</ref>
            </addrLine>
          </address>
            </author>
            <author>
               <name>Mireia Comas</name>
               <address>
              <addrLine>University of Barcelona</addrLine>
            <addrLine>
              <ref target="mailto:mireia_comas@ub.edu">mireia_comas@ub.edu</ref>
            </addrLine>
          </address>
            </author>
            <editor role="acceptingeditor">
               <name>Christine McWebb</name>
               <address>
            <addrLine>University of Waterloo</addrLine>
          </address>
            </editor>
            <editor role="recommendingreader">
               <name>Richard Cunningham</name>
               <address>
            <addrLine>Acadia University</addrLine>
          </address>
            </editor>
            <respStmt>
               <resp>TEI-encoding by</resp>
               <name>Takako Kato</name>
            </respStmt>
         </titleStmt>
         <publicationStmt>
            <publisher>Digital Medievalist, University of Lethbridge</publisher>
            <pubPlace>Lethbridge AB, Canada T1K 3M4 </pubPlace>
            <availability>
               <p>© Elena Cantarell and Mireia Comas, 2012. Creative Commons
                  Attribution-NonCommercial licence</p>
            </availability>
            <date n="received" when="2011-09-11">September 11, 2011</date>
            <date n="revised" when="2011-11-14">November 14, 2011</date>
            <date n="published" when="2012-02-07">February 7, 2012</date>
         </publicationStmt>
         <seriesStmt>
            <title>Digital Medievalist</title>
            <idno type="issue">7</idno>
            <idno type="date">2011</idno>
         </seriesStmt>
         <sourceDesc>
            <p>Original Composition</p>
         </sourceDesc>
      </fileDesc>
      <encodingDesc>
         <projectDesc>
            <p>Article from Digital Medievalist Journal (URL: <ref
                  target="http://www.digitalmedievalist.org"/>)</p>
         </projectDesc>
         <refsDecl>
            <p>Citations from the text of this article should be by paragraph number (found on the
               ID attribute of the p element).</p>
         </refsDecl>
      </encodingDesc>
      <profileDesc>
         <creation/>
         <langUsage>
            <language ident="en-GB">en-GB</language>
            <language ident="lat">lat</language>
         </langUsage>
         <textClass>
            <keywords scheme="DM">
               <term type="DMType">Article</term>
               <term type="keyword">Databases</term>
               <term type="keyword">Catalan private archives</term>
               <term type="keyword">Digitization of parchments</term>
               <term type="keyword">Arquibanc</term>
            </keywords>
         </textClass>
      </profileDesc>
   </teiHeader>
   <text>
      <front>
         <argument n="abstract">
            <p>The aim of this paper is to present the first steps of the Arquibanc Project that has
               as a main objective to recover archives that are in private hands to make them
               available to the scientific community. The fact is that the access to these private
               documents is not always so easy and, in some cases, the preservation of the records
               cannot be assured. Therefore, our project focuses on creating two online databases
               that may allow the consultation and use of the documents for research purposes. The
               databases have been designed to provide a description of each document to which is
               associated a digital reproduction of the document easier to read on screen. </p>
         </argument>
      </front>
      <body>
         <div>
            <head xml:id="section1">Introduction</head>
            <p xml:id="cantarell.p0001">This article is the result of the work carried out as part
               of two research projects, <title level="m">Written Memory in the Catalan Private
                  Domain: the Recovery of Archives and Documents</title> (Project reference:
               Ministerio de Ciencia e Innovación, HAR2008-01748, <title level="m">La memoria
                  escrita en el ámbito privado catalana: recuperación y estudio de archivos y
                  documentos</title>, PI: Daniel Piñol. Researchers: Ignasi J. Baiges; Elena
               Cantarell; Mireia Comas; Carme Muntaner), and <title level="m">The Recovery of
                  Catalan Private Archives</title> (Project reference: Universitat de Barcelona,
               PGIR/08-09 <title level="m">Recuperació d'arxius privats catalans</title>, PI: Daniel
               Piñol), under the leadership of Dr. Daniel Piñol Alabart. These projects, launched in
               2008, focus on the unusually large body of privately-owned historical documentation
               in Catalonia discovered by [contra]TAEDIUM, the University of Barcelona’s research
               team in Medieval History.</p>
         </div>
         <div>
            <head xml:id="section2">Context</head>
            <p xml:id="cantarell.p0002">The study and dissemination of such a large volume of
               privately-owned documentation in Catalonia is obviously of interest to historians. In
               fact, for the historical period from the Middle Ages to Modernity the documentation
               that remains <soCalled>hidden</soCalled>, that is, owned by private individuals and
               largely beyond the reach of researchers, is as plentiful as that preserved in public
               archives. In our view, drawing attention to these materials is essential to the
               development of historical research in our country, since we cannot attempt to write
               the history of medieval and modern Catalonia on the basis of only a part of the
               documentation preserved. </p>
            <p xml:id="cantarell.p0003">In fact, Catalonia possesses more medieval documents than
               any other nation or institution in Europe, with the exception of the Vatican Archive
                  (<ref target="#Alturo1998">Alturo 1998</ref>). Accessing the
                  <soCalled>hidden</soCalled> or inaccessible documentation represents a
               considerable challenge. Our project aims to take up this challenge by providing
               researchers with access to part of the documentation currently preserved in private
               hands in Catalonia.</p>
            <p>
               <figure xml:id="figure0001">
                  <graphic n="1001" url="support/Figure1.jpg"/>
                  <figDesc>963/04/23 Arxiu Fontcuberta. Doc. S1601001</figDesc>
               </figure>
            </p>
         </div>

         <div>
            <head xml:id="section3">Objectives</head>
            <p xml:id="cantarell.p0004"> The main goals of the <title level="m">Arquibanc</title>
               project are to locate, recover, arrange and disseminate the archives and collections
               of documents belonging to private owners in Catalonia. <title level="m"
                  >Arquibanc</title> is the Catalan word for a chest in which documents were kept in
               the Middle Ages. The abundance of these documents and the long historical period that
               they cover fully justify our project and its contribution to the study of the
               country’s history. However, a large number of these materials are privately owned and
               remain unpublished. Our aim is to find ways of providing researchers with access to
               these documents. Few of the heritage collections are already deposited in Catalan
               public archives and are attracting renewed interest due to their importance as
               historical resources. </p>
            <div>
               <head xml:id="section3.1">Location</head>
               <p xml:id="cantarell.p0005"> We first contacted the owners of two large private
                  archive collections in order to introduce our project and gauge their possible
                  reactions to it. The response was positive and led to other contacts. Our classes
                  at the university provided us with another possible source of interesting
                  documents; in the discussions of the project, some of our students told us of the
                  existence of other small individual archives belonging to wealthy farmers, who
                  kept these documents as a guarantee of their rights. We also heard from owners of
                  historical documents who were keen to have these materials published. As a result,
                  we can divide our materials into two kinds: well-established, well-organized
                  archives, and single documents. At present the <title level="m">Arquibanc</title>
                  project is working with 33 archives at various stages of development.</p>
            </div>
            <div>
               <head xml:id="section3.2">Recovery</head>
               <p xml:id="cantarell.p0006">In some cases the archives are well preserved and well
                  arranged. In these cases, the recovery process involves digitization and little
                  else, whereas in the case of documents in poor condition their recovery entails a
                  great deal of work. In the latter case we also suggest better ways of preservation
                  to owners. </p>
            </div>
            <div>
               <head xml:id="section3.3">Arrangement</head>
               <p xml:id="cantarell.p0007">Some archives were already very well organized. The
                  owners of the Fontcuberta archive, for example, an important heritage archive with
                  seventy linear meters of documentation from the tenth to the twenty-first
                  centuries covering the counties of Osona, Alt i Baix Empordà and Vallès
                  Occidental, had already added indices, master books, and so on. But in other cases
                  we have organized the collections following the indications of the ISAD
                  international system (<ref
                     target="#ConseilInternationaldesArchivesInternationalCouncilonArchives2000"
                     >Conseil International des Archives/International Council on Archives
                     2000</ref>) specially designed for heritage archives (<ref
                     target="#Fernandez1991">Fernandez 1991</ref>; <ref
                     target="#GifreMatasSoler2002">Gifre, Matas, and Soler 2002</ref>). </p>
            </div>
            <div>
               <head xml:id="section3.4">Dissemination</head>
               <p xml:id="cantarell.p0008">In addition to the distribution of documents in printed
                  format, online databases constitute the main tool at our disposal. Online
                  databases can provide access to materials preserved in small or privately-owned
                  archives which are hidden or difficult to track down. We currently have two
                  databases: <title level="m">Scripta</title>, which deals with the vast Fontcuberta
                  archive, and comprises three sets of documents, and <title level="m"
                     >Memoria</title>, the <title level="m">Arquibanc</title> database, which
                  includes documents from a variety of sources.</p>
               <p xml:id="cantarell.p0009">The databases contain public, semiprivate and private
                  registers and fields for the different kinds of materials or collections. They are
                  hosted on a University of Barcelona server, where they are properly maintained,
                  and can be accessed through the <title level="m">Arquibanc</title> research
                  project website: <ptr target="http://www.ub.edu/arquibanc/home.html"/>.</p>
            </div>
         </div>
         <div>
            <head xml:id="section4">Methodology</head>
            <div>
               <head xml:id="section4.1">First objective</head>
               <p xml:id="cantarell.p0010">To fulfill our first objective – the location and
                  evaluation of private documentary sources – we contacted, first of all, the owners
                  of those archives which were already arranged in order to draw up cooperation
                  agreements to cover the study, classification, description and cataloguing of the
                  materials. As a result of these first contacts other owners asked us to consider
                  including their documents in our project. Some of our university students also
                  alerted us to the existence of new archives covering a surprisingly long
                  chronological period, from the Middle Ages to the twentieth century. Because of
                  the huge volume of the material available, we were obliged to make a selection.
                  Among the materials we chose was the Fontcuberta archive. We also decided to
                  compile and study materials in which we considered the risk of deterioration or
                  disappearance to be high. </p>
               <p xml:id="cantarell.p0011">One of the most important tasks in this first stage was
                  the appraisal of the dimensions of the materials and the difficulties they
                  presented in order to plan the work teams and the time necessary for their study.
               </p>
            </div>
            <div>
               <head xml:id="section4.2">Second objective</head>
               <p xml:id="cantarell.p0012">To fulfill the second objective – the recovery of
                  documents at risk – we digitized all the materials to prepare them for study and
                  also to preserve the ones at clear risk of deterioration or loss. In many cases,
                  we advised the owners of fragile and poorly preserved documents to entrust them to
                  the National Archive of Catalonia, if they themselves were unable to supervise the
                  restoration needed for adequate preservation.</p>

               <p>
                  <figure xml:id="figure0002">
                     <graphic n="1002" url="support/Figure2.jpg"/>
                     <figDesc>Some of the documents collections are in very bad condition.</figDesc>
                  </figure>
               </p>
            </div>
            <div>
               <head xml:id="section4.3">Organization</head>
               <p xml:id="cantarell.p0013">With regards to organization, there is a clear
                  distinction to be made between two sets of materials. The first is the Fontcuberta
                  archive, extremely well organized and described according to the owners’ criteria
                  of use, which is to oversee and manage their patrimony. In this case, no
                  intervention on our part was required. The second is the body of materials that
                  had not previously been classified or described. Their inclusion in the database
                  only partially relieves this situation, but is a great help in carrying out a
                  classification of the documents in compliance with the ISAD(G) (General
                  International Standard for Archival Description) regulations. Once the process is
                  concluded, the documents can be accessed using the database’s search engine or
                  through the documentation classification chart. </p>
            </div>
            <div>
               <head xml:id="section4.4">Dissemination</head>
               <p xml:id="cantarell.p0014">Our priorities are the needs and desires of each owner.
                  Owners do not usually object to the processes of digitization, systemization and
                  description, but on the question of providing free access to the documentation,
                  their opinions vary widely: some stipulate that the publication should be
                  exclusively in printed form, while others opt to restrict access in different
                  ways. We must not forget that we are talking about private archives situated in
                  private homes. To deal with this situation, the owners and the University of
                  Barcelona sign agreements stating that access will be via the Internet, and will
                  be supervised by the database’s administrators. Our main dissemination tool is the
                  project’s website (<ptr target="http://www.ub.edu/arquibanc/home.html"/>), where
                  the databases containing the materials can be accessed. However, we also plan to
                  publish much of the material in printed form, and in fact two projects of this
                  kind are currently underway (<ref target="#CantarellComasMuntaner2011">Cantarell,
                     Comas, and Muntaner 2011</ref>; <ref
                     target="#BaigesCantarellComasPiñolSolerInpreparation">Baiges, Cantarell, Comas,
                     Piñol, and Soler</ref>).</p>

               <p xml:id="cantarell.p0015">This is where the value of the database is particularly
                  evident. As we noted in the introduction, a large amount of historical material
                  remains unpublished. We feel that any instrument that enables the scientific
                  community to consult these materials is valuable, since it may lead us to
                  reformulate or qualify our previously held ideas. But our aim was not to create an
                  instrument to replace the critical editions of documentary corpora, which we
                  consider to be essential. Indeed, in many cases, so much of the documentation is
                  unpublished that these editions cannot be carried out because of the technical
                  difficulty and economic cost involved. The challenge, then, was to design a tool
                  that was in a way comparable with traditional editions and could at the same time
                  complement them and help to provide new insights. </p>

               <p xml:id="cantarell.p0016">So, our starting point was the belief that this online
                  database should not be limited to a repertoire of document images, but should
                  contain the standard elements of diplomatic editions of documentary corpora. At
                  the same time, it should be able to generate indices to aid consultation. Finally,
                  another key feature of the database, thinking in particular of large-scale
                  research projects, is its ability to promote teamwork. Large teams of researchers
                  would be required for projects of this size.</p>
            </div>
         </div>

         <div>
            <head xml:id="section5">The <title level="m">Scripta</title> and <title level="m"
                  >Memoria</title> databases: characteristics and use</head>
            <p xml:id="cantarell.p0017">Let us now analyze the main features of the <title level="m"
                  >Scripta</title> and <title level="m">Memoria</title> databases to establish to
               what degree the initial expectations have been met and to identify the areas which
               have not yet been resolved. As a trial, we started with the <title level="m"
                  >Cubellis</title> database which catalogues and publishes documentation from the
               municipal archive of Cubells, a small town in inland Catalonia. The relatively small
               volume of materials made this database ideal for proof of concept testing. Many of
               the shortcomings that we were able to detect in this trial stage were corrected, and
               the database was adapted to the needs of the Arquibanc Project. The result was a tool
               that facilitated teamwork and allowed several different levels of collaboration.</p>
            <div>
               <head xml:id="section5.1">Design</head>
               <p xml:id="cantarell.p0018">Each document is given a register comprising all the
                  fields necessary for identification, description and classification. These fields
                  can be easily created and defined by the database editors without the need for
                  specific training. Therefore, without the aid of computer technicians, researchers
                  can adapt the structure of the database to meet the particular characteristics and
                  objectives of each project.</p>
               <p xml:id="cantarell.p0019">A special field in each register contains the image(s) of
                  the digitized document in a readable, downloadable format. If there are multiple
                  images, they can be consulted sequentially and as thumbnails. In fact, the size of
                  the images was the first problem we encountered in the project. Especially in the
                  case of the parchments, the size and the state of preservation of many of the
                  documents generated a digital image that was too large to be included in the
                  database. It has been very difficult, and in many cases impossible, to obtain
                  high-quality, high-resolution images of an acceptable size (not more than 1
                  Mb).</p>
               <p>
                  <figure xml:id="figure0003">
                     <graphic n="1003" url="support/Figure3.png"/>
                     <figDesc>Screenshot of the database interface showing the image
                        thumbnail.</figDesc>
                  </figure>
               </p>
               <p xml:id="cantarell.p0020">Initially, this caused certain problems for browsing, but
                  these have been largely overcome thanks to the improvements in web technology in
                  recent years; however, we still have the problem of server space. The small number
                  of large format documents in the <title level="m">Memoria</title> database do not
                  present difficulties, but for larger projects such as <title level="m"
                     >Scripta</title>, which contains the documentation of the Fontcuberta archive,
                  we have hundreds of large format documents, which increases the required storage
                  space and prevents the proper functioning of the tool due to the size of the
                  images of these documents. In these cases, we supply researchers with a high
                  resolution digital copy on request and, if they wish to publish the facsimile of
                  the document, we can provide an image that meets the printer’s requirements. </p>
            </div>
            <div>
               <head xml:id="section5.2">Indexed fields, data searches and the generation of
                  indices</head>
               <p xml:id="cantarell.p0021">The database editors can create all the indexed fields
                  they need for the objectives of each project. In the case we are describing, the
                  fields are the following: signature (single and required field); date and place;
                  document type in the case of public fields, and collaborator, image control, and
                  state of revision in the case of private fields, that is, fields accessible only
                  to the database editors and administrators. Private fields are useful for internal
                  control of the state of each register, because they provide answers to important
                  questions: who is the author of the description, has it been revised and if so by
                  whom, who is in charge of the digitization, compression and publication of the
                  image, which registers contain images and which do not, and what is the level of
                  access of each document. They also provide access data such as the date of
                  creation or modification, number of visits, and so on. </p>
               <p xml:id="cantarell.p0022">All these fields can be explored using the browse option
                  and can generate the corresponding indices. All the fields, with the obvious
                  exception of the image, can be explored with the <term>search</term> option. As we
                  noted above, the search engine of the databases allows us to identify all the
                  elements located in any of the fields and produces a variety of lists according to
                  the search results obtained. This means that each and every one of the words
                  contained in the abstract of the document, whichever language was used (Catalan /
                  Spanish / English etc.) can be found by the search engine. This requires that each
                  document has attached to it an abstract worded as broadly as possible, in order to
                  enable the user to refine the search at maximum. Searches can be made for proper
                  names (name and surname, or name and rank) using the <soCalled>search for adjacent
                     words</soCalled> option, political or administrative posts, and so on. The
                  search engine also allows the use of dummy characters (? or *). </p>
            </div>
            <div>
               <head xml:id="section5.3">Data entry and teamwork</head>
               <div>
                  <head xml:id="section5.3.1">Designed for teamwork</head>
                  <p xml:id="cantarell.p0023">The tool allows participation at several levels:
                        <soCalled>Masters</soCalled> are able to modify the internal structure of
                     the database and to grant permission to lower levels;
                        <soCalled>Administrators</soCalled> are authorized to manage fields,
                     deciding on their inclusion or exclusion, and to prepare the indications for
                     collaborators, etc.; <soCalled>Editors</soCalled> are collaborative
                     researchers, who can create new registers and edit the document as they see fit
                     (this point will be discussed in more detail below), and
                        <soCalled>Collaborators</soCalled> can carry out brief collaborations in the
                     management of images, etc. This structure helps to build up teams with
                     different levels of implication in the project and makes the tool flexible
                     enough to deal with the edition of different kinds of documentary resources.
                     Among the collaborators we have also been able to introduce students in the
                     initial identification tasks – mainly Master’s students, but in some cases
                     undergraduates as well. As we said above, this tool is easy to use, with a
                     user-friendly interface for the introduction of data. These instructions appear
                     in red, and help to establish common criteria for all editors. We have also
                     prepared a style sheet to ensure uniformity for data entry.</p>

                  <p>
                     <figure xml:id="figure0004">
                        <graphic n="1004" url="support/Figure4.png"/>
                        <figDesc>Example of an edition record.</figDesc>
                     </figure>
                  </p>

               </div>
               <div>
                  <head xml:id="section5.3.2">Made-to-measure levels of edition</head>
                  <p xml:id="cantarell.p0024">The ease with which fields can be added means that
                     users can produce anything ranging from a simple inventory of the contents of a
                     document to a complete critical edition, in particular because the empty fields
                     are not shown. In addition to entering the date, document type, abstract and
                     image, users can also add an annotated transcription, ex-libris information,
                     and bibliographical references. </p>
               </div>
               <div>
                  <head xml:id="section5.3.3">User identification</head>
                  <p xml:id="cantarell.p0025">As we noted above, this database is housed on a
                     University of Barcelona server, which means that we can monitor users by means
                     of proxy. This is particularly important when we make the large step from
                        <title level="m">Cubellis</title>, a small public archive without any access
                     restrictions, to <title level="m">Memoria</title> and <title level="m"
                        >Scripta</title>, which contain the privately-owned heritage archives on
                     which we are currently working. Indeed, as we explained above, the owners have
                     their own needs for and opinions on the dissemination of their archives.
                     Researchers wishing to study documentary resources must complete the
                     registration form and will receive an access code with the corresponding
                     authorizations.</p>
               </div>
            </div>
         </div>
         <div>
            <head xml:id="section6">Conclusion</head>
            <p xml:id="cantarell.p0026">The <title level="m">Arquibanc</title> project was set up to
               promote the study of the abundant and extremely important documentation owned by
               private individuals in Catalonia. Our main objective at present is not just to
               publish the documents in the Fontcuberta archive and to preserve the documents
               classified as at risk; we are also determined to disseminate the project as far as
               possible. We hope to involve new researchers who are interested both in the history
               of our country or in the digital edition of historical documentation. The amount of
               work involved has turned out to be far greater than we had originally anticipated,
               and we would welcome support from the Digital Humanities community as we move forward
               with the project. The design of the database is ideally suited to fluid teamwork on a
               large scale. We are confident that the intrinsic interest of the materials will
               attract researchers from different areas of knowledge and thus help us to build up an
               interdisciplinary team that will establish itself as a leader in the field. </p>
         </div>
      </body>
      <back>
         <div>
            <listBibl>
               <bibl xml:id="Alturo1998">Alturo Perucho, Jesús. 1998. <title level="a"
                     >Consideracions sobre l'estudi del patrimoni documental i bibliogràfic</title>.
                     <title level="j">Terme</title> 13: 19-23.</bibl>

               <bibl xml:id="BaigesCantarellComasPiñolSolerInpreparation">Baiges, Ignasi J., Elena
                  Cantarell, Mireia Comas, Daniel Piñol, and Joan Soler. <title level="u">Els
                     pergamins de Terrassa a l’arxiu Fontcuberta</title>. In preparation.</bibl>

               <bibl xml:id="CantarellComasMuntaner2011">Cantarell, Elena, Mireia Comas, and Carme
                  Muntaner. 2011. <title level="m">El llibre de la Baronia d’Eramprunyà</title>.
                  Lleida: Pagès. </bibl>

               <bibl xml:id="ConseilInternationaldesArchivesInternationalCouncilonArchives2000"
                  >Conseil International des Archives/International Council on Archives. 2000.
                     <title level="m">ISAD(G) (General International Standard Archival
                     Description)</title>. Madrid: Consejo General de Archivos. <ptr
                     target="http://www.mcu.es/archivos/docs/isad.pdf"/>
               </bibl>

               <bibl xml:id="Fernandez1991">Fernandez Trabal, Josep, 1991. <title level="a">Els
                     arxius familiars i patrimonials. Problemàtica, caracteritzacio i
                     metodologia</title>. <title level="j">Lligall. Revista Catalana
                     d’Arxivistica</title> 4: 95-114.</bibl>

               <bibl xml:id="GifreMatasSoler2002">Gifre, Pere, Josep Matas, and Santi Soler. 2002.
                     <title level="m">Els arxius patrimonials</title>. Girona: Associacio d’Història
                  Rural de les Comarques Gironines, Centre de Recerca d’Història Rural de la
                  Universitat de Girona.</bibl>
            </listBibl>
         </div>
      </back>
   </text>
</TEI>
