Technical Summary

The Journals of the Lewis and Clark Expedition has gone through several technology updates during its lifespan. You may read the legacy technical summary here. Currently, The Journals of the Lewis and Clark Expedition Online documents are encoded following the Text Encoding Initative P5 standard. Native Nations, places, and people are encoded within named entities to identify them specifically, while navigating spelling variations, historical names, and modern conventions. You may access the TEI-XML file for individual dates when viewing the journal entries.

Technologies
  • TEI-XML P5: journal and secondary resource documents
  • XSLT: XML transformation to create searchable text and HTML for display
  • Apache Solr: search and browse functionality
  • Ruby on Rails 5: web framework

Process

The TEI of The Journals of the Lewis and Clark Expedition was updated from the P4 specification to P5 using the Abbot XML Interoperability tool and XSLT scripts. To populate the site's contents, a script is run which processes each document, creating a static HTML view and updating Solr with the text and metadata (author, date, location, etc). Ruby on Rails then pulls information from Solr to build the contents of the site, such as the journal entry page, searching, and browsing by named entity.