What is HathiTrust (HT)?
- A consortium - international partnership of over 100 institutions.
a digital library containing about 13.5 million books, ~5 million (38%) of which are in the public domain. All items are fully indexed, allowing for full text search within all volumes. You can login with your Cornell NetID to
- Create Collections (public or private)
- Download PDF’s of any item available in full text
a trustworthy preservation repository providing long-term stewardship, redundant robust backup, continuous monitoring, and persistent identifiers for all content.
What is the HathiTrust Research Center (HTRC)?
- Research arm of HT - computational analysis of text at scale
- The role of indexing (SOLR)
- Experimental by nature - although taking steps to move into production
What specific services does the HTRC offer scholars?
The "portal"
Workset builder
Algorithms
Bookworm
Data Capsule