You are viewing an old version of this page. View the current version.

Compare with Current View Page History

« Previous Version 4 Next »

What is HathiTrust (HT)?

  • A consortium - international partnership of over 100 institutions.
  • a digital library containing about 13.5 million books, ~5 million (38%) of which are in the public domain. All items are fully indexed, allowing for full text search within all volumes. You can login with your Cornell NetID to

    • Create Collections (public or private)
    • Download PDF’s of any item available in full text
  • a trustworthy preservation repository providing long-term stewardship, redundant robust backup, continuous monitoring, and persistent identifiers for all content.

What is the HathiTrust Research Center (HTRC)?

  • Research arm of HT - computational analysis of text at scale
  • The role of indexing (SOLR)
  • Experimental by nature - although taking steps to move into production

What specific services does the HTRC offer scholars?

The "portal"

 

Workset builder

 

Algorithms

 

Bookworm

 

Data Capsule

 

 

  • No labels