Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

  • allows researchers to create a virtual machine environment, configure with tools, and analyze texts, see  Portal & Workset Builder Tutorial for v3.0, "Use the HTRC Data Capsule" for details
  • designed to be a secure analytical environment that respects access restrictions to text while allowing for computational analysis
  • not yet tied to worksets
  • currently restricted to "open-open" (non-restricted) corpus; eventual objective is to allow for access to full HT corpus

Bookworm

  • same functionality as Google nGrams
  • base data is currently "open-open" data (liberated from Google stipulations); working on legal aspects required for base data to shift to entire HT corpus, regardless of viewability.
  • plans and allocated grant to develop tie-in to worksets
  • See wiki for tutorial

Extracted Features datasets