...
- allows researchers to create a virtual machine environment, configure with tools, and analyze texts, see Portal & Workset Builder Tutorial for v3.0, " for details
- designed to be a secure analytical environment that respects access restrictions to text while allowing for computational analysis
- not yet tied to worksets
- currently restricted to "open-open" (non-restricted) corpus; eventual objective is to allow for access to full HT corpus
Bookworm
- same functionality as Google nGrams
- base data is currently "open-open" data (liberated from Google stipulations); working on legal aspects required for base data to shift to entire HT corpus, regardless of viewability.
- plans and allocated grant to develop tie-in to worksets
- See wiki for tutorial
Extracted Features datasets
- rationale and features explained explained
- details on leveraging the dataset using a workset and the EF_Rsync_Script_Generator algorithm