You are viewing an old version of this page. View the current version.

Compare with Current View Page History

« Previous Version 2 Next »

This page consists of our draft README for the data release related to our paper,

For the sake of simplicity: Unsupervised extraction of lexical simplifications from Wikipedia
Mark Yatskar, Bo Pang, Cristian Danescu-Niculescu-Mizil and Lillian Lee
Proceedings of the NAACL, 2010 (short paper).

We plan to complete a final version of the README later, but wanted to quickly provide enough details here for interested parties to be able to start making use of the data beforehand.

The page from which the data can be downloaded is at the homepage data for lexical simplification experiments.  

The files are:

  •  enwiki.tar.gz (1.56GB)
  • simplewiki.tar.gar (84.8MB)
  • No labels