{toc}

h4. Preparation

This page is a companion to the guest lecture on text mining for {menulink:custom|link=http://guides.library.cornell.edu/writ2100|target=_blank}Writing 2100{menulink}{menuicon:elements1}, given on 4/29/2014.
*Please do bring a laptop\!* If you have one, bring your own.  If you do not have one, please feel free to {menulink:custom|link=http://olinuris.library.cornell.edu/Computing/Laptops|target=_blank}check one out at the Olin circulation desk{menulink}{menuicon:elements1}.  Having a laptop will allow you to participate in the exercises and get the most out of class.
*No special software will be needed.* All exercises will be done through a Web browser.

h4. Agenda

h5. Presentation

There will be a {menulink:custom|link=http://prezi.com/b-q4a-hcha-o/?utm_campaign=share&utm_medium=copy&rc=ex0share|target=_blank}presentation{menulink}{menuicon:elements1} during which your questions and comments are welcome. My aim is to discuss as much as is useful to you.  Please feel free to chime in at any time.  

h5. Exercises

h6. Voyant

There will be exercise time using {menulink:custom|link=http://voyant-tools.org/|target=_blank}Voyant{menulink}{menuicon:elements1}, a low barrier text analysis tool that delivers a rich, interactive interface.  All exercises will be demonstrated, so no prior knowledge of the tool is required. The room is equipped with a jack to allow easy sharing of your desktop on the screen, so if you discover something that you would like to share and discuss, we can easily do so, and I will encourage that.  Please feel free to bring your own material for upload and analysis to the workshop, understanding that upload of any material will be subject to the {menulink:custom|link=http://docs.voyant-tools.org/privacy/|target=_blank}Voyant privacy policy{menulink}{menuicon:elements1}.
* {menulink:custom|link=http://voyant-tools.org/?corpus=1381949351436.5394&type=dream&stopList=stop.en.taporware.txt&skin=simple&event=documentTypeSelected|target=_blank}I Have a Dream - seen through Voyant{menulink}{menuicon:elements1}


h6. nGrams

There will be exercise time using Google's {menulink:custom|link=https://books.google.com/ngrams|target=_blank}nGram tool{menulink}{menuicon:elements1}. nGrams depict the frequency of a word or word combination analyzed by publication year.  Note that many modifications can be made to refine the analysis, so please consider the links below as starting points. Syntax for refinement is found on the {menulink:custom|link=https://books.google.com/ngrams/info|target=_blank}About page{menulink}{menuicon:elements1}. 
* {menulink:custom|link=https://books.google.com/ngrams/graph?content=Martin+Luther+King%2C%28segregation+*+.5%29%2C%28bussing*10%29%2C%28forced+integration+*+100%29%2Caffirmative+action%2C%28quota+system+*+15%29&case_insensitive=on&year_start=1800&year_end=2008&corpus=15&smoothing=3&share=&direct_url=t4%3B%2CMartin%20Luther%20King%3B%2Cc0%3B%2Cs0%3B%3BMartin%20Luther%20King%3B%2Cc0%3B%3BMARTIN%20LUTHER%20KING%3B%2Cc0%3B.t1%3B%2C%28segregation%20*%20.5%29%3B%2Cc0%3B.t1%3B%2C%28bussing%20*%2010%29%3B%2Cc0%3B.t1%3B%2C%28forced%20integration%20*%20100%29%3B%2Cc0%3B.t4%3B%2Caffirmative%20action%3B%2Cc0%3B%2Cs0%3B%3Baffirmative%20action%3B%2Cc0%3B%3BAffirmative%20Action%3B%2Cc0%3B%3BAffirmative%20action%3B%2Cc0%3B%3BAFFIRMATIVE%20ACTION%3B%2Cc0%3B.t1%3B%2C%28quota%20system%20*%2015%29%3B%2Cc0|target=_blank}Exploring terms used for describing racial integration{menulink}{menuicon:elements1}
* {menulink:custom|link=https://books.google.com/ngrams/graph?content=macho%2Ctaco%2Cloco%2Ccasa%2Cfiesta%2Cmanana&year_start=1800&year_end=2000&corpus=15&smoothing=3&share=&direct_url=t1%3B%2Cmacho%3B%2Cc0%3B.t1%3B%2Ctaco%3B%2Cc0%3B.t1%3B%2Cloco%3B%2Cc0%3B.t1%3B%2Ccasa%3B%2Cc0%3B.t1%3B%2Cfiesta%3B%2Cc0%3B.t1%3B%2Cmanana%3B%2Cc0|target=_blank}Spanish words in English publications{menulink}{menuicon:elements1}

h6. ManyEyes
{menulink:custom|link=http://www.manyeyes.com/software/analytics/manyeyes/|target=_blank}Many Eyes{menulink}{menuicon:elements1} is a site run by IBM that allows for various visualizations of data.  A few of the visualizations allow for free text entry, and analyze the text directly.  Please feel free to upload your own work into the site as primary data (upload requires an account), understanding that the {menulink:custom|link=http://www.ibm.com/privacy/us/en/|target=_blank}IBM Online Privacy statement{menulink}{menuicon:elements1} will apply.  

* {menulink:custom|link=http://www-958.ibm.com/v/371435|target=_blank}I Have a Dream Phrase Net{menulink}{menuicon:elements1}
* {menulink:custom|link=http://www-958.ibm.com/v/371464|target=_blank}I Have a Dream Word Tree{menulink}{menuicon:elements1}