You are viewing an old version of this page. View the current version.

Compare with Current View Page History

« Previous Version 15 Next »

Preparation

This page is a companion to the guest lecture on text mining for

Unknown macro: {menulink}

Writing 2100

Unknown macro: {menuicon}

scheduled for 4/29/2014.
Please do bring a laptop! If you have one, bring your own. If you do not have one, please feel free to

Unknown macro: {menulink}

check one out at the Olin circulation desk

Unknown macro: {menuicon}

. Having a laptop will allow you to participate in the exercises and get the most out of class.
No special software will be needed. All exercises will be done through a Web browser.

Agenda

Presentation

There will be a

Unknown macro: {menulink}

presentation

Unknown macro: {menuicon}

during which your questions and comments are welcome. My aim is to discuss as much as is useful to you. Please feel free to chime in at any time.

Exercises

All exercises will be demonstrated, so no prior knowledge of the tools are required. The room is equipped with a jack to allow easy sharing of your desktop on the screen, so if you discover something that you would like to share and discuss, we can easily do so, and I will encourage that.

Voyant
Unknown macro: {menulink}

Voyant

Unknown macro: {menuicon}

is a low barrier text analysis tool that delivers a rich, interactive interface. Please feel free to bring your own material for upload and analysis, understanding that upload of any material will be subject to the

Unknown macro: {menulink}

Voyant privacy policy

Unknown macro: {menuicon}

.

  • Unknown macro: {menulink}

    I Have a Dream - seen through Voyant

    Unknown macro: {menuicon}
  • Unknown macro: {menulink}

    Cornell Daily Sun through the centuries - seen through Voyant

    Unknown macro: {menuicon}
    - there's lots of noise in this example - see if you can improve the signal.
nGrams

There will be exercise time using Google's

Unknown macro: {menulink}

nGram tool

Unknown macro: {menuicon}

. nGrams depict the frequency of a word or word combination analyzed by publication year. Note that many modifications can be made to refine the analysis, so please consider the links below as starting points. Syntax for refinement is found on the

Unknown macro: {menulink}

About page

Unknown macro: {menuicon}

.

  • Unknown macro: {menulink}

    Exploring terms used for describing racial integration

    Unknown macro: {menuicon}
  • Unknown macro: {menulink}

    Spanish words in English publications

    Unknown macro: {menuicon}
ManyEyes
Unknown macro: {menulink}

Many Eyes

Unknown macro: {menuicon}

is a site run by IBM that allows for various visualizations of data. A few of the visualizations allow for free text entry, and analyze the text directly. Please feel free to upload your own work into the site as primary data (upload requires an account), understanding that the

Unknown macro: {menulink}

IBM Online Privacy statement

Unknown macro: {menuicon}

will apply.

  • Unknown macro: {menulink}

    I Have a Dream Phrase Net

    Unknown macro: {menuicon}
  • Unknown macro: {menulink}

    I Have a Dream Word Tree

    Unknown macro: {menuicon}
  • No labels