Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

  • The Features describing the image of handwritten digits: For each handwritten digit, a 113 103 dimensional feature vector is extracted based on the image of the hand written digit. This is provided to you in the features.csv file. The file has 12000 lines (one for each hand written digit) in the comma separated values format. 
  • A similarity graph: a graph connecting the 12000 data points is provided in the adjacency matrix form in the file adjacency.csv consisting of the adjacency matrix in comma separated value format. Two nodes are connected in this graph if the corresponding noisy image of the digits are similar enough (dissimilarity smaller than a fixed threshold).
  • 3 labeled points for each of the 10 classes: To help you identify your 10 clusters with the right digit from '0'-'9' we provide 3 example data-points for each digit in the data set in the file seed.csv. The file consists of 10 lines, one for each digit from '0'-'9'. Each line has 3 numbers providing the line number or index of 3 data point belonging to that class. The line numbering starts from 1 (not 0).

Task: For each handwritten digit, predict what the corresponding label from '0' to '9' is. The competition will be hosted on in-class-Kaggle. Details will be posted soon ...

Kaggle Link: https://inclass.kaggle.com/c/cs-4786-competition-1

 

Download the data below as zip file. When unzipped you will find the three files, Adjacency.csv, seed.csv and features.csv

...

Deliverables:

  1. Early Report: Each member of the group should submit a one to two page preliminary report that includes preliminary thoughts about how you plan to attempt the competition. For each individual's preliminary attempts and ideas so farindividual in the group, ;asp include what the individual has done so far and plan to do for the competition. This report is due on October 4th. All the group members can merge their preliminary reports into one preliminary_writeup.pdf on CMS. (worth 10% of the competition grade)
  2. Report: In the end of the competition each group should submit a 5-15 page writeup that includes visualization, clear explanation of methods etc. See grading guidelines for details about what is expected from the writeup. (worth 50% of the competition grade)
  3. Predictions: Competition is held on Kaggle in-class as a competition. You can submit your predictions to kaggle to compete with your friends. You should also submit your predictions on CMS.  (worth 40% of the competition grade)

...