MACS 30000: Perspectives on Computational Analysis University of Chicago
Annoucement
All Assignment 1 grades are finalized.
Assignment 2 revision due on Wed Nov 29
Peer reviews are assigned for Thursday’s discussion
Please submit written feedbacks on Canvas by 11:59 pm on Wed and
read your peers’ comments before coming to class.
15 mins in class on Thursday for clarifying your feedbacks
followed by presentation & QA
Participation
You will receive
a check (10/10) if you submit your peer reviews in time and speak up at least once in class this week
a check plus (12/10) if you submit your peer reviews and present your proposal draft in class on Thursday
a check minus (8/10) if you submit your peer reviews
a zero (0/10) if you fail to participate at all
Final Proposal
Due on Fri Nov 8
All submissions graded by our graders and no revision
You will receive feedbacks from me if you submit early by Wed Nov 6
Sample proposals from last year are on Canvas
Remote OHs next week
Mass collaboration
Major types
Human computation (week 8)
Open call (today)
Distributed data collection (today)
Open Calls
Pose a problem asking for specific, measurable solutions from other people
Offer a reward/incentive for participation
Compare and evaluate the solutions using a consistent and measurable metric
Generate broad participation from a wide range of researchers
Netflix prize
Need to predict what movies customers would enjoy
Internal research plateaus
Release an anonymized dataset of 100 million movie ratings to predict 3 million held-out ratings
Anyone who could create an algorithm that improved the existing model by 10% or better would win 1 million dollars
Clear and unbiased evaluation criteria
Solicited over 40,000 solutions
Discussion
The best predictive models in the Netflix Prize open call were hybrids of multiple models (ensemble methods). What characteristic of one model relative to other models made it improve the overall prediction when blended with the other models?
In your opinion, what kind of tasks are better suited for open call contests? What kind of tasks are not?
What are the benefits to the researchers proposing the problem?
What are the benefits to the participants proposing the solutions?
Are open calls better tailored to questions of prediction or questions of explanation? How might we utilize open calls to tackle explanations?
Using BibTeX or other citation managers ensures consistency in formatting
If you still have trouble understanding how to integrate your sources into your writing (e.g. when to cite, how to paraphrase) read chapter 14 in Booth or ask me.