View on GitHub

pcda17.github.io

Course materials for 'Critical Perspectives in Cultural Data Analysis' at UT Austin's iSchool

##Week 10: Machine Learning

Topics to Cover

Provide collection of several hundred texts grouped by genre:

Have students choose two or three categories to work with.

Using scikit-learn rain ML model using small number of texts and measure classification accuracy for remaining set.

Train model using more texts and see if there’s any improvement.

Compare models.

Look at mis-classified texts and discuss what features make them outliers.

Break

Demonstrate cluster analysis.

Sentiment analysis: Evaluate/classify Twitter data.