Algorithms, Lede 2017
These notes are for Algorithms, a Summer 2017 course for the Lede Program at Columbia University Graduate School of Journalism.
Housekeeping
Classes
servers
cron mturk s3
sqlite
- SQLite presentation
- Jupyter Sqlite and pandas
fuzziness matplotlib
text analysis
more text analysis
- Jupyter Counting and stemming
- Jupyter Hip Hop Lyrics
- Jupyter NRC Emotional Lexicon
- Jupyter Trump vs State of the Union addresses
- Jupyter Word counting for comparing books
clustering
- Jupyter A tiny introduction to Seaborn
- Jupyter K-Means Clustering with scikit-learn
- Jupyter KMeans clustering template
networks
- Jupyter Reshaping and building graphs
- Jupyter Network Centrality
- Jupyter NetworkX Graphs from Source-Target DataFrame
- Jupyter Marilyn Monroe Loves Visualization
Servers
- Creating a Digital Ocean scraping server
- Setting up a Digital Ocean server for Selenium, Chrome, and Python
- Using Jupyter Notebooks on your Digital Ocean server
- Using scp to send files to/from our server
- Using cron jobs for repeating tasks
- Using Tabula on your Digital Ocean or AWS EC2 server
- Using OpenRefine on your Digital Ocean or AWS EC2 server