Persine 🔎

Persine is an automated tool to study and reverse-engineer algorithmic recommendation systems, like YouTube videos and Amazon products.

J Soma

slides at jonathansoma.com/projects/persine

Hi, I’m Soma

Lede Program, Columbia Journalism School

js4571@columbia.edu @dangerscarf

investigate.ai littlecolumns.com ledeprogram.com

Persine

github.com/jsoma/persine

persine.readthedocs.io

Who’s to blame for a bad vaccine rollout? Algorithms!
Who’s to blame for a radicalization? Algorithms!
Who’s to blame for a racism? Algorithms!

Researching algorithms

A paper from 2020
Zeynep Tufekci in the NYT

Enter: Persine

Persine on GitHub

“Persine is an automated tool to study and reverse-engineer algorithmic recommendation systems, like YouTube videos and Amazon products. It has a simple interface and encourages reproducible results.”

from persine import PersonaEngine

engine = PersonaEngine()

with engine.persona() as persona:
   persona.run_batch([
      "https://www.youtube.com/watch?v=hZw23sWlyG0",
      "youtube:next_up#3",
      "youtube:homepage"
   ])
   persona.history.to_csv("history.csv")
   persona.recommendations.to_csv("recs.csv")

Let’s run Persine!

Run Persine on Google Colab

What can it do?

YouTube and Amazon

YouTube, the Great Radicalizer by Zeynep Tufekci (NYT)

Does watching a politically charged video prompt YouTube to send you down a rabbit hole of radicalization?

Children’s YouTube is still churning out blood, suicide and cannibalism by K.G. Ophanides (Wired)

Do kids watch super weird videos if YouTube is left on autoplay?

YouTube’s Algorithm Accused of Facilitating Paedophile Rings by Tim Cross (Videoweek)

Does watching bikini haul videos lead you to young girls in suggestive swimsuits?

‘Fiction is outperforming reality’: how YouTube’s algorithm distorts truth by Paul Lewis (The Guardian)

Does YouTube promote conspiracy theories?

The New Zealand shooting shows how YouTube and Facebook spread hate and violent images — yet again (Washington Post)

How well does YouTube police its content, and what kind of content is policed?

And Amazon, too!

Persine also works on Amazon, although it’s not as well tested, and Amazon’s website is kind of insane. You can see what’s surfaced in search results, what’s listed as recommendations on product pages or on the Amazon homepage.

It’s also easily extendable to other websites

Thanks!

js4571@columbia.edu @dangerscarf

investigate.ai littlecolumns.com ledeprogram.com

github.com/jsoma/persine

persine.readthedocs.io

// reveal.js plugins