Patronage Project Progress

I've been scraping data on and off for the past couple of months (i don't think my page requests will impact the website, but it seems ethical to not bombard patreon.com), and i've got a dataset made up of  approximately 6000 creators and 15000 patrons. This data has been cleaned and  new classifications formed before putting it into Gephi, a social network analysis package.

The software will create layouts based on the connections between nodes. I use the Force Atlas 2 algorithm which essentially creates a gravity that pulls together nodes which share connections (edges), and pushes apart nodes with few or no connections. In the image below you can see 'satellites' of patrons and creators around the edges which are isolated from the rest of the network.

 Nodes in grey, edges in black.

Nodes in grey, edges in black.

The density of nodes and edges obscures the detail. A close up:

Each creator has been given attributes including: the genre they work in (e.g. games, podcasts, animation), their location (where it is provided), and number of patrons. These are then used to filter the graphs and visually detect patterns in the data. Below is is a graph showing genres.

 This is just edges, coloured to represent 14 different genres. The light green on the right half of the graph is video and film, the pink on the left is comics, and the purple at the bottom is drawing and painting. 

This is just edges, coloured to represent 14 different genres. The light green on the right half of the graph is video and film, the pink on the left is comics, and the purple at the bottom is drawing and painting. 

These patterns tell us patrons concentrate their sponsorship on one or a relatively low number of genres. In this sample, on average patrons support 3.4 creators, with a range of 1-91. I'll include some descriptive statistics in future posts.