I was aiming for a high-level overview of the conversations in the Epstein files. Starting from the full data set available in Rye Howard-Stone's Epstein research data github repository, I identified all the emails and counted the number of occurrences of every word in them using a custom Python script. This process removed common English stopwords, which are uninformative. This word cloud was constructed from the top 10,000 remaining words using a custom fork of Andreas Mueller's word_cloud package and a custom coloring function that assigns each letter a color based on the average color at that coordinate in the image guide.

    by Flat_Telephone1951

    1 Comment

    Leave A Reply