Mapping the vocabulary of rappers

Tuesday, May 6, 2014


NYC designer and programmer Matt Daniels has created an interactive map that charts 85 of the biggest names in hip hop, by the vocabulary used in their works. He used the first 35,000 words from the lyrics of each artist (which cover about 3-5 studio albums and EP's), based on data provided by Rap Genius.
I used a research methodology called token analysis to determine each artist’s vocabulary. Each word is counted once, so pimps, pimp, pimping, and pimpin are four unique words. To avoid issues with apostrophes (e.g., pimpin’ vs. pimpin), they’re removed from the dataset. It still isn’t perfect. Hip hop is full of slang that is hard to transcribe (e.g., shorty vs. shawty), compound words (e.g., king shit), featured vocalists, and repetitive choruses.
The Largest Vocabulary in Hip Hop | via
Follow Blame it on the Voices on Twitter | Blame it on the Voices on Facebook

If you liked this post, you can subscribe to the Blame It On The Voices RSS feed and get your regular fix


0 comentarii:

Post a Comment

Dear spammers! Please note that a nofollow attribute is automatically added to all the comment-related links!

You can use the following HTML tags: <b>, <i>, <a>