The goal of this project is to analyze and compare songs that occupy top chart positions in different countries - to discover similarities and differences, and ultimately to answer the question if the popular songs content reflects cultural differences between people around the world. The top-charts data for the following 29 countries are analyzed in this project: Argentina, Australia, Austria, Belgium, Brazil, Bulgaria, Canada, Chile, China, Denmark, Finland, France, German, Greece, India, Ireland, Italy, Japan, Netherlands, New Zealand, Norway, Portugal, Russia, Spain, Sweden, Switzerland, UK, Ukraine, and USA.
The steps for data collection phase are fully described here.
The steps for data analytics phase are fully described here.
We created two visualizations to show the most frequent lyrics words in each country. One is based on the total number of words, and the other one is based on the tf-idf score of each word.
Top words w/ Counts Top words w/ tf-idf