Which pages are most often looked at on this site?
Now that this site has been running for a few years it is possible to consider the pages that people access on the site as a rough indication of which topics are interesting to the readers. This page outlines some of the access levels of different parts of the site.
There are twelve different categories of pages on the site. The figure above shows how frequently they are accessed. The user's focus is clearly on songs (rather than albums). Interestingly users look for the songs of a particular year more that the songs of a particular artist, but they look for albums of a particular artist much more than the albums of a particular year. The high volume of traffic for the monthly number one pages surprised us, of course as purists we worry that our presentation of this data is too simplistic, most users clearly would rather have a simple picture than an accurate one.
The plot above shows how many times each of the "Song Year" pages were viewed. By far the most frequently viewed year was 1952 (the third box up in the 1950s, given that was both exactly 60 years ago and also that the British Queen's Diamond Jubilee was celebrated during the period being measured that is not a surprise. The low values for the 2000s was also expected, the site is not particularly focused on (or indeed good for) music after about 2005. Also the period before the 1930s is outside most people's interests. The high overall viewings of the 1950s and 1990s are a surprise (at least to us).
The following artist's pages were most commonly viewed:
The following song titles were the most commonly viewed:
Monthly Number 1 Songs
The monthly number 1s pages are much more heavily used than we had expected. The plot above shows how frequently the pages were viewed across each of the 12 month's pages for the years 1940 to 2011. The high number views of January pages is an artifact of the navigation that tends to direct people via the January pages, the decade pages (Jan 1940, Jan 1950 etc) have high counts for the same reason. The peak between 1962 and 1995 can be interpreted to show that people really do look for the months they were born (assuming most users are 18-50 years old).
The album pages are visited about one quarter as many times as the corresponding song pages.
Why this list should be headed by what are (to us anyway) a pair of obscure Japanese and French acts is a mystery. At first we assumed these were high up because we only looked at a few day's logs, but extending the period made no difference.
As with all such reviews the impact of search engine crawlers and other web spiders have to be accounted for. When the site first started (in 2007) a large proportion of the accesses were from automated spiders. When we first started our estimate was that 90% of requests came from spiders, even as late as 2011 it was clear that this proportion was about 40%. However the significant growth in real use means that for the early part of 2012 the number is less than 7% (and continues to fall).
The picture above shows the number of pages that had smaller number of accesses over an extended period in the early part of 2012. The peak at 23 page views shows that many of the less frequented parts of the web site were mainly exercised by crawlers.
The most frequently accessed individual pages are those like the list of top songs, artists, titles and charts that are the normal way of getting into the site. As was shown above the annual song charts are the most popular type of data page. 1952 was far above all the other years, as has been mentioned above that is easy to explain. As the yellow line shows the impact of spiders on these results is so small as to be insignificant.
Here are the 50 most visited pages on the site:
Another question is which countries visitors come from. We used a simple country identifier to map the IP addresses of users to countries. It managed to assign countries for more than 94% of our visitors over a two month period (Aug-Sep 2010). It showed that we had visits from more than 150 different countries in that period.
The top 50 countries are shown in the figure above. The US accounts for just under half of the activity on the site.
As we would expect the number of visits varies according to the time of day in the visitor's country. The plot above shows that most users look at the site in the evening (5pm to 8pm) with fewest being active at 3am.
Surprisingly while the number of visits is greater at the weekend it is not that much greater. The plot above shows how the number of visits varied by week day over the two month period for the 10 most active countries.
Here is a list of the number of full page views by users within each country for a 5 month period during 2011.