Main Content

What proportion of our web traffic is robots?

Archive - Originally posted on "The Horse's Mouth" - 2007-06-19 06:44:20 - Graham Ellis

We welcome search engines to our site - to index our content and point their visitors back to us where appropriate, but such search engines are a means to an end and not an end in themselves. How much traffic to our web site is true visitor traffic, and how much is automata? An interesting pointer is a graph of log file size for the last four weeks - you'll see it at the top of this page. We noticed a distinct weekly "cyclic" flow in this graph (it will automatically update as this blog archives, so you may not see it if you come back here in 2008 or later!) and as automata run 24 x 7 for the most part, we're pretty sure that the peaks and troughs are caused by real visitors.

Our Current Visitors page allows us to take a snapshot of the HTML and PHP pages that use our standard template that have been called up, and from where, in the last fifteen minutes. The database records which browser was in use, the referring page and the country of origin and makes a fascinating read.


Our Most Popular Pages display looks back at yesterday's log files and tells us where people have visited - and a bias towards a particular page will tend to indicate heave real traffic as automata tend to spider evenly.

I started with the question "What proportion of our web traffic is robots? when perhaps I should have asked "What proportion of our web traffic is generated by real human beings browsing at the time?". From the current peaks and troughs on the graph, and the other evidence, I'm guestimating that the figure is somewhere between 40% and 65%, with a very high proportion of that being business rather than home users.

Update - December 2009 The weekly cycle continues ... there's an update on this story [here] although little has changed except the total traffic volume.