Main Content
What proportion of our web traffic is robots? Archive - Originally posted on "The Horse's Mouth" - 2007-06-19 06:44:20 - Graham Ellis
We welcome search engines to our site - to index our content and point their visitors back to us where appropriate, but such search engines are a means to an end and not an end in themselves. How much traffic to our web site is true visitor traffic, and how much is automata? An interesting pointer is a graph of log file size for the last four weeks - you'll see it at the top of this page. We noticed a distinct weekly "cyclic" flow in this graph (it will automatically update as this blog archives, so you may not see it if you come back here in 2008 or later!) and as automata run 24 x 7 for the most part, we're pretty sure that the peaks and troughs are caused by real visitors.
Our Current Visitors page allows us to take a snapshot of the HTML and PHP pages that use our standard template that have been called up, and from where, in the last fifteen minutes. The database records which browser was in use, the referring page and the country of origin and makes a fascinating read.
Our Most Popular Pages display looks back at yesterday's log files and tells us where people have visited - and a bias towards a particular page will tend to indicate heave real traffic as automata tend to spider evenly.
I started with the question "What proportion of our web traffic is robots? when perhaps I should have asked "What proportion of our web traffic is generated by real human beings browsing at the time?" . From the current peaks and troughs on the graph, and the other evidence, I'm guestimating that the figure is somewhere between 40% and 65%, with a very high proportion of that being business rather than home users.
Update - December 2009 The weekly cycle continues ... there's an update on this story [here] although little has changed except the total traffic volume.
Some other articles
G998 - Newsletter Highlighted Box Web Sites - Subject to Advertising Standards from 1st March - check your sites Global and Enable - two misused words! Are you wanting to learn PHP? Reading all our recent news from a single source A (biased?) comparison of PHP courses in the UK We have lost a regular business guest Why the Pony Tail? LinkedIn - Thrice Asked, and joined. How many cups of coffee? Public Training Course Dates until July 2009 A short introduction to our courses Linux and Java Course in London Evening drive across the roof of Wiltshire Tcl/Tk - updating your display while tasks are running Python v Ruby Troy, up state New York This article Well House Manor, Melksham, Art Gallery ls -l report, Linux / Unix - types and permssions Well House Manor and Beechfield House, Hotels, Melksham G902 - Web site techniques, utility and visibility Almost so wrong, but perhaps it's right for some? Effect on external factors on traffic to our web sites - an update Selecting RECENT and POPULAR news and trends for your web site users Well House Consultants, Well House Manor, First Great Western Coffee shop, TransWilts / 2014 web site reports Facebook marketing - early experiences How do I post automatically from a PHP script to my Twitter account? More or less back - what happened to our server the other day Web site - fully back! Helping search engines with appropriate 400 error codes TV show appearance - how does it effect your web site? An email marathon Some traps it's so easy to fall into in designing your web site Legal change - You need to obtain user consent if you use cookies on your website Short Web Addresses for Melksham QR codes with marketing logos embedded Some TestWise examples - helping use Ruby code to check your web site operation Promoting a single one of your domains on the search engines How big is a web page these days? Does the size of your pages matter? Learning more about our web site - and learning how to learn about yours Sharing the user experience - designing a form with the customer in mind Who is knocking at your web site door? Are you well set up to deal with allcomers? Automed web site testing scripted in Ruby using watir-webdriver Google +1 - what is it? Finding and diverting image requests from rogue domains Looking back at www.wellho.net Making the most of critical emails - reading behind the scene Retaining web site visitors - reducing the one page wonders How to set up short and meaningfull alternative URLs Is it worth it? How to run a successful online poll / petition / survey / consultation Web site traffic - real users, or just noise? Analysing Google arrivals by country of origin Status Page / breaks of service in early December Removal of technical resources from this site Writing with our customers words Koulutus, Open Source tietokone kielillä ldning, Open Source dator sprÃ¥k ldning, Open Source dator sprÃ¥k Opplæring, Open Source datamaskinen sprÃ¥k Uddannelse, Open Source computer sprog Opleiding, Open Source computertalen Formação, Open Source computador lÃnguas Ausbildung, die Open-Source-Sprachen Formazione, Open Source computer lingue Formación, de los lenguajes de código abierto Formation, des langages Open Source How important is a front page ranking on a search engine? Static mirroring through HTTrack, wget and others Web Site Loading - experiences and some solutions shared Cooking bodies and URLs Plagarism - who is copying my pages? Making our things easier to find How to avoid duplicating web page maintainance Find the link A few of my favourite things Web Bloopers - good form design - avoiding pitfalls I have been working hard but I do not expect you noticed Which country does a search engine think you are located in? Ever had One of THOSE mornings? Who is watching you? Rapid growth leads to server move How do Google Ads work? Kiss and Book To provide external links, or not? PHP course dot co, dot uk Online hotel reservations - Melksham, Wiltshire (near Bath) Colour, Composition or Content Where in the world / country is my visitor from? Perl, PHP or Python? No - Perl AND PHP AND Python! Ongoing Image Copyright Issues, PHP and MySQL solutions Script to present commonly used images - PHP A time to update pictures Above the fold with First Great Western Stuffing content into a web page - easy maintainance This article What brought YOU to our web site? Simple but effective use of mod_rewrite (Apache httpd) From Web to Web 2 Two new pages / sites Finding resources - some pointers Sorting out for a site map Drawing dynamic graphs in PHP Above the fold Our search engine placement is dropping. Search engine placement - long term strategy and success Training on Cascading Style Sheets Santa at the station Driving customers away Visibility Effective web campaign? Finding the language preference of a web site visitor Horse and Python training Where is a web site visitor browsing from Protecting images from theft Mirroring a dynamic site Keeping the visitors happy and browsing Denial of Service ''attack'' Bigger Box Campaign Getting favicon to work - avoiding common pitfalls Dynamic Web presence - next generation web site New Navigation Aid - Launch of My Wellho Form Madness What brings people to my web site? CMS - the minefield of Choices Graveyard pages Frightening and from-friend viruses and spams More maps Ordnance Survey - using a 'Get a map' What language is this written in? Growth pains Colour blindness for web developers The Iconish language Cover all the options An apology to Mr Boneparte Our most popular resources Information request forms, cleaning up spam Putting a form online Responding to spam Who are all these visitors? Searching for numbers Allow for peak traffic on your web site Your personal Google ranking The hunt for unique words Data Mining Implementing an effective site search engine Colour for access A case of case URLs - a service and not a hurdle No more 'Error 404' pages. Something better. Web design platoon Skills and responsibilities A606 - Apache httpd - log files and log tools Web Server Admin - some of those things that happen, and solutions Which (virtual) host was visited? Tuning Apache log files, and Python analysis Identifying and clearing denial of service attacks on your Apache server 20 minutes in to our 15 minutes of fame TV show appearance - how does it effect your web site? Reading Google Analytics results, based on the relative populations of countries Learning more about our web site - and learning how to learn about yours Who is knocking at your web site door? Are you well set up to deal with allcomers? Needle in a haystack - finding the web server overload Getting more log information from the Apache http web server Making the most of critical emails - reading behind the scene Server logs - drawing a graph of gathered data Apache httpd Server Status - monitoring your server Logging the performance of the Apache httpd web server libwww-perl and Indy Library in your server logs? Server overloading - turns out to be feof in PHP Logging Cookies with the Apache httpd web server Be careful of misreading server statistics Every link has two ends - fixing 404s at the recipient Web page (http) error status 405 This article What brings people to my web site?