We all know we should be using web analytics to analyse web site visitor behaviour and online marketing channel performance. However what type of web analysis should we use? Should you go for log file analysis or page tagging or a bit of both? First of all lets define what we mean by these terms.
Page tagging involves placing a piece of code usually externalised JavaScript on each page of your site and is sometimes referred to as client-side data collection. Every time a tagged page is opened by a visitors browser the script is processed and visitor information collected. Log file analysis refers to data collected by your web server. Whats the difference from a web analytics point of view?
The bad news is that both strategies have their advantages and disadvantages so here goes.
Page Tagging Advantages
Because data is collected client side this gets around any proxy and caching problems
Will give you information on web design parameters such as browser versions, platform versions, screen resolution, connection speed etc
Track client side events such as JavaScript and flash events
Page Tagging Disadvantages
Firewalls can prevent or interfere with script processing
Set up costs associated with insertion of code.
Insertion of code can lead to errors
Will not pick up page errors such as 404s
Because robots ignore scripts can not track search engine spiders
Unable to directly track non html pages
Vendor Specific
Logfile Analysis Advantages
Historical Data can be analysed
Little set up cost
No firewall issues
Easily track page errors
Can track Search Engine spiders
Vendor Independent
Can track non html pages such as pdfs
Logfile Disadvantages
Proxy/caching inaccuracies. If a page is cached no record is logged on your web server
No web design parameters
No event tracking
If you are used to looking at web statistics using Web Trends for instance you may see significant differences in visitor numbers. When moving to logfile analysis visitor numbers may increase by 20-30%. If your site is not using persistent cookies your web analytics programme can not identify unique visitors therefore all visitors are lumped together as total. Typically unique visitors represent about 20 -30% of total web site visits so this metric will be inflated by this amount. Sometimes youll see a dramatic reduction in site visits. This is usually because web analytics programmes strip out the loading of graphics which are erroneously counted as visits by other programs.
Other differences in visitor numbers are usually due to how programs define a visit. A visit duration of 30 minutes means that multiple visits from the same IP address with-in this time period will be counted as a single visit. Change this parameter to 15 minutes and these visits could be counted several times and your total visits will increase. Finally, when a web browser loads a PDF file is brings down different parts of the file at different time and some programs can count this as multiple requests for the same file. A good web analytics programme will collapse these multiple downloads into a single.
It is important to understand these differences and manage the expectations of your colleagues as surprise drops in web site metrics can sometimes lead to disenchantment with measuring web site performance altogether.
For more information on web analytics speak to us at www.ju2.com and keep an eye on our blog at www.ju2analytics.com
Jim Williams is the Managing Director of Web Strategy Consultancy http://www.ju2.com