Remove junk traffic from your Google Analytics data

Gerald Thulbourn, WebSanity Internet Marketing, HuntingdonSome of you might have noticed a horrible creeping problem in Google Analytics: traffic from junk sources such as web bots and spiders. Remove this non-human traffic with a new filtering option from Google.

If you’ve ever seen traffic in your Google analytics from a source called SEMALT it’s total junk. Some kind of semi-legitimate automated program that is spidering your website looking for SEO information (to sell back to you? or your competitors?) For low traffic sites or for niche pages it can really throw your stats off. (and there are more where it came from).

How come Google doesn’t filter it out? Normally “nice bots” identify themselves as such and so Google’s Analytics code can strip their traffic out: you’ll never know they were there, OR, because they aren’t triggering the JavaScript on your site, they simply don’t trigger the analytics tracking code.

As spiders have grown more sophisticated in their analysis of websites, they now do sometimes trigger JavaScript code, therefore looking just like a human visitor.

Google have countered this by adding a setting in your Google Analytics where you can tick a box under Bot Filtering to “Exclude all hits from known bots and spiders“. From that point forwards (NOT retrospectively) Google Analytics will ignore all such traffic.

How does it know what to ignore? It bases it on an independently maintained list of “bots” identified by the global community; so it should keep up to date with all the latest releases.

Why didn’t they turn it on automatically? This could make a big change to some people’s traffic reports (and often what is more important in SEO is the relative numbers month on month, not the absolute numbers).

So, armed with the understanding that you might see a traffic drop – but in future see only real human traffic – head into Bot Filtering in your Google Analytics settings and tell spam bots and spiders where to go…


Help and advice on Web Design in HuntingdonPeterborough & Cambridge AND SEO in Huntingdon, Peterborough & Cambridge from WebSanity Internet Marketing.

This entry was posted in Search Engine Optimisation (SEO), Web Hints & Tips, Web News and tagged , , , . Bookmark the permalink.

Comments are closed.

About Gerald Thulbourn

Gerald Thulbourn setup WebSanity in 2004. He has a 1st class honours MEng in Microelectronics & Software Engineering (i.e. he's a techy), 5 A grades at A level (i.e. he works hard) and loves to communicate (i.e. odd for a techy). He hates tech speak, sloppy/badly tested code, and technology for the sake of technology's sake. He loves helping people understand marketing concepts and seeing how their application makes a real difference to their business. In particular he loves training; SEO, Website Analysis, WordPress etc. Read more about us on Google+