Busting the Google Analytics Mythbuster

Web Log Storming

An interactive web server log analyzer and analytics tool - with a twist.

Web Log Storming screenshot

  • Dig out from your web log files pretty much anything you can imagine
  • Browse through website statistics
  • Drill-down to the level of individual visitors
  • Easily check accuracy by inspecting details, and exclude irrelevant data
  • Free 30-day trial available

Learn more about Web Log Storming

In the recent article at Google Analytics Blog author tries to bust several myths circulating in the public. You can find few half-truths and (intentional?) deceptions there that I simply can't ignore.

As I mentioned earlier, Google Analytics could be a nice addition to the main analytics solution (even we are using it occasionally), providing that you don't mind the baggage that comes with "free" label (What price Google Analytics?, Google Analytics - is it worth its price?, Google Analytics is not free, or search Google :) for more). JavaScript based systems give some information that log files can't, but they also suffer from several drawbacks that are limiting the value if used alone.

As each product has its own audience, I won't question anyone's decision to choose one type of solution or another, but some things simply must be said, regardless of how many people will read this compared to the original article. :)

MYTH 1: "You get what you pay for." Google Analytics is free, which means the system is down a lot."

I do agree that GA system is not down very often (if ever). Why it would be? They have more than enough resources to keep it alive, and imagine how much data they would lose in just one minute of downtime. But no matter how powerful their servers are your website will inevitably be slower. I doubt that you'll find this particularly alarming, but still...

MYTH 4: Google Analytics is not really accurate

Google Analytics uses JavaScript tags to collect data. This industry-standard method [...] discrepancies greater than 10%, it's due to an installation issue. Common problems include JavaScript errors, redirects, untagged pages and slow client-side load times.

[...]

All web analytics tools face the same technical limitations posed by JavaScript tags [...]

Ouch. This one is a main motivation for me to write the article. I'll just comment phrases in bold (in the order of the appearance).

  1. JavaScript tags are just one of methods used today. Even if we ignore custom in-house systems (based on whatever web developers use: PHP, ASP, Python, Ruby on Rails, ...), pretending that still widely accepted server log file analysis don't exist is at least an intentional delusion.
  2. Expected discrepancies of 10% or below among JavaScript based analyzers could be true, but compared to log file analysis, they show 2 to 5 times less traffic.
  3. It could be the installation issue only if visitors can be tracked with JavaScript. What about other traffic?
  4. Again, we can talk about errors and slow connection only when JavaScript tracking is possible.
  5. Saying that all web analytics tools face the same limitations is simply not true. JavaScript based web analytics tools do have these limitations, but not log file analyzers.

Pardon me if you don't care about visitors that block JavaScript or click on a different link before tracking script is loaded, websites that directly link to images on your website, downloads of non-html files (PDF, ZIP, EXE, images, ...), bandwidth usage, spiders, bots that pull down the whole website on a regular basis (wasting your bandwidth), direct access to scripts by hackers, etc, etc. Sure, with JavaScript analytics you can see trends and if you only care about marketing it could be good enough, but total number are not even close, and you can forget about other information that can be found in server log files.

MYTH 6: With Google Analytics you can't control your data

Yes, you can control your data... at some degree. Google promises to resists the urge to analyze your data for own purposes (if you don't forget to explicitly say so), but the fact is that they already have your data, right there. In this information era knowledge is a big asset. Sorry, but I don't buy that they won't ever "peek", just a little. Probably under the excuse of "serving better search results" (or more likely, "serving better advertisements"). And I'm not talking about analytics only: they have search queries, e-mails, documents, appointments, instant messages, etc. They predicted Eurovision 2009 contest winner based on what people search and I should believe that they won't silently use all the information they can for profits? Right...

Even if you do trust Google (and every its employee), you still can't say that you fully control your data as it's still on their servers. Anything can happen in the future. What if Google goes to bankruptcy? Okay, not likely, but possible. :) Therefore, you can't fully control your data, but don't get me wrong: I admit that there are few pros. For example, you don't need to think about backup - the data is much safer on Google servers than on your computer. :)

* * *

Disclaimer: the purpose of this article is not to persuade anyone to use server log analyzer instead of Google Analytics (I wrote another article for this :) ), but to point out few things that are too easily overlooked these days, intentionally or not.

Related articles

10 strengths of web log analyzers compared to JavaScript based analytics
Which web log analyzer I should use?
The Remedy for a Web Analytics Headache
What price Google Analytics? (external, by Dave Collins)

Return to the list of articles



Web Log Storming

An interactive web server log analyzer and analytics tool - with a twist.

Web Log Storming screenshot

  • Dig out from your web log files pretty much anything you can imagine
  • Browse through website statistics
  • Drill-down to the level of individual visitors
  • Easily check accuracy by inspecting details, and exclude irrelevant data
  • Free 30-day trial available

Learn more about Web Log Storming