Introduction to Google Analytics Sampling

Estimated Reading Time: 4 minutes

First, lets begin by discuss what sampling is. According to Wikipedia:

In statistics and survey methodology, sampling is concerned with the selection of a subset of individuals from within a population to estimate characteristics of the whole population.

Researchers rarely survey the entire population because the cost of a census is too high. The three main advantages of sampling are that the cost is lower, data collection is faster, and since the data set is smaller it is possible to ensure homogeneity and to improve the accuracy and quality of the data.

However, for some unfortunate reason, my geeky passion towards math, statistics and other engineering sciences are seldom shared by my friends and colleagues so if you spent your college years in anything other then classes on statistics, allow me to translate what this means.

Lets assume that your website generates a lot of traffic. Even though a product like Google Analytics collects all your website’s traffic data, it would take a significant amount of time to generate reports based on the complete set of data. Instead, the tool takes a subset of your data and presents a report based on a sample of the data, not the whole thing.

Up until recent it wasn’t possible to control sampling. If your data was sampled, you’d see the following message:

All Traffic Google Analytics Picture

A new feature in Google Analytics is the Adjust Sample Size tool. The slider, which is located below the date range, allows the user to choose between faster processing and higher precision.  You can adjust the sample size from the default of 250,000 (which is the center of the slider) up to 500,000 visits.  When you choose a sampling threshold, that preference will be used in all reports until you close Google Analytics.

Google Analytics Faster Processing

When does Google Analytics use data sampling

Google Analytics samples data when your requested data size meets one of the conditions:

    • 500,000 maximum sessions for special queries where the data is not already stored.
    • Any query that exceeds 1,000,000 unique dimension combinations.

Why should I worry about this

Data sampling may lead to inconsistent results when you run Google Analytics reports. Here is a real-life example:

When the scaling is set to ‘Faster Processing’ the report is built based on 981 visits and shows 9,605 IE visits from the state of NY.

Google Analytics Faster Processing

Keeping everything the same, when the sampling is set to the middle of the scale, the number of IE visitors from the State of NY drops significantly to 7,918.

Google Analytics Faster Processing

Finally, for this example, the highest and the average precision yield the same result for IE visitors from the State of NY – 7,918.

Google Analytics Faster Processing

Why were the results the same between the average and the high precision? Be the first to comment below or send us a note with your answer, and if you’re right we’ll send you an InfoTrust SuperHero T-shirt.

What can I do about it

If you are concerned about the effect of data sampling, you have a couple of options:

1. You can make a change to GATC to only collect a percentage of your site’s traffic rather than all the traffic. We will cover more technical details in the next post on this subject.

2. Reduce your date range so you have less data.

3. Upgrade to Google Analytics Premium. We are working on a post about GA Premium, so stay tuned!

If your site gets more than 10M hits per month.

First of all, congratulations! Now, you should keep in mind Google Analytics Terms & Conditions state that ”if you exceed more than 10 million hits per month, there is no assurance that the excess hits will be processed.” We are going to talk about your options in a future post, but for the purposes of our discussion about sampling, we will assume that you have a Free Google Analytics account and your traffic is under 10M hits per month.  If not, and if you have any questions, don’t hesitate to reach out to me at Alex@InfoTrustLLC.com.

Stay tuned as we are going to publish another blog post on more advanced topics associated with sampling. Meanwhile, here are some good resources on this subject:

Share on FacebookShare on Twitter

Submit to StumbleUpon

Article written by Alex Yastrebenetsky

Author

  • Alex Yastrebenetsky

    Alex Yastrebenetsky is a founder (and CEO) of InfoTrust. Known as "The Brain" (Pinky and the Brain) around the office, he enjoys traveling with his wife and young children.

Facebook
Twitter
LinkedIn
Email
Originally Published: June 11, 2012

Subscribe To Our Newsletter

October 13, 2023
Originally published on June 11, 2012

Other Articles You Will Enjoy

Beyond the Basics: Deep Dive into Google Analytics 4’s Bounce Rate, Engagement Rate, and Engaged Sessions

Beyond the Basics: Deep Dive into Google Analytics 4’s Bounce Rate, Engagement Rate, and Engaged Sessions

You’ve probably come across the notion that Google Analytics 4 (GA4) and Universal Analytics (UA) are distinct platforms, despite both being products of Google….

3-minute read
Google Tag Best Practices for Google Analytics 4

Google Tag Best Practices for Google Analytics 4

After collaborating with several of my colleagues at InfoTrust including Bryan Lamb, Head of Capabilities, Corey Chapman, Senior Tag Management Engineer, Chinonso Emma-Ebere, Tech…

4-minute read
A Deep Dive into Google Analytics 4: What’s New and Improved

A Deep Dive into Google Analytics 4: What’s New and Improved

‘Tis the season for some new and improved Google Analytics 4 (GA4) features! Over the course of the next year, GA4 will be introducing…

3-minute read
App Install Attribution in Google Analytics 4: What You Need to Know

App Install Attribution in Google Analytics 4: What You Need to Know

App install attribution in Google Analytics for Firebase (GA4) is a feature that helps you understand how users discover and install your app. It…

6-minute read
Unveiling the Mysteries of Google Tag: 10 Things You Need to Know

Unveiling the Mysteries of Google Tag: 10 Things You Need to Know

If you’ve recently stepped into the realm of Google Analytics 4 (GA4) and are scratching your head about the new kid on the block—Google…

4-minute read
Advanced Analysis Techniques in Google Analytics 4: How to Use AI-Powered Insights and Predictive Analytics for Effective Marketing

Advanced Analysis Techniques in Google Analytics 4: How to Use AI-Powered Insights and Predictive Analytics for Effective Marketing

AI-powered insights and predictive analytics are revolutionary tools reshaping the modern marketing landscape. These advanced analytics techniques, particularly prominent in Google Analytics 4 (GA4),…

8-minute read
Navigating the Complexity of Measuring Connected TV and OTT with Web Analytics Tools: A Privacy-Centric Perspective

Navigating the Complexity of Measuring Connected TV and OTT with Web Analytics Tools: A Privacy-Centric Perspective

Connected TV (CTV) devices and Over-The-Top (OTT) platforms have experienced significant growth in recent years, becoming a preferred choice for consuming content on various…

8-minute read
How to Track User Engagement and Behavior on Your Website Using Google Analytics 4 Custom Metrics

How to Track User Engagement and Behavior on Your Website Using Google Analytics 4 Custom Metrics

Understanding how users engage with your website is crucial for improving user experience and achieving your business goals. Google Analytics 4 (GA4) offers a…

5-minute read
Leveraging Attribution Models in Google Analytics 4 to Improve Your Marketing Strategy: Tips and Best Practices

Leveraging Attribution Models in Google Analytics 4 to Improve Your Marketing Strategy: Tips and Best Practices

In the dynamic landscape of digital marketing, understanding the customer journey is crucial for optimizing strategies and maximizing ROI. Google Analytics 4 (GA4) introduces…

5-minute read

Get Your Assessment

Thank you! We will be in touch with your results soon.
{{ field.placeholder }}
{{ option.name }}

Talk To Us

Talk To Us

Receive Book Updates

Fill out this form to receive email announcements about Crawl, Walk, Run: Advancing Analytics Maturity with Google Marketing Platform. This includes pre-sale dates, official publishing dates, and more.

Search InfoTrust

Leave Us A Review

Leave a review and let us know how we’re doing. Only actual clients, please.