Weld logo
Tools & PlatformsJuly 09, 20243 min read

Navigating Data Differences Between Weld and Google Analytics 4

Author image
by Pedro Prazeres
Data Differences Between Google Analytics console and API

With Weld you can integrate your Google Analytics 4 data for easy transformation, modelling, and combination with other data sources to build the valuable business insights you need.

However, as is normal when combining data from different platforms, you might see some differences between the values in your Google Analytics dashboard and the values imported through their Reporting API into Weld. Don't worry, your data is safe and correct! This is a common occurrence, and it's all due to Google's mechanisms for handling data in their GA4 processes.

In this post, we look into the reasons behind the data discrepancies you might encounter, and what to keep in mind when you need to balance precision and efficiency. Let's start by taking a look at some of the approaches Google Analytics takes to handle your data analysis.

Data Sampling

Google Analytics 4 uses data sampling, a process in which only a subset of a dataset is used to estimate the characteristics of the entire dataset. This allows faster data retrieval and processing due to the smaller amounts of data involved.

In Google Analytics, data sampling may occur when the number of events used to create a report, exploration, or request exceeds the quota limit for your property.

[GA4] About data sampling - Analytics Help

The quota limits for event-level queries are, as of the writing of this post, 10 million for standard Google Analytics properties and 1 billion for Google Analytics 360 properties. If data sampling is being used, this will be indicated by the data quality icon in the top right of the different cards and explorations in your Google Analytics 4 dashboard.

GA4 Sampling

The higher the percentage of data used, the more accurate and better quality your results will be.

HyperLogLog

When performing an exact count of distinct items (or cardinality) in a large dataset, significant amounts of memory and computing resources are needed. To reduce heavy memory usage and provide fast results, Google Analytics 4 utilizes the HyperLogLog++ (HLL++) algorithm—an augmented version of the HyperLogLog algorithm.

The HLL++ algorithm estimates the cardinality of several metrics in GA4, giving an approximation of the total. What this means in practice is that the values in your Google Analytics 4 dashboard are provided in a quick and efficient manner, but they are approximations. For most cases, the approximation is quite accurate, with a low error rate.

However, when you connect Google Analytics 4 to your Weld account, the values of the same cardinalities will likely differ. This is due to where and how your data is stored and processed through Weld: the destinations we offer have the time and resources to perform the necessary calculations and, consequently, will give you precise results on the distinct counts of session metrics.

You can see the results of HLL++ in your own GA4 dashboard: the totals presented for some of the metrics do not correspond to the sum of the values in the corresponding columns.

Data Discrepancy

As you can see below, the values are different when the same data is explored through Weld's SQL editor. For example, your total session count from the Organic Search channel might show a value of 3959 in GA4, and a total of 3955 in your Weld account.

Weld SQL Comparison

Considerations

Whenever you need a quick look at your Google Analytics data, the GA4 dashboard will give you fast results—albeit slightly inaccurate—due to the use of both data sampling and the HyperLogLog++ algorithm. But if precision is what you need, having your data connected through Weld will allow you to use the full power of our destinations to easily calculate the values of all the metrics you require.

References

Sign up to Weld

Ready to start transforming your data into insights? Get started with Weld for free today.

Continue reading

Weld Product Updates – November 2025 image

November 28, 2025

Weld Product Updates – November 2025

This month at Weld, we launched new connectors for Notion, LinkedIn Company Pages, and LaunchDarkly, introduced BigQuery partitioning and clustering, and delivered improvements acr…

Discrepancies in Microsoft Ads image

November 25, 2025

Discrepancies in Microsoft Ads

Microsoft Ads dashboards and your reporting tools can tell different stories. Learn why data discrepancies happen, what causes mismatched metrics, and how to troubleshoot them effe…

Loved by data teams from around the world

Get started with Weld

Spend less time managing data and more time getting real insights.