Unlock the power of Google Analytics with Big Query

Mastering GA4 Users: Metrics, Identity, and BigQuery Analysis

GA4 completely changed how we think about website and app users. Instead of just counting every visitor like Universal Analytics (UA) did, it focuses on real engagement. To truly master GA4, you have to go beyond the basics and understand how it defines, measures, and reports on users, especially when you dive into the raw data in BigQuery.


The Core GA4 User Metrics

GA4’s data model is built around events, and users are defined as the people who trigger them. The biggest shift from UA is the emphasis on Active Users as the primary metric.

MetricDefinitionNotes
Active UsersUnique users with an engaged session or new users within the timeframeFilters out short, non-interactive visits; primary metric in GA4 reports
Total UsersAll unique users who triggered at least one eventIncludes both active and inactive users; always higher than Active Users
New UsersUsers with their first-ever session (first_visit / first_open)Can overlap with Returning Users if analyzing multiple days/weeks
Returning UsersUsers who have had a previous sessionCounts only users who had prior activity in your date range

⚠️ Note: A single user can be both New and Returning in the same date range (e.g., first visit Monday, returns Wednesday).


GA4 Reporting Identity: Connecting the Dots

GA4 tracks a single user across devices and platforms using a multi-layered identity system, which is more advanced than UA’s reliance on a single browser cookie.

Identity SpaceHow It WorksBest Use
User IDUnique ID assigned to logged-in usersTrack cross-device behavior; best for de-duplicated user analysis
Google SignalsUses Google account login info and ad personalization consentHelps GA4 deduplicate users across devices without login
Device IDClient ID (web) or app instance ID (mobile)Default fallback; counts same person on different devices separately
ModelingMachine learning fills gaps for users who decline cookiesEstimates behavior based on similar consenting users

In GA4 Admin, you can prioritize identity spaces: Blended (default), Observed, or Device-based. Best practice: use Blended and implement User ID whenever possible.


Diving Deep: GA4 Users in BigQuery

For advanced analysis, the GA4 interface isn’t enough—you need raw, event-level data in bigquery.

Note: if you haven’t started tour bigquery journey, start here: How to set up BigQuery linkage in your Google Analytics 4 property

user_pseudo_id (Default Identifier)
An anonymous ID tied to a user’s device and browser. Use COUNT(DISTINCT user_pseudo_id) for general user counts.

user_id (Cross-Device Key)
Populated if you implement User ID. Key to cross-device analysis and connecting GA4 data with internal systems.

By combining these IDs in SQL queries, you can analyze logged-in vs. anonymous users, cross-device behavior, and even replicate GA4’s Active Users metric with custom filters.


Implementation and Best Practices

PracticeReason
Always Implement User IDEnsures true, de-duplicated cross-device user counts
Understand MetricsGA4 counts may be lower than UA but reflect real engagement
Use GA4 ExplorationsSegment, path, and funnel analysis for advanced insights
Combine GA4 with BigQueryEnables 360° user view, cohort analysis, and CRM integration

TL;DR

Comments

3 responses

Discover more from GA4 BigQuery

Subscribe now to keep reading and get access to the full archive.

Continue reading