π GA Data to Azure Solution Option: GA4 β Server GTM β Azure Databricks
Architecture Overview
The following diagram illustrates the end-to-end data flow:
flowchart LR
U["User Browser"]
W["GTM Web Container Fires GA4 Config Tag"]
S["sGTM Server Container Receives events via transport_url"]
GA["Google Analytics (Reporting)"]
AF["Azure Function HTTP Trigger"]
EH["Azure Event Hub (Capture Enabled)"]
ADLS["ADLS Gen2 Raw Event Storage"]
DB["Azure Databricks Autoloader β Delta Tables"]
U --> W
W --> S
S --> GA
S --> AF
AF --> EH
EH --> ADLS
ADLS --> DB
Data Flow: Browser β analytics.beyondscreen.online (sGTM) β GA4 + Azure Function β Event Hub β ADLS β Databricks
1. Website Setup
Host website on GitHub Pages (prasoot.github.io).
Register custom domain in GoDaddy (beyondscreen.online).
Create storage account with hierarchical namespace.
Create container ga4-raw.
Event Hub Capture writes Avro/Parquet files here.
8. Azure Databricks
Create workspace and mount ADLS Gen2.
Ingest via Autoloader:
9. Validation
Browser DevTools β only analytics.beyondscreen.online.
sGTM Preview β GA4 + Azure forwarders firing.
Azure Function logs β GA4 ingest function triggered.
Event Hub metrics increasing.
ADLS β new files arriving.
Databricks β row counts increasing.
β End State
All GA4 hits go via your first-party domain β sGTM β GA4 + Azure β Event Hub β ADLS β Databricks.
You get analytics in GA + raw data in Azure for advanced analysis & ML.