Your resource for web content, online publishing
and the distribution of digital products.
S M T W T F S
 
 
 
 
 
1
 
2
 
3
 
4
 
5
 
6
 
7
 
8
 
9
 
 
 
 
 
 
 
 
 
 
 
20
 
21
 
22
 
23
 
24
 
25
 
26
 
27
 
28
 
29
 
30
 

How did we use DBT and BigQuery to manage late arriving web events?

Tags: web
DATE POSTED:October 1, 2024

How we process half a billion web records a day, without spending half a billion everyday. This blog explains a key mechanism in our data platform infrastructure which enables our web data asset to be refreshed hourly and process late arriving web events.

Here’s a basic architecture diagram of our data platform:

As you can see we use DBT as a workflow and data transformation layer, and BigQuery for data storing and querying.

Continue reading...
Tags: web