Cloud-based DWH solutions offer:
Sources: They are the green nodes in the DAG, sources represent the raw data tables or views in your data warehouse that serve as the starting point for your transformations.
schema.yml
) within your dbt project.version: 2
sources:
- name: raw_data # Name of the source (logical grouping of tables)
database: raw # Database where the source tables are located
schema: public # Schema where the source tables are located
tables: # List of tables in the source
- name: orders # Name of the table
description: "Raw orders data from the e-commerce platform"
columns: # List of columns in the table
- name: order_id
description: "Unique identifier for each order"
tests:
- unique
- not_null
- name: order_date
description: "Date the order was placed"
- name: amount
description: "Total amount of the order"
tests:
- not_null
- name: customers # Another table in the same source
description: "Raw customer data from the CRM system"
Models: the core building blocks used to transform and structure data in your data warehouse. They are essentially SQL files with select statements that define how raw data should be transformed into meaningful, reusable datasets for analysis.
Tests: They are used to validate the quality, accuracy, and integrity of your data and transformations. Tests ensure that your data models meet predefined expectations and help catch issues early in the data pipeline.
version: 2
models:
- name: orders
columns:
- name: order_id
tests:
- unique
- not_null
- name: status
tests:
- accepted_values:
values: ['pending', 'shipped', 'delivered']
not_null
: Ensures a column does not contain null values.unique
: Ensures all values in a column are unique.accepted_values
: Ensures a column contains only specified values.relationships
: Ensures referential integrity between two models (e.g., foreign key constraints).tests
directory.macros
directory.dbt run
→ runs your models
dbt test
→ Tests your model as they are being built
dbt docs generate
→ generate docs for your projects
dbt build
→ runs + tests your models