- It’s a SQL file that consists of SQL statements that defines how you want to create your view or materialized tables.
- DBT handles all the DML and DDL commands in the backend, creating the view/table.
dbt run --select <model_name>
→ runs a specific model
dbt run --select <model_name>+
→ runs a specific model and all its downstream models
Modularity: is the degree to which a system's components may be separated and recombined, often with the benefit of flexibility and variety in use.
Naming Conventions:
- Source Models: A way of referencing the raw data that’s in our data lake
- Staging Models: Focuses on transforming the source data by applying light transformations (e.g. renaming columns, Converting dtypes).
- Intermediate Models: They reside between the staging models and the rest of the pipeline, that should directly reference the staging models and not the source models
- Facts
- Dimensions