Skip to main content
In Prequel Import, a Dataset is a materialized cache of data representing a connected Source table or custom SQL query. Datasets are maintained in the Datalake.

Types of Datasets

For efficiency, Prequel Import requires each Dataset to be assigned a type: Dimension or Fact.
  • Dimension: A table containing rows that may be updated over time (e.g., users, products, accounts). Prequel Import tracks changes and syncs updates to existing records.
  • Fact: An append-only table containing immutable events or transactions (e.g., events, page views, logs). New records are added but existing records are never modified. Prequel Import assumes records that have been successfully delivered to the Destination do not need to be monitored for changes.
To understand how Change Detection works for either table type, see the Change Detection guide.

Dataset configuration options

FieldTypeRequiredDescription
source_iduuidYesThe ID of the Source this Dataset is connected to.
source_object_namestringYesThe fully qualified name of the table or object in the Source (e.g., schema.table_name).
cache_table_namestringYesThe name of the table in the Datalake where this Dataset is cached.
typestringYesThe type of Dataset. Supported values: dimension, fact.
primary_key_columnstringYesThe column used to uniquely identify records in this Dataset.
last_modified_columnstringNoThe column used to track when records were last modified. Used for incremental sync. Set to null for full sync.