Distinct🔗
Pass on only the distinct (or unique) input records to the next component.
Properties🔗
Name = string
A human-readable name for the component.
Columns = dual listbox
Only these selected columns are passed to the next component. Duplicate records are removed, leaving only distinct values.
Note
On Redshift, since records may be located on many nodes, determining uniqueness can be expensive.
Strategy🔗
Generates a select distinct query.
| Snowflake | Delta Lake on Databricks | Amazon Redshift | Google BigQuery | Azure Synapse Analytics |
|---|---|---|---|---|
| ✅ | ✅ | ✅ | ✅ | ✅ |