Skip to content

Distinct🔗

Pass on only the distinct (or unique) input records to the next component.

Properties🔗

Name = string

A human-readable name for the component.


Columns = dual listbox

Only these selected columns are passed to the next component. Duplicate records are removed, leaving only distinct values.

Note

On Redshift, since records may be located on many nodes, determining uniqueness can be expensive.


Strategy🔗

Generates a select distinct query.


Snowflake Delta Lake on Databricks Amazon Redshift Google BigQuery Azure Synapse Analytics
✅ ✅ ✅ ✅ ✅