The Difference between Duplicate and Reference in Power Query

A naukri.com initiative

New

The Differ...

Towards Data Science

371

Power Query offers Duplicate and Reference features for loading data twice with distinctions in functionality.
Duplicate copies M-Code to create a new table, while Reference creates a new table based on an existing one.
SQL Profiler and Power Query Diagnostics are tools used to analyze the behavior of these features.
When creating a Duplicate, the data is retrieved twice with separate connections, as shown in SQL Profiler.
Creating a Reference also results in the data being read twice, indicating no difference in load traffic compared to Duplicate.
The key difference is that Duplicate creates an independent new table, whereas Reference is based on the outcome of the referenced table.
Reference is suitable for extracting subsets without affecting the original table, while Duplicate is needed for operations like merging tables due to circular references.
Consider potential conflicts during data loading, especially from sources like Excel, where adjusting settings for parallel loading may be necessary.
In conclusion, there is no difference in load performance between Duplicate and Reference in Power Query as both load data independently.
Understanding the distinctions between these features is vital for efficient data loading and transformation processes in Power Query.

Read Full Article

22 Likes

For uninterrupted reading, download the app