Enhancements to Microsoft Fabric Data Factory Copy Job: Truncate Destination, Query-Based Subsets, and Multi-Folder Support
Microsoft Fabric Blog details new Copy job capabilities in Fabric Data Factory, with features for truncating destinations, query-based data selection, and copying multiple folders in one operation.
Enhancements to Microsoft Fabric Data Factory Copy Job: Truncate Destination, Query-Based Subsets, and Multi-Folder Support
Overview
Microsoft Fabric Data Factory’s Copy job is a key feature for moving data across clouds, on-premises, or between services. It supports various delivery styles: bulk copy, incremental copy, and change data capture (CDC) replication, making data movement both flexible and streamlined.
Key Updates
Truncate Destination Before Full Copy
- Feature: Optionally truncate destination data before performing a full copy, ensuring full synchronization without duplicates.
- How it Works:
- The first incremental copy with truncation enabled deletes existing destination data, then loads data.
- Subsequent incremental copies append or merge records without removing existing ones.
- Resetting to full copy after incrementals triggers another destination truncation if enabled.
- Benefits: Prevents data duplication, delivers clean syncs, and can improve performance for large loads.
Full & Incremental Copy of Data Subsets With Database Queries
- Feature: Copy subsets of data for both full and incremental loads using custom SQL queries.
- Use Cases:
- Load data only for a specific region (using a region column) to support data compliance.
- Copy top N rows from a table (for testing or sampling).
- Current Support: Available now for Azure SQL Database, with more data connectors expected in future updates.
- Advantage: Offers more efficient and precise data ingestion based on custom selection logic.
Copy Multiple Folders in One Copy Job
- Feature: Select and copy multiple folders or a mix of folders and files in a single job.
- Advantage: Reduces the need for multiple Copy jobs, making development and operation simpler for data engineers.
Screenshots and Links
Summary
These enhancements make Microsoft Fabric Data Factory’s Copy job more robust for data ingestion, allowing for more customizable, efficient, and reliable workflows.
This post appeared first on “Microsoft Fabric Blog”. Read the entire article here