LogoLogo
  • Datafold
  • Introduction
    • Data Diff
      • Continuous Integration
      • Manual Data Diff
      • Diff Results
    • Column-level lineage
      • Usage, popularity, & impact per table or column
    • Alerting
  • ⏱️Quickstart Guide
  • Getting Started
    • Data Warehouses
      • Snowflake
      • BigQuery
      • Redshift
      • Postgres
      • Databricks
    • Configuration
      • Indexing
      • Filtering
      • Profiling
      • Lineage
    • On-prem Deployment
      • AWS
      • GCP
    • SSO
      • Okta
      • Google OAuth
      • SAML
  • Integrations
    • Continuous Integration
      • Source Control with Git
        • GitHub
          • On-prem Github
        • GitLab
      • dbt Cloud
      • dbt Core / datafold-sdk
        • GitHub example
        • GitLab example
      • dbt Configurations
      • datafold-sdk
    • Alert Integrations
      • Slack integration
        • Slack Alerts
        • On-prem Slack Integration
      • Alerting webhooks
    • Data Apps
      • Mode
      • Hightouch
  • Developer
    • Datafold API
      • Alerting
      • GraphQL Metadata API
      • Data Diff
      • Error handling
    • Security
      • GDPR
      • Network Security
Powered by GitBook
On this page
  • Overview
  • Schemas
  • Primary Keys
  • Column Profiles
  • Values
  • Timelines

Was this helpful?

  1. Introduction
  2. Data Diff

Diff Results

Get familiar with how Data Diff results appear in the Datafold UI

PreviousManual Data DiffNextColumn-level lineage

Last updated 2 years ago

Was this helpful?

The Datafold UI allows for manual checks on the differences between two tables. We'll review what information is available in the Datafold platform and how you can use this to better inform your data changes.

Overview

The Overview tab shows a high-level report on the differences between the tables.

Schemas

The schemas tab highlights renamed, added, or removed columns:

Primary Keys

Primary Keys tab shows samples of the data meeting the following conditions for primary keys:

  1. Are there any PKs that exist in Table A but are missing from Table B and vice versa

  2. Are there any duplicate PKs in either table?

Column Profiles

Datafold profiles every column for both tables and plots value distributions side-by-side. Using the profiles, you can identify the sources of the differences faster:

Values

Shows values side-by-side for every primary key. When values diverge, they are highlighted. You can select which columns are shown in what order by clicking on the gear menu on the left:

Timelines

Plots the percentage of rows that are different between the two tables for every column. It can be helpful in identifying cases where the data is broken for particular partitions and for debugging time-based filters: