Secoda Docs
Get Started
  • Getting Started with Secoda
    • Secoda as an Admin
      • Deployment options
      • Sign in options
      • Settings
      • Connect your data
        • Define Service Accounts
        • Choose which schemas to extract
      • Customize the workspace
      • Populate Questions with FAQs
      • Invite your teammates
        • Joining & Navigating between Multiple Workspaces
      • Onboard new users
        • Onboarding email templates
        • Onboarding Homepage template
        • Training session guide
      • User engagement and adoption
        • Tips & Tricks to share with new users
    • Secoda as an Editor
    • Secoda as a Viewer
      • Introduction guide
      • Requesting changes in Secoda
  • Best practices
    • Setting up your workspace
    • Integrating Secoda into existing workflows
    • Documentation best practices
    • Glossary best practices
    • Data governance
    • Data quality
    • Clean up your data
    • Tool migrations using Secoda
    • Slack <> Questions workflow
    • Defining resources workflow
    • Streamline data access: Private and public teams workflow
    • Exposing Secoda to external clients
  • Resource Management
    • Editing Properties
      • AI Description Editor
      • Bulk Editing
      • Propagation
      • Templates
    • Resource Sidesheet
    • Assigning Owners
    • Custom Properties
    • Tags
      • Custom Tags
      • PII Identifier
      • Verified Identifier
    • Import and Export Resources
    • Related Resources
  • User Management
    • Roles
    • Teams
    • Groups
  • Integrations
    • Integration Settings
    • Data Warehouses
      • BigQuery
        • BigQuery Metadata Extracted
      • Databricks
        • Databricks Metadata Extracted
      • Redshift
        • Redshift Metadata Extracted
      • Snowflake
        • Snowflake Metadata Extracted
        • Snowflake Costs
        • Snowflake Native App
      • Apache Hive
        • Apache Hive Metadata Extracted
      • Azure Synapse
        • Azure Synapse Metadata Extracted
      • MotherDuck
        • MotherDuck Metadata Extracted
      • ClickHouse
        • ClickHouse Metadata Extracted
    • Databases
      • Druid
        • Druid Metadata Extracted
      • MySQL
        • MySQL Metadata Extracted
      • Microsoft SQL Server
        • Page
        • Microsoft SQL Server Metadata Extracted
      • Oracle
        • Oracle Metadata Extracted
      • Salesforce
        • Salesforce Metadata Extracted
      • Postgres
        • Postgres Metadata Extracted
      • MongoDB
        • MongoDB Metadata Extracted
      • Azure Cosmos DB
        • Azure Cosmos DB Metadata Extracted
      • SingleStore
        • SingleStore Metadata Extracted
      • DynamoDB
        • DynamoDB Metadata Extracted
    • Data Visualization Tools
      • Amplitude
        • Amplitude Metadata Extracted
      • Looker
        • Looker Metadata Extracted
      • Looker Studio
        • Looker Studio Metadata Extracted
      • Metabase
        • Metabase Metadata Extracted
      • Mixpanel
        • Mixpanel Metadata Extracted
      • Mode
        • Mode Metadata Extracted
      • Power BI
        • Power BI Metadata Extracted
      • QuickSight
        • QuickSight Metadata Extracted
      • Retool
        • Retool Metadata Extracted
      • Redash
        • Redash Metadata Extracted
      • Sigma
        • Sigma Metadata Extracted
      • Tableau
        • Tableau Metadata Extracted
      • ThoughtSpot
        • ThoughtSpot Metadata Extracted
      • Cluvio
        • Cluvio Metadata Extracted
      • Hashboard
        • Hashboard Metadata Extracted
      • Lightdash
        • Lightdash Metadata Extracted
      • Preset
        • Preset Metadata Extracted
      • Superset
        • Superset Metadata Extracted
      • SQL Server Reporting Services
        • SQL Server Reporting Services Metadata Extracted
      • Hex
        • Hex Metadata Extracted
      • Omni
        • Omni Metadata Extracted
    • Data Pipeline Tools
      • Census
        • Census Metadata Extracted
      • Stitch
        • Stitch Metadata Extracted
      • Airflow
        • Airflow Metadata Extracted
      • Dagster
        • Dagster Metadata Extracted
      • Fivetran
        • Fivetran Metadata Extracted
      • Glue
        • Glue Metadata Extracted
      • Hightouch
        • Hightouch Metadata Extracted
      • Apache Kafka
        • Apache Kafka Metadata Extracted
      • Confluent Cloud
        • Confluent Cloud Metadata Extracted
      • Polytomic
        • Polytomic Metadata Extracted
      • Matillion
        • Matillion Metadata Extracted
      • Airbyte
        • Airbyte Extracted Metadata
      • Informatica
        • Informatica Metadata Extracted
      • Azure Data Factory
        • Azure Data Factory Metadata Extracted
    • Data Transformation Tools
      • dbt
        • dbt Cloud
          • dbt Cloud Metadata Extracted
        • dbt Core
          • dbt Core Metadata Extracted
      • Coalesce
        • Coalesce Metadata Extracted
    • Data Quality Tools
      • Cyera
      • Dataplex
        • Dataplex Metadata Extracted
      • Great Expectations
        • Great Expectations Metadata Extracted
      • Monte Carlo
        • Monte Carlo Metadata Extracted
    • Data Lakes
      • Google Cloud Storage
        • GCS Metadata Extracted
      • AWS S3
        • S3 Metadata Extracted
    • Query Engines
      • Trino
        • Trino Metadata Extracted
    • Custom Integrations
      • File Upload
        • CSV File Format
        • JSONL File Format
        • Maintain your Resources
      • Marketplace
        • Secoda SDK
        • Upload and Connect your Marketplace Integration
        • Publish the Integration
        • Example Integrations
      • Secoda Fields Explained
    • Security
      • Connecting via Reverse SSH Tunnel
      • Connecting via SSH Tunnel
      • Connecting via VPC Peering
      • Connecting via AWS Cross Account Role
      • Connecting via AWS PrivateLink
        • Snowflake via AWS PrivateLink
        • AWS Service via AWS PrivateLink
      • Recommendations to Improve SSH Tunnel Concurrency on SSH Bastion
    • Push Metadata to Source
  • Extensions
    • Chrome
    • Confluence
      • Confluence Metadata Extracted
      • Confluence best practices
    • Git
    • GitHub
    • Jira
      • Jira Metadata Extracted
    • Linear
    • Microsoft Teams
    • PagerDuty
    • Slack
      • Slack user guide
  • Features
    • Access Requests
    • Activity Log
    • Analytics
    • Announcements
    • Audit Log
    • Automations
      • Automations Use Cases
    • Archive
    • Bookmarks
    • Catalog
    • Collections
    • Column Profiling
    • Data Previews
    • Data Quality Score
    • Documents
      • Comments
      • Embeddings
    • Filters
    • Glossary
    • Homepage
    • Inbox
    • Lineage
      • Manual Lineage
    • Metrics
    • Monitors
      • Monitoring Use Cases
    • Notifications
    • Policies
    • Popularity
    • Publishing
    • Queries
      • Query Blocks
        • Chart Blocks
      • Extracted Queries
    • Questions
    • Search
    • Secoda AI
      • Secoda AI User Guide
      • Secoda AI Use Cases
      • Secoda AI Security FAQs
      • Prompts
    • Sharing
    • Views
  • Enterprise
    • SAML
      • Okta SAML
      • OneLogin SAML
      • Microsoft Azure AD SAML
      • Google SAML
      • SCIM
      • SAML Attributes
    • Self-Hosted
      • Additional Resources
        • Additional Environment Variables
          • PowerBI OAuth Application (on-premise)
          • Google OAuth Application (on-premise)
          • Github Application (on-premise)
          • OpenAI API Key Creation (on-premise)
          • AWS Bucket with Access Keys (on-premise)
        • TLS/SSL (Docker compose)
        • Automatic Updates (Docker compose)
        • Backups (Docker compose)
        • Outbound Connections
      • Self-Hosted Changelog
    • SIEM
      • Google Chronicle
  • API
    • Get Started
    • Authentication
    • Example Workflows
    • API Reference
      • Getting Started
      • Helpful Information
      • Audit Logs
      • Charts
      • Collections
      • Columns
      • Custom Properties
      • Dashboards
      • Databases
      • Documents
      • Events
      • Groups
      • Integrations
      • Lineage
      • Monitors
      • Resources
      • Schemas
      • Tables
      • Tags
      • Teams
      • Users
      • Questions
      • Queries
  • FAQ
  • Policies
    • Terms of Use
    • Secoda AI Terms
    • Master Subscription Agreement
    • Privacy Policy
    • Security Policy
    • Accessibility Statement
    • Data Processing Agreement
    • Subprocessors
    • Service Level Agreement
    • Bug Bounty Program
  • System Status
  • Changelog
Powered by GitBook
On this page
  • Benefits of DQS
  • Enabling DQS
  • Accessing and Using DQS
  • Table-Level Scores
  • Tracking Changes with Analytics Reports
  • Search and Catalog Filters
  • Automations for DQS
  • Adding a DQS Homepage Widget
  • Scoring Categories
  • Feedback

Was this helpful?

  1. Features

Data Quality Score

Enhance your data quality with Secoda's DQS: A systematic tool to measure, track, and improve data quality.

Last updated 5 months ago

Was this helpful?

The Data Quality Score feature is available to customers on plans. Request access by clicking "Upgrade" under Data Quality Score in your workspace or by contacting Customer Support (support@secoda.co).

The Data Quality Score (DQS) in Secoda offers a comprehensive scoring system that evaluates your data's quality. Focused initially on tables, DQS not only helps teams understand their data quality but also provides actionable steps to improve their scores over time. It aggregates scores across several categories to deliver a comprehensive total out of 100%.

Benefits of DQS

  • Data Producers: Offers actionable steps and clear metrics to measure data quality, helping to address and reduce technical debt.

  • Data Consumers: Improves data discoverability and trustworthiness with transparent quality indicators.

  • Governance: Supplies a measurable metric to guide data governance investments, promoting a high standard of data quality.

Access to DQS motivates both data producers and consumers to uphold high data quality standards, fostering a culture of continuous improvement and accountability within your organization.

Enabling DQS

By default, DQS is enabled in all workspaces. Admins can disable it in the Quality Score settings.

Consider adjusting the Resource filter rules to selectively include or exclude specific resources from the DQS calculations. For instance, you might exclude resources from your 'dev' database, which are often incomplete and could skew the overall quality score.

DQS Settings

Accessing and Using DQS

Table-Level Scores

  1. Navigate to a Table: Go to the resource page of any table in your workspace.

  2. Open the Quality Tab: Find the 'Quality' tab where the DQS is displayed.

  3. View Score Details: Check the overall score and explore detailed breakdowns for each category.

  4. Improve Scores: Click on specific categories to see actionable steps to enhance scores.

Tracking Changes with Analytics Reports

  1. Open the Analytics Dashboard: Head to the Analytics section of Secoda.

  2. Create a Quality Score Report: Use the "Add report" button to set up a new report focused on DQS.

  3. Customize Your View: Apply filters, such as by Team, to pinpoint areas needing data quality improvements.

Search and Catalog Filters

  • Is not set: No score has been assigned.

  • Good: Represents a high-quality score.

  • Moderate: Indicates average quality.

  • Poor: Shows a low-quality score.

These filters help you quickly identify and prioritize areas for improvement directly from your search results or catalog listings.

Automations for DQS

Create an Automation to automatically apply actions based on the DQS of each table. For instance, when a table achieves a "Good" quality score, the system can automatically mark it as "Published" and "Verified." This helps users quickly identify which tables are confirmed as reliable and ready for business use.

Adding a DQS Homepage Widget

  1. Customize Your Homepage: Navigate to your personal or team-level homepage settings.

  2. Add the DQS Widget: Select the DQS widget to add it to your homepage.

  3. Access DQS Instantly: Enjoy quick and easy access to your data quality scores directly from the homepage.

Scoring Categories

DQS consists of four main dimensions, each reflecting essential aspects of data quality:

1. Stewardship (25 Points)

  • Ownership (10 Points): Full points if the table has an owner; otherwise, it scores 0.

  • Tagged (10 Points): Full points if the table has been tagged (e.g., PII, Verified); no tags scores 0.

  • Questions & Answers (5 Points): Points awarded based on the percentage of Questions on the table with answers. If no questions have been asked, full points are awarded.

2. Usability (25 Points)

  • Resource Description (10 Points): Full points if the table has a description; no description scores 0.

  • Column Descriptions (10 Points): Points awarded based on the percentage of columns with descriptions.

  • Schema Description (5 Points): Full points if the table schema has a description; no description scores 0.

3. Freshness (20 Points)

  • Freshness Monitor Set (10 Points): Full points awarded if a freshness monitor is set; otherwise, it scores 0.

  • Freshness Passing (10 Points): Full points if the data freshness is within the expected period; otherwise, it scores 0.

4. Accuracy (30 Points)

The Accuracy score integrates results from Secoda Monitors or 3rd party tests like dbt, Monte Carlo and Great Expectations.

  • Nullness (10 Points):

    • 5 points awarded if a nullness test is configured for the table; otherwise, it scores 0.

    • Points awarded based on the percentage of passing nullness tests.

  • Uniqueness (10 Points):

    • 5 points awarded if a uniqueness test is configured for the table; otherwise, it scores 0.

    • Points awarded based on the percentage of passing uniqueness tests.

  • Other Tests (10 Points): Additional points based on the percentage of other tests passing.

Feedback

Table DQS
DQS in Analytics

In the Search and Catalog views, you can resources based on their Data Quality Score. The available quality filters are:

Filtering on Quality
DQS Automation
DQS on the Homepage

We welcome your suggestions for enhancing DQS. Please contribute your ideas on our to help us evolve this tool to better meet your needs.

filter
feedback board
Premium and Enterprise