• About us
    About us
    question mark
    Who we are

    Learn more about Mantu values, governance and offices.

    hexagon
    Our brands

    11 brands united by a shared vision.

    leave
    Sustainability

    Our strategy through diversity, environment and innovation.

    bookshelf
    Pressroom

    Breakthroughs, partnerships, and voices behind the transformation.

  • What we do
    What we do
    mantu
    PRACTICES

    Four practices designed to empower organizations, connect talent, and shape sustainable growth.

    cpu
    Technology

    Deep industry knowledge & cutting edge technology to co-create meaningful solutions.

    handshake
    Total Talent Management

    Tech to boost talent and create strong links between companies and the minds they need.

    digital qr
    Creative Intelligence

    Ensure continuity between decision, activation, and adoption. One team, one trajectory, through to lasting impact.

    medal
    Leadership & Advocacy

    Equip executive teams to define their purpose, shape their positioning and drive their strategy.

  • Insights
    Insights
    book open 4
    Blog

    Bold thinking. Fresh perspectives.

    book check
    Client Stories

    Where audacious ideas turn into real stories.

    mantu best managed companies award
    Mantu awarded one of Switzerland’s Best Managed Companies 2025 by Deloitte

    This award highlights the exceptional performance of privately held Swiss companies that demonstrate excellence in strategy, governance, innovation, and long-term results.

    Read more
    WeMeet 2025-2772 1 1
    Mantu signs the DEI Charter

    At the beginning of July 2025, Mantu’s Executive Committee signed the DEI Charter to foster diversity, equity, and inclusion at Mantu.

    Read more
  • Careers
    Careers
    binoculars
    Life at Mantu

    Mantu, as seen by its team members.

    building
    Find a company

    Mantu brings together complementary brands that cover many sectors, all around the world.

mantu blog thumbnail data quality management
June 26, 2026

Incomplete and inaccurate data: how to ensure data quality management ?

This article addresses the practical challenge of data quality: what it means to define it rigorously, how the cleaning process actually works, and how to build the governance infrastructure to maintain data reliability over time.

Poor data quality: the #1 silent driver of bad decisions


The problem with poor data quality is that it is often invisible at the point of decision. Analysts work with the data they have. Executives read dashboards without knowing what went into them. Decisions that feel data-driven are, in reality, built on incomplete or inaccurate inputs and the gap only becomes visible when outcomes disappoint and the post-mortem traces the failure back to the data.

The cost of poor data quality: what the research shows

  • IBM estimated that poor data quality costs the US economy $3.1 trillion per year.

  • Gartner research puts the average cost of poor data quality at $12.9 million per year per organization.

  • Data professionals report spending 60–80% of project time on data cleaning often because upstream quality was never governed.

The poor data quality cost is not just financial. It erodes trust in analytics as a function. When business leaders discover that a key insight was built on dirty data, the damage extends beyond that single decision it undermines confidence in data-driven approaches generally, and rebuilding that trust is harder than building the governance infrastructure that would have prevented the problem.

The six dimensions of data reliability


Data reliability is not a single property it is a composite of several dimensions, each of which can fail independently. Effective data quality management requires understanding and monitoring all of them.

Accuracy: Does the data correctly represent the real-world entity or event it describes?

Completeness: Are all required fields populated? Is anything missing that would affect analysis?

Consistency: Does the same entity appear the same way across different systems and datasets?

Timeliness: Is the data current enough for the decision it is meant to inform?

Validity: Does the data conform to the defined format, range, and business rules?

Uniqueness: Are there duplicate records that would distort aggregations or counts?

Analysis conducted without checking these dimensions produces results that may appear precise but are structurally unreliable. A conversion rate calculated on a dataset with 30% missing entries, for instance, tells you something, but not what you think it does. Identifying which quality dimensions are compromised, and by how much, is the first step of any serious data quality effort.

The data cleaning process: what it actually involves


The data cleaning process is the set of operations that transform a raw, imperfect dataset into one that is fit for analysis. It is the most labor-intensive phase of any analytical project, and the one most frequently underestimated in scoping.

What data cleaning is not

Data cleaning is not simply removing rows with missing values. That approach while sometimes appropriate destroys information and can introduce bias if the missingness is not random. Effective cleaning requires understanding why data is missing or incorrect before deciding how to handle it, which demands both technical skill and domain knowledge.

The core operations

1- Profiling understand before fixing

Before any transformation, assess the actual state of the data: distribution of values, rate of nulls by field, identification of outliers, and consistency across sources. Profiling surfaces the problems; cleaning addresses them.

2- Deduplication

Identifying and resolving duplicate records same customer with two IDs, same transaction recorded twice is critical for any analysis involving counts, sums, or rates. Deduplication rules must reflect business logic, not just technical matching.

3- Standardization

Ensuring consistent formats across the dataset: date formats, country codes, product categories, currency units. Inconsistency in these fields silently breaks aggregations and joins between tables.

4- Missing value treatment

Depending on the pattern and volume of missingness: imputation (filling with mean, median, or a model-derived estimate), flagging (keeping the record but noting the gap), or exclusion (with documented justification). Each choice has analytical consequences that must be understood and communicated.

5- Validation against business rules

Checking that values conform to known constraints a contract end date cannot precede the start date, a discount percentage cannot exceed 100, a customer age cannot be negative. These rule-based checks catch input errors that profiling alone would miss.

Data quality analytics: measuring what you cannot manage blind


Data quality analytics is the practice of applying measurement to data quality itself treating it as a monitored dimension of system health rather than something assessed once during a project. It answers the question: how good is our data, systematically and continuously?

Data quality KPIs worth tracking

Effective data quality monitoring is built around metrics tied to the six dimensions outlined above. In practice, the most actionable include: completeness rate by field and source, duplicate rate by entity type, format validity rate by attribute, and freshness lag the time between data generation and availability in the analytical layer. These metrics should be visible, owned, and reviewed on a regular cadence not examined only when something breaks.

Automated quality checks in data pipelines

At scale, manual quality review is not viable. Modern data engineering embeds quality checks directly into data pipelines: validation rules that run on every ingestion, alerting when quality thresholds are breached, and quarantine mechanisms that prevent bad data from flowing into production datasets. This shift from reactive data cleaning to proactive quality governance is one of the most consequential maturity improvements an analytics function can make.

Data quality management as an ongoing discipline


The most important reframe in data quality management is treating it as an ongoing organizational capability rather than a project to complete. Data quality degrades naturally over time: systems change, processes evolve, new sources are integrated, and the business context that defines what "correct" means shifts. A dataset that was fit for purpose eighteen months ago may not be today.

Sustainable data quality requires three things working together: clear ownership (who is responsible for each data domain's quality), defined standards (what "good" looks like for each attribute), and continuous monitoring (automated checks that surface degradation before it affects decisions). Organizations that invest in building this infrastructure rather than treating quality as a one-time cleaning exercise accumulate a structural advantage: their analytical outputs can be trusted, and they know it.

Building that infrastructure is precisely the kind of work Mantu's data analytics consulting practice supports from quality assessment and remediation through to the governance design that prevents the problem from recurring.