Published
May 7, 2026

Most Data Analysts Are Doing Undocumented Engineering Work

By
Sarah Li
Founders Associate

The Data Analyst Role Has Quietly Changed

Most companies think they have hired a data analyst. What they actually hired was a human patch layer between broken systems. The modern “data analyst” in many organisations is no longer primarily analysing data. They are:

  • Reverse engineering undocumented schemas
  • Mapping inconsistent fields between systems
  • Interpreting ambiguous business definitions
  • Cleaning broken source data
  • Chasing SMEs for tribal knowledge
  • Validating transformation logic
  • Reconciling mismatched records
  • Building temporary logic that becomes permanent infrastructure

That is not analysis. That is undocumented engineering work. And it is one of the biggest hidden inefficiencies in modern data organisations.

Why This Happens

Most businesses underestimate how much operational knowledge sits outside formal systems Documentation is outdated. Source systems evolved organically. Business logic changed over years of projects, acquisitions, vendor migrations, and tactical fixes. So when a new data project begins, someone has to bridge the gaps.

That person is usually called a “data analyst.”

But the actual job often looks more like: “Figure out how these systems really work without anyone being able to fully explain them.”

This becomes especially common in:

  • Investment management
  • Banking
  • Insurance
  • Asset servicing
  • Enterprise ERP environments
  • Large-scale cloud migrations
  • Regulatory transformation programmes

The more legacy systems involved, the worse the problem becomes.

The Real Job of Many Data Analysts

A huge percentage of project-based data analysts spend their time on activities like:

1. Data Mapping

Understanding how:

  • Client_ID
  • CustomerNumber
  • AccountRef
  • InvestorCode

all represent the same conceptual entity across systems. This is semantic reconciliation.

2. Business Logic Discovery

Analysts constantly uncover undocumented rules such as:

  • “This field is only populated for EMEA clients”
  • “That status code was deprecated in 2019”
  • “This table excludes dormant accounts”
  • “That feed breaks on month-end processing”

These rules are often:

  • undocumented
  • inconsistent
  • hidden inside SQL scripts
  • known only by one SME

The analyst becomes the operational memory of the project.

3. Translation Between Business and Engineering

Analysts frequently act as:

  • interpreter
  • negotiator
  • validator
  • process designer

Engineering teams need precision. Business teams communicate intent. The analyst bridges the ambiguity gap.

4. Manual Data Quality Operations

A surprising amount of analyst time goes into:

  • identifying bad records
  • tracing lineage issues
  • validating transformations
  • reconciling outputs
  • manually investigating anomalies

Again, this resembles operational engineering more than traditional analysis.

Why This Is Dangerous

1. Knowledge Becomes Trapped in People

Most project knowledge never becomes reusable organisational knowledge.

It lives inside:

  • spreadsheets
  • meeting notes
  • Jira tickets
  • Slack messages
  • analyst memory

When analysts leave, the company loses critical implementation intelligence.

2. Every New Project Starts From Scratch

Teams repeatedly rediscover:

  • source system meanings
  • mappings
  • transformation logic
  • historical exceptions
  • validation rules

This creates enormous duplication across programmes.

3. Analysts Become Bottlenecks

The organisation quietly becomes dependent on a handful of individuals who understand:

  • how systems actually behave
  • where data breaks
  • which logic can be trusted

This creates operational fragility.

4. Delivery Slows Down Dramatically

Engineering delivery gets blamed. But often the real delay is upstream ambiguity. Engineers cannot automate what nobody has clearly defined.

The Industry Has Misclassified the Problem

Most companies think they have:

  • a reporting problem
  • a dashboard problem
  • a BI tooling problem

In reality, many have:

  • a knowledge capture problem
  • a semantic alignment problem
  • a data implementation problem

The issue is not lack of access to data. The issue is undocumented operational logic.

AI Will Change This — But Not In The Way People Think

There is constant discussion about whether AI will replace data analysts. That is the wrong question.

The more important question is: "Which parts of undocumented engineering work can be systemised?" Because most analyst effort today is not “analysis.”

It is:

  • context reconstruction
  • schema interpretation
  • rule extraction
  • mapping validation
  • implementation coordination

These are highly repetitive knowledge problems. And that makes them increasingly automatable.

Why DataSync Exists

Most data tooling focuses on moving data. Very little focuses on understanding what the data actually means. That is the gap DataSync is designed to solve.

DataSync helps organisations reduce the amount of undocumented engineering work hidden inside data projects by using AI agents to:

  • accelerate data mapping
  • identify semantic similarities between systems
  • surface undocumented relationships
  • capture implementation logic
  • reduce repetitive manual reconciliation
  • create reusable organisational knowledge

Instead of analysts repeatedly rebuilding understanding from scratch, DataSync helps teams preserve and scale that knowledge across projects. This matters because most delivery delays are not caused by writing pipelines.

They are caused by:

  • unclear definitions
  • inconsistent mappings
  • tribal knowledge
  • SME dependency
  • undocumented operational rules

The companies that solve this layer will deliver data programmes dramatically faster than competitors still relying on spreadsheets, meetings, and analyst memory.

The Future Data Analyst Looks Different

The highest-value analysts will not be the people manually maintaining spreadsheets of mappings.

They will be the people who:

  • structure business meaning
  • validate operational intent
  • define decision logic
  • govern data interpretation
  • supervise AI-assisted implementation workflows

The role shifts from:

  • manual reconciliation to
  • knowledge orchestration

Final Thought

The uncomfortable reality is this:

Many “data analysts” are functioning as undocumented systems engineers without the tooling, structure, or recognition to support the work properly. Companies that understand this early will build faster, scale better, and become dramatically less dependent on tribal knowledge. Companies that ignore it will continue paying highly skilled people to rediscover the same operational logic project after project. And that is exactly the inefficiency AI-native data delivery platforms like DataSync are being built to eliminate.