Table of contents

Book a Maia Demo

Enjoy the freedom to do more with Maia on your side.

Book a demo

Resources

Written by

Kathy O'Neil

Legacy ETL Is the Hidden Constraint on AI Execution

March 31, 2026

Blog

5 minutes

TL;DR

AI isn’t failing because models or platforms fall short.

It’s failing because legacy ETL cannot support continuous, reliable execution at scale.

As enterprises move from analytics to AI-driven workflows, the constraint shifts from building systems to trusting them to run.

AI Data Automation is emerging as a new architectural layer, embedding pipeline logic directly into the data environment and eliminating external dependencies.

Why AI systems fail in execution, not development

Enterprises have invested heavily in AI, 77% of CEOs now say it will have the single most significant impact on their industry by 2028.

The platforms are in place. The mandate is clear.

But many organizations are beginning to face a harder question: not whether they can build AI, but whether they can operate it reliably enough to trust it with real business processes.

AI models are built. Pilots succeed.

And then progress slows, sometimes quietly, sometimes all at once.

Not because the models don’t work.

And not because the platforms aren’t capable.

Because the data layer underneath them, often built on legacy ETL pipelines, cannot sustain continuous execution.

The constraint isn’t new. The stakes are.

Most enterprise data environments were designed for analytics.

Pipelines run on schedules. Data moves in batches—often across separate systems that must extract, move, and rebuild data before it can be used.
When something breaks, an engineer investigates.

That model worked when workflows moved at human speed.

AI changes the equation.

Now, systems depend on continuous data pipelines and reliable operational signals.
When those systems fail, the impact is immediate, models stop retraining, applications lose context, and decisions become unreliable.

In some cases, the failure is even more visible: an automated workflow halts mid-process because an upstream pipeline didn’t complete, or worse, completes with stale data no one realizes is wrong.

This is the same pattern many teams are now recognizing as the Velocity Gap, the growing distance between AI ambition and production reality.

At its core, the issue isn’t a lack of tooling or investment.
It’s that the data layer required to support continuous execution was never designed for it.

As the stack moves up, the foundation matters more

The industry is moving beyond analytics.

New execution layers, workflow engines, agentic platforms, and emerging capabilities like Snowflake’s SnowWork, promise to automate business processes end to end.

The architects building these platforms are clear-eyed about it: autonomous execution agents are only as reliable as the data they operate on. A flawed upstream pipeline doesn’t just break a report, it generates a confident, wrong answer at machine speed.

But these systems operate on top of the data layer.

And in most enterprises, that layer is still governed by legacy ETL.

These platforms assume data is continuously available, governed, and production-ready.

In reality, it is often manually maintained, fragmented across tools, and dependent on human intervention to recover when something breaks.

Execution doesn’t scale, it becomes inconsistent.

And at that point, the risk isn’t delay. A single pipeline failure in a production AI system can stall downstream inference across every workflow it feeds, often requiring hours of manual intervention. It resets confidence for every business stakeholder watching the rollout.

It’s that the system cannot be trusted to run.

The real gap is data readiness for AI

This is why so many AI initiatives fail to move beyond pilot.

Not because the models aren’t effective, but because the data required to sustain them across AI systems cannot be delivered reliably, continuously, and at scale.

And even when organizations attempt to modernize, the challenge often persists, because execution still depends on systems operating outside the core data environment, introducing latency, fragmentation, and control gaps.

As AI systems move from analytics to execution, the limitations of analytics-era data architecture become harder to ignore.

A data layer that depends on external engines to move, transform, or repair data before it can be used cannot support continuous model retraining, real-time decisioning, or autonomous business workflows.

Until that changes, AI remains constrained, not by innovation, but by execution.

A new layer is emerging

This is why a new layer is taking shape: AI Data Automation.

Not as another tool in the stack, but as a new layer for AI Data Automation: a fundamentally different operating model for how data work gets done.

The shift is away from human-managed pipelines and reactive maintenance toward continuous execution, where pipelines are created, maintained, and governed automatically, without external dependency, handling schema drift, quality issues, and optimization autonomously.

Maia is where this shift becomes operational, removing the need for external systems to build, maintain, and repair pipelines, and embedding that logic directly into the data environment itself.

The goal isn’t faster development. It’s something more fundamental:

A data layer that can support continuous execution of AI systems, without depending on humans to keep it running.

Execution is the real measure of AI

AI doesn’t fail because the platforms aren’t capable.

It fails because the data layer cannot reliably support, or be trusted to sustain, the systems built on top of it.

Until that changes, every new layer of innovation will inherit the same constraint.

And the gap between ambition and outcome will continue to grow, the very definition of the Velocity Gap.

See how AI Data Automation changes the equation.

Book a Maia Demo.

Book a 30-minute live demo

Kathy O'Neil

Senior Director of Customer & Partner Programs

Kathy O’Neil is Senior Director of Customer & Partner Programs at Matillion. She works with AWS, Snowflake, and global SI partners to support joint go-to-market initiatives and help customers adopt Maia, Matillion’s AI Data Automation platform. With more than 30 years of experience in data, cloud, and enterprise software, Kathy builds practical partner programs that align product, sales, and marketing teams and translate collaboration into revenue. She writes about partner-led growth and what it takes to make joint go-to-market efforts work in practice.

Legacy ETL Is the Hidden Constraint on AI Execution

TL;DR

Why AI systems fail in execution, not development

The constraint isn’t new. The stakes are.

As the stack moves up, the foundation matters more

The real gap is data readiness for AI

A new layer is emerging

Execution is the real measure of AI

See how AI Data Automation changes the equation.

Related Resources

Agentic AI Systems Explained: Why Projects Are the Wrong Unit of Execution

Legacy Data Migration: A CTO's Playbook

Alteryx Alternatives in 2026

Maia changes the equation of data work