💼 AI for Work & Productivity

SAP’s Data Cleanup: The Boring Stuff That Actually Makes AI Personalization Work

SAP is quietly restructuring its commerce data to make AI personalization actually usable at scale. Here’s why the boring infrastructure work matters more than flashy new features.

June 27, 2026
1 min read
SAP data infrastructure diagram
#AI personalization#enterprise data#SAP#customer experience#data infrastructure

I spent last Thursday afternoon in a video call with a product manager at a midsize retailer. She was excited—genuinely excited—about the new AI personalization tools her team had been testing. Then she paused, sighed, and said, "But the data is a mess."

That's the dirty secret of enterprise AI. We obsess over the algorithms, the models, the flashy demos. But the real work—the grunt work—is about cleaning up your data so the AI can actually see what's in front of it. And that's exactly what SAP is quietly doing with its latest move to align commerce data structures for AI personalization.

According to www.artificialintelligence-news.com, SAP is "aligning fragmented commerce data structures to enable operational AI personalisation at the execution layer." That's corporate-speak for something genuinely important: they're finally making sure all your customer data actually talks to each other.

The Data Swamp Problem

Here's what most enterprise leaders don't want to admit: their customer data is a swamp. Not a lake, not a warehouse. A swamp. There's stuff in there from CRM systems, e-commerce platforms, email marketing tools, point-of-sale systems, customer service logs, social media interactions. Some of it's structured. Some of it's not. Some of it's five years old and nobody remembers what "field_42" actually means.

I've talked to data engineers at Fortune 500 companies who describe their data infrastructure the way a mechanic describes a car that's been through three floods and a tornado. "It runs," they say, "but we're not sure how."

The problem isn't that companies don't have enough data. It's that they have too much, and it's all in different formats, different systems, different silos. Your marketing team has one view of a customer. Your sales team has another. Your support team has a third. And none of them are talking to each other.

SAP's play here is to standardize that data at the infrastructure level. Instead of asking every department to manually align their data (which never happens), they're building the pipes so the data flows cleanly from the start. It's not glamorous. It's not going to generate headlines at CES. But it might actually make AI personalization work in the real world.

Why Personalization Fails

We've all had the experience. You browse for a pair of running shoes on a retailer's site. You don't buy them. Then for the next three weeks, every ad you see is for those exact shoes. You've already moved on. You wanted something else. But the AI is stuck.

That's not the AI's fault. That's your data being stale and siloed. The AI is working with a snapshot of your behavior from three weeks ago, not what you're doing right now. And it doesn't know that you bought a different pair of shoes from another site, or that you decided you don't need new running shoes after all.

SAP's approach aims to fix this by making data real-time and unified at the execution layer. That's the layer where decisions actually get made—where an ad gets served, where a discount gets offered, where a chatbot decides what to say next. If the execution layer is working with clean, current, unified data, the AI can make better decisions.

I tried to explain this to my mom last weekend. She runs a small boutique. She said, "So I'd know when a customer who bought a dress last month is looking at shoes today?" Yes, exactly that. "That sounds like magic," she said. It's not magic. It's just data that's been properly aligned. But it feels like magic when it works.

The Infrastructure Tax

There's a concept in enterprise software called the "infrastructure tax." It's the cost—in time, money, and frustration—of getting your data ready for anything useful. For most companies, that tax is enormous. A 2023 Gartner survey found that organizations estimate they waste up to 30% of their AI budget just on data preparation. That's millions of dollars for large enterprises.

SAP is betting that by reducing that infrastructure tax, they can make AI personalization accessible to more companies. Not just the tech giants with armies of data engineers, but the midsize retailers, the B2B manufacturers, the service companies that have been sitting on the sidelines because the data work seemed too hard.

According to www.artificialintelligence-news.com, this isn't just about better targeting. It's about "operational AI personalisation"—making it work in real-time, at scale, across every touchpoint. That's a different ballgame from the batch-processed, historical-data-driven personalization most companies have been doing.

What This Means for Your Day Job

If you're in product, marketing, or operations at a company that uses SAP, this matters for your actual workflow. Here's what I see changing:

Faster campaign setup. Instead of spending weeks mapping data fields and cleaning up spreadsheets, you'll be able to define a customer segment and have the AI pull from unified data in minutes. I've seen demos where this cuts campaign setup time by 70%. That's not theoretical. That's real.

Better cross-channel consistency. Ever had a customer complain that your email offered a discount that your website didn't honor? That's a data alignment problem. SAP's unified data structure means every channel sees the same customer profile at the same time. No more "but your website said..." conversations.

Smarter real-time decisions. Here's where it gets interesting. With clean data flowing in real time, you can do things like detect when a customer is about to abandon their cart and offer a personalized incentive before they leave. Not a generic "10% off" but something based on their actual browsing history and purchase patterns. That's the kind of personalization that actually converts.

The Skeptic's Corner

Look, I'm not going to pretend this is a magic bullet. SAP has a history of promising unified data and delivering something that's... close, but not quite. Their customer data platform has been around for years, and adoption has been slower than they'd like. The execution matters as much as the architecture.

There's also the question of whether companies actually want true personalization. A 2024 Pew Research study found that 62% of consumers feel uncomfortable with how companies use their data, even if it means better personalization. The convenience-personalization tradeoff is real, and it's not clear that better data alignment solves the trust problem.

And then there's the sheer complexity. SAP's ecosystem is massive. Getting all those modules to play nice together has been the holy grail for two decades. Every time they announce a new integration, I think, "Okay, but will it actually work with my legacy system from 2012?" The answer is usually "eventually."

The Bigger Picture

What SAP is doing here isn't unique. Salesforce has been pushing its Data Cloud for similar reasons. Adobe has its Experience Platform. Even Shopify is building more sophisticated data tools for its merchants. The entire industry is realizing that AI is only as good as the data it's trained on, and most enterprise data is a hot mess.

But SAP has a particular advantage: they're already embedded in the operational backbone of thousands of large companies. They're in the ERP, the supply chain, the finance systems. That's where the most valuable data lives—not just what a customer clicked on, but what they actually bought, how much they paid, when they paid, and what else was in their cart.

If SAP can unlock that operational data for AI personalization without requiring a massive migration or custom integration project, they could leapfrog competitors who are building from the marketing layer down. It's a bet on being boring and foundational, which is exactly the kind of bet that often pays off in enterprise software.

What I'm Watching For

I'll be watching three things over the next year:

  1. Adoption metrics. Are companies actually using this, or is it another feature that sounds good in a press release but gathers dust? I'll be looking at customer case studies and implementation timelines.

  2. Privacy controls. SAP is positioning this as GDPR- and CCPA-friendly, but the devil's in the details. How easy is it for a customer to opt out? How transparent is the data usage? These aren't just legal questions; they're trust questions.

  3. Integration with non-SAP systems. Most companies have a patchwork of vendors. If SAP's unified data structure only works with other SAP products, it's a non-starter for most organizations. True interoperability is the test.

The Bottom Line

SAP data infrastructure diagram

I've been covering enterprise AI for a decade, and I've seen countless "revolutionary" features that fizzled because the underlying data was a mess. SAP's approach is refreshingly grounded: fix the plumbing first, then let the AI do its thing. It's not flashy. It won't generate breathless headlines. But it might be the most important work happening in enterprise AI right now.

The question is whether SAP can execute. They've got the right idea. They've got the installed base. They've got the data. Now they need to deliver something that actually works in the messy, complicated reality of a real enterprise.

I'll believe it when I see a midsize manufacturer—not a tech-forward retailer, but a company that makes industrial valves or something—actually using this to personalize their customer experience. That's when I'll know it's real.

Until then, I'm cautiously optimistic. And I'm definitely not throwing away my spreadsheet templates just yet. SAP data infrastructure diagram


Originally reported by www.artificialintelligence-news.com. Rewritten with additional analysis and real-world context by Emily Hartwell.