The CIO’s playbook for AI data management success

AI & Automation31 Jul 2025

Darren Cunningham at Komprise explains why AI demands a new partnership between IT infrastructure and data leaders

As AI takes centre stage in the workforce and increasingly drives strategic initiatives, CIOs yet again find themselves tossed in the currents of disruption. Their roles are quickly evolving along with those of their staff. “CIOs must move beyond technology facilitation and take the lead in AI governance, cyber-security, skills development, sustainability, regulatory compliance, and technical debt management,” according to IDC analyst Daniel-Zoe Jimenez, in a recent IDC blog.

A central goal to AI success across any organisation is developing the right data infrastructure; this includes the people who can consistently achieve a high degree of data quality, data availability, data safety, performance and economic viability.

Now that unstructured data makes up an estimated 90% of enterprise data and is the vital source for AI, the game plan is changing for key players in the equation of AI execution: data storage teams and analytics platform teams are now tasked with preparing and delivering the right data to AI models.

Data use versus data value

Historically, storage teams have focused on data use by managing data availability, performance, cost, capacity and data protection with success measured by IOPS, uptime SLAs, and cost metrics. Data platform teams are concerned with discovering data value and rely on access to usable, clean, and relevant data. Until recently, they’ve worked primarily with structured and semi-structured data sources from applications and databases for their data warehouses and data lakes.

Data teams today are discovering that leveraging unstructured data for AI is difficult and much different from BI projects using ETL because the process is no longer linear. They can’t just move it to a data lake and process it. It’s too large and expensive to do so, and furthermore, AI runs in many different locations and applications. Moving large data swamps around is not practical nor even possible.

What data teams need is help from their data storage counterparts to curate data at the source, enabling them to copy the right data for each project to the appropriate AI tool. That requires knowledge of unstructured data content to accurately filter it and find desired data sets.

From ETL to metadata workflows with governance

The new imperative is to filter, curate, and deliver only the right data into a data workflow before any processing or AI model training begins. It requires automated metadata enrichment and orchestration. It also requires the ability to filter out sensitive data from the workflow.

Storage and data teams, under the leadership of the CIO, will need new tools and processes (including human verification) to efficiently discover, classify and move the right data sets to AI for each project, while preventing any potential quality issues and security violations. Historically siloed, these groups will now need to work together more closely to ensure safe, high-quality AI data pipelines.

Storage teams will need to evolve from infrastructure maintenance into data service providers to help data analytics teams as they gather the requirements and design processes for AI. Data platform teams will need to build common metadata definitions with storage teams and collaborate on an inclusive data governance strategy incorporating unstructured data.

To collaborate effectively, data teams will need to understand:

Basic file and object storage protocols (SMB, NFS, S3)
What unstructured data exists and is usable by AI
Storage access patterns and data tier and archiving strategies
Metadata enrichment strategies and tactics
Governance policies around sensitive and regulated data

Data storage teams will need to understand:

Data modelling, pipelines, and AI data preparation requirements
How to create metadata that is useful for data teams
New use cases that drive demand for unstructured data, such as AI customer chatbots or department-specific GenAI tools
Evolving performance, security and access needs for AI workloads

How AI impacts the CIO role

To support emerging AI business models and changing IT roles, CIOs will need to supply the right tools for these new requirements, including:

Unified visibility across all unstructured data: Data scientists need to discover relevant data regardless of where it lives across departmental and storage silos. Shared reports on data usage, growth, file types, top owners and top metadata categories help relay the critical metrics.
Automated metadata enrichment and tagging: Storage and data teams alike benefit from automated, AI-powered metadata capture, including content-aware tags, PII detection, access patterns and usage history. Storage stakeholders can use this data to ensure that they are adequately protecting sensitive and regulated data and right placing data for cost optimisation, while data teams appreciate the content-aware tags for rapid data curation.
Intelligent data ingestion pipelines: Instead of duplicating petabyte-scale data sets for AI use, institute policy-based movement of the right data sets continuously based on metadata tags.
Governed self-service access: Empower data teams with self-service search, and access to unstructured data sources.
AI data governance checks and balances: Through tools and manual inspection, teams must be empowered to segregate sensitive and proprietary data into a private, secure domain which restricts sharing with commercial applications, track the lineage of data, check data for biases and errors and facilitate audit trails of any data used in an AI tool and demonstrate that your organisation is complying.

Data: a shared enterprise asset

For AI to deliver on its promise, structured and unstructured data must be treated as a shared enterprise asset, not the domain of isolated teams. If storage and data teams remain misaligned, organisations risk:

Wasting valuable data locked in silos
Creating inefficient, costly and duplicative data pipelines
Violating governance and compliance policies
Slowing AI model development and time to value

The future of global data strategy in the enterprise relies upon how well teams across IT and departmental boundaries can work together to meet the CXO-level goal of using AI safely and ingeniously to deliver profoundly better products, services and business outcomes.

Darren Cunningham is the VP of Marketing at Komprise. Read more about their new preprocessing model for filtering unstructured data here.

Main image courtesy of iStockPhoto.com and NicoElNino

The CIO’s playbook for AI data management success

Business Reporter Team

You may also like

A future of prosperity and opportunity in the Middle EastSPONSORED ARTICLE

Achieving growth when it seems impossible

Advertising on Snapchat

Related Articles

Solving the hidden bottlenecks of AI growth

Making the business case for VPNs

Chinese AI startup DeepSeek releases upgraded model with domestic chip support

AI is building resilience to cyber-crime

Related Articles

Shadow IT: turning risk into reward

UK payments infrastructure: why it’s time for a reset

Most Viewed

Qualcomm considers buying UK semiconductor firm Alphawave

US says China's Huawei can't make more than 200,000 AI chips in 2025

Google launches new Ironwood chip to speed AI applications

China's Zhipu AI launches free AI agent, intensifying domestic tech race

ABB to spin off world's second biggest robotics business

Microsoft prepared to abandon high-stakes talks with OpenAI, FT reports

Exclusive-Arm expects its share of data center CPU market sales to rocket to 50% this year

China's ICBC launches $11 billion technology innovation fund

Samsung says trade turmoil raises chip business volatilities, may hit phone demand

South Korea plans $34 billion fund for strategic sectors like chips and autos

Winston House, 3rd Floor, Units 306-309, 2-4 Dollis Park, London, N3 1HF

23-29 Hendon Lane, London, N3 1RT

020 8349 4363

info@business-reporter.co.uk