14 data predictions for enterprise growth in 2023 Komprise Contract Senior Apply on EasyApply Create a free account to apply in seconds Shubham Sharma12:29 pm, PT, January 2, 2023The year 2023 is here, and enterprises are set to make the most of it. From startups to major conglomerates, every company has moved into the new year with the same mission – driving growth with a focus on operational efficiency, productivity, and resilience. Since data will play a key role in achieving this mission, leading industry experts and vendors have shared predictions on how the data space will take shape in the coming months. 1. CIOs will look to consolidate data and simplify architecture“Speaking with other CIOs, I've noticed that companies are growing exponentially without a plan to organize their data. When a company considers scaling at all costs but doesn’t invest in the right technology to support that growth, there will be issues."Part of the problem is that CIOs today have to manage too many systems. Too many disjointed data pools lead to duplicated, siloed and locked-up data, which is not only timely and costly to manage and analyze, but also leads to security issues. "For a company to truly move forward with digital transformation, they need to combine data science and data analytics and draw from a single source of truth. We’ll see more CIOs cutting back on vendor spending to simplify their data architecture. Companies that implement an architecture that combines hindsight and predictive analytics to deliver efficient and intelligent solutions will win in the end.” — Naveen Zutshi, CIO of Databricks2. Broader adoption of data contracts"Designed to prevent data quality issues that occur upstream when data-generating services unexpectedly change, data contracts are very much en vogue. Why? Thanks to changes made by software engineers who unknowingly create ramifications via updates that affect the downstream data pipeline and the rise of data modeling --- [these give] data engineers the option to deliver the data into the warehouse, premodeled. 2023 will see broader data contract adoption as practitioners attempt to apply these frameworks."— Lior Gavish, cofounder and CTO of Monte Carlo3. Availability will be the key to winning in 2023“One thing we have learned in recent years is outages can be crippling for a business. In 2023, availability will be the secret sauce differentiating the winners from the losers. Companies need to avoid lock-in and have the flexibility to scale. By diversifying cloud environments, companies will minimize the impact of outages on their ability to continue operations.”— Patrick Bossman, product manager for MariaDB4. 2023 will be the year of the data app “In the past 10 years, we’ve seen the rise of the web app and the phone app, but 2023 is the year of the data app. Reliable, high-performing data applications will be a critical tool for success as businesses seek new solutions to improve customer-facing applications and internal business operations. With on-demand data apps like Uber, Lyft and Doordash available at our fingertips, there’s nothing worse for a customer than to be stuck with the spinning wheel of doom and a request not going through. Powered by a foundation of real-time analytics, we will see increased pressure on data applications to not only be real-time but to be fail-safe.”— Dhruba Borthakur, cofounder and CTO at Rockset5. The rise of data processing agreement (DPA)“How organizations process data within on-premises systems has historically been a very controlled process that requires heavy engineering and security resources. However, using today’s SaaS data infrastructure, it’s never been easier to share and access data across departments, regions and companies. With this in mind, and as a result of the increase in data localization/sovereignty laws, the rules as to how one accesses, processes and reports on data use will need to be defined through contractual agreements --- also known as data processing agreements (DPA). "In 2023, we'll see DPAs become a standard element of SaaS contracts and data-sharing negotiations. How organizations handle these contracts will fundamentally change how they architect data infrastructure and will define the business value of the data. As a result, it will be in data leaders' best interest to fully embrace DPAs in 2023 and beyond. These lengthy documents will be complex, but the digitization of DPAs and the involvement of legal teams will make them far easier to understand and implement.”— Matt Carroll, cofounder and CEO of Immuta 6. No-copy data exchanges will take hold“In 2023, as data sharing continues to grow, and data and IT teams are strapped to keep up, no-copy data exchanges will become the new standard. As organizations productize their modern data stack, there will be an explosion in the size and number of datasets. Making copies before sharing just won’t be feasible anymore. In 2023, enterprises will flock to established platforms, like Snowflake’s Data Exchange and Databricks’ Delta Sharing protocol, to make it easier to share and monetize their data securely.”— Matt Carroll, cofounder and CEO of Immuta7. AI-based automation for unstructured data management will gain traction“Data management for file and object data is getting more sophisticated with adaptive machine learning and AI-based automation to intelligently guide data placement, lifecycle management, search and movement. Solutions can adapt based on the customer’s cost profile, data profile and target provisioning, and learn over time to refine recommendations. For example, an AI algorithm could be used to proactively identify sensitive datasets, such as files with extensions or tags related to financial documents, which have been stored out of compliance --- such as in the CMO's directory rather than a read-only directory owned by the CFO.”— Kumar Goswami, CEO and cofounder of Komprise8. Synthetic data will accelerate AI innovation“In 2023, synthetic data will be a game-changer in accelerating the development and deployment of AI while guarding against algorithmic bias. One of the significant challenges in developing AI is getting the right amount and diversity of data to train machine learning-based algorithms. These algorithms require massive amounts of data that are representative of the different people that will interact with it and the contexts in which it will be used. "It is difficult, time-consuming and costly to acquire this breadth and depth of data. Data synthesis enables AI companies to rapidly augment their existing datasets and simulate scenarios that are difficult to generate in the real world. "For example, in automotive, synthetic data tools can use a source image of a driver to create synthetic variations that use varying lighting conditions or head movements. It could even simulate a driver falling asleep behind the wheel --- data that is rare and very dangerous to capture in real life. Deploying synthetic data tools is key to not only solve these complex challenges of data collection, but also to combat algorithmic bias, by ensuring datasets are truly diverse.” Skills Machine Learning