市场调查报告书
商品编码
1154897
数据目录全球市场规模、份额、行业趋势分析报告:按元数据类型、行业、组件、部署(本地和云)、组织规模、数据消费者、地区展望和预测,2022~2028 年Global Data Catalog Market Size, Share & Industry Trends Analysis Report By Metadata Type, By Vertical, By Component, By Deployment (On-premise and Cloud), By Organization Size, By Data Consumer, By Regional Outlook and Forecast, 2022 - 2028 |
到 2028 年,数据目录的全球市场规模预计将达到 21 亿美元,预测期内復合年增长率为 19.6%。
目录是一个信息目录,提供有关数据集、文件和数据库的详细信息。它提供了诸如数据集的存储位置以及文件存储在何种类型的存储介质上等信息。从不同来源获得数据的综合视图的需求、海量数据的生成以及自助服务智能越来越多地被接受以改进决策过程是主要驱动因素。
COVID-19 影响分析
自从 COVID-19 出现以来,企业对支持数据分析的各种解决方案的需求已经凸显,并且可以看到引入的良好趋势。远程工作和向云端的迁移推动了对提高工作场所安全性和效率的解决方案的需求。因此,对帮助公司分析其数据的各种解决方案的需求显着增加,并且也出现了积极接受这些解决方案的趋势。因此,可以评估大流行对数据目录市场产生了积极影响。
市场增长因素
提高员工生产力和生计
为了让组织实现数据驱动的目标,他们需要建立系统和程序,让数据公民尽快获得他们需要的数据。 IBM 的一项研究发现,组织将一半以上的时间用于搜索数据,而只有一小部分时间用于使用数据。即使授予访问权限,也几乎看不到数据集所经历的更改。结果,数据集被重复使用,而僱用创建重复使用的数据集的工人变得多余。
促进数据治理和数据发现
数据治理是一组原则、规则和实践,可确保数据的准确性和一致性,以便数据可用于推动业务活动、制定决策和推动数字化转型。.通过端到端的数据沿袭,企业可以了解数据的来源、用途、使用对象和原因。每天都会产生数百万的信息和资产,企业越来越难以理解数据并从中获取有价值的信息。庞大的数据量也阻碍了适当的数据管理和治理。在您的组织内使用目录可以加快数据的搜索和评估。
市场製约因素
关于隐私和数据安全风险的误解
合规性和安全性是全球企业关注的问题。因此,企业对引入新的数据管理和跨平台数据传输解决方案犹豫不决。由于缺乏有关安全标准及其实践的信息,许多组织认为目录管理系统可能会导致广泛抽象的数据集出现安全漏洞。
元数据类型展望
数据目录市场按元数据类型分为技术元数据和业务元数据。在 2021 年的数据目录市场中,业务元数据部分占据了相当大的收入份额。业务元数据有助于维护按业务术语组织的文檔。提供数据文件和属性的业务术语。
行业展望
按行业划分,数据目录市场分为 BFSI、零售和电子商务、製造、政府和国防、医疗保健和生命科学、IT 和电信、媒体和娱乐、运输和物流以及其他行业。. BFSI 细分市场在 2021 年占据了数据目录市场最大的收入份额。银行业和金融业是最早采用数据目录的行业之一。财务分析领域使用统计方法来识别财务问题。
组件视角
数据目录市场根据组件细分为解决方案和服务。 2021 年,服务部门在数据目录市场中占据了很大的收入份额。公司使用元数据管理服务通过更好的可见性和可追溯性来提高数据质量。大多数数据目录服务提供跨平台、管道、数据集、图形、仪表板服务、汇总统计以及标记和文檔功能的数据沿袭。
部署前景
数据目录市场根据引入的类型分为云类型和内部部署类型。 2021 年,本地部分在数据目录市场中占据了最大的收入份额。大型企业通常选择本地数据目录服务。它们确保数据安全并保护数据免遭盗窃和其他形式的安全漏洞。本地数据目录允许公司为他们在其係统上拥有的所有数据构建一个企业范围的目录,无论其物理位置如何,都可以提供分析见解。
组织规模展望
数据目录市场根据组织规模分为中小企业和大型企业。在 2021 年的数据目录市场中,中小企业 (SME) 细分市场取得了相当大的收入份额。 SME 提供由 AI 驱动的数据目录,该目录为基于机器学习 (ML) 的发现算法提供支持,并从任何地方扫描和编目整个企业的数据资产,包括本地数据、云数据和大数据。
数据用户的观点
基于数据消费者,数据目录市场细分为商业智能工具、企业应用程序、移动和网络应用程序。商业智能工具部分将在 2021 年占据数据目录市场的最高收入份额。商业智能数据目录门户使组织能够最大限度地利用其数据目录投资。完全整合目录信息并使所有公司用户都可以访问它,从而创建了一个完全受控的自助服务监控环境。
区域展望
按地区划分,数据目录市场分为四个部分:北美、欧洲、亚太和 Ram 地区。 2021 年,北美市场在数据目录市场的收入份额最高。由于多种 BI 工具的数据使用量不断增加,以及 BFSI、医疗保健、电信和 IT 以及製造业等行业对数字创新的接受程度不断提高,预计该市场在预测期内将快速增长。该地区在 BI 技术开发、研究与开发 (R&D) 和技术进步方面处于领先地位。
产品发布是市场进入者采取的主要策略。根据基数矩阵中的分析,Microsoft Corporation 和 Google LLC 是数据目录市场的先驱。 Oracle Corporation、IBM Corporation、Informatica, LLC 等公司是数据目录市场的一些主要创新者。
The Global Data Catalog Market size is expected to reach $2.1 billion by 2028, rising at a market growth of 19.6% CAGR during the forecast period.
A Data Catalog is a unified data curation platform that brings together data supply and demand. It allows users to register data, obtain and utilize data, and evaluate and analyze data. Key components of a Data Catalog should include a data inventory and data discovery capabilities. Additional elements support data management, data evaluation, and data analytics, in addition to catalog management and data collaboration functions.
The volume of data and the number of forms in which it is available are constantly increasing. Thus, locating and retrieving data becomes a formidable task. Employees and companies are familiar with the situation where a market study or sales report cannot be located in the company's data systems. Finding the proper authority to ask for assistance wastes valuable time. For a corporation to leverage Big Data and participate in data insights, it is necessary to make datasets accessible to the entire organization. A data catalog facilitates data discovery, access, and utilization.
A catalog is a directory of information that provides details about data sets, files, and databases. It provides information about the location of a data set and the type of storage media on which the file is saved. The need to obtain a consolidated perspective of data acquired from disparate sources to improve the decision-making process, the generation of huge quantities of data, and the increasing acceptance of self-service intelligence are the primary dynamics.
COVID-19 Impact Analysis
Since the emergence of COVID-19, the necessity for a variety of solutions that assist businesses with data analytics has attracted considerable attention and a good adoption trend. The shift toward remote work and the need for the cloud have increased the need for solutions that enhance the security and efficiency of the workplace. This has induced a significant increase in demand for various solutions that aid organizations in data analytics, in addition to a positive acceptance trend for these solutions. Therefore, it can be assessed that the pandemic had a positive impact on the data catalog market.
Market Growth Factor
Improves Productivity and Life of Employees
For organizations to acknowledge their goal of becoming data-driven, they need to implement the systems and procedures that enable data citizens to obtain the necessary data as quickly as possible. According to research conducted by IBM, organizations spend more than half of their time searching for the data and only part of it in using it. Even when access is granted, there is little visibility into the alterations data sets undergo. This makes repetition of datasets, rendering the workers employed in making the repeated sets redundant.
Aids in Data Governance and Fastens Data Discovery
Data governance is a collection of principles, rules, and practices that assures that data is accurate and consistent and can be relied upon to drive business activities, inform choices, and fuel digital transformations. Companies can use end-to-end data lineage to comprehend where the data originates, what transpires, who uses it, and why. Every day, millions of information and assets are created, making it difficult for enterprises to comprehend and obtain valuable information from the data they possess. The huge amount of data also interferes with proper data management and governance. The use of a catalog within an organization can increase the rate of data search and evaluation.
Market Restraining Factor
Misunderstandings Regarding Privacy and Data Security Risks
Compliance and safety are concerns for businesses around the world. As a result, companies are hesitant to implement new data management or platform-to-platform data transfer solutions. Due to the lack of information regarding security standards and their operation, many organizations believe that catalog management systems may result in security breaches in their extensively abstracted data sets.
Metadata Type Outlook
Based on metadata type, the data catalog market is fragmented into technical metadata and business metadata. The business metadata segment garnered a substantial revenue share in the data catalog market in 2021. The maintenance of organized documentation inside the business vocabulary of terms is facilitated by business metadata. It offers business-speak explanations of data files and properties.
Vertical Outlook
On the basis of vertical, the data catalog market is segmented into BFSI, retail & e-commerce, manufacturing, government & defense, healthcare & life sciences, IT & telecom, Media & entertainment, transportation & logistics, and other verticals. The BFSI segment garnered the maximum revenue share in the data catalog market in 2021. The banking and financial industry was one of the first to implement data cataloging. The discipline of financial analysis identifies financial problems using statistical approaches.
Component Outlook
Based on component, the data catalog market is categorized into solutions and services. The services segment acquired a significant revenue share in the data catalog market in 2021. A company increases its data quality with better visibility and traceability by using metadata management services. Most data catalogue services provide a lineage of data across platforms, pipelines, datasets, graphs, dashboards services, summary statistics, and tagging and documentation capabilities.
Deployment Outlook
Based on deployment mode, the data catalog market is bifurcated into cloud and on-premises. The on-premise segment procured the maximum revenue share in the data catalog market in 2021. The on-premises data catalog services are opted, usually, by large enterprises. These ensure data security and protect data from theft and other forms of security breaches. The on-premise data catalogs are able to build an enterprise-scale catalog of all the data the enterprise has on their systems and provides insights into their analysis, irrespective of their physical locations.
Organization Size Outlook
On the basis of organization size, the data catalog market is divided into SMEs and large enterprises. The Small and Medium Enterprises (SMEs) segment recorded a substantial revenue share in the data catalog market in 2021. SME provides AI-powered data catalogs that give a machine-learning (ML) based discovery algorithm to scan and catalog enterprise-wide data assets, including on-premises, cloud, and big data from anywhere.
Data Consumer Outlook
On the basis of data consumer, the data catalog market is classified into business intelligence tools, enterprise applications, mobile and web applications. The business intelligence tools segment acquired the highest revenue share in the data catalog market in 2021. A Business Intelligence data catalog portal enables organizations to maximize their data catalog investment. By fully integrating catalog information and making it accessible to all enterprise users, a fully controlled self-service monitoring environment is created.
Regional Outlook
Based on region, the data catalog market is analyzed across North America, Europe, Asia Pacific, and LAMEA. The North America segment witnessed the highest revenue share in the data catalog market in 2021. Due to the rising use of data in multiple BI tools and the increased acceptance of digital innovations in industries such as BFSI, healthcare, telecom and IT, and manufacturing, the market is anticipated to expand fast over the forecast period. The region has become a leader in BI technology developments, research and development (R&D), and technological advances.
The major strategies followed by the market participants are Product Launches. Based on the Analysis presented in the Cardinal matrix; Microsoft Corporation and Google LLC are the forerunners in the Data Catalog Market. Companies such as Oracle Corporation, IBM Corporation, Informatica, LLC are some of the key innovators in Data Catalog Market.
The market research report covers the analysis of key stake holders of the market. Key companies profiled in the report include IBM Corporation, Microsoft Corporation, TIBCO Software, Inc., Collibra, Inc., Oracle Corporation, Google LLC, Informatica, LLC, Cloudera, Inc., Alteryx, Inc. and Alation, Inc.
Recent Strategies Deployed in Data Catalog Market
Partnerships, Collaborations and Agreements:
Sep-2022: Alation partnered with Fivetran, a global leader in modern data integration. Following this partnership, the companies aimed at enabling their customers to understand and locate the whole context of data in the latest data stack. Through the use of Fivetran Metadata API, the partnership unified reliable, governed data from various sources across one single view. This increases data visibility and improves decision-making and data pipelines.
Aug-2022: Oracle Cloud Infrastructure collaborated with Anaconda, a provider of the world's most recognized data science platform. The collaboration focused on delivering secure open-source Python and R tools and packages by allowing and embedding the latter company's repository throughout OCI Machine Learning and Artificial Intelligence services.
Jul-2022: Alteryx partnered with Polestar Solutions, a dominant player in enterprise performance management (EPM) and data analytics. With this partnership, the company aimed to enhance data science automation and analytics for enterprises. The partnership incorporated powerful analytics delivery abilities and domain expertise of Polestar Solutions with an end-to-end platform of Alteryx to allow enterprises to change data into innovative insights.
May-2022: IBM signed an agreement with Amazon Web Services, a subsidiary of Amazon. With this agreement, the company focused on providing a broad range of software catalogs as Software-as-a-Service (SaaS) on the AWS platform. The agreement further underpinned the importance of Red Hat, IBM, and AWS in providing adaptability and offering greater business value for customers across various industries.
Apr-2022: Informatica partnered with Snowflake, a Data Cloud Company. Through this partnership, the companies aimed to strengthen the combination of Informatica's Intelligent Data Management Cloud (IDMC) and Data Cloud to help expedite the move to the cloud by increasing the data governance and data management capabilities of customers.
Oct-2021: Informatica collaborated with Google Cloud, a suite of cloud computing services. With this collaboration, the companies formed a joint program for cloud migration to speed up the relocation of customers by 12 times from on-premises enterprise data warehouses to Google BigQuery. This increased the cost-effectiveness of migration. Additionally, the collaboration also promoted Informatica's Master Data Management and Data Governance services, making management and deployment of services easier on Google Cloud.
Aug-2021: IBM partnered with Cloudera, an American software company. Following this partnership, the companies reinforced their go-to-market programs and joint development strategies by incorporating the advanced analytical abilities of IBM Cloud Pak for Data, which is a unified platform for AI and data, into the Cloudera Data Platform.
Nov-2020: Alation partnered with Snowflake, the creator of the Data Cloud. Following this partnership, Alation aimed to increase data discovery & search, drive data cloud governance, and ease the migration to Snowflake's Data Cloud.
Jun-2020: Microsoft partnered with SAS, a leader in analytics. Under this partnership, the companies aimed at enabling users to run SAS workloads in the cloud. This would help the customers in enhancing their business solutions and determining critical value from digital transformation initiatives.
Product Launches and Product Expansions:
Aug-2022: Cloudera unveiled Cloudera Data Platform (CDP) One, a software-as-a-service (SaaS) offering incorporating all features of a data lakehouse. CDP allows easy and fast exploratory data science and self-service analytics on any data type. CDP One possesses built-in machine learning (ML) and enterprise security that demands zero clouds, monitoring, or security operations staff needed for reduced risk and lower TCO.
Jul-2022: Google announced the expansion of Google Cloud Data Catalog by unifying it with Dataplex to form a single interface for users. Through this expansion, the company focused on offering users a unified experience for discovering and searching data and combining it with important business information. It also organized data by logical data domains and facilitated the central monitoring and governance of distributed data with built-in automation capabilities and data intelligence.
Apr-2022: TIBCO unveiled TIBCO WebFOCUS 9.0.0. The product is developed with powerful capabilities such as TIBCO WebFOCUS Container Edition and a hub for a complete personalized customer experience, together with substantial enhancements for TIBCO WebFOCUS Designer. The innovative features of the product strengthened the ML/AI experience for company analysts, users, and engineers, utilizing analytics and data across the enterprise.
Sep-2021: Alation launched Data Governance App. The app transforms multi-cloud security and governance and provides continual, autonomous data governance using AI and machine learning. The Alation Data Governance App increases data governance abilities for the Snowflake Data Cloud.
Aug-2021: Cloudera introduced Cloudera DataFlow for the Public Cloud, a service built for data flows to operate streaming workloads from hybrid systems on the Cloudera Data Platform (CDP). This cloud-native service allows customers to automate intricate operations of data flow and enhances the operational potential with auto-scaling abilities to stream data flows. Additionally, it also helps organizations by eliminating the estimation of infrastructure size, thereby reducing cloud costs.
Jul-2021: Informatica introduced Cloud Data Governance & Catalog (CDGC) solutions. The solution offers seamless catalog as-a-service and streamlined data governance and is also the main component in Intelligent Data Management Cloud (IDMC) by Informatica. CDGC allowed companies to expedite reliable insights through analytics and data governance and increase business value from AI and cloud analytics.
Jul-2021: Oracle unveiled Oracle Cloud Infrastructure (OCI) Data Catalog. The product aims at the expedition of data cataloging for technical metadata harvesting by automating the creation and discovery of data assets. The product also simplifies the enrichment of metadata and provides bulk upload abilities.
May-2021: TIBCO introduced a range of enhancements to its Unify data management portfolio, comprising its TIBCO EBX and TIBCO Data Virtualization products. The updated solutions significantly strengthened the data fabric of an organization, allowing customers to experience the huge data potential.
Mar-2021: Oracle launched innovative additions to Oracle Autonomous Data Warehouse. With this launch, Oracle transformed cloud data warehousing from an intricate ecosystem of tools, products, and tasks that needs increased technical expertise, money, and time to perform data cleansing and transformation, data loading, machine learning, and business modeling into a suggestive drag-and-drop, point-and-click experience for data scientists, data analysts, and users of business.
Dec-2020: Microsoft unveiled Azure Purview, a platform with a data catalog incorporating data governance and discovery features. The unified data governance platform, Azure Purview automates the discovery, cataloging, mapping, and lineage tracking of data, intending to offer the customers an enhanced understanding of the range of the data estate.
Jun-2020: Collibra launched Collibra Data Intelligence Cloud. The Data Intelligence Cloud is an end-to-end integrated platform, which provides transparency into the data environment, automates data workflows, offers trusted insights, and ensures security. Collibra Data Intelligence Cloud offers privacy and data governance customers an optimal method of accessing reliable data that can then be assessed with the tools the businesses already use.
Apr-2020: Google expanded its services by releasing a data catalog. With this launch, the company aimed at increasing the visibility of customers into their data assets in platforms like Google Cloud and more. The features of this data catalog include crawlers that consume metadata automatically from Google Cloud, starting with Pub/Sub and BigQuery, and reaching to object storage systems of Google Cloud.
Acquisitions and Mergers:
Jul-2022: IBM acquired Databand, a provider of monitoring and visibility services for data pipelines to organizations. Through this acquisition, IBM aimed at incorporating the observability services of Databand into its data fabric platform that allowed the organization to use and govern data for BI, analytics, and machine learning. The IBM data fabric platform includes the IBM Watson Knowledge Catalog that offers data catalog and data governance abilities to allow users to detect and utilize data for training in machine learning or data analytics.
Feb-2022: Alteryx took over Trifacta, a San Francisco-based company. Through this acquisition, Alteryx focused on increasing its portfolio by providing a low code/no code, integrated end-to-end analytics automation interface in the cloud, thereby serving the demands of the enterprise.
Oct-2021: Alation acquired Lyngo Analytics, a data insights firm. With this acquisition, Alation focused on enhancing business user experiences with the data catalog further increasing data intelligence and assisting organizations in guiding data culture.
Feb-2021: Collibra acquired OwlDQ, a leading provider of predictive data quality software. The combination of Collibra Data Intelligence Cloud with OwlDQ, launched Collibra Data Quality, a new offering that enables organizations in automating and centralizing data quality workflows for smoothening the data and analytics processes and complying with global regulations throughout the organization.
Jul-2020: Informatica acquired Compact Solutions, a provider of innovative software and services for big data governance. Following this acquisition, Informatica focused on integrating its Intelligent Data Platform with Compact Solutions' expanded management of metadata. The acquisition enhanced the company's leadership in automation and AI-enabled by metadata.
Jun-2020: Microsoft took over ADRM Software, a leading provider of large-scale industry data models. Under this acquisition, Microsoft integrated Azure's limitless storage and compute with ADRM's comprehensive industry models. This integration enabled the development of an intelligent data lake in which data from various business lines can be harmonized quickly.
Market Segments covered in the Report:
By Metadata Type
By Vertical
By Component
By Deployment
By Organization Size
By Data Consumer
By Geography
Companies Profiled
Unique Offerings from KBV Research
List of Figures