Best Octopai Alternatives in 2024
Find the top alternatives to Octopai currently available. Compare ratings, reviews, pricing, and features of Octopai alternatives in 2024. Slashdot lists the best Octopai alternatives on the market that offer competing products that are similar to Octopai. Sort through Octopai alternatives below to make the best choice for your needs
-
1
MANTA
Manta
Manta is a unified data lineage platform that serves as the central hub of all enterprise data flows. Manta can construct lineage from report definitions, custom SQL code, and ETL workflows. Lineage is analyzed based on actual code, and both direct and indirect flows can be visualized on the map. Data paths between files, report fields, database tables, and individual columns are displayed to users in an intuitive user interface, enabling teams to understand data flows in context. -
2
Centralpoint
Oxcyon
Gartner's Magic Quadrant includes Centralpoint as a Digital Experience Platform. It is used by more than 350 clients around the world, and it goes beyond Enterprise Content Management. It securely authenticates (AD/SAML/OpenID, oAuth), all users for self-service interaction. Centralpoint automatically aggregates information from different sources and applies rich metadata against your rules to produce true Knowledge Management. This allows you to search for and relate disparate data sets from anywhere. Centralpoint's Module Gallery is the most robust and can be installed either on-premise or in the cloud. Check out our solutions for Automating Metadata and Automating Retention Policy Management. We also offer solutions to simplify the mashup of disparate data to benefit from AI (Artificial Intelligence). Centralpoint is often used to provide easy migration tools and an intelligent alternative to Sharepoint. It can be used to secure portal solutions for public sites, intranets, members, or extranets. -
3
Talend Data Catalog
Qlik
Talend Data Catalog provides your organization with a single point of control for all your data. Data Catalog provides robust tools for search, discovery, and connectors that allow you to extract metadata from almost any data source. It makes it easy to manage your data pipelines, protect your data, and accelerate your ETL process. Data Catalog automatically crawls, profiles and links all your metadata. Data Catalog automatically documents up to 80% of the data associated with it. Smart relationships and machine learning keep the data current and up-to-date, ensuring that the user has the most recent data. Data governance can be made a team sport by providing a single point of control that allows you to collaborate to improve data accessibility and accuracy. With intelligent data lineage tracking and compliance tracking, you can support data privacy and regulatory compliance. -
4
IRI Voracity
IRI, The CoSort Company
IRI Voracity is an end-to-end software platform for fast, affordable, and ergonomic data lifecycle management. Voracity speeds, consolidates, and often combines the key activities of data discovery, integration, migration, governance, and analytics in a single pane of glass, built on Eclipse™. Through its revolutionary convergence of capability and its wide range of job design and runtime options, Voracity bends the multi-tool cost, difficulty, and risk curves away from megavendor ETL packages, disjointed Apache projects, and specialized software. Voracity uniquely delivers the ability to perform data: * profiling and classification * searching and risk-scoring * integration and federation * migration and replication * cleansing and enrichment * validation and unification * masking and encryption * reporting and wrangling * subsetting and testing Voracity runs on-premise, or in the cloud, on physical or virtual machines, and its runtimes can also be containerized or called from real-time applications or batch jobs. -
5
Select Star
Select Star
$270 per monthIn just 15 minutes, you can set up your automated data catalogue and receive column-level lines, Entity Relationship diagrams, and auto-populated documentation in 24 hours. You can easily tag, find, and add documentation to data so everyone can find the right one for them. Select Star automatically detects your column-level data lineage and displays it. Now you can trust the data by knowing where it came. Select Star automatically displays how your company uses data. This allows you to identify relevant data fields without having to ask anyone else. Select Star ensures that your data is protected with AICPA SOC2 Security, Confidentiality and Availability standards. -
6
Atlan
Atlan
The modern data workspace. All your data assets, from data tables to reports, will be instantly discoverable. The combination of powerful search algorithms and easy browsing makes it easy to find the right asset. Atlan automatically generates data quality profiles that make it easy to detect bad data. We have you covered, from automatic variable type detection and frequency distribution to missing values or outlier detection. Atlan takes the hassle out of managing and governing your data ecosystem. Atlan's bots analyze SQL query history to automatically construct data lineage. They also auto-detect PII information. This allows you to create dynamic access policies and best-in-class governance. Our Excel-like query builder allows anyone to query multiple data lakes, warehouses, and DBs. Native integrations with tools such as Tableau and Jupyter make data collaboration possible. -
7
Datameer
Datameer
Datameer is your go-to data tool for exploring, preparing, visualizing, and cataloging Snowflake insights. From exploring raw datasets to driving business decisions – an all-in-one tool. -
8
Global IDs
Global IDs
Global IDs offers a variety of Enterprise Data Solutions, including data governance, cloud migration, compliance, privacy, analytics, and rationalization. Global IDs EDA Platform features include automated discovery and profiling as well as data classification, data lineage and data quality. These functions make data transparent, trustworthy, and easily understandable for all members of the ecosystem. Global IDs EDA platform architecture was designed to integrate from the ground up, with all platform functionality available via APIs. Global IDs EDA platform automates data administration for enterprises of all sizes and data ecosystems. -
9
Collibra
Collibra
The Collibra Data Intelligence Cloud offers a best-in class catalog, flexible governance and continuous quality. It also has built-in privacy. A best-in-class data catalogue that supports your users includes embedded governance, privacy, and quality. You can raise the bar by ensuring that teams can quickly access, understand, and access data from all sources, including business applications and data science tools, in one central location. Your data deserves privacy. Automate, centralize and guide workflows to encourage collaboration and operationalize privacy. Collibra Data Lineage gives you the complete story about your data. Automatically map relationships between applications, systems, and reports to provide a context rich view of the enterprise. Focus on the data that you are most concerned about and make sure it is accurate, complete, and trustworthy. -
10
Kylo
Teradata
Kylo is an enterprise-ready open-source data lake management platform platform for self-service data ingestion and data preparation. It integrates metadata management, governance, security, and best practices based on Think Big's 150+ big-data implementation projects. Self-service data ingest that includes data validation, data cleansing, and automatic profiling. Visual sql and an interactive transformation through a simple user interface allow you to manage data. Search and explore data and metadata. View lineage and profile statistics. Monitor the health of feeds, services, and data lakes. Track SLAs and troubleshoot performance. To enable user self-service, create batch or streaming pipeline templates in Apache NiFi. While organizations can spend a lot of engineering effort to move data into Hadoop, they often struggle with data governance and data quality. Kylo simplifies data ingest and shifts it to data owners via a simple, guided UI. -
11
Data360 Govern
Precisely
Although your organization understands the value of data, and how it must be accessible to business users for maximum impact, enterprise data governance can make it difficult to trust, understand, or find that data. Data360 Govern, an enterprise data governance, metadata, and catalog management solution, gives you confidence in your data's quality, value, and trustworthiness. It automates governance tasks and stewardship tasks, helping you to answer critical questions about your data's origin, use, meaning, ownership and quality. Data360 Govern allows you to make faster decisions about data usage and management, foster collaboration across your organization, and give users the ability to get the answers they require - whenever they need them. Transparency into your company's data landscape allows you to track the most critical data that aligns with your business goals. -
12
Dataedo
Dataedo
$49 per monthYour metadata can be discovered, documented and managed. Dataedo has multiple automated metadata scanners. These scanners connect to different database technologies, extract data structures, and then load them into the metadata repository. In just a few clicks you can create a catalog of all your data and then describe each element. With business-friendly aliases, decrypt column and table names and provide meaning and purpose to data assets with descriptions and custom fields. To find out what data is stored in your data asset, you can use sample data. Make sure you have a better understanding of the data before you use it. Data profiling can help ensure high quality data. Data profiling allows everyone to have access to data knowledge. A lightweight, on-premises data catalogue can help you build data literacy, democratize data, and empower your employees to make better data use. A data catalog can help you increase data literacy. -
13
Ataccama ONE
Ataccama
Ataccama is a revolutionary way to manage data and create enterprise value. Ataccama unifies Data Governance, Data Quality and Master Data Management into one AI-powered fabric that can be used in hybrid and cloud environments. This gives your business and data teams unprecedented speed and security while ensuring trust, security and governance of your data. -
14
Castor
Castor
$699 per monthCastor is a data catalogue that can be adopted by all employees. Get a complete overview of your data environment. Our powerful search engine makes it easy to find data quickly. Access data quickly and easily by joining a new data infrastructure. Expand beyond the traditional data catalog. Modern data teams have multiple data sources. Instead of building one truth, they build it. Castor's delightful and automated documentation makes it easy to trust data. In minutes, you can get a column-level view of your cross-system data lineage. To build trust in your data, get a bird's-eye view of your data pipelines. All you need to troubleshoot data issues, conduct impact analyses, and comply with GDPR is one tool. Optimize performance, cost compliance, security, and security for data. Our automated infrastructure monitoring system will keep your data stack healthy. -
15
Aggua
Aggua
Aggua is an AI platform with augmented data fabric that gives data and business teams access to their data. It creates Trust and provides practical Data Insights for a more holistic and data-centric decision making. With just a few clicks, you can find out what's happening under the hood of your data stack. You can access data lineage, cost insights and documentation without interrupting your data engineer's day. With automated lineage, data engineers and architects can spend less time manually tracing what data type changes will break in their data pipelines, tables, and infrastructure. -
16
SAP Information Steward software allows for data profiling, monitoring, and information policy management. It is the information governance layer of SAP Business Technology Platform and can help you to anticipate risk and achieve better business outcomes. To gain continuous insight into your enterprise's data model integrity, combine data profiling, metadata management, and data lineage. You will gain a better understanding about the data quality in your data management landscape while accessing and analysing metrics using intuitive dashboards and scorecards. Supporting analysts, data stewards, IT experts, and other professionals with consistent validation rules, guidelines, can improve enterprise information management initiatives. Data profiling and metadata management are two solutions that can help you discover, assess, define and monitor the quality of your enterprise's data assets. Run what-if analyses to forecast the savings that improved data quality could bring.
-
17
Privacera
Privacera
Multi-cloud data security with a single pane of glass Industry's first SaaS access governance solution. Cloud is fragmented and data is scattered across different systems. Sensitive data is difficult to access and control due to limited visibility. Complex data onboarding hinders data scientist productivity. Data governance across services can be manual and fragmented. It can be time-consuming to securely move data to the cloud. Maximize visibility and assess the risk of sensitive data distributed across multiple cloud service providers. One system that enables you to manage multiple cloud services' data policies in a single place. Support RTBF, GDPR and other compliance requests across multiple cloud service providers. Securely move data to the cloud and enable Apache Ranger compliance policies. It is easier and quicker to transform sensitive data across multiple cloud databases and analytical platforms using one integrated system. -
18
erwin Data Intelligence
erwin
$299 per montherwin Data Intelligence, or erwin DI, combines data literacy and data catalog capabilities to provide greater awareness and access to data assets, guidance on how to use them, and guardrails to ensure that data policies and best practice are followed. Automatically extract, transform, and feed metadata from a variety of data sources, operational process, and data models into one central catalog. It is then made accessible and understandable through role-based, contextual views. This allows stakeholders to make strategic decisions based upon accurate insights. erwin DI supports enterprise information governance, digital transformation, and any other effort that relies upon data to achieve positive outcomes. You can schedule ongoing scans of metadata from a wide range of data sources. You can easily map data elements from source and target, including data moving, and harmonize data integration across platforms. Data consumers can easily identify and find data that is relevant to their roles. -
19
DataGalaxy
DataGalaxy
DataGalaxy’s all-in one data catalog provides out-of-the box actionability, with fully-customizable features, visualization tools, AI integration, to give business teams a way to document, link and track their metadata assets. The Data Catalog 360deg platform's user-centric approach is dedicated to metadata management, knowledge sharing, and mapping. This helps your organization manage data in the way that you want. A data catalog allows employees from different teams to collaborate by using homogeneous, centralized data sets. Our data catalog provides clarity for data definitions, synonyms and essential business attributes. It also includes a semantic layer to help all users understand and leverage data. If you are looking for answers on a specific metadata, the data catalog will identify 360deg data experts and owners. This will empower your team by facilitating collaboration. -
20
Secure and manage the data lifecycle, from Edge to AI in any cloud or data centre. Operates on all major public clouds as well as the private cloud with a public experience everywhere. Integrates data management and analytics experiences across the entire data lifecycle. All environments are covered by security, compliance, migration, metadata management. Open source, extensible, and open to multiple data stores. Self-service analytics that is faster, safer, and easier to use. Self-service access to multi-function, integrated analytics on centrally managed business data. This allows for consistent experiences anywhere, whether it is in the cloud or hybrid. You can enjoy consistent data security, governance and lineage as well as deploying the cloud analytics services that business users need. This eliminates the need for shadow IT solutions.
-
21
Apache Atlas
Apache Software Foundation
Atlas is a flexible and extensible set core foundational governance services that enable enterprises to efficiently and effectively meet their compliance requirements within Hadoop. It also allows integration with the entire enterprise data ecosystem. Apache Atlas offers open metadata management and governance capabilities that allow organizations to create a catalog of their data assets, classify, govern and provide collaboration capabilities around these assets for data scientists, analysts, and the data governance group. Pre-defined types to manage various Hadoop and non Hadoop metadata. Ability to create new types to manage metadata. Types can inherit from other types, and can have simple attributes, complex attributes, and object references. Type instances, also known as entities, are able to capture metadata object details and their relationships. REST APIs allow for easier integration with types and instances. -
22
ASG Data Intelligence
ASG Technologies
There is a greater demand for data-driven insight and innovation than ever before. To maintain a competitive edge in today’s global enterprises, it is essential to be able to use trusted data to make informed business decisions. Despite the fact that most companies have a lot of data, business leaders often don't know how to access it. ASG Data Intelligence is the solution to data distrust. It is a metadata-driven platform which makes technical data "smarter". It provides end-to-end views and movements of data (data lineage), as well as business meanings and usage guardrails. Data value can be unleashed when it is made available, understood, and trusted by all users in your organization, including data scientists, analysts and marketers. Improved understanding of data's origins, business context and processes will help you build trust in it. -
23
Tree Schema Data Catalog
Tree Schema
$99 per monthThis is the essential tool for metadata management. In just 5 minutes, automatically populate your entire catalogue! Data Discovery. Data Discovery. Find the data you need from any part of your data ecosystem, starting with the database and ending with the specific values for each field. Automated documentation of your data from existing data storage. First-class support for unstructured and tabular data. Automated data governance actions. Data Lineage. Data Lineage. Explore your data lineage to understand where your data is coming from and where it is headed. View the impact analysis of changes. See all up- and downstream impacts. Visualize connections and relationships. API AccessNew. Tree Schema API allows you to manage your data lineage in code and keep your catalog current. Integrate Data Lineage in CICD pipelines Capture values & description within your code Analyze the impact of breaking changes. Data Dictionary. Know the key terms and lingo which drive your business. Define the context and scope of keywords -
24
Dremio
Dremio
Dremio provides lightning-fast queries as well as a self-service semantic layer directly to your data lake storage. No data moving to proprietary data warehouses, and no cubes, aggregation tables, or extracts. Data architects have flexibility and control, while data consumers have self-service. Apache Arrow and Dremio technologies such as Data Reflections, Columnar Cloud Cache(C3), and Predictive Pipelining combine to make it easy to query your data lake storage. An abstraction layer allows IT to apply security and business meaning while allowing analysts and data scientists access data to explore it and create new virtual datasets. Dremio's semantic layers is an integrated searchable catalog that indexes all your metadata so business users can make sense of your data. The semantic layer is made up of virtual datasets and spaces, which are all searchable and indexed. -
25
Informatica Enterprise Data Catalog
Informatica
You can scan and index metadata, discover and profile information, and provide detailed lineage across tens to millions of data sets. To maximize data reuse and data value, classify and organize data assets in any environment. Automate scanning across multiple cloud platforms, BI tools and ETL tools; and data types. AI-powered domain discovery, data similarities, business term associations, recommendations, and recommendation making are possible. You can track data movement from high-level system views through to fine column-level lineage and get detailed impact analysis. The Data Asset Analytics dashboard allows you to see asset usage, enrichment, collaboration. You can view data quality rules, scorecards and metric groups in context. You can tap into shared data knowledge through certifications, ratings, reviews, a Q&A forum, and change notifications. Informatica stands out from the rest with its extensive range of enterprise-grade data management products. -
26
SCIKIQ
DAAS Labs
$10,000 per yearA platform for data management powered by AI that allows data democratization. Insights drives innovation by integrating and centralizing all data sources, facilitating collaboration, and empowering organizations for innovation. SCIKIQ, a holistic business platform, simplifies the data complexities of business users through a drag-and-drop user interface. This allows businesses to concentrate on driving value out of data, allowing them to grow and make better decisions. You can connect any data source and use box integration to ingest both structured and unstructured data. Built for business users, easy to use, no-code platform, drag and drop data management. Self-learning platform. Cloud agnostic, environment agnostic. You can build on top of any data environment. The SCIKIQ architecture was specifically designed to address the complex hybrid data landscape. -
27
IBM Manta Data Lineage, a data lineage tool, increases the transparency of data pipelines so that businesses can ensure data accuracy across all their models and systems. Data quality, provenance and lineage become increasingly important as businesses integrate AI into workflows and data complexity increases. IBM's CEO study of 2023 found that the primary barrier to generative AI adoption was concerns about data lineage. IBM offers a data lineage platform which automatically scans all your applications and builds a powerful map of data flows. The platform delivers the information through a native user-interface (UI) or other channels for both technical and nontechnical audiences. IBM Manta Data Lineage gives data operations teams comprehensive visibility and control over their data pipeline. By improving your understanding of dynamic metadata and using it, you can ensure data is managed efficiently across complex systems.
-
28
Databricks Data Intelligence Platform
Databricks
The Databricks Data Intelligence Platform enables your entire organization to utilize data and AI. It is built on a lakehouse that provides an open, unified platform for all data and governance. It's powered by a Data Intelligence Engine, which understands the uniqueness in your data. Data and AI companies will win in every industry. Databricks can help you achieve your data and AI goals faster and easier. Databricks combines the benefits of a lakehouse with generative AI to power a Data Intelligence Engine which understands the unique semantics in your data. The Databricks Platform can then optimize performance and manage infrastructure according to the unique needs of your business. The Data Intelligence Engine speaks your organization's native language, making it easy to search for and discover new data. It is just like asking a colleague a question. -
29
Datakin
Datakin
$2 per monthYou can instantly see the order in your complex data world and know exactly where to find answers. Datakin automatically tracks data lineage and displays your entire data ecosystem as a rich visual graph. It clearly shows the upstream and downstream relationships of each dataset. The Duration tab summarizes the job's performance and its upstream dependencies in a Gantt-style graph. This makes it easy to identify bottlenecks. The Compare tab allows you to see how your jobs and data have changed over time. Sometimes jobs that run well can produce poor output. The Quality tab shows you the most important data quality metrics and how they change over time. This makes anomalies easily visible. Datakin allows you to quickly identify the root cause of problems and prevent them from happening again. -
30
Metaphor
Metaphor Data
Automatically index warehouses, lakes and dashboards. Metaphor allows you to show your most trusted data to your users when combined with lineage, utilization, and other social popularity indicators. Open 360-degree views of your data are available to all employees. This allows for data conversations and data sharing. Meet your customers at their location - share artifacts and documentation natively via Slack. Tag your conversations in Slack and associate them with data. Collaboration across silos is possible through the organic discovery and use of key terms and patterns. You can easily discover data from the entire stack and write technical details. This wiki is easy to use by non-technical users. Slack allows you to support your users and the catalog can be used as a Data Enablement Tool to quickly onboard users. -
31
Blindata
Blindata
$2000/year/ user Blindata is a comprehensive Data Governance program that includes all functions. Data Catalog, Data Lineage & Business Glossary provide a complete and integrated view of your Data. Data Classification gives data a semantic meaning, while Data Quality Modules, Issue Management and Data Stewardship modules increase the reliability and trust of data. Privacy compliance can also be facilitated by specific features. Registry of processing activities, central management of privacy notes, consent registry with Blockchain integration. Blindata Agent is able to connect to multiple data sources and collect metadata, such as data structures (Tables Views Fields ...), data Quality metrics, reverse lineage etc.). Blindata's modular architecture is entirely API-based, allowing for systematic integration with business systems of the highest importance (DBMS, Active Directory e-commerce and Data Platforms). Blindata can be purchased as a SaaS or installed "on Premise", or it can be purchased from AWS Marketplace. -
32
EPMware
EPMware
Master Data Management and Data Governance. Plug and Play adapters for Oracle Hyperion, Onestream, Anaplan, and More. The Leader in Performance Management Master data On-Premise or in the Cloud. Designed to include Business Users in MDM/Data Governance. With built-in application Intelligence, managing hierarchies in EPMware and data governance becomes a seamless process. This creates dimensional consistency across all subscribing apps. Our one-click integration allows hierarchies to be visualized and modeled in a request. This allows for real-time data governance, which ensures that metadata updates are audited and error-proof. EPMware's workflow capabilities allow metadata to be reviewed, approved, and then deployed to both on-premise and in the cloud. There are no files to load or extract, and no manual intervention. Just a seamless, audited metadata integration right out of the box. Integration and Validation Focus EPMware provides native and pre-built integration support to the most popular EPM and CPM technologies. -
33
Mozart Data
Mozart Data
Mozart Data is the all-in-one modern data platform for consolidating, organizing, and analyzing your data. Set up a modern data stack in an hour, without any engineering. Start getting more out of your data and making data-driven decisions today. -
34
OpenText Magellan
OpenText
Machine Learning and Predictive Analytics Platform. Advanced artificial intelligence is a pre-built platform for machine learning and big-data analytics that can enhance data-driven decision making. OpenText Magellan makes predictive analytics easy to use and provides flexible data visualizations that maximize business intelligence. Artificial intelligence software reduces the need to manually process large amounts of data. It presents valuable business insights in a manner that is easily accessible and relevant to the organization's most important objectives. Organizations can enhance business processes by using a curated combination of capabilities such as predictive modeling, data discovery tools and data mining techniques. IoT data analytics is another way to use data to improve decision-making based on real business intelligence. -
35
Narrative
Narrative
$0With your own data shop, create new revenue streams from the data you already have. Narrative focuses on the fundamental principles that make buying or selling data simpler, safer, and more strategic. You must ensure that the data you have access to meets your standards. It is important to know who and how the data was collected. Access new supply and demand easily for a more agile, accessible data strategy. You can control your entire data strategy with full end-to-end access to all inputs and outputs. Our platform automates the most labor-intensive and time-consuming aspects of data acquisition so that you can access new data sources in days instead of months. You'll only ever have to pay for what you need with filters, budget controls and automatic deduplication. -
36
OvalEdge, a cost-effective data catalogue, is designed to provide end-to-end data governance and privacy compliance. It also provides fast, reliable analytics. OvalEdge crawls the databases, BI platforms and data lakes of your organization to create an easy-to use, smart inventory. Analysts can quickly discover data and provide powerful insights using OvalEdge. OvalEdge's extensive functionality allows users to improve data access, data literacy and data quality.
-
37
Google Cloud Data Catalog
Google
$100 per GiB per monthFully managed and highly scalable metadata and data discovery service. New customers receive $300 in Google Cloud credits for free during the Free Trial. All customers receive up to 1 MiB business or ingested meta data storage and 1,000,000 API calls free of charge. A simple, but powerful faceted search interface allows you to pinpoint your data. Automatically sync technical metadata and create schematized tags to support business metadata. Cloud Data Loss Prevention integration allows you to automatically tag sensitive data. Access your data immediately and scale without the need to manage or set up infrastructure. With a powerful UI built with the same search technology that Gmail or API access, empower any member of the team to find and tag data. Data Catalog is fully managed so that you can easily start and scale. Cloud IAM integrations and Cloud DLP integrations allow you to enforce data security policies and ensure compliance. -
38
Secuvy AI
Secuvy
Secuvy, a next-generation cloud platform, automates data security, privacy compliance, and governance via AI-driven workflows. Unstructured data is treated with the best data intelligence. Secuvy, a next-generation cloud platform that automates data security, privacy compliance, and governance via AI-driven workflows is called Secuvy. Unstructured data is treated with the best data intelligence. Automated data discovery, customizable subjects access requests, user validations and data maps & workflows to comply with privacy regulations such as the ccpa or gdpr. Data intelligence is used to locate sensitive and private information in multiple data stores, both in motion and at rest. Our mission is to assist organizations in protecting their brand, automating processes, and improving customer trust in a world that is rapidly changing. We want to reduce human effort, costs and errors in handling sensitive data. -
39
Anzo
Cambridge Semantics
Anzo is a modern data integration and discovery platform that allows anyone to find, connect, and blend enterprise data into analytics-ready datasets. Anzo's unique use semantics and graph models makes it possible for anyone in your company - from data scientists to novice business users to drive data discovery and integration and create their own analytics-ready data sets. Anzo's graph models give business users a visual map of enterprise data that is easy for them to understand and navigate, regardless of how complex, siloed, or large their data may be. Semantics adds business content to data. It allows users to harmonize data using shared definitions and create blended, business-ready data upon demand. -
40
DvSum, an AI-powered Data Intelligence platform, makes it remarkably easy for data and analytics teams discover, monitor, and govern data. DvSum uses powerful AI-enabled algorithms to automatically catalog, classify, and curate your data and make it available as a Data Catalog. DvSum Data Intelligence will help you propel your enterprise towards its digital- and analytics-enabled transformation goals.
-
41
DataHawk
We-Bridge
Visualize data lineage automatically extracting data flow data source to target. Data lineage management software that automatically collects and analyzes mission-critical data. It also visualizes data flow and derivation rules from data source to target. Data Lineage refers to the flow of data between the source and the target. Tracking Data Lineage is about understanding the flow and derivation rules of data processed, transformed, and used. Multi-tier column-level data lineage graph and list, from source to destination. Drill down data lineage at the business system, column and table levels. Provide parsers to support analysis of Big Data technologies and various environments. Our patented technology allows for path sensitive dynamic string analysis and data flow analysis within programs. -
42
Acryl Data
Acryl Data
No more data catalog ghost cities. Acryl Cloud accelerates time-to-value for data producers through Shift Left practices and an intuitive user interface for data consumers. Continuously detect data-quality incidents in real time, automate anomaly detecting to prevent breakdowns, and drive quick resolution when they occur. Acryl Cloud supports both pull-based and push-based metadata ingestion to ensure information is reliable, current, and definitive. Data should be operational. Automated Metadata Tests can be used to uncover new insights and areas for improvement. They go beyond simple visibility. Reduce confusion and speed up resolution with clear asset ownership and automatic detection. Streamlined alerts and time-based traceability are also available. -
43
Metaplane
Metaplane
$825 per monthIn 30 minutes, you can monitor your entire warehouse. Automated warehouse-to-BI lineage can identify downstream impacts. Trust can be lost in seconds and regained in months. With modern data-era observability, you can have peace of mind. It can be difficult to get the coverage you need with code-based tests. They take hours to create and maintain. Metaplane allows you to add hundreds of tests in minutes. Foundational tests (e.g. We support foundational tests (e.g. row counts, freshness and schema drift), more complicated tests (distribution shifts, nullness shiftings, enum modifications), custom SQL, as well as everything in between. Manual thresholds can take a while to set and quickly become outdated as your data changes. Our anomaly detection algorithms use historical metadata to detect outliers. To minimize alert fatigue, monitor what is important, while also taking into account seasonality, trends and feedback from your team. You can also override manual thresholds. -
44
Zaloni Arena
Zaloni
End-to-end DataOps built upon an agile platform that protects and improves your data assets. Arena is the leading augmented data management platform. Our active data catalog allows for self-service data enrichment to control complex data environments. You can create custom workflows to increase the reliability and accuracy of each data set. Machine-learning can be used to identify and align master assets for better data decisions. Superior security is assured with complete lineage, including detailed visualizations and masking. Data management is easy with Arena. Arena can catalog your data from any location. Our extensible connections allow for analytics across all your preferred tools. Overcome data sprawl challenges with our software. Our software is designed to drive business and analytics success, while also providing the controls and extensibility required in today's multicloud data complexity. -
45
IBM Watson Knowledge Catalog
IBM
$300 per instanceIntelligent cataloging enables you to activate business-ready data for AI/analytics. It is supported by active metadata management and policy management. IBM Watson®, Knowledge Catalog is a data catalogue tool that powers intelligent, self service discovery of data, models, and more. The cloud-based enterprise metadata repository activates data for AI, machine learning (ML), and deep learning. Access, categorize, categorize, and share knowledge assets, data, and their relationships wherever they are located. To provide the right context and drive value across requirements such as regulatory compliance and data monetization, organize, define, and manage enterprise data. Active policy management and dynamic masking allow clients to trust you. This will help protect data and manage compliance and audit readiness. With intuitive dashboards and flows, you can consume and transform data at the speed and convenience of business. These flows can be shared with colleagues or analytics tools. -
46
What if your data had a recommendation engine? Automated data inventory was created. A searchable catalog showed user behavior. Smart recommendations were made inline by the system as you typed queries. Alation, the first enterprise-wide collaborative data catalog, makes all this possible. It's a powerful tool that dramatically increases the productivity of analysts and the accuracy of analytics. It also empowers business decision-making for everyone. Alation provides proactive recommendations to data users through applications. Google inspired us to create a simple interface that connects the language of your business with the technical schema of your data. No more is it difficult to find the data you need due to complicated semantic translations. Are you unfamiliar with the data environment and unsure which data to use in your query. Alation allows you to build your query and provides inline recommendations that indicate whether data is trustworthy.
-
47
Validio
Validio
Get a clear view of your data assets: popularity, usage, and schema coverage. Get important insights into your data assets, such as popularity and utilization. Find and filter data based on tags and descriptions in metadata. Get valuable insights about your data assets, such as popularity, usage, quality, and schema cover. Drive data governance and ownership throughout your organization. Stream-lake-warehouse lineage to facilitate data ownership and collaboration. Lineage maps are automatically generated at the field level to help understand the entire data ecosystem. Anomaly detection is based on your data and seasonality patterns. It uses automatic backfilling from historical data. Machine learning thresholds are trained for each data segment and not just metadata. -
48
Microsoft Purview
Microsoft
$0.342Microsoft Purview is a unified data governance service that helps you manage and govern your on-premises, multicloud, and software-as-a-service (SaaS) data. You can easily create a comprehensive, up-to date map of your data landscape using automated data discovery, sensitive classification, and end to end data lineage. Data consumers can find trustworthy, valuable data. Automated data discovery, lineage identification and data classification across on and off-premises, multicloud, as well as SaaS sources. For more effective governance, a unified map of all your data assets and their relationships. Semantic search allows data discovery using technical or business terms. Get insight into the movement and location of sensitive data in your hybrid data landscape. Purview Data Map will help you establish the foundation for data usage and governance. Automate and manage metadata from mixed sources. Use built-in and customized classifiers to classify data and Microsoft Information Protection sensitive labels to protect it. -
49
Oracle Enterprise Metadata Management is a comprehensive platform for managing metadata. OEMM can extract and catalog metadata from any metadata provider, including relational and Hadoop, ETL and BI, as well as data modeling and many others. OEMM is more than a metadata repository. It allows interactive searching and browsing of metadata, as well as providing data lineage and impact analysis, semantic definition, and semantic usage analysis for all metadata assets within the catalog. The advanced algorithms of OEMM combine metadata from all providers to provide the complete data path from source to report or vice-versa. OEMM supports almost any metadata provider, including data modeling tools, databases and CASE tools, Hadoop engines, ETL engines. Warehouses, BI, EAI environments and many others.
-
50
FortressIQ
Automation Anywhere
FortressIQ is the industry's most advanced process-intelligence platform. It allows enterprises to decode work and transform experiences. FortressIQ combines innovative computer vision with artificial intelligence to provide unprecedented process insights. It is extremely fast and delivers detail and accuracy that are unattainable using traditional methods. The platform automatically acquires process data across multiple systems. This empowers enterprises to understand, monitor and improve their operations, employee and customer experience, and every business process. FortressIQ was established in 2017 and is supported by Lightspeed Venture Partners and Boldstart Ventures as well as Comcast Ventures and Eniac Ventures. Continuously and automatically identify inefficiencies and process variations to determine optimal process paths and reduce time to automate.