What Integrates with DataHub?
Find out what DataHub integrations exist in 2025. Learn what software and services currently integrate with DataHub, and sort them by reviews, cost, features, and more. Below is a list of products that DataHub currently integrates with:
-
1
STRM
STRM
FreeDeveloping and overseeing data policies can often be a tedious and frustrating process. However, with PACE by STRM, you gain a powerful tool to ensure the secure usage of data. You can implement data policies through code, no matter where they are situated, eliminating lengthy delays and expensive meetings in the process. Say hello to your new open-source data security engine that transforms the way you manage data governance. Data policies extend beyond merely controlling access; they are essential for deriving meaningful insights from data while maintaining appropriate safeguards. PACE empowers you to collaborate on the rationale and timing, while automating the execution through code. With PACE, you have the ability to programmatically define and enforce data policies across various platforms. This tool can seamlessly integrate into your existing data platform and catalog, taking advantage of the inherent capabilities of your current stack. By automating policy application across critical data platforms and catalogs, PACE simplifies your governance efforts significantly. This innovative solution streamlines the creation and execution of policies, allowing you to centralize oversight while decentralizing implementation. Additionally, it simplifies compliance with auditing requirements by enabling you to demonstrate how controls are executed effectively. Ultimately, PACE revolutionizes data governance, making it more efficient and user-friendly. -
2
MaxCompute
Alibaba Cloud
MaxCompute, formerly referred to as ODPS, is a comprehensive, fully managed platform designed for multi-tenant data processing, catering to large-scale data warehousing needs. This platform offers a variety of data import solutions and supports distributed computing models, empowering users to efficiently analyze vast datasets while minimizing production expenses and safeguarding data integrity. It accommodates exabyte-level data storage and computation, along with support for SQL, MapReduce, and Graph computational frameworks, as well as Message Passing Interface (MPI) iterative algorithms. MaxCompute delivers superior computing and storage capabilities compared to traditional enterprise private clouds, achieving a cost reduction of 20% to 30%. With over seven years of reliable offline analysis services, it also features robust multi-level sandbox protection and monitoring systems. Additionally, MaxCompute utilizes tunnels for data transmission, which are designed to be scalable, facilitating the daily import and export of petabyte-level data. Users can transfer either all data or historical records through multiple tunnels, ensuring flexibility and efficiency in data management. In this way, MaxCompute seamlessly integrates powerful data processing capabilities with cost-effective solutions for businesses. -
3
DataWorks
Alibaba Cloud
DataWorks, a comprehensive Big Data platform introduced by Alibaba Cloud, offers an all-in-one solution for Big Data development, management of data permissions, offline job scheduling, and more. The platform is designed to function seamlessly right from the start, eliminating the need for users to manage complex underlying clusters and operations. Users can effortlessly build workflows through a drag-and-drop interface, while also having the ability to edit and debug their code in real-time, inviting collaboration from fellow developers. The platform supports a wide range of functionalities, including data integration, MaxCompute SQL, MaxCompute MR, machine learning, and shell tasks. Additionally, it features robust task monitoring capabilities, providing alerts in case of errors to prevent service disruptions. With the ability to run millions of tasks simultaneously, DataWorks accommodates various scheduling options, including hourly, daily, weekly, and monthly tasks. As an exceptional platform for constructing big data warehouses, DataWorks delivers extensive data warehousing services, catering to all aspects of data aggregation, processing, governance, and services. Its user-friendly design and powerful features make it an indispensable tool for organizations looking to harness the power of Big Data effectively. -
4
Anomalo
Anomalo
Anomalo helps you get ahead of data issues by automatically detecting them as soon as they appear and before anyone else is impacted. -Depth of Checks: Provides both foundational observability (automated checks for data freshness, volume, schema changes) and deep data quality monitoring (automated checks for data consistency and correctness). -Automation: Use unsupervised machine learning to automatically identify missing and anomalous data. -Easy for everyone, no-code UI: A user can generate a no-code check that calculates a metric, plots it over time, generates a time series model, sends intuitive alerts to tools like Slack, and returns a root cause analysis. -Intelligent Alerting: Incredibly powerful unsupervised machine learning intelligently readjusts time series models and uses automatic secondary checks to weed out false positives. -Time to Resolution: Automatically generates a root cause analysis that saves users time determining why an anomaly is occurring. Our triage feature orchestrates a resolution workflow and can integrate with many remediation steps, like ticketing systems. -In-VPC Development: Data never leaves the customer’s environment. Anomalo can be run entirely in-VPC for the utmost in privacy & security -
5
AWS Marketplace
Amazon
AWS Marketplace serves as a carefully organized digital platform that allows users to explore, buy, implement, and oversee third-party software, data products, and services seamlessly within the AWS environment. This marketplace offers a vast array of options spanning various categories, including security, machine learning, business applications, and DevOps tools. By featuring adaptable pricing structures like pay-as-you-go, annual subscriptions, and free trials, AWS Marketplace makes it easier for customers to manage procurement and billing by consolidating expenses into a single AWS invoice. Additionally, it facilitates quick deployment of pre-configured software that can be easily launched on AWS infrastructure. This efficient model not only empowers businesses to spur innovation and reduce time-to-market but also enhances their ability to control software utilization and costs effectively. Ultimately, AWS Marketplace stands as an essential tool for organizations looking to optimize their software management and procurement processes. -
6
Astro
Astronomer
Astronomer is the driving force behind Apache Airflow, the de facto standard for expressing data flows as code. Airflow is downloaded more than 4 million times each month and is used by hundreds of thousands of teams around the world. For data teams looking to increase the availability of trusted data, Astronomer provides Astro, the modern data orchestration platform, powered by Airflow. Astro enables data engineers, data scientists, and data analysts to build, run, and observe pipelines-as-code. Founded in 2018, Astronomer is a global remote-first company with hubs in Cincinnati, New York, San Francisco, and San Jose. Customers in more than 35 countries trust Astronomer as their partner for data orchestration. -
7
Datafold
Datafold
Eliminate data outages by proactively identifying and resolving data quality problems before they enter production. Achieve full test coverage of your data pipelines in just one day, going from 0 to 100%. With automatic regression testing across billions of rows, understand the impact of each code modification. Streamline change management processes, enhance data literacy, ensure compliance, and minimize the time taken to respond to incidents. Stay ahead of potential data issues by utilizing automated anomaly detection, ensuring you're always informed. Datafold’s flexible machine learning model adjusts to seasonal variations and trends in your data, allowing for the creation of dynamic thresholds. Save significant time spent analyzing data by utilizing the Data Catalog, which simplifies the process of locating relevant datasets and fields while providing easy exploration of distributions through an intuitive user interface. Enjoy features like interactive full-text search, data profiling, and a centralized repository for metadata, all designed to enhance your data management experience. By leveraging these tools, you can transform your data processes and improve overall efficiency.
- Previous
- You're on page 1
- Next