What Integrates with OpenMetadata?
Find out what OpenMetadata integrations exist in 2026. Learn what software and services currently integrate with OpenMetadata, and sort them by reviews, cost, features, and more. Below is a list of products that OpenMetadata currently integrates with:
-
1
Oracle Cloud Infrastructure
Oracle
Oracle Cloud Infrastructure not only accommodates traditional workloads but also provides advanced cloud development tools for modern needs. It is designed with the capability to identify and counteract contemporary threats, empowering innovation at a faster pace. By merging affordability with exceptional performance, it effectively reduces total cost of ownership. As a Generation 2 enterprise cloud, Oracle Cloud boasts impressive compute and networking capabilities while offering an extensive range of infrastructure and platform cloud services. Specifically engineered to fulfill the requirements of mission-critical applications, Oracle Cloud seamlessly supports all legacy workloads, allowing businesses to transition from their past while crafting their future. Notably, our Generation 2 Cloud is uniquely equipped to operate Oracle Autonomous Database, recognized as the industry's first and only self-driving database. Furthermore, Oracle Cloud encompasses a wide-ranging portfolio of cloud computing solutions, spanning application development, business analytics, data management, integration, security, artificial intelligence, and blockchain technology, ensuring that businesses have all the tools they need to thrive in a digital landscape. This comprehensive approach positions Oracle Cloud as a leader in the evolving cloud marketplace. -
2
PostgreSQL
PostgreSQL Global Development Group
PostgreSQL stands out as a highly capable, open-source object-relational database system that has been actively developed for more than three decades, earning a solid reputation for its reliability, extensive features, and impressive performance. Comprehensive resources for installation and usage are readily available in the official documentation, which serves as an invaluable guide for both new and experienced users. Additionally, the open-source community fosters numerous forums and platforms where individuals can learn about PostgreSQL, understand its functionalities, and explore job opportunities related to it. Engaging with this community can enhance your knowledge and connection to the PostgreSQL ecosystem. Recently, the PostgreSQL Global Development Group announced updates for all supported versions, including 15.1, 14.6, 13.9, 12.13, 11.18, and 10.23, which address 25 reported bugs from the past few months. Notably, this marks the final release for PostgreSQL 10, meaning that it will no longer receive any security patches or bug fixes going forward. Therefore, if you are currently utilizing PostgreSQL 10 in your production environment, it is highly recommended that you plan to upgrade to a more recent version to ensure continued support and security. Upgrading will not only help maintain the integrity of your data but also allow you to take advantage of the latest features and improvements introduced in newer releases. -
3
Amazon Kinesis
Amazon
Effortlessly gather, manage, and scrutinize video and data streams as they occur. Amazon Kinesis simplifies the process of collecting, processing, and analyzing streaming data in real-time, empowering you to gain insights promptly and respond swiftly to emerging information. It provides essential features that allow for cost-effective processing of streaming data at any scale while offering the adaptability to select the tools that best align with your application's needs. With Amazon Kinesis, you can capture real-time data like video, audio, application logs, website clickstreams, and IoT telemetry, facilitating machine learning, analytics, and various other applications. This service allows you to handle and analyze incoming data instantaneously, eliminating the need to wait for all data to be collected before starting the processing. Moreover, Amazon Kinesis allows for the ingestion, buffering, and real-time processing of streaming data, enabling you to extract insights in a matter of seconds or minutes, significantly reducing the time it takes compared to traditional methods. Overall, this capability revolutionizes how businesses can respond to data-driven opportunities as they arise. -
4
AWS Storage Gateway
Amazon
AWS Storage Gateway is a hybrid cloud storage solution that allows on-premises users to tap into virtually limitless cloud storage options. It is utilized by clients to streamline storage management while also cutting costs across various hybrid cloud scenarios. Such scenarios encompass transferring tape backups to the cloud, minimizing local storage by leveraging cloud-based file shares, and offering quick access to AWS data for on-site applications, in addition to serving numerous migration, archiving, processing, and disaster recovery needs. To facilitate these functions, the service offers three distinct gateway types: Tape Gateway, File Gateway, and Volume Gateway, which all provide a smooth connection between local applications and cloud storage while caching data locally to ensure rapid access. Applications interact with the service via either a virtual machine or a dedicated hardware gateway appliance, utilizing standard storage protocols like NFS, SMB, and iSCSI. This versatility enables businesses to adapt their storage solutions to meet varying needs and optimize performance. -
5
Presto
Presto
Introducing our innovative Contactless Dining Solution, which comes with a $0 monthly fee. As the leading provider of contactless dining technology globally, we have over 100 million active users each month and have successfully shipped more than 300,000 systems. Our solution allows restaurants to deliver a seamless, end-to-end contactless dining experience, enabling guests to browse the full menu, place orders, and pay at their table—all without any physical interaction. By signing up today, you can transition to a completely contactless service in just three days, and enjoy the benefits of no recurring fees (though standard payment processing charges do apply), without needing to modify your current POS system. While our solution is available worldwide, supplies are limited due to high demand, so it's essential to reserve your spot promptly. Join the growing number of over 100 million monthly users already benefiting from Presto, as we continue to dominate the contactless dining market in both the U.S. and Europe. Experience the future of dining and enhance your restaurant's service by embracing this technology today! -
6
Delta Lake
Delta Lake
Delta Lake serves as an open-source storage layer that integrates ACID transactions into Apache Spark™ and big data operations. In typical data lakes, multiple pipelines operate simultaneously to read and write data, which often forces data engineers to engage in a complex and time-consuming effort to maintain data integrity because transactional capabilities are absent. By incorporating ACID transactions, Delta Lake enhances data lakes and ensures a high level of consistency with its serializability feature, the most robust isolation level available. For further insights, refer to Diving into Delta Lake: Unpacking the Transaction Log. In the realm of big data, even metadata can reach substantial sizes, and Delta Lake manages metadata with the same significance as the actual data, utilizing Spark's distributed processing strengths for efficient handling. Consequently, Delta Lake is capable of managing massive tables that can scale to petabytes, containing billions of partitions and files without difficulty. Additionally, Delta Lake offers data snapshots, which allow developers to retrieve and revert to previous data versions, facilitating audits, rollbacks, or the replication of experiments while ensuring data reliability and consistency across the board. -
7
MLflow
MLflow
MLflow is an open-source suite designed to oversee the machine learning lifecycle, encompassing aspects such as experimentation, reproducibility, deployment, and a centralized model registry. The platform features four main components that facilitate various tasks: tracking and querying experiments encompassing code, data, configurations, and outcomes; packaging data science code to ensure reproducibility across multiple platforms; deploying machine learning models across various serving environments; and storing, annotating, discovering, and managing models in a unified repository. Among these, the MLflow Tracking component provides both an API and a user interface for logging essential aspects like parameters, code versions, metrics, and output files generated during the execution of machine learning tasks, enabling later visualization of results. It allows for logging and querying experiments through several interfaces, including Python, REST, R API, and Java API. Furthermore, an MLflow Project is a structured format for organizing data science code, ensuring it can be reused and reproduced easily, with a focus on established conventions. Additionally, the Projects component comes equipped with an API and command-line tools specifically designed for executing these projects effectively. Overall, MLflow streamlines the management of machine learning workflows, making it easier for teams to collaborate and iterate on their models. -
8
Glue
Glue
Even if your team is located across different continents, Glue’s engaging virtual environments allow you to interact as if you were physically together. Each user is represented by a 3D avatar that captures their movements and gestures, facilitating non-verbal communication alongside spoken words. The implementation of spatial audio enables you to perceive the proximity of others, allowing you to distinguish whether someone is close by or farther away, as well as their direction. Within this shared virtual environment, avatars accurately reflect the users’ head and hand movements, which enhances the non-verbal interaction that traditional video calls often miss. This immersive experience transforms collaboration, making conversations in Glue feel as authentic as they would in a real-world setting. The unique technology provides 3D directional sound, ensuring that audio intensity decreases with distance, further enriching the sense of presence among participants. Ultimately, Glue redefines remote teamwork by creating a collaborative atmosphere that closely resembles face-to-face interactions. -
9
Amundsen
Amundsen
Uncover and rely on data for your analyses and models while enhancing productivity by dismantling silos. Gain instant insights into data usage by others and locate data within your organization effortlessly through a straightforward text search. Utilizing a PageRank-inspired algorithm, the system suggests results based on names, descriptions, tags, and user activity associated with tables or dashboards. Foster confidence in your data with automated and curated metadata that includes detailed information on tables and columns, highlights frequent users, indicates the last update, provides statistics, and offers data previews when authorized. Streamline the process by linking the ETL jobs and the code that generated the data, making it easier to manage table and column descriptions while minimizing confusion about which tables to utilize and their contents. Additionally, observe which data sets are commonly accessed, owned, or marked by your colleagues, and discover the most frequent queries for any table by reviewing the dashboards that leverage that specific data. This comprehensive approach not only enhances collaboration but also drives informed decision-making across teams. -
10
Apache Superset
Apache
Superset is a speedy, efficient, and user-friendly tool that offers a wide array of features enabling users of varying expertise to analyze and visualize their data, ranging from basic line graphs to intricate geospatial visualizations. It has the capability to link with any SQL-based data source via SQLAlchemy, accommodating contemporary cloud-native databases and systems that handle vast amounts of data, even at petabyte levels. Not only is Superset lightweight, but it also boasts impressive scalability, making the most of your current data infrastructure without the need for an additional ingestion layer. This flexibility ensures that users can seamlessly integrate Superset into their existing data workflows. -
11
Apache NiFi
Apache Software Foundation
A user-friendly, robust, and dependable system for data processing and distribution is offered by Apache NiFi, which facilitates the creation of efficient and scalable directed graphs for routing, transforming, and mediating data. Among its various high-level functions and goals, Apache NiFi provides a web-based user interface that ensures an uninterrupted experience for design, control, feedback, and monitoring. It is designed to be highly configurable, loss-tolerant, and capable of low latency and high throughput, while also allowing for dynamic prioritization of data flows. Additionally, users can alter the flow in real-time, manage back pressure, and trace data provenance from start to finish, as it is built with extensibility in mind. You can also develop custom processors and more, which fosters rapid development and thorough testing. Security features are robust, including SSL, SSH, HTTPS, and content encryption, among others. The system supports multi-tenant authorization along with internal policy and authorization management. Also, NiFi consists of various web applications, such as a web UI, web API, documentation, and custom user interfaces, necessitating the configuration of your mapping to the root path for optimal functionality. This flexibility and range of features make Apache NiFi an essential tool for modern data workflows. -
12
OpenSearch
OpenSearch
OpenSearch is an open-source search and analytics suite that is community-driven and based on the Apache 2.0 licensed versions of Elasticsearch 7.10.2 and Kibana 7.10.2. It includes the OpenSearch search engine daemon and the OpenSearch Dashboards for visualization and user interaction. This platform allows users to easily ingest, secure, search, aggregate, visualize, and analyze their data. It is particularly well-suited for various applications, including application search and log analytics. Users gain the advantage of an open-source solution that they can customize, enhance, monetize, and resell according to their needs. Furthermore, OpenSearch is committed to delivering a secure and high-quality search and analytics environment, continuously evolving with a promising roadmap of innovative features and enhancements to meet users' needs effectively. -
13
Apache Pinot
Apache Corporation
Pinot is built to efficiently handle OLAP queries on static data with minimal latency. It incorporates various pluggable indexing methods, including Sorted Index, Bitmap Index, and Inverted Index. While it currently lacks support for joins, this limitation can be mitigated by utilizing Trino or PrestoDB for querying purposes. The system offers an SQL-like language that enables selection, aggregation, filtering, grouping, ordering, and distinct queries on datasets. It comprises both offline and real-time tables, with real-time tables being utilized to address segments lacking offline data. Additionally, users can tailor the anomaly detection process and notification mechanisms to accurately identify anomalies. This flexibility ensures that users can maintain data integrity and respond proactively to potential issues. -
14
SQLAlchemy
SQLAlchemy
SQLAlchemy serves as a Python toolkit for SQL and an object-relational mapper, allowing developers to harness the complete capabilities of SQL with great flexibility. As the size and performance of SQL databases become critical, they tend to deviate from functioning merely as object collections; similarly, when abstraction is prioritized, object collections lose their resemblance to traditional tables and rows. SQLAlchemy seeks to bridge these opposing principles effectively. It views the database as a relational algebra engine rather than simply a set of tables, enabling selection of rows not only from tables but also from joins and various select statements, which can be integrated into more complex structures. The expression language of SQLAlchemy is built upon this foundational idea, enhancing its functionality. Additionally, SQLAlchemy is widely recognized for its object-relational mapper (ORM) feature, which is an optional element that implements the data mapper pattern, providing a robust framework for developers to work with databases seamlessly. This dual functionality of SQLAlchemy makes it a versatile tool for both simple and intricate database interactions. -
15
LDAP
LDAP
LDAP serves as a vital hub for information pertaining to the Lightweight Directory Access Protocol (LDAP), a robust and adaptable standards-compliant framework that facilitates interactions with directory servers. Frequently employed for user authentication and the management of details regarding users, groups, and applications, an LDAP directory server acts as a versatile data repository suited for a wide range of applications. It provides essential insights into directory services and the intricacies of the LDAP protocol. Additionally, it aids users in identifying the right directory server, client API, or LDAP-oriented tools that are tailored to various environments. The platform also includes links to a variety of standards documents and reference materials for those seeking a deeper understanding of the protocol. Moreover, it showcases articles related to directory services, featuring updates on software releases, newly established standards documents, specifications, and practical guides to enhance user knowledge and experience with LDAP. -
16
MariaDB
MariaDB
MariaDB Platform is an enterprise-level open-source database solution. It supports transactional, analytical, and hybrid workloads, as well as relational and JSON data models. It can scale from standalone databases to data warehouses to fully distributed SQL, which can execute millions of transactions per second and perform interactive, ad-hoc analytics on billions upon billions of rows. MariaDB can be deployed on prem-on commodity hardware. It is also available on all major public cloud providers and MariaDB SkySQL, a fully managed cloud database. MariaDB.com provides more information. -
17
Flink
Flink
Regardless of your grocery needs, we bring fresh, organic fruits and vegetables straight to your doorstep. Enjoy delivery in just 10 minutes at prices comparable to supermarkets. Flink serves as your portable supermarket, offering fresh options daily at those same supermarket rates. We currently operate in all major German cities and have recently expanded to certain areas in the Netherlands and France! You can check our specific delivery areas using the app, with more cities on the way soon. Occasionally, our rapid packing may lead to an incorrect item being included in your order. If that happens, simply reach out through the Support feature in the app, and we’ll resolve it promptly. Our delivery service runs from Monday to Saturday between 8am and 11pm, providing everything from fresh produce to your favorite sweets. You can conveniently pay for your groceries within the app, selecting from a variety of online payment options. Our delivery hubs are strategically located in densely populated urban centers, and we utilize electric bikes for an eco-friendly approach to delivery, ensuring that your groceries arrive quickly and sustainably. Plus, with our commitment to expanding our service area, you can expect even more convenient grocery delivery options in the near future. -
18
Apache Airflow
The Apache Software Foundation
Airflow is a community-driven platform designed for the programmatic creation, scheduling, and monitoring of workflows. With its modular architecture, Airflow employs a message queue to manage an unlimited number of workers, making it highly scalable. The system is capable of handling complex operations through its ability to define pipelines using Python, facilitating dynamic pipeline generation. This flexibility enables developers to write code that can create pipelines on the fly. Users can easily create custom operators and expand existing libraries, tailoring the abstraction level to meet their specific needs. The pipelines in Airflow are both concise and clear, with built-in parametrization supported by the robust Jinja templating engine. Eliminate the need for complex command-line operations or obscure XML configurations! Instead, leverage standard Python functionalities to construct workflows, incorporating date-time formats for scheduling and utilizing loops for the dynamic generation of tasks. This approach ensures that you retain complete freedom and adaptability when designing your workflows, allowing you to efficiently respond to changing requirements. Additionally, Airflow's user-friendly interface empowers teams to collaboratively refine and optimize their workflow processes.