Best Data Management Software for Cloudera

Find and compare the best Data Management software for Cloudera in 2025

Use the comparison tool below to compare the top Data Management software for Cloudera on the market. You can filter results by user reviews, pricing, features, platform, region, support options, integrations, and more.

  • 1
    DataBuck Reviews
    See Software
    Learn More
    Big Data Quality must always be verified to ensure that data is safe, accurate, and complete. Data is moved through multiple IT platforms or stored in Data Lakes. The Big Data Challenge: Data often loses its trustworthiness because of (i) Undiscovered errors in incoming data (iii). Multiple data sources that get out-of-synchrony over time (iii). Structural changes to data in downstream processes not expected downstream and (iv) multiple IT platforms (Hadoop DW, Cloud). Unexpected errors can occur when data moves between systems, such as from a Data Warehouse to a Hadoop environment, NoSQL database, or the Cloud. Data can change unexpectedly due to poor processes, ad-hoc data policies, poor data storage and control, and lack of control over certain data sources (e.g., external providers). DataBuck is an autonomous, self-learning, Big Data Quality validation tool and Data Matching tool.
  • 2
    QuerySurge Reviews
    Top Pick
    QuerySurge is the smart Data Testing solution that automates the data validation and ETL testing of Big Data, Data Warehouses, Business Intelligence Reports and Enterprise Applications with full DevOps functionality for continuous testing. Use Cases - Data Warehouse & ETL Testing - Big Data (Hadoop & NoSQL) Testing - DevOps for Data / Continuous Testing - Data Migration Testing - BI Report Testing - Enterprise Application/ERP Testing Features Supported Technologies - 200+ data stores are supported QuerySurge Projects - multi-project support Data Analytics Dashboard - provides insight into your data Query Wizard - no programming required Design Library - take total control of your custom test desig BI Tester - automated business report testing Scheduling - run now, periodically or at a set time Run Dashboard - analyze test runs in real-time Reports - 100s of reports API - full RESTful API DevOps for Data - integrates into your CI/CD pipeline Test Management Integration QuerySurge will help you: - Continuously detect data issues in the delivery pipeline - Dramatically increase data validation coverage - Leverage analytics to optimize your critical data - Improve your data quality at speed
  • 3
  • 4
    OvalEdge Reviews

    OvalEdge

    OvalEdge

    $1,300/month
    1 Rating
    OvalEdge, a cost-effective data catalogue, is designed to provide end-to-end data governance and privacy compliance. It also provides fast, reliable analytics. OvalEdge crawls the databases, BI platforms and data lakes of your organization to create an easy-to use, smart inventory. Analysts can quickly discover data and provide powerful insights using OvalEdge. OvalEdge's extensive functionality allows users to improve data access, data literacy and data quality.
  • 5
    Alation Reviews
    What if your data had a recommendation engine? Automated data inventory was created. A searchable catalog showed user behavior. Smart recommendations were made inline by the system as you typed queries. Alation, the first enterprise-wide collaborative data catalog, makes all this possible. It's a powerful tool that dramatically increases the productivity of analysts and the accuracy of analytics. It also empowers business decision-making for everyone. Alation provides proactive recommendations to data users through applications. Google inspired us to create a simple interface that connects the language of your business with the technical schema of your data. No more is it difficult to find the data you need due to complicated semantic translations. Are you unfamiliar with the data environment and unsure which data to use in your query. Alation allows you to build your query and provides inline recommendations that indicate whether data is trustworthy.
  • 6
    Google Cloud Platform Reviews
    Top Pick

    Google Cloud Platform

    Google

    Free ($300 in free credits)
    25 Ratings
    Google Cloud is an online service that lets you create everything from simple websites to complex apps for businesses of any size. Customers who are new to the system will receive $300 in credits for testing, deploying, and running workloads. Customers can use up to 25+ products free of charge. Use Google's core data analytics and machine learning. All enterprises can use it. It is secure and fully featured. Use big data to build better products and find answers faster. You can grow from prototypes to production and even to planet-scale without worrying about reliability, capacity or performance. Virtual machines with proven performance/price advantages, to a fully-managed app development platform. High performance, scalable, resilient object storage and databases. Google's private fibre network offers the latest software-defined networking solutions. Fully managed data warehousing and data exploration, Hadoop/Spark and messaging.
  • 7
    IRI DarkShield Reviews

    IRI DarkShield

    IRI, The CoSort Company

    $5000
    IRI DarkShield uses several search techniques to find, and multiple data masking functions to de-identify, sensitive data in semi- and unstructured data sources enterprise-wide. You can use the search results to provide, remove, or fix PII simultaneously or separately to comply with GDPR data portability and erasure provisions. DarkShield jobs are configured, logged, and run from IRI Workbench or a restful RPC (web services) API to encrypt, redact, blur, etc., the PII it discovers in: * NoSQL & RDBs * PDFs * Parquet * JSON, XML & CSV * Excel & Word * BMP, DICOM, GIF, JPG & TIFF using pattern or dictionary matches, fuzzy search, named entity recognition, path filters, or image area bounding boxes. DarkShield search data can display in its own interactive dashboard, or in SIEM software analytic and visualization platforms like Datadog or Splunk ES. A Splunk Adaptive Response Framework or Phantom Playbook can also act on it. IRI DarkShield is a breakthrough in unstructured data hiding technology, speed, usability and affordability. DarkShield consolidates, multi-threads, the search, extraction and remediation of PII in multiple formats and folders on your network and in the cloud, on Windows, Linux, and macOS.
  • 8
    Style Intelligence Reviews
    Style Intelligence from InetSoft is a complete business intelligence platform that empowers companies with the ability to analyze, monitor, report and collaborate on business and operational data coming from different sources in real-time. Its top features include a data mashup Data Block architecture and professional atomic block modeling tool. There is also a database write-back option. Style Intelligence is robust and easy-to-use. It offers granular security, multitenancy support, multiple integrations, and is fully scalable.
  • 9
    Immuta Reviews
    Immuta's Data Access Platform is built to give data teams secure yet streamlined access to data. Every organization is grappling with complex data policies as rules and regulations around that data are ever-changing and increasing in number. Immuta empowers data teams by automating the discovery and classification of new and existing data to speed time to value; orchestrating the enforcement of data policies through Policy-as-code (PaC), data masking, and Privacy Enhancing Technologies (PETs) so that any technical or business owner can manage and keep it secure; and monitoring/auditing user and policy activity/history and how data is accessed through automation to ensure provable compliance. Immuta integrates with all of the leading cloud data platforms, including Snowflake, Databricks, Starburst, Trino, Amazon Redshift, Google BigQuery, and Azure Synapse. Our platform is able to transparently secure data access without impacting performance. With Immuta, data teams are able to speed up data access by 100x, decrease the number of policies required by 75x, and achieve provable compliance goals.
  • 10
    IRI FieldShield Reviews

    IRI FieldShield

    IRI, The CoSort Company

    IRI FieldShield® is a powerful and affordable data discovery and de-identification package for masking PII, PHI, PAN and other sensitive data in structured and semi-structured sources. Front-ended in a free Eclipse-based design environment, FieldShield jobs classify, profile, scan, and de-identify data at rest (static masking). Use the FieldShield SDK or proxy-based application to secure data in motion (dynamic data masking). The usual method for masking RDB and other flat files (CSV, Excel, LDIF, COBOL, etc.) is to classify it centrally, search for it globally, and automatically mask it in a consistent way using encryption, pseudonymization, redaction or other functions to preserve realism and referential integrity in production or test environments. Use FieldShield to make test data, nullify breaches, or comply with GDPR. HIPAA. PCI, PDPA, PCI-DSS and other laws. Audit through machine- and human-readable search reports, job logs and re-ID risks scores. Optionally mask data when you map it; FieldShield functions can also run in IRI Voracity ETL and federation, migration, replication, subsetting, and analytic jobs. To mask DB clones run FieldShield in Windocks, Actifio or Commvault. Call it from CI/CD pipelines and apps.
  • 11
    iCEDQ Reviews
    iCEDQ, a DataOps platform that allows monitoring and testing, is a DataOps platform. iCEDQ is an agile rules engine that automates ETL Testing, Data Migration Testing and Big Data Testing. It increases productivity and reduces project timelines for testing data warehouses and ETL projects. Identify data problems in your Data Warehouse, Big Data, and Data Migration Projects. The iCEDQ platform can transform your ETL or Data Warehouse Testing landscape. It automates it from end to end, allowing the user to focus on analyzing the issues and fixing them. The first edition of iCEDQ was designed to validate and test any volume of data with our in-memory engine. It can perform complex validation using SQL and Groovy. It is optimized for Data Warehouse Testing. It scales based upon the number of cores on a server and is 5X faster that the standard edition.
  • 12
    Protegrity Reviews
    Our platform allows businesses to use data, including its application in advanced analysis, machine learning and AI, to do great things without worrying that customers, employees or intellectual property are at risk. The Protegrity Data Protection Platform does more than just protect data. It also classifies and discovers data, while protecting it. It is impossible to protect data you don't already know about. Our platform first categorizes data, allowing users the ability to classify the type of data that is most commonly in the public domain. Once those classifications are established, the platform uses machine learning algorithms to find that type of data. The platform uses classification and discovery to find the data that must be protected. The platform protects data behind many operational systems that are essential to business operations. It also provides privacy options such as tokenizing, encryption, and privacy methods.
  • 13
    IBM Cognos Analytics Reviews
    Cognos Analytics with Watson brings BI to a new level with AI capabilities that provide a complete, trustworthy, and complete picture of your company. They can forecast the future, predict outcomes, and explain why they might happen. Built-in AI can be used to speed up and improve the blending of data or find the best tables for your model. AI can help you uncover hidden trends and drivers and provide insights in real-time. You can create powerful visualizations and tell the story of your data. You can also share insights via email or Slack. Combine advanced analytics with data science to unlock new opportunities. Self-service analytics that is governed and secures data from misuse adapts to your needs. You can deploy it wherever you need it - on premises, on the cloud, on IBM Cloud Pak®, for Data or as a hybrid option.
  • 14
    Querona Reviews
    We make BI and Big Data analytics easier and more efficient. Our goal is to empower business users, make BI specialists and always-busy business more independent when solving data-driven business problems. Querona is a solution for those who have ever been frustrated by a lack in data, slow or tedious report generation, or a long queue to their BI specialist. Querona has a built-in Big Data engine that can handle increasing data volumes. Repeatable queries can be stored and calculated in advance. Querona automatically suggests improvements to queries, making optimization easier. Querona empowers data scientists and business analysts by giving them self-service. They can quickly create and prototype data models, add data sources, optimize queries, and dig into raw data. It is possible to use less IT. Users can now access live data regardless of where it is stored. Querona can cache data if databases are too busy to query live.
  • 15
    Ataccama ONE Reviews
    Ataccama is a revolutionary way to manage data and create enterprise value. Ataccama unifies Data Governance, Data Quality and Master Data Management into one AI-powered fabric that can be used in hybrid and cloud environments. This gives your business and data teams unprecedented speed and security while ensuring trust, security and governance of your data.
  • 16
    PHEMI Health DataLab Reviews
    Unlike most data management systems, PHEMI Health DataLab is built with Privacy-by-Design principles, not as an add-on. This means privacy and data governance are built-in from the ground up, providing you with distinct advantages: Lets analysts work with data without breaching privacy guidelines Includes a comprehensive, extensible library of de-identification algorithms to hide, mask, truncate, group, and anonymize data. Creates dataset-specific or system-wide pseudonyms enabling linking and sharing of data without risking data leakage. Collects audit logs concerning not only what changes were made to the PHEMI system, but also data access patterns. Automatically generates human and machine-readable de- identification reports to meet your enterprise governance risk and compliance guidelines. Rather than a policy per data access point, PHEMI gives you the advantage of one central policy for all access patterns, whether Spark, ODBC, REST, export, and more
  • 17
    GigaSpaces Reviews
    Smart DIH is a data management platform that quickly serves applications with accurate, fresh and complete data, delivering high performance, ultra-low latency, and an always-on digital experience. Smart DIH decouples APIs from SoRs, replicating critical data, and making it available using event-driven architecture. Smart DIH enables drastically shorter development cycles of new digital services, and rapidly scales to serve millions of concurrent users – no matter which IT infrastructure or cloud topologies it relies on. XAP Skyline is a distributed in-memory development platform that delivers transactional consistency, combined with extreme event-based processing and microsecond latency. The platform fuels core business solutions that rely on instantaneous data, including online trading, real-time risk management and data processing for AI and large language models.
  • 18
    IRI Voracity Reviews

    IRI Voracity

    IRI, The CoSort Company

    IRI Voracity is an end-to-end software platform for fast, affordable, and ergonomic data lifecycle management. Voracity speeds, consolidates, and often combines the key activities of data discovery, integration, migration, governance, and analytics in a single pane of glass, built on Eclipse™. Through its revolutionary convergence of capability and its wide range of job design and runtime options, Voracity bends the multi-tool cost, difficulty, and risk curves away from megavendor ETL packages, disjointed Apache projects, and specialized software. Voracity uniquely delivers the ability to perform data: * profiling and classification * searching and risk-scoring * integration and federation * migration and replication * cleansing and enrichment * validation and unification * masking and encryption * reporting and wrangling * subsetting and testing Voracity runs on-premise, or in the cloud, on physical or virtual machines, and its runtimes can also be containerized or called from real-time applications or batch jobs.
  • 19
    Qlik Data Integration Reviews
    Qlik Data Integration platform automates the process for providing reliable, accurate and trusted data sets for business analysis. Data engineers are able to quickly add new sources to ensure success at all stages of the data lake pipeline, from real-time data intake, refinement, provisioning and governance. This is a simple and universal solution to continuously ingest enterprise data into popular data lake in real-time. This model-driven approach allows you to quickly design, build, and manage data lakes in the cloud or on-premises. To securely share all your derived data sets, create a smart enterprise-scale database catalog.
  • 20
    Prophecy Reviews

    Prophecy

    Prophecy

    $299 per month
    Prophecy allows you to connect with many more people, including data analysts and visual ETL developers. To create your pipelines, all you have to do is click and type a few SQL expressions. You will be creating high-quality, readable code for Spark or Airflow by using the Low-Code Designer. This code is then committed to your Git. Prophecy provides a gem builder that allows you to quickly create and roll out your own Frameworks. Data Quality, Encryption and new Sources are just a few examples. Prophecy offers best practices and infrastructure as managed service - making your life and operations easier! Prophecy makes it easy to create workflows that are high-performance and scale out using the cloud.
  • 21
    Progress DataDirect Reviews
    Progress DataDirect is passionate about empowering applications with enterprise data. We offer cloud and on-premises connectivity solutions for relational, NoSQL and Big Data data sources. We design solutions for thousands of companies and top vendors in analytics, data management, and BI. Our high-value connectors are designed to reduce development costs for a variety data sources. For greater security and peace of mind, you can get 24/7 support from experts around the world. For faster SQL access, connect with easy-to-use and time-saving drivers. Our mission is to keep up with the changing trends in data connectivity. If we don't have the connector you need, we will help you design it. Integrate connectivity into an application or service.
  • 22
    Indexima Data Hub Reviews

    Indexima Data Hub

    Indexima

    $3,290 per month
    Reframe your perception of time with data analytics. Instantly access the data of your business and work directly in your dashboard, without having to go back and forth with your IT team. Indexima DataHub is a new space where operational and functional users can instantly access their data. Indexima's unique indexing engine, combined with machine learning, allows businesses to quickly and easily access their data. The robust and scalable solution allows businesses to query their data directly from the source in volumes of up to tens billions of rows within milliseconds. With our Indexima platform, users can implement instant analytics for all their data with just one click. Indexima’s new ROI and TCO Calculator will help you determine the ROI of your data platform in just 30 seconds. Infrastructure costs, project deployment times, and data engineering cost, while boosting analytical performances.
  • 23
    Tabular Reviews

    Tabular

    Tabular

    $100 per month
    Tabular is a table store that allows you to create an open table. It was created by the Apache Iceberg creators. Connect multiple computing frameworks and engines. Reduce query time and costs up to 50%. Centralize enforcement of RBAC policies. Connect any query engine, framework, or tool, including Athena BigQuery, Snowflake Databricks Trino Spark Python, Snowflake Redshift, Snowflake Databricks and Redshift. Smart compaction, data clustering and other automated services reduce storage costs by up to 50% and query times. Unify data access in the database or table. RBAC controls are easy to manage, enforce consistently, and audit. Centralize your security at the table. Tabular is easy-to-use and has RBAC, high-powered performance, and high ingestion under the hood. Tabular allows you to choose from multiple "best-of-breed" compute engines, based on their strengths. Assign privileges to the data warehouse database or table level.
  • 24
    Hue Reviews
    Hue provides the best querying experience by combining the most intelligent autocomplete components and query editor. The tables and storage browses use your existing data catalog in a transparent way. Help users find the right data among thousands databases and document it themselves. Help users with their SQL queries, and use rich previews of links. Share directly from the editor in Slack. There are several apps, each specialized in one type of querying. Browsers are the first place to explore data sources. The editor excels at SQL queries. It has an intelligent autocomplete and risk alerts. Self-service troubleshooting is also available. Dashboards are primarily used to visualize indexed data, but they can also query SQL databases. The results of a search for specific cell values are highlighted. Hue has one of the most powerful SQL autocompletes on the planet to make your SQL editing experience as easy as possible.
  • 25
    Data Virtuality Reviews
    Connect and centralize data. Transform your data landscape into a flexible powerhouse. Data Virtuality is a data integration platform that allows for instant data access, data centralization, and data governance. Logical Data Warehouse combines materialization and virtualization to provide the best performance. For high data quality, governance, and speed-to-market, create your single source data truth by adding a virtual layer to your existing data environment. Hosted on-premises or in the cloud. Data Virtuality offers three modules: Pipes Professional, Pipes Professional, or Logical Data Warehouse. You can cut down on development time up to 80% Access any data in seconds and automate data workflows with SQL. Rapid BI Prototyping allows for a significantly faster time to market. Data quality is essential for consistent, accurate, and complete data. Metadata repositories can be used to improve master data management.
  • Previous
  • You're on page 1
  • 2
  • 3
  • Next