Best Data Cleansing Software for Hadoop

Find and compare the best Data Cleansing software for Hadoop in 2025

Use the comparison tool below to compare the top Data Cleansing software for Hadoop on the market. You can filter results by user reviews, pricing, features, platform, region, support options, integrations, and more.

  • 1
    Composable DataOps Platform Reviews

    Composable DataOps Platform

    Composable Analytics

    $8/hr - pay-as-you-go
    4 Ratings
    Composable is an enterprise-grade DataOps platform designed for business users who want to build data-driven products and create data intelligence solutions. It can be used to design data-driven products that leverage disparate data sources, live streams, and event data, regardless of their format or structure. Composable offers a user-friendly, intuitive dataflow visual editor, built-in services that facilitate data engineering, as well as a composable architecture which allows abstraction and integration of any analytical or software approach. It is the best integrated development environment for discovering, managing, transforming, and analysing enterprise data.
  • 2
    Zuar Runner Reviews
    It shouldn't take long to analyze data from your business solutions. Zuar Runner allows you to automate your ELT/ETL processes, and have data flow from hundreds of sources into one destination. Zuar Runner can manage everything: transport, warehouse, transformation, model, reporting, and monitoring. Our experts will make sure your deployment goes smoothly and quickly.
  • 3
    Ataccama ONE Reviews
    Ataccama is a revolutionary way to manage data and create enterprise value. Ataccama unifies Data Governance, Data Quality and Master Data Management into one AI-powered fabric that can be used in hybrid and cloud environments. This gives your business and data teams unprecedented speed and security while ensuring trust, security and governance of your data.
  • 4
    IRI Voracity Reviews

    IRI Voracity

    IRI, The CoSort Company

    IRI Voracity is an end-to-end software platform for fast, affordable, and ergonomic data lifecycle management. Voracity speeds, consolidates, and often combines the key activities of data discovery, integration, migration, governance, and analytics in a single pane of glass, built on Eclipse™. Through its revolutionary convergence of capability and its wide range of job design and runtime options, Voracity bends the multi-tool cost, difficulty, and risk curves away from megavendor ETL packages, disjointed Apache projects, and specialized software. Voracity uniquely delivers the ability to perform data: * profiling and classification * searching and risk-scoring * integration and federation * migration and replication * cleansing and enrichment * validation and unification * masking and encryption * reporting and wrangling * subsetting and testing Voracity runs on-premise, or in the cloud, on physical or virtual machines, and its runtimes can also be containerized or called from real-time applications or batch jobs.
  • 5
    Talend Data Fabric Reviews
    Talend Data Fabric's cloud services are able to efficiently solve all your integration and integrity problems -- on-premises or in cloud, from any source, at any endpoint. Trusted data delivered at the right time for every user. With an intuitive interface and minimal coding, you can easily and quickly integrate data, files, applications, events, and APIs from any source to any location. Integrate quality into data management to ensure compliance with all regulations. This is possible through a collaborative, pervasive, and cohesive approach towards data governance. High quality, reliable data is essential to make informed decisions. It must be derived from real-time and batch processing, and enhanced with market-leading data enrichment and cleaning tools. Make your data more valuable by making it accessible internally and externally. Building APIs is easy with the extensive self-service capabilities. This will improve customer engagement.
  • 6
    Informatica MDM Reviews
    Our multidomain, market-leading solution supports any master domain, implementation style, or use case. It can be used in the cloud or on premises. Integrates best-in class data integration, data quality management and data privacy. Trusted views of master data that are critical to business operations allow you to tackle complex issues head-on. Automatedly link master, transaction, or interaction data relationships across master domains. Contact data verification, B2B enrichment and B2C enrichment services increase data accuracy. With one click, update multiple master data records, dynamic models, and collaborative workflows. AI-powered match tuning, rule recommendations and optimization can reduce maintenance costs and speed up deployment. Use pre-configured, highly granular charts or dashboards to increase productivity. With trusted, relevant data, you can create high-quality data that will help you improve your business results.
  • 7
    matchit Reviews
    Matchit®, the core of our matching software, is specifically designed to produce results that are human-like. It does this at scale and without any preprocessing. Matchit uses Artificial Intelligence, which is a proprietary phonetic algorithm and lexicons as well as a contextual scoring engine to overcome the inconsistencies and challenges often found in business and contact data. Matching logic is a combination function and off-the shelf fuzzy algorithms that produces an alphanumeric value. Conventional matching solutions require the user to define it. This alphanumeric value (or'match key') is used to compare two records and ultimately find matches. Matchit is not like traditional matching solutions. It doesn't rely solely on one comparison between match keys to find matches. Matchit evaluates records contextually by running a variety comparisons and scoring them individually to determine the similarity between all elements in your data.
  • Previous
  • You're on page 1
  • Next