Atlan
Introducing the contemporary data workspace, where all your data assets, ranging from data tables to BI reports, are made effortlessly discoverable. Our advanced search algorithms, coupled with a user-friendly browsing interface, ensure that locating the right asset is a simple task. Atlan simplifies the identification of poor-quality data by automatically generating data quality profiles, allowing users to easily spot issues. With features such as automatic variable type detection, frequency distribution analysis, missing value identification, and outlier detection, Atlan covers all aspects of data quality management. This platform alleviates the challenges associated with governing and managing your data ecosystem effectively. Atlan’s intelligent bots analyze SQL query histories to automatically build data lineage and identify PII data, enabling the creation of dynamic access policies and top-tier governance. Additionally, even those without a technical background can effortlessly query across various data lakes, warehouses, and databases using our intuitive, Excel-like query builder. Moreover, seamless integrations with tools like Tableau and Jupyter enhance collaboration around data, transforming the way teams work together. This holistic approach not only empowers users but also fosters a more data-driven culture within organizations.
Learn more
DataHub
DataHub is a free and open-source metadata platform that streamlines data discovery, observability and governance across diverse data ecologies. It allows organizations to discover trustworthy data with experiences tailored to each user and eliminates breaking updates with detailed cross-platform, column-level lineage. DataHub gives you a complete view of your data, including its business, operational and technical context. The platform provides automated data quality checks, AI-driven anomaly identification and alerts teams when problems arise. It also centralizes incident tracking. DataHub's detailed ownership, documentation, and lineage information allows for quick issue resolution. It automates governance by classifying assets in real-time, reducing manual work with GenAI documentation, AI classification, and smart propagation. DataHub’s extensible architecture supports more than 70 native integrations.
Learn more
Castor
Castor serves as a comprehensive data catalog aimed at facilitating widespread use throughout an entire organization. It provides a holistic view of your data ecosystem, allowing you to swiftly search for information using its robust search capabilities. Transitioning to a new data framework and accessing necessary data becomes effortless. This approach transcends conventional data catalogs by integrating various data sources, thereby ensuring a unified truth. With an engaging and automated documentation process, Castor simplifies the task of establishing trust in your data. Within minutes, users can visualize column-level, cross-system data lineage. Gain an overarching perspective of your data pipelines to enhance confidence in your data integrity. This tool enables users to address data challenges, conduct impact assessments, and ensure GDPR compliance all in one platform. Additionally, it helps in optimizing performance, costs, compliance, and security associated with your data management. By utilizing our automated infrastructure monitoring system, you can ensure the ongoing health of your data stack while streamlining data governance practices.
Learn more
IRI Voracity
IRI Voracity is an end-to-end software platform for fast, affordable, and ergonomic data lifecycle management. Voracity speeds, consolidates, and often combines the key activities of data discovery, integration, migration, governance, and analytics in a single pane of glass, built on Eclipse™.
Through its revolutionary convergence of capability and its wide range of job design and runtime options, Voracity bends the multi-tool cost, difficulty, and risk curves away from megavendor ETL packages, disjointed Apache projects, and specialized software. Voracity uniquely delivers the ability to perform data:
* profiling and classification
* searching and risk-scoring
* integration and federation
* migration and replication
* cleansing and enrichment
* validation and unification
* masking and encryption
* reporting and wrangling
* subsetting and testing
Voracity runs on-premise, or in the cloud, on physical or virtual machines, and its runtimes can also be containerized or called from real-time applications or batch jobs.
Learn more