Best Data Preparation Software for GitHub

Find and compare the best Data Preparation software for GitHub in 2026

Use the comparison tool below to compare the top Data Preparation software for GitHub on the market. You can filter results by user reviews, pricing, features, platform, region, support options, integrations, and more.

  • 1
    Linx Reviews

    Linx

    Twenty57

    $599 per month
    1 Rating
    A powerful iPaaS platform for integration and business process automation. Linx is a powerful integration platform (iPaaS) that enables organizations to connect all their data sources, systems, and applications. The platform is known for its programming-like flexibility and the resulting ability to handle complex integrations at scale. It is a popular choice for growing businesses looking to embrace a unified integration strategy.
  • 2
    Domo Reviews
    Top Pick
    Domo puts data to work for everyone so they can multiply their impact on the business. Underpinned by a secure data foundation, our cloud-native data experience platform makes data visible and actionable with user-friendly dashboards and apps. Domo helps companies optimize critical business processes at scale and in record time to spark bold curiosity that powers exponential business results.
  • 3
    Gathr.ai Reviews
    Top Pick
    Gathr is a Data+AI fabric, helping enterprises rapidly deliver production-ready data and AI products. Data+AI fabric enables teams to effortlessly acquire, process, and harness data, leverage AI services to generate intelligence, and build consumer applications— all with unparalleled speed, scale, and confidence. Gathr’s self-service, AI-assisted, and collaborative approach enables data and AI leaders to achieve massive productivity gains by empowering their existing teams to deliver more valuable work in less time. With complete ownership and control over data and AI, flexibility and agility to experiment and innovate on an ongoing basis, and proven reliable performance at real-world scale, Gathr allows them to confidently accelerate POVs to production. Additionally, Gathr supports both cloud and air-gapped deployments, making it the ideal choice for diverse enterprise needs. Gathr, recognized by leading analysts like Gartner and Forrester, is a go-to-partner for Fortune 500 companies, such as United, Kroger, Philips, Truist, and many others.
  • 4
    Dataiku Reviews
    Dataiku is a comprehensive enterprise AI platform built to transform how organizations develop, deploy, and manage artificial intelligence at scale. It unifies data, analytics, and machine learning into a centralized environment where both technical and non-technical users can collaborate effectively. The platform enables teams to design and operationalize AI workflows, from data preparation to model deployment and monitoring. With its orchestration capabilities, Dataiku connects various data systems, applications, and processes to streamline operations across the enterprise. It also offers robust governance features that ensure transparency, compliance, and cost control throughout the AI lifecycle. Organizations can build intelligent agents, automate decision-making, and enhance analytics without disrupting existing workflows. Dataiku supports the transition from siloed models to production-ready machine learning systems that can be reused and scaled. Its flexibility allows businesses to modernize legacy analytics while preserving institutional knowledge. Companies across industries leverage the platform to accelerate innovation, improve efficiency, and unlock new revenue opportunities. By combining scalability, governance, and usability, Dataiku empowers enterprises to turn AI into a strategic advantage.
  • 5
    Telegraf Reviews
    Telegraf is an open-source server agent that helps you collect metrics from your sensors, stacks, and systems. Telegraf is a plugin-driven agent that collects and sends metrics and events from systems, databases, and IoT sensors. Telegraf is written in Go. It compiles to a single binary and has no external dependencies. It also requires very little memory. Telegraf can gather metrics from a wide variety of inputs and then write them into a wide range of outputs. It can be easily extended by being plugin-driven for both the collection and output data. It is written in Go and can be run on any system without external dependencies. It is easy to collect metrics from your endpoints with the 300+ plugins that have been created by data experts in the community.
  • 6
    Boomi Reviews

    Boomi

    Boomi

    $550.00/month
    Boomi's iPaaS platform empowers businesses to integrate, automate, and manage their data and workflows across multiple applications and systems. By leveraging AI agents, Boomi automates complex processes, improving speed and reducing errors. With a user-friendly interface and a library of pre-built connectors, the platform simplifies the integration of applications such as Salesforce, SAP, and AWS. Boomi helps organizations unlock their full potential by enabling rapid digital transformation, secure data management, and optimized business operations. Boomi Agentstudio is the solution for managing AI agents at scale, offering businesses a centralized platform to design, monitor, and deploy agents effectively. It includes powerful tools such as Agent Garden for lifecycle management, Agent Control Tower for visibility and governance, and AI-powered workflows that integrate seamlessly with other business systems. By providing easy-to-use tools for AI agent orchestration, Boomi allows organizations to achieve efficient, compliant automation while reducing operational complexities, all within a secure environment.
  • 7
    Fivetran Reviews
    Fivetran is a comprehensive data integration solution designed to centralize and streamline data movement for organizations of all sizes. With more than 700 pre-built connectors, it effortlessly transfers data from SaaS apps, databases, ERPs, and files into data warehouses and lakes, enabling real-time analytics and AI-driven insights. The platform’s scalable pipelines automatically adapt to growing data volumes and business complexity. Leading companies such as Dropbox, JetBlue, Pfizer, and National Australia Bank rely on Fivetran to reduce data ingestion time from weeks to minutes and improve operational efficiency. Fivetran offers strong security compliance with certifications including SOC 1 & 2, GDPR, HIPAA, ISO 27001, PCI DSS, and HITRUST. Users can programmatically create and manage pipelines through its REST API for seamless extensibility. The platform supports governance features like role-based access controls and integrates with transformation tools like dbt Labs. Fivetran helps organizations innovate by providing reliable, secure, and automated data pipelines tailored to their evolving needs.
  • 8
    Mozart Data Reviews
    Mozart Data is the all-in-one modern data platform for consolidating, organizing, and analyzing your data. Set up a modern data stack in an hour, without any engineering. Start getting more out of your data and making data-driven decisions today.
  • 9
    Astro by Astronomer Reviews
    Astronomer is the driving force behind Apache Airflow, the de facto standard for expressing data flows as code. Airflow is downloaded more than 4 million times each month and is used by hundreds of thousands of teams around the world. For data teams looking to increase the availability of trusted data, Astronomer provides Astro, the modern data orchestration platform, powered by Airflow. Astro enables data engineers, data scientists, and data analysts to build, run, and observe pipelines-as-code. Founded in 2018, Astronomer is a global remote-first company with hubs in Cincinnati, New York, San Francisco, and San Jose. Customers in more than 35 countries trust Astronomer as their partner for data orchestration.
  • 10
    Microsoft Power Query Reviews
    Power Query provides a user-friendly solution for connecting, extracting, transforming, and loading data from a variety of sources. Acting as a robust engine for data preparation and transformation, Power Query features a graphical interface that simplifies the data retrieval process and includes a Power Query Editor for implementing necessary changes. The versatility of the engine allows it to be integrated across numerous products and services, meaning the storage location of the data is determined by the specific application of Power Query. This tool enables users to efficiently carry out the extract, transform, and load (ETL) processes for their data needs. With Microsoft’s Data Connectivity and Data Preparation technology, users can easily access and manipulate data from hundreds of sources in a straightforward, no-code environment. Power Query is equipped with support for a multitude of data sources through built-in connectors, generic interfaces like REST APIs, ODBC, OLE, DB, and OData, and even offers a Power Query SDK for creating custom connectors tailored to individual requirements. This flexibility makes Power Query an indispensable asset for data professionals seeking to streamline their workflows.
  • 11
    TROCCO Reviews

    TROCCO

    primeNumber Inc

    TROCCO is an all-in-one modern data platform designed to help users seamlessly integrate, transform, orchestrate, and manage data through a unified interface. It boasts an extensive array of connectors that encompass advertising platforms such as Google Ads and Facebook Ads, cloud services like AWS Cost Explorer and Google Analytics 4, as well as various databases including MySQL and PostgreSQL, and data warehouses such as Amazon Redshift and Google BigQuery. One of its standout features is Managed ETL, which simplifies the data import process by allowing bulk ingestion of data sources and offers centralized management for ETL configurations, thereby removing the necessity for individual setup. Furthermore, TROCCO includes a data catalog that automatically collects metadata from data analysis infrastructure, creating a detailed catalog that enhances data accessibility and usage. Users have the ability to design workflows that enable them to organize a sequence of tasks, establishing an efficient order and combination to optimize data processing. This capability allows for increased productivity and ensures that users can better capitalize on their data resources.
  • Previous
  • You're on page 1
  • Next
MongoDB Logo MongoDB