Best Big Data Software in China

Find and compare the best Big Data software in China in 2024

Use the comparison tool below to compare the top Big Data software in China on the market. You can filter results by user reviews, pricing, features, platform, region, support options, integrations, and more.

  • 1
    Amazon QuickSight Reviews
    Amazon QuickSight allows everyone within your organization to access your data. This includes asking questions in natural language, exploring interactive dashboards, and automatically looking for patterns or outliers using machine learning. QuickSight powers millions upon millions of dashboard views per week for customers like the NFL, Expedia and Volvo. This allows their end-users make better data-driven decision making. To receive relevant visualizations, ask questions about your data using Q's ML-powered engine. This allows you to ask conversational questions without the need for data preparation by admins and authors. AWS' machine learning expertise allows you to uncover hidden insights in your data, forecast accurately and do what-if analysis. You can also add natural language narratives or easy-to-understand natural languages to your dashboards using AWS' machine learning expertise. You can embed interactive visualizations and dashboards in your applications, as well as sophisticated dashboard authoring and natural language query capabilities.
  • 2
    Ataccama ONE Reviews
    Ataccama is a revolutionary way to manage data and create enterprise value. Ataccama unifies Data Governance, Data Quality and Master Data Management into one AI-powered fabric that can be used in hybrid and cloud environments. This gives your business and data teams unprecedented speed and security while ensuring trust, security and governance of your data.
  • 3
    Iguazio Reviews

    Iguazio

    Iguazio (Acquired by McKinsey)

    The Iguazio AI Platform provides a complete AI workflow in a single ready-to-use platform that includes all the required building blocks for building, deploying, operationalizing, scaling and de-risking ML and GenAI applications in live business environments. Highlights: - From POC to production - Get your AI projects out of the lab and into production with full automation and auto-scaling capabilities. - LLM Customization - Responsibly fine-tune models with RAG, RAFT and more. Improve model accuracy and performance at minimal cost. - GPU Provisioning - Optimize GPU resources by scaling usage up and down as needed. - Hybrid Deployment - Including AWS cloud, AWS GovCloud and AWS Outposts. - Governance - Monitor AI applications, address regulation needs, keep PII secure, mitigate bias and more
  • 4
    Sesame Software Reviews
    When you have the expertise of an enterprise partner combined with a scalable, easy-to-use data management suite, you can take back control of your data, access it from anywhere, ensure security and compliance, and unlock its power to grow your business. Why Use Sesame Software? Relational Junction builds, populates, and incrementally refreshes your data automatically. Enhance Data Quality - Convert data from multiple sources into a consistent format – leading to more accurate data, which provides the basis for solid decisions. Gain Insights - Automate the update of information into a central location, you can use your in-house BI tools to build useful reports to avoid costly mistakes. Fixed Price - Avoid high consumption costs with yearly fixed prices and multi-year discounts no matter your data volume.
  • 5
    GeoSpock Reviews
    GeoSpock DB - The space-time analytics database - allows data fusion in the connected world. GeoSpockDB is a unique cloud-native database that can be used to query for real-world applications. It can combine multiple sources of Internet of Things data to unlock their full potential, while simultaneously reducing complexity, cost, and complexity. GeoSpock DB enables data fusion and efficient storage. It also allows you to run ANSI SQL query and connect to analytics tools using JDBC/ODBC connectors. Users can perform analysis and share insights with familiar toolsets. This includes support for common BI tools such as Tableau™, Amazon QuickSight™, and Microsoft Power BI™, as well as Data Science and Machine Learning environments (including Python Notebooks or Apache Spark). The database can be integrated with internal applications as well as web services, including compatibility with open-source visualisation libraries like Cesium.js and Kepler.
  • 6
    Hopsworks Reviews

    Hopsworks

    Logical Clocks

    $1 per month
    Hopsworks is an open source Enterprise platform that allows you to develop and operate Machine Learning (ML), pipelines at scale. It is built around the first Feature Store for ML in the industry. You can quickly move from data exploration and model building in Python with Jupyter notebooks. Conda is all you need to run production-quality end-to-end ML pipes. Hopsworks can access data from any datasources you choose. They can be in the cloud, on premise, IoT networks or from your Industry 4.0-solution. You can deploy on-premises using your hardware or your preferred cloud provider. Hopsworks will offer the same user experience in cloud deployments or the most secure air-gapped deployments.
  • 7
    Google Cloud Dataproc Reviews
    Dataproc makes it easy to process open source data and analytic processing in the cloud. Faster build custom OSS clusters for custom machines Dataproc can speed up your data and analytics processing, whether you need more memory for Presto or GPUs to run Apache Spark machine learning. It spins up a cluster in less than 90 seconds. Cluster management is easy and affordable Dataproc offers autoscaling, idle cluster deletion and per-second pricing. This allows you to focus your time and resources on other areas. Security built in by default Encryption by default ensures that no data is left unprotected. Component Gateway and JobsAPI allow you to define permissions for Cloud IAM clusters without the need to set up gateway or networking nodes.
  • 8
    Tengu Reviews
    TENGU is a Data orchestration platform that serves as a central workspace for all data profiles to work more efficiently and enhance collaboration. Allowing you to get the most out of your data, faster. It allows complete control over your data environment in an innovative graph view for intuitive monitoring. Connecting all necessary tools in one workspace. It enables self-service, monitoring and automation, supporting all data roles and operations from integration to transformation.
  • 9
    PHEMI Health DataLab Reviews
    Unlike most data management systems, PHEMI Health DataLab is built with Privacy-by-Design principles, not as an add-on. This means privacy and data governance are built-in from the ground up, providing you with distinct advantages: Lets analysts work with data without breaching privacy guidelines Includes a comprehensive, extensible library of de-identification algorithms to hide, mask, truncate, group, and anonymize data. Creates dataset-specific or system-wide pseudonyms enabling linking and sharing of data without risking data leakage. Collects audit logs concerning not only what changes were made to the PHEMI system, but also data access patterns. Automatically generates human and machine-readable de- identification reports to meet your enterprise governance risk and compliance guidelines. Rather than a policy per data access point, PHEMI gives you the advantage of one central policy for all access patterns, whether Spark, ODBC, REST, export, and more
  • 10
    GigaSpaces Reviews
    Smart DIH is a data management platform that quickly serves applications with accurate, fresh and complete data, delivering high performance, ultra-low latency, and an always-on digital experience. Smart DIH decouples APIs from SoRs, replicating critical data, and making it available using event-driven architecture. Smart DIH enables drastically shorter development cycles of new digital services, and rapidly scales to serve millions of concurrent users – no matter which IT infrastructure or cloud topologies it relies on. XAP Skyline is a distributed in-memory development platform that delivers transactional consistency, combined with extreme event-based processing and microsecond latency. The platform fuels core business solutions that rely on instantaneous data, including online trading, real-time risk management and data processing for AI and large language models.
  • 11
    Mozart Data Reviews
    Mozart Data is the all-in-one modern data platform for consolidating, organizing, and analyzing your data. Set up a modern data stack in an hour, without any engineering. Start getting more out of your data and making data-driven decisions today.
  • 12
    Conversionomics Reviews

    Conversionomics

    Conversionomics

    $250 per month
    No per-connection charges for setting up all the automated connections that you need. No per-connection fees for all the automated connections that you need. No technical expertise is required to set up and scale your cloud data warehouse or processing operations. Conversionomics allows you to make mistakes and ask hard questions about your data. You have the power to do whatever you want with your data. Conversionomics creates complex SQL to combine source data with lookups and table relationships. You can use preset joins and common SQL, or create your own SQL to customize your query. Conversionomics is a data aggregation tool with a simple interface that makes it quick and easy to create data API sources. You can create interactive dashboards and reports from these sources using our templates and your favorite data visualization tools.
  • 13
    IRI Voracity Reviews

    IRI Voracity

    IRI, The CoSort Company

    IRI Voracity is an end-to-end software platform for fast, affordable, and ergonomic data lifecycle management. Voracity speeds, consolidates, and often combines the key activities of data discovery, integration, migration, governance, and analytics in a single pane of glass, built on Eclipse™. Through its revolutionary convergence of capability and its wide range of job design and runtime options, Voracity bends the multi-tool cost, difficulty, and risk curves away from megavendor ETL packages, disjointed Apache projects, and specialized software. Voracity uniquely delivers the ability to perform data: * profiling and classification * searching and risk-scoring * integration and federation * migration and replication * cleansing and enrichment * validation and unification * masking and encryption * reporting and wrangling * subsetting and testing Voracity runs on-premise, or in the cloud, on physical or virtual machines, and its runtimes can also be containerized or called from real-time applications or batch jobs.
  • 14
    Dataleyk Reviews

    Dataleyk

    Dataleyk

    €0.1 per GB
    Dataleyk is a secure, fully-managed cloud platform for SMBs. Our mission is to make Big Data analytics accessible and easy for everyone. Dataleyk is the missing piece to achieving your data-driven goals. Our platform makes it easy to create a stable, flexible, and reliable cloud data lake without any technical knowledge. All of your company data can be brought together, explored with SQL, and visualized with your favorite BI tool. Dataleyk will modernize your data warehouse. Our cloud-based data platform is capable of handling both structured and unstructured data. Data is an asset. Dataleyk, a cloud-based data platform, encrypts all data and offers data warehousing on-demand. Zero maintenance may not be an easy goal. It can be a catalyst for significant delivery improvements, and transformative results.
  • 15
    Azure Data Share Reviews

    Azure Data Share

    Microsoft

    $0.05 per dataset-snapshot
    You can share data with other organizations in any format and size. You can easily control what data you share, who gets it, and the terms of your use. Data Share gives you full visibility into all data-sharing relationships through a user-friendly interface. You can share data with just a few clicks or create your own application using REST API. Serverless code-free data sharing service that doesn't require infrastructure setup or management. An intuitive interface to manage all data-sharing relationships. Automated data sharing for predictability and productivity. Secure data-sharing service that utilizes underlying Azure security measures. In just a few clicks, you can share structured and unstructured data from multiple Azure storages with other organizations. There is no infrastructure to create or manage, no SAS keys required, and sharing data is completely code-free. You can control data access and set terms that are consistent with your enterprise policies.
  • 16
    Indexima Data Hub Reviews

    Indexima Data Hub

    Indexima

    $3,290 per month
    Reframe your perception of time with data analytics. Instantly access the data of your business and work directly in your dashboard, without having to go back and forth with your IT team. Indexima DataHub is a new space where operational and functional users can instantly access their data. Indexima's unique indexing engine, combined with machine learning, allows businesses to quickly and easily access their data. The robust and scalable solution allows businesses to query their data directly from the source in volumes of up to tens billions of rows within milliseconds. With our Indexima platform, users can implement instant analytics for all their data with just one click. Indexima’s new ROI and TCO Calculator will help you determine the ROI of your data platform in just 30 seconds. Infrastructure costs, project deployment times, and data engineering cost, while boosting analytical performances.
  • 17
    Hydrolix Reviews

    Hydrolix

    Hydrolix

    $2,237 per month
    Hydrolix is a streaming lake of data that combines decoupled archiving, indexed searching, and stream processing for real-time query performance on terabyte scale at a dramatically lower cost. CFOs love that data retention costs are 4x lower. Product teams appreciate having 4x more data at their disposal. Scale up resources when needed and down when not. Control costs by fine-tuning resource consumption and performance based on workload. Imagine what you could build if you didn't have budget constraints. Log data from Kafka, Kinesis and HTTP can be ingested, enhanced and transformed. No matter how large your data, you will only get the data that you need. Reduce latency, costs, and eliminate timeouts and brute-force queries. Storage is decoupled with ingest and queries, allowing them to scale independently to meet performance and cost targets. Hydrolix's HDX (high-density compress) reduces 1TB to 55GB.
  • 18
    WarpStream Reviews

    WarpStream

    WarpStream

    $2,987 per month
    WarpStream, an Apache Kafka compatible data streaming platform, is built directly on object storage. It has no inter-AZ network costs, no disks that need to be managed, and it's infinitely scalable within your VPC. WarpStream is deployed in your VPC as a stateless, auto-scaling binary agent. No local disks are required to be managed. Agents stream data directly into and out of object storage without buffering on local drives and no data tiering. Instantly create new "virtual" clusters in our control plan. Support multiple environments, teams or projects without having to manage any dedicated infrastructure. WarpStream is Apache Kafka protocol compatible, so you can continue to use your favorite tools and applications. No need to rewrite or use a proprietary SDK. Simply change the URL of your favorite Kafka library in order to start streaming. Never again will you have to choose between budget and reliability.
  • 19
    AnswerRocket Reviews
    AnswerRocket, a US software company, has been innovating search based data discovery analytics via natural language since 2013. Their solution gives businesses the intelligence and analytics they need to run a data-driven organization in today's economy. The platform's elegant, top-quality engineering gives businesses an advantage over their competitors by providing a deeper look at how data is analyzed.
  • 20
    EspressReport ES Reviews
    EspressRepot ES (Enterprise Server), a web- and desktop-based software, allows users to create stunning interactive data visualizations and reports. The platform supports Java EE integration to draw data from data sources like Bid Data (Hadoop Spark and MongoDB), ad hoc queries and reports as well as online map support, mobile compatibility and alert monitor.
  • 21
    Octopai Reviews
    To have complete control over your data, harness the power of data discovery, data lineage and a data catalogue. It can quickly navigate through complex data landscapes. Access the most comprehensive automated data lineage and discovery system. This gives you unprecedented visibility and trust in the most complex data environments. Octopai extracts metadata from all data environments. Octopai can instantly analyze metadata in a fast, secure, and easy process. Octopai gives you access to data lineage, data discovery, and a data catalogue, all from one central platform. In seconds, trace any data from end to end through your entire data landscape. Find the data you need automatically from any place in your data landscape. A self-creating, self updating data catalog will help you create consistency across your company.
  • 22
    Alooma Reviews
    Alooma allows data teams visibility and control. It connects data from all your data silos into BigQuery in real-time. You can set up and flow data in minutes. Or, you can customize, enrich, or transform data before it hits the data warehouse. Never lose an event. Alooma's safety nets make it easy to handle errors without affecting your pipeline. Alooma infrastructure can handle any number of data sources, low or high volume.
  • 23
    Minit Reviews
    Transform the way you analyze and monitor processes to uncover opportunities for continuous process improvement. We help businesses find opportunities for continuous process improvement and greater operational efficiency. We believe every organization can achieve operational excellence and digital transformation. We are determined to make the world a place where every process can be easily and automatically analysed and improved. We encourage everyone to look into the potential benefits of Process Mining technology for their business. This is possible even if you are not a process or data expert. Our solution was created for anyone who is interested in understanding their processes and empowering them to make data-driven decision from the beginning.
  • 24
    Centralpoint Reviews
    Gartner's Magic Quadrant includes Centralpoint as a Digital Experience Platform. It is used by more than 350 clients around the world, and it goes beyond Enterprise Content Management. It securely authenticates (AD/SAML/OpenID, oAuth), all users for self-service interaction. Centralpoint automatically aggregates information from different sources and applies rich metadata against your rules to produce true Knowledge Management. This allows you to search for and relate disparate data sets from anywhere. Centralpoint's Module Gallery is the most robust and can be installed either on-premise or in the cloud. Check out our solutions for Automating Metadata and Automating Retention Policy Management. We also offer solutions to simplify the mashup of disparate data to benefit from AI (Artificial Intelligence). Centralpoint is often used to provide easy migration tools and an intelligent alternative to Sharepoint. It can be used to secure portal solutions for public sites, intranets, members, or extranets.
  • 25
    Astro Reviews
    Astronomer is the driving force behind Apache Airflow, the de facto standard for expressing data flows as code. Airflow is downloaded more than 4 million times each month and is used by hundreds of thousands of teams around the world. For data teams looking to increase the availability of trusted data, Astronomer provides Astro, the modern data orchestration platform, powered by Airflow. Astro enables data engineers, data scientists, and data analysts to build, run, and observe pipelines-as-code. Founded in 2018, Astronomer is a global remote-first company with hubs in Cincinnati, New York, San Francisco, and San Jose. Customers in more than 35 countries trust Astronomer as their partner for data orchestration.