Apache Doris
Apache Doris serves as an advanced data warehouse tailored for real-time analytics, providing exceptionally rapid insights into large-scale real-time data.
It features both push-based micro-batch and pull-based streaming data ingestion, achieving this within a second, along with a storage engine capable of real-time updates, appends, and pre-aggregations.
The platform is optimized for handling high-concurrency and high-throughput queries thanks to its columnar storage engine, MPP architecture, cost-based query optimizer, and vectorized execution engine.
Moreover, it supports federated querying across various data lakes like Hive, Iceberg, and Hudi, as well as traditional databases such as MySQL and PostgreSQL.
Doris also accommodates complex data types, including Array, Map, and JSON, and features a variant data type that allows for automatic inference of JSON data types.
Additionally, it employs advanced indexing techniques like NGram bloomfilter and inverted index to enhance text search capabilities.
With its distributed architecture, Doris enables linear scalability, incorporates workload isolation, and implements tiered storage to optimize resource management effectively.
Furthermore, it is designed to support both shared-nothing clusters and the separation of storage and compute resources, making it a versatile solution for diverse analytical needs.
Learn more
Striim
Data integration for hybrid clouds Modern, reliable data integration across both your private cloud and public cloud. All this in real-time, with change data capture and streams. Striim was developed by the executive and technical team at GoldenGate Software. They have decades of experience in mission critical enterprise workloads. Striim can be deployed in your environment as a distributed platform or in the cloud. Your team can easily adjust the scaleability of Striim. Striim is fully secured with HIPAA compliance and GDPR compliance. Built from the ground up to support modern enterprise workloads, whether they are hosted in the cloud or on-premise. Drag and drop to create data flows among your sources and targets. Real-time SQL queries allow you to process, enrich, and analyze streaming data.
Learn more
VeloDB
VeloDB, powered by Apache Doris is a modern database for real-time analytics at scale.
In seconds, micro-batch data can be ingested using a push-based system. Storage engine with upserts, appends and pre-aggregations in real-time. Unmatched performance in real-time data service and interactive ad hoc queries.
Not only structured data, but also semi-structured. Not only real-time analytics, but also batch processing. Not only run queries against internal data, but also work as an federated query engine to access external databases and data lakes.
Distributed design to support linear scalability. Resource usage can be adjusted flexibly to meet workload requirements, whether on-premise or cloud deployment, separation or integration.
Apache Doris is fully compatible and built on this open source software. Support MySQL functions, protocol, and SQL to allow easy integration with other tools.
Learn more
Rockset
Real-time analytics on raw data. Live ingest from S3, DynamoDB, DynamoDB and more. Raw data can be accessed as SQL tables. In minutes, you can create amazing data-driven apps and live dashboards. Rockset is a serverless analytics and search engine that powers real-time applications and live dashboards. You can directly work with raw data such as JSON, XML and CSV. Rockset can import data from real-time streams and data lakes, data warehouses, and databases. You can import real-time data without the need to build pipelines. Rockset syncs all new data as it arrives in your data sources, without the need to create a fixed schema. You can use familiar SQL, including filters, joins, and aggregations. Rockset automatically indexes every field in your data, making it lightning fast. Fast queries are used to power your apps, microservices and live dashboards. Scale without worrying too much about servers, shards or pagers.
Learn more