Apache Doris is an advanced data warehouse for real time analytics. It delivers lightning fast analytics on real-time, large-scale data.
Ingestion of micro-batch data and streaming data within a second. Storage engine with upserts, appends and pre-aggregations in real-time.
Optimize for high-concurrency, high-throughput queries using columnar storage engine, cost-based query optimizer, and vectorized execution engine.
Federated querying for data lakes like Hive, Iceberg, and Hudi and databases like MySQL and PostgreSQL.
Compound data types, such as Arrays, Maps and JSON. Variant data types to support auto datatype inference for JSON data. NGram bloomfilter for text search.
Distributed design for linear scaling. Workload isolation, tiered storage and efficient resource management. Supports shared-nothing as well as the separation of storage from compute.