Apache Parquet Description

Parquet was created to provide the Hadoop ecosystem with the benefits of columnar, compressed data representation. Parquet was built with complex nested data structures and uses the Dremel paper's record shredding/assemblage algorithm. This approach is better than flattening nested namespaces. Parquet is designed to support efficient compression and encoding strategies. Multiple projects have shown the positive impact of the right compression and encoding scheme on data performance. Parquet allows for compression schemes to be specified per-column. It is future-proofed to allow for more encodings to be added as they are developed and implemented. Parquet was designed to be used by everyone. We don't want to play favorites in the Hadoop ecosystem.

Integrations

Reviews

Total
ease
features
design
support

No User Reviews. Be the first to provide a review:

Write a Review

Company Details

Company:
The Apache Software Foundation
Year Founded:
1999
Headquarters:
United States
Website:
parquet.apache.org

Media

Apache Parquet Screenshot 1
Recommended Products
Secure your business by securing your people. Icon
Secure your business by securing your people.

Over 100,000 businesses trust 1Password

Take the guesswork out of password management, shadow IT, infrastructure, and secret sharing so you can keep your people safe and your business moving.

Product Details

Platforms
Windows
Mac
Linux
Type of Training
Documentation
Webinars
Videos
Customer Support
Online

Apache Parquet Features and Options