Apache Hive Description

Apache Hive™, a data warehouse software, facilitates the reading, writing and management of large datasets that are stored in distributed storage using SQL. Structure can be projected onto existing data. Hive provides a command line tool and a JDBC driver to allow users to connect to it. Apache Hive is an Apache Software Foundation open-source project. It was previously a subproject to Apache® Hadoop®, but it has now become a top-level project. We encourage you to read about the project and share your knowledge. To execute traditional SQL queries, you must use the MapReduce Java API. Hive provides the SQL abstraction needed to integrate SQL-like query (HiveQL), into the underlying Java. This is in addition to the Java API that implements queries.

Integrations

Reviews - 1 Verified Review

Total
ease
features
design
support

Company Details

Company:
Apache Software Foundation
Year Founded:
1999
Headquarters:
United States
Website:
hive.apache.org

Media

Apache Hive Screenshot 1
You Might Also Like
Data-Driven Innovation: The CDP Playbook for Eng Teams Icon
Data-Driven Innovation: The CDP Playbook for Eng Teams

Why your engineering team needs a CDP

In this playbook, you’ll learn…
- How engineering teams use real-time customer data to achieve business goals.
- How to elevate your business to a new level of engineering efficiency with AI.
- Strategies used by engineering teams at Instacart, Staples Canada, Televisa Univision, CrossFit, and ClearScore to improve KPIs and drive efficiencies.

Product Details

Platforms
SaaS
Type of Training
Documentation
In Person
Customer Support
Online

Apache Hive Features and Options

ETL Software

Data Analysis
Data Filtering
Data Quality Control
Job Scheduling
Match & Merge
Metadata Management
Non-Relational Transformations
Version Control

Apache Hive User Reviews

Write a Review
  • Name: Anonymous (Verified)
    Job Title: Software Developer
    Length of product use: 1-2 Years
    Used How Often?: Monthly
    Role: User
    Organization Size: 100 - 499
    Features
    Design
    Ease
    Pricing
    Support
    Likelihood to Recommend to Others
    1 2 3 4 5 6 7 8 9 10

    Great ETL Solution

    Date: Jul 09 2020

    Summary: Apache Hive is a good solution to query and analyze large amount of data. Its ease of use and good performance in handling large amount of data makes it an excellent ETL Solution.

    Positive: Open Source
    Easy to learn - similar to SQL
    Fast performance
    Various data structures supported
    Scalable to meet growing demands
    Integrates with various tools & databases

    Negative: Needs more SQL functionalities like subqueries & better optimization for advanced query like joins.

    Read More...