What are User Defined Functions(UDF) in Apache Hive?

Post author:nitendratech
Post category:Hive
Post comments:0 Comments
Post published:July 5, 2022

Apache hive is a data warehousing tool in which we use a Structured Query Language(SQL) like language called Hive Query Language(HQL) to perform various ETL tasks on given data. Hive…

Hive Tutorials

Post author:nitendratech
Post category:Hive
Post comments:1 Comment
Post published:September 11, 2021

Apache Hive is a Data Warehousing Infrastructure built on top of Hadoop and provides table abstraction on top of data resident in HDFS(Hadoop Distributed File System) as explained on their official…

Types of Table in Apache Hive

Post author:nitendratech
Post category:Hive
Post comments:0 Comments
Post published:June 15, 2021

Apache Hive has mainly two types of tables: Managed and External tables. Managed Table: When hive creates managed (default) tables, it follows the "schema on read" principle and loads the…

Hive Vectorization Technique

Post author:nitendratech
Post category:Hive
Post comments:1 Comment
Post published:October 17, 2019

The hive Vectorization technique is a feature (in both MapReduce and Tez Engine) that greatly reduces the CPU usage for typical query operations like scans, filters, aggregates, and joins. A…

Apache Hive 3 Architecture

Post author:nitendratech
Post category:Hive
Post comments:0 Comments
Post published:May 21, 2019

With the introduction of Apache Hive 3, Apache Hadoop has introduced different new features to address the growing needs of enterprise data warehouse systems. This blog post talks about several…

Running Hive Queries using Shell scripts

Post author:nitendratech
Post category:Hive
Post comments:0 Comments
Post published:November 4, 2018

When working with Big data platform like Apache Hive, certain task such as data ingestion/data preparation task needs to be scheduled daily using shell/bash scripts so that downstream application can get the new data everyday.In this blog post we will load movies data into hive tables using shell scripts

Installing Apache Hive on Ubuntu

Post author:nitendratech
Post category:Hive
Post comments:0 Comments
Post published:March 25, 2018

In this blog post we will install Apache Hive in Ubuntu Machine.Once installation is complete we will run Hive queries using Hive Query Language(HQL) to Verify the installation.

Partitions in Apache Hive

Post author:nitendratech
Post category:Hive
Post comments:0 Comments
Post published:February 10, 2018

Partitions are fundamentally horizontal slices of data that allow large sets of data to be segmented into more manageable chunks. They are virtual columns acting as storage units but, not…

Introduction to Hive Query Language(HQL)

Post author:nitendratech
Post category:Hive
Post comments:3 Comments
Post published:January 28, 2018

Apache Hive is a data warehouse software project built on top of Apache Hadoop for providing data summarization, query and analysis. Hive Query Language is SQL like query language which is used to query data from hive tables.

Introduction to Apache Hive

Post author:nitendratech
Post category:Hive
Post comments:15 Comments
Post published:December 28, 2017

Apache Hive is a Data Warehousing Infrastructure built on top of Hadoop and provides table abstraction on top of data resident in HDFS as explained in their official page. It is used for providing data summarization ,query and analysis for large data sets.