What are User Defined Functions(UDF) in Apache Hive?
Apache hive is a data warehousing tool in which we use a Structured Query Language(SQL) like language called Hive Query Language(HQL) to perform various ETL tasks on given data. Hive…
Apache hive is a data warehousing tool in which we use a Structured Query Language(SQL) like language called Hive Query Language(HQL) to perform various ETL tasks on given data. Hive…
What is a Data Engineer? The general job of the engineer is to design and build things. In the field of software engineering, engineering design, and building software. When we…
In the earlier blog post, we looked into various interview questions that can come with the hive and its architecture. In this blog post, we will mainly focus on the…
In this blog post, we will look at some of the most frequent and important Apache Hive interview questions and answers that come up during an interview. If you are…
Apache Hive is a Data Warehousing Infrastructure built on top of Hadoop and provides table abstraction on top of data resident in HDFS(Hadoop Distributed File System) as explained on their official…
Apache Hive has mainly two types of tables: Managed and External tables. Managed Table: When hive creates managed (default) tables, it follows the "schema on read" principle and loads the…
The hive Vectorization technique is a feature (in both MapReduce and Tez Engine) that greatly reduces the CPU usage for typical query operations like scans, filters, aggregates, and joins. A…
With the introduction of Apache Hive 3, Apache Hadoop has introduced different new features to address the growing needs of enterprise data warehouse systems. This blog post talks about several…
When working with Big data platform like Apache Hive, certain task such as data ingestion/data preparation task needs to be scheduled daily using shell/bash scripts so that downstream application can get the new data everyday.In this blog post we will load movies data into hive tables using shell scripts
In this blog post we will install Apache Hive in Ubuntu Machine.Once installation is complete we will run Hive queries using Hive Query Language(HQL) to Verify the installation.