Hadoop Tutorials

Post author:nitendratech
Post category:Hadoop
Post comments:3 Comments
Post published:September 8, 2021

Apache Hadoop is an open-source framework that is used to store and process large datasets whose size ranges from gigabytes to petabytes of data. This framework uses multiple commodity computers…

Big Data Tutorial

Post author:nitendratech
Post category:Big Data
Post comments:0 Comments
Post published:September 8, 2021

“Big data” refers to datasets whose size is beyond the ability of typical database software tools to capture, store, manage, and analyze. Here the size of the data is subjective…

What is MAC Address?

Post author:nitendratech
Post category:Linux
Post comments:1 Comment
Post published:September 3, 2021

MAC or Media Access Control address is a hardware address that is assigned to a network interface controller/(NIC) that is used in the same network segment. They are mostly used…

What is Heartbeat in Hadoop Framework?

Post author:nitendratech
Post category:Hadoop
Post comments:2 Comments
Post published:August 25, 2021

What is HeartBeat? In Hadoop framework heartbeat is a signal that is sent by DataNode to NameNode and also by Task Tracker to the Job tracker. DataNode sends the signal…

What is Speculative Execution in Hadoop?

Post author:nitendratech
Post category:Hadoop
Post comments:1 Comment
Post published:August 23, 2021

Speculative execution is a way of coping with individual machine performance. There might be many machines in large clusters with a hundred or thousands of machines, that are not performing…

What is ETL (Extract, Transform and Load) Process?

Post author:nitendratech
Post category:Database
Post comments:12 Comments
Post published:August 18, 2021

Understanding ETL (Extract, Transform, and Load) ETL is an abbreviation of the Extract, transform, and load. It is a process that is used to extract data from various sources and…

What is an Edge Node? Hadoop Cluster, Database, and Applications

Post author:nitendratech
Post category:Big Data
Post comments:1 Comment
Post published:July 17, 2021

What does Edge Node Mean? An edge node is a computer that provides an interface for communicating with other nodes in case of cluster code. Edge nodes are also called…

What are HDFS Data Blocks?

Post author:nitendratech
Post category:Hadoop
Post comments:1 Comment
Post published:June 22, 2021

Hadoop Block Size Configuration and Components Block is defined as the smallest site/location on the hard drive that is available to read and write data. Data in HDFS(Hadoop Distributed File…

Rack Awareness in Hadoop HDFS

Post author:nitendratech
Post category:Hadoop
Post comments:1 Comment
Post published:June 10, 2021

What is Rack? Before looking into the Rack awareness in Hadoop HDFS, let us understand the rack itself. A rack is a storage area where all the data nodes are…

Apache Hadoop 3 Changes

Post author:nitendratech
Post category:Hadoop
Post comments:1 Comment
Post published:May 11, 2019

Apache Hadoop 3 incorporated a number of enhancements over the Hadoop-2.x. We will talk about the important enhancement that was implemented as part of Hadoop 3 over Hadoop 2 in…