Data Engineering User Guide

Even though learning about Data engineering is a daunting task, one can have a clear understanding of this filed by following a step-by-step approach. In this blog post, we will go over each of the steps and relevant steps you can follow through as a tutorial to understand Data Engineering and related topics.

Concepts on Data

In this section, we will learn about data and its quality before staring on Data engineering.

Tutorial Links
What is Data, its types and How is it different from knowledge?
What is Data Analysis and different steps involved in it ?
What is a Database and Why it is important??
What is Relational Database Management Systems?

Concepts on Computer Science and Programming Language

In this section, we will learn bout basics of programming language concepts one needs to work with data

Tutorial Links
What is Computer Science?
How to Start Career in Computer Science?

Concepts on Data Pipelines

In this section, we will learn about data pipeline and concepts needed in Data engineering

Tutorial Links
What is Data Pipeline and its role in Data Engineering?
How to start career in Data Engineering?
What is a Data Platform?
Concept about Data Observability and Its Importance in Modern Data Tech Stack

Concepts on Big Data, tools and Technologies

In this section, we will learn about various concepts on Big Data and its related tools.

Tutorial Links
What is Big Data and Why it is important to understand? Introduction and Properties
What is Apache Hadoop?
Learn about Hadoop Distributed File System(HDFS)
Learn about MapReduce?
What is Apache Hive?
Learn about Apache Spark as Distributed Processing framework

Concepts on File and Its type in Data Engineering

Tutorial Links
What are Big Data File Storage Formats?
What is Columnar Data Storage and its Types?
What is a Flat File ? And Why is It Important?
What are Log Files ?

Concepts on Data Quality

Tutorial Links
What is Data Quality?
How to Improve Data Quality in an Organization?

Concepts on Database and Warehousing

Tutorial Links
What is Database?
What is Relational Database Management Systems(RDBMS)
What is Structured Query Language(SQL)?
NoSQL or Non-Relational Database
What is Enterprise Data Warehouse(EDW)?

Concepts on Cloud Computing

In this section, we will see various tutorials on cloud computing.

Tutorial Links
What is Cloud Computing?
Amazon Web Services(AWS)
What is Containerization ?
Learn Kubernetes?

Interview Questions

Tutorial Links
Big Data Interview Questions
Apache Hadoop Interview Questions
Apache Spark Interview Questions
Data Engineering Interview Questions
System Design Interview Questions
DevOps Interview Questions
Important Kubernetes Interview Questions
Amazon Web Service Interview Questions

Conclusion

In this blog post and user guide, we listed out the various topics and their tutorial links for the various topics and concepts that are important in the field of Data Engineering.