Category: Big Data
Understanding NiFi and Airflow in Simple Terms: How They Help with Data Engineering and Data Flow
NiFi handles real-time data flow, while Airflow manages batch processing tasks. Learn how they differ and when to use each for data management.
Scala 3 Basics
Basics How to declare a variable Declare a constant variable val x: Int = 42 or val x = 42 Declare a variable var x: Int = 42 or var x = 42 Expressions & Instructions Code Blocks Functions How to define functions def aFunction(a: String, b: Int): String = { a + ” “…
How to setup Single Node Hive on Tez Hadoop Cluster
Steps to setup Hadoop 2, Hive 2.1.1, Hive with Mysql, Tez and Tez UI on Single Node
What is High Availability in Hadoop
What is Namenode Node High Availability, Resource Manager High Availability and how Hadoop achieved high availability in different versions
How to get the list of only files and not directories in hadoop
Get the list of only directories and regular files (which are not directory) in hadoop hdfs data storage