July 2018

Requirement In this post, we are going to explore windowing functions in Hive. These are the windowing functions: LEAD LAD FIRST_VALUE LAST_VALUE MIN/MAX/COUNT/AVG OVER Clause Component Involved Hive Sample Data ID FIRST_NAME LAST_NAME DESIGNATION DEPARTMENT SALARY 1001 Jervis Roll Director of Sales Sales 30000 1002 Gordon Mattster Marketing Manager SalesRead More →

Requirement In this post, we are going to explore analytics functions in Hive. These are the following analytics function available in the hive: ROW_NUMBER RANK DENSE_RANK CUME_DIST PERCENT_RANK NTILE Component Involved Hive Sample Data ID FIRST_NAME LAST_NAME DESIGNATION DEPARTMENT SALARY 1001 Jervis Roll Director of Sales Sales 30000 1002 GordonRead More →

Requirement In this post, we will go through the concept of Bucketing in Hive. This post will cover the below-following points about Bucketing: What is Bucketing in Hive? How to load data into a Bucketed table? What is its importance? Components Involved HIVE HDFS Sample Data We will use theRead More →