+1 vote
in Hive by
How Hive distributes the rows into buckets?

1 Answer

0 votes
by

By using the formula: hash_function (bucketing_column) modulo (num_of_buckets) Hive determines the bucket number for a row. Basically, hash_function depends on the column data type. Although, hash_function for integer data type will be:

hash_function (int_type_column)= value of int_type_column

...