Jump to content

Hadooooop Sodharulaki .. Oka Sahayam Kaavalenu


Recommended Posts

Posted

To explain collect_set() function in hive 

 

hash_id | num_of_cats

=====================
ad3jkfk 4
ad3jkfk 4
ad3jkfk 2
fkjh43f 1
fkjh43f 8
fkjh43f 8
rjkhd93 7
rjkhd93 4
rjkhd93 7

 

should return:

 

hash_agg | cats_aggregate

===========================
ad3jkfk Array<int>(4,4,2)
fkjh43f Array<int>(1,8,8)
rjkhd93 Array<int>(7,4,7)

 

 

I can get this output using collect_set() in Hive.

 

But this collect_set() inbuilt function accepts only primitive data types. Is there anyway i can extend this function to accept struct datatype also.?

Posted

ltt stackoverflow hadoop forums lo essuko va.//

 

 

Eskunna bhaiya.. evaru respond kaaledhu.. andhukey ikkada vesa

Posted

http://stackoverflow...keep-duplicates

 

check this...

.//
 
 
thanks for your time bhaiya.. yes.. that in-built function in hive later evolved in 2013 is  solution for PRIMITIVE DATA TYPES int , string, etc. but i need the same function to work for non-primitive types like structs... iippudu existing unna function source code ki velli methods ni overwrite cheyyali... adhi naaku raaadhu.. anduke ee post
 
Posted

[email protected]    post cheyyi u r question...

 

with your sample source code...hope you get help...........

×
×
  • Create New...