11 2013 档案

get top k elements of the same key in hive
摘要:key points:1. group by key and sort by using distribute by and sort by.2. get top k elements by a UDF (user defined function) RANK---------Here is the source code.--------------package com.example.hive.udf;import org.apache.hadoop.hive.ql.exec.UDF; public final class Rank extends UDF{ priv... 阅读全文

posted @ 2013-11-14 19:17 Torstan 阅读(517) 评论(0) 推荐(0) 编辑

My advice to young people - Donald Knuth [video]
摘要:http://www.youtube.com/watch?v=75Ju0eM5T2cItookanoteofwhatknuthsaidinthevideo.1.Don'tdisbeliefthatbecausesomethingistrandy,somethingisgood.Iprobablygotheotherextreme,ifIfindtoomanypeopleadoptanidea,Iprobablythinkitiswrong.Ifmyworkistoopopular,IprobablythinkIhavetochangeit.Ofcourse,Itseemstooridi 阅读全文

posted @ 2013-11-06 13:08 Torstan 阅读(256) 评论(0) 推荐(0) 编辑

导航

点击右上角即可分享
微信分享提示