grunt> cat t.txt kw1 2 kw3 1 kw2 4 kw1 5 kw2 2 cat test.pig A = LOAD '/user/input/t.txt' as (k:chararray,c:int); B = group A BY k; C = foreach B generate group,SUM(A.c); -- DUMP C; store C into 'test.output'; $ pig -e 'illustrate -script test.pig' 20…
HiveQL(Hive SQL)跟普通SQL最大区别 一直使用PIG,而今也需要兼顾HIVE.网上搜了点资料,感觉挺有用,这里翻译过来.翻译估计不太准确,待自己熟悉HIVE后再慢慢总结. * No true date/time data types, no interval types, and many missing UDFs for manipulating dates (e.g. ADD_MONTH) * Strict type matching without support for…
监听调试web service的好工具TCPMonhttp://ws.apache.org/commons/tcpmon/download.cgi TCPMon Tutorial Content Introduction Structure and Dependencies Usage Patterns As an Explicit Intermediate As a Request Sender for Web Services As a Proxy Advanced Settings Slo…