




  1. [xingoo@localhost tmp]$ cat aa.txt
  2. 1 a 3
  3. 2 b 4
  4. 3 c 1
  5. [xingoo@localhost tmp]$ cat bb.txt
  6. 1 xxx 2
  7. 2 yyy 3
  8. 3 zzz 5


  1. hive> create table aa
  2. > (a string,b string,c string)
  3. > row format delimited
  4. > fields terminated by ' ';
  5. OK
  6. Time taken: 0.19 seconds
  7. hive> create table bb like aa;
  8. OK
  9. Time taken: 0.188 seconds


  1. hive> describe aa;
  2. OK
  3. a string
  4. b string
  5. c string
  6. Time taken: 0.068 seconds, Fetched: 3 row(s)
  7. hive> describe bb;
  8. OK
  9. a string
  10. b string
  11. c string
  12. Time taken: 0.045 seconds, Fetched: 3 row(s)


  1. hive> load data local inpath '/usr/tmp/aa.txt' overwrite into table aa;
  2. Loading data to table test.aa
  3. OK
  4. Time taken: 0.519 seconds
  5. hive> load data local inpath '/usr/tmp/bb.txt' overwrite into table bb;
  6. Loading data to table test.bb
  7. OK
  8. Time taken: 0.321 seconds



  1. hive> select * from aa a join bb b on a.c=b.a;
  2. WARNING: Hive-on-MR is deprecated in Hive 2 and may not be available in the future versions. Consider using a different execution engine (i.e. spark, tez) or using Hive 1.X releases.
  3. Query ID = root_20160824161233_f9ecefa2-e5d7-416d-8d90-e191937e7313
  4. Total jobs = 1
  5. SLF4J: Class path contains multiple SLF4J bindings.
  6. SLF4J: Found binding in [jar:file:/usr/apache-hive-2.1.0-bin/lib/log4j-slf4j-impl-2.4.1.jar!/org/slf4j/impl/StaticLoggerBinder.class]
  7. SLF4J: Found binding in [jar:file:/usr/hadoop/hadoop-2.6.4/share/hadoop/common/lib/slf4j-log4j12-1.7.5.jar!/org/slf4j/impl/StaticLoggerBinder.class]
  8. SLF4J: See http://www.slf4j.org/codes.html#multiple_bindings for an explanation.
  9. SLF4J: Actual binding is of type [org.apache.logging.slf4j.Log4jLoggerFactory]
  10. 2016-08-24 16:12:44 Starting to launch local task to process map join; maximum memory = 518979584
  11. 2016-08-24 16:12:47 Dump the side-table for tag: 0 with group count: 3 into file: file:/usr/hive/tmp/xingoo/a69078ea-b7d5-4a78-9342-05a1695e9f98/hive_2016-08-24_16-12-33_145_337836390845333215-1/-local-10004/HashTable-Stage-3/MapJoin-mapfile00--.hashtable
  12. 2016-08-24 16:12:47 Uploaded 1 File to: file:/usr/hive/tmp/xingoo/a69078ea-b7d5-4a78-9342-05a1695e9f98/hive_2016-08-24_16-12-33_145_337836390845333215-1/-local-10004/HashTable-Stage-3/MapJoin-mapfile00--.hashtable (332 bytes)
  13. 2016-08-24 16:12:47 End of local task; Time Taken: 3.425 sec.
  14. Execution completed successfully
  15. MapredLocal task succeeded
  16. Launching Job 1 out of 1
  17. Number of reduce tasks is set to 0 since there's no reduce operator
  18. Job running in-process (local Hadoop)
  19. 2016-08-24 16:12:50,222 Stage-3 map = 100%, reduce = 0%
  20. Ended Job = job_local944389202_0007
  21. MapReduce Jobs Launched:
  22. Stage-Stage-3: HDFS Read: 1264 HDFS Write: 90 SUCCESS
  23. Total MapReduce CPU Time Spent: 0 msec
  24. OK
  25. 3 c 1 1 xxx 2
  26. 1 a 3 3 zzz 5
  27. Time taken: 17.083 seconds, Fetched: 2 row(s)



  1. ive> select * from aa a left outer join bb b on a.c=b.a;
  2. WARNING: Hive-on-MR is deprecated in Hive 2 and may not be available in the future versions. Consider using a different execution engine (i.e. spark, tez) or using Hive 1.X releases.
  3. Query ID = root_20160824161637_6d540592-13fd-4f59-a2cf-0a91c0fc9533
  4. Total jobs = 1
  5. SLF4J: Class path contains multiple SLF4J bindings.
  6. SLF4J: Found binding in [jar:file:/usr/apache-hive-2.1.0-bin/lib/log4j-slf4j-impl-2.4.1.jar!/org/slf4j/impl/StaticLoggerBinder.class]
  7. SLF4J: Found binding in [jar:file:/usr/hadoop/hadoop-2.6.4/share/hadoop/common/lib/slf4j-log4j12-1.7.5.jar!/org/slf4j/impl/StaticLoggerBinder.class]
  8. SLF4J: See http://www.slf4j.org/codes.html#multiple_bindings for an explanation.
  9. SLF4J: Actual binding is of type [org.apache.logging.slf4j.Log4jLoggerFactory]
  10. 2016-08-24 16:16:48 Starting to launch local task to process map join; maximum memory = 518979584
  11. 2016-08-24 16:16:51 Dump the side-table for tag: 1 with group count: 3 into file: file:/usr/hive/tmp/xingoo/a69078ea-b7d5-4a78-9342-05a1695e9f98/hive_2016-08-24_16-16-37_813_4572869866822819707-1/-local-10004/HashTable-Stage-3/MapJoin-mapfile11--.hashtable
  12. 2016-08-24 16:16:51 Uploaded 1 File to: file:/usr/hive/tmp/xingoo/a69078ea-b7d5-4a78-9342-05a1695e9f98/hive_2016-08-24_16-16-37_813_4572869866822819707-1/-local-10004/HashTable-Stage-3/MapJoin-mapfile11--.hashtable (338 bytes)
  13. 2016-08-24 16:16:51 End of local task; Time Taken: 2.634 sec.
  14. Execution completed successfully
  15. MapredLocal task succeeded
  16. Launching Job 1 out of 1
  17. Number of reduce tasks is set to 0 since there's no reduce operator
  18. Job running in-process (local Hadoop)
  19. 2016-08-24 16:16:53,843 Stage-3 map = 100%, reduce = 0%
  20. Ended Job = job_local1670258961_0008
  21. MapReduce Jobs Launched:
  22. Stage-Stage-3: HDFS Read: 1282 HDFS Write: 90 SUCCESS
  23. Total MapReduce CPU Time Spent: 0 msec
  24. OK
  25. 1 a 3 3 zzz 5
  26. 2 b 4 NULL NULL NULL
  27. 3 c 1 1 xxx 2
  28. Time taken: 16.048 seconds, Fetched: 3 row(s)



  1. hive> select * from aa a right outer join bb b on a.c=b.a;
  2. WARNING: Hive-on-MR is deprecated in Hive 2 and may not be available in the future versions. Consider using a different execution engine (i.e. spark, tez) or using Hive 1.X releases.
  3. Query ID = root_20160824162227_5d0f0090-1a9b-4a3f-9e82-e93c4d180f4b
  4. Total jobs = 1
  5. SLF4J: Class path contains multiple SLF4J bindings.
  6. SLF4J: Found binding in [jar:file:/usr/apache-hive-2.1.0-bin/lib/log4j-slf4j-impl-2.4.1.jar!/org/slf4j/impl/StaticLoggerBinder.class]
  7. SLF4J: Found binding in [jar:file:/usr/hadoop/hadoop-2.6.4/share/hadoop/common/lib/slf4j-log4j12-1.7.5.jar!/org/slf4j/impl/StaticLoggerBinder.class]
  8. SLF4J: See http://www.slf4j.org/codes.html#multiple_bindings for an explanation.
  9. SLF4J: Actual binding is of type [org.apache.logging.slf4j.Log4jLoggerFactory]
  10. 2016-08-24 16:22:37 Starting to launch local task to process map join; maximum memory = 518979584
  11. 2016-08-24 16:22:40 Dump the side-table for tag: 0 with group count: 3 into file: file:/usr/hive/tmp/xingoo/a69078ea-b7d5-4a78-9342-05a1695e9f98/hive_2016-08-24_16-22-27_619_7820027359528638029-1/-local-10004/HashTable-Stage-3/MapJoin-mapfile20--.hashtable
  12. 2016-08-24 16:22:40 Uploaded 1 File to: file:/usr/hive/tmp/xingoo/a69078ea-b7d5-4a78-9342-05a1695e9f98/hive_2016-08-24_16-22-27_619_7820027359528638029-1/-local-10004/HashTable-Stage-3/MapJoin-mapfile20--.hashtable (332 bytes)
  13. 2016-08-24 16:22:40 End of local task; Time Taken: 2.368 sec.
  14. Execution completed successfully
  15. MapredLocal task succeeded
  16. Launching Job 1 out of 1
  17. Number of reduce tasks is set to 0 since there's no reduce operator
  18. Job running in-process (local Hadoop)
  19. 2016-08-24 16:22:43,060 Stage-3 map = 100%, reduce = 0%
  20. Ended Job = job_local2001415675_0009
  21. MapReduce Jobs Launched:
  22. Stage-Stage-3: HDFS Read: 1306 HDFS Write: 90 SUCCESS
  23. Total MapReduce CPU Time Spent: 0 msec
  24. OK
  25. 3 c 1 1 xxx 2
  26. NULL NULL NULL 2 yyy 3
  27. 1 a 3 3 zzz 5
  28. Time taken: 15.483 seconds, Fetched: 3 row(s)



  1. hive> select * from aa a full outer join bb b on a.c=b.a;
  2. WARNING: Hive-on-MR is deprecated in Hive 2 and may not be available in the future versions. Consider using a different execution engine (i.e. spark, tez) or using Hive 1.X releases.
  3. Query ID = root_20160824162252_c71b2fae-9768-4b9a-b5ad-c06d7cdb60fb
  4. Total jobs = 1
  5. Launching Job 1 out of 1
  6. Number of reduce tasks not specified. Estimated from input data size: 1
  7. In order to change the average load for a reducer (in bytes):
  8. set hive.exec.reducers.bytes.per.reducer=<number>
  9. In order to limit the maximum number of reducers:
  10. set hive.exec.reducers.max=<number>
  11. In order to set a constant number of reducers:
  12. set mapreduce.job.reduces=<number>
  13. Job running in-process (local Hadoop)
  14. 2016-08-24 16:22:54,111 Stage-1 map = 100%, reduce = 100%
  15. Ended Job = job_local1766586034_0010
  16. MapReduce Jobs Launched:
  17. Stage-Stage-1: HDFS Read: 4026 HDFS Write: 270 SUCCESS
  18. Total MapReduce CPU Time Spent: 0 msec
  19. OK
  20. 3 c 1 1 xxx 2
  21. NULL NULL NULL 2 yyy 3
  22. 1 a 3 3 zzz 5
  23. 2 b 4 NULL NULL NULL
  24. Time taken: 1.689 seconds, Fetched: 4 row(s)



  1. hive> select * from aa a left semi join bb b on a.c=b.a;
  2. WARNING: Hive-on-MR is deprecated in Hive 2 and may not be available in the future versions. Consider using a different execution engine (i.e. spark, tez) or using Hive 1.X releases.
  3. Query ID = root_20160824162327_e7fc72a7-ef91-4d39-83bc-ff8159ea8816
  4. Total jobs = 1
  5. SLF4J: Class path contains multiple SLF4J bindings.
  6. SLF4J: Found binding in [jar:file:/usr/apache-hive-2.1.0-bin/lib/log4j-slf4j-impl-2.4.1.jar!/org/slf4j/impl/StaticLoggerBinder.class]
  7. SLF4J: Found binding in [jar:file:/usr/hadoop/hadoop-2.6.4/share/hadoop/common/lib/slf4j-log4j12-1.7.5.jar!/org/slf4j/impl/StaticLoggerBinder.class]
  8. SLF4J: See http://www.slf4j.org/codes.html#multiple_bindings for an explanation.
  9. SLF4J: Actual binding is of type [org.apache.logging.slf4j.Log4jLoggerFactory]
  10. 2016-08-24 16:23:37 Starting to launch local task to process map join; maximum memory = 518979584
  11. 2016-08-24 16:23:41 Dump the side-table for tag: 1 with group count: 3 into file: file:/usr/hive/tmp/xingoo/a69078ea-b7d5-4a78-9342-05a1695e9f98/hive_2016-08-24_16-23-27_008_3026796648107813784-1/-local-10004/HashTable-Stage-3/MapJoin-mapfile31--.hashtable
  12. 2016-08-24 16:23:41 Uploaded 1 File to: file:/usr/hive/tmp/xingoo/a69078ea-b7d5-4a78-9342-05a1695e9f98/hive_2016-08-24_16-23-27_008_3026796648107813784-1/-local-10004/HashTable-Stage-3/MapJoin-mapfile31--.hashtable (317 bytes)
  13. 2016-08-24 16:23:41 End of local task; Time Taken: 3.586 sec.
  14. Execution completed successfully
  15. MapredLocal task succeeded
  16. Launching Job 1 out of 1
  17. Number of reduce tasks is set to 0 since there's no reduce operator
  18. Job running in-process (local Hadoop)
  19. 2016-08-24 16:23:43,798 Stage-3 map = 100%, reduce = 0%
  20. Ended Job = job_local521961878_0011
  21. MapReduce Jobs Launched:
  22. Stage-Stage-3: HDFS Read: 1366 HDFS Write: 90 SUCCESS
  23. Total MapReduce CPU Time Spent: 0 msec
  24. OK
  25. 1 a 3
  26. 3 c 1
  27. Time taken: 16.811 seconds, Fetched: 2 row(s)



  1. hive> select * from aa join bb;
  2. Warning: Map Join MAPJOIN[9][bigTable=?] in task 'Stage-3:MAPRED' is a cross product
  3. WARNING: Hive-on-MR is deprecated in Hive 2 and may not be available in the future versions. Consider using a different execution engine (i.e. spark, tez) or using Hive 1.X releases.
  4. Query ID = root_20160824162449_20e4b5ec-768f-48cf-a840-7d9ff360975f
  5. Total jobs = 1
  6. SLF4J: Class path contains multiple SLF4J bindings.
  7. SLF4J: Found binding in [jar:file:/usr/apache-hive-2.1.0-bin/lib/log4j-slf4j-impl-2.4.1.jar!/org/slf4j/impl/StaticLoggerBinder.class]
  8. SLF4J: Found binding in [jar:file:/usr/hadoop/hadoop-2.6.4/share/hadoop/common/lib/slf4j-log4j12-1.7.5.jar!/org/slf4j/impl/StaticLoggerBinder.class]
  9. SLF4J: See http://www.slf4j.org/codes.html#multiple_bindings for an explanation.
  10. SLF4J: Actual binding is of type [org.apache.logging.slf4j.Log4jLoggerFactory]
  11. 2016-08-24 16:25:00 Starting to launch local task to process map join; maximum memory = 518979584
  12. 2016-08-24 16:25:02 Dump the side-table for tag: 0 with group count: 1 into file: file:/usr/hive/tmp/xingoo/a69078ea-b7d5-4a78-9342-05a1695e9f98/hive_2016-08-24_16-24-49_294_2706432574075169306-1/-local-10004/HashTable-Stage-3/MapJoin-mapfile40--.hashtable
  13. 2016-08-24 16:25:02 Uploaded 1 File to: file:/usr/hive/tmp/xingoo/a69078ea-b7d5-4a78-9342-05a1695e9f98/hive_2016-08-24_16-24-49_294_2706432574075169306-1/-local-10004/HashTable-Stage-3/MapJoin-mapfile40--.hashtable (305 bytes)
  14. 2016-08-24 16:25:02 End of local task; Time Taken: 2.892 sec.
  15. Execution completed successfully
  16. MapredLocal task succeeded
  17. Launching Job 1 out of 1
  18. Number of reduce tasks is set to 0 since there's no reduce operator
  19. Job running in-process (local Hadoop)
  20. 2016-08-24 16:25:05,677 Stage-3 map = 100%, reduce = 0%
  21. Ended Job = job_local2068422373_0012
  22. MapReduce Jobs Launched:
  23. Stage-Stage-3: HDFS Read: 1390 HDFS Write: 90 SUCCESS
  24. Total MapReduce CPU Time Spent: 0 msec
  25. OK
  26. 1 a 3 1 xxx 2
  27. 2 b 4 1 xxx 2
  28. 3 c 1 1 xxx 2
  29. 1 a 3 2 yyy 3
  30. 2 b 4 2 yyy 3
  31. 3 c 1 2 yyy 3
  32. 1 a 3 3 zzz 5
  33. 2 b 4 3 zzz 5
  34. 3 c 1 3 zzz 5



  1. Hive学习:Hive连接JOIN用例详解

