PLSQL_性能优化系列17_Oracle Merge Into和Update更新效率

2015-05-21 Created By BaoXinjian

一、摘要

以前只考虑 merge into 只是在特定场合下方便才使用的，今天才发现，merge into 竟然会比 update 在更新数据时有这么大的改进。

其实呢，merge into部分的update和update也没啥不同的，不同的地方在于使用merge into后执行计划变了。

merge方法是最简洁，效率最高的方式，在大数据量更新时优先使用这种方式。

1. 基本语法

merge into test1 using test2

on (test1.id = test2.id)

when matched then update

set test1.name = nvl2(test1.name,test2.name,test1.name);

update内联视图方式：使用这种方式必须在test2.id上有主键 (这里很好理解，必须保证每一个test1.id对应在test2里只有一条记录，如果test2中有多条对应的记录，怎么更新test1)

或者on (test1.id = test2.id, test1.name = test2.name ....)，通过多栏位对比，确认唯一记录，类似Unique Index

2. 使用并行，加快大量数据更新：

merge /*+parallel(test1,4)*/ into test1 using test2

on (test1.id = test2.id)

when matched then update

set test1.name = nvl2(test1.name,test2.name,test1.name);

二、测试案例 - Update / Merge Into

1. 创建测试数据

create table test1 as select * from dba_objects where rownum<=10000;--10000条记录

create table test2 as select * from dba_objects--73056条记录

2. 直接Update时间和效率

SQL> alter system flush shared_pool;

System altered.

SQL> alter system flush buffer_cache;

System altered.

SQL> set linesize 400 pagesize 400

SQL> set autot trace

SQL> set timing on

SQL> update test1 t1

  2     set t1.object_name = (select t2.object_name

  3                             from test2 t2

  4                            where t2.object_id = t1.object_id);

10000 rows updated.

Elapsed: 00:06:33.35

Execution Plan

----------------------------------------------------------

   0      UPDATE STATEMENT Optimizer=ALL_ROWS (Cost=2923252 Card=10011 Bytes=790869)

   1    0   UPDATE OF 'TEST1'

   2    1     TABLE ACCESS (FULL) OF 'TEST1' (TABLE) (Cost=40 Card=10011 Bytes=790869)

   3    1     TABLE ACCESS (FULL) OF 'TEST2' (TABLE) (Cost=292 Card=772 Bytes=60988)

Statistics

----------------------------------------------------------

        430  recursive calls

      11122  db block gets

   15275257  consistent gets

       1175  physical reads

    4058752  redo size

        520  bytes sent via SQL*Net to client

        668  bytes received via SQL*Net from client

          3  SQL*Net roundtrips to/from client

          7  sorts (memory)

          0  sorts (disk)

      10000  rows processed

3. 通过Merge Into时间和效率

SQL> alter system flush shared_pool;

System altered.

Elapsed: 00:00:00.45

SQL> alter system flush buffer_cache;

System altered.

Elapsed: 00:00:00.71

SQL> merge into test1 t1

  2  using test2 t2

  3  on (t1.object_id = t2.object_id)

  4  when matched then

  5    update set t1.object_name = t2.object_name;

10000 rows merged.

Elapsed: 00:00:00.92

Execution Plan

----------------------------------------------------------

   0      MERGE STATEMENT Optimizer=ALL_ROWS (Cost=1243 Card=10011 Bytes=1321452)

   1    0   MERGE OF 'TEST1'

   2    1     VIEW

   3    2       HASH JOIN (Cost=1243 Card=10011 Bytes=4264686)

   4    3         TABLE ACCESS (FULL) OF 'TEST1' (TABLE) (Cost=40 Card=10011 Bytes=2192409)

   5    3         TABLE ACCESS (FULL) OF 'TEST2' (TABLE) (Cost=292 Card=77163 Bytes=15972741)

Statistics

----------------------------------------------------------

       1224  recursive calls

      10279  db block gets

       1586  consistent gets

       1191  physical reads

    2803872  redo size

        526  bytes sent via SQL*Net to client

        634  bytes received via SQL*Net from client

          3  SQL*Net roundtrips to/from client

         12  sorts (memory)

          0  sorts (disk)

      10000  rows processed

aaarticlea/png;base64,iVBORw0KGgoAAAANSUhEUgAAABQAAAATCAIAAAAf7rriAAABHklEQVQ4jc3Tv0sCcRjH8Wc4/wL9H4T0P3CpXYQcHQpuFIQayjvXQrnulhosHDpQsBZB8Q9wuWu1QVBcbMtNyaAm3y0e/fC4Lmvo4bO++D6fB74CLJdPm0X+MRaRn2HuowziDFNMVBFhajDvsHB4GYXBMQZxRikeVBHh0WDeDo+jKzz5gJ8dXjfAU4NZm4UbEse+4uC1qVYplSgU2Ntf4aHXeXrGLBjX65TLaEXyeTIZdrYjCskEIpLdRdMwTWybOxc/3GpxfsHpCUfH5HKk0xGF5JaHi1gm9jWuP3Ycbm+o1bAsDg9Q1U9YC8b9Pt0uzSZXl/LdrOHxmF6PTodGg0oFXY8oJLzOuveyf+f1vB8si17EDFj7zz5GyPwKvwECQrZ4yvBSdAAAAABJRU5ErkJggg==" alt="" />三、解析计划

1. 通过Update的解析计划

SQL> set autot off

SQL> update /*+gather_plan_statistics*/ test1 t1

  2     set t1.object_name = (select t2.object_name

  3                             from test2 t2

  4                            where t2.object_id = t1.object_id);

10000 rows updated.

Elapsed: 00:04:32.81

SQL> select * from table(dbms_xplan.display_cursor(null,null,'iostats'));

PLAN_TABLE_OUTPUT

--------------------------------------------------------------------------------------------

SQL_ID  c8qt9a54qgmqg, child number 0

-------------------------------------

update /*+gather_plan_statistics*/ test1 t1    set t1.object_name =

(select t2.object_name                            from test2 t2

                  where t2.object_id = t1.object_id)

Plan hash value: 3883393169

--------------------------------------------------------------------------------------

| Id  | Operation          | Name  | Starts | E-Rows | A-Rows |   A-Time   | Buffers |

--------------------------------------------------------------------------------------

|   0 | UPDATE STATEMENT   |       |      1 |        |      0 |00:04:32.73 |      10M|

|   1 |  UPDATE            | TEST1 |      1 |        |      0 |00:04:32.73 |      10M|

|   2 |   TABLE ACCESS FULL| TEST1 |      1 |  10011 |  10000 |00:00:00.17 |     133 |

|*  3 |   TABLE ACCESS FULL| TEST2 |  10000 |    772 |  10000 |00:04:31.51 |      10M|

--------------------------------------------------------------------------------------

Predicate Information (identified by operation id):

---------------------------------------------------

   3 - filter("T2"."OBJECT_ID"=:B1)

Note

-----

   - dynamic sampling used for this statement (level=2)

26 rows selected.

Elapsed: 00:00:01.38

2. 通过Merge Into的解析计划

SQL> merge /*+gather_plan_statistics*/

  2  into test1 t1

  3  using test2 t2

  4  on (t1.object_id = t2.object_id)

  5  when matched then

  6    update set t1.object_name = t2.object_name;

10000 rows merged.

Elapsed: 00:00:00.52

SQL> select * from table(dbms_xplan.display_cursor(null,null,'iostats'));

PLAN_TABLE_OUTPUT

-------------------------------------------------------------------------------------------

SQL_ID  9n4tc6tvwaj9c, child number 0

-------------------------------------

merge /*+gather_plan_statistics*/ into test1 t1 using test2 t2 on

(t1.object_id = t2.object_id) when matched then   update set

t1.object_name = t2.object_name

Plan hash value: 818823782

----------------------------------------------------------------------------------------

| Id  | Operation            | Name  | Starts | E-Rows | A-Rows |   A-Time   | Buffers |

----------------------------------------------------------------------------------------

|   0 | MERGE STATEMENT      |       |      1 |        |      0 |00:00:00.47 |   11458 |

|   1 |  MERGE               | TEST1 |      1 |        |      0 |00:00:00.47 |   11458 |

|   2 |   VIEW               |       |      1 |        |  10000 |00:00:00.33 |    1179 |

|*  3 |    HASH JOIN         |       |      1 |  10011 |  10000 |00:00:00.25 |    1179 |

|   4 |     TABLE ACCESS FULL| TEST1 |      1 |  10011 |  10000 |00:00:00.08 |     133 |

|   5 |     TABLE ACCESS FULL| TEST2 |      1 |  77163 |  73056 |00:00:00.26 |    1046 |

----------------------------------------------------------------------------------------

Predicate Information (identified by operation id):

---------------------------------------------------

   3 - access("T1"."OBJECT_ID"="T2"."OBJECT_ID")

Note

-----

   - dynamic sampling used for this statement (level=2)

28 rows selected.

Elapsed: 00:00:00.15

四、结果分析

1. 测试结果对比：update和merge into 都更新1w条记录，

update耗时6分钟，逻辑读消耗15275257；

merge into 耗时6秒钟，消耗逻辑读1586，相差太大了。

2. 其实看着执行计划，这个结果也很容易理解：

update采用的类似nested loop的方式，对更新的每一行，都会对查询的表扫描一次；

merge into这里选择的是hash join，则针对每张表都是做了一次 full table scan，对每张表都只是扫描一次。

3. Oracle官方建议，在大数据更新过程中，也是通过使用Merge Into代替Update

Thanks and Regards

参考： http://blog.csdn.net/xiexbb/article/details/4242063

PLSQL_性能优化系列17_Oracle Merge Into和Update更新效率的更多相关文章

PLSQL_性能优化系列14_Oracle High Water Level高水位分析
2014-10-04 Created By BaoXinjian 一.摘要 PLSQL_性能优化系列14_Oracle High Water Level高水位分析高水位线好比水库中储水的水位线,用于 ...
PLSQL_性能优化系列16_Oracle Tuning Analyze优化分析
2014-12-23 Created By BaoXinjian
PLSQL_性能优化系列01_Oracle Index索引
2014-06-01 Created By BaoXinjian
PLSQL_性能优化系列15_Oracle Explain Plan解析计划解读
2014-12-19 Created By BaoXinjian
PLSQL_性能优化系列05_Oracle Hint提示
2014-06-20 Created By BaoXinjian
PLSQL_性能优化系列02_Oracle Join关联
2014-09-25 Created By BaoXinjian
PLSQL_性能优化系列19_Oracle Explain Plan解析计划通过Profile绑定
20150529 Created By BaoXinjian
PLSQL_性能优化系列12_Oracle Index Anaylsis索引分析
2014-10-04 Created By BaoXinjian
PLSQL_性能优化系列08_Oracle Insert / Direct Insert性能优化
2014-09-25 Created By BaoXinjian

随机推荐

JavaScript学习记录总结（十）——几个重要的BOM对象
一.弹出框 <script type="text/javascript"> window.onload=function(){ window.al ...
Docker网络管理
一.Docker的四种网络模式(host.container.none.bridge) 1. host模式,使用docker run时使用--net=host指定,docker使用的网络实际上和宿主机 ...
简单工厂模式（Simple Factory Pattern）
简单工厂模式是属于创建型模式,又叫做静态工厂方法(Static Factory Method)模式,但不属于23种GOF设计模式之一.简单工厂模式是由一个工厂对象决定创建出哪一种产品类的实例.简单工厂 ...
Python 基本语法1
Python 基础语法(一) Python的特点 1. 简单 Python是一种代表简单思想的语言. 2. 易学 Python有极其简单的语法. 3. 免费.开源 Python是FLOSS(自由/开放 ...
linux下crontab定时执行本地脚本和定时访问指定url
https://my.oschina.net/u/2487410/blog/683308 使用linux curl命令讲解:http://www.linuxdiyf.com/linux/2800.ht ...
GUI、GUILayout、EditorGUI、EditorGUILayout
GUI GUI.BeginGroup(rect) //在里面画的控件,将以这个GroupRect的左上角为原点,仅此而已 GUI.EndGroup() GUILayout GUILayout.Begi ...
word2010插入奇数页使奇偶页不同的问题
word2010版本,设计奇偶页不同时,遇到一个问题,就是在第一章最后插入分节符——奇数页时,会导致整个奇数页都为1.原因:①首先每一章的末尾都应该有分页符或者奇数页等分节符来分开每一章:②点击页码可 ...
A better SHOW TABLE STATUS
From command line we have the entire MySQL server on hands (if we have privileges too of course) but ...
利用zip(或者phar)协议进行本地文件包含
$include_file=$_GET[include_file];if ( isset( $include_file ) && strtolower( substr( $includ ...
asterisk
http://www.asterisk.org/ asterisk is the world's most widely adopted open source commnuctions platfo ...

PLSQL_性能优化系列17_Oracle Merge Into和Update更新效率

PLSQL_性能优化系列17_Oracle Merge Into和Update更新效率的更多相关文章

随机推荐

热门专题