11G新特性 -- Multicolumn Statistics (Column groups)

默认oracle会收集表中各个列的统计信息，但是会忽略列之间的关联关系。在大多情况下，优化器假设在复杂查询中的列之间是独立的。当where子句后指定了一个表的多个列条件时，优化器通常会将多个列的选择性（selectivity）相乘得到where语句的选择性，导致优化器做出错误判断！
Oracle 11g引入了多列统计信息概念，如果上面情况列关联性很好，可以做多列统计信息收集，让优化器做出正确判断。

在oracle 10g中，只有在一些特殊场合，优化器才会考虑列之间的关联关系：
-The optimizer used the number of distinct keys in an index to estimate selectivity provided all columns of a conjunctive predicate match all columns of a concatenated index key. In addition, the predicates must be equalities used in equijoins.
- If you set DYNAMIC_SAMPLING to level 4, the optimizer used dynamic sampling to estimate the selectivity of predicates involving multiple columns from a table. Because the sampling size is quite small, the results are dubious in most cases.

创建Column Groups:

DECLARE

  cg_name varchar2();

BEGIN

  cg_name := dbms_stats.create_extended_stats(null,'customers', '(cust_state_province,country_id)');

END;

/

查看Column Groups:

SQL> select extension_name, extension from dba_stat_extensions where table_name='CUSTOMERS';

EXTENSION_NAME                 EXTENSION

------------------------------ --------------------------------------------------------------------------------

SYS_STU#S#WF25Z#QAHIHE#MOFFMM_ ("CUST_STATE_PROVINCE","COUNTRY_ID")

或者

SQL>  select sys.dbms_stats.show_extended_stats_name ('sh','customers','(cust_state_province,country_id)') col_group_name from dual;

COL_GROUP_NAME

--------------------------------------------------

SYS_STU#S#WF25Z#QAHIHE#MOFFMM_

删除:

SQL> exec dbms_stats.drop_extended_stats('sh','customers','(cust_state_province, country_id)');

收集Column Groups的统计信息:

SQL> exec dbms_stats.gather_table_stats('sh','customers',method_opt =>'for all columns size skewonly for columns (cust_state_province,country_id) size skewonly');

监控Column Groups:

--查询多列统计信息

SQL> Select extension_name, extension from user_stat_extensions where table_name='CUSTOMERS';

EXTENSION_NAME                 EXTENSION

------------------------------ --------------------------------------------------------------------------------

SYS_STU#S#WF25Z#QAHIHE#MOFFMM_ ("CUST_STATE_PROVINCE","COUNTRY_ID")

SQL>

--查看distinct数和柱状图使用情况

SQL> select e.extension col_group, t.num_distinct, t.histogram from user_stat_extensions e, user_tab_col_statistics t where e.extension_name = t.column_name and e.table_name = t.table_name and t.table_name = 'CUSTOMERS';

COL_GROUP                                                                        NUM_DISTINCT HISTOGRAM

-------------------------------------------------------------------------------- ------------ ---------------

("CUST_STATE_PROVINCE","COUNTRY_ID")                                                       FREQUENCY

SQL>

实验：
1）当不使用多列统计信息时，真实结果是3341，执行计划是1132.

SQL> exec dbms_stats.drop_extended_stats('sh','customers','(cust_state_province,country_id)');

PL/SQL procedure successfully completed.

SQL> select count(*) from sh.customers where CUST_STATE_PROVINCE = 'CA' and country_id=;

  COUNT(*)

----------

Execution Plan

----------------------------------------------------------

Plan hash value: 

--------------------------------------------------------------------------------

| Id  | Operation          | Name      | Rows  | Bytes | Cost (%CPU)| Time     |

--------------------------------------------------------------------------------

|    | SELECT STATEMENT   |           |      |     |      ()| :: |

|    |  SORT AGGREGATE    |           |      |     |            |          |

|*   |   TABLE ACCESS FULL| CUSTOMERS |   |  |      ()| :: |

--------------------------------------------------------------------------------

Predicate Information (identified by operation id):

---------------------------------------------------

    - filter("CUST_STATE_PROVINCE"='CA' AND "COUNTRY_ID"=)

Statistics

----------------------------------------------------------

          recursive calls

            db block gets

         consistent gets

            physical reads

            redo size

          bytes sent via SQL*Net to client

          bytes received via SQL*Net from client

            SQL*Net roundtrips to/from client

           sorts (memory)

            sorts (disk)

            rows processed

2）当使用多列统计信息时，真实结果是3341，执行计划是3437.

SQL> EXEC DBMS_STATS.GATHER_TABLE_STATS('SH','CUSTOMERS',METHOD_OPT =>'FOR ALL COLUMNS SIZE SKEWONLY FOR COLUMNS (CUST_STATE_PROVINCE,COUNTRY_ID) SIZE SKEWONLY');

PL/SQL procedure successfully completed.

SQL>  select count(*) from sh.customers where CUST_STATE_PROVINCE = 'CA' and country_id=;

  COUNT(*)

----------

Execution Plan

----------------------------------------------------------

Plan hash value: 

--------------------------------------------------------------------------------

| Id  | Operation          | Name      | Rows  | Bytes | Cost (%CPU)| Time     |

--------------------------------------------------------------------------------

|    | SELECT STATEMENT   |           |      |     |      ()| :: |

|    |  SORT AGGREGATE    |           |      |     |            |          |

|*   |   TABLE ACCESS FULL| CUSTOMERS |   |  |      ()| :: |

--------------------------------------------------------------------------------

Predicate Information (identified by operation id):

---------------------------------------------------

    - filter("CUST_STATE_PROVINCE"='CA' AND "COUNTRY_ID"=)

Statistics

----------------------------------------------------------

            recursive calls

            db block gets

         consistent gets

            physical reads

            redo size

          bytes sent via SQL*Net to client

          bytes received via SQL*Net from client

            SQL*Net roundtrips to/from client

            sorts (memory)

            sorts (disk)

            rows processed

3）即以上情况，使用多列统计信息能让优化器得到更准确的判断！

11G新特性 -- Multicolumn Statistics (Column groups)的更多相关文章

11G新特性 -- Expression Statistics
当在查询中使用了function,返回值会受到影响. 比如: select count(*) from customers where lower(cust_state_province)='ca'; ...
11g新特性与12c新特性
1. 11g新特性概图管理新特性> 开发新特性> 2. 12c 新特性概图
11g新特性-自动sql调优(Automatic SQL Tuning)
11g新特性-自动sql调优(Automatic SQL Tuning) 在Oracle 10g中,引进了自动sql调优特性.此外,ADDM也会监控捕获高负载的sql语句. 在Oracle 11g中, ...
使用Oracle 11g新特性 Active Database Duplication 搭建Dataguard环境
Duplication Database 介绍 Duplicate database可以按照用途分为2种: duplicate database(复制出一个数据库) duplicate standby ...
Oracle 11g 新特性 --SQL Plan Management 说明
Oracle 11g 新特性 --SQL Plan Management 说明参见大神博主文章: http://blog.csdn.net/tianlesoftware/article/detail ...
Oracle 11g 新特性 – HM（Hang Manager）简介
在这篇文章中我们会对oracle 11g 新特性—hang 管理器(Hang Manager) 进行介绍.我们需要说明,HM 只在RAC 数据库中存在. 在我们诊断数据库问题的时候,经常会遇到一些数据 ...
11G 新特性之密码延迟认证
11G 新特性之密码延迟认证 11G 新特性之密码延迟认证 Table of Contents 1. 特性简述 2. 特性潜在引发问题 3. 关闭特性 1 特性简述为了防止用户密码的暴力破解,从 ...
11G新特性 -- Statistics Preferences
Statistics Preferences新特性可以实现对指定对象进行信息收集. 可以在table.schema.database.global级别设置statistics preference. ...
Oracle 11g新特性
文章转自网络 Oracle 11g于2007年7月11日美国东部时间11时(北京时间11日22时)正式发布,11g是甲骨文公司30年来发布的最重要的数据库版本,根据用户的需求实现了信息生命周期管理(I ...

随机推荐

Asp.Net Core 2.0 项目实战（1） NCMVC开源下载了
Asp.Net Core 2.0 项目实战(1) NCMVC开源下载了 Asp.Net Core 2.0 项目实战(2)NCMVC一个基于Net Core2.0搭建的角色权限管理开发框架 Asp.Ne ...
【Java】剑指offer(8) 用两个栈实现队列
本文参考自<剑指offer>一书,代码采用Java语言. 更多:<剑指Offer>Java实现合集题目用两个栈实现一个队列.队列的声明如下,请实现它的两个函数append ...
P1590 失踪的7
P1590 失踪的7进制转换的题目,如果把一个10进制的数当成9进制,相当于没有9这个数字,题目失踪了7,但是无所谓.如果当前的大于7,它就跳过了一个数字,向左移动1位. #include<io ...
在 Windows 10 中使用 OpenAI Spinning Up
前段时间,openAI 发布了 Spinning Up ,本篇博客将介绍如何在 Windows 系统中使用 Spinning Up. 什么是 Spinning Up 先来说说 Spinning Up, ...
python 搭建一个简单的搜索引擎
我把代码和爬好的数据放在了git上,欢迎大家来参考 https://github.com/linyi0604/linyiSearcher 我是在 manjaro linux下做的, 使用python3 ...
50 tips of JavaScript，这些坑你都知道吗？
1.在局部作用域中,使用var操作符定义的变量将成为定义该变量的作用域中的局部变量,省略var的会创建全局变量:在全局作用域中,不管是否使用var操作符定义的变量都会创建一个全局变量.但是,在全局作用 ...
Python图形编程探索系列-02-框架设计
跳转到我的博客设计任务在主窗口root中放置三个容器用于容纳组件,容器采用框架设计. 代码初步设计 import tkinter as tk root = tk.Tk() root.geometr ...
phpstorm破解
由于JetBrains系列新版本注册激活发生了变化,所以原来的激活方式已经不能在使用. 只能用新的方式来破解了.此方式支持所有系列的新版版.包括IDEA15,PHPSTORM10,WEBSTORM11 ...
vscode使用wsl调试代码
第一步在WSL中配好环境第二步安装CodeRunner即可,在用户配置中加入如下行: "terminal.integrated.shell.windows": "C:\ ...
asp.net通过distinct过滤集合(list)中重复项的办法
/// <summary> /// 权限Distinct比较器 /// </summary> public class PermissionIdComparer : IEqua ...

11G新特性 -- Multicolumn Statistics (Column groups)

11G新特性 -- Multicolumn Statistics (Column groups)的更多相关文章

随机推荐

热门专题