SSIS 阻塞,半阻塞和全阻塞 (Non-blocking, semi-blocking and Fully-blocking) transformations清单
三种Blocking类型,这里跟数据流的Buff关系很大:
■■ non-blocking transformations,每一行直接转换输出,没有等待.
■■ partial-blocking transformation,直到存储一定数量的好才输出。
■■ blocking transformation, 在输出前必须先读入所有行。
Logical Row-Level Transformations
| Data flow transformation | purpose | blocking type |
| Audit | Adds additional columns to each row based on sys- tem package variables such as ExecutionStartTime and PackageName. | N |
| Cache Transform | Allows you to write data to a cache with the Cache con- nection manager. The data can then be used by the Lookup transformation. This is useful if you are using multiple Lookup transformations against the same data, because SSIS will cache the needed data only once and not for each Lookup component. | N |
| Character Map | Performs common text operations such as Uppercase and allows advanced linguistic bit-conversion operations. | N |
| Copy Column | Duplicates column values in each row to a new named column. | N |
| Data Conversion | Creates a new column in each row based on a new data type converted from the existing column. An example is converting text to numeric data or text to Unicode text. | N |
| Data flow transformation | purpose | blocking type |
| Derived Column | Creates or replaces a column for each row based on a specified SSIS expression. This is the most often used logical row-level transformation because it enables the replacement of column values or the creation of new columns based on existing columns, variables, and parameters. |
N |
| Export Column | Exports binary large objects (BLOB) columns, one row at a time, to a file. | N |
| Import Column | Loads binary files such as images into the pipeline; intend- ed for a BLOB data type destination. | N |
| Row Count | Tracks the number of rows that flow through the transfor- mation and stores the number in a package variable after the final row. | N |
Multi-Input and Multi-Output Transformations
| Data flow transformation | purpose | blocking type |
| CDC Splitter | Splits a single flow of changed rows from the CDC source component into multiple data flows based on the type of the source data change (that is, whether it is an insert, update, or delete operation). CDC Splitter routes the data based on the __$operation column into three possible outputs. this transformation is like a specific version of the Conditional Split transformation that automatically handles the standard values of the __$operation column. |
N |
| Conditional Split | Routes or filters data based on a Boolean expression to one or more outputs, from which each row can be sent out only one output path. | N |
| Lookup | Performs a lookup operation between a current row and an external dataset on one or more columns. Additional columns can be added to the data flow from the external dataset. | N |
| Merge | Combines the rows of two similar sorted inputs, one on top of the other, based on a defined sort key. | P |
| Merge Join | Joins the rows of two sorted inputs based on a defined join column or columns, adding columns from each source. | P |
| Multicast | Generates one or mode identical outputs, from which every row is sent out every output. This transformation creates a logical copy of the data. | N |
| Union All | Combines one or more similar inputs, stacking rows one on top of another, based on matching columns. The number of rows in the output of Union All is the combined row counts of all the inputs. | P |
Multi-Row Transformations
| Data flow transformation | purpose | blocking type |
| Aggregate | Associates rows based on defined grouping and generates aggregations such as SUM, MAX, MIN, and COUNT. | B |
| Percent Sampling | Filters the input rows by allowing only a defined percent to be passed to the output path. | N |
| Pivot | Takes multiple input rows and pivots the rows to generate an output with more columns based on the original row values. | P |
| Row Sampling | Generates a fixed number of rows, sampling the data from the entire input, no matter how much larger than the defined output the input is. | B |
| Sort | Orders the input based on defined sort columns and sort directions. The Sort transformation also allows the removal of duplicates across the sort columns. | B |
| Unpivot | Takes a single row and generates multiple rows, moving column values to the new row based on defined columns. | P |
Advanced Data-Preparation Transformations
| Data flow transformation | purpose | blocking type |
| DQS Cleansing | Validates rows by automatically per- forming data cleansing using an exist- ing knowledge base in Data Quality Services (DQS). | P |
| OLE DB Command | Performs database operations such as updates or deletions, one row at a time, based on mapped parameters from input rows. | N |
| Slowly Changing Dimension | Generates transformations necessary to support loading dimension tables in data warehouse scenarios. This transformation handles SCD (Slowly Changing Dimension) Type 1 and Type 2 and also has support for inferred members. Chapter 7 focuses on this transformation. | N |
| Data Mining Query | Applies input rows against a data min- ing model for prediction. | P |
| Fuzzy Grouping | Performs de-duplication based on similarity of string values in selected columns. | B |
| Fuzzy Lookup | Joins a data flow input to a reference table based on column similarity. The Similarity Threshold setting specifies the closeness of allowed matches—a high setting means that matching val- ues are close in similarity. | B |
| Script Component | Applies custom .NET scripting capabilities against rows, columns, inputs, and outputs in the data flow pipeline. This is the most powerful component. Chapter 19, “Implementing Custom Code in SSIS Packages” looks at some of its possibilities. | N |
| Term Extraction | Analyzes text input columns for English-language nouns and noun phrases. | P |
| Term Lookup | Analyzes text input columns against a user-defined set of words for associa- tion. | P |
SSIS 阻塞,半阻塞和全阻塞 (Non-blocking, semi-blocking and Fully-blocking) transformations清单的更多相关文章
- 微软BI 之SSIS 系列 - 理解Data Flow Task 中的同步与异步, 阻塞,半阻塞和全阻塞以及Buffer 缓存概念
开篇介绍 在 SSIS Dataflow 数据流中的组件可以分为 Synchronous 同步和 Asynchronous 异步这两种类型. 同步与异步 Synchronous and Asynchr ...
- php使用flock阻塞写入文件和非阻塞写入文件的实例讲解
php使用flock阻塞写入文件和非阻塞写入文件的实例讲解: 阻塞写入代码:(所有程序会等待上次程序执行结束才会执行,30秒会超时) <?php $file = fopen("test ...
- 同步I/O、异步I/O与阻塞I/O、非阻塞I/O的区别
一.I/O I/O (Input/Output,输入/输出)即数据的读取(接收)或写入(发送)操作. 通常用户进程中的一个完整I/O分为两阶段:用户进程空间<-->内核空间.内核空间< ...
- (原创)JAVA阻塞队列LinkedBlockingQueue 以及非阻塞队列ConcurrentLinkedQueue 的区别
阻塞队列:线程安全 按 FIFO(先进先出)排序元素.队列的头部 是在队列中时间最长的元素.队列的尾部 是在队列中时间最短的元素.新元素插入到队列的尾部,并且队列检索操作会获得位于队列头部的元素.链接 ...
- 深入理解非阻塞同步IO和非阻塞异步IO
这两篇文章分析了Linux下的5种IO模型 http://blog.csdn.net/historyasamirror/article/details/5778378 http://blog.csdn ...
- java 多线程阻塞队列 与 阻塞方法与和非阻塞方法
Queue是什么 队列,是一种数据结构.除了优先级队列和LIFO队列外,队列都是以FIFO(先进先出)的方式对各个元素进行排序的.无论使用哪种排序方式,队列的头都是调用remove()或poll()移 ...
- 定时器详解和应用、js加载阻塞、css加载阻塞
1.setTimeout().setInterval()详解和应用 1.1 详解: setTimeout.setInterval执行时机 1.2 存在问题: setInterval重复定时器可能存在的 ...
- Python之阻塞IO模型与非阻塞IO模型
Python之阻塞IO模型与非阻塞IO模型 IO模型 1 阻塞IO: 全程阻塞 2 非阻塞IO: 发送多次系统调用: 优点:wait for data时无阻塞 缺点:1 系统调用太多 2 数据不是实时 ...
- IO阻塞模型、IO非阻塞模型、多路复用IO模型
IO操作主要包括两类: 本地IO 网络IO 本地IO:本地IO是指本地的文件读取等操作,本地IO的优化主要是在操作系统中进行,我们对于本地IO的优化作用十分有限 网络IO:网络IO指的是在进行网络操作 ...
- 阻塞I/O、非阻塞I/O和I/O多路复用、怎样理解阻塞非阻塞与同步异步的区别?
“阻塞”与"非阻塞"与"同步"与“异步"不能简单的从字面理解,提供一个从分布式系统角度的回答.1.同步与异步 同步和异步关注的是消息通信机制 (syn ...
随机推荐
- input 输入框获得/失去焦点时隐藏/显示文字(jquery版)
input输入框在获得或失去焦点时隐藏或显示文字,这样的焦点效果想必很多朋友在填写form表格的时候都曾见识过吧,本文使用jquery实现以下,感兴趣的朋友可以参考下哈 大家可以看效果图的搜索输入框, ...
- html5 input的type属性启动数字输入法
html5 input的type属性启动数字输入法 当文本框只能输入数字是一个很常见的需求,比如电话号码,身份证号,卡号, 数量....等等只允许数字输入,为了更好的用户体验性,直接写出 启动数字 ...
- HTML5 调用手机相册和摄像头的方法并上传微信下测试通过
<input type="file" capture="camera" accept="image/*" id="camer ...
- Centos6.5中 一键安装LNMP 安装Yii2.0 手工配置
1.一键安装LNMP cd /usr wget -c http://soft.vpser.net/lnmp/lnmp1.2-full.tar.gz tar zxf lnmp1.-full.tar.gz ...
- LBA(逻辑区块地址)及PBA(物理区块地址)
LBA,全称为Logical Block Address,是PC数据存储装置上用来表示数据所在位置的通用机制,我们最常见到使用它的装置就是硬盘.LBA可以指某个数据区块的地址或者某个地址上所指向的数据 ...
- 动态规划以及在leetcode中的应用
之前只是知道动态规划是通过组合子问题来解决原问题的,但是如何分析,如何应用一直都是一头雾水.最近在leetcode中发现有好几道题都可以用动态规划方法进行解决,就此做下笔录. 动态规划:应用于子问题重 ...
- uber司机如何注册 uber司机详细注册流程
注意:本文驾驶证行驶证图片为本人拍摄,请广大网友勿作它用 转载请修改! 详细的介绍注册优步uber司机端步骤; uber对司机的要求: 车辆为本地牌照 车龄在5年以内 裸车价格在10万以上 目 ...
- 一步一步学android之控件篇——ScrollView
一个手机的屏幕大小是有限的,那么我要显示的东西显示不下怎么办?这就会使用到ScrollView来进行滚动显示,他的定义如下: 可以看到ScrollView是继承于FrameLayout的,所以Scro ...
- 一步一步学数据结构之1--n(通用树)
今天来看大家介绍树,树是一种非线性的数据结构,树是由n个结点组成的有限集合,如果n=0,称为空树:如果n>0,则:有一个特定的称之为根的结点,它只有直接后继,但没有直接前驱:除根以外的其他结点划 ...
- css学习笔记四
广州天气变冷了,css学习笔记还是要总结. 总结: 1:几米页面静态页面主要是一列结构头部banner图,mainbody部分放文字内容和图书图片,底部是页面的版权信息 2:腾讯软件中心静态页面制作( ...