<转> Intel VTune分析结果中的名词释译

原文转自http://blog.chinaunix.net/uid-26000296-id-3369740.html

Elapsed Time（执行耗时）：

the total time your target ran, is calculated as follows:

Wall clock time at end of application – Wall clock time at start of application

应用程序的整个的运行时间，等于 ”程序结束时间减程序开始时间”。

CPU Time:

Active processor Self time spent in the function. For multiple threads, CPU time is summed up. By default, the Self time is provided in seconds. The blue bar is a visual indicator of the CPU time usage. The longer the bar, the higher the value.

处理器工作的时间。

对于多线程来说，CPU时间是累加的。默认地，它是以秒为单位。

分析结果中的蓝色条指示的是CPU的使用时间，条越长，值越大。

In the Summary window, CPU time is the overall time that all processors spent working for the application. If there are multiple cores then the times are added. For example, if core 1 spends 4 seconds working for the application and core 2 spends 7 seconds then the CPU time will be 11 seconds. The CPU time can be greater than the Elapsed time. The upper bound for CPU time is Elapsed time * number of logical cores.

在总结窗口中，CPU时间是所有处理器花费在程序处理上的时间。

如果是多核，这个时间是累加的。

如果一个核花费了4秒用于工作，而另一个核花了7秒用于这个工作，那么CPU时间就是11秒。

所以CPU时间可以大于耗时。

CPU时间的上限是耗时*逻辑核个数。

Instructions Retired:

Modern processors execute much more instructions that the program flow needs. This is

called "speculative execution". Then the instructions that were "proven" as indeed

needed by flow are "retired". You can think about "retired" instuctions as only

instructions needed by the program flow.

现代处理器是以”预测执行”的方式执行指令，也就是说会执行比程序流程本身要多很多的指令，

如分支预判，提前计算等。而最后程序流程真正执行了的指令就是”retired”。可以认为

”retired”的指令是程序流真正执行了的指令。

I guess "retired instructions" means those instructions that are acturally executed and completed by CPU. The CPU some kind of prediction about the instructions to be excuted and put them into some place like a "pool". But not all of these instructions will be excuted.

CPI Rate：

Clockticks per Instructions Retired (CPI) event ratio, also known as Cycles per Instructions, is one of the basic performance metrics for the hardware event-based samplingcollection. This ratio is calculated as Clockticks / Instructions Retired.

每条指令执行完成的时钟周期。

When you want to determine where to focus your performance tuning effort, the CPI is the first metric to check. A good CPI rate indicates that the code is executing optimally.

As a general guide these numbers have been derived from experienced performance engineers:

Good	Poor
0.75	4

A high value for this ratio indicates that over the current code region, instructions are taking a high number of processor clocks to execute. This could indicate a problem if most of the instructions are not predominately high latency instructions and/or coming from microcode ROM. In this case there may be opportunities to modify your code to improve the efficiency with which instructions are executed within the processor.

现在的处理器通常是一个周期内可以执行多条指令。

例如，一个时钟周期内执行四条指令，那么CPI的最佳理论值为0.25。

如果这个值很高，则表示当前的代码块的指令使用了多个时钟周期来执行指令，如果这些指令不是高延迟指令，则表明这段代码是有问题。这时可以通过优化这个地方来提高效率。

Synchronization Context Switches（同步上下文切换）:

Number of times a thread was switched off a processor because of making an explicit

call to thread synchronization API. For example, in case of trying to wait on a

synchronization object already occupied by another thread, the number of synchronization

context switches will characterize the level of contention between threads.

因为显示调用线程同步API而导致的线程关闭处理的次数。

例如，尝试等待被其它线程占用的同步对象，同步上下文切换的次数反映了线程间竞争的等级。

Wait Count:

Number of times the corresponding system wait API was called. For a lock, it is the number of times the lock was contended and caused a wait.

系统wait API被调用的次数。

对于一个锁来说，指的是因为竞争锁而导致等待的次数。

Wait Rate:

Average Wait time (in milliseconds) per synchronization context switch. Low metric

value may signal an increased contention between threads and inefficient use of

system API.

每次同步上下文切换的平均等待时间(以毫秒为单位)。

该值较小表示的是线程间竞争的增加和系统API的低效使用

Estimated Call Count:

Statistical estimation of call counts based on hardware events.

基于硬件事件的调用次数的统计估计值。

Wait Time:

Duration of a thread inactivity due to contended synchronization.

由于竞争同步导致的线程不活动的时间。

Inactive Time:

Time during which a thread remained preempted from execution. Note that many threads can be inactive at any given point in time, so the sum of Wait and Inactive times of those threads can be much greater than the Total time of program execution.

线程执行时保持被其它线程抢先占用的时间。

在一个给定的时间点，会有很多线程是不活动的，因此，线程等待和不活动的时间之和会远大于程序执行的总时长。

Overhead Time:

Duration that starts with the release of a shared resource and ends with the receipt of that resource. Ideally, the duration of overhead time is very short because it reduces the time a thread has to wait to acquire a resource.

从共享资源被释放开始，到收到这个资源的时间。

这个时间应该很短，因为它是减少线程获得资源的等待时间。

Spin Time(轮询时间):

Wait Time during which the CPU is busy. This often occurs when a synchronization API causes the CPU to poll while the software thread is waiting. Some Spin Time may be preferable to the alternative of increased thread context switches. Too much Spin Time, however, can reflect lost opportunity for productive work.

因为CPU忙而导致的等待时间。

当同步API造成CPU轮询(因为软件线程在等待)时，这时就会发生轮询。

有些轮询时间会有利于增加线程上下文切换的选择。

然而，太多的轮询时间，反映了有效工作的不充分。

Idle Time:

Duration while a thread remained inactive (for any reason) and the system did not have any other task to execute (was idle). The Idle time is always less than any of the Wait and Inactive time.

当一个线程保持不活动且系统没有其它任务在执行的时间。

闲置时间总是小于等待和不活动时间。

<转> Intel VTune分析结果中的名词释译的更多相关文章

R语言读入数据库的中英名词互译测试并计分脚本（考试用）
1. 分子生物学中英文.csv,输入文件,两列,以tab键分隔的txt文本,没有列名 2. 错误的名解.csv, 如果在测试中拼写错误,会写出到这个文件,可用这个容易犯错的名词进行新的测试 3. 注意 ...
【性能分析】使用Intel VTune Amplifier
本文转自 https://software.intel.com/zh-cn/blogs/2010/11/10/amplxe-cl/版权归原作者所有,如原作者有任何不允许转载之理由,本文将自行删除. I ...
Intel VTune Amplifier XE 使用
VTune <VTune 开发者手册> 1. 安装 1.1 软件安装下载: (安装包下载地址) 安装: # 1.解压 tar -zxvf filename.tar.gz # 2.安装 c ...
intel vtune 介绍、安装和使用
intel vtune 介绍 https://software.intel.com/en-us/vtune intel vtune 安装包下载地址 https://software.intel.com ...
【Win 10 应用开发】分析 URI 中的查询字符串
分析URI中的字符有K种方法(K >= 2),如果查询字符串中的参数比较简单,可以通过子字符串查找的方式来处理:如果查询字符串相对复杂,你可以使用正则表达式来匹配 key1=value1 , ...
[转]DllMain中不当操作导致死锁问题的分析——DllMain中要谨慎写代码（完结篇）
在CSDN中发现这篇文章,讲解的比较详细,所以在这里备份一个.原文链接:http://blog.csdn.net/breaksoftware/article/details/8167641 DllMa ...
一个使用C#的TPL Dataflow Library的例子：分析文本文件中词频
博客搬到了fresky.github.io - Dawei XU,请各位看官挪步.最新的一篇是:一个使用C#的TPL Dataflow Library的例子:分析文本文件中词频.
从虚拟机指令执行的角度分析JAVA中多态的实现原理
从虚拟机指令执行的角度分析JAVA中多态的实现原理前几天突然被一个"家伙"问了几个问题,其中一个是:JAVA中的多态的实现原理是什么? 我一想,这肯定不是从语法的角度来阐释多态吧 ...
Linux的nmon监控结果分析文件中网络分析NET
1.首先,使用# ifconfig查看Linux系统中的网卡名称,有的是eth0,有的是em1,以查看结果为准,下图为em1 2.先试试Linux系统中有没有安装ethtool工具,没有的话,下载et ...

随机推荐

Android学习笔记进阶20 之得到图片的缩略图
<1>简介之前往往是通过Bitmap.Drawable和Canvas配合完成,需要写一系列繁杂的逻辑去缩小原有图片,从而得到缩略图. 现在我给大家介绍一种比较简单的方法:(网上有) 在A ...
Linux下MySQL允许远程连接以及授权命令
--针对某个库做授权 GRANT ALL PRIVILEGES ON *.* TO 'myuser'@'%' IDENTIFIED BY 'mypassword' WITH GRANT OPTION; ...
linux系统下的/proc目录介绍
1. /proc目录 Linux 内核提供了一种通过 /proc 文件系统,在运行时访问内核内部数据结构.改变内核设置的机制.proc文件系统是一个伪文件系统,它只存在内存当中,而不占用外存空间.它以 ...
初识Django框架——环境搭建前你需要了解的几点
Django是一个开放源代码的Web应用框架,由Python写成. 采用了MVC的框架模式,即模型M,视图V和控制器C. 它最初是被开发来用于管理劳伦斯出版集团旗下的一些以新闻内容为主的网站的,即是C ...
洛谷——P2695 骑士的工作
https://www.luogu.org/problem/show?pid=2695 题目背景你作为一个村的村长,保卫村庄是理所当然的了.今天,村庄里来了一只恶龙,他有n个头,恶龙到处杀人放火.你 ...
关于我遇到的“LNK 2019无法解析的外部符号”的链接错误
昨天在调试程序的时候出现了"LNK 2019无法解析的外部符号"的问题(VS2008),依照网上说的方法都没有解决这个问题,最后在项目文件里发现有两个同名的可是不在同一个目录下的c ...
fragment-实例创建跟使用
今天接触到了fragment,在网上看到了它的使用方法.现在把它的创建跟使用记录下,方便以后查询. 1.创建一个自己的fragment,继承自fragment package com.android. ...
gridView -item 大小调节(dimen-代码引用)
今天在修改一个gridview的时候,发现里面的内容并不会自动适应,填满整个gridview,而是会产生滑动,尝试了很多的方法,包括在item文件中设定width和height,结果,宽度可调,高度却 ...
git 工具的使用总结（6）-提交合并处理
1.撤消修改 1)revert:反转提交,它就是把你的一个提交先撤消掉,但是,它跟reset不同的是,你的这次这小会留下记录,这样在你下次需要的时候,可以通过这个节点把撤消的提交恢复 zhangshu ...
vue使用marked.js实现markdown转html并提取标题生成目录
html: <template> <div class="wrapper"> <div class="container"> ...

<转> Intel VTune分析结果中的名词释译

<转> Intel VTune分析结果中的名词释译的更多相关文章

随机推荐

热门专题