cadvisor详解

一. cadvisor和k8s的耦合

cadvisor是一个谷歌开发的容器监控工具，它被内嵌到k8s中作为k8s的监控组件。现在将k8s中的cadvisor实现分析一下。

k8s中和cadvisor的代码主要在./pkg/kubelet/cadvisor目录下。在当前k8s版本(v1.13)中，kubelet主要调用的cadvisor方法如下：

MachineInfo
RootFsInfo
VersionInfo
GetDirFsInfo

GetFsInfo

---------------------------------------
ContainerInfoV2
SubcontainerInfo
ContainerInfo
WatchEvents

分割线之上的方法和cadvisor本身耦合较松，分割线之下的方法则和cadvisor耦合紧密。怎么样理解这里的耦合度呢？举例来说，对于分割线

之上的方法，例如MachineInfo，它的操作只是简单的读取本地文件以获取主机的信息。比如通过读取/proc/cpuinfo文件读取本地主机的cpu信息。

对于这种方法，我们可以非常轻松的移植他们。

而分割线之下的方法则很难从cadvisor中单独剥离出来，它们的实现是依赖于整个cadvisor的体系。下面分析一下cadvisor具体的实现

二. 事件监听层

cadvisor的架构简单来说就是一个event机制。它基本上可以分为两层，事件监听层和事件处理层。事件监听层负责监听linux系统发生的事件，而事件处理层

负责对这些事件进行处理。

首先说说事件监听层。事件监听层主要包含两个监听者，ContainerAdd事件和OOM事件。其对应的函数是watchForNewContainers, watchForNewOoms。

watchForNewContainers完成的事情是启动每一个watcher。代码如下，可以看到和watcher交互的是eventsChannel。目前cadvisor中包含两种wathcer, 一个是rawWatcher，另一个是rktWatcher。

    for _, watcher := range self.containerWatchers {

        err := watcher.Start(self.eventsChannel)

        if err != nil {

            return err

        }

    }

rawWatcher直接监控系统的cgroup根目录，而rktWatcher似乎是与rkt的client进行交互，由于rkt不是主流的技术，因此我们目前主要研究rawWatcher。这个watcher的代码在./manager/watcher/raw目录下。

稍作分析就可以看出这个watcher是调用了github.com/sigma/go-inotify库，这个库简单来说就是利用linux的inotify机制对cgroup根目录进行监听，如果根目录创建了新的目录，那么它就会触发一个ContainerAdd的事件。

然后将事件发送到上面代码中的self.eventsChannel中。注意linux的inotify机制会监听目录的增删改。而这里rawWatcher只对目录的增删感兴趣。也就是说它只对容器的创建和删除感兴趣，对容器本身状态的变化不感兴趣。

对函数rawContainerWatcher.watchDirectory的代码稍作分析不难发现，它是一个递归调用的结构。如果用户请求对任何目录进行监听，它会一并监听这个目录下的所有子目录。

watchForNewOoms是为了监控OOM事件，它的执行流程与container watcher类似，只不过调用的库是github.com/euank/go-kmsg-parser/，这个库的原理是读取linux系统的/dev/kmsg字符串设备。这个字符串设备的大概

意思是将系统的事件报告出来。其核心代码如下。

    outStream := make(chan *oomparser.OomInstance, )

    oomLog, err := oomparser.New()

    if err != nil {

        return err

    }

    go oomLog.StreamOoms(outStream)

    go func() {

        for oomInstance := range outStream {

            // Surface OOM and OOM kill events.

            newEvent := &info.Event{

                ContainerName: oomInstance.ContainerName,

                Timestamp:     oomInstance.TimeOfDeath,

                EventType:     info.EventOom,

            }

            err := self.eventHandler.AddEvent(newEvent)

            if err != nil {

                klog.Errorf("failed to add OOM event for %q: %v", oomInstance.ContainerName, err)

            }

三事件处理层

事件监听层将event发送到self.eventsChannel上，这些event包含了,ContainerAdd, ContainerDelete,EventOomKill三种。这三种事件分两类进行处理，对于ContainerAdd和ContainerDelete， Manager分别

调用CreateContainer和ContainerDestroy方法，然后调用self.eventHandler.AddEvent(event)方法。而EventOomkill事件则只调用self.eventHandler.AddEvent(event)方法，没有其他特殊的处理。

那么这个eventHandler是干啥的呢。这个东西实际上就是一个缓冲区，我们看一下这个evnetHandler的数据结构。它的核心数据结构就是events.watchers，它维护了一组watch，每一个watch存储了一个channel和一个

request。这个request其所在的watch想要监听的事件特性。evnetsHandler每当接收到新的事件的时候，它会根据这个事件的类型分发给各个watch。

// events provides an implementation for the EventManager interface.

type events struct {

    // eventStore holds the events by event type.

    eventStore map[info.EventType]*utils.TimedStore

    // map of registered watchers keyed by watch id.

    watchers map[int]*watch

    // lock guarding the eventStore.

    eventsLock sync.RWMutex

    // lock guarding watchers.

    watcherLock sync.RWMutex

    // last allocated watch id.

    lastId int

    // Event storage policy.

    storagePolicy StoragePolicy

}

// initialized by a call to WatchEvents(), a watch struct will then be added

// to the events slice of *watch objects. When AddEvent() finds an event that

// satisfies the request parameter of a watch object in events.watchers,

// it will send that event out over the watch object's channel. The caller that

// called WatchEvents will receive the event over the channel provided to

// WatchEvents

type watch struct {

    // request parameters passed in by the caller of WatchEvents()

    request *Request

    // a channel used to send event back to the caller.

    eventChannel *EventChannel

}

// Request holds a set of parameters by which Event objects may be screened.

// The caller may want events that occurred within a specific timeframe

// or of a certain type, which may be specified in the *Request object

// they pass to an EventManager function

type Request struct {

    // events falling before StartTime do not satisfy the request. StartTime

    // must be left blank in calls to WatchEvents

    StartTime time.Time

    // events falling after EndTime do not satisfy the request. EndTime

    // must be left blank in calls to WatchEvents

    EndTime time.Time

    // EventType is a map that specifies the type(s) of events wanted

    EventType map[info.EventType]bool

    // allows the caller to put a limit on how many

    // events to receive. If there are more events than MaxEventsReturned

    // then the most chronologically recent events in the time period

    // specified are returned. Must be >= 1

    MaxEventsReturned int

    // the absolute container name for which the event occurred

    ContainerName string

    // if IncludeSubcontainers is false, only events occurring in the specific

    // container, and not the subcontainers, will be returned

    IncludeSubcontainers bool

}

剩下的事就很简单了，对于任何ContainerAdd事件，manager维护了一组工厂类，每一个类对应一种container类型。这些工厂类定义在./container中。manager分析ContainerAdd事件中的相关信息，将它传递

给对应的工厂类，工厂类为container生成一个对应的handler并且存储起来，handler执行具体的监控任务。具体来说就是定期读取container对应的cgroup文件。从中获取信息。handler将读取到的数据存储到自己的缓存memoryCache中。

handler的包装类型是containerData

四. k8s中用到的几个关键函数

GetContainerV2(),直接获取它想要的container对应的handler，然后读取其中memoryCache的状态数据

WatchEvents(),这个函数主要是OOMWatcher在调用，它暴露出一个channel给OOMWatcher用以监听系统的OOMWatcher事件

cadvisor详解的更多相关文章

详解k8s一个完整的监控方案(Heapster+Grafana+InfluxDB) - kubernetes
1.浅析整个监控流程 heapster以k8s内置的cAdvisor作为数据源收集集群信息,并汇总出有价值的性能数据(Metrics):cpu.内存.网络流量等,然后将这些数据输出到外部存储,如Inf ...
详解k8s原生的集群监控方案(Heapster+InfluxDB+Grafana) - kubernetes
1.浅析监控方案 heapster是一个监控计算.存储.网络等集群资源的工具,以k8s内置的cAdvisor作为数据源收集集群信息,并汇总出有价值的性能数据(Metrics):cpu.内存.netwo ...
Kubernetes学习之路（二十）之K8S组件运行原理详解总结
目录一.看图说K8S 二.K8S的概念和术语三.K8S集群组件 1.Master组件 2.Node组件 3.核心附件四.K8S的网络模型五.Kubernetes的核心对象详解 1.Pod资源对 ...
Kubernetes Pod 驱逐详解
原文链接:Kubernetes Pod 驱逐详解在 Kubernetes 中,Pod 使用的资源最重要的是 CPU.内存和磁盘 IO,这些资源可以被分为可压缩资源(CPU)和不可压缩资源(内存,磁盘 ...
kubelet 参数详解
kubelet 参数详解基本参数 --allow-privileged=true #允许容器请求特权模式 --anonymous-auth=false #不允许匿名请求到 kubelet 服务(默认 ...
Kubernetes K8S之存储Volume详解
K8S之存储Volume概述与说明,并详解常用Volume示例主机配置规划服务器名称(hostname) 系统版本配置内网IP 外网IP(模拟) k8s-master CentOS7.7 2C ...
Linq之旅：Linq入门详解（Linq to Objects）
示例代码下载:Linq之旅:Linq入门详解(Linq to Objects) 本博文详细介绍 .NET 3.5 中引入的重要功能:Language Integrated Query(LINQ,语言集 ...
架构设计：远程调用服务架构设计及zookeeper技术详解（下篇）
一.下篇开头的废话终于开写下篇了,这也是我写远程调用框架的第三篇文章,前两篇都被博客园作为[编辑推荐]的文章,很兴奋哦,嘿嘿~~~~,本人是个很臭美的人,一定得要截图为证: 今天是2014年的第一天 ...
EntityFramework Core 1.1 Add、Attach、Update、Remove方法如何高效使用详解
前言我比较喜欢安静,大概和我喜欢研究和琢磨技术原因相关吧,刚好到了元旦节,这几天可以好好学习下EF Core,同时在项目当中用到EF Core,借此机会给予比较深入的理解,这里我们只讲解和EF 6. ...

随机推荐

[Leetcode Week6]Reorder List
Reorder List 题解原创文章,拒绝转载题目来源:https://leetcode.com/problems/reorder-list/description/ Description G ...
HDU2441 ACM(Array Complicated Manipulation)
ACM(Array Complicated Manipulation) Time Limit: 5000/1000 MS (Java/Others) Memory Limit: 32768/32 ...
MFC/Socket网络编程
转载: https://jingyan.baidu.com/article/676629974557c254d51b84da.html
myeclipse打断点进入后无法查看变量的值的解决方法
myeclipse打断点进入后无法查看变量的值,打开mycelipse菜单选项:“Window” - “Preferences” - “Java” - “Editor” - “Hovers" ...
ios iphone ipad上iframe的宽度会扩大的解决办法
这个问题,我从网上查了下,好像是属于ios的bug,android,windows都没有问题. 解决办法,就是在iframe加载完成后,设置 iframe里面body的宽度为多少PX. $(" ...
mysql管理和基本操作
进去mysql:mysql –uroot –p 重启数据库:[root@nanaLinux ~]# /etc/init.d/mysqld restart 1.Mysql忘记root密码 // 查看my ...
TS 数据流分析学习
TS 流.包结构以及同步 1. TS 流: 可以将TS流理解为一种单一码流.混合码流. 单一码流:TS流的基本组成单位是长度为188字节的TS包. 混合码流:TS流有多种数据组成,一个TS包中的数据可 ...
Windows和Ubuntu平台Android +JAVA 环境搭建
NOTE 测试的时候,尤其是移动端的测试,需要搭建JAVA和Andriod环境: appium和macaca都需要这两个环境: Q&A Macaca doctor 发现没有platforms这 ...
7/26 CSU-ACM2018暑期训练3-递归&递推-选讲
题目链接把M个同样的苹果放在N个同样的盘子里,允许有的盘子空着不放,问共有多少种不同的分法?(用K表示)5,1,1和1,5,1 是同一种分法. Input 第一行是测试数据的数目t(0 <= ...
HDU 2829 Lawrence
$dp$,斜率优化. 设$dp[i][j]$表示前$i$个数字切了$j$次的最小代价.$dp[i][j]=dp[k][j-1]+p[k+1][i]$.观察状态转移方程,可以发现是一列一列推导出来的.可 ...

cadvisor详解

cadvisor详解的更多相关文章

随机推荐

热门专题