Hadoop生态圈-Flume的组件之自定义拦截器（interceptor）

　　　　　　　　　　　　　　　　Hadoop生态圈-Flume的组件之自定义拦截器（interceptor）

　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　作者：尹正杰

　　本篇博客只是举例了一个自定义拦截器的方法，测试字节传输速度。

1>.自定义interceptor方法

 /*

 @author :yinzhengjie

 Blog:http://www.cnblogs.com/yinzhengjie/tag/Hadoop%E7%94%9F%E6%80%81%E5%9C%88/

 EMAIL:y1053419035@qq.com

 */

 package cn.org.yinzhengjie.interceptor;

 import org.apache.flume.Context;

 import org.apache.flume.Event;

 import org.apache.flume.interceptor.Interceptor;

 import java.util.List;

 /**

  * 设置限速拦截器

  * <p>

  * 当 字节/时间，即同一时刻，如果进入的字节过多

  * 则休眠一会

  */

 public class MyInterceptor implements Interceptor {

     private int speed;

     //构造

     private MyInterceptor(int speed) {

         this.speed = speed;

     }

     //do nothing

     public void initialize() {

     }

     /**

      * 1、拿出上一个event的时间，和当前时间进行相减，得出上一个event的时间间隔

      * 2、得到上一个event的body字节数

      * 3、相除得到上一个event的速度，并在此event中先进行停留，再返回event

      *

      * @param event

      * @return

      */

     long lastTime = -1;

     long lastBodySize = 0;

     public Event intercept(Event event) {

         byte[] body = event.getBody();

         int len = body.length;

         long current = System.nanoTime();

         //第一个event

         if (lastTime == -1) {

             lastTime = current;

             lastBodySize = len;

         }

         //非第一个event

         else {

             //计算上一个event停留的时间

             long interval = current - lastTime;

             System.out.println("=========================" + current + "/" + lastTime + "/" + interval + "=========================");

             //上一个event的速度

             int now_speed = (int) ((double) lastBodySize / interval * 1000);

             if (now_speed > speed) {

                 System.out.println("=========================" + now_speed + "=========================");

                 //计算需要停留多少秒 线程休眠，时间 = shouldTime - interval

                 try {

                     Thread.sleep((lastBodySize / speed) * 1000 - interval);

                 } catch (InterruptedException e) {

                     e.printStackTrace();

                 }

             }

             lastBodySize = len;

             lastTime = System.currentTimeMillis();

         }

         return event;

     }

     //迭代List<Event>，将所有Event交给intercept(Event)进行处理

     public List<Event> intercept(List<Event> events) {

         for (Event event : events) {

             intercept(event);

         }

         return events;

     }

     //do nothing

     public void close() {

     }

     public static class Builder implements Interceptor.Builder {

         private int speed;

         public void configure(Context context) {

             speed = context.getInteger(Constants.SPEED, Constants.DEFAULT_SPEED);

         }

         public Interceptor build() {

             return new MyInterceptor(speed);

         }

     }

     public static class Constants {

         public static String SPEED = "speed";

         public static int DEFAULT_SPEED = 1;

     }

 }

2>.打包并将其发送到 /soft/flume/lib下

[yinzhengjie@s101 ~]$ cd /soft/flume/lib/

[yinzhengjie@s101 lib]$

[yinzhengjie@s101 lib]$ ll | grep MyFlume

-rw-r--r--  1 yinzhengjie yinzhengjie    5231 Jun 20 18:53 MyFlume-1.0-SNAPSHOT.jar

[yinzhengjie@s101 lib]$

[yinzhengjie@s101 lib]$ rm -rf MyFlume-1.0-SNAPSHOT.jar

[yinzhengjie@s101 lib]$

[yinzhengjie@s101 lib]$ rz

[yinzhengjie@s101 lib]$

[yinzhengjie@s101 lib]$ ll | grep MyFlume

-rw-r--r--  1 yinzhengjie yinzhengjie    8667 Jun 20 21:02 MyFlume-1.0-SNAPSHOT.jar

[yinzhengjie@s101 lib]$

[yinzhengjie@s101 lib]$

3>.编写agent的配置文件

[yinzhengjie@s101 ~]$ more /soft/flume/conf/yinzhengjie_myInterceptor.conf

# Name the components on this agent

a1.sources = r1

a1.sinks = k1

a1.channels = c1

# 定义源: seq

a1.sources.r1.type = seq

# 定义一次RPC产生的批次数量

a1.sources.r1.batchSize = 

# 指定添加拦截器

a1.sources.r1.interceptors = i1

a1.sources.r1.interceptors.i1.type = cn.org.yinzhengjie.interceptor.MyInterceptor$Builder

a1.sources.r1.interceptors.i1.speed = 

# Describe the sink

a1.sinks.k1.type = logger

# Use a channel which buffers events in memory

a1.channels.c1.type = memory

a1.channels.c1.capacity =

a1.channels.c1.transactionCapacity = 

# Bind the source and sink to the channel

a1.sources.r1.channels = c1

a1.sinks.k1.channel = c1

[yinzhengjie@s101 ~]$

4>.启动flume并测试

[yinzhengjie@s101 ~]$ flume-ng agent -f /soft/flume/conf/yinzhengjie_myInterceptor.conf -n a1

　　下图是运行agent部分的输出内容

Hadoop生态圈-Flume的组件之自定义拦截器（interceptor）的更多相关文章

Hadoop生态圈-Flume的组件之自定义Sink
Hadoop生态圈-Flume的组件之自定义Sink 作者:尹正杰版权声明:原创作品,谢绝转载!否则将追究法律责任. 本篇博客主要介绍sink相关的API使用两个小案例,想要了解更多关于API的小技 ...
Hadoop生态圈-Flume的组件之拦截器与选择器
Hadoop生态圈-Flume的组件之拦截器与选择器作者:尹正杰版权声明:原创作品,谢绝转载!否则将追究法律责任. 本篇博客只是配置的是Flume主流的Interceptors,想要了解更详细 ...
Hadoop生态圈-Flume的组件之sink处理器
Hadoop生态圈-Flume的组件之sink处理器作者:尹正杰版权声明:原创作品,谢绝转载!否则将追究法律责任. 一. 二.
Flume（二） —— 自定义拦截器、Source、Sink
自定义拦截器自定义Source 自定义Sink 引入依赖 <dependency> <groupId>org.apache.flume</groupId> < ...
Struts2自定义拦截器Interceptor以及拦截器登录实例
1.在Struts2自定义拦截器有三种方式: -->实现Interceptor接口 public class QLInterceptorAction implements Interceptor ...
第1节 flume：15、flume案例二，通过自定义拦截器实现数据的脱敏
1.7.flume案例二案例需求: 在数据采集之后,通过flume的拦截器,实现不需要的数据过滤掉,并将指定的第一个字段进行加密,加密之后再往hdfs上面保存原始数据与处理之后的数据对比图一 ...
Struts2自定义拦截器
1. 需求自定义拦截器实现,用户登录的访问控制. 2. 定义拦截器类 public class LoginInterceptor extends AbstractInterceptor { @Ove ...
struts2--实现自定义拦截器
前言: 本篇文章,我想说清实现拦截器的办法,还有为什么要这样做: 目录: 1.需求目的 2.实现步骤及原理(文字怕描述不清,画图描述) 3.代码 4.总结一.需求目的规范或限制越级访问(例如:一个 ...
java：struts框架3（自定义拦截器，token令牌，文件上传和下载（单/多））
1.自定义拦截器: struts.xml: <?xml version="1.0" encoding="UTF-8"?> <!DOCTYPE ...

随机推荐

Jenkins控制台输出乱码
一.问题详情 jenkins构建mav任务,在控制台显示乱码: 二.原因分析 1. 查看系统编码和tomcat的编码都正常 # grep encoding /usr/local/tomcat/conf ...
win10系统安装web3js的正确方法
在安装web3的时候用npm install web3 –save-dev 在win10系统下会一直安装不成功.后来换用了 cnpm install web3 –save-dev 安装时候报出:C ...
链家鸟哥：从留级打架问题学生到PHP大神，他的人生驱动力竟然是？
链家鸟哥:从留级打架问题学生到PHP大神,他的人生驱动力竟然是?| 二叉树短视频 http://mp.weixin.qq.com/s/D4l_zOpKDakptCM__4hLrQ 从问题劝退学生到高考 ...
c#版flappybird 未完全实现
这些天开始在深圳找工作,想着把从前有些淡忘的技术再温故下.看到尊敬的<传智播客>有一期公开课,讲的是用c#编写flappybird小游戏,也就自己搜了下游戏资源,也来试试看. 其实用到的技 ...
LeetCode 167. 两数之和 II - 输入有序数组
题目: 给定一个已按照升序排列的有序数组,找到两个数使得它们相加之和等于目标数. 函数应该返回这两个下标值index1 和 index2,其中 index1 必须小于 index2. 说明: 返回的 ...
[搜狐科技]由浅入深理解Raft协议
由浅入深理解Raft协议 2017-10-16 12:12操作系统/设计 0 - Raft协议和Paxos的因缘读过Raft论文<In Search of an Understandable ...
转帖 OKR
什么是OKR OKR全称是Objectives and Key Results,即目标与关键成果法.OKR是一套定义和跟踪目标及其完成情况的管理工具和方法.1999年Intel公司发明了这种方法,后来 ...
移动的调试工具vConsole
在PC端写代码调试的时候,直接console.log()即可,但是在手机端怎么调试??最近发现一个很有用的插件vConsole 首先引入插件: 然后再文件中使用即可: 这样再手机中就会出现下面的标识, ...
Vue项目部署问题及解决方案
Vue项目部署问题及解决方案 Vue-Router 有两种模式,默认是 hash 模式,另外一种是 history 模式. hash:也就是地址栏里的 # 符号.比如 http://www.examp ...
一本通1645Fibonacci
1645:Fibonacci 时间限制: 1000 ms 内存限制: 524288 KB [题目描述] 原题来自:POJ 3070 我们知道斐波那契数列 F0=0,F1=1,Fn=Fn ...

Hadoop生态圈-Flume的组件之自定义拦截器（interceptor）

Hadoop生态圈-Flume的组件之自定义拦截器（interceptor）的更多相关文章

随机推荐

热门专题