Flume的安装与配置

一、 资源下载

资源地址：http://flume.apache.org/download.html

程序地址：http://apache.fayea.com/flume/1.6.0/apache-flume-1.6.0-bin.tar.gz

源码地址：http://mirrors.hust.edu.cn/apache/flume/1.6.0/apache-flume-1.6.0-src.tar.gz

二、 安装搭建

(1)编译好的包：

直接在安装目录解压即可(重命名可选)

cd /usr/local/

tar –zxvf apache-flume-1.6.0-bin.tar.gz

mv apache-flume-1.6.0-bin flume

(2)源码编译安装：

这种方法比较麻烦，要把需要的包都下载全，然后用以下命令编译：

只进行编译：mvn clean compile
编译并且执行单元测试：mvn clean test
单独运行单元测试： mvn clean test -Dtest=<Test1>,<Test2>,... -DfailIfNoTests=false
创建压缩包: mvn clean install
跳过单元测试创建压缩包: mvn clean install –DskipTests

编译完成之后，和直接运行可执行包的

三、 运行与配置

(1)flume的配置

# example.conf: A single-node Flume configuration

# Name the components on this agent

a1.sources = r1

a1.sinks = k1

a1.channels = c1

# Describe/configure the source

a1.sources.r1.type = exec

a1.sources.r1.command = tail -F /flume/test.log

# Describe the sink

a1.sinks.k1.type = hdfs

# Use a channel which buffers events in memory

a1.channels.c1.type = memory

a1.channels.c1.capacity = 1000

a1.channels.c1.transactionCapacity = 100

a1.sinks.k1.hdfs.path=hdfs://192.168.15.135:9000/flume/events/%y-%m-%d/%H%M/%S

a1.sinks.k1.hdfs.filePrefix = events-

a1.sinks.k1.hdfs.round = true

a1.sinks.k1.hdfs.roundValue = 10

a1.sinks.k1.hdfs.roundUnit = minute

a1.sinks.k1.hdfs.useLocalTimeStamp = true

# Bind the source and sink to the channel

a1.sources.r1.channels = c1

a1.sinks.k1.channel = c1

配置文件分为四个部分source、sink、channel和它们之间的关联关系；flume之间模块的关系如下图：

如图：source是负责从WebServer收集数据信息，Sink负责将收集和格式化后的日志写入到磁盘、其他文件系统或其他日志系统，channel是负责连接source和sink。因为有channel的存在，所以source和sink是多对多的关系。

# example.conf: A single-node Flume configuration
# Name the components on this agent	a1是代理的名字
a1.sources = r1	定义一个source：r1
a1.sinks = k1	定义一个sink：k1
a1.channels = c1	定义一个channel：c1
# Describe/configure the source
a1.sources.r1.type = exec	a1的r1的类型为exec(执行类型)
a1.sources.r1.command = tail -F /flume/test.log	a1的r1要执行的命令为tail一个test.log
# Describe the sink
a1.sinks.k1.type = hdfs	a1的sink类型为hdfs
# Use a channel which buffers events in memory
a1.channels.c1.type = memory	a1的channel的类型为存在内存
a1.channels.c1.capacity = 1000	a1的容量为1000
a1.channels.c1.transactionCapacity = 100	a1的交互容量为100
a1.sinks.k1.hdfs.path=hdfs://192.168.15.135:9000/flume/events/%y-%m-%d/%H%M/%S	a1的叫k1的sink的最终存储的文件系统的路径是：hdfs://……
a1.sinks.k1.hdfs.filePrefix = events-	sink在存储文件的时候的前缀为event-
a1.sinks.k1.hdfs.round = true	hdfs配置项
a1.sinks.k1.hdfs.roundValue = 10	hdfs配置项
a1.sinks.k1.hdfs.roundUnit = minute	hdfs配置项
a1.sinks.k1.hdfs.useLocalTimeStamp = true	将用本地时间戳设置为true
# Bind the source and sink to the channel
a1.sources.r1.channels = c1	把source-r1绑定到channel-c1
a1.sinks.k1.channel = c1	把sink-k1绑定到channel-c1

(2)flume的运行方法为：

$ bin/flume-ng agent -n $agent_name -c conf -f conf/flume-conf.properties

-n 指定代理(agent)名字；

-c conf指定配置文件的目录(主要是日志等其他配置文件的目录)；

-f 本次运行的flume的配置文件，需要添加路径（模式是在工程的根路径flume/）

执行命令例如：

$ bin/flume-ng agent -n a1 -c conf -f conf/example.conf

执行成功之后，我们可以在logs的flume.log中看到日志。

另外，还可以用以下方式启动，来指定日志输出：

$ bin/flume-ng agent --conf conf --conf-file example.conf --name a1 -Dflume.root.logger=INFO,console

--conf ：与-c相同；

--conf-file ：与-f相同；

--name：与-n相同；

flume.root.logger：指定日志级别和显示方式，上述命令为INFO，输出到终端；如果没有此项，像之前的命令一样，默认的级别是INFO，输出到LOGFILE。

四、备注

(1)可选的source有：

§ Avro Source
§ Thrift Source
§ Exec Source
§ JMS Source
§ Spooling Directory Source
§ Twitter 1% firehose Source (experimental)
§ Kafka Source
§ NetCat Source
§ Sequence Generator Source
§ Syslog Sources
§ HTTP Source
§ Stress Source
§ Legacy Sources
§ Custom Source
§ Scribe Source

(2)可选的sink有：

§ HDFS Sink
§ Hive Sink
§ Logger Sink
§ Avro Sink
§ Thrift Sink
§ IRC Sink
§ File Roll Sink
§ Null Sink
§ HBaseSinks
§ MorphlineSolrSink
§ ElasticSearchSink
§ Kite Dataset Sink
§ Kafka Sink
§ Custom Sink
§ (2)可选的channel有：
- § Memory Channel
- § JDBC Channel
- § Kafka Channel
- § File Channel
- § Spillable Memory Channel
- § Pseudo Transaction Channel
- § Custom Channel

详细配置参考：http://flume.apache.org/FlumeUserGuide.html#flume-sources

Flume的安装与配置的更多相关文章

Flume的安装，配置及使用
1,上传jar包 2,解压 3,改名 4,更改配置文件将template文件重镜像 root@Ubuntu-1:/usr/local/apache-flume/conf# cat flume-env ...
Flume简介与使用（一）——Flume安装与配置
Flume简介与使用(一)——Flume安装与配置 Flume简介 Flume是一个分布式的.可靠的.实用的服务——从不同的数据源高效的采集.整合.移动海量数据. 分布式:可以多台机器同时运行采集数据 ...
CentOS6安装各种大数据软件第七章：Flume安装与配置
相关文章链接 CentOS6安装各种大数据软件第一章:各个软件版本介绍 CentOS6安装各种大数据软件第二章:Linux各个软件启动命令 CentOS6安装各种大数据软件第三章:Linux基础 ...
使用Windows Azure的VM安装和配置CDH搭建Hadoop集群
本文主要内容是使用Windows Azure的VIRTUAL MACHINES和NETWORKS服务安装CDH (Cloudera Distribution Including Apache Hado ...
Flume NG简介及配置
Flume下载地址:http://apache.fayea.com/flume/ 常用的分布式日志收集系统: Apache Flume. Facebook Scribe. Apache Chukwa ...
浅谈 zookeeper 原理,安装和配置
当前云计算流行, 单一机器额的处理能力已经不能满足我们的需求,不得不采用大量的服务集群.服务集群对外提供服务的过程中,有很多的配置需要随时更新,服务间需要协调工作,那么这些信息如何推送到各个节点?并且 ...
一脸懵逼学习基于CentOs的Hadoop集群安装与配置
1:Hadoop分布式计算平台是由Apache软件基金会开发的一个开源分布式计算平台.以Hadoop分布式文件系统(HDFS)和MapReduce(Google MapReduce的开源实现)为核心的 ...
日志采集框架Flume以及Flume的安装部署（一个分布式、可靠、和高可用的海量日志采集、聚合和传输的系统）
Flume支持众多的source和sink类型,详细手册可参考官方文档,更多source和sink组件 http://flume.apache.org/FlumeUserGuide.html Flum ...
一脸懵逼学习基于CentOs的Hadoop集群安装与配置（三台机器跑集群）
1:Hadoop分布式计算平台是由Apache软件基金会开发的一个开源分布式计算平台.以Hadoop分布式文件系统(HDFS)和MapReduce(Google MapReduce的开源实现)为核心的 ...

随机推荐

C语言错误之--初始值(低级错误)
今天犯了一个低级错误,虽然低级,但是也不能忽视,一个低级错误以后可能小则浪费时间和精力,大则酿成整个app的项目bug.
标准sql语句，学习
标准SQL语句总结标准SQL语句总结,标准SQL语言基本上适用于下面所列出的数据库软件 -------------------------------------------------------- ...
PL/SQL之--函数
一.函数函数是作为数据库对象存储在oracle数据库中,函数又被称为PL/SQL子程序.oracle处理使用系统提供的函数之外,用户还可以自己定义函数.函数通常被作为一个表达式来调用或存储过程的一个 ...
Eclipse 快捷键篇
1. Ctrl+Shift+R:打开资源这可能是所有快捷键组合中最省时间的了.这组快捷键可以让你打开你的工作区中任何一个文件,而你只需要按下文件名或mask名中的前几个字母,比如applic*.xml ...
一个自定义 HBase Filter -“通过RowKeys来高性能获取数据”
摘要: 大家在使用HBase和Solr搭建系统中经常遇到的一个问题就是:“我通过SOLR得到了RowKeys后,该怎样去HBase上取数据”.使用现有的Filter性能差劲,网上也没有现成的自定义Fi ...
session实现购物系统的简例和application实现统计页面访问次数的简例
login.jsp <body> <form action="checkLogin.jsp"> <table> <tr>< ...
linux sed命令
一.初识sed 在部署openstack的过程中,会接触到大量的sed命令,比如 # Bind MySQL service to all network interfaces. sed -i 's/1 ...
[转]Ionic – Mobile UI Framework for PhoneGap/Cordova Developers
本文转自:http://devgirl.org/2014/01/20/ionic-mobile-ui-framework-for-phonegapcordova-developers/ Ionic i ...
关于volatile和synchronized
这个可能是最好的对比volatile和synchronized作用的文章了.volatile是一个变量修饰符,而synchronized是一个方法或块的修饰符.所以我们使用这两种关键字来指定三种简单的 ...
【问题&解决】解决 Android SDK下载和更新失败“Connection to https://dl-ssl.google.com refused”的问题
缘由: 更新sdk,遇到了更新下载失败问题: Fetching https://dl-ssl.google.com/android/repository/addons_list-2.xmlFetche ...

Flume的安装与配置

Flume的安装与配置

Flume的安装与配置的更多相关文章

随机推荐

热门专题