一、什么是Monstache

Monstache 是Golang语言实现的基于MongoDB的oplog实现实时数据同步及订阅的插件，支持MongoDB与ES之间的数据同步。其中MongoDB需要搭建副本集。

二、安装过程

1、安装Go，并配置环境变量

(1) 下载Go并解压

wget https://dl.google.com/go/go1.14.4.linux-amd64.tar.gz  

tar -C /usr/local -xzf go1.14.4.linux-amd64.tar.gz  

(2) 建立go的工作空间，即GOAPTH所指向的目录

创建三个目录 mkdir bin、mkdir src、mkdir pkg

(3) 配置环境变量

使用 vim /etc/profile命令打开环境变量配置文件，并将如下内容写入该文件中。其中GOPROXY用来指定阿里云Go模块代理

export GOROOT=/usr/local/go  

export GOPATH=/home/go/  

export PATH=$PATH:$GOROOT/bin:$GOPATH/bin  

export GOPROXY=https://mirrors.aliyun.com/goproxy/  

(4) 应用环境变量配置

source /etc/profile

2、Monstache说明

（1）Monstache-Mongodb-Es 版本关系:

Monstache version	Git branch (used to build plugin)	Docker tag	Description	Elasticsearch	MongoDB	Status
6	rel6	rel6, latest	MongoDB, Inc. go driver	Version 7+	Version 2.6+	Supported
5	rel5	rel5	MongoDB, Inc. go driver	Version 6	Version 2.6+	Supported
4	master	rel4	mgo community go driver	Version 6	Version 3	Deprecated
3	rel3	rel3	mgo community go driver	Versions 2 and 5	Version 3	Deprecated

（2）Monstache配置

参数	说明
mongo-url	MongoDB实例的主节点访问地址。可在实例的基本信息页面获取，获取前需配置MongoDB实例的白名单，即在白名单中添加安装Monstache的ECS实例的内网IP地址。
elasticsearch-urls	Elasticsearch实例的访问地址，格式为http://<Elasticsearch实例的内网地址>:9200。
direct-read-namespaces	指定待同步的集合
change-stream-namespaces	如果要使用MongoDB变更流功能，需要指定此参数。启用此参数后，oplog追踪会被设置为无效
namespace-regex	通过正则表达式指定需要监听的集合。此设置可以用来监控符合正则表达式的集合中数据的变化。
elasticsearch-user	访问Elasticsearch实例的用户名，默认为elastic。注意实际业务中不建议使用elastic用户，这样会降低系统安全性。建议使用自建用户，并给予自建用户分配相应的角色和权限。
elasticsearch-password	对应用户的密码。elastic用户的密码在创建实例时指定，如果忘记可进行重置
elasticsearch-max-conns	定义连接Elasticsearch的线程数。默认为4，即使用4个Go线程同时将数据同步到Elasticsearch。
dropped-collections	默认为true，表示当删除MongoDB集合时，会同时删除Elasticsearch中对应的索引。
dropped-databases	默认为true，表示当删除MongoDB数据库时，会同时删除Elasticsearch中对应的索引。
resume	默认为false。设置为true，Monstache会将已成功同步到Elasticsearch的MongoDB操作的时间戳写入monstache.monstache集合中。当Monstache因为意外停止时，可通过该时间戳恢复同步任务，避免数据丢失。如果指定了cluster-name，该参数将自动开启。
resume-strategy	指定恢复策略。仅当resume为true时生效。
verbose	默认为false，表示不启用调试日志。
cluster-name	指定集群名称。指定后，Monstache将进入高可用模式，集群名称相同的进程将进行协调
mapping	指定Elasticsearch索引映射。默认情况下，数据从MongoDB同步到Elasticsearch时，索引会自动映射为数据库名.集合名。如果需要修改索引名称，可通过该参数设置。

3、Monstache安装

（1）进入安装路径

cd /usr/local

（2）使用命令下载或手动上传安装包

1）从git库下载

git clone https://github.com/rwynn/monstache.git，若提示下载失败需先下载git ,使用yum install -y git命令

2）手动上传

资源下载地址: https://github.com/rwynn/monstache

进入安装目录，安装

cd /usr/local/monstache-6.7.11

go install

查看版本

安装完成

4、配置实时同步任务

（1）进入Monstache安装目录，创建并编辑配置文件

#创建monstache的存储目录

cd /usr/local/monstache-6.7.11

mkdir config

mkdir logs

进入config目录，编辑创建配置文件

cd config

vim config.toml

文件内容如下

# connection settings  

# connect to MongoDB using the following URL

mongo-url = "mongodb://192.168.8.79:27017/pns"

# connect to the Elasticsearch REST API at the following node URLs

elasticsearch-urls = ["http://192.168.8.153:9200"]  

# frequently required settings  

# if you need to seed an index from a collection and not just listen and sync changes events

# you can copy entire collections or views from MongoDB to Elasticsearch

direct-read-namespaces = ["库名.集合名"]  

# if you want to use MongoDB change streams instead of legacy oplog tailing use change-stream-namespaces

# change streams require at least MongoDB API 3.6+

# if you have MongoDB 4+ you can listen for changes to an entire database or entire deployment

# in this case you usually don't need regexes in your config to filter collections unless you target the deployment.

# to listen to an entire db use only the database name.  For a deployment use an empty string.

#change-stream-namespaces = ["mydb.col"]  

# additional settings  

# if you don't want to listen for changes to all collections in MongoDB but only a few

# e.g. only listen for inserts, updates, deletes, and drops from mydb.mycollection

# this setting does not initiate a copy, it is only a filter on the change event listener

namespace-regex = '^库名\.集合名$'

# compress requests to Elasticsearch

gzip = true

# generate indexing statistics

#stats = true

# index statistics into Elasticsearch

#index-stats = true

# use the following PEM file for connections to MongoDB

#mongo-pem-file = "/path/to/mongoCert.pem"

# disable PEM validation

#mongo-validate-pem-file = false

# use the following user name for Elasticsearch basic auth

elasticsearch-user = "elastic"

# use the following password for Elasticsearch basic auth

elasticsearch-password = "<your_es_password>"

# use 4 go routines concurrently pushing documents to Elasticsearch

elasticsearch-max-conns = 4

# use the following PEM file to connections to Elasticsearch

#elasticsearch-pem-file = "/path/to/elasticCert.pem"

# validate connections to Elasticsearch

#elastic-validate-pem-file = true

# propogate dropped collections in MongoDB as index deletes in Elasticsearch

dropped-collections = true

# propogate dropped databases in MongoDB as index deletes in Elasticsearch

dropped-databases = true

# do not start processing at the beginning of the MongoDB oplog

# if you set the replay to true you may see version conflict messages

# in the log if you had synced previously. This just means that you are replaying old docs which are already

# in Elasticsearch with a newer version. Elasticsearch is preventing the old docs from overwriting new ones.

#replay = false

# resume processing from a timestamp saved in a previous run

resume = true

# do not validate that progress timestamps have been saved

#resume-write-unsafe = false

# override the name under which resume state is saved

#resume-name = "default"

# use a custom resume strategy (tokens) instead of the default strategy (timestamps)

# tokens work with MongoDB API 3.6+ while timestamps work only with MongoDB API 4.0+

resume-strategy = 0

# exclude documents whose namespace matches the following pattern

#namespace-exclude-regex = '^mydb\.ignorecollection$'

# turn on indexing of GridFS file content

#index-files = true

# turn on search result highlighting of GridFS content

#file-highlighting = true

# index GridFS files inserted into the following collections

#file-namespaces = ["users.fs.files"]

# print detailed information including request traces

verbose = true

# enable clustering mode

#cluster-name = 'es-cn-mp91kzb8m00******'

# do not exit after full-sync, rather continue tailing the oplog

#exit-after-direct-reads = false

[[mapping]]

namespace = "pns.user_device_table"

index = "pns.user_device_table"

#type = "device"

注：1）如果不设置mapping，那么同步到es的索引默认为mongoDB的数据库名.集合名

2）monstache是根据mongoDB的oplog来同步更新的，因此需要开启mongo的副本集才可以

3）当mongo有密码认证时mongoUrl需要改为

mongodb://用户：密码@IP:PORT/数据库名称?authSource=数据库名称&authMechanism=SCRAM-SHA-1，同时这个地址要为mongoDB主节点访问地址

4）direct-read-namespaces表示初始化会全量同步到es，而change-stream-namespaces只会同步有更改变化的

5）namespace-regex不设置时默认同步mongoDB中所有数据的更改变化，可以指定只同步哪个或哪些集合的增删改

（2）配置自启

#切换目录

cd /etc/init.d  

#编辑命令

vim monstache

脚本内容

#!/bin/bash

#

# chkconfig: 2345 10 90

#

# description: MONSTACHE RUN  

#程序名

MONSTACHE_NAME="monstache-rel6"  

#资源位置

MONSTACHE_OPTS=/home/go/bin/monstache

MONSTACHE_CONF=/usr/local/monstache-6.7.11/config/config.toml  

#日志位置

MONSTACHE_LOGS=/usr/local/monstache-6.7.11/logs/monstache.log  

#开始方法

start() {

nohup $MONSTACHE_OPTS -f $MONSTACHE_CONF > $MONSTACHE_LOGS  2>&1 &

echo "$MONSTACHE_NAME started success."

}  

#结束方法

stop() {

echo "stopping $MONSTACHE_NAME ..."

kill -9 `ps -ef|grep $MONSTACHE_OPTS|grep -v grep|grep -v stop|awk '{print $2}'`

}  

case "$1" in

start)

start

;;

stop)

stop

;;

restart)

stop

start

;;

*)

echo "Userage: $0 {start|stop|restart}"

exit 1

esac

#授权

chmod +x monstache   

#添加系统服务

chkconfig --add monstache

chkconfig monstache on

chkconfig --list

注：MONSTACHE_OPTS位置是在GO的安装目录下

（3）启停相关

#启停命令

systemctl start|stop|restart monstache  

#查看进程

ps -ef|grep monstache

注：

　　1、使用版本

　　　　mongo 4.4.15

　　　　monstache 6.7.11

　　　　elasticsearch 7.9.2

　　2、mongo单机创建副本集

　　　　参考文档：http://www.kaotop.com/it/14307.html

　　3、集群下创建副本集

　　　　参考文档：https://blog.csdn.net/qq_42428264/article/details/124227050

参考原文地址：https://blog.csdn.net/weixin_44187730/article/details/117198268

Monstache的安装和使用的更多相关文章

docker——容器安装tomcat
写在前面: 继续docker的学习,学习了docker的基本常用命令之后,我在docker上安装jdk,tomcat两个基本的java web工具,这里对操作流程记录一下. 软件准备: 1.jdk-7 ...
网络原因导致 npm 软件包 node-sass / gulp-sass 安装失败的处理办法
如果你正在构建一个基于 gulp 的前端自动化开发环境,那么极有可能会用到 gulp-sass ,由于网络原因你可能会安装失败,因为安装过程中部分细节会到亚马逊云服务器上获取文件.本文主要讨论在不变更 ...
Sublime Text3安装JsHint
介绍 Sublime Text3使用jshint依赖Nodejs,SublimeLinter和Sublimelinter-jshint. NodeJs的安装省略. 安装SublimeLinter Su ...
Fabio 安装和简单使用
Fabio(Go 语言):https://github.com/eBay/fabio Fabio 是一个快速.现代.zero-conf 负载均衡 HTTP(S) 路由器,用于部署 Consul 管理的 ...
gentoo 安装
加载完光驱后 1进行ping命令查看网络是否通畅 2设置硬盘的标识为GPT(主要用于64位且启动模式为UEFI,还有一个是MBR,主要用于32位且启动模式为bois) parted -a optima ...
Linux平台 Oracle 10gR2（10.2.0.5）RAC安装 Part3：db安装和升级
Linux平台 Oracle 10gR2(10.2.0.5)RAC安装 Part3:db安装和升级环境:OEL 5.7 + Oracle 10.2.0.5 RAC 5.安装Database软件 5. ...
Linux平台 Oracle 10gR2（10.2.0.5）RAC安装 Part1：准备工作
Linux平台 Oracle 10gR2(10.2.0.5)RAC安装 Part1:准备工作环境:OEL 5.7 + Oracle 10.2.0.5 RAC 1.实施前准备工作 1.1 服务器安装操 ...
【原】nodejs全局安装和本地安装的区别
来微信支付有2年多了,从2年前的互联网模式转变为O2O模式,主要的场景是跟线下的商户去打交道,不像以往的互联网模式,有产品经理提需求,我们帮忙去解决问题. 转型后是这样的,团队成员更多需要去寻找业务的 ...
tLinux 2.2下安装Mono 4.8
Tlinux2.2发行版基于CentOS 7.2.1511研发而成,内核版本与Tlinux2.0发行版保持完全一致,更加稳定,并保持对Tlinux2.0的完全兼容.Mono 4版本要求CentOS 7 ...
工欲善其事，必先利其器之 VS2013全攻略(安装，技巧，快捷键，插件）！
如有需要WPF工具的朋友可以移步工欲善其事,必先利其器之 WPF篇: 随着开发轨迹来看高效WPF开发的工具和技巧之前一篇<c++的性能, c#的产能?!鱼和熊掌可以兼得,.NET NATI ...

随机推荐

分布式拒绝服务攻击(DDoS)和僵尸网络(Botnet)
DDos和僵尸网络是相辅相成的两种攻击手段,本文仅介绍基本概念,详细请查看文末参考资料. 分布式拒绝服务攻击(DDoS) 分布式拒绝服务攻击DDoS是一种基于DoS的特殊形式的拒绝服务攻击,是一种分布 ...
超级容易理解的Three.js中的物体rotation
假设模特头朝着自己现在躺在地上那么改变Y就是等于躺着转圈圈 mesh.rotation.set( 0,Math.PI/1.2,0); 改变X就意味着,这个是本来模特头朝着自己躺着,然后站起来了,后脑 ...
UEditor采坑指南
1.版本选择 GitHub上的最新版(1.5)没有提供后端代码.想要后端代码可以下载历史版本(1.4.3.3). 1.5版本:https://github.com/fex-team/ueditor 1 ...
sun.security.validator.ValidatorException: PKIXpath building failed: sun.security.provider,javax.net.ssT.SSLHandshakeExceptions.certpath.SunCertPathBuilderException
报错信息: sun.security.validator.ValidatorException: PKIXpath building failed: sun.security.provider,jav ...
JavaScript所有内部属性列表 [[Configurable]] 等
简介据MDN文档所说内部属性是由 [[···]] 包裹的内容,于是我们去复制 ECMA-262 标准的所有文字部分,然后用正则统计 [[···]] 的出现并打印效果代码经老大提醒,这里的正则并 ...
K3S 系列文章-5G IoT 网关设备 POD 访问报错 DNS 'i/o timeout'分析与解决
开篇 <K3s 系列文章> <Rancher 系列文章> 问题概述 20220606 5G IoT 网关设备同时安装 K3S Server, 但是 POD 却无法访问互联网地址 ...
Spring（AOP的认识、实现）
2:Spring AOP AOP (Aspect Oriented Programming) 面向切面编程 OOP(Object Oriented Programming)面向对象编程,用对象的思想来 ...
vue3语法糖+ts组件传值
在开发中有些功能是通用的,而且逻辑大致相同,像这种东西可以封成一个组件,比较常用的就是函数封装,组件封装,组件封装是需要引入到页面使用的,所以通常它会有一些自己的方法,父子组件可以通过一些值来进行关联 ...
Charles基本操作指南
安装: 百度"charles",闭着眼睛安装. charles依赖于java,还需要安装java,怎么安装java,见我的另一篇博客. 破解注册码:Registered Name ...
Spark系列 - (3) Spark SQL
3. Spark SQL 3.1 Hive.Shark和Sparksql Hive:Hadoop刚开始出来的时候,使用的是hadoop自带的分布式计算系统 MapReduce,但是MapReduce的 ...

Monstache的安装和使用