consul cluster setup

  • 介绍和指南:
    consul用于服务发现.当底层服务发生变化时,能及时更新正确的mysql服务IP. 并提供给业务查询.但需要自行编写脚本,监测数据库状态和切断故障服务器的对外提供服务.
    https://www.consul.io/intro/getting-started/install.html

环境:

consul cluster node
node1:192.168.99.183
node1:192.168.99.184
node1:192.168.99.185

agent(client)/安装在mysql服务器上
agent1:192.168.99.210(mysql master)
agent2:192.168.99.211(mysql slave1)
agent3:192.168.99.212(mysql slave2)

安装:

cd /opt
wget https://releases.hashicorp.com/consul/1.2.2/consul_1.2.2_linux_amd64.zip
unzip consul_1.2.2_linux_amd64.zip
cd /usr/bin/
ln -s /opt/consul consul
mkdir /data/consul/ /etc/consul.d/ \\安装完成:
#consul
Usage: consul [--version] [--help] <command> [<args>] Available commands are:
agent Runs a Consul agent
catalog Interact with the catalog
connect Interact with Consul Connect
event Fire a new event
exec Executes a command on Consul nodes
force-leave Forces a member of the cluster to enter the "left" state
info Provides debugging information for operators.
intention Interact with Connect service intentions
join Tell Consul agent to join cluster
keygen Generates a new encryption key
keyring Manages gossip layer encryption keys
kv Interact with the key-value store
leave Gracefully leaves the Consul cluster and shuts down
lock Execute a command holding a lock
maint Controls node or service maintenance mode
members Lists the members of a Consul cluster
monitor Stream logs from a Consul agent
operator Provides cluster-level tools for Consul operators
reload Triggers the agent to reload configuration files
rtt Estimates network round trip time between nodes
snapshot Saves, restores and inspects snapshots of Consul server state
validate Validate config files/directories
version Prints the Consul version
watch Watch for changes in Consul [root@db210_09:38:25 /data/consul/script]
#consul --version
Consul v1.2.2
Protocol 2 spoken by default, understands 2 to 3 (agent will automatically use protocol >2 when speaking to compatible agents)

集群配置

三个节点配置基本一样,只需要修改bind_addr、client_addr为当前服务器IP.
重要参数:
-bootstrap-expect:在一个datacenter中期望提供的server节点数目,当大于或等这个数量的server成功上线群集才对外提供服务,该标记不能和bootstrap同时存在,推荐使用bootstrap-expect方式.

#cd /etc/consul.d/
#ll
-rwxr-xr-x. 1 root root 403 Aug 29 10:21 server.json \\如果没有x属性,使用:chmod +x server.json
[root@db5_09:44:18 /etc/consul.d]
#cat server.json
{
"data_dir": "/data/consul",
"datacenter": "dc1",
"log_level": "INFO",
"server": true,
"bootstrap_expect": 2, \\第一个节点需要此配置
"bind_addr": "192.168.99.185",
"client_addr": "192.168.99.185",
"ports":{
"dns":53
},
"ui":true,
"retry_join": ["192.168.99.183","192.168.99.184","192.168.99.185"],
"retry_interval":"3s",
"raft_protocol":3,
"rejoin_after_leave":true
}

集群启动

先启动带 "bootstrap_expect": 2,标签的server。启动命令:

[root@db210_11:10:21 /etc/consul.d]
#consul agent --config-dir=/etc/consul.d >/data/consul/consul.log 2>&1 &

查看集群成员

[root@db3_09:52:27 /etc/consul.d]                              用执着守候成功
#consul members --http-addr=192.168.99.183:8500
Node Address Status Type Build Protocol DC Segment
db3 192.168.99.183:8301 alive server 1.2.2 2 dc1 <all>
db4 192.168.99.184:8301 alive server 1.2.2 2 dc1 <all>
db5 192.168.99.185:8301 alive server 1.2.2 2 dc1 <all>

此种状态就可以添加服务器的注册了.

Client节点配置

配置分为三部分:
1.注册client配置,用于与consul集群serrver通信,配置文件:/etc/consul.d/agent.json
2.注册应用服务(mysql)的配置,配置文件:/etc/consul.d/r_db3308.json(slave的只读服务),w_db3308.json(master 服务)
3.检测master或slave是否健康的python脚本,配置文件:/data/consul/script/check_mysql.py

  • agent.json部分:
#cd /etc/consul.d/
[root@db210_10:16:22 /etc/consul.d]
#ll
total 12
-rwxr-xr-x 1 root root 316 Aug 29 10:41 agent.json
-rwxr-xr-x 1 root root 349 Sep 1 22:10 r_db3308.json
-rwxr-xr-x 1 root root 350 Sep 1 22:13 w_db3308.json
[root@db210_10:16:23 /etc/consul.d]
#cat agent.json
{
"data_dir": "/data/consul",
"enable_script_checks": true,
"bind_addr": "192.168.99.210",
"retry_join": ["192.168.99.183","192.168.99.184","192.168.99.185"],
"retry_interval": "30s",
"rejoin_after_leave": true,
"start_join": ["192.168.99.183","192.168.99.184","192.168.99.185"]
}
[root@db210_10:19:50 /etc/consul.d]
此部分内容和群集server注册内容差不多.
  • r_db3308.json和cat w_db3308.json部分 :
[root@db210_10:16:22 /etc/consul.d]
#ll
total 12
-rwxr-xr-x 1 root root 316 Aug 29 10:41 agent.json
-rwxr-xr-x 1 root root 349 Sep 1 22:10 r_db3308.json
-rwxr-xr-x 1 root root 350 Sep 1 22:13 w_db3308.json
[root@db210_10:19:50 /etc/consul.d]
#cat r_db3308.json
{
"service":
{
"name":"r_db3508",
"tags":[
"zstdb3508"
],
"address":"192.168.99.210",
"port":3508,
"check":
{
"args":[
"/data/consul/script/check_mysql.py",
"slave" ],
"interval":"5s" }
} }
[root@db210_10:22:18 /etc/consul.d]
#cat w_db3308.json
{
"service":
{
"name":"w_db3508",
"tags":[
"zstdb3508"
],
"address":"192.168.99.210",
"port":3508,
"check":
{
"args":[
"/data/consul/script/check_mysql.py",
"master" ],
"interval":"5s" }
} }
  • check_mysql.py(检查主从节点状态脚本)
[root@db212_11:40:41 /data/consul/script]
#cat check_mysql.py
#!/usr/bin/env python2
#-*- coding: utf-8 -*-
# Script Name: mysql_check.py
# Description: check mysql servers status
# Author: Wenyz
# Create Date: 2018/08/29
import os,sys
import time
import datetime
import MySQLdb
import getpass
check_item=sys.argv[1]
print (datetime.datetime.now().strftime('%Y-%m-%d %H:%M:%S'))
print "check_item:%s"%check_item
#check_item="master"
host='127.0.0.1'
user='wyz'
password='*****'
port = 3508
database='wenyz'
def mysql_connect():
try:
conn = MySQLdb.connect(host = host, user = user ,passwd = password,port = port, db = database)
return conn.cursor()
except MySQLdb.Error,e:
try:
print "Error %d:%s"%(e.args[0],e.args[1])
except IndexError:
print "MySQL Error:%s" % str(e)
sys.exit(1)
def validate_select():
try:
cursor = mysql_connect()
cursor.execute('use wenyz;')
cursor.execute('select * from t2 limit 2;')
result_set=cursor.fetchall()
if result_set[0][0] == 4079861: #id
print "the frist rowid is %s"%result_set[0][0]
print 'Successfully query data!'
else:
print "the vaule is %s"%result_set[0][0]
return 'successfully'
except MySQLdb.Error,e:
print "Error %d:%s"%(e.args[0],e.args[1])
print "MySQL Error:%s" % str(e)
print "Query data failed"
return 'failed'
cursor.close()
conn.close()
#validate_select() def check_mysql_variable(var_key):
try:
cursor = mysql_connect()
sql="show global variables like \'%s\'"%var_key
cursor.execute(sql)
variable_set=cursor.fetchall()
show_var={}
for r in variable_set:
show_var[r[0]]=r[1]
return show_var[var_key]
except MySQLdb.Error,e:
try:
print "Error %d:%s"%(e.args[0],e.args[1])
except IndexError:
print "MySQL Error:%s" % str(e)
sys.exit(1)
cursor.close()
conn.close()
def set_var(key_name,key_value):
try:
cursor = mysql_connect()
sql="set global %s=%s"%(key_name,key_value)
cursor.execute(sql)
except MySQLdb.Error,e:
try:
print "Error %d:%s"%(e.args[0],e.args[1])
except IndexError:
print "MySQL Error:%s" % str(e)
sys.exit(2)
cursor.close() def isslave():
try:
conn = MySQLdb.connect(host = host, user = user ,passwd = password,port = port, db = database)
cursor=conn.cursor(cursorclass=MySQLdb.cursors.DictCursor)
cursor.execute("show slave status")
slave_status = cursor.fetchone()
va_slave_status={} if slave_status !=None:
if slave_status['Slave_IO_Running']=='Yes' and slave_status['Slave_SQL_Running']=='Yes':
print "This is a slave & Slave_SQL_Running and Slave_IO_Running are both YES"
return 'Yes'
else:
print "This is a slave,but Slave_SQL_Running or Slave_IO_Running is not YES"
return 'yes'
else:
# print type(slave_status)
print "show slave status:%s"%slave_status
print "Slave replication setup error or this is master"
return 'no'
except IndexError:
print "show slave failed,%s"%str(e)
sys.exit(2)
if check_item=='master':
try:
vs=validate_select()
isslave=isslave()
c_read_only=check_mysql_variable("read_only")
c_super_read_only=check_mysql_variable("super_read_only")
if vs=='successfully' and isslave=='no' and c_read_only=='OFF' and c_super_read_only=='OFF':
print "It's a healthy master,select t2:%s,isslave:%s ,read_only:%s,super_read_only:%s"%(vs,isslave,c_read_only,c_super_read_only)
sys.exit(0)
else:
print "It's not a healthy master,select t2:%s,isslave:%s ,read_only:%s,super_read_only:%s"%(vs,isslave,c_read_only,c_super_read_only)
sys.exit(2)
except IndexError:
print "show slave failed,%s"%str(e)
sys.exit(2)
elif check_item=='slave':
try:
vs=validate_select()
isslave=isslave()
c_read_only=check_mysql_variable("read_only")
c_super_read_only=check_mysql_variable("super_read_only")
# print slave_status,type(slave_status)
#print "Isslave is yes but read_only is off,will set to on %s,%s,%s"%(isslave,c_read_only,c_super_read_only)
if isslave=="Yes" and (c_read_only=="OFF" or c_super_read_only=="OFF"):
print "Isslave () is true, but read_only or super_read_only is OFF, seting this parameter to on.read_only:%s,super_read_only:%s"%(c_read_only,c_super_read_only)
set_var("read_only","ON")
set_var("super_read_only","ON")
if vs=='successfully' and isslave=='Yes' and c_read_only=='ON' and c_super_read_only=='ON':
print "It's a healthy slave,select t2:%s,isslave:%s ,read_only:%s,super_read_only:%s"%(vs,isslave,c_read_only,c_super_read_only)
sys.exit(0)
else:
print "It's not a healthy slave,select t2:%s,isslave:%s ,read_only:%s,super_read_only:%s"%(vs,isslave,c_read_only,c_super_read_only)
sys.exit(2)
except IndexError:
print "show slave failed,%s"% str(e)
sys.exit(2)
else:
print 'The parameter is not "master" or "slave"'
sys.exit(2)
#isslave()

Client节点启动并查看状态

consul agent --config-dir=/etc/consul.d/ > /data/consul/consul.log 2>&1 &
[root@db212_11:52:35 /data/consul/script]
#consul members --http-addr=192.168.99.183:8500
Node Address Status Type Build Protocol DC Segment
db3 192.168.99.183:8301 alive server 1.2.2 2 dc1 <all>
db4 192.168.99.184:8301 alive server 1.2.2 2 dc1 <all>
db5 192.168.99.185:8301 alive server 1.2.2 2 dc1 <all>
db210 192.168.99.210:8301 alive client 1.2.2 2 dc1 <default>
db211 192.168.99.211:8301 alive client 1.2.2 2 dc1 <default>
db212 192.168.99.212:8301 alive client 1.2.2 2 dc1 <default>

启动后可以通过看web方式查看服务,并通过DNS查询核实服务是否注册成功:

[root@db212_11:52:56 /data/consul/script]
#nslookup
> server 192.168.99.183
Default server: 192.168.99.183
Address: 192.168.99.183#53
> w_db3508.service.consul.
Server: 192.168.99.183
Address: 192.168.99.183#53 Name: w_db3508.service.consul
Address: 192.168.99.210
> r_db3508.service.consul.
Server: 192.168.99.183
Address: 192.168.99.183#53 Name: r_db3508.service.consul
Address: 192.168.99.211
Name: r_db3508.service.consul
Address: 192.168.99.212
>


转自

服务发现 consul cluster 的搭建 - 2森林 - 博客园 https://www.cnblogs.com/2woods/p/9573313.html

服务发现 consul cluster 的搭建【转】的更多相关文章

  1. 服务发现 consul cluster 的搭建

    consul cluster setup 介绍和指南: consul用于服务发现.当底层服务发生变化时,能及时更新正确的mysql服务IP. 并提供给业务查询.但需要自行编写脚本,监测数据库状态和切断 ...

  2. 服务发现 - consul 的介绍、部署和使用

    什么是服务发现 相关源码: spring cloud demo 微服务的框架体系中,服务发现是不能不提的一个模块.我相信了解或者熟悉微服务的童鞋应该都知道它的重要性.这里我只是简单的提一下,毕竟这不是 ...

  3. 服务发现 - consul 的介绍、部署和使用(转)

    什么是服务发现 相关源码: spring cloud demo 微服务的框架体系中,服务发现是不能不提的一个模块.我相信了解或者熟悉微服务的童鞋应该都知道它的重要性.这里我只是简单的提一下,毕竟这不是 ...

  4. .Net微服务实践(五)[服务发现]:Consul介绍和环境搭建

    目录 介绍 服务发现 健康检查.键值存储和数据中心 架构 Consul模式 环境安装 HTTP API 和Command CLI 示例API介绍 最后 在上篇.Net微服务实践(四)[网关]:Ocel ...

  5. 8分钟学会Consul集群搭建及微服务概念

    Consul介绍: Consul 是由 HashiCorp 公司推出的开源软件,用于实现分布式系统的服务发现与配置.与其他分布式服务注册与发现的方案,Consul 的方案更“一站式”,内置了服务注册与 ...

  6. 服务注册发现consul之一:consul介绍、安装、及功能介绍

    Consul 是一套开源的分布式服务发现和配置管理系统,由 HashiCorp 公司用 Go 语言开发.它具有很多优点.包括:基于 raft 协议,比较简洁: 支持健康检查, 同时支持 HTTP 和 ...

  7. 基于 Consul 的 Docker Swarm 服务发现

    Docker 是一种新型的虚拟化技术,它的目标在于实现轻量级操作系统的虚拟化.相比传统的虚拟化方案,Docker 虚拟化技术有一些很明显的优势:启动容器的速度明显快于传统虚拟化技术,同时创建一台虚拟机 ...

  8. 微服务Consul系列之服务注册与服务发现

    在进行服务注册之前先确认集群是否建立,关于服务注册可以看上篇微服务Consul系列之集群搭建的介绍,两种注册方式:一种是注册HTTP API.另一种是通过配置文件定义,下面讲解的是基于后者配置文件定义 ...

  9. Consul 服务发现与配置

    Consule 是什么 Consul包含多个组件,但是作为一个整体,为你的基础设施提供服务发现和服务配置的工具.他提供以下关键特性: 服务发现 Consul 的客户端可用提供一个服务,比如 api 或 ...

随机推荐

  1. mycat入门--数据库分片

    配置mycat的用户名和密码: 连接mycat,就像连接mysql一样:

  2. 低版本Flume兼容高版本elasticsearch

    Flume更新比较慢,而elasticsearch更新非常快所以当涉及更换elasticsearch版本时会出现不兼容问题. apache-flume-1.6.0+elasticsearch1.5.1 ...

  3. LeetCode_p150_逆波兰表达式计算/后缀表达式计算

    有效的运算符包括 +, -, *, / .每个运算对象可以是整数,也可以是另一个逆波兰表达式. 说明: 整数除法只保留整数部分. 给定逆波兰表达式总是有效的.换句话说,表达式总会得出有效数值且不存在除 ...

  4. XML fragments parsed from previous mappers already contains value for xxxxx

    错误信息: Caused by: org.springframework.core.NestedIOException: Failed to parse mapping resource: 'file ...

  5. Java的String和StringBuilder

    一.String 1.创建String对象的方法: String s1="haha"; String s2=new String(); String s3=new String(& ...

  6. CSS高度塌陷问题与解决办法

    问题描述: 在文档流中,父元素默认被子元素撑开(父多高子多高),一旦子浮动,脱离文档流,父无撑起元素便塌陷,父下的所有元素会上移. (不推荐)可以将父高度写死避免塌陷,但高度写死后,父高度不能自动适应 ...

  7. [ffmpeg] h.264解码所用的主要缓冲区介绍

    在进行h264解码过程中,有两个最重要的结构体,分别为H264Picture.H264SliceContext. H264Picture H264Picture用于维护一帧图像以及与该图像相关的语法元 ...

  8. JDK源代码学习-基础类

    一.概述 1.Java,是一套语言规范,例如规定了变量如何定义.控制语句如何写等,提供基本的语法规范.JDK是java自带的一套调用组件,是对基本java语法规范的进一步封装,jdk中都是使用java ...

  9. Python实现FTP文件的上传和下载

    # coding: utf-8 import os from ftplib import FTP def ftp_connect(host, username, password): ftp = FT ...

  10. 自动化测试框架【windows版】:JMeter + Ant + Jenkins

    前提条件:windows安装了jmeter.ant.jenkins 安装方法参考汇总目录中对应的博文 截图看不清的,可以调大浏览器倍数看 jenkins驱动ant执行,ant驱动jmeter执行 an ...