DTM initialization: failure during startup recovery, retry failed, check segment status (cdbtm.c:1603)
安装greenplum集群出现以下错误:
20160315:13:49:16:025696 gpinitsystem:h95:jason-[INFO]:-Checking configuration parameters, please wait...
20160315:13:49:16:025696 gpinitsystem:h95:jason-[INFO]:-Reading Greenplum configuration file init_config
20160315:13:49:16:025696 gpinitsystem:h95:jason-[INFO]:-Locale has not been set in init_config, will set to default value
20160315:13:49:16:025696 gpinitsystem:h95:jason-[INFO]:-Locale set to en_US.utf8
20160315:13:49:16:025696 gpinitsystem:h95:jason-[INFO]:-No DATABASE_NAME set, will exit following template1 updates
20160315:13:49:16:025696 gpinitsystem:h95:jason-[INFO]:-MASTER_MAX_CONNECT not set, will set to default value 250
20160315:13:49:17:025696 gpinitsystem:h95:jason-[INFO]:-Checking configuration parameters, Completed
20160315:13:49:17:025696 gpinitsystem:h95:jason-[INFO]:-Commencing multi-home checks, please wait...
..
20160315:13:49:17:025696 gpinitsystem:h95:jason-[INFO]:-Configuring build for standard array
20160315:13:49:17:025696 gpinitsystem:h95:jason-[INFO]:-Commencing multi-home checks, Completed
20160315:13:49:17:025696 gpinitsystem:h95:jason-[INFO]:-Building primary segment instance array, please wait...
..................
20160315:13:49:24:025696 gpinitsystem:h95:jason-[INFO]:-Checking Master host
20160315:13:49:24:025696 gpinitsystem:h95:jason-[INFO]:-Checking new segment hosts, please wait...
..................
20160315:13:49:39:025696 gpinitsystem:h95:jason-[INFO]:-Checking new segment hosts, Completed
20160315:13:49:39:025696 gpinitsystem:h95:jason-[INFO]:-Building the Master instance database, please wait...
20160315:13:49:49:025696 gpinitsystem:h95:jason-[INFO]:-Starting the Master in admin mode
20160315:13:51:35:025696 gpinitsystem:h95:jason-[INFO]:-Commencing parallel build of primary segment instances
20160315:13:51:35:025696 gpinitsystem:h95:jason-[INFO]:-Spawning parallel processes batch [1], please wait...
..................
20160315:13:51:36:025696 gpinitsystem:h95:jason-[INFO]:-Waiting for parallel processes batch [1], please wait...
..................................................
20160315:13:52:26:025696 gpinitsystem:h95:jason-[INFO]:------------------------------------------------
20160315:13:52:26:025696 gpinitsystem:h95:jason-[INFO]:-Parallel process exit status
20160315:13:52:26:025696 gpinitsystem:h95:jason-[INFO]:------------------------------------------------
20160315:13:52:26:025696 gpinitsystem:h95:jason-[INFO]:-Total processes marked as completed = 18
20160315:13:52:26:025696 gpinitsystem:h95:jason-[INFO]:-Total processes marked as killed = 0
20160315:13:52:26:025696 gpinitsystem:h95:jason-[INFO]:-Total processes marked as failed = 0
20160315:13:52:26:025696 gpinitsystem:h95:jason-[INFO]:------------------------------------------------
20160315:13:52:27:025696 gpinitsystem:h95:jason-[INFO]:-Deleting distributed backout files
20160315:13:52:27:025696 gpinitsystem:h95:jason-[INFO]:-Removing back out file
20160315:13:52:27:025696 gpinitsystem:h95:jason-[INFO]:-No errors generated from parallel processes
20160315:13:52:27:025696 gpinitsystem:h95:jason-[INFO]:-Restarting the Greenplum instance in production mode
20160315:13:52:27:011100 gpstop:h95:jason-[INFO]:-Starting gpstop with args: -a -i -m -d /home/jason/gpdata/gpseg-1
20160315:13:52:27:011100 gpstop:h95:jason-[INFO]:-Gathering information and validating the environment...
20160315:13:52:27:011100 gpstop:h95:jason-[INFO]:-Obtaining Greenplum Master catalog information
20160315:13:52:27:011100 gpstop:h95:jason-[INFO]:-Obtaining Segment details from master...
20160315:13:52:27:011100 gpstop:h95:jason-[INFO]:-Greenplum Version: 'greenplum (Greenplum Database) 4.3.99.00 build dev'
20160315:13:52:27:011100 gpstop:h95:jason-[INFO]:-There are 0 connections to the database
20160315:13:52:27:011100 gpstop:h95:jason-[INFO]:-Commencing Master instance shutdown with mode='immediate'
20160315:13:52:27:011100 gpstop:h95:jason-[INFO]:-Master host=h95
20160315:13:52:27:011100 gpstop:h95:jason-[INFO]:-Commencing Master instance shutdown with mode=immediate
20160315:13:52:27:011100 gpstop:h95:jason-[INFO]:-Master segment instance directory=/home/jason/gpdata/gpseg-1
20160315:13:52:28:011100 gpstop:h95:jason-[INFO]:-Attempting forceful termination of any leftover master process
20160315:13:52:28:011100 gpstop:h95:jason-[INFO]:-Terminating processes for segment /home/jason/gpdata/gpseg-1
20160315:13:52:28:011100 gpstop:h95:jason-[ERROR]:-Failed to kill processes for segment /home/jason/gpdata/gpseg-1: ([Errno 3] No such process)
20160315:13:52:28:011187 gpstart:h95:jason-[INFO]:-Starting gpstart with args: -a -d /home/jason/gpdata/gpseg-1
20160315:13:52:28:011187 gpstart:h95:jason-[INFO]:-Gathering information and validating the environment...
20160315:13:52:28:011187 gpstart:h95:jason-[INFO]:-Greenplum Binary Version: 'greenplum (Greenplum Database) 4.3.99.00 build dev'
20160315:13:52:28:011187 gpstart:h95:jason-[INFO]:-Greenplum Catalog Version: '201310150'
20160315:13:52:28:011187 gpstart:h95:jason-[INFO]:-Starting Master instance in admin mode
20160315:13:52:29:011187 gpstart:h95:jason-[INFO]:-Obtaining Greenplum Master catalog information
20160315:13:52:29:011187 gpstart:h95:jason-[INFO]:-Obtaining Segment details from master...
20160315:13:52:30:011187 gpstart:h95:jason-[INFO]:-Setting new master era
20160315:13:52:30:011187 gpstart:h95:jason-[INFO]:-Master Started...
20160315:13:52:30:011187 gpstart:h95:jason-[INFO]:-Shutting down master
20160315:13:52:31:011187 gpstart:h95:jason-[INFO]:-Commencing parallel segment instance startup, please wait...
.......
20160315:13:52:38:011187 gpstart:h95:jason-[INFO]:-Process results...
20160315:13:52:38:011187 gpstart:h95:jason-[INFO]:-----------------------------------------------------
20160315:13:52:38:011187 gpstart:h95:jason-[INFO]:- Successful segment starts = 18
20160315:13:52:38:011187 gpstart:h95:jason-[INFO]:- Failed segment starts = 0
20160315:13:52:38:011187 gpstart:h95:jason-[INFO]:- Skipped segment starts (segments are marked down in configuration) = 0
20160315:13:52:38:011187 gpstart:h95:jason-[INFO]:-----------------------------------------------------
20160315:13:52:38:011187 gpstart:h95:jason-[INFO]:-
20160315:13:52:38:011187 gpstart:h95:jason-[INFO]:-Successfully started 18 of 18 segment instances
20160315:13:52:38:011187 gpstart:h95:jason-[INFO]:-----------------------------------------------------
20160315:13:52:38:011187 gpstart:h95:jason-[INFO]:-Starting Master instance h95 directory /home/jason/gpdata/gpseg-1
20160315:13:52:39:011187 gpstart:h95:jason-[INFO]:-Command sys_ctl reports Master h95 instance active
20160315:13:54:33:011187 gpstart:h95:jason-[WARNING]:-FATAL: DTM initialization: failure during startup recovery, retry failed, check segment status (cdbtm.c:1603) 20160315:13:54:33:011187 gpstart:h95:jason-[INFO]:-No standby master configured. skipping...
20160315:13:54:33:011187 gpstart:h95:jason-[INFO]:-Check status of database with gpstate utility
20160315:13:54:37:025696 gpinitsystem:h95:jason-[INFO]:-Completed restart of Greenplum instance in production mode
20160315:13:54:37:025696 gpinitsystem:h95:jason-[INFO]:-Loading gp_toolkit...
psql: FATAL: DTM initialization: failure during startup recovery, retry failed, check segment status (cdbtm.c:1603)
20160315:13:56:26:gpinitsystem:h95:jason-[FATAL]:-Failed to retrieve rolname. Script Exiting!
我的集群配置:两台机器,32g内存16g交换分区。每台机器9个节点。集群按照完成之后,显示segment启动的18个,但是通过psql连接不上,报错!
主要错误信息:
DTM initialization: failure during startup recovery, retry failed, check segment status (cdbtm.c:1603)
去官网看了很多人遇到此类的问题,错误原因有很多,今天特地总结以下:
Q&A1:系统环境变量没有设置正确,这个需要根据自己安装版本的greenplum去设置一下环境变量,可以去官网相对应的版本install guide 那里设置一下!
Q&A2:shared_buffers设置太大,对于如何根据自己内存和segment节点个数分配shared_buffers,可以去官网找一下,通常出去2g的other,以及statement_mem * segment 个数,剩下的除以segment的个数即可。这种情况通常出现中安装过程中就设置了shared_buffers,一般默认的125MB
Q&A3:防火墙是否关闭,这个情况最容易忽略,也是最容易出现的,通常有些人重启机器之后就忘记了关闭,我就是这样的,嘿嘿。你可以设置防火墙重启后一样生效!
。。。还有其他的原因欢迎来补充!谢谢,分享是一种美,希望能帮到你!
DTM initialization: failure during startup recovery, retry failed, check segment status (cdbtm.c:1603)的更多相关文章
- 运行QQ出现initialization failure 0x0000000c错误和浏览器上不了网
出现QQ出现initialization failure 0x0000000c错误和浏览器上不了网的问题,原因是关机的时候没有正常关闭导致的. 解决方法: 1.我们在开始菜单栏中的附件中找到“命令提示 ...
- Fast Failure Detection and Recovery in SDN with Stateful Data Plane
文章名称:Fast Failure Detection and Recovery in SDN with Stateful Data Plane 利用SDN的带状态数据平面进行快速故障检测和恢复 发表 ...
- java.lang.NoClassDefFoundError: com.sap.conn.jco.JCo (initialization failure) java.lang.UnsatisfiedLinkError: no sapjco3 in java.library.path
java.lang.NoClassDefFoundError: com.sap.conn.jco.JCo (initialization failure) at java.lang.J9VMInter ...
- java执行spark查询hbase的jar包出现错误提示:ob aborted due to stage failure: Master removed our application: FAILED
执行java调用scala 打包后的jar时候出现异常 /14 23:57:08 WARN TaskSchedulerImpl: Initial job has not accepted any re ...
- ”initialization failure:0x0000000C“错误,何解?
今天开机后打开软件,报出这样的警告”initialization failure:0x0000000C“. 我问了度娘,看了很多回答,答案参差不齐.其中,有个回答还是很不错的(刚好我的是win10系统 ...
- “error: command 'x86_64-linux-gnu-gcc' failed with exit status 1” in virtualenv
Most of the time these are dependency-issues. Following the stack-trace of the gcc compiler one ca ...
- 10.Execution failed with exit status: 3
错误信息: insert overwrite table t_mobile_mid_use_p_tmp4_rcf select '201411' as month_id, a.prov_id, a.c ...
- command 'x86_64-linux-gnu-gcc' failed with exit status 1错误及解决方案
Ubuntu16.04安装Scrapy(pip install Scrapy)时提示错误如下: Failed building wheel for cryptography Running setup ...
- error: command 'cc' failed with exit status 1
报错: Complete output from command /usr/bin/python -u -c "import setuptools, tokenize;__file__='/ ...
随机推荐
- Vector(容器)
vector(容器)就像数组一样,但比数组强大很多,下面介绍一下vector常用的几种方法: 一.对于vector自身的处理,包括赋初始值,复制等等: vector<int> v1 ; v ...
- Media Queries for Standard Devices
/* Smartphones (portrait and landscape) ----------- */ @media only screen and (min-device-width : 32 ...
- 密码输入模块getpass
getpass模块用于命令行输入密码,它提供了两个函数. getpass.getpass([prompt[, stream]]) 提示用户输入密码,同时不显示输入的密码 ...
- Yii2.0中文开发向导——rules常用规则
public function rules(){ return [ //必须填写 ['email, username, password,agree,verifyPassword,verifyCode ...
- [LeetCode]题解(python):097-Interleaving String
题目来源: https://leetcode.com/problems/interleaving-string/ 题意分析: 给定字符串s1,s2,s3,判断s3是否由s1和s2穿插组成.如“abc” ...
- python自学笔记(四)python基本数据类型之元组、集合、字典
一.元组tuple 特性 1.有序集合 2.通过偏移来取数据 3.不可变对象,不能在原地修改内存,没有排序.修改等操作 元组不可变的好处:保证数据的安全,比如我们传给一个不熟悉的方法,确保不会改变我们 ...
- IPv6地址的ping、telnet等操作
最近在研究https协议是如何传输数据的,用wireshark抓包分析,发现客户机和google网站在传输数据时使用了IPv6地址,于是相对ipv6地址测试下基本的功能. ping功能,直接使用pin ...
- 转: sublime text常用插件和快捷键
Sublime Text 2是一个轻量.简洁.高效.跨平台的编辑器.博主之前一直用notepdd++写前端代码,用得也挺顺手了,早就听说sublime的大名,一直也懒得去试试看,认为都是工具用着顺手就 ...
- linux技术框架
编程语言 一般使用c或者c++ linux使用 鸟哥私房菜 工具使用 代码编辑source insight,代码编译gcc,代码调试gdb,代码编译组织makefile,命令执行shell,文本编辑n ...
- Hat’s Words(字典树)
Hat’s Words Time Limit: 2000/1000 MS (Java/Others) Memory Limit: 65536/32768 K (Java/Others)Total ...