最近几天的push:user:req数据写不到集群,报错如下:

[--13T09::,][DEBUG][o.e.a.b.TransportShardBulkAction] [yunva_etl_es8] [push:user:req][] failed to execute bulk item (index) index {[push:user:req][/push/user/req][AVyfE_JChh2DyKB0G1Kj], source[{"appId":"","yunvaId":"362b103623aa4f75911b374615c6a682","providerId":"","proviceId":"","mac":"B0:C4:E7:61:E6:E7","imsi":"","imei":"","factory":"samsung","model":"GT-S7568","osType":"android","osVersion":,"networkType":"wifi","ip":"14.152.80.239","sessionId":"","time":"2017-06-13 09:11:39.061.+0800"}]}
java.lang.IllegalArgumentException: number of documents in the index cannot exceed
at org.apache.lucene.index.DocumentsWriterPerThread.reserveOneDoc(DocumentsWriterPerThread.java:) ~[lucene-core-6.2..jar:6.2. 43ab70147eb494324a1410f7a9f16a896a59bc6f - shalin - -- ::]
at org.apache.lucene.index.DocumentsWriterPerThread.updateDocument(DocumentsWriterPerThread.java:) ~[lucene-core-6.2..jar:6.2. 43ab70147eb494324a1410f7a9f16a896a59bc6f - shalin - -- ::]
at org.apache.lucene.index.DocumentsWriter.updateDocument(DocumentsWriter.java:) ~[lucene-core-6.2..jar:6.2. 43ab70147eb494324a1410f7a9f16a896a59bc6f - shalin - -- ::]
at org.apache.lucene.index.IndexWriter.updateDocument(IndexWriter.java:) ~[lucene-core-6.2..jar:6.2. 43ab70147eb494324a1410f7a9f16a896a59bc6f - shalin - -- ::]
at org.apache.lucene.index.IndexWriter.addDocument(IndexWriter.java:) ~[lucene-core-6.2..jar:6.2. 43ab70147eb494324a1410f7a9f16a896a59bc6f - shalin - -- ::]
at org.elasticsearch.index.engine.InternalEngine.index(InternalEngine.java:) ~[elasticsearch-5.0..jar:5.0.]

搜索了下:
是lucene的文档数限制,每个分片最大支持2的31次方个文档数量
https://discuss.elastic.co/t/lucene-max-documents-limit/34761
Lucene max documents limit,Yep, there is a 2^31 limit per shard in Lucene.

查询当前es分片文档:

# curl 10.26.241.237:/_cat/shards?v
index shard prirep state docs store ip node push:user:req p STARTED .7gb 10.30.136.143 yunva_etl_es8
push:user:req r STARTED .7gb 10.25.135.215 yunva_etl_es2
push:user:req p STARTED .7gb 10.45.150.115 yunva_etl_es9
push:user:req r STARTED .7gb 10.30.136.143 yunva_etl_es8
push:user:req r STARTED .9gb 10.30.136.143 yunva_etl_es8
push:user:req p STARTED .9gb 10.174.12.230 yunva_etl_es10
push:user:req p STARTED 884gb 10.174.12.230 yunva_etl_es10
push:user:req r STARTED 884gb 10.27.78.228 yunva_etl_es5
push:user:req r STARTED 883gb 10.45.150.115 yunva_etl_es9
push:user:req p STARTED 883gb 10.174.12.230 yunva_etl_es10

查询出来的2147483519和2的31次方2147483648接近,所以新增加的数据无法插入

临时的解决办法:先把kafka的磁盘加到数据多保留一段时间20天

原来的集群数据抽取到hbase中作为冷数据处理,再把kafka里面的数据导入数据到新的集群中

生产环境elasticsearch5.0报错IllegalArgumentException: number of documents in the index cannot exceed 2147483519的处理的更多相关文章

  1. 生产环境elasticsearch5.0.1和6.3.2集群的部署配置详解

    线上环境elasticsearch5.0.1集群的配置部署 es集群的规划: 硬件: 7台8核.64G内存.2T ssd硬盘加1台8核16G的阿里云服务器 其中一台作为kibana+kafka连接查询 ...

  2. JDK1.8环境下依然报错 Unsupported major.minor version 52.0

    JDK1.8环境下依然报错 Unsupported major.minor version 52.0 在配置elasticsearch-rtf全文搜索引擎时,按照Github上项目readme.md来 ...

  3. centos7.2环境elasticsearch-5.0.1+kibana-5.0.1+zookeeper3.4.6+kafka_2.9.2-0.8.2.1部署详解

    centos7.2环境elasticsearch-5.0.1+kibana-5.0.1+zookeeper3.4.6+kafka_2.9.2-0.8.2.1部署详解 环境准备: 操作系统:centos ...

  4. Tomcat6环境JBPM4.4报错:java.lang.ClassNotFoundException: de.odysseus.el.util.SimpleResolver

    Tomcat6环境JBPM4.4报错:java.lang.ClassNotFoundException: de.odysseus.el.util.SimpleResolver 报错信息:

  5. [转]happybase1.0 报错:ThriftPy does not support generating module with path in protocol 'f'

    happybase1.0 报错:ThriftPy does not support generating module with path in protocol 'f' 2016-10-12 14: ...

  6. jmeter Linux环境执行总报错 cannot allocate memory

    1.windows环境写好的测试用例,执行没有问题,在Linux环境跑总是报错,提示如下 cannot allocate memory 2.一开始以为是哪块设置有问题,因为脚本里边有设置邮件自动发送, ...

  7. jQuery3.0+报错Uncaught TypeError: e.indexOf is not a function

    jQuery3.0+报错Uncaught TypeError: e.indexOf is not a function 使用.load()绑定事件时报错,Uncaught TypeError: e.i ...

  8. MySQL8.0报错Can't connect to MySQL server on 'localhost' (10061)的解决办法

    MySQL8.0报错Can't connect to MySQL server on 'localhost' (10061)的解决办法 事情的起因     今天课堂上要展示小组项目,需要用一个软件叫W ...

  9. nginx报错:403 Forbidden 并且访问首页index.php是下载文件的状态

    nginx报错:403 Forbidden 并且访问首页index.php是下载文件的状态,不能正常解析php 系统有其他两个站访问是正常的 看日志没有看到明显的错误 搜索了下: 答案如下: php的 ...

随机推荐

  1. PHP7 网络编程(二)daemon守护进程

    前言 在一个多任务的计算机操作系统中,守护进程(英语:daemon,/ˈdiːmən/或/ˈdeɪmən/)是一种在后台执行的计算机程序.此类程序会被以进程的形式初始化.守护进程程序的名称通常以字母“ ...

  2. 细说java平台日志组件

    1. java.util.logging JDK自带日志组件,使用方式简单,不需要依赖第三方日志组件.支持将日志打印到控制台,文件,甚至可以将日志通过网络打印到指定主机.相对于第三方独立日志框架来说, ...

  3. SQL Server进阶(四):联接-cross join、inner join、left join、right jion、union、union all

    测试数据脚本 CREATE TABLE Atable ( S# INT, Sname ), Sage INT, Sfrom ) ) insert into Atable ,N,N'A' union a ...

  4. 海南医院帆软报表 最终版本SQL

    1. 挂号微信收入明细 -- WANTED-1283 门诊查询统计报表-挂号微信收入明细 select p.name as patientName, -- 患者姓名 p.birthday, -- 出生 ...

  5. TCP时间获取程序

    一.服务器程序 #include <stdio.h> #include <stdlib.h> #include <time.h> #include <sys/ ...

  6. 通过修改配置文件修改MySQL的时区设置

    一.找到my.ini文件 二.将时区改为东八区 添加:default-time-zone='+08:00'

  7. 关于AJAX的基础操作

    AJAX开发 AJAX即“Asynchronous Javascript And XML”(异步JavaScript和XML),是指一种创建交互式网页应用的网页开发技术. AJAX = 异步 Java ...

  8. html页面设置<span>的高度和宽度

    <span>标签属于行内元素(inline),所以无法设置高度和宽度:如果需要改变其宽高,就需要将其转变为块体元素(block)或行内块体元素(inle-block): 1 span{di ...

  9. Coursera, Machine Learning, Unsupervised Learning, K-means, Dimentionality Reduction

    Clustering  K-means: 基本思想是先随机选择要分类数目的点,然后找出距离这些点最近的training data 着色,距离哪个点近就算哪种类型,再对每种分类算出平均值,把中心点移动到 ...

  10. Coursera, Deep Learning 4, Convolutional Neural Networks, week3, Object detection

    学习目标 Understand the challenges of Object Localization, Object Detection and Landmark Finding Underst ...