elasticsearch index 之 engine

elasticsearch对于索引中的数据操作如读写get等接口都封装在engine中，同时engine还封装了索引的读写控制，如流量、错误处理等。engine是离lucene最近的一部分。

engine的实现结构如下所示：

engine接口有三个实现类，主要逻辑都在InternalEngine中。ShadowEngine之实现了engine接口的部分读方法，主要用于对于索引的读操作。shardFSEngine在InternalEngine的基础上实现了recovery方法，它的功能跟InternalEngine基本相同只是它的recovery过程有区别，不会对Translog和index进行快照存储。

Engine类定义了一些index操作的主要方法和内部类，方法如create，index等。内部类如index，delete等。这些方法的实现是在子类中，这些方法的参数是这些内部类。首先看一下它的方法：

 public abstract void create(Create create) throws EngineException;

    public abstract void index(Index index) throws EngineException;

    public abstract void delete(Delete delete) throws EngineException;

    public abstract void delete(DeleteByQuery delete) throws EngineException;

这些抽象方法都在子类中实现，它们的参数都是一类，这些都是Engine的内部类，这些内部类类似于实体类，没有相关逻辑只是由很多filed及get方法构成。如Create和Index都继承自IndexOperation，它们所有信息都存储到IndexOperation的相关Field中，IndexOperation如下所示：

 public static abstract class IndexingOperation implements Operation {

        private final DocumentMapper docMapper;

        private final Term uid;

        private final ParsedDocument doc;

        private long version;

        private final VersionType versionType;

        private final Origin origin;

        private final boolean canHaveDuplicates;

        private final long startTime;

        private long endTime;

    ………………

}

无论是Index还是Create，相关数据和配置都在doc中，根据doc和docMapper就能够获取本次操作的所有信息，另外的一些字段如version，uid都是在类初始化时构建。这样传给实际方法的是一个class，在方法内部根据需求获取到相应的数据，如index方法的实现：

    private void innerIndex(Index index) throws IOException {

        synchronized (dirtyLock(index.uid())) {

            final long currentVersion;

            VersionValue versionValue = versionMap.getUnderLock(index.uid().bytes());

            if (versionValue == null) {

                currentVersion = loadCurrentVersionFromIndex(index.uid());

            } else {

                if (engineConfig.isEnableGcDeletes() && versionValue.delete() && (engineConfig.getThreadPool().estimatedTimeInMillis() - versionValue.time()) > engineConfig.getGcDeletesInMillis()) {

                    currentVersion = Versions.NOT_FOUND; // deleted, and GC

                } else {

                    currentVersion = versionValue.version();

                }

            }

            long updatedVersion;

            long expectedVersion = index.version();

            if (index.versionType().isVersionConflictForWrites(currentVersion, expectedVersion)) {

                if (index.origin() == Operation.Origin.RECOVERY) {

                    return;

                } else {

                    throw new VersionConflictEngineException(shardId, index.type(), index.id(), currentVersion, expectedVersion);

                }

            }

            updatedVersion = index.versionType().updateVersion(currentVersion, expectedVersion);

            index.updateVersion(updatedVersion);

            if (currentVersion == Versions.NOT_FOUND) {

                // document does not exists, we can optimize for create

                index.created(true);

                if (index.docs().size() > 1) {

                    indexWriter.addDocuments(index.docs(), index.analyzer());

                } else {

                    indexWriter.addDocument(index.docs().get(0), index.analyzer());

                }

            } else {

                if (versionValue != null) {

                    index.created(versionValue.delete()); // we have a delete which is not GC'ed...

                }

                if (index.docs().size() > 1) {

                    indexWriter.updateDocuments(index.uid(), index.docs(), index.analyzer());//获取IndexOperation中doc中字段更新索引

                } else {

                    indexWriter.updateDocument(index.uid(), index.docs().get(0), index.analyzer());

                }

            }

            Translog.Location translogLocation = translog.add(new Translog.Index(index));//写translog

            versionMap.putUnderLock(index.uid().bytes(), new VersionValue(updatedVersion, translogLocation));

            indexingService.postIndexUnderLock(index);

        }

    }

这就是Engine中create、index这些方法的实现方式。后面分析索引过程中会有更加详细说明。Engine中还有获取索引状态（元数据）及索引操作的方法如merge。这些方法也是在子类中调用lucene的相关接口，跟create，index，get很类似。因为没有深入Engine的方法实现，因此这里的分析比较简单，后面的分析会涉及这里面很多方法。

总结：这里只是从结构上对indexEngine进行了简单说明，它里面的方法是es对lucene索引操作方法的封装，只是增加了一下处理方面的逻辑如写translog，异常处理等。它的操作对象是shard，es所有对shard的写操作都是通过Engine来实现，后面的分析会有所体现。

elasticsearch index 之 engine的更多相关文章

ElasticSearch Index操作源码分析
ElasticSearch Index操作源码分析本文记录ElasticSearch创建索引执行源码流程.从执行流程角度看一下创建索引会涉及到哪些服务(比如AllocationService.Mas ...
elasticsearch index 之 put mapping
elasticsearch index 之 put mapping mapping机制使得elasticsearch索引数据变的更加灵活,近乎于no schema.mapping可以在建立索引时设 ...
elasticsearch index 功能源码概述
从本篇开始,对elasticsearch的介绍将进入数据功能部分(index),这一部分包括索引的创建,管理,数据索引及搜索等相关功能.对于这一部分的介绍,首先对各个功能模块的分析,然后详细分析数据索 ...
Add mappings to an Elasticsearch index in realtime
Changing mapping on existing index is not an easy task. You may find the reason and possible solutio ...
ElasticSearch Index API && Mapping
ElasticSearch NEST Client 操作Index var indexName="twitter"; var deleteIndexResponse = clie ...
Elasticsearch Index模块
1. Index Setting(索引设置) 每个索引都可以设置索引级别.可选值有: static :只能在索引创建的时候,或者在一个关闭的索引上设置 dynamic:可以动态设置 1.1. S ...
Elasticsearch index fields 重命名
reindex数据复制,重索引 POST _reindex { "source": { "index": "twitter" }, &quo ...
elasticsearch index tuning
一.扩容 tag_server当前使用ElasticSearch版本为5.6,此版本单个index的分片是固定的,一旦创建后不能更改. 1.扩容方法1,不适 ES6.1支持split index功能, ...
elasticsearch index 之 create index（-）
从本篇开始,就进入了Index的核心代码部分.这里首先分析一下索引的创建过程.elasticsearch中的索引是多个分片的集合,它只是逻辑上的索引,并不具备实际的索引功能,所有对数据的操作最终还是由 ...

随机推荐

《剑指offer》二维数组中的查找
一.题目描述在一个二维数组中,每一行都按照从左到右递增的顺序排序,每一列都按照从上到下递增的顺序排序.请完成一个函数,输入这样的一个二维数组和一个整数,判断数组中是否含有该整数. 二.输入描述 ar ...
mutt发邮件
在 /etc/Muttrc 文件添加以下内容: set from="laughingliang@chaincar.com" set use_from=yes set envel ...
mongodb 的查询深入剖析
db.表名.find({goods_id:3}); //查询出 goods_id 为 3 的数据 db.表名.find({cat_i ...
kubernetes学习与实践篇（一）主要概念介绍
什么是kubernetes Kubernetes是Google开源的容器集群管理系统,实现基于Docker构建容器,利用Kubernetes能很方面管理多台Docker主机中的容器. 主要功能将多台 ...
wpf convert png to xaml
原文:wpf convert png to xaml 把png图片转化成xaml资源 <ResourceDictionary xmlns="http://schemas.microso ...
ARM官方《CMSIS-RTOS教程》之线程Threads
创建线程Creating Threads 一旦RTOS开始运行,就会有很多系统调用来管理和控制活跃的线程.默认情况下,main()函数自动被创建为第一个可运行的线程.在第一个例子里我们使用main() ...
Jeff Dean专访，有不少干货
<专访Jeff Dean:我们要推动机器学习再上一层楼> 文件链接如下: Link https://arxiv.org/ 有意思的是,里面提到的 arXiv网站,一个能够用来证明论文上传时 ...
Cisco交换机端口安全
Cisco交换机端口安全通过端口设置,可以限制允许访问交换机上某个端口的MAC地址以及IP(可选)来实现严格控制对该端口的输入,最终确保网络接入安全.配置网络安全时应该注意如下问题: 1 ...
tomcat加载web.xml
这几天看tomcat的源码,疑问很多,比如之一“ tomcat 怎么加载 web.xml”,下面是跟踪的过程,其中事件监听器有一个观察者模式,比较好.记录下来以供参考 >>>> ...
c# 读取 excel文件内容，写入txt文档
1 winform 读取excel文档 1)点击button按钮,弹出上传excel窗口 private void button_headcompany_Click(object sender, Ev ...

elasticsearch index 之 engine

elasticsearch index 之 engine的更多相关文章

随机推荐

热门专题