bleve搜索引擎源码分析之索引——mapping和lucene一样，也有

例子：

package main

import (

    "fmt"

    "github.com/blevesearch/bleve"

)

func main() {

    // open a new index

    mapping := bleve.NewIndexMapping()

    index, err := bleve.New("example.bleve", mapping)

    if err != nil {

        fmt.Println(err)

        return

    }

    data := struct {

        Name string

        Des  string

    }{

        Name: "hello world this is bone",

        Des:  "this is a good time",

    }

    // index some data

    index.Index("id", data)

    // search for some text

    query := bleve.NewMatchQuery("this is bone")

    search := bleve.NewSearchRequest(query)

    searchResults, err := index.Search(search)

    if err != nil {

        fmt.Println(err)

        return

    }

    fmt.Println(searchResults)

}

mapping这里：

// NewIndexMapping creates a new IndexMapping that will use all the default indexing rules

func NewIndexMapping() *mapping.IndexMappingImpl {

    return mapping.NewIndexMapping()

}

难道是使用和lucene一样的？？？

// NewIndexMapping creates a new IndexMapping that will use all the default indexing rules

func NewIndexMapping() *IndexMappingImpl {

    return &IndexMappingImpl{

        TypeMapping:           make(map[string]*DocumentMapping),

        DefaultMapping:        NewDocumentMapping(),

        TypeField:             defaultTypeField,

        DefaultType:           defaultType,

        DefaultAnalyzer:       defaultAnalyzer,

        DefaultDateTimeParser: defaultDateTimeParser,

        DefaultField:          defaultField,

        IndexDynamic:          IndexDynamic,

        StoreDynamic:          StoreDynamic,

        CustomAnalysis:        newCustomAnalysis(),

        cache:                 registry.NewCache(),

    }

}

New就是设置索引目录和mapping。

// New index at the specified path, must not exist.

// The provided mapping will be used for all

// Index/Search operations.

func New(path string, mapping mapping.IndexMapping) (Index, error) {

    return newIndexUsing(path, mapping, Config.DefaultIndexType, Config.DefaultKVStore, nil)

}

index文档实现:

// Index adds the specified index operation to the

// batch.  NOTE: the bleve Index is not updated

// until the batch is executed.

func (b *Batch) Index(id string, data interface{}) error {

    if id == "" {

        return ErrorEmptyID

    }

    doc := document.NewDocument(id)

    err := b.index.Mapping().MapDocument(doc, data)

    if err != nil {

        return err

    }

    b.internal.Update(doc)

    return nil

}

其中，NewDocument实现：

type Document struct {

    ID              string  `json:"id"`

    Fields          []Field `json:"fields"`

    CompositeFields []*CompositeField

    Number          uint64 `json:"-"`

}

func NewDocument(id string) *Document {

    return &Document{

        ID:              id,

        Fields:          make([]Field, ),

        CompositeFields: make([]*CompositeField, ),

    }

}

MappingDocument实现：

func (im *IndexMappingImpl) MapDocument(doc *document.Document, data interface{}) error {

    docType := im.determineType(data)

    docMapping := im.mappingForType(docType)

    walkContext := im.newWalkContext(doc, docMapping)

    if docMapping.Enabled {

        docMapping.walkDocument(data, []string{}, []uint64{}, walkContext)

        // see if the _all field was disabled

        allMapping := docMapping.documentMappingForPath("_all")

        if allMapping == nil || (allMapping.Enabled != false) {

            field := document.NewCompositeFieldWithIndexingOptions("_all", true, []string{}, walkContext.excludedFromAll, document.IndexField|document.IncludeTermVectors)

            doc.AddField(field)

        }

    }

    return nil

}

我晕，看来bleve真的是和lucene设计一样！也有_all属性。

难道后面倒排列表也会使用skip list？？？

bleve搜索引擎源码分析之索引——mapping和lucene一样，也有_all的更多相关文章

bleve搜索引擎源码分析之索引——mapping真复杂啊
接下来看看下面index部分的源码实现: data := struct { Name string Des string }{ Name: "hello world this is bone ...
Spark源码分析 – 汇总索引
http://jerryshao.me/categories.html#architecture-ref http://blog.csdn.net/pelick/article/details/172 ...
wukong引擎源码分析之索引——part 1 倒排列表本质是有序数组存储
searcher.IndexDocument(0, types.DocumentIndexData{Content: "此次百度收购将成中国互联网最大并购"}) engine.go ...
wukong引擎源码分析之索引——part 3 文档评分无非就是将docid对应的fields信息存储起来，为搜索结果rank评分用
之前的文章分析过,接受索引请求处理的代码在segmenter_worker.go里: func (engine *Engine) segmenterWorker() { for { request : ...
lua源码分析伪索引
Lua 提供了一个注册表, 这是一个预定义出来的表, 可以用来保存任何 C 代码想保存的 Lua 值. 这个表可以用有效伪索引 LUA_REGISTRYINDEX 来定位. 任何 C 库都可以在这张 ...
wukong引擎源码分析之索引——part 2 持久化直接set（key，docID数组）在kv存储里
前面说过,接收indexerRequest的代码在index_worker.go里: func (engine *Engine) indexerAddDocumentWorker(shard int) ...
4 weekend110的textinputformat对切片规划的源码分析 + 倒排索引的mr实现 + 多个job在同一个main方法中提交
好的,现在,来weekend110的textinputformat对切片规划的源码分析, Inputformat默认是textinputformat,一通百通. 这就是今天,weekend110的te ...
【异常及源码分析】org.mybatis.spring.MyBatisSystemException: nested exception is org.apache.ibatis.type.TypeException: Could not set parameters for mapping: ParameterMapping
一.异常出现的场景 1)异常出现的SQL @Select("SELECT\n" + " id,discount_type ,min_charge, ${cardFee} ...
Solr4.8.0源码分析(14)之SolrCloud索引深入(1)
Solr4.8.0源码分析(14) 之 SolrCloud索引深入(1) 上一章节<Solr In Action 笔记(4) 之 SolrCloud分布式索引基础>简要学习了SolrClo ...

随机推荐

CentOS配置TFTP服务器
服务器端软件包 tftp-server 启动脚本 /usr/sbin/in.tftpd 启动服务 /usr/lib/systemd/system/tftp.service 配置文件 /etc/xin ...
AC日记——砝码称重洛谷 P2347
题目描述设有1g.2g.3g.5g.10g.20g的砝码各若干枚(其总重<=1000), 输入输出格式输入格式: 输入方式:a1 a2 a3 a4 a5 a6 (表示1g砝码有a1个,2g砝 ...
前端跨域调请求 nginx反向代理
用本地pc的目录,请求192.168.3.246的接口,以/api为标识运行命令: 启动 nginx -s start 重启 nginx -s relaod 停止 nginx -s stop 查看 ...
马蜂窝ABTest多层分流系统的设计与实现
什么是 ABTest 产品的改变不是由我们随便「拍脑袋」得出,而是需要由实际的数据驱动,让用户的反馈来指导我们如何更好地改善服务.正如马蜂窝 CEO 陈罡在接受专访时所说:「有些东西是需要 Sen ...
HDOJ 5213
题目连接:http://acm.hdu.edu.cn/showproblem.php?pid=5213 BC 上的题,题解很清楚,会莫对的应该不难, 对于一个询问,我们拆成四个询问,开始拆成求区间矩形 ...
存code
#include<cstdio> #include<cstring> #include<iostream> #include<algorithm> us ...
java集合系列之ArrayList源码分析
java集合系列之ArrayList源码分析(基于jdk1.8) ArrayList简介 ArrayList时List接口的一个非常重要的实现子类,它的底层是通过动态数组实现的,因此它具备查询速度快, ...
如何删除xcode启动主页面项目列表
Open Xcode, leave the splash screen up and choose "File", "Open Recent Projects" ...
将一个文件从gbk编码转换为utf8编码
用django展示模板时,出现如下错误: 'utf8' codec can't decode byte 0xd3 in position 197: invalid continuation byte ...
[BLE]CC2640之ADC功能实现和供电电压的採集
一.开篇 Write programs that do one thing and do it well ~~~~~ 发现非常多人关于使用CC2640/CC2650的过程中比較难以应对的问题就是实现A ...

bleve搜索引擎源码分析之索引——mapping和lucene一样，也有_all

bleve搜索引擎源码分析之索引——mapping和lucene一样，也有_all的更多相关文章

随机推荐

热门专题