lucene中Field简介

Lucene 6.1.0中存在的field种类如下（后缀是Field）：

下面介绍几个常用的Field类型：

TextField

A field that is indexed and tokenized, without term vectors. For example this would be used on a 'body' field, that contains the bulk of a document's text.
是一个会自动被索引和分词的字段。一般被用在文章的正文部分。

StringField

A field that is indexed but not tokenized: the entire String value is indexed as a single token. For example this might be used for a 'country' field or an 'id' field. If you also need to sort on this field, separately add a SortedDocValuesField to your document.
StringField会被索引，但是不会被分词，即会被当作一个完整的token处理，一般用在“国家”或者“ID”.

StoredField

A field whose value is stored so that IndexSearcher.doc(int) and IndexReader.document() will return the field and its value.
也就是一个默认会被存储的Field。

举个例子
（下面是对新闻数据进行索引的过程，数据存储在MySQL数据库中，title列存文章标题，content存正文，url存文章所在的链接，author是文章的作者）：

Field field = null;

if (rs.getString("title") != null) {

    field = new TextField("title", rs.getString("title"), Field.Store.YES);

    document.add(field);

}

if (rs.getString("content") != null) {

    field = new TextField("content", rs.getString("content"), Field.Store.NO);

    document.add(field);

}

if (rs.getString("url") != null) {

    field = new StringField("url", rs.getString("url"), Field.Store.YES);

    document.add(field);

}

if (rs.getString("author") != null) {

    field = new TextField("author", rs.getString("author"), Field.Store.YES);

    document.add(field);

}

    writer.addDocument(document);

第一个参数是设置field的name，第二个是value，第三个是选择是否存储，如果存储的话在检索的时候可以返回值。
一般对于文章正文都不需要存储，在检索的时候只需要返回文章的标题和url即可。

lucene中Field简介的更多相关文章

lucene中Field简析
http://blog.csdn.net/zhaoxiao2008/article/details/14180019 先看一段lucene3代码 Document doc = new Document ...
lucene中Field.Index,Field.Store详解
lucene在doc.add(new Field("content",curArt.getContent(),Field.Store.NO,Field.Index.TOKENIZE ...
【转载】lucene中Field.Index,Field.Store详解
lucene在doc.add(new Field("content",curArt.getContent(),Field.Store.NO,Field.Index.TOKENIZE ...
lucene中Field.Index,Field.Store的一些设置
lucene在doc.add(new Field("content",curArt.getContent(),Field.Store.NO,Field.Index.TOKENIZE ...
Lucene 的 Field 域和索引维护
一.Field 域 1.Field 属性 Field 是文档中的域,包括 Field 名和 Field 值两部分,一个文档可以包括多个 Field,Document 只是 Field 的一个承载体,F ...
lucene中FSDirectory、RAMDirectory的用法
package com.ljq.one; import java.io.BufferedReader;import java.io.File;import java.io.FileInputStrea ...
【Lucene3.6.2入门系列】第03节_简述Lucene中常见的搜索功能
package com.jadyer.lucene; import java.io.File; import java.io.IOException; import java.text.SimpleD ...
Lucene中的 Query对象
"Lucene中的 Query对象": 检索前,需要对检索字符串进行分析,这是由queryparser来完成的.为了保证查询的正确性,最好用创建索引文件时同样的分析器. quer ...
lucene 中关于Store.YES 关于Store.NO的解释
总算搞明白 lucene 中关于Store.YES 关于Store.NO的解释了一直对Lucene Store.YES不太理解,网上多数的说法是存储字段,NO为不存储. 这样的解释有点郁闷:字面意 ...

随机推荐

基础线程机制--Daemon，sleep()，yield()
Daemon 守护线程是程序运行时在后台提供服务的线程,不属于程序中不可或缺的部分,当所有非守护进程执行完成时,程序也就终止,同时会杀死所有的守护进程.main()属于非守护线程.可以使用setD ...
hdu 6512 Triangle
Problem Description After Xiaoteng took a math class, he learned a lot of different shapes, but Xiao ...
jenkins在windows系统下部署安装,使用
首先需要从官网上下载下来war包,让进入tomcat中启动tomcat,然后可以看一堆日志再在网站输入 localhost:8080/jenkins就会进去下面界面: 会出现上面状况: 需要进入: ...
Unity 动画系统目录之 Animation
返回 Unity 动画系统目录官方文档 Animation:https://docs.unity3d.com/ScriptReference/Animation.html Animator:http ...
C#校验手机端或客户端
以下代码用来检查,客户端是手机端还是PC端 string strUserAgent = Request.UserAgent.ToString().ToLower(); bool isMobile = ...
vs快捷键（SharePoint项目）
1.ctrl+c,alt+c,shift+ctrl+c: ========== Copying to SharePoint Root =========={ProjectRoot}\pkg\Debug ...
d题
#include<iostream>#include<algorithm>using namespace std;int a[200005];int b[200005];int ...
Win10家庭版组策略gpedit.msc的问题
大家都认为,Windows家庭版中并不包含组策略,其实不然,它是有相关文件的,只是不让你使用而已.那么我们让系统允许你使用就好了.首先你需要在桌面上新建一个txt文本文档.然后将以下代码复制到这个新建 ...
工作ui（2）
做完整个小Demo整理的一些方法和踩过的miniUI的坑,分享出来希望大家批评指正,共同进步. 1.动态创建列:尽量不要直接在html文件里创建列,动态设置在js文件里方面添加.修改等. 首先把列定义 ...
c++11 move构造函数和move operator 函数学习
先看个代码吧!!!!!!!!!! #include <iostream> using namespace std; class A { public: A(){cout<<&q ...

lucene中Field简介

lucene中Field简介的更多相关文章

随机推荐

热门专题