lucene 中关于Store.YES 关于Store.NO的解释

总算搞明白 lucene 中关于Store.YES 关于Store.NO的解释了

一直对Lucene Store.YES不太理解，网上多数的说法是存储字段，NO为不存储。

这样的解释有点郁闷：字面意思一看就明白，但是不解。

之前我的理解是：如果字段可以不存储，那要怎么搜索这个不存储的字段呢？

原来Lucene就是这样，可以设置某些字段为不存储，但是可以用来检索。

终于在一篇文章里看到这几句话，突然间就明白了。

//Store.YES 保存可以查询可以打印内容
Field storeYes = new Field("storeyes","storeyes",Store.YES,Index.TOKENIZED);
//Store.NO 不保存可以查询不可打印内容由于不保存内容所以节省空间，但是这个索引是存在的，可以通过这个索引去检索
Field storeNo = new Field("storeno","storeno",Store.NO,Index.TOKENIZED);
//Store.COMPRESS 压缩保存可以查询可以打印内容可以节省生成索引文件的空间，Field storeCompress = new Field("storecompress","storecompress",Store.COMPRESS,Index.TOKENIZED);

至此，对于理解Store.YES，Store.NO 就是不存储就不能直接获取此字段的内容，存储了就可以。但是两者都可以用于检索。

字段是否能被搜索，还与Index有关。

package luxun.lucene.base;

import java.io.File;
import java.io.IOException;
import java.util.Arrays;

import org.apache.lucene.analysis.Analyzer;
import org.apache.lucene.analysis.standard.StandardAnalyzer;
import org.apache.lucene.document.Document;
import org.apache.lucene.document.Field;
import org.apache.lucene.document.Field.Index;
import org.apache.lucene.document.Field.Store;
import org.apache.lucene.index.CorruptIndexException;
import org.apache.lucene.index.IndexReader;
import org.apache.lucene.index.IndexWriter;
import org.apache.lucene.index.IndexWriterConfig;
import org.apache.lucene.index.Term;
import org.apache.lucene.search.FieldDoc;
import org.apache.lucene.search.IndexSearcher;
import org.apache.lucene.search.ScoreDoc;
import org.apache.lucene.search.Sort;
import org.apache.lucene.search.SortField;
import org.apache.lucene.search.SortField.Type;
import org.apache.lucene.search.TermQuery;
import org.apache.lucene.search.TopFieldDocs;
import org.apache.lucene.store.Directory;
import org.apache.lucene.store.FSDirectory;
import org.apache.lucene.store.LockObtainFailedException;
import org.apache.lucene.store.SimpleFSDirectory;
import org.apache.lucene.util.Version;
public class SortFieldValueTest {

   @SuppressWarnings("deprecation")
   public void buildIndex() throws CorruptIndexException, LockObtainFailedException, IOException {
       File indexDir = new File("/home/cristo/luxun_test/luxunlucenetest/04/index");
       // dataDir is the directory that hosts the text files that to be indexed
       Directory directory = new SimpleFSDirectory(indexDir);
       Analyzer analyzer = new StandardAnalyzer(Version.LUCENE_47);
       IndexWriter writer = new IndexWriter(directory,    new IndexWriterConfig(Version.LUCENE_47, analyzer));
       Document doc = new Document();
       doc.add(new Field("id", "1", Store.YES, Index.NO));
       doc.add(new Field("text", "lucene", Store.NO, Index.ANALYZED));
       doc.add(new Field("time", "2010", Store.NO, Index.NOT_ANALYZED_NO_NORMS));
       doc.add(new Field("tide", "149", Store.NO, Index.NOT_ANALYZED_NO_NORMS));
       writer.addDocument(doc);
       doc = new Document();
       doc.add(new Field("id", "3", Store.YES, Index.NO));
       doc.add(new Field("text", "lucene", Store.NO, Index.ANALYZED));
       doc.add(new Field("time", "2011", Store.NO, Index.NOT_ANALYZED_NO_NORMS));
       doc.add(new Field("tide", "14", Store.NO, Index.NOT_ANALYZED_NO_NORMS));
       writer.addDocument(doc);
       doc = new Document();
       doc.add(new Field("id", "2", Store.YES, Index.NO));
       doc.add(new Field("text", "lucene", Store.NO, Index.ANALYZED));
       doc.add(new Field("time", "2001", Store.NO, Index.NOT_ANALYZED_NO_NORMS));
       doc.add(new Field("tide", "13", Store.NO, Index.NOT_ANALYZED_NO_NORMS));
       writer.addDocument(doc);
       doc = new Document();
       doc.add(new Field("id", "5", Store.YES, Index.NO));
       doc.add(new Field("text", "lucene", Store.NO, Index.ANALYZED));
       doc.add(new Field("time", "2001", Store.NO, Index.NOT_ANALYZED_NO_NORMS));
       doc.add(new Field("tide", "19", Store.NO, Index.NOT_ANALYZED_NO_NORMS));
       writer.addDocument(doc);
       doc = new Document();
       doc.add(new Field("id", "9", Store.YES, Index.NO));
       doc.add(new Field("text", "lucene", Store.NO, Index.ANALYZED));
       doc.add(new Field("time", "2171", Store.NO, Index.NOT_ANALYZED_NO_NORMS));
       doc.add(new Field("tide", "19", Store.NO, Index.NOT_ANALYZED_NO_NORMS));
       writer.addDocument(doc);
       writer.commit();
       writer.close();
   }
   public void searchWithOneSortField() throws CorruptIndexException, IOException {//可以通过对time属性进行索引，排序，获取对应的id
       File indexDir = new File("/home/cristo/luxun_test/luxunlucenetest/04/index");
       @SuppressWarnings("deprecation")
       IndexSearcher searcher = new IndexSearcher(IndexReader.open(FSDirectory.open( indexDir)));
       TermQuery termQuery = new TermQuery(new Term("time", "2001"));
       TopFieldDocs topFieldDocs = searcher.search(termQuery, null, 10, new Sort(new SortField("time",Type.STRING, true)));
       ScoreDoc[] sorDocs = topFieldDocs.scoreDocs;
       for (ScoreDoc doc : sorDocs) {
       //   FieldDoc fieldDoc = (FieldDoc) doc;
           System.out.println(searcher.doc(doc.doc).get("id"));
       }
   }

   public static void main(String[] args) throws CorruptIndexException, IOException {
       SortFieldValueTest SortFieldValueTest1=new SortFieldValueTest();
       SortFieldValueTest1.buildIndex();
       SortFieldValueTest1.searchWithOneSortField();

}
}

参考http://blog.csdn.net/telnetor/article/details/6187378

http://www.codeweblog.com/%E6%80%BB%E7%AE%97%E6%89%BE%E5%88%B0lucene-%E5%85%B3%E4%BA%8Estore-yes%E7%9A%84%E8%A7%A3%E9%87%8A%E4%BA%86/

lucene 中关于Store.YES 关于Store.NO的解释的更多相关文章

SQL Server中TempDB管理（version store的逻辑结构）
原文:SQL Server中TempDB管理(version store的逻辑结构) 原文来自: http://blogs.msdn.com/b/sqlserverstorageengine/arch ...
vue-learning：41 - Vuex - 第二篇：const store = new Vue.Store(option)中option选项、store实例对象的属性和方法
vuex 第二篇:const store = new Vue.Store(option)中option选项.store实例对象的属性和方法 import Vuex from 'vuex' const ...
【Lucene3.6.2入门系列】第03节_简述Lucene中常见的搜索功能
package com.jadyer.lucene; import java.io.File; import java.io.IOException; import java.text.SimpleD ...
Lucene 中自定义排序的实现
使用Lucene来搜索内容,搜索结果的显示顺序当然是比较重要的.Lucene中Build-in的几个排序定义在大多数情况下是不适合我们使用的.要适合自己的应用程序的场景,就只能自定义排序功能,本节我们 ...
lucene中的IndexWriter.setMaxFieldLength()
lucene中的IndexWriter.setMaxFieldLength() 老版本的Lucene中,IndexWriter的maxFieldLength是指一个索引中的最大的Field个数. 这个 ...
《Lucene in Action 第二版》第4章节学习总结 -- Lucene中的分析
通过第四章的学习,可以了解lucene的分析过程是怎样的,并且可以学会如何使用lucene内置分析器,以及自定义分析器.下面是具体总结 1. 分析(Analysis)是什么? 在lucene中,分析就 ...
lucene中Field简析
http://blog.csdn.net/zhaoxiao2008/article/details/14180019 先看一段lucene3代码 Document doc = new Document ...
lucene中TOKENIZED,UN_TOKENIZED 解釋
Field("content",curArt.getContent(),Field.Store.NO,Field.Index.TOKENIZED)); 這些地方與舊版本有很大的區別 ...
Lucene中的 Query对象
"Lucene中的 Query对象": 检索前,需要对检索字符串进行分析,这是由queryparser来完成的.为了保证查询的正确性,最好用创建索引文件时同样的分析器. quer ...

随机推荐

SQL 列提取组成字符串
SELECT BussinessNo = STUFF(REPLACE(REPLACE((SELECT N.business_no FROM T_delegate_list N WHERE N.g_mo ...
python读取bin文件并下发串口
# coding:utf-8import time, serialfrom struct import *import binascii file = open('E:\\1.bin', 'rb')i ...
python [1:3]
Python下标是以0开始的x[1:3]表示返回集合中下标1至3(不包括3)的元素集合x[:3] 表示返回从开始到下标3(不包括3)的元素集合x[3:]表示返回从下标3到结束的元素集合X[:]表示返回 ...
网页 JavaScript
今天,我主要学习了JavaScript的部分内容. JavaScript在html中的位置主要有三个地方:head里面.body里面.</html>之后,我们一般写在</html&g ...
NGUI等比缩放
/// <summary> /// UI 等比缩放 /// </summary> static private void AdaptiveUI() { ; ; UIRoot u ...
将数组写入plist文件
data 加载plist [NSBundle mainBundle] [arr writeToURL:<#(NSURL *)#> atomically:<#(BOOL)#>]
BJFU 1440 fudq的ACM
矩阵快速幂 #include<cstdio> #include<algorithm> using namespace std; ; const int INF =1e9; ; ...
js 鸭式辨型法
无意中看到arr.length === +arr.length;这句代码,然后就去了解了下这是一种鸭式辨型的判断方法. 鸭式辨型:像鸭子一样走路.游泳和嘎嘎叫的鸟就是鸭子这句话表示: a.arr有 ...
C语言中结构体定义实际上相当于变量入栈
struct context { int edi; int esi; int ebx; int ebp; int eip;}; 对应的入栈顺序是 pushl %esp pushl %eip pushl ...
zookeeper入门知识
ZooKeeper 是什么? ZooKeeper 顾名思义动物园管理员,他是拿来管大象(Hadoop) . 蜜蜂(Hive) .小猪(Pig) 的管理员, Apache Hbase和 Apache ...

lucene 中关于Store.YES 关于Store.NO的解释

lucene 中关于Store.YES 关于Store.NO的解释的更多相关文章

随机推荐

热门专题