Lucene4.6至 Lucene6.6的每个迭代对API的改动

由于项目需求，需要将Lucene4.6升级到Lucene6.6，因此我对这之间的所有重要的API改动做了搜集；特别重要的改变加粗显示。

Lucene4.7改动：

LUCENE-5405: Make ShingleAnalzyerWrapper.getWrappedAnalyzer() public final
(gsingers)

LUCENE-5395: The
SpatialArgsParser now only reads WKT, no more "lat, lon" etc. but
it's easy to override the parseShape method if you wish.

Lucene4.8改动：

LUCENE-5516: MergeScheduler#merge()
now accepts a MergeTrigger as well as a boolean that indicates if a new merge
was found in the caller thread before the scheduler was called.
(Simon Willnauer)
LUCENE-5487: Separated bulk scorer (new Weight.bulkScorer method)
from normal scoring (Weight.scorer) for those queries that can do bulk scoring
more efficiently, e.g. BooleanQuery in some cases. This also simplified the
Weight.scorer API by removing the two confusing booleans.

Lucene4.9改动：

·LUCENE-5725:
MoreLikeThis#like now accepts multiple values per field. The pre-existing
method has been deprecated in favor of a variable arguments for the like text.
(Alex Ksikes via Simon Willnauer)
·LUCENE-5711: MergePolicy
accepts an IndexWriter instance on each method rather than holding state
against a single IndexWriter instance.
·LUCENE-5640:
The Token class was deprecated. Since Lucene 2.9, TokenStreams are using
Attributes, Token is no longer used.
(Uwe Schindler, Robert Muir)
·LUCENE-5678:
IndexOutput no longer allows seeking, so it is no longer required to use
RandomAccessFile to write Indexes. Lucene now uses standard FileOutputStream
wrapped with OutputStreamIndexOutput to write index data. BufferedIndexOutput
was removed, because buffering and checksumming is provided by
FilterOutputStreams, provided by the JDK.

Lucene5.0改动：

LUCENE-4924: DocIdSetIterator.docID() must now return -1 when the iterator is not
positioned. This change affects all classes that inherit from DocIdSetIterator,
including DocsEnum and DocsAndPositionsEnum.
(Adrien Grand)

LUCENE-5388: Remove Reader from
Tokenizer's constructor and from Analyzer's createComponents. TokenStreams now
always get their input via setReader.
(Benson Margulies via Robert Muir - pull request #16)

LUCENE-5527: The Collector API has been refactored to use a
dedicated（专用的） Collector per leaf.
(Shikhar Bhushan, Adrien Grand)

LUCENE-5702: The FieldComparator API has been refactor to a
per-leaf API, just like Collectors.
(Adrien Grand)

LUCENE-4246: IndexWriter.close now always closes, even if it
throws an exception. The new IndexWriterConfig.setCommitOnClose (default true)
determines whether close() should commit before closing.

LUCENE-5569: *AtomicReader/AtomicReaderContext have been renamed to *LeafReader/LeafReaderContext.

LUCENE-6021:FixedBitSet.nextSetBit
now returns DocIdSetIterator.NO_MORE_DOCS instead of -1 when there are no more
bits which are set.
(Adrien Grand)

LUCENE-6084: IndexOutput's
constructor now requires a String resourceDescription so its toString is sane
(Robert Muir, Mike McCandless)

LUCENE-6121:
CachingTokenFilter.reset() now propagates to its input if called before
incrementToken(). You must call reset() now on this filter instead of doing it
a-priori on the input(), which previously didn't work.
(David Smiley, Robert Muir)

LUCENE-6165: IndexWriter.addIndexes(IndexReader...) changed to
addIndexes(CodecReader...)
(Robert Muir)

Lucene5.1改动：

LUCENE-6218, LUCENE-6220: Add
Collector.needsScores() and needsScores parameter to Query.createWeight().
(Robert Muir, Adrien Grand)

LUCENE-4524, LUCENE-6246, LUCENE-6256, LUCENE-6271:

Merge
DocsEnum and DocsAndPositionsEnum into a single PostingsEnum iterator.
TermsEnum.docs() and TermsEnum.docsAndPositions() are replaced by
TermsEnum.postings().

LUCENE-6270:
Replaced TermsFilter with TermsQuery, use a QueryWrapperFilter(TermsQuery)
instead.
(Adrien Grand)

LUCENE-6272: Scorer
extends DocSetIdIterator rather than DocsEnum
(Alan Woodward)

LUCENE-6286: Removed IndexSearcher methods that take a
Filter object. A BooleanQuery with a filter clause must be used instead.
(Adrien Grand)

LUCENE-6300:
PrefixFilter, TermRangeFilter and NumericRangeFilter have been removed. Use
PrefixQuery, TermRangeQuery and NumericRangeQuery instead.
(Adrien Grand)

LUCENE-6303:
Replaced FilterCache with QueryCache and CachingWrapperFilter with
CachingWrapperQuery.
(Adrien Grand)

Lucene5.2改动：

LUCENE-6377:
SearcherFactory#newSearcher now accepts the previous reader to simplify warming
logic during opening new searchers.
(Simon Willnauer)
LUCENE-6410:
Removed unused "reuse" parameter to Terms.iterator.
(Robert Muir, Mike McCandless)

Lucene5.3改动：

LUCENE-6552:
Add MergePolicy.OneMerge.getMergeInfo and rename setInfo to setMergeInfo
(Simon Willnauer, Mike McCandless)
LUCENE-6583:
FilteredQuery is deprecated and will be removed in 6.0. It should be replaced
with a BooleanQuery which handle the
query as a MUST clause and the filter as a FILTER clause.
(Adrien Grand)
LUCENE-6553: The postings, spans and scorer APIs no
longer take an acceptDocs parameter. Live docs are now always checked on top of
these APIs.
(Adrien Grand)
LUCENE-6643:
GroupingSearch from lucene/grouping was changed to take a Query object to
define groups instead of a Filter.
(Adrien Grand)
LUCENE-6648:
All lucene/facet APIs now take Query
objects where they used to take Filter objects.
(Adrien Grand)
LUCENE-6531:
PhraseQuery is now immutable and can be built using the PhraseQuery.Builder
class.
(Adrien Grand)
LUCENE-6570:
BooleanQuery is now immutable and can be
built using the BooleanQuery.Builder class.
(Adrien Grand)

Lucene5.4改动：

LUCENE-6855:
CachingWrapperQuery is deprecated and will be removed in 6.0.
(Adrien Grand)
LUCENE-6849:
Expose IndexWriter.flush() method, to move all in-memory segments to disk
without opening a near-real-time reader nor calling fsync
(Robert Muir, Simon Willnauer, Mike McCandless)

Lucene5.5改动：

LUCENE-6919:
The Scorer class has been refactored to
expose an iterator instead of extending DocIdSetIterator. asTwoPhaseIterator()
has been renamed to twoPhaseIterator() for consistency.
(Adrien Grand)
LUCENE-6980:
Default applyAllDeletes to true when opening near-real-time readers
(Mike McCandless)
LUCENE-6932:
IndexInput.seek implementations now throw EOFException if you seek beyond the
end of the file
(Adrien Grand, Mike McCandless)
LUCENE-6988:
IndexableField.tokenStream() no longer throws IOException
(Alan Woodward)

Lucene6.0改动：

LUCENE-6583:
FilteredQuery has been removed. Instead, you can construct a BooleanQuery with
one MUST clause for the query, and one FILTER clause for the filter.
(Adrien Grand)

LUCENE-6706:
PayloadTermQuery and PayloadNearQuery have been removed. Instead, use
PayloadScoreQuery to wrap any SpanQuery.
(Alan Woodward)

LUCENE-6947:
SortField.missingValue is now protected. You can read its value using the new
SortField.getMissingValue getter.
(Adrien Grand)

LUCENE-7052, LUCENE-7053: Remove
custom comparators from BytesRef class and solely use natural byte[] comparator
throughout codebase. This also simplifies API of BytesRefHash. It also replaces
the natural comparator in ArrayUtil by Java 8's Comparator#naturalOrder().
(Mike McCandless, Uwe Schindler, Robert Muir)

LUCENE-7058: Add
getters to various Query implementations
(Guillaume Smet via Alan
Woodward)

LUCENE-7064:
MultiPhraseQuery is now immutable and should be constructed with
MultiPhraseQuery.Builder.
(Luc Vanlerberghe via Adrien Grand)

LUCENE-6952: These
classes are now abstract: FilterCodecReader, FilterLeafReader, FilterCollector,
FilterDirectory. And some Filter* classes in lucene-test-framework too.
(David Smiley)

SOLR-8867:
FunctionValues.getRangeScorer now takes a LeafReaderContext instead of an
IndexReader, and avoids matching documents without a value in the field for
numeric fields.
(yonik)

Lucene6.1改动：

LUCENE-7243:Removed
the LeafReaderContext parameter from QueryCachingPolicy#shouldCache.
(Adrien Grand)

Lucene6.2改动：

ScoringWrapperSpans was removed since it
had no purpose or effect as of Lucene 5.5.

Lucene6.3改动：

修复了一些BUG无API改动！

Lucene6.4改动：

修复了一些BUG无API改动！

Lucene6.5改动：

· LUCENE-7624:
TermsQuery has been renamed as TermInSetQuery and moved to core.
(Alan Woodward)
LUCENE-7637:
TermInSetQuery requires that all terms come from the same field.
(Adrien Grand)
LUCENE-7644:
FieldComparatorSource.newComparator() and SortField.getComparator() no longer
throw IOException
(Alan Woodward)
LUCENE-7628:
Scorer.getChildren() now only returns Scorers that are positioned on the
current document, and can throw an IOException. AssertingScorer checks that
getChildren() is not called on an unpositioned Scorer.
(Alan Woodward, Adrien Grand)
LUCENE-7707:
TopDocs.merge now takes a boolean option telling it when to use the incoming
shard index versus when to assign the shard index itself, allowing users to
merge shard responses incrementally instead of once all shard responses are
present.
(Simon Willnauer, Mike McCandless)

Lucene6.6改动：

修复了一些BUG无API改动！

Lucene4.6至 Lucene6.6的每个迭代对API的改动的更多相关文章

打造属于自己的支持版本迭代的Asp.Net Web Api Route
在目前的主流架构中,我们越来越多的看到web Api的存在,小巧,灵活,基于Http协议,使它在越来越多的微服务项目或者移动项目充当很好的service endpoint. 问题以Asp.Net W ...
细说java中Map的两种迭代方式
曾经对java中迭代方式总是迷迷糊糊的,今天总算弄懂了.特意的总结了一下.基本是算是理解透彻了. 1.再说Map之前先说下Iterator: Iterator主要用于遍历(即迭代訪问)Collecti ...
Android 中的mvvm
我们来了解一下MVVM模式与Databinding ,MVVM是一种模式,Databinding 是一种框架.DataBinding是一个实现数据和UI绑定的框架.而ViewModel和View可以通 ...
iOS组件化思路 <转>
随着应用需求逐步迭代,应用的代码体积将会越来越大,为了更好的管理应用工程,我们开始借助CocoaPods版本管理工具对原有应用工程进行拆分.但是仅仅完成代码拆分还不足以解决业务之间的代码耦合,为了更好 ...
python基础之面向对象高级编程
面向对象基本知识: 面向对象是一种编程方式,此编程方式的实现是基于对类和对象的使用类是一个模板,模板中包装了多个"函数"供使用(可以讲多函数中公用的变量封装到对象中) ...
Python学习笔记 for windows 三
多重继承继承是面向对象编程的一个重要的方式,因为通过继承,子类就可以扩展父类的功能. 哺乳类:能跑的哺乳类,能飞的哺乳类: 鸟类:能跑的鸟类,能飞的鸟类. class Animal(object): ...
python基础——定制类
python基础——定制类看到类似__slots__这种形如__xxx__的变量或者函数名就要注意,这些在Python中是有特殊用途的. __slots__我们已经知道怎么用了,__len__()方 ...
python 定制类
看到类似__slots__这种形如__xxx__的变量或者函数名就要注意,这些在Python中是有特殊用途的. __slots__我们已经知道怎么用了,__len__()方法我们也知道是为了能让cla ...
Python3 面向对象高级编程
正常情况下,当我们定义了一个class,创建了一个class的实例后,我们可以给该实例绑定任何属性和方法,这就是动态语言的灵活性. class Student(object): pass 然后,尝试 ...

随机推荐

在 CentOS 下手工安装 Docker v1.1x
Docker在 centos 6.x 下面默认最新的版本是1.7, 然而这个并不符合我的实际需求, 尤其我需要 docker-compose 来作为编配工具部署swarm, 所以我只有手工安装了. 首 ...
C#通过gridview导出excel
[CustomAuthorize] public FileResult ExportQuestionCenterExcel(SearchBaseQuestion search) ...
一些容易记混的c++相关知识点
一些容易记混的c++相关知识. 截图自:<王道程序员面试宝典>
定点数(fixed-point number)
定义定点数(fixed-point number)就是小数点位置固定的数,也就是说,小数点后面的位数是固定的,比如要记录一笔账目,这些账目的数字都不会超过100,就可以使用2位小数位定点数来记录,比 ...
Python练习——循环2
1.求1~100之间能被7整除,但不能同时被5整除的所有整数 . for i in range(1,101): if i%7 == 0 and i%5 !=0: print(i) 2.输出“水仙花数” ...
Ubuntu使用时遇到的问题
启动时显示System program problem detected 解决办法: 打开命令行窗口:Ctrl+Alt+T 执行命令:sudo gedit /etc/default/apport 把e ...
从1到n的阶乘的和（python）
今天在百度上逛一些ctf的平台,偶然发现一道编程题,于是乎,便用我刚刚学的python知识解了这道题题目的描述是这样的: 计算1!+2!+3!+...+6666!后五位. 这个计算量很大啊,我还是用 ...
mac mysql连接报错ERROR 1045 (28000): Access denied for user 'root'@'localhost' (using password: YES)
找了半天又是kill进程,又是改设置文件,又是重启电脑,都不管用翻到stackoverflow上的解决方案,实施成功: 原文链接:https://stackoverflow.com/questio ...
exception = {"元数据集合中已存在具有标识“xxx”的项。\r\n参数名: item"}
vs提示:exception = {"元数据集合中已存在具有标识"xxx"的项.\r\n参数名: item"} 出现这个错误说明有重复的字段,有可能是继承的类里 ...
python Django框架接入微信公众平台
1.在接入微信公众平台之前,需要在微信公众平台配置好基本信息,如下: 这个时候点击“提交”按钮,会提示“Token校验失败”,不要着急,这是必然会出现的现象,先不要退出页面,保留各项输入的数据,按第二 ...

Lucene4.6至 Lucene6.6的每个迭代对API的改动

Lucene4.6至 Lucene6.6的每个迭代对API的改动的更多相关文章

随机推荐

热门专题