Lucene4.6至 Lucene6.6的每个迭代对API的改动
由于项目需求,需要将Lucene4.6升级到Lucene6.6,因此我对这之间的所有重要的API改动做了搜集;特别重要的改变加粗显示。
Lucene4.7改动:
LUCENE-5405: Make ShingleAnalzyerWrapper.getWrappedAnalyzer() public final
(gsingers)
LUCENE-5395: The
SpatialArgsParser now only reads WKT, no more "lat, lon" etc. but
it's easy to override the parseShape method if you wish.
Lucene4.8改动:
- LUCENE-5516: MergeScheduler#merge()
now accepts a MergeTrigger as well as a boolean that indicates if a new merge
was found in the caller thread before the scheduler was called.
(Simon Willnauer) - LUCENE-5487: Separated bulk scorer (new Weight.bulkScorer method)
from normal scoring (Weight.scorer) for those queries that can do bulk scoring
more efficiently, e.g. BooleanQuery in some cases. This also simplified the
Weight.scorer API by removing the two confusing booleans.
Lucene4.9改动:
- ·LUCENE-5725:
MoreLikeThis#like now accepts multiple values per field. The pre-existing
method has been deprecated in favor of a variable arguments for the like text.
(Alex Ksikes via Simon Willnauer) - ·LUCENE-5711: MergePolicy accepts an IndexWriter instance on each method rather than holding state against a single IndexWriter instance.
- ·LUCENE-5640:
The Token class was deprecated. Since Lucene 2.9, TokenStreams are using
Attributes, Token is no longer used.
(Uwe Schindler, Robert Muir) - ·LUCENE-5678: IndexOutput no longer allows seeking, so it is no longer required to use RandomAccessFile to write Indexes. Lucene now uses standard FileOutputStream wrapped with OutputStreamIndexOutput to write index data. BufferedIndexOutput was removed, because buffering and checksumming is provided by FilterOutputStreams, provided by the JDK.
Lucene5.0改动:
LUCENE-4924: DocIdSetIterator.docID() must now return -1 when the iterator is not
positioned. This change affects all classes that inherit from DocIdSetIterator,
including DocsEnum and DocsAndPositionsEnum.
(Adrien Grand)
LUCENE-5388: Remove Reader from
Tokenizer's constructor and from Analyzer's createComponents. TokenStreams now
always get their input via setReader.
(Benson Margulies via Robert Muir - pull request #16)
LUCENE-5527: The Collector API has been refactored to use a
dedicated(专用的) Collector per leaf.
(Shikhar Bhushan, Adrien Grand)
LUCENE-5702: The FieldComparator API has been refactor to a
per-leaf API, just like Collectors.
(Adrien Grand)
LUCENE-4246: IndexWriter.close now always closes, even if it throws an exception. The new IndexWriterConfig.setCommitOnClose (default true) determines whether close() should commit before closing.
LUCENE-5569: *AtomicReader/AtomicReaderContext have been renamed to *LeafReader/LeafReaderContext.
LUCENE-6021:FixedBitSet.nextSetBit
now returns DocIdSetIterator.NO_MORE_DOCS instead of -1 when there are no more
bits which are set.
(Adrien Grand)
LUCENE-6084: IndexOutput's
constructor now requires a String resourceDescription so its toString is sane
(Robert Muir, Mike McCandless)
LUCENE-6121:
CachingTokenFilter.reset() now propagates to its input if called before
incrementToken(). You must call reset() now on this filter instead of doing it
a-priori on the input(), which previously didn't work.
(David Smiley, Robert Muir)
LUCENE-6165: IndexWriter.addIndexes(IndexReader...) changed to
addIndexes(CodecReader...)
(Robert Muir)
Lucene5.1改动:
LUCENE-6218, LUCENE-6220: Add
Collector.needsScores() and needsScores parameter to Query.createWeight().
(Robert Muir, Adrien Grand)
LUCENE-4524, LUCENE-6246, LUCENE-6256, LUCENE-6271:
Merge DocsEnum and DocsAndPositionsEnum into a single PostingsEnum iterator. TermsEnum.docs() and TermsEnum.docsAndPositions() are replaced by TermsEnum.postings().
LUCENE-6270:
Replaced TermsFilter with TermsQuery, use a QueryWrapperFilter(TermsQuery)
instead.
(Adrien Grand)
LUCENE-6272: Scorer
extends DocSetIdIterator rather than DocsEnum
(Alan Woodward)
LUCENE-6286: Removed IndexSearcher methods that take a
Filter object. A BooleanQuery with a filter clause must be used instead.
(Adrien Grand)
LUCENE-6300:
PrefixFilter, TermRangeFilter and NumericRangeFilter have been removed. Use
PrefixQuery, TermRangeQuery and NumericRangeQuery instead.
(Adrien Grand)
LUCENE-6303:
Replaced FilterCache with QueryCache and CachingWrapperFilter with
CachingWrapperQuery.
(Adrien Grand)
Lucene5.2改动:
- LUCENE-6377:
SearcherFactory#newSearcher now accepts the previous reader to simplify warming
logic during opening new searchers.
(Simon Willnauer) - LUCENE-6410:
Removed unused "reuse" parameter to Terms.iterator.
(Robert Muir, Mike McCandless)
Lucene5.3改动:
- LUCENE-6552:
Add MergePolicy.OneMerge.getMergeInfo and rename setInfo to setMergeInfo
(Simon Willnauer, Mike McCandless) - LUCENE-6583:
FilteredQuery is deprecated and will be removed in 6.0. It should be replaced
with a BooleanQuery which handle the
query as a MUST clause and the filter as a FILTER clause.
(Adrien Grand) - LUCENE-6553: The postings, spans and scorer APIs no
longer take an acceptDocs parameter. Live docs are now always checked on top of
these APIs.
(Adrien Grand) - LUCENE-6643:
GroupingSearch from lucene/grouping was changed to take a Query object to
define groups instead of a Filter.
(Adrien Grand) - LUCENE-6648:
All lucene/facet APIs now take Query
objects where they used to take Filter objects.
(Adrien Grand) - LUCENE-6531:
PhraseQuery is now immutable and can be built using the PhraseQuery.Builder
class.
(Adrien Grand) - LUCENE-6570:
BooleanQuery is now immutable and can be
built using the BooleanQuery.Builder class.
(Adrien Grand)
Lucene5.4改动:
- LUCENE-6855:
CachingWrapperQuery is deprecated and will be removed in 6.0.
(Adrien Grand) - LUCENE-6849:
Expose IndexWriter.flush() method, to move all in-memory segments to disk
without opening a near-real-time reader nor calling fsync
(Robert Muir, Simon Willnauer, Mike McCandless)
Lucene5.5改动:
- LUCENE-6919:
The Scorer class has been refactored to
expose an iterator instead of extending DocIdSetIterator. asTwoPhaseIterator()
has been renamed to twoPhaseIterator() for consistency.
(Adrien Grand) - LUCENE-6980:
Default applyAllDeletes to true when opening near-real-time readers
(Mike McCandless) - LUCENE-6932:
IndexInput.seek implementations now throw EOFException if you seek beyond the
end of the file
(Adrien Grand, Mike McCandless) - LUCENE-6988:
IndexableField.tokenStream() no longer throws IOException
(Alan Woodward)
Lucene6.0改动:
LUCENE-6583:
FilteredQuery has been removed. Instead, you can construct a BooleanQuery with
one MUST clause for the query, and one FILTER clause for the filter.
(Adrien Grand)
LUCENE-6706:
PayloadTermQuery and PayloadNearQuery have been removed. Instead, use
PayloadScoreQuery to wrap any SpanQuery.
(Alan Woodward)
LUCENE-6947:
SortField.missingValue is now protected. You can read its value using the new
SortField.getMissingValue getter.
(Adrien Grand)
LUCENE-7052, LUCENE-7053: Remove
custom comparators from BytesRef class and solely use natural byte[] comparator
throughout codebase. This also simplifies API of BytesRefHash. It also replaces
the natural comparator in ArrayUtil by Java 8's Comparator#naturalOrder().
(Mike McCandless, Uwe Schindler, Robert Muir)
LUCENE-7058: Add
getters to various Query implementations
(Guillaume Smet via Alan
Woodward)
LUCENE-7064:
MultiPhraseQuery is now immutable and should be constructed with
MultiPhraseQuery.Builder.
(Luc Vanlerberghe via Adrien Grand)
LUCENE-6952: These
classes are now abstract: FilterCodecReader, FilterLeafReader, FilterCollector,
FilterDirectory. And some Filter* classes in lucene-test-framework too.
(David Smiley)
SOLR-8867:
FunctionValues.getRangeScorer now takes a LeafReaderContext instead of an
IndexReader, and avoids matching documents without a value in the field for
numeric fields.
(yonik)
Lucene6.1改动:
LUCENE-7243:Removed
the LeafReaderContext parameter from QueryCachingPolicy#shouldCache.
(Adrien Grand)
Lucene6.2改动:
ScoringWrapperSpans was removed since it had no purpose or effect as of Lucene 5.5.
Lucene6.3改动:
修复了一些BUG无API改动!
Lucene6.4改动:
修复了一些BUG无API改动!
Lucene6.5改动:
- · LUCENE-7624:
TermsQuery has been renamed as TermInSetQuery and moved to core.
(Alan Woodward) - LUCENE-7637:
TermInSetQuery requires that all terms come from the same field.
(Adrien Grand) - LUCENE-7644:
FieldComparatorSource.newComparator() and SortField.getComparator() no longer
throw IOException
(Alan Woodward) - LUCENE-7628:
Scorer.getChildren() now only returns Scorers that are positioned on the
current document, and can throw an IOException. AssertingScorer checks that
getChildren() is not called on an unpositioned Scorer.
(Alan Woodward, Adrien Grand) - LUCENE-7707:
TopDocs.merge now takes a boolean option telling it when to use the incoming
shard index versus when to assign the shard index itself, allowing users to
merge shard responses incrementally instead of once all shard responses are
present.
(Simon Willnauer, Mike McCandless)
Lucene6.6改动:
修复了一些BUG无API改动!