Class Summary | |
---|---|
AbstractBenchmarker | |
AbstractField | |
AbstractQueryMaker | Abstract base query maker. |
AddDocTask | Add a document, optionally with of a certain size. |
Algorithm | Test algorithm, as read from file |
AlreadyClosedException | This exception is thrown when there is an attempt to access something that has already been closed. |
Among | Licensed to the Apache Software Foundation (ASF) under one or more contributor license agreements. |
Analyzer | An Analyzer builds TokenStreams, which analyze text. |
AnalyzerUtil | Various fulltext analysis utilities avoiding redundant code in several classes. |
AnalyzingQueryParser |
Overrides Lucene's default QueryParser so that Fuzzy-, Prefix-, Range-, and WildcardQuerys
are also passed through the given analyzer, but wild card characters (like * )
don't get removed from the search terms.
|
AndQuery | Licensed to the Apache Software Foundation (ASF) under one or more contributor license agreements. |
BasicDocMaker | Create documents for the test. |
BasicQueryFactory | Licensed to the Apache Software Foundation (ASF) under one or more contributor license agreements. |
Benchmark | Run the benchmark algorithm. |
Benchmarker | |
BenchmarkOptions | Marker Interface defining some common options. |
BitVector | Optimized implementation of a vector of bits. |
Block | Port of Andi Vajda's DbDirectory to Java Edition of Berkeley Database |
Block | |
BooleanClause | A clause in a BooleanQuery. |
BooleanClause.Occur | Specifies how clauses are to occur in matching documents. |
BooleanFilterBuilder | |
BooleanQuery | A Query that matches documents matching boolean combinations of other queries, e.g. TermQuerys, PhraseQuerys or other BooleanQuerys. |
BooleanQueryBuilder | |
BooleanQuery.TooManyClauses | Thrown when an attempt is made to add more than TooManyClauses clauses. |
BoostingQueryBuilder | Licensed to the Apache Software Foundation (ASF) under one or more contributor license agreements. |
BoostingTermQuery | The BoostingTermQuery is very similar to the SpanTermQuery except that it factors in the value of the payload located at each of the positions where the Term occurs. |
BoostingTermQuery.BoostingTermWeight | |
BrazilianAnalyzer | Analyzer for Brazilian language. |
BrazilianStemFilter | Based on GermanStemFilter |
BrazilianStemmer | A stemmer for Brazilian words. |
BufferedIndexInput | Base implementation class for buffered IndexInput. |
BufferedIndexOutput | Base implementation class for buffered IndexOutput. |
ByteFieldSource |
Expert: obtains single byte field values from the
FieldCache
using getBytes() and makes those values
available as other numeric types, casting as needed.
|
CachedFilterBuilder | Filters are cached in an LRU Cache keyed on the contained query or filter object. |
CachingTokenFilter | This class can be used if the Tokens of a TokenStream are intended to be consumed more than once. |
CachingWrapperFilter | Wraps another filter's result and caches it. |
ChainedFilter |
Allows multiple Filters to be chained. |
CharStream | This interface describes a character stream that maintains line and column number positions of the characters. |
CharStream | This interface describes a character stream that maintains line and column number positions of the characters. |
CharStream | This interface describes a character stream that maintains line and column number positions of the characters. |
CharStream | This interface describes a character stream that maintains line and column number positions of the characters. |
CharTokenizer | An abstract base class for simple, character-oriented tokenizers. |
ChineseAnalyzer | Title: ChineseAnalyzer Description: Subclass of org.apache.lucene.analysis.Analyzer build from a ChineseTokenizer, filtered with ChineseFilter. |
ChineseFilter | Title: ChineseFilter Description: Filter with a stop word table Rule: No digital is allowed. |
ChineseTokenizer | Title: ChineseTokenizer Description: Extract tokens from the Stream using Character.getType() Rule: A Chinese character as a single token Copyright: Copyright (c) 2001 Company: The difference between thr ChineseTokenizer and the CJKTokenizer (id=23545) is that they have different token parsing logic. |
CJKAnalyzer | Filters CJKTokenizer with StopFilter. |
CJKTokenizer | CJKTokenizer was modified from StopTokenizer which does a decent job for most European languages. |
ClearStatsTask | Clear statistics data. |
CloseIndexTask | Close index writer. |
CloseReaderTask | Close index reader. |
ComplexExplanation | Expert: Describes the score computation for document and query, andcan distinguish a match independent of a positive value. |
ComposedQuery | Licensed to the Apache Software Foundation (ASF) under one or more contributor license agreements. |
Config | Perf run configuration properties. |
ConfigurableDocumentHandler | Licensed to the Apache Software Foundation (ASF) under one or more contributor license agreements. |
Constants | Some useful constants. |
Constants | |
ConstantScoreQuery | A query that wraps a filter and simply returns a constant score equal to the query boost for every document in the filter. |
ConstantScoreQueryBuilder | Licensed to the Apache Software Foundation (ASF) under one or more contributor license agreements. |
ConstantScoreQuery.ConstantScorer | |
ConstantScoreQuery.ConstantWeight | |
ConstantScoreRangeQuery | A range query that returns a constant score equal to its boost for all documents in the range. |
CoreParser | Assembles a QueryBuilder which uses only core Lucene Query objects |
CorePlusExtensionsParser | Licensed to the Apache Software Foundation (ASF) under one or more contributor license agreements. |
CorruptIndexException | This exception is thrown when Lucene detects an inconsistency in the index. |
CreateIndexTask | Create an index. |
CustomScoreQuery | Query that sets document score as a programmatic function of (up to) two (sub) scores. |
CzechAnalyzer | Analyzer for Czech language. |
DanishStemmer | Generated class implementing code defined by a snowball script. |
DateField | Provides support for converting dates to strings and vice-versa. |
DateTools | Provides support for converting dates to strings and vice-versa. |
DateTools.Resolution | Specifies the time granularity. |
DbDirectory |
A DbDirectory is a Berkeley DB 4.3 based implementation of
Directory . |
DbHandleExtractor | This class is a hack to workaround the need to rewrite the entire org.apache.lucene.store.db package after Sleepycat radically changed its Java API from version 4.2.52 to version 4.3.21. |
DbIndexInput | |
DbIndexOutput | |
DbLock |
This implementation of Lock is
trivial as DbDirectory operations are managed by the Berkeley DB
locking system.
|
DefaultEncoder | Simple Encoder implementation that does not modify the output |
DefaultSimilarity | Expert: Default scoring implementation. |
DeleteDocTask | Delete a document by docid. |
DeleteFiles | Deletes documents from an index that do not contain a term. |
DemoHTMLParser | HTML Parser that is based on Lucene's demo HTML parser. |
Dictionary | A simple interface representing a Dictionary. |
Directory | A Directory is a flat list of files. |
DisjunctionMaxQuery | A query that generates the union of documents produced by its subqueries, and that scores each document with the maximum score for that document as produced by any subquery, plus a tie breaking increment for any additional matching subqueries. |
DistanceQuery | Licensed to the Apache Software Foundation (ASF) under one or more contributor license agreements. |
DistanceSubQuery | Licensed to the Apache Software Foundation (ASF) under one or more contributor license agreements. |
DocData | Output of parsing (e.g. |
DocMaker | Create documents for the test. |
Document | Documents are the unit of indexing and search. |
DocumentHandler | Allows a class to act as a Lucene document handler |
DocumentHandlerException | |
DocValues | Expert: represents field values as different types. |
DOMUtils | Licensed to the Apache Software Foundation (ASF) under one or more contributor license agreements. |
Driver | Sets up the benchmark driver. |
DutchAnalyzer | Analyzer for Dutch language. |
DutchStemFilter | A filter that stems Dutch words. |
DutchStemmer | A stemmer for Dutch words. |
DutchStemmer | Generated class implementing code defined by a snowball script. |
EdgeNGramTokenFilter | Tokenizes the given token into n-grams of given size(s). |
EdgeNGramTokenFilter.Side | Specifies which side of the input the n-gram should be generated from |
EdgeNGramTokenizer | Tokenizes the input from an edge into n-grams of given size(s). |
EdgeNGramTokenizer.Side | Specifies which side of the input the n-gram should be generated from |
Encoder | Encodes original text. |
EnglishStemmer | Generated class implementing code defined by a snowball script. |
Entities | Licensed to the Apache Software Foundation (ASF) under one or more contributor license agreements. |
Explanation | Expert: Describes the score computation for document and query. |
ExtractReuters | Split the Reuters SGML documents into Simple Text files containing: Title, Date, Dateline, Body |
FastCharStream | An efficient implementation of JavaCC's CharStream interface. |
FastCharStream | An efficient implementation of JavaCC's CharStream interface. |
FastCharStream | An efficient implementation of JavaCC's CharStream interface. |
FastCharStream | An efficient implementation of JavaCC's CharStream interface. |
Field | A field is a section of a Document. |
Fieldable | Synonymous with Field. |
FieldCache | Expert: Maintains caches of term values. |
FieldCacheSource |
Expert: A base class for ValueSource implementations that retrieve values for
a single field from the FieldCache .
|
FieldCache.ByteParser | Interface to parse bytes from document fields. |
FieldCache.FloatParser | Interface to parse floats from document fields. |
FieldCache.IntParser | Interface to parse ints from document fields. |
FieldCache.ShortParser | Interface to parse shorts from document fields. |
FieldCache.StringIndex | Expert: Stores term text values and document ordering data. |
FieldDoc | Expert: A ScoreDoc which also contains information about how to sort the referenced document. |
FieldReaderException | |
FieldsQuery | Licensed to the Apache Software Foundation (ASF) under one or more contributor license agreements. |
FieldScoreQuery | A query that scores each document as the value of the numeric input field. |
FieldScoreQuery.Type | Type of score field, indicating how field values are interpreted/parsed. |
FieldSelector | Similar to a java.io.FileFilter, the FieldSelector allows one to make decisions about what Fields get loaded on a Document by IndexReader |
FieldSelectorResult | Provides information about what should be done with this Field |
FieldSortedHitQueue | Expert: A hit queue for sorting by hits by terms in more than one field. |
Field.Index | Specifies whether and how a field should be indexed. |
Field.Store | Specifies whether and how a field should be stored. |
Field.TermVector | Specifies whether and how a field should have term vectors. |
File | Port of Andi Vajda's DbDirectory to Java Edition of Berkeley Database |
File | |
FileBasedQueryMaker | Create queries from a FileReader. |
FileDocument | A utility for making Lucene Documents from a File. |
FileExtensionDocumentHandler | A DocumentHandler implementation to delegate responsibility to based on a files extension. |
FileUtils | File utilities. |
Filter | Abstract base class providing a mechanism to restrict searches to a subset of an index. |
FilterBuilder | |
FilterBuilderFactory | |
FilteredQuery | A query that applies a filter to the results of another query. |
FilteredQueryBuilder | |
FilteredTermEnum | Abstract class for enumerating a subset of all terms. |
FilterIndexReader | A FilterIndexReader contains another IndexReader, which it
uses as its basic source of data, possibly transforming the data along the
way or providing additional functionality. |
FilterIndexReader.FilterTermDocs | Base class for filtering TermDocs implementations. |
FilterIndexReader.FilterTermEnum | Base class for filtering TermEnum implementations. |
FilterIndexReader.FilterTermPositions | Base class for filtering TermPositions implementations. |
FilterManager | Filter caching singleton. |
FilterManager.FilterCleaner | Keeps the cache from getting too big. |
FilterManager.FilterItem | Holds the filter and the last time the filter was used, to make LRU-based cache cleaning possible. |
FinnishStemmer | Generated class implementing code defined by a snowball script. |
FloatFieldSource |
Expert: obtains float field values from the
FieldCache
using getFloats() and makes those values
available as other numeric types, casting as needed.
|
Format | Formatting utilities (for reports). |
Formatter | Processes terms found in the original text, typically by applying some form of mark-up to highlight terms in HTML search results pages. |
Fragmenter | Implements the policy for breaking text into multiple fragments for consideration by the Highlighter class. |
FrenchAnalyzer | Analyzer for French language. |
FrenchStemFilter | A filter that stemms french words. |
FrenchStemmer | A stemmer for French words. |
FrenchStemmer | Generated class implementing code defined by a snowball script. |
FSDirectory | Straightforward implementation of Directory as a directory of files. |
FSDirectory.FSIndexInput | |
FSDirectory.FSIndexOutput | |
FuzzyLikeThisQueryBuilder | Licensed to the Apache Software Foundation (ASF) under one or more contributor license agreements. |
FuzzyQuery | Implements the fuzzy search query. |
FuzzyQuery.ScoreTerm | |
FuzzyQuery.ScoreTermQueue | |
FuzzyTermEnum | Subclass of FilteredTermEnum for enumerating all terms that are similiar to the specified filter term. |
German2Stemmer | Generated class implementing code defined by a snowball script. |
GermanStemmer | Generated class implementing code defined by a snowball script. |
GradientFormatter | Formats text with different color intensity depending on the score of the term. |
GreekAnalyzer | Analyzer for the Greek language. |
GreekCharsets | GreekCharsets class contains encodings schemes (charsets) and toLowerCase() method implementation for greek characters in Unicode, ISO-8859-7 and Microsoft Windows CP1253. |
GreekLowerCaseFilter | Normalizes token text to lower case, analyzing given ("greek") charset. |
HighFreqTerms |
HighFreqTerms class extracts terms and their frequencies out
of an existing Lucene index.
|
Highlighter | Class used to markup highlighted terms found in the best sections of a text, using configurable Fragmenter, Scorer, Formatter, Encoder and tokenizers. |
Hit | Wrapper used by HitIterator to provide a lazily loaded hit from Hits. |
HitCollector | Lower-level search API. |
HitIterator | An iterator over Hits that provides lazy fetching of each document. iterator returns an instance of this class. |
Hits | A ranked list of documents, used to hold search results. |
HtmlDocument |
The HtmlDocument class creates a Lucene Document from an HTML document. |
HTMLDocument | A utility for making Lucene Documents for HTML documents. |
HTMLParser | |
HTMLParser | HTML Parsing Interfacew for test purposes |
HTMLParserConstants | |
HTMLParserTokenManager | |
IndexCommitPoint |
Expert: represents a single commit into an index as seen by the IndexDeletionPolicy. |
IndexDeletionPolicy |
Expert: policy for deletion of stale |
IndexFileNameFilter | Filename filter that accept filenames and extensions only created by Lucene. |
IndexFiles | Index all text files under a directory. |
IndexHTML | Indexer for HTML files. |
IndexInput | Abstract base class for input from a file in a Directory. |
IndexMergeTool | Merges indices specified on the command line into the index specified as the first command line argument. |
IndexModifier |
[Note that as of 2.1, all but one of the methods in this class are available via IndexWriter. |
IndexOutput | Abstract base class for output to a file in a Directory. |
IndexReader | IndexReader is an abstract class, providing an interface for accessing an index. |
IndexReader.FieldOption | |
IndexSearcher | Implements search over a single IndexReader. |
IndexTask | Ant task to index files with Lucene |
IndexTask.AnalyzerType | |
IndexTask.HandlerConfig | |
IndexWriter |
An IndexWriter creates and maintains an index.
|
IntFieldSource |
Expert: obtains int field values from the
FieldCache
using getInts() and makes those values
available as other numeric types, casting as needed.
|
ISOLatin1AccentFilter | A filter that replaces accented characters in the ISO Latin 1 character set (ISO-8859-1) by their unaccented equivalent. |
ItalianStemmer | Generated class implementing code defined by a snowball script. |
JakartaRegexpCapabilities | Implementation tying Jakarta Regexp to RegexQuery. |
JavaUtilRegexCapabilities | An implementation tying Java's built-in java.util.regex to RegexQuery. |
JEDirectory |
Port of Andi Vajda's DbDirectory to to Java Edition of Berkeley Database
A JEDirectory is a Berkeley DB JE based implementation of
Directory . |
JEIndexInput | Port of Andi Vajda's DbDirectory to Java Edition of Berkeley Database |
JEIndexOutput | Port of Andi Vajda's DbDirectory to Java Edition of Berkeley Database |
JELock | Port of Andi Vajda's DbDirectory to Java Edition of Berkeley Database |
KeepOnlyLastCommitDeletionPolicy | This IndexDeletionPolicy implementation that keeps only the most recent commit and immediately removes all prior commits after a new commit is done. |
KeywordAnalyzer | "Tokenizes" the entire stream as a single token. |
KeywordTokenizer | Emits the entire input as a single token. |
KpStemmer | Generated class implementing code defined by a snowball script. |
LengthFilter | Removes words that are too long and too short from the stream. |
LengthNormModifier | Given a directory, a Similarity, and a list of fields, updates the fieldNorms in place for every document using the Similarity.lengthNorm. |
LetterTokenizer | A LetterTokenizer is a tokenizer that divides text at non-letters. |
LikeThisQueryBuilder | |
ListSearcher | See table searcher explanation. |
LoadFirstFieldSelector | Load the First field and break. |
Lock | An interprocess mutex lock. |
LockFactory | Base class for Locking implementation. Directory uses instances of this class to implement locking. |
LockObtainFailedException |
This exception is thrown when the write.lock
could not be acquired. |
Lock.With | Utility class for executing code with exclusive access. |
LovinsStemmer | Generated class implementing code defined by a snowball script. |
LowerCaseFilter | Normalizes token text to lower case. |
LowerCaseTokenizer | LowerCaseTokenizer performs the function of LetterTokenizer and LowerCaseFilter together. |
LuceneDictionary | Lucene Dictionary: terms taken from the given field of a Lucene index. |
LucenePackage | Lucene's package information, including version. |
Lucli | Main class for lucli: the Lucene Command Line Interface. |
MapFieldSelector | A FieldSelector based on a Map of field names to FieldSelectorResults |
MatchAllDocsQuery | A query that matches all documents. |
MatchAllDocsQueryBuilder | Licensed to the Apache Software Foundation (ASF) under one or more contributor license agreements. |
MemoryIndex | High-performance single-document main memory Apache Lucene fulltext search index. |
MemUsage | This class holds a set of memory usage values. |
MMapDirectory | File-based Directory implementation that uses mmap for input. |
MoreLikeThis | Generate "more like this" similarity queries. |
MoreLikeThisQuery | A simple wrapper for MoreLikeThis for use in scenarios where a Query object is required eg in custom QueryParser extensions. |
MultiFieldQueryParser | A QueryParser which constructs queries to search multiple fields. |
MultipleTermPositions |
Describe class MultipleTermPositions here.
|
MultiPhraseQuery |
MultiPhraseQuery is a generalized version of PhraseQuery, with an added
method (Term[]) .
|
MultiReader | An IndexReader which reads multiple indexes, appending their content. |
MultiSearcher | Implements search over a set of Searchables .
|
MultiTermQuery | A Query that matches documents containing a subset of terms provided by a FilteredTermEnum enumeration. |
NativeFSLockFactory | Implements LockFactory using native OS file locks (available through java.nio.*). |
NewRoundTask | Increment the counter for properties maintained by Round Number. |
NGramTokenFilter | Tokenizes the input into n-grams of the given size(s). |
NGramTokenizer | Tokenizes the input into n-grams of the given size(s). |
NoLockFactory | Use this LockFactory to disable locking entirely. |
NoMoreDataException | Exception indicating there is no more data. |
NorwegianStemmer | Generated class implementing code defined by a snowball script. |
NotQuery | Licensed to the Apache Software Foundation (ASF) under one or more contributor license agreements. |
NullFragmenter | Fragmenter implementation which does not fragment the text. |
NumberTools | Provides support for converting longs to Strings, and back again. |
OpenIndexTask | Open an index writer. |
OpenReaderTask | Open an index reader. |
OptimizeTask | Optimize the index. |
OrdFieldSource |
Expert: obtains the ordinal of the field value from the default Lucene
Fieldcache using getStringIndex().
|
OrQuery | Licensed to the Apache Software Foundation (ASF) under one or more contributor license agreements. |
ParallelMultiSearcher | Implements parallel search over a set of Searchables .
|
ParallelReader | An IndexReader which reads multiple, parallel indexes. |
Parameter | A serializable Enum class. |
ParseException | This exception is thrown when parse errors are encountered. |
ParseException | This exception is thrown when parse errors are encountered. |
ParseException | This exception is thrown when parse errors are encountered. |
ParseException | This exception is thrown when parse errors are encountered. |
ParseException | This exception is thrown when parse errors are encountered. |
ParserException | |
PatternAnalyzer | Efficient Lucene analyzer/tokenizer that preferably operates on a String rather than a java.io.Reader, that can flexibly separate text into terms via a regular expression Pattern (with behaviour identical to String#split(String)), and that combines the functionality of LetterTokenizer, LowerCaseTokenizer, WhitespaceTokenizer, StopFilter into a single efficient multi-purpose class. |
Payload | A Payload is metadata that can be stored together with each occurrence of a term. |
PerfRunData | Data maintained by a performance test run. |
PerfTask | A (abstract) task to be tested for performance. |
PerFieldAnalyzerWrapper | This analyzer is used to facilitate scenarios where different fields require different analysis techniques. |
PhraseQuery | A Query that matches documents containing a particular sequence of terms. |
PlainTextDictionary | Dictionary represented by a text file. |
Points | Test run data points collected as the test proceeds. |
PorterStemFilter | Transforms the token stream as per the Porter stemming algorithm. |
PorterStemmer | Generated class implementing code defined by a snowball script. |
PortugueseStemmer | Generated class implementing code defined by a snowball script. |
PrecedenceQueryParser | Experimental query parser variant designed to handle operator precedence in a more sensible fashion than QueryParser. |
PrecedenceQueryParserConstants | |
PrecedenceQueryParserTokenManager | |
PrefixFilter | |
PrefixQuery | A Query that matches documents containing terms with a specified prefix. |
PriorityQueue | A PriorityQueue maintains a partial ordering of its elements such that the least element can always be found in constant time. |
Query | The abstract base class for queries. |
QueryBuilder | Implemented by objects that produce Lucene Query objects from XML streams. |
QueryBuilderFactory | |
QueryData | This class holds parameters for a query benchmark. |
QueryFilter | Constrains search results to only match those which also match a provided query. |
QueryMaker | Create queries for the test. |
QueryParser | This class is generated by JavaCC. |
QueryParser | This class is generated by JavaCC. |
QueryParserConstants | |
QueryParserConstants | |
QueryParserTokenManager | |
QueryParserTokenManager | |
QueryParser.Operator | The default operator for parsing queries. |
QueryScorer | Scorer implementation which scores text fragments by the number of unique query terms found. |
QueryTemplateManager | Provides utilities for turning query form input (such as from a web page or Swing gui) into Lucene XML queries by using XSL templates. |
QueryTermExtractor | Utility class used to extract the terms used in a query, plus any weights. |
QueryTermVector | |
QueryWrapperFilter | Constrains search results to only match those which also match a provided query. |
RAMDirectory | A memory-resident Directory implementation. |
RAMOutputStream | A memory-resident IndexOutput implementation. |
RangeFilter | A Filter that restricts search results to a range of values in a given field. |
RangeFilterBuilder | |
RangeQuery | A Query that matches documents within an exclusive range. |
ReadTask | Read index (abstract) task. |
RegexCapabilities | Defines basic operations needed by RegexQuery for a regular expression implementation. |
RegexpTunnel | This class exists as a gateway to access useful Jakarta Regexp package protected data. |
RegexQuery | Implements the regular expression term search query. |
RegexQueryCapable | Defines methods for regular expression supporting Querys to use. |
RegexTermEnum | Subclass of FilteredTermEnum for enumerating all terms that match the specified regular expression term using the specified regular expression implementation. |
RemoteCachingWrapperFilter | Provides caching of Filters themselves on the remote end of an RMI connection. |
RemoteSearchable | A remote searchable implementation. |
RepAllTask | Report all statistics with no aggregations. |
Report | Textual report of current statistics. |
ReportTask | Report (abstract) task - all report tasks extend this task. |
RepSelectByPrefTask | Report by-name-prefix statistics with no aggregations. |
RepSumByNameRoundTask | Report all statistics grouped/aggregated by name and round. |
RepSumByNameTask | Report all statistics aggregated by name. |
RepSumByPrefRoundTask | Report all prefix matching statistics grouped/aggregated by name and round. |
RepSumByPrefTask | Report by-name-prefix statistics aggregated by name. |
ReqExclScorer | A Scorer for queries with a required subscorer and an excluding (prohibited) subscorer. |
ReqOptSumScorer | A Scorer for queries with a required part and an optional part. |
ResetInputsTask | Reset inputs so that the test run would behave, input wise, as if it just started. |
ResetSystemEraseTask | Reset all index and input data and call gc, erase index and dir, does NOT clear statistics. |
ResetSystemSoftTask | Reset all index and input data and call gc, does NOT erase index/dir, does NOT clear statistics. |
ReutersDocMaker | A DocMaker using the Reuters collection for its input. |
ReutersQueries | |
ReutersQueryMaker | A QueryMaker that makes queries devised manually (by Grant Ingersoll) for searching in the Reuters collection. |
ReverseOrdFieldSource |
Expert: obtains the ordinal of the field value from the default Lucene
FieldCache using getStringIndex()
and reverses the order.
|
RussianAnalyzer | Analyzer for Russian language. |
RussianCharsets | RussianCharsets class contains encodings schemes (charsets) and toLowerCase() method implementation for russian characters in Unicode, KOI8 and CP1252. |
RussianLetterTokenizer | A RussianLetterTokenizer is a tokenizer that extends LetterTokenizer by additionally looking up letters in a given "russian charset". |
RussianLowerCaseFilter | Normalizes token text to lower case, analyzing given ("russian") charset. |
RussianStemFilter | A filter that stems Russian words. |
RussianStemmer | Generated class implementing code defined by a snowball script. |
Sample | Sample performance test written programatically - no algorithm file is needed here. |
ScoreDoc | Expert: Returned by low-level search implementations. |
ScoreDocComparator | Expert: Compares two ScoreDoc objects for sorting. |
Scorer | Expert: Common scoring functionality for different types of queries. |
Scorer | Adds to the score for a fragment based on its tokens |
ScorerDocQueue | A ScorerDocQueue maintains a partial ordering of its Scorers such that the least Scorer can always be found in constant time. |
Searchable | The interface for search implementations. |
Searcher | An abstract base class for search implementations. |
SearchFiles | Simple command-line based search demo. |
SearchTask | Search task. |
SearchTravRetLoadFieldSelectorTask | Search and Traverse and Retrieve docs task using a SetBasedFieldSelector. |
SearchTravRetTask | Search and Traverse and Retrieve docs task. |
SearchTravTask | Search and Traverse task. |
SegmentInfos.FindSegmentsFile | Utility class for executing code that needs to do something with the current segments file. |
SetBasedFieldSelector | Declare what fields to load normally and what fields to load lazily |
SetPropTask | Set a performance test configuration property. |
ShortFieldSource |
Expert: obtains short field values from the
FieldCache
using getShorts() and makes those values
available as other numeric types, casting as needed.
|
Similarity | Expert: Scoring API. |
SimilarityDelegator | Expert: Delegating scoring implementation. |
SimilarityQueries | Simple similarity measures. |
SimpleAnalyzer | An Analyzer that filters LetterTokenizer with LowerCaseFilter. |
SimpleCharStream | An implementation of interface CharStream, where the stream is assumed to contain only ASCII characters (without unicode processing). |
SimpleDocMaker | Create documents for the test. |
SimpleFragmenter | Fragmenter implementation which breaks text up into same-size fragments with no concerns over spotting sentence boundaries. |
SimpleFSLockFactory | Implements LockFactory using File#createNewFile(). |
SimpleHTMLEncoder | Simple Encoder implementation to escape text for HTML output |
SimpleHTMLFormatter | Simple Formatter implementation to highlight terms with a pre and post tag |
SimpleQueryMaker | A QueryMaker that makes queries for a collection created using SimpleDocMaker. |
SimpleSloppyPhraseQueryMaker | Create sloppy phrase queries for performance test, in an index created using simple doc maker. |
SimpleTerm | Licensed to the Apache Software Foundation (ASF) under one or more contributor license agreements. |
SimpleTerm.MatchingTermVisitor | |
SingleInstanceLockFactory | Implements LockFactory for a single in-process instance, meaning all locking will take place through this one instance. |
SmallFloat | Floating point numbers smaller than 32 bits. |
SnowballAnalyzer | Filters StandardTokenizer with StandardFilter, LowerCaseFilter, StopFilter and SnowballFilter. |
SnowballFilter | A filter that stems words using a Snowball-generated stemmer. |
SnowballProgram | Licensed to the Apache Software Foundation (ASF) under one or more contributor license agreements. |
Sort | Encapsulates sort criteria for returned hits. |
SortComparator | Abstract base class for sorting hits returned by a Query. |
SortComparatorSource | Expert: returns a comparator for sorting ScoreDocs. |
SortField | Stores information about how to sort documents by terms in an individual field. |
SpanBuilderBase | Licensed to the Apache Software Foundation (ASF) under one or more contributor license agreements. |
SpanFirstBuilder | Licensed to the Apache Software Foundation (ASF) under one or more contributor license agreements. |
SpanFirstQuery | Matches spans near the beginning of a field. |
SpanGradientFormatter | Formats text with different color intensity depending on the score of the term using the span tag. |
SpanishStemmer | Generated class implementing code defined by a snowball script. |
SpanNearBuilder | Licensed to the Apache Software Foundation (ASF) under one or more contributor license agreements. |
SpanNearClauseFactory | Licensed to the Apache Software Foundation (ASF) under one or more contributor license agreements. |
SpanNearQuery | Matches spans which are near one another. |
SpanNotBuilder | Licensed to the Apache Software Foundation (ASF) under one or more contributor license agreements. |
SpanNotQuery | Removes matches which overlap with another SpanQuery. |
SpanOrBuilder | Licensed to the Apache Software Foundation (ASF) under one or more contributor license agreements. |
SpanOrQuery | Matches the union of its clauses. |
SpanOrTermsBuilder | Licensed to the Apache Software Foundation (ASF) under one or more contributor license agreements. |
SpanQuery | Base class for span-based queries. |
SpanQueryBuilder | |
SpanQueryBuilderFactory | |
SpanRegexQuery | A SpanQuery version of RegexQuery allowing regular expression queries to be nested within other SpanQuery subclasses. |
Spans | Expert: an enumeration of span matches. |
SpanScorer | Public for extension only. |
SpanTermBuilder | Licensed to the Apache Software Foundation (ASF) under one or more contributor license agreements. |
SpanTermQuery | Matches spans containing a term. |
SpanWeight | Expert-only. |
SpellChecker |
Spell Checker class (Main class) |
SrndPrefixQuery | Licensed to the Apache Software Foundation (ASF) under one or more contributor license agreements. |
SrndQuery | Licensed to the Apache Software Foundation (ASF) under one or more contributor license agreements. |
SrndTermQuery | Licensed to the Apache Software Foundation (ASF) under one or more contributor license agreements. |
SrndTruncQuery | Licensed to the Apache Software Foundation (ASF) under one or more contributor license agreements. |
StaleReaderException | This exception is thrown when an IndexReader tries to make changes to the index (via IndexReader, IndexReader or IndexReader) but changes have already been committed to the index since this reader was instantiated. |
StandardAnalyzer | Filters StandardTokenizer with StandardFilter, LowerCaseFilter and StopFilter, using a list of English stop words. |
StandardBenchmarker | Reads in the Reuters Collection, downloaded from http://www.daviddlewis.com/resources/testcollections/reuters21578/reuters21578.tar.gz in the workingDir/reuters and indexes them using the StandardAnalyzer Runs a standard set of documents through an Indexer and then runs a standard set of queries against the index. |
StandardFilter | Normalizes tokens extracted with StandardTokenizer. |
StandardOptions | |
StandardTokenizer | A grammar-based tokenizer constructed with JavaCC. |
StandardTokenizerConstants | |
StandardTokenizerTokenManager | |
StopAnalyzer | Filters LetterTokenizer with LowerCaseFilter and StopFilter. |
StopFilter | Removes stop words from a token stream. |
StringHelper | Methods for manipulating strings. |
SwedishStemmer | Generated class implementing code defined by a snowball script. |
SweetSpotSimilarity | A similarity with a lengthNorm that provides for a "platuea" of equally good lengths, and tf helper functions. |
SynExpand | Expand a query by looking up synonyms for every term. |
SynLookup | Test program to look up synonyms. |
SynonymMap | Loads the WordNet prolog file wn_s.pl into a thread-safe main-memory hash map that can be used for fast high-frequency lookups of synonyms for any given (lowercase) word string. |
SynonymTokenFilter | Injects additional tokens for synonyms of token terms fetched from the underlying child stream; the child stream must deliver lowercase tokens for synonyms to be found. |
Syns2Index |
Convert the prolog file wn_s.pl from the WordNet prolog download
into a Lucene index suitable for looking up synonyms and performing query expansion (SynExpand.expand(...) ).
|
TableSearcher | This is a TableModel that encapsulates Lucene search logic within a TableModel implementation. |
Tags | Licensed to the Apache Software Foundation (ASF) under one or more contributor license agreements. |
TaskSequence | Sequence of parallel or sequential tasks. |
TaskStats | Statistics for a task run. |
Term | A Term represents a word from text. |
TermDocs | TermDocs provides an interface for enumerating <document, frequency> pairs for a term. |
TermEnum | Abstract class for enumerating terms. |
TermFreqVector | Provides access to stored term vector of a document field. |
TermPositions | TermPositions provides an interface for enumerating the <document, frequency, <position>* > tuples for a term. |
TermPositionVector | Extends TermFreqVector to provide additional information about
positions in which each of the terms is found. |
TermQuery | A Query that matches documents containing a term. |
TermQueryBuilder | |
TermsFilterBuilder | |
TermsQueryBuilder | Builds a BooleanQuery from all of the terms found in the XML element using the choice of analyzer |
TermSpans | Expert: Public for extension only |
TermVectorOffsetInfo | The TermVectorOffsetInfo class holds information pertaining to a Term in a TermPositionVector's offset information. |
TestApp | Licensed to the Apache Software Foundation (ASF) under one or more contributor license agreements. |
TestData | This class holds together all parameters related to a test. |
TestRunData | This class holds series of TimeData related to a single test run. |
TextDocument | A utility for making Lucene Documents from a File. |
TextFragment | Low-level class used to record information about a section of a document with a score. |
ThaiAnalyzer | Analyzer for Thai language. |
ThaiWordFilter | TokenFilter that use java.text.BreakIterator to break each Token that is Thai into separate Token(s) for each Thai word. |
TimeData | This class holds a data point measuring speed of processing. |
Token | A Token is an occurence of a term from the text of a field. |
Token | Describes the input token stream. |
Token | Describes the input token stream. |
Token | Describes the input token stream. |
Token | Describes the input token stream. |
Token | Describes the input token stream. |
TokenFilter | A TokenFilter is a TokenStream whose input is another token stream. |
TokenGroup | One, or several overlapping tokens, along with the score(s) and the scope of the original text |
Tokenizer | A Tokenizer is a TokenStream whose input is a Reader. |
TokenMgrError | |
TokenMgrError | |
TokenMgrError | |
TokenMgrError | |
TokenMgrError | |
TokenSources | Hides implementation issues associated with obtaining a TokenStream for use with the higlighter - can obtain from TermFreqVectors with offsets and (optionally) positions or from Analyzer class reparsing the stored content. |
TokenStream | A TokenStream enumerates the sequence of tokens, either from fields of a document or from query text. |
TooManyBasicQueries | Licensed to the Apache Software Foundation (ASF) under one or more contributor license agreements. |
TopDocCollector | A HitCollector implementation that collects the top-scoring documents, returning them as a TopDocs. |
TopDocs | Expert: Returned by low-level search implementations. |
TopFieldDocCollector | A HitCollector implementation that collects the top-sorting documents, returning them as a TopFieldDocs. |
TopFieldDocs | Expert: Returned by low-level sorted search implementations. |
ToStringUtils | Licensed to the Apache Software Foundation (ASF) under one or more contributor license agreements. |
TrecDocMaker | A DocMaker using the (compressed) Trec collection for its input. |
UserInputQueryBuilder | |
ValueSource | Expert: source of values for basic function queries. |
ValueSourceQuery |
Expert: A Query that sets the scores of document to the
values obtained from a ValueSource .
|
WarmTask | Warm reader task: retrieve all reader documents. |
Weight | Expert: Calculate query weights and build query scorers. |
WeightedTerm | Lightweight class to hold term and a weight value used for scoring this term |
WhitespaceAnalyzer | An Analyzer that uses WhitespaceTokenizer. |
WhitespaceTokenizer | A WhitespaceTokenizer is a tokenizer that divides text at whitespace. |
WildcardQuery | Implements the wildcard search query. |
WildcardTermEnum | Subclass of FilteredTermEnum for enumerating all terms that match the specified wildcard filter term. |
WordlistLoader | Loader for text files that represent a list of stopwords. |
WordlistLoader | Loads a text file and adds every line as an entry to a Hashtable. |