org.apache.lucene.analysis.ru

Class RussianAnalyzer

public final class RussianAnalyzer extends Analyzer

Analyzer for Russian language. Supports an external list of stopwords (words that will not be indexed at all). A default set of stopwords is used unless an alternative list is specified.

Version: $Id: RussianAnalyzer.java 472959 2006-11-09 16:21:50Z yonik $

Author: Boris Okner, b.okner@rogers.com

Constructor Summary
RussianAnalyzer()
RussianAnalyzer(char[] charset)
Builds an analyzer.
RussianAnalyzer(char[] charset, String[] stopwords)
Builds an analyzer with the given stop words.
RussianAnalyzer(char[] charset, Hashtable stopwords)
Builds an analyzer with the given stop words.
Method Summary
TokenStreamtokenStream(String fieldName, Reader reader)
Creates a TokenStream which tokenizes all the text in the provided Reader.

Constructor Detail

RussianAnalyzer

public RussianAnalyzer()

RussianAnalyzer

public RussianAnalyzer(char[] charset)
Builds an analyzer.

RussianAnalyzer

public RussianAnalyzer(char[] charset, String[] stopwords)
Builds an analyzer with the given stop words.

RussianAnalyzer

public RussianAnalyzer(char[] charset, Hashtable stopwords)
Builds an analyzer with the given stop words.

UNKNOWN: create a Set version of this ctor

Method Detail

tokenStream

public TokenStream tokenStream(String fieldName, Reader reader)
Creates a TokenStream which tokenizes all the text in the provided Reader.

Returns: A TokenStream build from a RussianLetterTokenizer filtered with RussianLowerCaseFilter, StopFilter, and RussianStemFilter

Copyright © 2000-2007 Apache Software Foundation. All Rights Reserved.