org.apache.lucene.analysis.th

Class ThaiAnalyzer


public class ThaiAnalyzer
extends Analyzer

Analyzer for Thai language. It uses java.text.BreakIterator to break words.
Version:
0.2
Author:
Samphan Raruenrom for To-Be-One Technology Co., Ltd.

Method Summary

TokenStream
tokenStream(String fieldName, Reader reader)
Creates a TokenStream which tokenizes all the text in the provided Reader.

Methods inherited from class org.apache.lucene.analysis.Analyzer

getPositionIncrementGap, tokenStream

Method Details

tokenStream

public TokenStream tokenStream(String fieldName,
                               Reader reader)
Creates a TokenStream which tokenizes all the text in the provided Reader. Default implementation forwards to tokenStream(Reader) for compatibility with older version. Override to allow Analyzer to choose strategy based on document and/or field. Must be able to handle null field name for backward compatibility.
Overrides:
tokenStream in interface Analyzer

Copyright © 2000-2007 Apache Software Foundation. All Rights Reserved.