Package com.ibm.icu.impl.breakiter
Class CjkBreakEngine
- java.lang.Object
-
- com.ibm.icu.impl.breakiter.DictionaryBreakEngine
-
- com.ibm.icu.impl.breakiter.CjkBreakEngine
-
- All Implemented Interfaces:
LanguageBreakEngine
public class CjkBreakEngine extends DictionaryBreakEngine
-
-
Nested Class Summary
-
Nested classes/interfaces inherited from class com.ibm.icu.impl.breakiter.DictionaryBreakEngine
DictionaryBreakEngine.DequeI, DictionaryBreakEngine.PossibleWord
-
-
Field Summary
Fields Modifier and Type Field Description private UnicodeSet
fClosePunctuationSet
private DictionaryMatcher
fDictionary
private UnicodeSet
fDigitOrOpenPunctuationOrAlphabetSet
private UnicodeSet
fHangulWordSet
private MlBreakEngine
fMlBreakEngine
private java.util.HashSet<java.lang.String>
fSkipSet
private boolean
isCj
private static int
kint32max
private static int
kMaxKatakanaGroupLength
private static int
kMaxKatakanaLength
private static int
maxSnlp
-
Fields inherited from class com.ibm.icu.impl.breakiter.DictionaryBreakEngine
fSet
-
-
Constructor Summary
Constructors Constructor Description CjkBreakEngine(boolean korean)
-
Method Summary
All Methods Static Methods Instance Methods Concrete Methods Modifier and Type Method Description int
divideUpDictionaryRange(java.text.CharacterIterator inText, int startPos, int endPos, DictionaryBreakEngine.DequeI foundBreaks, boolean isPhraseBreaking)
Divide up a range of known dictionary characters handled by this break engine.boolean
equals(java.lang.Object obj)
private static int
getKatakanaCost(int wordlength)
private java.lang.String
getPatternFromText(java.text.CharacterIterator text, java.lang.StringBuffer sb, int start, int length)
int
hashCode()
private void
initializeJapanesePhraseParamater()
private static boolean
isKatakana(int value)
private void
loadHiragana()
private void
loadJapaneseExtensions()
-
Methods inherited from class com.ibm.icu.impl.breakiter.DictionaryBreakEngine
findBreaks, handles, setCharacters
-
-
-
-
Field Detail
-
fHangulWordSet
private UnicodeSet fHangulWordSet
-
fDigitOrOpenPunctuationOrAlphabetSet
private UnicodeSet fDigitOrOpenPunctuationOrAlphabetSet
-
fClosePunctuationSet
private UnicodeSet fClosePunctuationSet
-
fDictionary
private DictionaryMatcher fDictionary
-
fSkipSet
private java.util.HashSet<java.lang.String> fSkipSet
-
fMlBreakEngine
private MlBreakEngine fMlBreakEngine
-
isCj
private boolean isCj
-
kMaxKatakanaLength
private static final int kMaxKatakanaLength
- See Also:
- Constant Field Values
-
kMaxKatakanaGroupLength
private static final int kMaxKatakanaGroupLength
- See Also:
- Constant Field Values
-
maxSnlp
private static final int maxSnlp
- See Also:
- Constant Field Values
-
kint32max
private static final int kint32max
- See Also:
- Constant Field Values
-
-
Method Detail
-
initializeJapanesePhraseParamater
private void initializeJapanesePhraseParamater()
-
loadJapaneseExtensions
private void loadJapaneseExtensions()
-
loadHiragana
private void loadHiragana()
-
equals
public boolean equals(java.lang.Object obj)
- Overrides:
equals
in classjava.lang.Object
-
hashCode
public int hashCode()
- Overrides:
hashCode
in classjava.lang.Object
-
getKatakanaCost
private static int getKatakanaCost(int wordlength)
-
isKatakana
private static boolean isKatakana(int value)
-
divideUpDictionaryRange
public int divideUpDictionaryRange(java.text.CharacterIterator inText, int startPos, int endPos, DictionaryBreakEngine.DequeI foundBreaks, boolean isPhraseBreaking)
Description copied from class:DictionaryBreakEngine
Divide up a range of known dictionary characters handled by this break engine.
- Specified by:
divideUpDictionaryRange
in classDictionaryBreakEngine
- Parameters:
inText
- A UText representing the textstartPos
- The start of the range of dictionary charactersendPos
- The end of the range of dictionary charactersfoundBreaks
- Output of break positions. Positions are pushed. Pre-existing contents of the output stack are unaltered.- Returns:
- The number of breaks found
-
getPatternFromText
private java.lang.String getPatternFromText(java.text.CharacterIterator text, java.lang.StringBuffer sb, int start, int length)
-
-