Package org.languagetool.tagging.de
Class GermanTagger
java.lang.Object
org.languagetool.tagging.BaseTagger
org.languagetool.tagging.de.GermanTagger
- All Implemented Interfaces:
org.languagetool.tagging.Tagger
- Direct Known Subclasses:
SwissGermanTagger
public class GermanTagger
extends org.languagetool.tagging.BaseTagger
German part-of-speech tagger, requires data file in
de/german.dict
in the classpath.
The POS tagset is described in
tagset.txt-
Field Summary
FieldsModifier and TypeFieldDescriptionprivate GermanCompoundTokenizer
private final org.languagetool.tagging.ManualTagger
Fields inherited from class org.languagetool.tagging.BaseTagger
conversionLocale, wordTagger
-
Constructor Summary
Constructors -
Method Summary
Modifier and TypeMethodDescriptionprivate List
<org.languagetool.tagging.TaggedWord> private List
<org.languagetool.AnalyzedToken> getAnalyzedTokens
(List<org.languagetool.tagging.TaggedWord> taggedWords, String word) private List
<org.languagetool.AnalyzedToken> getAnalyzedTokens
(List<org.languagetool.tagging.TaggedWord> taggedWords, String word, List<String> compoundParts) private List
<org.languagetool.AnalyzedToken> getImperativeForm
(String word, List<String> sentenceTokens, int pos) private org.languagetool.AnalyzedToken
getNoInfoToken
(String word) private List
<org.languagetool.AnalyzedToken> getSubstantivatedForms
(String word, List<String> sentenceTokens, int pos) private void
@Nullable org.languagetool.AnalyzedTokenReadings
Return only the first reading of the given word ornull
.private boolean
matchesUppercaseAdjective
(String unknownUppercaseToken) private String
sanitizeWord
(String word) List
<org.languagetool.tagging.TaggedWord> List
<org.languagetool.AnalyzedTokenReadings> List
<org.languagetool.AnalyzedTokenReadings> Methods inherited from class org.languagetool.tagging.BaseTagger
additionalTags, asAnalyzedToken, asAnalyzedTokenList, asAnalyzedTokenListForTaggedWords, createNullToken, createToken, getAnalyzedTokens, getDictionary, getDictionaryPath, getWordTagger, overwriteWithManualTagger
-
Field Details
-
removalTagger
private final org.languagetool.tagging.ManualTagger removalTagger -
compoundTokenizer
-
-
Constructor Details
-
GermanTagger
public GermanTagger()
-
-
Method Details
-
addStem
-
sanitizeWord
-
getManualAdditionsFileName
- Specified by:
getManualAdditionsFileName
in classorg.languagetool.tagging.BaseTagger
-
getManualRemovalsFileName
- Overrides:
getManualRemovalsFileName
in classorg.languagetool.tagging.BaseTagger
-
lookup
@Nullable public @Nullable org.languagetool.AnalyzedTokenReadings lookup(String word) throws IOException Return only the first reading of the given word ornull
.- Throws:
IOException
-
tag
-
matchesUppercaseAdjective
-
tag
public List<org.languagetool.AnalyzedTokenReadings> tag(List<String> sentenceTokens) throws IOException - Specified by:
tag
in interfaceorg.languagetool.tagging.Tagger
- Overrides:
tag
in classorg.languagetool.tagging.BaseTagger
- Throws:
IOException
-
tag
public List<org.languagetool.AnalyzedTokenReadings> tag(List<String> sentenceTokens, boolean ignoreCase) throws IOException - Throws:
IOException
-
getImperativeForm
-
getSubstantivatedForms
-
initializeIfRequired
- Throws:
IOException
-
getNoInfoToken
-
getAnalyzedTokens
-
getAnalyzedTokens
-