Package org.lionsoul.jcseg.dic
Interface ILexicon
-
public interface ILexiconlexicon configuration class.- Author:
- chenxin
-
-
Field Summary
Fields Modifier and Type Field Description static intCJK_CHARCJK single wordstatic intCJK_SYNCJK synonymsstatic intCJK_UNITChinese single unitsstatic intCJK_WORDChinese, Japanese, Korean words Note: since version 2.0.1 the Chinese English mixed words and the English punctuation mixed words are all included here.static intCN_DNAME_1first word of Chinese double namestatic intCN_DNAME_2second word of Chinese double namestatic intCN_LNAMEChinese last namestatic intCN_LNAME_ADORNthe adorn(修饰) char before the last name like word "老陈", "小陈"static intCN_SNAMEChinese single namestatic intDOMAIN_SUFFIXdomain name suffix dictionary for the URL recognitionstatic intMIX_ASSIST_WORDspecial lexicon for Chinese-English[-Chinese] mixed word recognition For the optimization implementation of the mixed word recognitionstatic intNUMBER_UNITstatic intSTOP_WORDstop wordsstatic intT_LENstatic intUNMATCH_CJK_WORDunmatched word
-
-
-
Field Detail
-
T_LEN
static final int T_LEN
- See Also:
- Constant Field Values
-
CJK_WORD
static final int CJK_WORD
Chinese, Japanese, Korean words Note: since version 2.0.1 the Chinese English mixed words and the English punctuation mixed words are all included here. implemented at 2016/11/09- See Also:
- Constant Field Values
-
CJK_UNIT
static final int CJK_UNIT
Chinese single units- See Also:
- Constant Field Values
-
CN_LNAME
static final int CN_LNAME
Chinese last name- See Also:
- Constant Field Values
-
CN_SNAME
static final int CN_SNAME
Chinese single name- See Also:
- Constant Field Values
-
CN_DNAME_1
static final int CN_DNAME_1
first word of Chinese double name- See Also:
- Constant Field Values
-
CN_DNAME_2
static final int CN_DNAME_2
second word of Chinese double name- See Also:
- Constant Field Values
-
CN_LNAME_ADORN
static final int CN_LNAME_ADORN
the adorn(修饰) char before the last name like word "老陈", "小陈"- See Also:
- Constant Field Values
-
STOP_WORD
static final int STOP_WORD
stop words- See Also:
- Constant Field Values
-
MIX_ASSIST_WORD
static final int MIX_ASSIST_WORD
special lexicon for Chinese-English[-Chinese] mixed word recognition For the optimization implementation of the mixed word recognition- See Also:
- Constant Field Values
-
DOMAIN_SUFFIX
static final int DOMAIN_SUFFIX
domain name suffix dictionary for the URL recognition- See Also:
- Constant Field Values
-
NUMBER_UNIT
static final int NUMBER_UNIT
- See Also:
- Constant Field Values
-
CJK_CHAR
static final int CJK_CHAR
CJK single word- See Also:
- Constant Field Values
-
CJK_SYN
static final int CJK_SYN
CJK synonyms- See Also:
- Constant Field Values
-
UNMATCH_CJK_WORD
static final int UNMATCH_CJK_WORD
unmatched word- See Also:
- Constant Field Values
-
-