Interface ILexicon


  • public interface ILexicon
    lexicon configuration class.
    Author:
    chenxin
    • Field Summary

      Fields 
      Modifier and Type Field Description
      static int CJK_CHAR
      CJK single word
      static int CJK_SYN
      CJK synonyms
      static int CJK_UNIT
      Chinese single units
      static int CJK_WORD
      Chinese, Japanese, Korean words Note: since version 2.0.1 the Chinese English mixed words and the English punctuation mixed words are all included here.
      static int CN_DNAME_1
      first word of Chinese double name
      static int CN_DNAME_2
      second word of Chinese double name
      static int CN_LNAME
      Chinese last name
      static int CN_LNAME_ADORN
      the adorn(修饰) char before the last name like word "老陈", "小陈"
      static int CN_SNAME
      Chinese single name
      static int DOMAIN_SUFFIX
      domain name suffix dictionary for the URL recognition
      static int MIX_ASSIST_WORD
      special lexicon for Chinese-English[-Chinese] mixed word recognition For the optimization implementation of the mixed word recognition
      static int NUMBER_UNIT  
      static int STOP_WORD
      stop words
      static int T_LEN  
      static int UNMATCH_CJK_WORD
      unmatched word
    • Field Detail

      • CJK_WORD

        static final int CJK_WORD
        Chinese, Japanese, Korean words Note: since version 2.0.1 the Chinese English mixed words and the English punctuation mixed words are all included here. implemented at 2016/11/09
        See Also:
        Constant Field Values
      • CN_DNAME_1

        static final int CN_DNAME_1
        first word of Chinese double name
        See Also:
        Constant Field Values
      • CN_DNAME_2

        static final int CN_DNAME_2
        second word of Chinese double name
        See Also:
        Constant Field Values
      • CN_LNAME_ADORN

        static final int CN_LNAME_ADORN
        the adorn(修饰) char before the last name like word "老陈", "小陈"
        See Also:
        Constant Field Values
      • MIX_ASSIST_WORD

        static final int MIX_ASSIST_WORD
        special lexicon for Chinese-English[-Chinese] mixed word recognition For the optimization implementation of the mixed word recognition
        See Also:
        Constant Field Values
      • DOMAIN_SUFFIX

        static final int DOMAIN_SUFFIX
        domain name suffix dictionary for the URL recognition
        See Also:
        Constant Field Values