A B C D E F G H I J K L M N O P Q R S T U V W _
All Classes All Packages
All Classes All Packages
All Classes All Packages
A
- add(int) - Method in class org.lionsoul.jcseg.util.IntArrayList
-
Append a new Integer to the end.
- add(int, String, int) - Method in class org.lionsoul.jcseg.dic.HashMapDictionary
- add(int, String, int) - Method in interface org.lionsoul.jcseg.IDictionary
-
add a new word to the dictionary
- add(int, String, int, int) - Method in class org.lionsoul.jcseg.dic.HashMapDictionary
- add(int, String, int, int) - Method in interface org.lionsoul.jcseg.IDictionary
-
add a new word to the dictionary
- add(int, String, int, int, String[]) - Method in class org.lionsoul.jcseg.dic.HashMapDictionary
- add(int, String, int, int, String[]) - Method in interface org.lionsoul.jcseg.IDictionary
-
add a new word to the dictionary with its statistics frequency
- add(int, String, int, String[]) - Method in class org.lionsoul.jcseg.dic.HashMapDictionary
- add(int, String, int, String[]) - Method in interface org.lionsoul.jcseg.IDictionary
-
add a new word to the dictionary
- add(int, IWord) - Method in class org.lionsoul.jcseg.dic.HashMapDictionary
- add(int, IWord) - Method in interface org.lionsoul.jcseg.IDictionary
-
directly add a IWord item to the dictionary
- add(IWord) - Method in class org.lionsoul.jcseg.SynonymsEntry
-
add a new synonyms word and the newly added word will extend the part of speech and the entity from the base word if there are not set
- add(T) - Method in class org.lionsoul.jcseg.util.IHashQueue
-
append a item from the tail
- addEntity(String) - Method in interface org.lionsoul.jcseg.IWord
-
append a new entity
- addEntity(String) - Method in class org.lionsoul.jcseg.segmenter.Word
- addPartSpeech(String) - Method in interface org.lionsoul.jcseg.IWord
-
add a new part to speech to the word.
- addPartSpeech(String) - Method in class org.lionsoul.jcseg.segmenter.Word
- ADictionary - Class in org.lionsoul.jcseg.dic
-
Dictionary abstract super class
- ADictionary(SegmenterConfig, Boolean) - Constructor for class org.lionsoul.jcseg.dic.ADictionary
-
initialize the ADictionary
- AL_TODO_FILE - Static variable in class org.lionsoul.jcseg.dic.ADictionary
-
the default autoload task file name
- append(boolean) - Method in class org.lionsoul.jcseg.util.IStringBuffer
-
append a boolean value
- append(char) - Method in class org.lionsoul.jcseg.util.IStringBuffer
-
append a char to the buffer
- append(char[]) - Method in class org.lionsoul.jcseg.util.IStringBuffer
-
append some chars to the buffer
- append(char[], int) - Method in class org.lionsoul.jcseg.util.IStringBuffer
-
append the rest of the chars to the buffer
- append(char[], int, int) - Method in class org.lionsoul.jcseg.util.IStringBuffer
-
append parts of the chars to the buffer
- append(double) - Method in class org.lionsoul.jcseg.util.IStringBuffer
-
append a double value
- append(float) - Method in class org.lionsoul.jcseg.util.IStringBuffer
-
append a float value
- append(int) - Method in class org.lionsoul.jcseg.util.IStringBuffer
-
append a int value
- append(long) - Method in class org.lionsoul.jcseg.util.IStringBuffer
-
append a long value
- append(short) - Method in class org.lionsoul.jcseg.util.IStringBuffer
-
append a short value
- append(String) - Method in class org.lionsoul.jcseg.util.IStringBuffer
-
append a string to the buffer
- APPEND_CJK_ENTITY - Variable in class org.lionsoul.jcseg.segmenter.SegmenterConfig
-
do the entity recognition ?
- APPEND_CJK_PINYIN - Variable in class org.lionsoul.jcseg.segmenter.SegmenterConfig
-
append the Pinyin to the result
- APPEND_CJK_SYN - Variable in class org.lionsoul.jcseg.segmenter.SegmenterConfig
-
append the syn word to the result.
- APPEND_PART_OF_SPEECH - Variable in class org.lionsoul.jcseg.segmenter.SegmenterConfig
-
append the part of speech.
- appendCJKPinyin() - Method in class org.lionsoul.jcseg.segmenter.SegmenterConfig
- appendCJKSyn() - Method in class org.lionsoul.jcseg.segmenter.SegmenterConfig
- appendCJKWordFeatures(IWord) - Method in class org.lionsoul.jcseg.segmenter.Segmenter
-
check and append the pinyin and the synonyms words of the specified word
- appendLatinWordFeatures(IWord) - Method in class org.lionsoul.jcseg.segmenter.Segmenter
-
Check and append the synonyms/pinyin words of specified word included the CJK and basic Latin words All the synonyms words share the same position part of speech, word type with the primitive word
- appendPinyin(SegmenterConfig, LinkedList<IWord>, IWord) - Static method in class org.lionsoul.jcseg.segmenter.SegKit
-
Quick interface to do the pinyin word append.
- appendSynonyms(SegmenterConfig, LinkedList<IWord>, IWord) - Static method in class org.lionsoul.jcseg.segmenter.SegKit
-
quick interface to do the synonyms word append You got check if the specified word has any synonyms first
- ArrayUtil - Class in org.lionsoul.jcseg.util
-
Array util class
- ArrayUtil() - Constructor for class org.lionsoul.jcseg.util.ArrayUtil
- autoFilter - Variable in class org.lionsoul.jcseg.extractor.impl.TextRankKeywordsExtractor
-
auto filter the words with low score
- autoLoad() - Method in class org.lionsoul.jcseg.segmenter.SegmenterConfig
-
initialize the value of its options by auto searching the jcesg.properties file:
- AutoLoadFile - Class in org.lionsoul.jcseg.dic
-
AutoLoad file to describle the autoload configuration files
- AutoLoadFile(String) - Constructor for class org.lionsoul.jcseg.dic.AutoLoadFile
- autoMinLength - Variable in class org.lionsoul.jcseg.extractor.impl.TextRankKeyphraseExtractor
-
auto append the words with a length over the specified value as a phrase
B
- B - Static variable in class org.lionsoul.jcseg.extractor.impl.TextRankSummaryExtractor
- behindLatin - Variable in class org.lionsoul.jcseg.segmenter.Segmenter
-
global behind Latin word after the CJK word added at 2016/11/22 for better mixed word implementation
- bucketSort(int[], int) - Static method in class org.lionsoul.jcseg.util.Sort
-
bucket sort algorithm
- bucketSort(Integer[], int) - Static method in class org.lionsoul.jcseg.util.Sort
-
bucket sort algorithm
- buffer() - Method in class org.lionsoul.jcseg.util.IStringBuffer
-
return the chars of the buffer
- ByteCharCounter - Class in org.lionsoul.jcseg.util
-
All Basic printable Latin char counter class include all the English punctuation and the letters
- ByteCharCounter() - Constructor for class org.lionsoul.jcseg.util.ByteCharCounter
C
- charAt(int) - Method in class org.lionsoul.jcseg.util.IStringBuffer
-
get the char at a specified position in the buffer
- CharTypeFunction - Interface in org.lionsoul.jcseg.fi
- CHECK_CE_MASk - Static variable in interface org.lionsoul.jcseg.ISegment
-
Whether to check the Chinese and English mixed word.
- CHECK_CF_MASK - Static variable in interface org.lionsoul.jcseg.ISegment
-
Whether to check the Chinese fraction.
- CHECK_EC_MASK - Static variable in interface org.lionsoul.jcseg.ISegment
-
Whether to check the English Chinese mixed suffix For the new implementation of the mixed word recognition Added at 2016/11/22
- Chunk - Class in org.lionsoul.jcseg.segmenter
-
chunk concept for the mmseg word segmentation algorithm
- Chunk(IWord[]) - Constructor for class org.lionsoul.jcseg.segmenter.Chunk
- CJK_CHAR - Static variable in interface org.lionsoul.jcseg.dic.ILexicon
-
CJK single word
- CJK_SYN - Static variable in interface org.lionsoul.jcseg.dic.ILexicon
-
CJK synonyms
- CJK_UNIT - Static variable in interface org.lionsoul.jcseg.dic.ILexicon
-
Chinese single units
- CJK_WORD - Static variable in interface org.lionsoul.jcseg.dic.ILexicon
-
Chinese, Japanese, Korean words Note: since version 2.0.1 the Chinese English mixed words and the English punctuation mixed words are all included here.
- CJKIndexOf(String) - Static method in class org.lionsoul.jcseg.util.StringUtil
- CJKIndexOf(String, int) - Static method in class org.lionsoul.jcseg.util.StringUtil
-
get the index of the first CJK char of the specified string
- clear() - Method in class org.lionsoul.jcseg.util.IntArrayList
- clear() - Method in class org.lionsoul.jcseg.util.IStringBuffer
-
clear the buffer by reset the count to 0
- CLEAR_STOPWORD - Variable in class org.lionsoul.jcseg.segmenter.SegmenterConfig
-
clear away the stop word.
- clearStopwords() - Method in class org.lionsoul.jcseg.segmenter.SegmenterConfig
- clone() - Method in interface org.lionsoul.jcseg.IWord
-
make clone available
- clone() - Method in class org.lionsoul.jcseg.segmenter.SegmenterConfig
-
rewrite the clone method
- clone() - Method in class org.lionsoul.jcseg.segmenter.Word
-
Interface to clone the current object
- CN_DNAME_1 - Static variable in interface org.lionsoul.jcseg.dic.ILexicon
-
first word of Chinese double name
- CN_DNAME_2 - Static variable in interface org.lionsoul.jcseg.dic.ILexicon
-
second word of Chinese double name
- CN_LNAME - Static variable in interface org.lionsoul.jcseg.dic.ILexicon
-
Chinese last name
- CN_LNAME_ADORN - Static variable in interface org.lionsoul.jcseg.dic.ILexicon
-
the adorn(修饰) char before the last name like word "老陈", "小陈"
- CN_SNAME - Static variable in interface org.lionsoul.jcseg.dic.ILexicon
-
Chinese single name
- CNFRA_TO_ARABIC - Variable in class org.lionsoul.jcseg.segmenter.SegmenterConfig
-
Chinese fraction to Arabic fraction .
- cnFractionToArabic() - Method in class org.lionsoul.jcseg.segmenter.SegmenterConfig
- CNNUM_TO_ARABIC - Variable in class org.lionsoul.jcseg.segmenter.SegmenterConfig
-
Chinese numeric to Arabic .
- cnNumericToArabic(String, boolean) - Static method in class org.lionsoul.jcseg.util.NumericUtil
-
a static method to turn the Chinese numeric to Arabic numbers
- cnNumToArabic() - Method in class org.lionsoul.jcseg.segmenter.SegmenterConfig
- compareTo(TextRankSummaryExtractor.Document) - Method in class org.lionsoul.jcseg.extractor.impl.TextRankSummaryExtractor.Document
-
override the compareTo method compare document with its relevance score
- COMPLEX - org.lionsoul.jcseg.ISegment.Type
- COMPLEX - Static variable in interface org.lionsoul.jcseg.ISegment
- COMPLEX_MODE - Static variable in interface org.lionsoul.jcseg.ISegment
- ComplexSeg - Class in org.lionsoul.jcseg.segmenter
-
Jcseg complex segmentation implementation based on the filter works of MMSeg rules:
- ComplexSeg(SegmenterConfig, ADictionary) - Constructor for class org.lionsoul.jcseg.segmenter.ComplexSeg
- config - Variable in class org.lionsoul.jcseg.dic.ADictionary
- config - Variable in class org.lionsoul.jcseg.segmenter.DelimiterSeg
- config - Variable in class org.lionsoul.jcseg.segmenter.DetectSeg
- config - Variable in class org.lionsoul.jcseg.segmenter.NGramSeg
- config - Variable in class org.lionsoul.jcseg.segmenter.Segmenter
- contains(String, String[]) - Static method in class org.lionsoul.jcseg.util.ArrayUtil
-
check if there is an element that contains the specified string
- contains(T) - Method in class org.lionsoul.jcseg.util.IHashQueue
-
check the specifield T is aleady exists in the queue or not
- create(SegmenterConfig, boolean) - Method in interface org.lionsoul.jcseg.fi.DictionaryFunction
- create(SegmenterConfig, ADictionary) - Method in interface org.lionsoul.jcseg.fi.SegmenterFunction
- createDateTimePool() - Static method in class org.lionsoul.jcseg.util.TimeUtil
-
create and return a date-time pool
- createDefaultDictionary(SegmenterConfig) - Static method in class org.lionsoul.jcseg.dic.DictionaryFactory
-
create the ADictionary according to the JcsegTaskConfig check and load the lexicon by default
- createDefaultDictionary(SegmenterConfig, boolean) - Static method in class org.lionsoul.jcseg.dic.DictionaryFactory
-
create the ADictionary according to the JcsegTaskConfig
- createDefaultDictionary(SegmenterConfig, boolean, boolean) - Static method in class org.lionsoul.jcseg.dic.DictionaryFactory
-
create a default ADictionary instance: 1.
- createSingletonDictionary(SegmenterConfig) - Static method in class org.lionsoul.jcseg.dic.DictionaryFactory
-
create a singleton ADictionary object according to the JcsegTaskConfig check and load the lexicon by default
- createSingletonDictionary(SegmenterConfig, boolean) - Static method in class org.lionsoul.jcseg.dic.DictionaryFactory
-
create a singleton ADictionary object according to the SegmentConfig
- ctrlMask - Variable in class org.lionsoul.jcseg.segmenter.Segmenter
-
segmentation runtime function control mask
D
- D - Static variable in class org.lionsoul.jcseg.extractor.impl.TextRankKeyphraseExtractor
- D - Static variable in class org.lionsoul.jcseg.extractor.impl.TextRankKeywordsExtractor
- D - Static variable in class org.lionsoul.jcseg.extractor.impl.TextRankSummaryExtractor
- data - Variable in class org.lionsoul.jcseg.util.IHashQueue.Entry
- data - Variable in class org.lionsoul.jcseg.util.IIntFIFO.Entry
- data - Variable in class org.lionsoul.jcseg.util.IIntQueue.Entry
- DATETIME_A - Static variable in class org.lionsoul.jcseg.util.TimeUtil
- DATETIME_D - Static variable in class org.lionsoul.jcseg.util.TimeUtil
- DATETIME_H - Static variable in class org.lionsoul.jcseg.util.TimeUtil
- DATETIME_I - Static variable in class org.lionsoul.jcseg.util.TimeUtil
- DATETIME_M - Static variable in class org.lionsoul.jcseg.util.TimeUtil
- DATETIME_NONE - Static variable in class org.lionsoul.jcseg.util.TimeUtil
-
date-time part index constants we consider a date-time as the following seven parts: +------+-------+-----+---------------+------+--------+--------+ | 0 | 1 | 2 | 3 | 4 | 5 | 6 | +------+-------+-----+---------------+------+--------+--------+ | year | month | day | timing method | hour | minute | second | +------+-------+-----+---------------+------+--------+--------+ and the numeric value before every part.
- DATETIME_S - Static variable in class org.lionsoul.jcseg.util.TimeUtil
- DATETIME_Y - Static variable in class org.lionsoul.jcseg.util.TimeUtil
- decrease(char) - Method in class org.lionsoul.jcseg.util.ByteCharCounter
- decrease(char, int) - Method in class org.lionsoul.jcseg.util.ByteCharCounter
- deleteCharAt(int) - Method in class org.lionsoul.jcseg.util.IStringBuffer
-
delete the char at the specified position
- DELIMITER - org.lionsoul.jcseg.ISegment.Type
- DELIMITER - Static variable in interface org.lionsoul.jcseg.ISegment
- DELIMITER_MODE - Static variable in interface org.lionsoul.jcseg.ISegment
- DelimiterSeg - Class in org.lionsoul.jcseg.segmenter
-
delimiter segment algorithm implementation extended from common segment interface ISegment
- DelimiterSeg(SegmenterConfig, ADictionary) - Constructor for class org.lionsoul.jcseg.segmenter.DelimiterSeg
-
method to create a new ISegment
- deQueue() - Method in class org.lionsoul.jcseg.util.IIntFIFO
-
remove the first item from the queue
- deQueue() - Method in class org.lionsoul.jcseg.util.IIntQueue
-
remove the node from the head and you should make sure the size is larger than 0 by calling size() before you invoke the method or you will just get -1
- DETECT - org.lionsoul.jcseg.ISegment.Type
- DETECT - Static variable in interface org.lionsoul.jcseg.ISegment
- DETECT_MODE - Static variable in interface org.lionsoul.jcseg.ISegment
- DetectSeg - Class in org.lionsoul.jcseg.segmenter
-
Detect segmentation mode return words only in the loaded dictionary yat, when matched a word and return it or continue to find the next word in the dictionary
- DetectSeg(SegmenterConfig, ADictionary) - Constructor for class org.lionsoul.jcseg.segmenter.DetectSeg
-
method to create a new ISegment
- dic - Variable in class org.lionsoul.jcseg.segmenter.DelimiterSeg
-
the dictionary and task configuration
- dic - Variable in class org.lionsoul.jcseg.segmenter.DetectSeg
-
the dictionary and task configuration
- dic - Variable in class org.lionsoul.jcseg.segmenter.NGramSeg
-
the dictionary and task configuration
- dic - Variable in class org.lionsoul.jcseg.segmenter.Segmenter
-
the dictionary and task configuration instance
- DictionaryFactory - Class in org.lionsoul.jcseg.dic
-
Dictionary Factory to create Dictionary instance a path of the class that has extends the ADictionary class must be given first
- DictionaryFunction - Interface in org.lionsoul.jcseg.fi
- dictMap - Variable in class org.lionsoul.jcseg.dic.HashMapDictionary
-
hash table for the words
- Document(int, Sentence, List<IWord>, double) - Constructor for class org.lionsoul.jcseg.extractor.impl.TextRankSummaryExtractor.Document
-
construct method
- DOMAIN_SUFFIX - Static variable in interface org.lionsoul.jcseg.dic.ILexicon
-
domain name suffix dictionary for the URL recognition
E
- E_ANGLE - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_ANGLE_360 - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_ANGLE_90 - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_ANGLE_DU - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_ANGLE_FEN - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_ANGLE_GON - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_ANGLE_MRAD - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_ANGLE_RAD - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_AREA - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_AREA_ACRE - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_AREA_ARE - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_AREA_CM2 - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_AREA_DM2 - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_AREA_FT2 - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_AREA_HA - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_AREA_IN2 - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_AREA_KM2 - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_AREA_M2 - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_AREA_MM2 - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_AREA_MU - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_AREA_NM2 - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_AREA_QING - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_AREA_SQ_FT - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_AREA_SQ_IN - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_AREA_SQ_MI - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_AREA_SQ_RD - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_AREA_SQ_YD - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_AREA_UM2 - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_DATETIME - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_DATETIME_A - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_DATETIME_AH - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_DATETIME_AHI - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_DATETIME_AHIS - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_DATETIME_D - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_DATETIME_H - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_DATETIME_HI - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_DATETIME_HI_A - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_DATETIME_HIS - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_DATETIME_HIS_A - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_DATETIME_I - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_DATETIME_M - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_DATETIME_P - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_DATETIME_S - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_DATETIME_Y - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_DATETIME_YM - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_DATETIME_YM_A - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_DATETIME_YMD - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_DATETIME_YMD_A - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_DATETIME_YMDHIS - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_DATETIME_YMDZHIS - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_DISTANCE - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_DISTANCE_KM - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_DISTANCE_LI - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_DISTANCE_LY - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_DISTANCE_MI - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_DISTANCE_NMI - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_EMAIL - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_EMAIL_A - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_FORCE - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_FORCE_GF - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_FORCE_KGF - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_FORCE_KIP - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_FORCE_KN - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_FORCE_LBF - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_FORCE_N - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_FORCE_TF - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_IP - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_IP_A - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_LENGTH - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_LENGTH_CFT - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_LENGTH_CIN - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_LENGTH_CM - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_LENGTH_DM - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_LENGTH_FM - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_LENGTH_FT - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_LENGTH_FUR - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_LENGTH_IN - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_LENGTH_KM - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_LENGTH_M - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_LENGTH_MM - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_LENGTH_NM - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_LENGTH_TFT - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_LENGTH_UM - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_LENGTH_YD - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_MOBILE - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_MOBILE_A - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_MONEY - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_MONEY_AUD - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_MONEY_BUK - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_MONEY_CAD - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_MONEY_CNY - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_MONEY_CSK - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_MONEY_CUP - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_MONEY_DEM - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_MONEY_DKK - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_MONEY_EGP - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_MONEY_EUR - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_MONEY_FRF - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_MONEY_GBP - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_MONEY_HKD - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_MONEY_INR - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_MONEY_ISK - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_MONEY_ITL - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_MONEY_JPY - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_MONEY_KRW - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_MONEY_KWP - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_MONEY_MOP - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_MONEY_MXP - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_MONEY_MYR - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_MONEY_NOK - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_MONEY_NZD - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_MONEY_PHP - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_MONEY_SEK - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_MONEY_SGD - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_MONEY_SKK - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_MONEY_SUR - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_MONEY_THB - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_MONEY_TWD - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_MONEY_USD - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_MONEY_VND - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_MONEY_ZAR - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_NAME - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_NAME_A - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_NAME_CN - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_NAME_CN_A - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_NAME_FOREIGN - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_NAME_FOREIGN_A - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_NAME_NICKNAME - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_NAME_NICKNAME_A - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_NUC_PREFIX - Static variable in class org.lionsoul.jcseg.segmenter.Entity
-
prefix for numeric and unit composed word entity
- E_NUMERIC - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_NUMERIC_A - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_NUMERIC_ARABIC - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_NUMERIC_ARABIC_A - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_NUMERIC_CN - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_NUMERIC_CN_A - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_NUMERIC_CN_FRACTION - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_NUMERIC_CN_FRACTION_A - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_NUMERIC_DECIMAL - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_NUMERIC_DECIMAL_A - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_NUMERIC_FRACTION - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_NUMERIC_FRACTION_A - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_NUMERIC_PERCENTAGE - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_NUMERIC_PERCENTAGE_A - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_PLACE - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_PLACE_CITY - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_PLACE_CONTINENT - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_PLACE_DISTRICT - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_PLACE_NATION - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_PLACE_PROVINCE - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_PLACE_TOWNSHIP - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_PLACE_VIEWPOINT - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_PLACE_VILLAGE - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_PRESSURE - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_PRESSURE_ATM - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_PRESSURE_BAR - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_PRESSURE_HG_IN - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_PRESSURE_HG_MM - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_PRESSURE_HPA - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_PRESSURE_KPA - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_PRESSURE_MBAR - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_PRESSURE_PA - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_PRESSURE_WG_MM - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_QUALITY - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_QUALITY_BP - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_QUALITY_CT - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_QUALITY_DAN - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_QUALITY_DAN_UK - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_QUALITY_DAN_US - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_QUALITY_DR - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_QUALITY_G - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_QUALITY_GR - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_QUALITY_JIN - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_QUALITY_KG - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_QUALITY_LB - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_QUALITY_LIANG - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_QUALITY_LT - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_QUALITY_MG - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_QUALITY_OZ - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_QUALITY_Q - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_QUALITY_QIAN - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_QUALITY_ST - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_QUALITY_T - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_STORAGE - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_STORAGE_B - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_STORAGE_BIT - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_STORAGE_EB - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_STORAGE_GB - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_STORAGE_KB - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_STORAGE_MB - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_STORAGE_PB - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_STORAGE_TB - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_TEMPERATURE - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_TEMPERATURE_C - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_TEMPERATURE_F - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_TEMPERATURE_K - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_TEMPERATURE_R - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_TEMPERATURE_RE - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_THE_CARD - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_THE_CARD_A - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_THE_EPISODE - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_THE_EPISODE_A - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_THE_NUMBER - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_THE_NUMBER_A - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_TIME - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_TIME_D - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_TIME_H - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_TIME_H_A - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_TIME_I - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_TIME_I_A - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_TIME_MON - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_TIME_MS - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_TIME_NS - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_TIME_P - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_TIME_S - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_TIME_S_A - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_TIME_US - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_TIME_WEEK - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_TIME_YEAR - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_UNIT_BAG - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_UNIT_BOTTLE - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_UNIT_BOX - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_UNIT_DAN - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_UNIT_DISCOUNT - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_UNIT_ITEM - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_URL - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_URL_A - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_URL_FTP - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_URL_FTP_A - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_URL_HTTP - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_URL_HTTP_A - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_VOLUME - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_VOLUME_CL - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_VOLUME_CM3 - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_VOLUME_DL - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_VOLUME_DM3 - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_VOLUME_FT3 - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_VOLUME_GAL_UK - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_VOLUME_GAL_US - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_VOLUME_HL - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_VOLUME_IN3 - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_VOLUME_L - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_VOLUME_M3 - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_VOLUME_MFT - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_VOLUME_ML - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_VOLUME_MM3 - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_VOLUME_NL - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_VOLUME_UL - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- E_VOLUME_YD3 - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- EN_LETTER - Static variable in class org.lionsoul.jcseg.util.StringUtil
- EN_MAX_LEN - Variable in class org.lionsoul.jcseg.segmenter.SegmenterConfig
-
maximum/minimum match length for English word extract
- EN_NUMERIC - Static variable in class org.lionsoul.jcseg.util.StringUtil
- EN_POSPEECH - Static variable in interface org.lionsoul.jcseg.IWord
- EN_PUNCTUATION - Static variable in class org.lionsoul.jcseg.util.StringUtil
- EN_SEC_MIN_LEN - Variable in class org.lionsoul.jcseg.segmenter.SegmenterConfig
-
minimum length for the secondary segmentation word
- EN_SECOND_SEG - Variable in class org.lionsoul.jcseg.segmenter.SegmenterConfig
-
whether to do the secondary split for complex Latin compose by the type of the chars
- EN_UNKNOW - Static variable in class org.lionsoul.jcseg.util.StringUtil
- EN_WHITESPACE - Static variable in class org.lionsoul.jcseg.util.StringUtil
- EN_WORD_SEG - Variable in class org.lionsoul.jcseg.segmenter.SegmenterConfig
-
do the English word extract
- endsWith(String, String[]) - Static method in class org.lionsoul.jcseg.util.ArrayUtil
-
check if there is an element that ends with the specified string
- enQueue(int) - Method in class org.lionsoul.jcseg.util.IIntFIFO
-
add a new item to the queue
- enQueue(int) - Method in class org.lionsoul.jcseg.util.IIntQueue
-
append a int from the tail
- enSecondSeg(IWord, LinkedList<IWord>) - Method in class org.lionsoul.jcseg.segmenter.Segmenter
-
Do the secondary split for the specified complex Latin word This will split a complex English, Arabic, punctuation compose word to multiple simple parts Like 'qq2013' will split to 'qq' and '2013'
- enSecondSegFilter(IWord) - Method in class org.lionsoul.jcseg.segmenter.MostSeg
- enSecondSegFilter(IWord) - Method in class org.lionsoul.jcseg.segmenter.NLPSeg
- enSecondSegFilter(IWord) - Method in class org.lionsoul.jcseg.segmenter.Segmenter
-
interface to check and do the English secondary segmentation.
- Entity - Class in org.lionsoul.jcseg.segmenter
-
word item entity class
- Entity() - Constructor for class org.lionsoul.jcseg.segmenter.Entity
- EntityFormat - Class in org.lionsoul.jcseg.util
-
Entity format manager class
- EntityFormat() - Constructor for class org.lionsoul.jcseg.util.EntityFormat
- Entry(int, IIntFIFO.Entry) - Constructor for class org.lionsoul.jcseg.util.IIntFIFO.Entry
- Entry(int, IIntQueue.Entry, IIntQueue.Entry) - Constructor for class org.lionsoul.jcseg.util.IIntQueue.Entry
- Entry(T, IHashQueue.Entry<T>, IHashQueue.Entry<T>) - Constructor for class org.lionsoul.jcseg.util.IHashQueue.Entry
- enWordSeg(IWord, LinkedList<IWord>) - Method in class org.lionsoul.jcseg.segmenter.MostSeg
-
Latin word lexicon based English word segmentation for search mode
- enWordSeg(IWord, LinkedList<IWord>) - Method in class org.lionsoul.jcseg.segmenter.Segmenter
-
Latin word lexicon based English word segmentation.
- equals(Object) - Method in interface org.lionsoul.jcseg.IWord
-
you have to rewrite the #equals method cause the Jcseg require it
- equals(Object) - Method in class org.lionsoul.jcseg.segmenter.Word
F
- factory - Variable in enum org.lionsoul.jcseg.IDictionary.Type
- factory - Variable in enum org.lionsoul.jcseg.ISegment.Type
- fieldsArr - Static variable in class org.lionsoul.jcseg.segmenter.Entity
- fillDateTimePool(IWord[], int, IWord) - Static method in class org.lionsoul.jcseg.util.TimeUtil
-
fill the date-time pool specified part with part index constant
- fillDateTimePool(IWord[], IWord) - Static method in class org.lionsoul.jcseg.util.TimeUtil
-
fill the date-time pool specified part through the specified time entity string.
- fillTimeToPool(IWord[], String) - Static method in class org.lionsoul.jcseg.util.TimeUtil
-
fill a date-time time part with a standard time format like '15:45:36' to the specified time pool
- filter(IWord) - Method in class org.lionsoul.jcseg.extractor.KeyphraseExtractor
-
word item filter
- filter(IWord) - Method in class org.lionsoul.jcseg.extractor.KeywordsExtractor
-
word item filter
- findCHName(char[], int, IChunk) - Method in class org.lionsoul.jcseg.segmenter.Segmenter
-
find an Chinese name from the current position of the input chars
- first() - Method in class org.lionsoul.jcseg.util.IStringBuffer
-
always return the first char
- fromIndex(int) - Static method in enum org.lionsoul.jcseg.IDictionary.Type
- fromIndex(int) - Static method in enum org.lionsoul.jcseg.ISegment.Type
- fromString(String) - Static method in enum org.lionsoul.jcseg.IDictionary.Type
- fromString(String) - Static method in enum org.lionsoul.jcseg.ISegment.Type
- fromString(String, IDictionary.Type) - Static method in enum org.lionsoul.jcseg.IDictionary.Type
-
get the Type with the specified string name
- fromString(String, ISegment.Type) - Static method in enum org.lionsoul.jcseg.ISegment.Type
-
get the Type with the specified string name
- fwsTohws(String) - Static method in class org.lionsoul.jcseg.util.StringUtil
-
a static method to replace the full-width char to the half-width char in a given string (65281-65374 for full-width char)
G
- get(char) - Method in class org.lionsoul.jcseg.util.ByteCharCounter
- get(int) - Method in class org.lionsoul.jcseg.util.IntArrayList
- get(int, String) - Method in class org.lionsoul.jcseg.dic.HashMapDictionary
- get(int, String) - Method in interface org.lionsoul.jcseg.IDictionary
-
return the IWord associate with the given key.
- get(String) - Static method in class org.lionsoul.jcseg.segmenter.Entity
-
get the entity string by the specified key.
- getAutoMinLength() - Method in class org.lionsoul.jcseg.extractor.impl.TextRankKeyphraseExtractor
- getAverageWordsLength() - Method in interface org.lionsoul.jcseg.IChunk
-
return the average word length for all the chunks.
- getAverageWordsLength() - Method in class org.lionsoul.jcseg.segmenter.Chunk
- getBestChunk(char[], int, int) - Method in class org.lionsoul.jcseg.segmenter.ComplexSeg
- getBestChunk(char[], int, int) - Method in class org.lionsoul.jcseg.segmenter.Segmenter
-
an abstract method to get word from the current position with MMSEG algorithm.
- getBestChunk(char[], int, int) - Method in class org.lionsoul.jcseg.segmenter.SimpleSeg
- getConfig() - Method in class org.lionsoul.jcseg.dic.ADictionary
- getConfig() - Method in class org.lionsoul.jcseg.segmenter.DelimiterSeg
-
get the current Segmenter Config instance
- getConfig() - Method in class org.lionsoul.jcseg.segmenter.DetectSeg
-
get the current task config instance
- getConfig() - Method in class org.lionsoul.jcseg.segmenter.NGramSeg
- getConfig() - Method in class org.lionsoul.jcseg.segmenter.Segmenter
-
get the current task configuration instance.
- getDateTimeIndex(String) - Static method in class org.lionsoul.jcseg.util.TimeUtil
-
get and return the time part index of the specified IWord#entity
- getDelimiter() - Method in class org.lionsoul.jcseg.segmenter.DelimiterSeg
-
get the current delimiter
- getDELIMITER() - Method in class org.lionsoul.jcseg.segmenter.SegmenterConfig
- getDic() - Method in class org.lionsoul.jcseg.segmenter.DelimiterSeg
-
get the current dictionary instance
- getDic() - Method in class org.lionsoul.jcseg.segmenter.NGramSeg
- getDict() - Method in class org.lionsoul.jcseg.segmenter.DetectSeg
-
get the current dictionary instance
- getDict() - Method in class org.lionsoul.jcseg.segmenter.Segmenter
-
get the current dictionary instance.
- getEnCharType(int) - Static method in class org.lionsoul.jcseg.util.StringUtil
-
get the type of the English char defined in this class and start with EN_.
- getEnMaxLen() - Method in class org.lionsoul.jcseg.segmenter.SegmenterConfig
- getEnSecondMinLen() - Method in class org.lionsoul.jcseg.segmenter.SegmenterConfig
- getEnSecondSeg() - Method in class org.lionsoul.jcseg.segmenter.SegmenterConfig
- getEntity() - Method in interface org.lionsoul.jcseg.IWord
-
get the entity name array of the word
- getEntity() - Method in class org.lionsoul.jcseg.segmenter.Word
- getEntity(int) - Method in interface org.lionsoul.jcseg.IWord
-
get the specified entity by index
- getEntity(int) - Method in class org.lionsoul.jcseg.segmenter.Word
- getFile() - Method in class org.lionsoul.jcseg.dic.AutoLoadFile
- getFrequency() - Method in interface org.lionsoul.jcseg.IWord
-
return the frequency of the word, use only when the word's length is one.
- getFrequency() - Method in class org.lionsoul.jcseg.segmenter.Word
- getGRAM() - Method in class org.lionsoul.jcseg.segmenter.SegmenterConfig
- getIndex() - Method in class org.lionsoul.jcseg.extractor.impl.TextRankSummaryExtractor.Document
- getIndex(String) - Static method in class org.lionsoul.jcseg.dic.ADictionary
-
get the key's type index located in ILexicon interface
- getJarHome(Object) - Static method in class org.lionsoul.jcseg.util.Util
-
get the absolute parent path for the jar file.
- getKeyphrase(Reader) - Method in class org.lionsoul.jcseg.extractor.impl.TextRankKeyphraseExtractor
- getKeyphrase(Reader) - Method in class org.lionsoul.jcseg.extractor.KeyphraseExtractor
-
get the keyphrase list from a reader
- getKeyphraseFromFile(String) - Method in class org.lionsoul.jcseg.extractor.KeyphraseExtractor
-
get the keyphrase list from a file
- getKeyphraseFromString(String) - Method in class org.lionsoul.jcseg.extractor.KeyphraseExtractor
-
get the keyphrase list from a string
- getKeySentence(Reader) - Method in class org.lionsoul.jcseg.extractor.impl.TextRankSummaryExtractor
- getKeySentence(Reader) - Method in class org.lionsoul.jcseg.extractor.SummaryExtractor
-
get the key sentence from a reader
- getKeySentenceFromFile(String) - Method in class org.lionsoul.jcseg.extractor.SummaryExtractor
-
get key sentence from a file path
- getKeySentenceFromString(String) - Method in class org.lionsoul.jcseg.extractor.SummaryExtractor
-
get key sentence from a string
- getKeywords(Reader) - Method in class org.lionsoul.jcseg.extractor.impl.TextRankKeywordsExtractor
- getKeywords(Reader) - Method in class org.lionsoul.jcseg.extractor.KeywordsExtractor
-
get the keywords list from a reader
- getKeywordsFromFile(String) - Method in class org.lionsoul.jcseg.extractor.KeywordsExtractor
-
get the keywords list from a file
- getKeywordsFromString(String) - Method in class org.lionsoul.jcseg.extractor.KeywordsExtractor
-
get the keywords list from a string
- getKeywordsNum() - Method in class org.lionsoul.jcseg.extractor.impl.TextRankKeyphraseExtractor
- getKeywordsNum() - Method in class org.lionsoul.jcseg.extractor.impl.TextRankKeywordsExtractor
- getLargestAverageWordLengthChunks(ArrayList<IChunk>, ArrayList<IChunk>) - Static method in class org.lionsoul.jcseg.segmenter.MMSegFilter
-
2.
- getLargestSingleMorphemicFreedomChunks(ArrayList<IChunk>, ArrayList<IChunk>) - Static method in class org.lionsoul.jcseg.segmenter.MMSegFilter
-
the largest sum of degree of morphemic freedom of one-character words this rule will return the chunks that own the largest sum of degree of morphemic freedom of one-character
- getLastUpdateTime() - Method in class org.lionsoul.jcseg.dic.AutoLoadFile
- getLength() - Method in interface org.lionsoul.jcseg.IChunk
-
return the length of the chunk(the number of the word)
- getLength() - Method in interface org.lionsoul.jcseg.IWord
-
return the length of the word
- getLength() - Method in class org.lionsoul.jcseg.segmenter.Chunk
- getLength() - Method in class org.lionsoul.jcseg.segmenter.Word
- getLength() - Method in class org.lionsoul.jcseg.sentence.Sentence
- getLexiconPath() - Method in class org.lionsoul.jcseg.segmenter.SegmenterConfig
-
return the lexicon directory path
- getList() - Method in class org.lionsoul.jcseg.SynonymsEntry
-
return the synonyms list
- getMaxCnLnadron() - Method in class org.lionsoul.jcseg.segmenter.SegmenterConfig
- getMaximumMatchChunks(ArrayList<IChunk>, ArrayList<IChunk>) - Static method in class org.lionsoul.jcseg.segmenter.MMSegFilter
-
1.
- getMaxIterateNum() - Method in class org.lionsoul.jcseg.extractor.impl.TextRankKeyphraseExtractor
- getMaxIterateNum() - Method in class org.lionsoul.jcseg.extractor.impl.TextRankKeywordsExtractor
- getMaxIterateNum() - Method in class org.lionsoul.jcseg.extractor.impl.TextRankSummaryExtractor
- getMaxLength() - Method in class org.lionsoul.jcseg.segmenter.SegmenterConfig
- getMaxWordsNum() - Method in class org.lionsoul.jcseg.extractor.impl.TextRankKeyphraseExtractor
- getN() - Method in class org.lionsoul.jcseg.segmenter.NGramSeg
- getNameSingleThreshold() - Method in class org.lionsoul.jcseg.segmenter.SegmenterConfig
- getNextCJKWord(int, int) - Method in class org.lionsoul.jcseg.segmenter.MostSeg
-
get the next CJK word from the current position of the input stream and this function is the core part the most segmentation implements
- getNextCJKWord(int, int) - Method in class org.lionsoul.jcseg.segmenter.NLPSeg
- getNextCJKWord(int, int) - Method in class org.lionsoul.jcseg.segmenter.Segmenter
-
get the next CJK word from the current position of the input stream
- getNextDatetimeWord(IWord, int) - Method in class org.lionsoul.jcseg.segmenter.NLPSeg
-
get and return the next date-time word
- getNextLatinWord(int, int) - Method in class org.lionsoul.jcseg.segmenter.Segmenter
-
get the next Latin word from the current position of the input stream
- getNextMatch(int, char[], int, List<IWord>) - Method in class org.lionsoul.jcseg.segmenter.Segmenter
-
match the next CJK word in the dictionary
- getNextMixedWord(char[], int) - Method in class org.lionsoul.jcseg.segmenter.Segmenter
-
get the next mixed word, CJK-English or CJK-English-CJK or whatever
- getNextPunctuationPairWord(int, int) - Method in class org.lionsoul.jcseg.segmenter.Segmenter
-
get the next punctuation pair word from the current position of the input stream.
- getNextTheWord(IWord) - Method in class org.lionsoul.jcseg.segmenter.NLPSeg
-
get the next the_xxx word like '第x个', '第x集' EG ...
- getNextTimeMergedWord(IWord, int) - Method in class org.lionsoul.jcseg.segmenter.NLPSeg
-
get and return the next time merged date-time word
- getNextType(int, int, CharTypeFunction) - Method in class org.lionsoul.jcseg.segmenter.NGramSeg
-
common interface to get the next n-gram word for the specified char type.
- getNumericUnitComposedWord(int, IWord) - Method in class org.lionsoul.jcseg.segmenter.NLPSeg
- getPairPunctuationText(int) - Method in class org.lionsoul.jcseg.segmenter.Segmenter
-
find pair punctuation of the given punctuation char the purpose is to get the text between them
- getParameter() - Method in interface org.lionsoul.jcseg.IWord
-
return the additional parameters
- getParameter() - Method in class org.lionsoul.jcseg.segmenter.Word
- getPartSpeech() - Method in interface org.lionsoul.jcseg.IWord
-
return the part of speech of the word.
- getPartSpeech() - Method in class org.lionsoul.jcseg.segmenter.Word
- getPinyin() - Method in interface org.lionsoul.jcseg.IWord
-
return the pinyin of the word
- getPinyin() - Method in class org.lionsoul.jcseg.segmenter.Word
- getPollTime() - Method in class org.lionsoul.jcseg.segmenter.SegmenterConfig
- getPosition() - Method in interface org.lionsoul.jcseg.IWord
-
return the start position of the word.
- getPosition() - Method in class org.lionsoul.jcseg.segmenter.Word
- getPosition() - Method in class org.lionsoul.jcseg.sentence.Sentence
- getPPTMaxLength() - Method in class org.lionsoul.jcseg.segmenter.SegmenterConfig
- getPropertieFile() - Method in class org.lionsoul.jcseg.segmenter.SegmenterConfig
- getPunctuationPair(char) - Static method in class org.lionsoul.jcseg.util.StringUtil
-
get the pair punctuation' pair
- getQueueSize() - Method in class org.lionsoul.jcseg.util.IPushbackReader
-
get the buffer size - the number of buffered data
- getRootWord() - Method in class org.lionsoul.jcseg.SynonymsEntry
-
return the base word
- getScore() - Method in class org.lionsoul.jcseg.extractor.impl.TextRankSummaryExtractor.Document
- getSeg() - Method in class org.lionsoul.jcseg.extractor.KeyphraseExtractor
- getSeg() - Method in class org.lionsoul.jcseg.extractor.KeywordsExtractor
- getSentence() - Method in class org.lionsoul.jcseg.extractor.impl.TextRankSummaryExtractor.Document
- getSentenceNum() - Method in class org.lionsoul.jcseg.extractor.impl.TextRankSummaryExtractor
- getSentenceSeg() - Method in class org.lionsoul.jcseg.extractor.SummaryExtractor
- getSingleWordsFrequency() - Method in interface org.lionsoul.jcseg.IChunk
- getSingleWordsFrequency() - Method in class org.lionsoul.jcseg.segmenter.Chunk
- getSingleWordsMorphemicFreedom() - Method in interface org.lionsoul.jcseg.IChunk
-
return the degree of morphemic freedom for all the single words.
- getSingleWordsMorphemicFreedom() - Method in class org.lionsoul.jcseg.segmenter.Chunk
- getSmallestVarianceWordLengthChunks(ArrayList<IChunk>, ArrayList<IChunk>) - Static method in class org.lionsoul.jcseg.segmenter.MMSegFilter
-
the smallest variance word length this rule will the chunks that one the smallest variance word length
- getStreamPosition() - Method in interface org.lionsoul.jcseg.ISegment
-
get the current length of the stream
- getStreamPosition() - Method in class org.lionsoul.jcseg.segmenter.DelimiterSeg
- getStreamPosition() - Method in class org.lionsoul.jcseg.segmenter.DetectSeg
- getStreamPosition() - Method in class org.lionsoul.jcseg.segmenter.NGramSeg
- getStreamPosition() - Method in class org.lionsoul.jcseg.segmenter.Segmenter
- getSummary(Reader, int) - Method in class org.lionsoul.jcseg.extractor.impl.TextRankSummaryExtractor
- getSummary(Reader, int) - Method in class org.lionsoul.jcseg.extractor.SummaryExtractor
-
get summary from a reader
- getSummaryFromFile(String, int) - Method in class org.lionsoul.jcseg.extractor.SummaryExtractor
-
get document summary from a file
- getSummaryFromString(String, int) - Method in class org.lionsoul.jcseg.extractor.SummaryExtractor
-
get document summary from a string
- getSyn() - Method in interface org.lionsoul.jcseg.IWord
-
return the Synonyms' entry of the word.
- getSyn() - Method in class org.lionsoul.jcseg.segmenter.Word
- getTimeKey(int) - Static method in class org.lionsoul.jcseg.util.TimeUtil
-
get and return the time key part with the part index value
- getTimeKey(String) - Static method in class org.lionsoul.jcseg.util.TimeUtil
-
get and return the time key part of the specified entity string
- getTimeKey(IWord) - Static method in class org.lionsoul.jcseg.util.TimeUtil
- getType() - Method in interface org.lionsoul.jcseg.IWord
-
return the type of the word
- getType() - Method in class org.lionsoul.jcseg.segmenter.Word
- getValue() - Method in interface org.lionsoul.jcseg.IWord
-
return the value of the word
- getValue() - Method in class org.lionsoul.jcseg.segmenter.Word
- getValue() - Method in class org.lionsoul.jcseg.sentence.Sentence
- getWindowSize() - Method in class org.lionsoul.jcseg.extractor.impl.TextRankKeyphraseExtractor
- getWindowSize() - Method in class org.lionsoul.jcseg.extractor.impl.TextRankKeywordsExtractor
- getWords() - Method in class org.lionsoul.jcseg.extractor.impl.TextRankSummaryExtractor.Document
- getWords() - Method in interface org.lionsoul.jcseg.IChunk
-
get the all the words in the chunk.
- getWords() - Method in class org.lionsoul.jcseg.segmenter.Chunk
- getWordSeg() - Method in class org.lionsoul.jcseg.extractor.SummaryExtractor
- getWordsVariance() - Method in interface org.lionsoul.jcseg.IChunk
-
return the variance of all the words in all the chunks.
- getWordsVariance() - Method in class org.lionsoul.jcseg.segmenter.Chunk
- gIsb - Variable in class org.lionsoul.jcseg.sentence.SentenceSeg
-
global string buffer
H
- hashCode() - Method in class org.lionsoul.jcseg.segmenter.Word
-
rewrite the hash code generate algorithm take the value as the main factor
- HASHMAP - org.lionsoul.jcseg.IDictionary.Type
- HASHMAP - Static variable in interface org.lionsoul.jcseg.IDictionary
-
Segmentation type constants
- HashMapDictionary - Class in org.lionsoul.jcseg.dic
-
Dictionary class
- HashMapDictionary(SegmenterConfig, Boolean) - Constructor for class org.lionsoul.jcseg.dic.HashMapDictionary
- hwsTofws(String) - Static method in class org.lionsoul.jcseg.util.StringUtil
-
a static method to replace the half-width char to the full-width char in a given string
I
- I_CN_NAME - Variable in class org.lionsoul.jcseg.segmenter.SegmenterConfig
-
identify the Chinese name?
- iaList - Variable in class org.lionsoul.jcseg.segmenter.Segmenter
- IChunk - Interface in org.lionsoul.jcseg
-
chunk interface for Jcseg.
- identifyCnName() - Method in class org.lionsoul.jcseg.segmenter.SegmenterConfig
- IDictionary - Interface in org.lionsoul.jcseg
-
Common interface for Dictionary
- IDictionary.Type - Enum in org.lionsoul.jcseg
- idx - Variable in class org.lionsoul.jcseg.segmenter.NGramSeg
-
the index value of the current input stream mainly for track the start position of the token
- idx - Variable in class org.lionsoul.jcseg.segmenter.Segmenter
-
the index value of the current input stream mainly for track the start position of the token
- idx - Variable in class org.lionsoul.jcseg.sentence.SentenceSeg
- IHashQueue<T extends IWord> - Class in org.lionsoul.jcseg.util
-
A normal queue base one single link but with hash index, so, it is fast for searching
- IHashQueue() - Constructor for class org.lionsoul.jcseg.util.IHashQueue
- IHashQueue.Entry<T> - Class in org.lionsoul.jcseg.util
-
innner Entry node class
- IIntFIFO - Class in org.lionsoul.jcseg.util
-
int first in first out queue base on single link
- IIntFIFO() - Constructor for class org.lionsoul.jcseg.util.IIntFIFO
- IIntFIFO.Entry - Class in org.lionsoul.jcseg.util
-
Item Entry inner class
- IIntQueue - Class in org.lionsoul.jcseg.util
-
char queue class base on double link Not thread safe
- IIntQueue() - Constructor for class org.lionsoul.jcseg.util.IIntQueue
- IIntQueue.Entry - Class in org.lionsoul.jcseg.util
-
innner Entry node class
- ILexicon - Interface in org.lionsoul.jcseg.dic
-
lexicon configuration class.
- implode(String, Object[]) - Static method in class org.lionsoul.jcseg.util.ArrayUtil
-
String array implode internal method
- increase(char) - Method in class org.lionsoul.jcseg.util.ByteCharCounter
- increase(char, int) - Method in class org.lionsoul.jcseg.util.ByteCharCounter
- index - Variable in enum org.lionsoul.jcseg.IDictionary.Type
- index - Variable in enum org.lionsoul.jcseg.ISegment.Type
- indexOf(String, String[]) - Static method in class org.lionsoul.jcseg.util.ArrayUtil
-
check and search the specified element in the Array
- insertionSort(T[]) - Static method in class org.lionsoul.jcseg.util.Sort
-
insert sort method
- insertionSort(T[], int, int) - Static method in class org.lionsoul.jcseg.util.Sort
-
method to sort an subarray from start to end with insertion sort algorithm
- IntArrayList - Class in org.lionsoul.jcseg.util
-
array list for basic int data type to instead of ArrayList Well, this will save a lot work to Reopened and Unpacking
- IntArrayList() - Constructor for class org.lionsoul.jcseg.util.IntArrayList
- IntArrayList(int) - Constructor for class org.lionsoul.jcseg.util.IntArrayList
- IPushbackReader - Class in org.lionsoul.jcseg.util
-
IPushBackReader based on Reader Not thread safe support unlimited unread operation
- IPushbackReader(Reader) - Constructor for class org.lionsoul.jcseg.util.IPushbackReader
- is(int) - Method in interface org.lionsoul.jcseg.fi.CharTypeFunction
- isAutoFilter() - Method in class org.lionsoul.jcseg.extractor.impl.TextRankKeywordsExtractor
- isAutoload() - Method in class org.lionsoul.jcseg.segmenter.SegmenterConfig
-
about lexicon autoload
- isb - Variable in class org.lionsoul.jcseg.segmenter.NGramSeg
- isb - Variable in class org.lionsoul.jcseg.segmenter.Segmenter
- isCJK(String) - Static method in class org.lionsoul.jcseg.util.StringUtil
- isCJK(String, int, int) - Static method in class org.lionsoul.jcseg.util.StringUtil
-
check if the specified string is all CJK chars
- isCJKChar(int) - Static method in class org.lionsoul.jcseg.util.StringUtil
-
check the specified char is CJK, Thai...
- isCNNumeric(char) - Static method in class org.lionsoul.jcseg.util.NumericUtil
-
check if the given char is a Chinese numeric or not
- isCNNumericString(String, int, int) - Static method in class org.lionsoul.jcseg.util.NumericUtil
-
check if the specified string is a Chinese numeric string
- isCnPunctuation(int) - Static method in class org.lionsoul.jcseg.util.StringUtil
- isDate(String, char) - Static method in class org.lionsoul.jcseg.util.EntityFormat
-
check if the specified string is an valid Latin Date string like "2017/02/22", "2017-02-22" or "2017.02.22"
- isDecimal(String) - Static method in class org.lionsoul.jcseg.util.StringUtil
- isDecimal(String, int, int) - Static method in class org.lionsoul.jcseg.util.StringUtil
-
check the specified char is a decimal including the full-width char
- isDigit(String) - Static method in class org.lionsoul.jcseg.util.StringUtil
- isDigit(String, int, int) - Static method in class org.lionsoul.jcseg.util.StringUtil
-
check the specified char is a digit or not true will return if it is or return false this method can recognize full-with char
- ISegment - Interface in org.lionsoul.jcseg
-
Jcseg segmentation interface
- ISegment.Type - Enum in org.lionsoul.jcseg
- isEnChar(int) - Static method in class org.lionsoul.jcseg.util.StringUtil
-
check the specified char is a basic Latin and Russia and Greece letter.
- isENKeepPunctuaton(char) - Static method in class org.lionsoul.jcseg.util.StringUtil
-
check the given char is English keep punctuation
- isEnLetter(int) - Static method in class org.lionsoul.jcseg.util.StringUtil
-
include the full-width and half-width char
- isEnNumeric(int) - Static method in class org.lionsoul.jcseg.util.StringUtil
-
check the specified char is an English numeric(48-57) including the full-width char
- isEnPunctuation(int) - Static method in class org.lionsoul.jcseg.util.StringUtil
-
check if the given char is half-width punctuation
- isEnWordSeg() - Method in class org.lionsoul.jcseg.segmenter.SegmenterConfig
- isFWEnChar(int) - Static method in class org.lionsoul.jcseg.util.StringUtil
-
check the given char is a full-width char AT+reader: the full-width punctuation is not included here
- isHWEnChar(int) - Static method in class org.lionsoul.jcseg.util.StringUtil
-
check the given char is a half-width char or not
- isIpAddress(String) - Static method in class org.lionsoul.jcseg.util.EntityFormat
- isKeepEnSecOriginalWord() - Method in class org.lionsoul.jcseg.segmenter.SegmenterConfig
- isKeepEnSegOriginalWord() - Method in class org.lionsoul.jcseg.segmenter.SegmenterConfig
- isKeepPunctuation(char) - Method in class org.lionsoul.jcseg.segmenter.SegmenterConfig
- isLatin(String) - Static method in class org.lionsoul.jcseg.util.StringUtil
- isLatin(String, int, int) - Static method in class org.lionsoul.jcseg.util.StringUtil
-
check if the specified string is all Latin chars
- isLetter(String) - Static method in class org.lionsoul.jcseg.util.StringUtil
- isLetter(String, int, int) - Static method in class org.lionsoul.jcseg.util.StringUtil
-
check if the specified string is Latin letter
- isLetterNumber(int) - Static method in class org.lionsoul.jcseg.util.StringUtil
-
check the specified char is Letter number like 'ⅠⅡ' true will be return if it is, or return false
- isLetterOrNumeric(String) - Static method in class org.lionsoul.jcseg.util.StringUtil
- isLetterOrNumeric(String, int, int) - Static method in class org.lionsoul.jcseg.util.StringUtil
-
check if the specified string is Latin numeric or letter
- isLowerCaseLetter(int) - Static method in class org.lionsoul.jcseg.util.StringUtil
- isMailAddress(String) - Static method in class org.lionsoul.jcseg.util.EntityFormat
-
check if the specified string is an email address or not
- isMobileNumber(String) - Static method in class org.lionsoul.jcseg.util.EntityFormat
-
check if the specified string is a mobile number
- isNoTailingPunctuation(char) - Static method in class org.lionsoul.jcseg.util.StringUtil
-
check if the given punctuation is the one that need to be cleared
- isNumeric(String) - Static method in class org.lionsoul.jcseg.util.StringUtil
- isNumeric(String, int, int) - Static method in class org.lionsoul.jcseg.util.StringUtil
-
check if the specified string is Latin numeric
- isOtherNumber(int) - Static method in class org.lionsoul.jcseg.util.StringUtil
-
check the specified char is other number like '①⑩⑽㈩' true will be return if it is, or return false
- isPairPunctuation(char) - Static method in class org.lionsoul.jcseg.util.StringUtil
-
check the given char is pair punctuation or not
- isPunctuation(int) - Static method in class org.lionsoul.jcseg.util.StringUtil
-
check if the given char is a punctuation
- isPunctuation(String) - Static method in class org.lionsoul.jcseg.util.StringUtil
- isPunctuation(String, int, int) - Static method in class org.lionsoul.jcseg.util.StringUtil
-
Check if the specified string is all punctuation chars (English and Chinese punctuation)
- isSync() - Method in class org.lionsoul.jcseg.dic.ADictionary
- isTime(String) - Static method in class org.lionsoul.jcseg.util.EntityFormat
-
check if the specified string is a valid time string like '12:45', '12:45:12'
- IStringBuffer - Class in org.lionsoul.jcseg.util
-
string buffer class
- IStringBuffer() - Constructor for class org.lionsoul.jcseg.util.IStringBuffer
-
create a buffer with a default length 16
- IStringBuffer(int) - Constructor for class org.lionsoul.jcseg.util.IStringBuffer
-
create a buffer with a specified length
- IStringBuffer(String) - Constructor for class org.lionsoul.jcseg.util.IStringBuffer
-
create a buffer with a specified string
- isUpperCaseLetter(int) - Static method in class org.lionsoul.jcseg.util.StringUtil
- isUrlAddress(String, ADictionary) - Static method in class org.lionsoul.jcseg.util.EntityFormat
-
check if the specified string is an URL address or not
- isWhitespace(int) - Static method in class org.lionsoul.jcseg.util.StringUtil
-
check the given string is a whitespace
- IWord - Interface in org.lionsoul.jcseg
-
Word Token interface
J
- JcsegTest - Class in org.lionsoul.jcseg.test
-
Jcseg test program.
- JcsegTest() - Constructor for class org.lionsoul.jcseg.test.JcsegTest
K
- K1 - Static variable in class org.lionsoul.jcseg.extractor.impl.TextRankSummaryExtractor
- KEEP_UNREG_WORDS - Variable in class org.lionsoul.jcseg.segmenter.SegmenterConfig
- keepUnregWords() - Method in class org.lionsoul.jcseg.segmenter.SegmenterConfig
- keyphrase(String) - Method in class org.lionsoul.jcseg.test.JcsegTest
-
keyphrase extractor
- KeyphraseExtractor - Class in org.lionsoul.jcseg.extractor
-
key phrase extractor
- KeyphraseExtractor(ISegment) - Constructor for class org.lionsoul.jcseg.extractor.KeyphraseExtractor
-
construct method
- keywords(String) - Method in class org.lionsoul.jcseg.test.JcsegTest
-
keywords extractor
- KeywordsExtractor - Class in org.lionsoul.jcseg.extractor
-
document keywords extractor
- KeywordsExtractor(ISegment) - Constructor for class org.lionsoul.jcseg.extractor.KeywordsExtractor
-
construct method
- keywordsNum - Variable in class org.lionsoul.jcseg.extractor.impl.TextRankKeyphraseExtractor
- keywordsNum - Variable in class org.lionsoul.jcseg.extractor.impl.TextRankKeywordsExtractor
L
- ladCJKPos() - Method in class org.lionsoul.jcseg.segmenter.SegmenterConfig
- last() - Method in class org.lionsoul.jcseg.util.IStringBuffer
-
always return the last char
- latinIndexOf(String) - Static method in class org.lionsoul.jcseg.util.StringUtil
- latinIndexOf(String, int) - Static method in class org.lionsoul.jcseg.util.StringUtil
-
get the index of the first Latin char of the specified string
- length() - Method in class org.lionsoul.jcseg.util.IStringBuffer
-
return the length of the buffer
- LEX_PROPERTY_FILE - Static variable in class org.lionsoul.jcseg.segmenter.SegmenterConfig
-
default lexicon property file name
- load(File) - Method in class org.lionsoul.jcseg.dic.ADictionary
-
load all the words from a specified lexicon file
- load(InputStream) - Method in class org.lionsoul.jcseg.dic.ADictionary
-
load all the words from a specified lexicon input stream
- load(InputStream) - Method in class org.lionsoul.jcseg.segmenter.SegmenterConfig
-
initialize the value of its options from a InputStream of a jcseg.properties file
- load(String) - Method in class org.lionsoul.jcseg.dic.ADictionary
-
load all the words from a specified lexicon path
- load(String) - Method in class org.lionsoul.jcseg.segmenter.SegmenterConfig
-
initialize the value of its options from a specified jcseg.properties propertie file
- LOAD_CJK_ENTITY - Variable in class org.lionsoul.jcseg.segmenter.SegmenterConfig
-
whether to load the entity define
- LOAD_CJK_PINYIN - Variable in class org.lionsoul.jcseg.segmenter.SegmenterConfig
-
whether to load the Pinyin of the CJK_WORDS
- LOAD_CJK_POS - Variable in class org.lionsoul.jcseg.segmenter.SegmenterConfig
-
whether to load the word's part of speech
- LOAD_CJK_SYN - Variable in class org.lionsoul.jcseg.segmenter.SegmenterConfig
-
whether to load the synonym word of the CJK_WORDS.
- LOAD_PARAMETER - Variable in class org.lionsoul.jcseg.segmenter.SegmenterConfig
-
whether to load the self-define parameter
- loadCJKEntity() - Method in class org.lionsoul.jcseg.segmenter.SegmenterConfig
- loadCJKPinyin() - Method in class org.lionsoul.jcseg.segmenter.SegmenterConfig
- loadCJKSyn() - Method in class org.lionsoul.jcseg.segmenter.SegmenterConfig
- loadClassPath() - Method in class org.lionsoul.jcseg.dic.ADictionary
-
load all the words from all the files under the specified class path.
- loadDirectory(String) - Method in class org.lionsoul.jcseg.dic.ADictionary
-
load the all the words from all the files under a specified lexicon directory
- loadWords(SegmenterConfig, ADictionary, File, List<String[]>) - Static method in class org.lionsoul.jcseg.dic.ADictionary
-
load all the words in the specified lexicon file into the dictionary
- loadWords(SegmenterConfig, ADictionary, InputStream, List<String[]>) - Static method in class org.lionsoul.jcseg.dic.ADictionary
-
load words from a InputStream
- loadWords(SegmenterConfig, ADictionary, String, List<String[]>) - Static method in class org.lionsoul.jcseg.dic.ADictionary
-
load all the words from a specified lexicon file path
M
- main(String[]) - Static method in class org.lionsoul.jcseg.test.JcsegTest
- MAPPING - Static variable in enum org.lionsoul.jcseg.IDictionary.Type
-
the type index and type mapping for quick get the type by type index number.
- MAPPING - Static variable in enum org.lionsoul.jcseg.ISegment.Type
-
the type index and type mapping for quick get the type by type index number.
- match(int, String) - Method in class org.lionsoul.jcseg.dic.HashMapDictionary
- match(int, String) - Method in interface org.lionsoul.jcseg.IDictionary
-
loop up the dictionary, check the given key is in the dictionary or not
- MAX_CN_LNADRON - Variable in class org.lionsoul.jcseg.segmenter.SegmenterConfig
-
the max length for the adron of the Chinese last name.like 老陈 “老”
- MAX_LATIN_LENGTH - Variable in class org.lionsoul.jcseg.segmenter.SegmenterConfig
-
maximum length for Latin words
- MAX_LENGTH - Variable in class org.lionsoul.jcseg.segmenter.SegmenterConfig
-
maximum length for maximum match(5-7)
- MAX_UNIT_LENGTH - Variable in class org.lionsoul.jcseg.segmenter.SegmenterConfig
-
maximum length for unit words for the NLP algorithm added at 2016/11/18
- maxIterateNum - Variable in class org.lionsoul.jcseg.extractor.impl.TextRankKeyphraseExtractor
- maxIterateNum - Variable in class org.lionsoul.jcseg.extractor.impl.TextRankKeywordsExtractor
- maxIterateNum - Variable in class org.lionsoul.jcseg.extractor.impl.TextRankSummaryExtractor
- maxWordsNum - Variable in class org.lionsoul.jcseg.extractor.impl.TextRankKeyphraseExtractor
-
max phrase length
- mergeSort(T[]) - Static method in class org.lionsoul.jcseg.util.Sort
-
merge sort algorithm
- MIX_ASSIST_WORD - Static variable in interface org.lionsoul.jcseg.dic.ILexicon
-
special lexicon for Chinese-English[-Chinese] mixed word recognition For the optimization implementation of the mixed word recognition
- MIX_POSPEECH - Static variable in interface org.lionsoul.jcseg.IWord
- mixPrefixLength - Variable in class org.lionsoul.jcseg.dic.ADictionary
- mixSuffixLength - Variable in class org.lionsoul.jcseg.dic.ADictionary
-
maximum length for the Chinese words after the LATIN word or the one before it used to match Chinese and English mix word, like 'B超,AA制...' or style compose style like '卡拉ok'.
- MMSegFilter - Class in org.lionsoul.jcseg.segmenter
-
MMSeg default filter class
- MMSegFilter() - Constructor for class org.lionsoul.jcseg.segmenter.MMSegFilter
- MOST - org.lionsoul.jcseg.ISegment.Type
- MOST - Static variable in interface org.lionsoul.jcseg.ISegment
- MOST_MODE - Static variable in interface org.lionsoul.jcseg.ISegment
- MostSeg - Class in org.lionsoul.jcseg.segmenter
-
Most mode implementation which all the possible combinations will be returned, and build it for information retrieval (better for index) of course.
- MostSeg(SegmenterConfig, ADictionary) - Constructor for class org.lionsoul.jcseg.segmenter.MostSeg
N
- N - Variable in class org.lionsoul.jcseg.segmenter.NGramSeg
-
The N for n-gram, default to 1 and that is uni-gram
- name - Variable in enum org.lionsoul.jcseg.IDictionary.Type
- name - Variable in enum org.lionsoul.jcseg.ISegment.Type
- NAME_POSPEECH - Static variable in interface org.lionsoul.jcseg.IWord
- NAME_SINGLE_THRESHOLD - Variable in class org.lionsoul.jcseg.segmenter.SegmenterConfig
-
the threshold of the single word that is a single word when it and the last char of the name make up a word.
- next - Variable in class org.lionsoul.jcseg.util.IHashQueue.Entry
- next - Variable in class org.lionsoul.jcseg.util.IIntFIFO.Entry
- next - Variable in class org.lionsoul.jcseg.util.IIntQueue.Entry
- next() - Method in interface org.lionsoul.jcseg.ISegment
-
segment a word from a char array from a specified position.
- next() - Method in class org.lionsoul.jcseg.segmenter.DelimiterSeg
- next() - Method in class org.lionsoul.jcseg.segmenter.DetectSeg
- next() - Method in class org.lionsoul.jcseg.segmenter.NGramSeg
- next() - Method in class org.lionsoul.jcseg.segmenter.NLPSeg
-
Override the next method to add the date-time entity recognition And we also invoke the parent.next method to get the next token
- next() - Method in class org.lionsoul.jcseg.segmenter.Segmenter
- next() - Method in class org.lionsoul.jcseg.sentence.SentenceSeg
-
get the next sentence
- nextCJKSentence(int) - Method in class org.lionsoul.jcseg.segmenter.Segmenter
-
load a CJK char list from the stream start from the current position till the char is not a CJK char
- nextCNNumeric(char[], int) - Method in class org.lionsoul.jcseg.segmenter.Segmenter
-
find the Chinese number from the current position count until the char in the specified position is not a other number or whitespace
- nextLatinString(int) - Method in class org.lionsoul.jcseg.segmenter.Segmenter
-
the simple version of the next basic Latin fetch logic Just return the next Latin string with the keep punctuation after it
- nextLatinWord(int, int) - Method in class org.lionsoul.jcseg.segmenter.NLPSeg
-
find the letter or digit word from the current position count until the char is whitespace or not letter_digit
- nextLatinWord(int, int) - Method in class org.lionsoul.jcseg.segmenter.Segmenter
-
find the letter or digit word from the current position count until the char is whitespace or not letter_digit
- nextLetterNumber(int) - Method in class org.lionsoul.jcseg.segmenter.Segmenter
-
find the next other letter from the current position find the letter number from the current position count until the char in the specified position is not a letter number or whitespace
- nextOtherNumber(int) - Method in class org.lionsoul.jcseg.segmenter.Segmenter
-
find the other number from the current position count until the char in the specified position is not a other number or whitespace
- NGRAM - org.lionsoul.jcseg.ISegment.Type
- NGRAM - Static variable in interface org.lionsoul.jcseg.ISegment
- NGRAM_MODE - Static variable in interface org.lionsoul.jcseg.ISegment
- NGramSeg - Class in org.lionsoul.jcseg.segmenter
-
Jcseg n-gram tokenizer implementation
- NGramSeg(SegmenterConfig, ADictionary) - Constructor for class org.lionsoul.jcseg.segmenter.NGramSeg
-
method to create a new ISegment
- NLP - org.lionsoul.jcseg.ISegment.Type
- NLP - Static variable in interface org.lionsoul.jcseg.ISegment
- NLP_MODE - Static variable in interface org.lionsoul.jcseg.ISegment
- NLPSeg - Class in org.lionsoul.jcseg.segmenter
-
NLP segmentation implementation And this extends all the properties of the Complex one the rest of them are build for NLP only
- NLPSeg(SegmenterConfig, ADictionary) - Constructor for class org.lionsoul.jcseg.segmenter.NLPSeg
- NUMBER_UNIT - Static variable in interface org.lionsoul.jcseg.dic.ILexicon
- NUMERIC_POSPEECH - Static variable in interface org.lionsoul.jcseg.IWord
- NumericUtil - Class in org.lionsoul.jcseg.util
-
a class to deal with Chinese numeric
- NumericUtil() - Constructor for class org.lionsoul.jcseg.util.NumericUtil
O
- org.lionsoul.jcseg - package org.lionsoul.jcseg
- org.lionsoul.jcseg.dic - package org.lionsoul.jcseg.dic
- org.lionsoul.jcseg.extractor - package org.lionsoul.jcseg.extractor
- org.lionsoul.jcseg.extractor.impl - package org.lionsoul.jcseg.extractor.impl
- org.lionsoul.jcseg.fi - package org.lionsoul.jcseg.fi
- org.lionsoul.jcseg.segmenter - package org.lionsoul.jcseg.segmenter
- org.lionsoul.jcseg.sentence - package org.lionsoul.jcseg.sentence
- org.lionsoul.jcseg.test - package org.lionsoul.jcseg.test
- org.lionsoul.jcseg.util - package org.lionsoul.jcseg.util
P
- PPT_MAX_LENGTH - Variable in class org.lionsoul.jcseg.segmenter.SegmenterConfig
-
the maximum length for the text between the pair punctuation.
- PPT_POSPEECH - Static variable in interface org.lionsoul.jcseg.IWord
- prev - Variable in class org.lionsoul.jcseg.util.IHashQueue.Entry
- prev - Variable in class org.lionsoul.jcseg.util.IIntQueue.Entry
- printChunks(String, ArrayList<IChunk>) - Static method in class org.lionsoul.jcseg.segmenter.ComplexSeg
- printMatrix(double[][]) - Static method in class org.lionsoul.jcseg.util.Util
-
print the specified matrix
- PUNCTUATION - Static variable in interface org.lionsoul.jcseg.IWord
- pushBack(int) - Method in class org.lionsoul.jcseg.segmenter.DelimiterSeg
-
push back the data to the stream
- pushBack(int) - Method in class org.lionsoul.jcseg.segmenter.DetectSeg
-
push back the data to the stream
- pushBack(int) - Method in class org.lionsoul.jcseg.segmenter.NGramSeg
-
push back the data to the stream
- pushBack(int) - Method in class org.lionsoul.jcseg.segmenter.Segmenter
-
push back the data to the stream.
- pushBack(String) - Method in class org.lionsoul.jcseg.segmenter.Segmenter
-
push back a string to the stream
Q
- qCNNumericToArabic(String) - Static method in class org.lionsoul.jcseg.util.NumericUtil
- QUANTIFIER - Static variable in interface org.lionsoul.jcseg.IWord
- quickSelect(T[], int) - Static method in class org.lionsoul.jcseg.util.Sort
-
quick select algorithm
- quicksort(T[]) - Static method in class org.lionsoul.jcseg.util.Sort
-
quick sort algorithm
R
- read() - Method in class org.lionsoul.jcseg.util.IPushbackReader
-
read the next int from the stream this will check the buffer queue first and take the first item of the buffer as the result
- read(char[], int, int) - Method in class org.lionsoul.jcseg.util.IPushbackReader
-
read the specified block from the stream
- reader - Variable in class org.lionsoul.jcseg.segmenter.NGramSeg
- reader - Variable in class org.lionsoul.jcseg.segmenter.Segmenter
- reader - Variable in class org.lionsoul.jcseg.sentence.SentenceSeg
- readNext() - Method in class org.lionsoul.jcseg.segmenter.DelimiterSeg
-
read the next char from the current position
- readNext() - Method in class org.lionsoul.jcseg.segmenter.DetectSeg
-
read the next char from the current position
- readNext() - Method in class org.lionsoul.jcseg.segmenter.NGramSeg
-
read the next char from the current position
- readNext() - Method in class org.lionsoul.jcseg.segmenter.Segmenter
-
read the next char from the current position
- readNext() - Method in class org.lionsoul.jcseg.sentence.SentenceSeg
-
read the next char from the current position
- readUntil(char) - Method in class org.lionsoul.jcseg.sentence.SentenceSeg
-
loop the reader until the specified char is found.
- remove() - Method in class org.lionsoul.jcseg.util.IHashQueue
-
remove the node from the head and you should make sure the size is larger than 0 by calling size() before you invoke the method or you will just get null.
- remove(int) - Method in class org.lionsoul.jcseg.util.IntArrayList
-
remove the element at the specified position use System.arraycopy instead of a loop may be more efficient
- remove(int, String) - Method in class org.lionsoul.jcseg.dic.HashMapDictionary
- remove(int, String) - Method in interface org.lionsoul.jcseg.IDictionary
-
remove the mapping associate with the given key
- reset(Reader) - Method in interface org.lionsoul.jcseg.ISegment
-
reset the reader
- reset(Reader) - Method in class org.lionsoul.jcseg.segmenter.DelimiterSeg
- reset(Reader) - Method in class org.lionsoul.jcseg.segmenter.DetectSeg
- reset(Reader) - Method in class org.lionsoul.jcseg.segmenter.NGramSeg
- reset(Reader) - Method in class org.lionsoul.jcseg.segmenter.Segmenter
-
input stream and reader reset.
- reset(Reader) - Method in class org.lionsoul.jcseg.sentence.SentenceSeg
-
stream/reader reset.
- resetMode(String) - Method in class org.lionsoul.jcseg.test.JcsegTest
- resetPrefixLength(SegmenterConfig, ADictionary, int) - Static method in class org.lionsoul.jcseg.dic.ADictionary
-
check and reset the value of
ADictionary.mixPrefixLength - resetSuffixLength(SegmenterConfig, ADictionary, int) - Static method in class org.lionsoul.jcseg.dic.ADictionary
-
check and reset the value of the
ADictionary.mixSuffixLength - resetSynonymsNet() - Method in class org.lionsoul.jcseg.dic.ADictionary
-
1, synonyms words to synonyms entry 2, loop each synonyms word and set the IWord#synEntry 3, clear the synonyms buffer
- rootWord - Variable in class org.lionsoul.jcseg.SynonymsEntry
-
the Synonyms root word
S
- seg - Variable in class org.lionsoul.jcseg.extractor.KeyphraseExtractor
-
the ISegment object
- seg - Variable in class org.lionsoul.jcseg.extractor.KeywordsExtractor
-
the ISegment object
- SegKit - Class in org.lionsoul.jcseg.segmenter
-
abstract segmentor kit class
- SegKit() - Constructor for class org.lionsoul.jcseg.segmenter.SegKit
- Segmenter - Class in org.lionsoul.jcseg.segmenter
-
abstract segmentation super class: 1.
- Segmenter(SegmenterConfig, ADictionary) - Constructor for class org.lionsoul.jcseg.segmenter.Segmenter
-
initialize the segment
- SegmenterConfig - Class in org.lionsoul.jcseg.segmenter
-
Jcseg segmenter configuration class
- SegmenterConfig() - Constructor for class org.lionsoul.jcseg.segmenter.SegmenterConfig
-
create the config and do nothing about initialize Note: this may cuz Incompatibility problems for the old version that has use this construct method
- SegmenterConfig(boolean) - Constructor for class org.lionsoul.jcseg.segmenter.SegmenterConfig
-
create and initialize the config by autoload
- SegmenterConfig(InputStream) - Constructor for class org.lionsoul.jcseg.segmenter.SegmenterConfig
-
create and initialize the task config from a InputStream
- SegmenterConfig(String) - Constructor for class org.lionsoul.jcseg.segmenter.SegmenterConfig
-
create and initialize the task config from a properties file
- SegmenterFunction - Interface in org.lionsoul.jcseg.fi
- sentence(String) - Method in class org.lionsoul.jcseg.test.JcsegTest
-
key sentence extractor
- Sentence - Class in org.lionsoul.jcseg.sentence
-
sentence desc class
- Sentence(String) - Constructor for class org.lionsoul.jcseg.sentence.Sentence
- Sentence(String, int) - Constructor for class org.lionsoul.jcseg.sentence.Sentence
-
construct method
- sentenceNum - Variable in class org.lionsoul.jcseg.extractor.impl.TextRankSummaryExtractor
- sentenceSeg - Variable in class org.lionsoul.jcseg.extractor.SummaryExtractor
-
sentence splitter object
- SentenceSeg - Class in org.lionsoul.jcseg.sentence
-
document sentence splitter
- SentenceSeg() - Constructor for class org.lionsoul.jcseg.sentence.SentenceSeg
- SentenceSeg(Reader) - Constructor for class org.lionsoul.jcseg.sentence.SentenceSeg
-
construct method
- set(int, char) - Method in class org.lionsoul.jcseg.util.IStringBuffer
-
set the char at the specified index
- set(int, int) - Method in class org.lionsoul.jcseg.util.IntArrayList
- set(String, String) - Method in class org.lionsoul.jcseg.segmenter.SegmenterConfig
-
set the option value from a specified key and value define in jcseg.properties
- setAppendCJKPinyin(boolean) - Method in class org.lionsoul.jcseg.segmenter.SegmenterConfig
- setAppendCJKSyn(boolean) - Method in class org.lionsoul.jcseg.segmenter.SegmenterConfig
- setAppendPartOfSpeech(boolean) - Method in class org.lionsoul.jcseg.segmenter.SegmenterConfig
- setAutoFilter(boolean) - Method in class org.lionsoul.jcseg.extractor.impl.TextRankKeywordsExtractor
- setAutoload(boolean) - Method in class org.lionsoul.jcseg.segmenter.SegmenterConfig
- setAutoMinLength(int) - Method in class org.lionsoul.jcseg.extractor.impl.TextRankKeyphraseExtractor
- setClearStopwords(boolean) - Method in class org.lionsoul.jcseg.segmenter.SegmenterConfig
- setCnFactionToArabic(boolean) - Method in class org.lionsoul.jcseg.segmenter.SegmenterConfig
- setCnNumToArabic(boolean) - Method in class org.lionsoul.jcseg.segmenter.SegmenterConfig
- setDelimiter(char) - Method in class org.lionsoul.jcseg.segmenter.DelimiterSeg
-
set the delimiter default to whitespace
- setDELIMITER(char) - Method in class org.lionsoul.jcseg.segmenter.SegmenterConfig
- setEnMaxLen(int) - Method in class org.lionsoul.jcseg.segmenter.SegmenterConfig
- setEnSecondMinLen(int) - Method in class org.lionsoul.jcseg.segmenter.SegmenterConfig
- setEnSecondSeg(boolean) - Method in class org.lionsoul.jcseg.segmenter.SegmenterConfig
- setEntity(String[]) - Method in interface org.lionsoul.jcseg.IWord
-
set the entity name array of the word
- setEntity(String[]) - Method in class org.lionsoul.jcseg.segmenter.Word
- setEntityForNull(String[]) - Method in interface org.lionsoul.jcseg.IWord
- setEntityForNull(String[]) - Method in class org.lionsoul.jcseg.segmenter.Word
- setEnWordSeg(boolean) - Method in class org.lionsoul.jcseg.segmenter.SegmenterConfig
- setFrequency(int) - Method in interface org.lionsoul.jcseg.IWord
-
set the frequency of the word
- setFrequency(int) - Method in class org.lionsoul.jcseg.segmenter.Word
- setGRAM(byte) - Method in class org.lionsoul.jcseg.segmenter.SegmenterConfig
- setICnName(boolean) - Method in class org.lionsoul.jcseg.segmenter.SegmenterConfig
- setIndex(int) - Method in class org.lionsoul.jcseg.extractor.impl.TextRankSummaryExtractor.Document
- setKeepEnSecOriginalWord(boolean) - Method in class org.lionsoul.jcseg.segmenter.SegmenterConfig
- setKeepEnSegOriginalWord(boolean) - Method in class org.lionsoul.jcseg.segmenter.SegmenterConfig
- setKeepPunctuations(String) - Method in class org.lionsoul.jcseg.segmenter.SegmenterConfig
- setKeepUnregWords(boolean) - Method in class org.lionsoul.jcseg.segmenter.SegmenterConfig
- setKeywordsNum(int) - Method in class org.lionsoul.jcseg.extractor.impl.TextRankKeyphraseExtractor
- setKeywordsNum(int) - Method in class org.lionsoul.jcseg.extractor.impl.TextRankKeywordsExtractor
- setLastUpdateTime(long) - Method in class org.lionsoul.jcseg.dic.AutoLoadFile
- setLength(int) - Method in interface org.lionsoul.jcseg.IWord
-
self define the length
- setLength(int) - Method in class org.lionsoul.jcseg.segmenter.Word
- setLength(int) - Method in class org.lionsoul.jcseg.sentence.Sentence
- setLength(int) - Method in class org.lionsoul.jcseg.util.IStringBuffer
-
set the length of the buffer actually it just override the count and the actual buffer has nothing changed
- setLexiconPath(String[]) - Method in class org.lionsoul.jcseg.segmenter.SegmenterConfig
- setLoadCJKPinyin(boolean) - Method in class org.lionsoul.jcseg.segmenter.SegmenterConfig
- setLoadCJKPos(boolean) - Method in class org.lionsoul.jcseg.segmenter.SegmenterConfig
- setLoadCJKSyn(boolean) - Method in class org.lionsoul.jcseg.segmenter.SegmenterConfig
- setLoadEntity(boolean) - Method in class org.lionsoul.jcseg.segmenter.SegmenterConfig
- setMaxCnLnadron(int) - Method in class org.lionsoul.jcseg.segmenter.SegmenterConfig
- setMaxIterateNum(int) - Method in class org.lionsoul.jcseg.extractor.impl.TextRankKeyphraseExtractor
- setMaxIterateNum(int) - Method in class org.lionsoul.jcseg.extractor.impl.TextRankKeywordsExtractor
- setMaxIterateNum(int) - Method in class org.lionsoul.jcseg.extractor.impl.TextRankSummaryExtractor
- setMaxLength(int) - Method in class org.lionsoul.jcseg.segmenter.SegmenterConfig
- setMaxWordsNum(int) - Method in class org.lionsoul.jcseg.extractor.impl.TextRankKeyphraseExtractor
- setN(byte) - Method in class org.lionsoul.jcseg.segmenter.NGramSeg
- setNameSingleThreshold(int) - Method in class org.lionsoul.jcseg.segmenter.SegmenterConfig
- setParameter(String) - Method in interface org.lionsoul.jcseg.IWord
-
set the string parameters
- setParameter(String) - Method in class org.lionsoul.jcseg.segmenter.Word
- setPartSpeech(String[]) - Method in interface org.lionsoul.jcseg.IWord
- setPartSpeech(String[]) - Method in class org.lionsoul.jcseg.segmenter.Word
- setPartSpeechForNull(String[]) - Method in interface org.lionsoul.jcseg.IWord
- setPartSpeechForNull(String[]) - Method in class org.lionsoul.jcseg.segmenter.Word
- setPinyin(String) - Method in interface org.lionsoul.jcseg.IWord
-
set the pinyin of the word
- setPinyin(String) - Method in class org.lionsoul.jcseg.segmenter.Word
- setPollTime(int) - Method in class org.lionsoul.jcseg.segmenter.SegmenterConfig
- setPosition(int) - Method in interface org.lionsoul.jcseg.IWord
-
set the position of the word
- setPosition(int) - Method in class org.lionsoul.jcseg.segmenter.Word
- setPosition(int) - Method in class org.lionsoul.jcseg.sentence.Sentence
- setPPT_MAX_LENGTH(int) - Method in class org.lionsoul.jcseg.segmenter.SegmenterConfig
- setRootWord(IWord) - Method in class org.lionsoul.jcseg.SynonymsEntry
- setScore(double) - Method in class org.lionsoul.jcseg.extractor.impl.TextRankSummaryExtractor.Document
- setSeg(ISegment) - Method in class org.lionsoul.jcseg.extractor.KeyphraseExtractor
- setSeg(ISegment) - Method in class org.lionsoul.jcseg.extractor.KeywordsExtractor
- setSentence(Sentence) - Method in class org.lionsoul.jcseg.extractor.impl.TextRankSummaryExtractor.Document
- setSentenceNum(int) - Method in class org.lionsoul.jcseg.extractor.impl.TextRankSummaryExtractor
- setSentenceSeg(SentenceSeg) - Method in class org.lionsoul.jcseg.extractor.SummaryExtractor
- setSyn(SynonymsEntry) - Method in interface org.lionsoul.jcseg.IWord
- setSyn(SynonymsEntry) - Method in class org.lionsoul.jcseg.segmenter.Word
- setType(int) - Method in interface org.lionsoul.jcseg.IWord
-
set the type of the word
- setType(int) - Method in class org.lionsoul.jcseg.segmenter.Word
- setValue(String) - Method in class org.lionsoul.jcseg.sentence.Sentence
- setWindowSize(int) - Method in class org.lionsoul.jcseg.extractor.impl.TextRankKeyphraseExtractor
- setWindowSize(int) - Method in class org.lionsoul.jcseg.extractor.impl.TextRankKeywordsExtractor
- setWords(List<IWord>) - Method in class org.lionsoul.jcseg.extractor.impl.TextRankSummaryExtractor.Document
- setWordSeg(ISegment) - Method in class org.lionsoul.jcseg.extractor.SummaryExtractor
- shellSort(T[]) - Static method in class org.lionsoul.jcseg.util.Sort
-
shell sort algorithm
- SIMPLE - org.lionsoul.jcseg.ISegment.Type
- SIMPLE - Static variable in interface org.lionsoul.jcseg.ISegment
-
Segmentation type constants
- SIMPLE_MODE - Static variable in interface org.lionsoul.jcseg.ISegment
-
Segmentation type index
- SimpleSeg - Class in org.lionsoul.jcseg.segmenter
-
Jcseg simple segmentation implementation
- SimpleSeg(SegmenterConfig, ADictionary) - Constructor for class org.lionsoul.jcseg.segmenter.SimpleSeg
- SIMSTR - Static variable in class org.lionsoul.jcseg.util.STConverter
- SimToTraditional(String) - Static method in class org.lionsoul.jcseg.util.STConverter
-
convert the simplified words to traditional words of the specified string.
- SimToTraditional(String, IStringBuffer) - Static method in class org.lionsoul.jcseg.util.STConverter
- size() - Method in class org.lionsoul.jcseg.SynonymsEntry
- size() - Method in class org.lionsoul.jcseg.util.IHashQueue
-
get the size of the queue
- size() - Method in class org.lionsoul.jcseg.util.IIntFIFO
-
get the size of the queue
- size() - Method in class org.lionsoul.jcseg.util.IIntQueue
-
get the size of the queue
- size() - Method in class org.lionsoul.jcseg.util.IntArrayList
- size(int) - Method in class org.lionsoul.jcseg.dic.HashMapDictionary
- size(int) - Method in interface org.lionsoul.jcseg.IDictionary
-
return the size of the dictionary
- Sort - Class in org.lionsoul.jcseg.util
-
All kind of Sort algorithm implemented method use the default compare method
- Sort() - Constructor for class org.lionsoul.jcseg.util.Sort
- START_SS_MASK - Static variable in interface org.lionsoul.jcseg.ISegment
-
Whether to start the Latin secondary segmentation.
- startAutoload() - Method in class org.lionsoul.jcseg.dic.ADictionary
-
start the lexicon autoload thread
- startsWith(String, String[]) - Static method in class org.lionsoul.jcseg.util.ArrayUtil
-
check if there is an element that starts with the specified string
- STConverter - Class in org.lionsoul.jcseg.util
-
Simplified and traditional chinese convert class all the search work base on
String.indexOf(int)you may store all the words in a HashMap for the purpuse of a faster fetch - STConverter() - Constructor for class org.lionsoul.jcseg.util.STConverter
- STOP_WORD - Static variable in interface org.lionsoul.jcseg.dic.ILexicon
-
stop words
- stopAutoload() - Method in class org.lionsoul.jcseg.dic.ADictionary
- streamResetTo(String, int) - Method in class org.lionsoul.jcseg.segmenter.NGramSeg
-
reset the data back from the specified position
- StringUtil - Class in org.lionsoul.jcseg.util
-
a class to deal with the English stop char like the English punctuation
- StringUtil() - Constructor for class org.lionsoul.jcseg.util.StringUtil
- subWordPool - Variable in class org.lionsoul.jcseg.segmenter.Segmenter
- summary(String) - Method in class org.lionsoul.jcseg.test.JcsegTest
-
summary extractor
- SummaryExtractor - Class in org.lionsoul.jcseg.extractor
-
document summary extractor
- SummaryExtractor(ISegment, SentenceSeg) - Constructor for class org.lionsoul.jcseg.extractor.SummaryExtractor
-
construct method
- sync - Variable in class org.lionsoul.jcseg.dic.ADictionary
- SynonymsEntry - Class in org.lionsoul.jcseg
-
Synonyms words entry class
- SynonymsEntry() - Constructor for class org.lionsoul.jcseg.SynonymsEntry
- SynonymsEntry(IWord) - Constructor for class org.lionsoul.jcseg.SynonymsEntry
T
- T_BASIC_LATIN - Static variable in interface org.lionsoul.jcseg.IWord
-
Latin series including the Arabic numbers.
- T_CJK_PINYIN - Static variable in interface org.lionsoul.jcseg.IWord
-
Chinese Pinyin
- T_CJK_WORD - Static variable in interface org.lionsoul.jcseg.IWord
-
Chinese,Japanese,Korean words
- T_CN_NAME - Static variable in interface org.lionsoul.jcseg.IWord
-
Chinese last name.
- T_CN_NICKNAME - Static variable in interface org.lionsoul.jcseg.IWord
-
Chinese nickname like: 老陈
- T_CN_NUMERIC - Static variable in interface org.lionsoul.jcseg.IWord
-
Chinese numeric
- T_LEN - Static variable in interface org.lionsoul.jcseg.dic.ILexicon
- T_LETTER_NUMBER - Static variable in interface org.lionsoul.jcseg.IWord
-
letter number like 'ⅠⅡ'
- T_MIXED_WORD - Static variable in interface org.lionsoul.jcseg.IWord
-
Chinese and English mix word like B超,SIM卡.
- T_OTHER_NUMBER - Static variable in interface org.lionsoul.jcseg.IWord
-
other number like '①⑩⑽㈩'
- T_PUNCTUATION - Static variable in interface org.lionsoul.jcseg.IWord
- T_UNRECOGNIZE_WORD - Static variable in interface org.lionsoul.jcseg.IWord
-
useless chars like the CJK punctuation
- TextRankKeyphraseExtractor - Class in org.lionsoul.jcseg.extractor.impl
-
document key phrase extractor base on textRank algorithm
- TextRankKeyphraseExtractor(ISegment) - Constructor for class org.lionsoul.jcseg.extractor.impl.TextRankKeyphraseExtractor
- TextRankKeywordsExtractor - Class in org.lionsoul.jcseg.extractor.impl
-
document keywords extractor base on textRank algorithm
- TextRankKeywordsExtractor(ISegment) - Constructor for class org.lionsoul.jcseg.extractor.impl.TextRankKeywordsExtractor
- textRankSortedDocuments(List<Sentence>, List<List<IWord>>) - Method in class org.lionsoul.jcseg.extractor.impl.TextRankSummaryExtractor
-
get the documents order by relevance score.
- TextRankSummaryExtractor - Class in org.lionsoul.jcseg.extractor.impl
-
TextRank summary extractor base on textRank algorithm
- TextRankSummaryExtractor(ISegment, SentenceSeg) - Constructor for class org.lionsoul.jcseg.extractor.impl.TextRankSummaryExtractor
- TextRankSummaryExtractor.Document - Class in org.lionsoul.jcseg.extractor.impl
-
summary document inner class
- TIME_POSPEECH - Static variable in interface org.lionsoul.jcseg.IWord
- TimeUtil - Class in org.lionsoul.jcseg.util
-
Time Util class
- TimeUtil() - Constructor for class org.lionsoul.jcseg.util.TimeUtil
- toJsonArray(String[]) - Static method in class org.lionsoul.jcseg.util.ArrayUtil
- toJsonObject(String[]) - Static method in class org.lionsoul.jcseg.util.ArrayUtil
-
implode the array elements as a Json array string
- tokenize(String) - Method in class org.lionsoul.jcseg.test.JcsegTest
-
string tokenize handler
- toLowerCase(int) - Static method in class org.lionsoul.jcseg.util.StringUtil
- toString() - Method in class org.lionsoul.jcseg.segmenter.Chunk
- toString() - Method in class org.lionsoul.jcseg.segmenter.Word
- toString() - Method in class org.lionsoul.jcseg.sentence.Sentence
-
rewrite the toString method
- toString() - Method in class org.lionsoul.jcseg.util.ByteCharCounter
- toString() - Method in class org.lionsoul.jcseg.util.IStringBuffer
-
return the string of the current buffer
- toUpperCase(int) - Static method in class org.lionsoul.jcseg.util.StringUtil
- TRASTR - Static variable in class org.lionsoul.jcseg.util.STConverter
- TraToSimplified(String) - Static method in class org.lionsoul.jcseg.util.STConverter
-
convert the traditional words to simplified words of the specified string.
- TraToSimplified(String, IStringBuffer) - Static method in class org.lionsoul.jcseg.util.STConverter
U
- UNMATCH_CJK_WORD - Static variable in interface org.lionsoul.jcseg.dic.ILexicon
-
unmatched word
- unread(char[], int, int) - Method in class org.lionsoul.jcseg.util.IPushbackReader
-
unread a block from a char array to the stream
- unread(int) - Method in class org.lionsoul.jcseg.util.IPushbackReader
-
unread the specified data to the stream push the data back to the queue in fact, you know
- UNRECOGNIZE - Static variable in interface org.lionsoul.jcseg.IWord
- Util - Class in org.lionsoul.jcseg.util
-
static method for jcseg.
- Util() - Constructor for class org.lionsoul.jcseg.util.Util
V
- valueOf(String) - Static method in enum org.lionsoul.jcseg.IDictionary.Type
-
Returns the enum constant of this type with the specified name.
- valueOf(String) - Static method in enum org.lionsoul.jcseg.ISegment.Type
-
Returns the enum constant of this type with the specified name.
- values() - Static method in enum org.lionsoul.jcseg.IDictionary.Type
-
Returns an array containing the constants of this enum type, in the order they are declared.
- values() - Static method in enum org.lionsoul.jcseg.ISegment.Type
-
Returns an array containing the constants of this enum type, in the order they are declared.
W
- windowSize - Variable in class org.lionsoul.jcseg.extractor.impl.TextRankKeyphraseExtractor
- windowSize - Variable in class org.lionsoul.jcseg.extractor.impl.TextRankKeywordsExtractor
- Word - Class in org.lionsoul.jcseg.segmenter
-
word class for Jcseg with the
IWordinterface implemented at 2017/03/29: make the synonym series methodWord.getSyn()Word.setSyn(SynonymsEntry)and the part of speech series methodWord.getPartSpeech()Word.setPartSpeech(String[])Word.addPartSpeech(String)and theWord.clone()method synchronized for might happen concurrent access. - Word(String, int) - Constructor for class org.lionsoul.jcseg.segmenter.Word
- Word(String, int, int) - Constructor for class org.lionsoul.jcseg.segmenter.Word
- Word(String, int, int, String[]) - Constructor for class org.lionsoul.jcseg.segmenter.Word
-
construct method to initialize the newly created Word instance
- Word(String, int, String[]) - Constructor for class org.lionsoul.jcseg.segmenter.Word
- wordNewOrClone(int, String, int) - Method in class org.lionsoul.jcseg.segmenter.NGramSeg
-
check if the specified word is existed in a specified dictionary and if does clone it or create a new one.
- wordNewOrClone(int, String, int) - Method in class org.lionsoul.jcseg.segmenter.Segmenter
-
check if the specified word is existed in a specified dictionary and if it does clone it or create a new one.
- wordPool - Variable in class org.lionsoul.jcseg.segmenter.DelimiterSeg
- wordPool - Variable in class org.lionsoul.jcseg.segmenter.DetectSeg
- wordPool - Variable in class org.lionsoul.jcseg.segmenter.NGramSeg
-
CJK word cache pool, Reusable string buffer
- wordPool - Variable in class org.lionsoul.jcseg.segmenter.Segmenter
-
CJK word cache pool, Reusable string buffer and the array list for basic integer
- wordSeg - Variable in class org.lionsoul.jcseg.extractor.SummaryExtractor
-
ISegment word tokenizer object
_
- __toString() - Method in class org.lionsoul.jcseg.segmenter.Word
-
for debug only
All Classes All Packages