A B C D E F G H I J K L M N O P Q R S T U V W _ 
All Classes All Packages

A

add(int) - Method in class org.lionsoul.jcseg.util.IntArrayList
Append a new Integer to the end.
add(int, String, int) - Method in class org.lionsoul.jcseg.dic.HashMapDictionary
 
add(int, String, int) - Method in interface org.lionsoul.jcseg.IDictionary
add a new word to the dictionary
add(int, String, int, int) - Method in class org.lionsoul.jcseg.dic.HashMapDictionary
 
add(int, String, int, int) - Method in interface org.lionsoul.jcseg.IDictionary
add a new word to the dictionary
add(int, String, int, int, String[]) - Method in class org.lionsoul.jcseg.dic.HashMapDictionary
 
add(int, String, int, int, String[]) - Method in interface org.lionsoul.jcseg.IDictionary
add a new word to the dictionary with its statistics frequency
add(int, String, int, String[]) - Method in class org.lionsoul.jcseg.dic.HashMapDictionary
 
add(int, String, int, String[]) - Method in interface org.lionsoul.jcseg.IDictionary
add a new word to the dictionary
add(int, IWord) - Method in class org.lionsoul.jcseg.dic.HashMapDictionary
 
add(int, IWord) - Method in interface org.lionsoul.jcseg.IDictionary
directly add a IWord item to the dictionary
add(IWord) - Method in class org.lionsoul.jcseg.SynonymsEntry
add a new synonyms word and the newly added word will extend the part of speech and the entity from the base word if there are not set
add(T) - Method in class org.lionsoul.jcseg.util.IHashQueue
append a item from the tail
addEntity(String) - Method in interface org.lionsoul.jcseg.IWord
append a new entity
addEntity(String) - Method in class org.lionsoul.jcseg.segmenter.Word
 
addPartSpeech(String) - Method in interface org.lionsoul.jcseg.IWord
add a new part to speech to the word.
addPartSpeech(String) - Method in class org.lionsoul.jcseg.segmenter.Word
 
ADictionary - Class in org.lionsoul.jcseg.dic
Dictionary abstract super class
ADictionary(SegmenterConfig, Boolean) - Constructor for class org.lionsoul.jcseg.dic.ADictionary
initialize the ADictionary
AL_TODO_FILE - Static variable in class org.lionsoul.jcseg.dic.ADictionary
the default autoload task file name
append(boolean) - Method in class org.lionsoul.jcseg.util.IStringBuffer
append a boolean value
append(char) - Method in class org.lionsoul.jcseg.util.IStringBuffer
append a char to the buffer
append(char[]) - Method in class org.lionsoul.jcseg.util.IStringBuffer
append some chars to the buffer
append(char[], int) - Method in class org.lionsoul.jcseg.util.IStringBuffer
append the rest of the chars to the buffer
append(char[], int, int) - Method in class org.lionsoul.jcseg.util.IStringBuffer
append parts of the chars to the buffer
append(double) - Method in class org.lionsoul.jcseg.util.IStringBuffer
append a double value
append(float) - Method in class org.lionsoul.jcseg.util.IStringBuffer
append a float value
append(int) - Method in class org.lionsoul.jcseg.util.IStringBuffer
append a int value
append(long) - Method in class org.lionsoul.jcseg.util.IStringBuffer
append a long value
append(short) - Method in class org.lionsoul.jcseg.util.IStringBuffer
append a short value
append(String) - Method in class org.lionsoul.jcseg.util.IStringBuffer
append a string to the buffer
APPEND_CJK_ENTITY - Variable in class org.lionsoul.jcseg.segmenter.SegmenterConfig
do the entity recognition ?
APPEND_CJK_PINYIN - Variable in class org.lionsoul.jcseg.segmenter.SegmenterConfig
append the Pinyin to the result
APPEND_CJK_SYN - Variable in class org.lionsoul.jcseg.segmenter.SegmenterConfig
append the syn word to the result.
APPEND_PART_OF_SPEECH - Variable in class org.lionsoul.jcseg.segmenter.SegmenterConfig
append the part of speech.
appendCJKPinyin() - Method in class org.lionsoul.jcseg.segmenter.SegmenterConfig
 
appendCJKSyn() - Method in class org.lionsoul.jcseg.segmenter.SegmenterConfig
 
appendCJKWordFeatures(IWord) - Method in class org.lionsoul.jcseg.segmenter.Segmenter
check and append the pinyin and the synonyms words of the specified word
appendLatinWordFeatures(IWord) - Method in class org.lionsoul.jcseg.segmenter.Segmenter
Check and append the synonyms/pinyin words of specified word included the CJK and basic Latin words All the synonyms words share the same position part of speech, word type with the primitive word
appendPinyin(SegmenterConfig, LinkedList<IWord>, IWord) - Static method in class org.lionsoul.jcseg.segmenter.SegKit
Quick interface to do the pinyin word append.
appendSynonyms(SegmenterConfig, LinkedList<IWord>, IWord) - Static method in class org.lionsoul.jcseg.segmenter.SegKit
quick interface to do the synonyms word append You got check if the specified word has any synonyms first
ArrayUtil - Class in org.lionsoul.jcseg.util
Array util class
ArrayUtil() - Constructor for class org.lionsoul.jcseg.util.ArrayUtil
 
autoFilter - Variable in class org.lionsoul.jcseg.extractor.impl.TextRankKeywordsExtractor
auto filter the words with low score
autoLoad() - Method in class org.lionsoul.jcseg.segmenter.SegmenterConfig
initialize the value of its options by auto searching the jcesg.properties file:
AutoLoadFile - Class in org.lionsoul.jcseg.dic
AutoLoad file to describle the autoload configuration files
AutoLoadFile(String) - Constructor for class org.lionsoul.jcseg.dic.AutoLoadFile
 
autoMinLength - Variable in class org.lionsoul.jcseg.extractor.impl.TextRankKeyphraseExtractor
auto append the words with a length over the specified value as a phrase

B

B - Static variable in class org.lionsoul.jcseg.extractor.impl.TextRankSummaryExtractor
 
behindLatin - Variable in class org.lionsoul.jcseg.segmenter.Segmenter
global behind Latin word after the CJK word added at 2016/11/22 for better mixed word implementation
bucketSort(int[], int) - Static method in class org.lionsoul.jcseg.util.Sort
bucket sort algorithm
bucketSort(Integer[], int) - Static method in class org.lionsoul.jcseg.util.Sort
bucket sort algorithm
buffer() - Method in class org.lionsoul.jcseg.util.IStringBuffer
return the chars of the buffer
ByteCharCounter - Class in org.lionsoul.jcseg.util
All Basic printable Latin char counter class include all the English punctuation and the letters
ByteCharCounter() - Constructor for class org.lionsoul.jcseg.util.ByteCharCounter
 

C

charAt(int) - Method in class org.lionsoul.jcseg.util.IStringBuffer
get the char at a specified position in the buffer
CharTypeFunction - Interface in org.lionsoul.jcseg.fi
 
CHECK_CE_MASk - Static variable in interface org.lionsoul.jcseg.ISegment
Whether to check the Chinese and English mixed word.
CHECK_CF_MASK - Static variable in interface org.lionsoul.jcseg.ISegment
Whether to check the Chinese fraction.
CHECK_EC_MASK - Static variable in interface org.lionsoul.jcseg.ISegment
Whether to check the English Chinese mixed suffix For the new implementation of the mixed word recognition Added at 2016/11/22
Chunk - Class in org.lionsoul.jcseg.segmenter
chunk concept for the mmseg word segmentation algorithm
Chunk(IWord[]) - Constructor for class org.lionsoul.jcseg.segmenter.Chunk
 
CJK_CHAR - Static variable in interface org.lionsoul.jcseg.dic.ILexicon
CJK single word
CJK_SYN - Static variable in interface org.lionsoul.jcseg.dic.ILexicon
CJK synonyms
CJK_UNIT - Static variable in interface org.lionsoul.jcseg.dic.ILexicon
Chinese single units
CJK_WORD - Static variable in interface org.lionsoul.jcseg.dic.ILexicon
Chinese, Japanese, Korean words Note: since version 2.0.1 the Chinese English mixed words and the English punctuation mixed words are all included here.
CJKIndexOf(String) - Static method in class org.lionsoul.jcseg.util.StringUtil
 
CJKIndexOf(String, int) - Static method in class org.lionsoul.jcseg.util.StringUtil
get the index of the first CJK char of the specified string
clear() - Method in class org.lionsoul.jcseg.util.IntArrayList
 
clear() - Method in class org.lionsoul.jcseg.util.IStringBuffer
clear the buffer by reset the count to 0
CLEAR_STOPWORD - Variable in class org.lionsoul.jcseg.segmenter.SegmenterConfig
clear away the stop word.
clearStopwords() - Method in class org.lionsoul.jcseg.segmenter.SegmenterConfig
 
clone() - Method in interface org.lionsoul.jcseg.IWord
make clone available
clone() - Method in class org.lionsoul.jcseg.segmenter.SegmenterConfig
rewrite the clone method
clone() - Method in class org.lionsoul.jcseg.segmenter.Word
Interface to clone the current object
CN_DNAME_1 - Static variable in interface org.lionsoul.jcseg.dic.ILexicon
first word of Chinese double name
CN_DNAME_2 - Static variable in interface org.lionsoul.jcseg.dic.ILexicon
second word of Chinese double name
CN_LNAME - Static variable in interface org.lionsoul.jcseg.dic.ILexicon
Chinese last name
CN_LNAME_ADORN - Static variable in interface org.lionsoul.jcseg.dic.ILexicon
the adorn(修饰) char before the last name like word "老陈", "小陈"
CN_SNAME - Static variable in interface org.lionsoul.jcseg.dic.ILexicon
Chinese single name
CNFRA_TO_ARABIC - Variable in class org.lionsoul.jcseg.segmenter.SegmenterConfig
Chinese fraction to Arabic fraction .
cnFractionToArabic() - Method in class org.lionsoul.jcseg.segmenter.SegmenterConfig
 
CNNUM_TO_ARABIC - Variable in class org.lionsoul.jcseg.segmenter.SegmenterConfig
Chinese numeric to Arabic .
cnNumericToArabic(String, boolean) - Static method in class org.lionsoul.jcseg.util.NumericUtil
a static method to turn the Chinese numeric to Arabic numbers
cnNumToArabic() - Method in class org.lionsoul.jcseg.segmenter.SegmenterConfig
 
compareTo(TextRankSummaryExtractor.Document) - Method in class org.lionsoul.jcseg.extractor.impl.TextRankSummaryExtractor.Document
override the compareTo method compare document with its relevance score
COMPLEX - org.lionsoul.jcseg.ISegment.Type
 
COMPLEX - Static variable in interface org.lionsoul.jcseg.ISegment
 
COMPLEX_MODE - Static variable in interface org.lionsoul.jcseg.ISegment
 
ComplexSeg - Class in org.lionsoul.jcseg.segmenter
Jcseg complex segmentation implementation based on the filter works of MMSeg rules:
ComplexSeg(SegmenterConfig, ADictionary) - Constructor for class org.lionsoul.jcseg.segmenter.ComplexSeg
 
config - Variable in class org.lionsoul.jcseg.dic.ADictionary
 
config - Variable in class org.lionsoul.jcseg.segmenter.DelimiterSeg
 
config - Variable in class org.lionsoul.jcseg.segmenter.DetectSeg
 
config - Variable in class org.lionsoul.jcseg.segmenter.NGramSeg
 
config - Variable in class org.lionsoul.jcseg.segmenter.Segmenter
 
contains(String, String[]) - Static method in class org.lionsoul.jcseg.util.ArrayUtil
check if there is an element that contains the specified string
contains(T) - Method in class org.lionsoul.jcseg.util.IHashQueue
check the specifield T is aleady exists in the queue or not
create(SegmenterConfig, boolean) - Method in interface org.lionsoul.jcseg.fi.DictionaryFunction
 
create(SegmenterConfig, ADictionary) - Method in interface org.lionsoul.jcseg.fi.SegmenterFunction
 
createDateTimePool() - Static method in class org.lionsoul.jcseg.util.TimeUtil
create and return a date-time pool
createDefaultDictionary(SegmenterConfig) - Static method in class org.lionsoul.jcseg.dic.DictionaryFactory
create the ADictionary according to the JcsegTaskConfig check and load the lexicon by default
createDefaultDictionary(SegmenterConfig, boolean) - Static method in class org.lionsoul.jcseg.dic.DictionaryFactory
create the ADictionary according to the JcsegTaskConfig
createDefaultDictionary(SegmenterConfig, boolean, boolean) - Static method in class org.lionsoul.jcseg.dic.DictionaryFactory
create a default ADictionary instance: 1.
createSingletonDictionary(SegmenterConfig) - Static method in class org.lionsoul.jcseg.dic.DictionaryFactory
create a singleton ADictionary object according to the JcsegTaskConfig check and load the lexicon by default
createSingletonDictionary(SegmenterConfig, boolean) - Static method in class org.lionsoul.jcseg.dic.DictionaryFactory
create a singleton ADictionary object according to the SegmentConfig
ctrlMask - Variable in class org.lionsoul.jcseg.segmenter.Segmenter
segmentation runtime function control mask

D

D - Static variable in class org.lionsoul.jcseg.extractor.impl.TextRankKeyphraseExtractor
 
D - Static variable in class org.lionsoul.jcseg.extractor.impl.TextRankKeywordsExtractor
 
D - Static variable in class org.lionsoul.jcseg.extractor.impl.TextRankSummaryExtractor
 
data - Variable in class org.lionsoul.jcseg.util.IHashQueue.Entry
 
data - Variable in class org.lionsoul.jcseg.util.IIntFIFO.Entry
 
data - Variable in class org.lionsoul.jcseg.util.IIntQueue.Entry
 
DATETIME_A - Static variable in class org.lionsoul.jcseg.util.TimeUtil
 
DATETIME_D - Static variable in class org.lionsoul.jcseg.util.TimeUtil
 
DATETIME_H - Static variable in class org.lionsoul.jcseg.util.TimeUtil
 
DATETIME_I - Static variable in class org.lionsoul.jcseg.util.TimeUtil
 
DATETIME_M - Static variable in class org.lionsoul.jcseg.util.TimeUtil
 
DATETIME_NONE - Static variable in class org.lionsoul.jcseg.util.TimeUtil
date-time part index constants we consider a date-time as the following seven parts: +------+-------+-----+---------------+------+--------+--------+ | 0 | 1 | 2 | 3 | 4 | 5 | 6 | +------+-------+-----+---------------+------+--------+--------+ | year | month | day | timing method | hour | minute | second | +------+-------+-----+---------------+------+--------+--------+ and the numeric value before every part.
DATETIME_S - Static variable in class org.lionsoul.jcseg.util.TimeUtil
 
DATETIME_Y - Static variable in class org.lionsoul.jcseg.util.TimeUtil
 
decrease(char) - Method in class org.lionsoul.jcseg.util.ByteCharCounter
 
decrease(char, int) - Method in class org.lionsoul.jcseg.util.ByteCharCounter
 
deleteCharAt(int) - Method in class org.lionsoul.jcseg.util.IStringBuffer
delete the char at the specified position
DELIMITER - org.lionsoul.jcseg.ISegment.Type
 
DELIMITER - Static variable in interface org.lionsoul.jcseg.ISegment
 
DELIMITER_MODE - Static variable in interface org.lionsoul.jcseg.ISegment
 
DelimiterSeg - Class in org.lionsoul.jcseg.segmenter
delimiter segment algorithm implementation extended from common segment interface ISegment
DelimiterSeg(SegmenterConfig, ADictionary) - Constructor for class org.lionsoul.jcseg.segmenter.DelimiterSeg
method to create a new ISegment
deQueue() - Method in class org.lionsoul.jcseg.util.IIntFIFO
remove the first item from the queue
deQueue() - Method in class org.lionsoul.jcseg.util.IIntQueue
remove the node from the head and you should make sure the size is larger than 0 by calling size() before you invoke the method or you will just get -1
DETECT - org.lionsoul.jcseg.ISegment.Type
 
DETECT - Static variable in interface org.lionsoul.jcseg.ISegment
 
DETECT_MODE - Static variable in interface org.lionsoul.jcseg.ISegment
 
DetectSeg - Class in org.lionsoul.jcseg.segmenter
Detect segmentation mode return words only in the loaded dictionary yat, when matched a word and return it or continue to find the next word in the dictionary
DetectSeg(SegmenterConfig, ADictionary) - Constructor for class org.lionsoul.jcseg.segmenter.DetectSeg
method to create a new ISegment
dic - Variable in class org.lionsoul.jcseg.segmenter.DelimiterSeg
the dictionary and task configuration
dic - Variable in class org.lionsoul.jcseg.segmenter.DetectSeg
the dictionary and task configuration
dic - Variable in class org.lionsoul.jcseg.segmenter.NGramSeg
the dictionary and task configuration
dic - Variable in class org.lionsoul.jcseg.segmenter.Segmenter
the dictionary and task configuration instance
DictionaryFactory - Class in org.lionsoul.jcseg.dic
Dictionary Factory to create Dictionary instance a path of the class that has extends the ADictionary class must be given first
DictionaryFunction - Interface in org.lionsoul.jcseg.fi
 
dictMap - Variable in class org.lionsoul.jcseg.dic.HashMapDictionary
hash table for the words
Document(int, Sentence, List<IWord>, double) - Constructor for class org.lionsoul.jcseg.extractor.impl.TextRankSummaryExtractor.Document
construct method
DOMAIN_SUFFIX - Static variable in interface org.lionsoul.jcseg.dic.ILexicon
domain name suffix dictionary for the URL recognition

E

E_ANGLE - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_ANGLE_360 - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_ANGLE_90 - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_ANGLE_DU - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_ANGLE_FEN - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_ANGLE_GON - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_ANGLE_MRAD - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_ANGLE_RAD - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_AREA - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_AREA_ACRE - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_AREA_ARE - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_AREA_CM2 - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_AREA_DM2 - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_AREA_FT2 - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_AREA_HA - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_AREA_IN2 - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_AREA_KM2 - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_AREA_M2 - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_AREA_MM2 - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_AREA_MU - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_AREA_NM2 - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_AREA_QING - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_AREA_SQ_FT - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_AREA_SQ_IN - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_AREA_SQ_MI - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_AREA_SQ_RD - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_AREA_SQ_YD - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_AREA_UM2 - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_DATETIME - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_DATETIME_A - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_DATETIME_AH - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_DATETIME_AHI - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_DATETIME_AHIS - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_DATETIME_D - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_DATETIME_H - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_DATETIME_HI - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_DATETIME_HI_A - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_DATETIME_HIS - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_DATETIME_HIS_A - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_DATETIME_I - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_DATETIME_M - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_DATETIME_P - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_DATETIME_S - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_DATETIME_Y - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_DATETIME_YM - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_DATETIME_YM_A - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_DATETIME_YMD - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_DATETIME_YMD_A - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_DATETIME_YMDHIS - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_DATETIME_YMDZHIS - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_DISTANCE - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_DISTANCE_KM - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_DISTANCE_LI - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_DISTANCE_LY - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_DISTANCE_MI - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_DISTANCE_NMI - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_EMAIL - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_EMAIL_A - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_FORCE - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_FORCE_GF - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_FORCE_KGF - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_FORCE_KIP - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_FORCE_KN - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_FORCE_LBF - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_FORCE_N - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_FORCE_TF - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_IP - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_IP_A - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_LENGTH - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_LENGTH_CFT - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_LENGTH_CIN - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_LENGTH_CM - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_LENGTH_DM - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_LENGTH_FM - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_LENGTH_FT - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_LENGTH_FUR - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_LENGTH_IN - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_LENGTH_KM - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_LENGTH_M - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_LENGTH_MM - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_LENGTH_NM - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_LENGTH_TFT - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_LENGTH_UM - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_LENGTH_YD - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_MOBILE - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_MOBILE_A - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_MONEY - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_MONEY_AUD - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_MONEY_BUK - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_MONEY_CAD - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_MONEY_CNY - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_MONEY_CSK - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_MONEY_CUP - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_MONEY_DEM - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_MONEY_DKK - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_MONEY_EGP - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_MONEY_EUR - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_MONEY_FRF - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_MONEY_GBP - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_MONEY_HKD - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_MONEY_INR - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_MONEY_ISK - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_MONEY_ITL - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_MONEY_JPY - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_MONEY_KRW - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_MONEY_KWP - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_MONEY_MOP - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_MONEY_MXP - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_MONEY_MYR - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_MONEY_NOK - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_MONEY_NZD - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_MONEY_PHP - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_MONEY_SEK - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_MONEY_SGD - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_MONEY_SKK - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_MONEY_SUR - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_MONEY_THB - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_MONEY_TWD - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_MONEY_USD - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_MONEY_VND - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_MONEY_ZAR - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_NAME - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_NAME_A - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_NAME_CN - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_NAME_CN_A - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_NAME_FOREIGN - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_NAME_FOREIGN_A - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_NAME_NICKNAME - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_NAME_NICKNAME_A - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_NUC_PREFIX - Static variable in class org.lionsoul.jcseg.segmenter.Entity
prefix for numeric and unit composed word entity
E_NUMERIC - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_NUMERIC_A - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_NUMERIC_ARABIC - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_NUMERIC_ARABIC_A - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_NUMERIC_CN - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_NUMERIC_CN_A - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_NUMERIC_CN_FRACTION - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_NUMERIC_CN_FRACTION_A - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_NUMERIC_DECIMAL - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_NUMERIC_DECIMAL_A - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_NUMERIC_FRACTION - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_NUMERIC_FRACTION_A - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_NUMERIC_PERCENTAGE - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_NUMERIC_PERCENTAGE_A - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_PLACE - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_PLACE_CITY - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_PLACE_CONTINENT - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_PLACE_DISTRICT - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_PLACE_NATION - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_PLACE_PROVINCE - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_PLACE_TOWNSHIP - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_PLACE_VIEWPOINT - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_PLACE_VILLAGE - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_PRESSURE - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_PRESSURE_ATM - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_PRESSURE_BAR - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_PRESSURE_HG_IN - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_PRESSURE_HG_MM - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_PRESSURE_HPA - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_PRESSURE_KPA - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_PRESSURE_MBAR - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_PRESSURE_PA - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_PRESSURE_WG_MM - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_QUALITY - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_QUALITY_BP - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_QUALITY_CT - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_QUALITY_DAN - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_QUALITY_DAN_UK - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_QUALITY_DAN_US - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_QUALITY_DR - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_QUALITY_G - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_QUALITY_GR - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_QUALITY_JIN - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_QUALITY_KG - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_QUALITY_LB - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_QUALITY_LIANG - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_QUALITY_LT - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_QUALITY_MG - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_QUALITY_OZ - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_QUALITY_Q - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_QUALITY_QIAN - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_QUALITY_ST - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_QUALITY_T - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_STORAGE - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_STORAGE_B - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_STORAGE_BIT - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_STORAGE_EB - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_STORAGE_GB - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_STORAGE_KB - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_STORAGE_MB - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_STORAGE_PB - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_STORAGE_TB - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_TEMPERATURE - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_TEMPERATURE_C - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_TEMPERATURE_F - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_TEMPERATURE_K - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_TEMPERATURE_R - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_TEMPERATURE_RE - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_THE_CARD - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_THE_CARD_A - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_THE_EPISODE - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_THE_EPISODE_A - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_THE_NUMBER - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_THE_NUMBER_A - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_TIME - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_TIME_D - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_TIME_H - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_TIME_H_A - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_TIME_I - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_TIME_I_A - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_TIME_MON - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_TIME_MS - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_TIME_NS - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_TIME_P - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_TIME_S - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_TIME_S_A - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_TIME_US - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_TIME_WEEK - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_TIME_YEAR - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_UNIT_BAG - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_UNIT_BOTTLE - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_UNIT_BOX - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_UNIT_DAN - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_UNIT_DISCOUNT - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_UNIT_ITEM - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_URL - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_URL_A - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_URL_FTP - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_URL_FTP_A - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_URL_HTTP - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_URL_HTTP_A - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_VOLUME - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_VOLUME_CL - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_VOLUME_CM3 - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_VOLUME_DL - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_VOLUME_DM3 - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_VOLUME_FT3 - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_VOLUME_GAL_UK - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_VOLUME_GAL_US - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_VOLUME_HL - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_VOLUME_IN3 - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_VOLUME_L - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_VOLUME_M3 - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_VOLUME_MFT - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_VOLUME_ML - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_VOLUME_MM3 - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_VOLUME_NL - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_VOLUME_UL - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
E_VOLUME_YD3 - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
EN_LETTER - Static variable in class org.lionsoul.jcseg.util.StringUtil
 
EN_MAX_LEN - Variable in class org.lionsoul.jcseg.segmenter.SegmenterConfig
maximum/minimum match length for English word extract
EN_NUMERIC - Static variable in class org.lionsoul.jcseg.util.StringUtil
 
EN_POSPEECH - Static variable in interface org.lionsoul.jcseg.IWord
 
EN_PUNCTUATION - Static variable in class org.lionsoul.jcseg.util.StringUtil
 
EN_SEC_MIN_LEN - Variable in class org.lionsoul.jcseg.segmenter.SegmenterConfig
minimum length for the secondary segmentation word
EN_SECOND_SEG - Variable in class org.lionsoul.jcseg.segmenter.SegmenterConfig
whether to do the secondary split for complex Latin compose by the type of the chars
EN_UNKNOW - Static variable in class org.lionsoul.jcseg.util.StringUtil
 
EN_WHITESPACE - Static variable in class org.lionsoul.jcseg.util.StringUtil
 
EN_WORD_SEG - Variable in class org.lionsoul.jcseg.segmenter.SegmenterConfig
do the English word extract
endsWith(String, String[]) - Static method in class org.lionsoul.jcseg.util.ArrayUtil
check if there is an element that ends with the specified string
enQueue(int) - Method in class org.lionsoul.jcseg.util.IIntFIFO
add a new item to the queue
enQueue(int) - Method in class org.lionsoul.jcseg.util.IIntQueue
append a int from the tail
enSecondSeg(IWord, LinkedList<IWord>) - Method in class org.lionsoul.jcseg.segmenter.Segmenter
Do the secondary split for the specified complex Latin word This will split a complex English, Arabic, punctuation compose word to multiple simple parts Like 'qq2013' will split to 'qq' and '2013'
enSecondSegFilter(IWord) - Method in class org.lionsoul.jcseg.segmenter.MostSeg
 
enSecondSegFilter(IWord) - Method in class org.lionsoul.jcseg.segmenter.NLPSeg
 
enSecondSegFilter(IWord) - Method in class org.lionsoul.jcseg.segmenter.Segmenter
interface to check and do the English secondary segmentation.
Entity - Class in org.lionsoul.jcseg.segmenter
word item entity class
Entity() - Constructor for class org.lionsoul.jcseg.segmenter.Entity
 
EntityFormat - Class in org.lionsoul.jcseg.util
Entity format manager class
EntityFormat() - Constructor for class org.lionsoul.jcseg.util.EntityFormat
 
Entry(int, IIntFIFO.Entry) - Constructor for class org.lionsoul.jcseg.util.IIntFIFO.Entry
 
Entry(int, IIntQueue.Entry, IIntQueue.Entry) - Constructor for class org.lionsoul.jcseg.util.IIntQueue.Entry
 
Entry(T, IHashQueue.Entry<T>, IHashQueue.Entry<T>) - Constructor for class org.lionsoul.jcseg.util.IHashQueue.Entry
 
enWordSeg(IWord, LinkedList<IWord>) - Method in class org.lionsoul.jcseg.segmenter.MostSeg
Latin word lexicon based English word segmentation for search mode
enWordSeg(IWord, LinkedList<IWord>) - Method in class org.lionsoul.jcseg.segmenter.Segmenter
Latin word lexicon based English word segmentation.
equals(Object) - Method in interface org.lionsoul.jcseg.IWord
you have to rewrite the #equals method cause the Jcseg require it
equals(Object) - Method in class org.lionsoul.jcseg.segmenter.Word
 

F

factory - Variable in enum org.lionsoul.jcseg.IDictionary.Type
 
factory - Variable in enum org.lionsoul.jcseg.ISegment.Type
 
fieldsArr - Static variable in class org.lionsoul.jcseg.segmenter.Entity
 
fillDateTimePool(IWord[], int, IWord) - Static method in class org.lionsoul.jcseg.util.TimeUtil
fill the date-time pool specified part with part index constant
fillDateTimePool(IWord[], IWord) - Static method in class org.lionsoul.jcseg.util.TimeUtil
fill the date-time pool specified part through the specified time entity string.
fillTimeToPool(IWord[], String) - Static method in class org.lionsoul.jcseg.util.TimeUtil
fill a date-time time part with a standard time format like '15:45:36' to the specified time pool
filter(IWord) - Method in class org.lionsoul.jcseg.extractor.KeyphraseExtractor
word item filter
filter(IWord) - Method in class org.lionsoul.jcseg.extractor.KeywordsExtractor
word item filter
findCHName(char[], int, IChunk) - Method in class org.lionsoul.jcseg.segmenter.Segmenter
find an Chinese name from the current position of the input chars
first() - Method in class org.lionsoul.jcseg.util.IStringBuffer
always return the first char
fromIndex(int) - Static method in enum org.lionsoul.jcseg.IDictionary.Type
 
fromIndex(int) - Static method in enum org.lionsoul.jcseg.ISegment.Type
 
fromString(String) - Static method in enum org.lionsoul.jcseg.IDictionary.Type
 
fromString(String) - Static method in enum org.lionsoul.jcseg.ISegment.Type
 
fromString(String, IDictionary.Type) - Static method in enum org.lionsoul.jcseg.IDictionary.Type
get the Type with the specified string name
fromString(String, ISegment.Type) - Static method in enum org.lionsoul.jcseg.ISegment.Type
get the Type with the specified string name
fwsTohws(String) - Static method in class org.lionsoul.jcseg.util.StringUtil
a static method to replace the full-width char to the half-width char in a given string (65281-65374 for full-width char)

G

get(char) - Method in class org.lionsoul.jcseg.util.ByteCharCounter
 
get(int) - Method in class org.lionsoul.jcseg.util.IntArrayList
 
get(int, String) - Method in class org.lionsoul.jcseg.dic.HashMapDictionary
 
get(int, String) - Method in interface org.lionsoul.jcseg.IDictionary
return the IWord associate with the given key.
get(String) - Static method in class org.lionsoul.jcseg.segmenter.Entity
get the entity string by the specified key.
getAutoMinLength() - Method in class org.lionsoul.jcseg.extractor.impl.TextRankKeyphraseExtractor
 
getAverageWordsLength() - Method in interface org.lionsoul.jcseg.IChunk
return the average word length for all the chunks.
getAverageWordsLength() - Method in class org.lionsoul.jcseg.segmenter.Chunk
 
getBestChunk(char[], int, int) - Method in class org.lionsoul.jcseg.segmenter.ComplexSeg
 
getBestChunk(char[], int, int) - Method in class org.lionsoul.jcseg.segmenter.Segmenter
an abstract method to get word from the current position with MMSEG algorithm.
getBestChunk(char[], int, int) - Method in class org.lionsoul.jcseg.segmenter.SimpleSeg
 
getConfig() - Method in class org.lionsoul.jcseg.dic.ADictionary
 
getConfig() - Method in class org.lionsoul.jcseg.segmenter.DelimiterSeg
get the current Segmenter Config instance
getConfig() - Method in class org.lionsoul.jcseg.segmenter.DetectSeg
get the current task config instance
getConfig() - Method in class org.lionsoul.jcseg.segmenter.NGramSeg
 
getConfig() - Method in class org.lionsoul.jcseg.segmenter.Segmenter
get the current task configuration instance.
getDateTimeIndex(String) - Static method in class org.lionsoul.jcseg.util.TimeUtil
get and return the time part index of the specified IWord#entity
getDelimiter() - Method in class org.lionsoul.jcseg.segmenter.DelimiterSeg
get the current delimiter
getDELIMITER() - Method in class org.lionsoul.jcseg.segmenter.SegmenterConfig
 
getDic() - Method in class org.lionsoul.jcseg.segmenter.DelimiterSeg
get the current dictionary instance
getDic() - Method in class org.lionsoul.jcseg.segmenter.NGramSeg
 
getDict() - Method in class org.lionsoul.jcseg.segmenter.DetectSeg
get the current dictionary instance
getDict() - Method in class org.lionsoul.jcseg.segmenter.Segmenter
get the current dictionary instance.
getEnCharType(int) - Static method in class org.lionsoul.jcseg.util.StringUtil
get the type of the English char defined in this class and start with EN_.
getEnMaxLen() - Method in class org.lionsoul.jcseg.segmenter.SegmenterConfig
 
getEnSecondMinLen() - Method in class org.lionsoul.jcseg.segmenter.SegmenterConfig
 
getEnSecondSeg() - Method in class org.lionsoul.jcseg.segmenter.SegmenterConfig
 
getEntity() - Method in interface org.lionsoul.jcseg.IWord
get the entity name array of the word
getEntity() - Method in class org.lionsoul.jcseg.segmenter.Word
 
getEntity(int) - Method in interface org.lionsoul.jcseg.IWord
get the specified entity by index
getEntity(int) - Method in class org.lionsoul.jcseg.segmenter.Word
 
getFile() - Method in class org.lionsoul.jcseg.dic.AutoLoadFile
 
getFrequency() - Method in interface org.lionsoul.jcseg.IWord
return the frequency of the word, use only when the word's length is one.
getFrequency() - Method in class org.lionsoul.jcseg.segmenter.Word
 
getGRAM() - Method in class org.lionsoul.jcseg.segmenter.SegmenterConfig
 
getIndex() - Method in class org.lionsoul.jcseg.extractor.impl.TextRankSummaryExtractor.Document
 
getIndex(String) - Static method in class org.lionsoul.jcseg.dic.ADictionary
get the key's type index located in ILexicon interface
getJarHome(Object) - Static method in class org.lionsoul.jcseg.util.Util
get the absolute parent path for the jar file.
getKeyphrase(Reader) - Method in class org.lionsoul.jcseg.extractor.impl.TextRankKeyphraseExtractor
 
getKeyphrase(Reader) - Method in class org.lionsoul.jcseg.extractor.KeyphraseExtractor
get the keyphrase list from a reader
getKeyphraseFromFile(String) - Method in class org.lionsoul.jcseg.extractor.KeyphraseExtractor
get the keyphrase list from a file
getKeyphraseFromString(String) - Method in class org.lionsoul.jcseg.extractor.KeyphraseExtractor
get the keyphrase list from a string
getKeySentence(Reader) - Method in class org.lionsoul.jcseg.extractor.impl.TextRankSummaryExtractor
 
getKeySentence(Reader) - Method in class org.lionsoul.jcseg.extractor.SummaryExtractor
get the key sentence from a reader
getKeySentenceFromFile(String) - Method in class org.lionsoul.jcseg.extractor.SummaryExtractor
get key sentence from a file path
getKeySentenceFromString(String) - Method in class org.lionsoul.jcseg.extractor.SummaryExtractor
get key sentence from a string
getKeywords(Reader) - Method in class org.lionsoul.jcseg.extractor.impl.TextRankKeywordsExtractor
 
getKeywords(Reader) - Method in class org.lionsoul.jcseg.extractor.KeywordsExtractor
get the keywords list from a reader
getKeywordsFromFile(String) - Method in class org.lionsoul.jcseg.extractor.KeywordsExtractor
get the keywords list from a file
getKeywordsFromString(String) - Method in class org.lionsoul.jcseg.extractor.KeywordsExtractor
get the keywords list from a string
getKeywordsNum() - Method in class org.lionsoul.jcseg.extractor.impl.TextRankKeyphraseExtractor
 
getKeywordsNum() - Method in class org.lionsoul.jcseg.extractor.impl.TextRankKeywordsExtractor
 
getLargestAverageWordLengthChunks(ArrayList<IChunk>, ArrayList<IChunk>) - Static method in class org.lionsoul.jcseg.segmenter.MMSegFilter
2.
getLargestSingleMorphemicFreedomChunks(ArrayList<IChunk>, ArrayList<IChunk>) - Static method in class org.lionsoul.jcseg.segmenter.MMSegFilter
the largest sum of degree of morphemic freedom of one-character words this rule will return the chunks that own the largest sum of degree of morphemic freedom of one-character
getLastUpdateTime() - Method in class org.lionsoul.jcseg.dic.AutoLoadFile
 
getLength() - Method in interface org.lionsoul.jcseg.IChunk
return the length of the chunk(the number of the word)
getLength() - Method in interface org.lionsoul.jcseg.IWord
return the length of the word
getLength() - Method in class org.lionsoul.jcseg.segmenter.Chunk
 
getLength() - Method in class org.lionsoul.jcseg.segmenter.Word
 
getLength() - Method in class org.lionsoul.jcseg.sentence.Sentence
 
getLexiconPath() - Method in class org.lionsoul.jcseg.segmenter.SegmenterConfig
return the lexicon directory path
getList() - Method in class org.lionsoul.jcseg.SynonymsEntry
return the synonyms list
getMaxCnLnadron() - Method in class org.lionsoul.jcseg.segmenter.SegmenterConfig
 
getMaximumMatchChunks(ArrayList<IChunk>, ArrayList<IChunk>) - Static method in class org.lionsoul.jcseg.segmenter.MMSegFilter
1.
getMaxIterateNum() - Method in class org.lionsoul.jcseg.extractor.impl.TextRankKeyphraseExtractor
 
getMaxIterateNum() - Method in class org.lionsoul.jcseg.extractor.impl.TextRankKeywordsExtractor
 
getMaxIterateNum() - Method in class org.lionsoul.jcseg.extractor.impl.TextRankSummaryExtractor
 
getMaxLength() - Method in class org.lionsoul.jcseg.segmenter.SegmenterConfig
 
getMaxWordsNum() - Method in class org.lionsoul.jcseg.extractor.impl.TextRankKeyphraseExtractor
 
getN() - Method in class org.lionsoul.jcseg.segmenter.NGramSeg
 
getNameSingleThreshold() - Method in class org.lionsoul.jcseg.segmenter.SegmenterConfig
 
getNextCJKWord(int, int) - Method in class org.lionsoul.jcseg.segmenter.MostSeg
get the next CJK word from the current position of the input stream and this function is the core part the most segmentation implements
getNextCJKWord(int, int) - Method in class org.lionsoul.jcseg.segmenter.NLPSeg
 
getNextCJKWord(int, int) - Method in class org.lionsoul.jcseg.segmenter.Segmenter
get the next CJK word from the current position of the input stream
getNextDatetimeWord(IWord, int) - Method in class org.lionsoul.jcseg.segmenter.NLPSeg
get and return the next date-time word
getNextLatinWord(int, int) - Method in class org.lionsoul.jcseg.segmenter.Segmenter
get the next Latin word from the current position of the input stream
getNextMatch(int, char[], int, List<IWord>) - Method in class org.lionsoul.jcseg.segmenter.Segmenter
match the next CJK word in the dictionary
getNextMixedWord(char[], int) - Method in class org.lionsoul.jcseg.segmenter.Segmenter
get the next mixed word, CJK-English or CJK-English-CJK or whatever
getNextPunctuationPairWord(int, int) - Method in class org.lionsoul.jcseg.segmenter.Segmenter
get the next punctuation pair word from the current position of the input stream.
getNextTheWord(IWord) - Method in class org.lionsoul.jcseg.segmenter.NLPSeg
get the next the_xxx word like '第x个', '第x集' EG ...
getNextTimeMergedWord(IWord, int) - Method in class org.lionsoul.jcseg.segmenter.NLPSeg
get and return the next time merged date-time word
getNextType(int, int, CharTypeFunction) - Method in class org.lionsoul.jcseg.segmenter.NGramSeg
common interface to get the next n-gram word for the specified char type.
getNumericUnitComposedWord(int, IWord) - Method in class org.lionsoul.jcseg.segmenter.NLPSeg
 
getPairPunctuationText(int) - Method in class org.lionsoul.jcseg.segmenter.Segmenter
find pair punctuation of the given punctuation char the purpose is to get the text between them
getParameter() - Method in interface org.lionsoul.jcseg.IWord
return the additional parameters
getParameter() - Method in class org.lionsoul.jcseg.segmenter.Word
 
getPartSpeech() - Method in interface org.lionsoul.jcseg.IWord
return the part of speech of the word.
getPartSpeech() - Method in class org.lionsoul.jcseg.segmenter.Word
 
getPinyin() - Method in interface org.lionsoul.jcseg.IWord
return the pinyin of the word
getPinyin() - Method in class org.lionsoul.jcseg.segmenter.Word
 
getPollTime() - Method in class org.lionsoul.jcseg.segmenter.SegmenterConfig
 
getPosition() - Method in interface org.lionsoul.jcseg.IWord
return the start position of the word.
getPosition() - Method in class org.lionsoul.jcseg.segmenter.Word
 
getPosition() - Method in class org.lionsoul.jcseg.sentence.Sentence
 
getPPTMaxLength() - Method in class org.lionsoul.jcseg.segmenter.SegmenterConfig
 
getPropertieFile() - Method in class org.lionsoul.jcseg.segmenter.SegmenterConfig
 
getPunctuationPair(char) - Static method in class org.lionsoul.jcseg.util.StringUtil
get the pair punctuation' pair
getQueueSize() - Method in class org.lionsoul.jcseg.util.IPushbackReader
get the buffer size - the number of buffered data
getRootWord() - Method in class org.lionsoul.jcseg.SynonymsEntry
return the base word
getScore() - Method in class org.lionsoul.jcseg.extractor.impl.TextRankSummaryExtractor.Document
 
getSeg() - Method in class org.lionsoul.jcseg.extractor.KeyphraseExtractor
 
getSeg() - Method in class org.lionsoul.jcseg.extractor.KeywordsExtractor
 
getSentence() - Method in class org.lionsoul.jcseg.extractor.impl.TextRankSummaryExtractor.Document
 
getSentenceNum() - Method in class org.lionsoul.jcseg.extractor.impl.TextRankSummaryExtractor
 
getSentenceSeg() - Method in class org.lionsoul.jcseg.extractor.SummaryExtractor
 
getSingleWordsFrequency() - Method in interface org.lionsoul.jcseg.IChunk
 
getSingleWordsFrequency() - Method in class org.lionsoul.jcseg.segmenter.Chunk
 
getSingleWordsMorphemicFreedom() - Method in interface org.lionsoul.jcseg.IChunk
return the degree of morphemic freedom for all the single words.
getSingleWordsMorphemicFreedom() - Method in class org.lionsoul.jcseg.segmenter.Chunk
 
getSmallestVarianceWordLengthChunks(ArrayList<IChunk>, ArrayList<IChunk>) - Static method in class org.lionsoul.jcseg.segmenter.MMSegFilter
the smallest variance word length this rule will the chunks that one the smallest variance word length
getStreamPosition() - Method in interface org.lionsoul.jcseg.ISegment
get the current length of the stream
getStreamPosition() - Method in class org.lionsoul.jcseg.segmenter.DelimiterSeg
 
getStreamPosition() - Method in class org.lionsoul.jcseg.segmenter.DetectSeg
 
getStreamPosition() - Method in class org.lionsoul.jcseg.segmenter.NGramSeg
 
getStreamPosition() - Method in class org.lionsoul.jcseg.segmenter.Segmenter
 
getSummary(Reader, int) - Method in class org.lionsoul.jcseg.extractor.impl.TextRankSummaryExtractor
 
getSummary(Reader, int) - Method in class org.lionsoul.jcseg.extractor.SummaryExtractor
get summary from a reader
getSummaryFromFile(String, int) - Method in class org.lionsoul.jcseg.extractor.SummaryExtractor
get document summary from a file
getSummaryFromString(String, int) - Method in class org.lionsoul.jcseg.extractor.SummaryExtractor
get document summary from a string
getSyn() - Method in interface org.lionsoul.jcseg.IWord
return the Synonyms' entry of the word.
getSyn() - Method in class org.lionsoul.jcseg.segmenter.Word
 
getTimeKey(int) - Static method in class org.lionsoul.jcseg.util.TimeUtil
get and return the time key part with the part index value
getTimeKey(String) - Static method in class org.lionsoul.jcseg.util.TimeUtil
get and return the time key part of the specified entity string
getTimeKey(IWord) - Static method in class org.lionsoul.jcseg.util.TimeUtil
 
getType() - Method in interface org.lionsoul.jcseg.IWord
return the type of the word
getType() - Method in class org.lionsoul.jcseg.segmenter.Word
 
getValue() - Method in interface org.lionsoul.jcseg.IWord
return the value of the word
getValue() - Method in class org.lionsoul.jcseg.segmenter.Word
 
getValue() - Method in class org.lionsoul.jcseg.sentence.Sentence
 
getWindowSize() - Method in class org.lionsoul.jcseg.extractor.impl.TextRankKeyphraseExtractor
 
getWindowSize() - Method in class org.lionsoul.jcseg.extractor.impl.TextRankKeywordsExtractor
 
getWords() - Method in class org.lionsoul.jcseg.extractor.impl.TextRankSummaryExtractor.Document
 
getWords() - Method in interface org.lionsoul.jcseg.IChunk
get the all the words in the chunk.
getWords() - Method in class org.lionsoul.jcseg.segmenter.Chunk
 
getWordSeg() - Method in class org.lionsoul.jcseg.extractor.SummaryExtractor
 
getWordsVariance() - Method in interface org.lionsoul.jcseg.IChunk
return the variance of all the words in all the chunks.
getWordsVariance() - Method in class org.lionsoul.jcseg.segmenter.Chunk
 
gIsb - Variable in class org.lionsoul.jcseg.sentence.SentenceSeg
global string buffer

H

hashCode() - Method in class org.lionsoul.jcseg.segmenter.Word
rewrite the hash code generate algorithm take the value as the main factor
HASHMAP - org.lionsoul.jcseg.IDictionary.Type
 
HASHMAP - Static variable in interface org.lionsoul.jcseg.IDictionary
Segmentation type constants
HashMapDictionary - Class in org.lionsoul.jcseg.dic
Dictionary class
HashMapDictionary(SegmenterConfig, Boolean) - Constructor for class org.lionsoul.jcseg.dic.HashMapDictionary
 
hwsTofws(String) - Static method in class org.lionsoul.jcseg.util.StringUtil
a static method to replace the half-width char to the full-width char in a given string

I

I_CN_NAME - Variable in class org.lionsoul.jcseg.segmenter.SegmenterConfig
identify the Chinese name?
iaList - Variable in class org.lionsoul.jcseg.segmenter.Segmenter
 
IChunk - Interface in org.lionsoul.jcseg
chunk interface for Jcseg.
identifyCnName() - Method in class org.lionsoul.jcseg.segmenter.SegmenterConfig
 
IDictionary - Interface in org.lionsoul.jcseg
Common interface for Dictionary
IDictionary.Type - Enum in org.lionsoul.jcseg
 
idx - Variable in class org.lionsoul.jcseg.segmenter.NGramSeg
the index value of the current input stream mainly for track the start position of the token
idx - Variable in class org.lionsoul.jcseg.segmenter.Segmenter
the index value of the current input stream mainly for track the start position of the token
idx - Variable in class org.lionsoul.jcseg.sentence.SentenceSeg
 
IHashQueue<T extends IWord> - Class in org.lionsoul.jcseg.util
A normal queue base one single link but with hash index, so, it is fast for searching
IHashQueue() - Constructor for class org.lionsoul.jcseg.util.IHashQueue
 
IHashQueue.Entry<T> - Class in org.lionsoul.jcseg.util
innner Entry node class
IIntFIFO - Class in org.lionsoul.jcseg.util
int first in first out queue base on single link
IIntFIFO() - Constructor for class org.lionsoul.jcseg.util.IIntFIFO
 
IIntFIFO.Entry - Class in org.lionsoul.jcseg.util
Item Entry inner class
IIntQueue - Class in org.lionsoul.jcseg.util
char queue class base on double link Not thread safe
IIntQueue() - Constructor for class org.lionsoul.jcseg.util.IIntQueue
 
IIntQueue.Entry - Class in org.lionsoul.jcseg.util
innner Entry node class
ILexicon - Interface in org.lionsoul.jcseg.dic
lexicon configuration class.
implode(String, Object[]) - Static method in class org.lionsoul.jcseg.util.ArrayUtil
String array implode internal method
increase(char) - Method in class org.lionsoul.jcseg.util.ByteCharCounter
 
increase(char, int) - Method in class org.lionsoul.jcseg.util.ByteCharCounter
 
index - Variable in enum org.lionsoul.jcseg.IDictionary.Type
 
index - Variable in enum org.lionsoul.jcseg.ISegment.Type
 
indexOf(String, String[]) - Static method in class org.lionsoul.jcseg.util.ArrayUtil
check and search the specified element in the Array
insertionSort(T[]) - Static method in class org.lionsoul.jcseg.util.Sort
insert sort method
insertionSort(T[], int, int) - Static method in class org.lionsoul.jcseg.util.Sort
method to sort an subarray from start to end with insertion sort algorithm
IntArrayList - Class in org.lionsoul.jcseg.util
array list for basic int data type to instead of ArrayList Well, this will save a lot work to Reopened and Unpacking
IntArrayList() - Constructor for class org.lionsoul.jcseg.util.IntArrayList
 
IntArrayList(int) - Constructor for class org.lionsoul.jcseg.util.IntArrayList
 
IPushbackReader - Class in org.lionsoul.jcseg.util
IPushBackReader based on Reader Not thread safe support unlimited unread operation
IPushbackReader(Reader) - Constructor for class org.lionsoul.jcseg.util.IPushbackReader
 
is(int) - Method in interface org.lionsoul.jcseg.fi.CharTypeFunction
 
isAutoFilter() - Method in class org.lionsoul.jcseg.extractor.impl.TextRankKeywordsExtractor
 
isAutoload() - Method in class org.lionsoul.jcseg.segmenter.SegmenterConfig
about lexicon autoload
isb - Variable in class org.lionsoul.jcseg.segmenter.NGramSeg
 
isb - Variable in class org.lionsoul.jcseg.segmenter.Segmenter
 
isCJK(String) - Static method in class org.lionsoul.jcseg.util.StringUtil
 
isCJK(String, int, int) - Static method in class org.lionsoul.jcseg.util.StringUtil
check if the specified string is all CJK chars
isCJKChar(int) - Static method in class org.lionsoul.jcseg.util.StringUtil
check the specified char is CJK, Thai...
isCNNumeric(char) - Static method in class org.lionsoul.jcseg.util.NumericUtil
check if the given char is a Chinese numeric or not
isCNNumericString(String, int, int) - Static method in class org.lionsoul.jcseg.util.NumericUtil
check if the specified string is a Chinese numeric string
isCnPunctuation(int) - Static method in class org.lionsoul.jcseg.util.StringUtil
 
isDate(String, char) - Static method in class org.lionsoul.jcseg.util.EntityFormat
check if the specified string is an valid Latin Date string like "2017/02/22", "2017-02-22" or "2017.02.22"
isDecimal(String) - Static method in class org.lionsoul.jcseg.util.StringUtil
 
isDecimal(String, int, int) - Static method in class org.lionsoul.jcseg.util.StringUtil
check the specified char is a decimal including the full-width char
isDigit(String) - Static method in class org.lionsoul.jcseg.util.StringUtil
 
isDigit(String, int, int) - Static method in class org.lionsoul.jcseg.util.StringUtil
check the specified char is a digit or not true will return if it is or return false this method can recognize full-with char
ISegment - Interface in org.lionsoul.jcseg
Jcseg segmentation interface
ISegment.Type - Enum in org.lionsoul.jcseg
 
isEnChar(int) - Static method in class org.lionsoul.jcseg.util.StringUtil
check the specified char is a basic Latin and Russia and Greece letter.
isENKeepPunctuaton(char) - Static method in class org.lionsoul.jcseg.util.StringUtil
check the given char is English keep punctuation
isEnLetter(int) - Static method in class org.lionsoul.jcseg.util.StringUtil
include the full-width and half-width char
isEnNumeric(int) - Static method in class org.lionsoul.jcseg.util.StringUtil
check the specified char is an English numeric(48-57) including the full-width char
isEnPunctuation(int) - Static method in class org.lionsoul.jcseg.util.StringUtil
check if the given char is half-width punctuation
isEnWordSeg() - Method in class org.lionsoul.jcseg.segmenter.SegmenterConfig
 
isFWEnChar(int) - Static method in class org.lionsoul.jcseg.util.StringUtil
check the given char is a full-width char AT+reader: the full-width punctuation is not included here
isHWEnChar(int) - Static method in class org.lionsoul.jcseg.util.StringUtil
check the given char is a half-width char or not
isIpAddress(String) - Static method in class org.lionsoul.jcseg.util.EntityFormat
 
isKeepEnSecOriginalWord() - Method in class org.lionsoul.jcseg.segmenter.SegmenterConfig
 
isKeepEnSegOriginalWord() - Method in class org.lionsoul.jcseg.segmenter.SegmenterConfig
 
isKeepPunctuation(char) - Method in class org.lionsoul.jcseg.segmenter.SegmenterConfig
 
isLatin(String) - Static method in class org.lionsoul.jcseg.util.StringUtil
 
isLatin(String, int, int) - Static method in class org.lionsoul.jcseg.util.StringUtil
check if the specified string is all Latin chars
isLetter(String) - Static method in class org.lionsoul.jcseg.util.StringUtil
 
isLetter(String, int, int) - Static method in class org.lionsoul.jcseg.util.StringUtil
check if the specified string is Latin letter
isLetterNumber(int) - Static method in class org.lionsoul.jcseg.util.StringUtil
check the specified char is Letter number like 'ⅠⅡ' true will be return if it is, or return false
isLetterOrNumeric(String) - Static method in class org.lionsoul.jcseg.util.StringUtil
 
isLetterOrNumeric(String, int, int) - Static method in class org.lionsoul.jcseg.util.StringUtil
check if the specified string is Latin numeric or letter
isLowerCaseLetter(int) - Static method in class org.lionsoul.jcseg.util.StringUtil
 
isMailAddress(String) - Static method in class org.lionsoul.jcseg.util.EntityFormat
check if the specified string is an email address or not
isMobileNumber(String) - Static method in class org.lionsoul.jcseg.util.EntityFormat
check if the specified string is a mobile number
isNoTailingPunctuation(char) - Static method in class org.lionsoul.jcseg.util.StringUtil
check if the given punctuation is the one that need to be cleared
isNumeric(String) - Static method in class org.lionsoul.jcseg.util.StringUtil
 
isNumeric(String, int, int) - Static method in class org.lionsoul.jcseg.util.StringUtil
check if the specified string is Latin numeric
isOtherNumber(int) - Static method in class org.lionsoul.jcseg.util.StringUtil
check the specified char is other number like '①⑩⑽㈩' true will be return if it is, or return false
isPairPunctuation(char) - Static method in class org.lionsoul.jcseg.util.StringUtil
check the given char is pair punctuation or not
isPunctuation(int) - Static method in class org.lionsoul.jcseg.util.StringUtil
check if the given char is a punctuation
isPunctuation(String) - Static method in class org.lionsoul.jcseg.util.StringUtil
 
isPunctuation(String, int, int) - Static method in class org.lionsoul.jcseg.util.StringUtil
Check if the specified string is all punctuation chars (English and Chinese punctuation)
isSync() - Method in class org.lionsoul.jcseg.dic.ADictionary
 
isTime(String) - Static method in class org.lionsoul.jcseg.util.EntityFormat
check if the specified string is a valid time string like '12:45', '12:45:12'
IStringBuffer - Class in org.lionsoul.jcseg.util
string buffer class
IStringBuffer() - Constructor for class org.lionsoul.jcseg.util.IStringBuffer
create a buffer with a default length 16
IStringBuffer(int) - Constructor for class org.lionsoul.jcseg.util.IStringBuffer
create a buffer with a specified length
IStringBuffer(String) - Constructor for class org.lionsoul.jcseg.util.IStringBuffer
create a buffer with a specified string
isUpperCaseLetter(int) - Static method in class org.lionsoul.jcseg.util.StringUtil
 
isUrlAddress(String, ADictionary) - Static method in class org.lionsoul.jcseg.util.EntityFormat
check if the specified string is an URL address or not
isWhitespace(int) - Static method in class org.lionsoul.jcseg.util.StringUtil
check the given string is a whitespace
IWord - Interface in org.lionsoul.jcseg
Word Token interface

J

JcsegTest - Class in org.lionsoul.jcseg.test
Jcseg test program.
JcsegTest() - Constructor for class org.lionsoul.jcseg.test.JcsegTest
 

K

K1 - Static variable in class org.lionsoul.jcseg.extractor.impl.TextRankSummaryExtractor
 
KEEP_UNREG_WORDS - Variable in class org.lionsoul.jcseg.segmenter.SegmenterConfig
 
keepUnregWords() - Method in class org.lionsoul.jcseg.segmenter.SegmenterConfig
 
keyphrase(String) - Method in class org.lionsoul.jcseg.test.JcsegTest
keyphrase extractor
KeyphraseExtractor - Class in org.lionsoul.jcseg.extractor
key phrase extractor
KeyphraseExtractor(ISegment) - Constructor for class org.lionsoul.jcseg.extractor.KeyphraseExtractor
construct method
keywords(String) - Method in class org.lionsoul.jcseg.test.JcsegTest
keywords extractor
KeywordsExtractor - Class in org.lionsoul.jcseg.extractor
document keywords extractor
KeywordsExtractor(ISegment) - Constructor for class org.lionsoul.jcseg.extractor.KeywordsExtractor
construct method
keywordsNum - Variable in class org.lionsoul.jcseg.extractor.impl.TextRankKeyphraseExtractor
 
keywordsNum - Variable in class org.lionsoul.jcseg.extractor.impl.TextRankKeywordsExtractor
 

L

ladCJKPos() - Method in class org.lionsoul.jcseg.segmenter.SegmenterConfig
 
last() - Method in class org.lionsoul.jcseg.util.IStringBuffer
always return the last char
latinIndexOf(String) - Static method in class org.lionsoul.jcseg.util.StringUtil
 
latinIndexOf(String, int) - Static method in class org.lionsoul.jcseg.util.StringUtil
get the index of the first Latin char of the specified string
length() - Method in class org.lionsoul.jcseg.util.IStringBuffer
return the length of the buffer
LEX_PROPERTY_FILE - Static variable in class org.lionsoul.jcseg.segmenter.SegmenterConfig
default lexicon property file name
load(File) - Method in class org.lionsoul.jcseg.dic.ADictionary
load all the words from a specified lexicon file
load(InputStream) - Method in class org.lionsoul.jcseg.dic.ADictionary
load all the words from a specified lexicon input stream
load(InputStream) - Method in class org.lionsoul.jcseg.segmenter.SegmenterConfig
initialize the value of its options from a InputStream of a jcseg.properties file
load(String) - Method in class org.lionsoul.jcseg.dic.ADictionary
load all the words from a specified lexicon path
load(String) - Method in class org.lionsoul.jcseg.segmenter.SegmenterConfig
initialize the value of its options from a specified jcseg.properties propertie file
LOAD_CJK_ENTITY - Variable in class org.lionsoul.jcseg.segmenter.SegmenterConfig
whether to load the entity define
LOAD_CJK_PINYIN - Variable in class org.lionsoul.jcseg.segmenter.SegmenterConfig
whether to load the Pinyin of the CJK_WORDS
LOAD_CJK_POS - Variable in class org.lionsoul.jcseg.segmenter.SegmenterConfig
whether to load the word's part of speech
LOAD_CJK_SYN - Variable in class org.lionsoul.jcseg.segmenter.SegmenterConfig
whether to load the synonym word of the CJK_WORDS.
LOAD_PARAMETER - Variable in class org.lionsoul.jcseg.segmenter.SegmenterConfig
whether to load the self-define parameter
loadCJKEntity() - Method in class org.lionsoul.jcseg.segmenter.SegmenterConfig
 
loadCJKPinyin() - Method in class org.lionsoul.jcseg.segmenter.SegmenterConfig
 
loadCJKSyn() - Method in class org.lionsoul.jcseg.segmenter.SegmenterConfig
 
loadClassPath() - Method in class org.lionsoul.jcseg.dic.ADictionary
load all the words from all the files under the specified class path.
loadDirectory(String) - Method in class org.lionsoul.jcseg.dic.ADictionary
load the all the words from all the files under a specified lexicon directory
loadWords(SegmenterConfig, ADictionary, File, List<String[]>) - Static method in class org.lionsoul.jcseg.dic.ADictionary
load all the words in the specified lexicon file into the dictionary
loadWords(SegmenterConfig, ADictionary, InputStream, List<String[]>) - Static method in class org.lionsoul.jcseg.dic.ADictionary
load words from a InputStream
loadWords(SegmenterConfig, ADictionary, String, List<String[]>) - Static method in class org.lionsoul.jcseg.dic.ADictionary
load all the words from a specified lexicon file path

M

main(String[]) - Static method in class org.lionsoul.jcseg.test.JcsegTest
 
MAPPING - Static variable in enum org.lionsoul.jcseg.IDictionary.Type
the type index and type mapping for quick get the type by type index number.
MAPPING - Static variable in enum org.lionsoul.jcseg.ISegment.Type
the type index and type mapping for quick get the type by type index number.
match(int, String) - Method in class org.lionsoul.jcseg.dic.HashMapDictionary
 
match(int, String) - Method in interface org.lionsoul.jcseg.IDictionary
loop up the dictionary, check the given key is in the dictionary or not
MAX_CN_LNADRON - Variable in class org.lionsoul.jcseg.segmenter.SegmenterConfig
the max length for the adron of the Chinese last name.like 老陈 “老”
MAX_LATIN_LENGTH - Variable in class org.lionsoul.jcseg.segmenter.SegmenterConfig
maximum length for Latin words
MAX_LENGTH - Variable in class org.lionsoul.jcseg.segmenter.SegmenterConfig
maximum length for maximum match(5-7)
MAX_UNIT_LENGTH - Variable in class org.lionsoul.jcseg.segmenter.SegmenterConfig
maximum length for unit words for the NLP algorithm added at 2016/11/18
maxIterateNum - Variable in class org.lionsoul.jcseg.extractor.impl.TextRankKeyphraseExtractor
 
maxIterateNum - Variable in class org.lionsoul.jcseg.extractor.impl.TextRankKeywordsExtractor
 
maxIterateNum - Variable in class org.lionsoul.jcseg.extractor.impl.TextRankSummaryExtractor
 
maxWordsNum - Variable in class org.lionsoul.jcseg.extractor.impl.TextRankKeyphraseExtractor
max phrase length
mergeSort(T[]) - Static method in class org.lionsoul.jcseg.util.Sort
merge sort algorithm
MIX_ASSIST_WORD - Static variable in interface org.lionsoul.jcseg.dic.ILexicon
special lexicon for Chinese-English[-Chinese] mixed word recognition For the optimization implementation of the mixed word recognition
MIX_POSPEECH - Static variable in interface org.lionsoul.jcseg.IWord
 
mixPrefixLength - Variable in class org.lionsoul.jcseg.dic.ADictionary
 
mixSuffixLength - Variable in class org.lionsoul.jcseg.dic.ADictionary
maximum length for the Chinese words after the LATIN word or the one before it used to match Chinese and English mix word, like 'B超,AA制...' or style compose style like '卡拉ok'.
MMSegFilter - Class in org.lionsoul.jcseg.segmenter
MMSeg default filter class
MMSegFilter() - Constructor for class org.lionsoul.jcseg.segmenter.MMSegFilter
 
MOST - org.lionsoul.jcseg.ISegment.Type
 
MOST - Static variable in interface org.lionsoul.jcseg.ISegment
 
MOST_MODE - Static variable in interface org.lionsoul.jcseg.ISegment
 
MostSeg - Class in org.lionsoul.jcseg.segmenter
Most mode implementation which all the possible combinations will be returned, and build it for information retrieval (better for index) of course.
MostSeg(SegmenterConfig, ADictionary) - Constructor for class org.lionsoul.jcseg.segmenter.MostSeg
 

N

N - Variable in class org.lionsoul.jcseg.segmenter.NGramSeg
The N for n-gram, default to 1 and that is uni-gram
name - Variable in enum org.lionsoul.jcseg.IDictionary.Type
 
name - Variable in enum org.lionsoul.jcseg.ISegment.Type
 
NAME_POSPEECH - Static variable in interface org.lionsoul.jcseg.IWord
 
NAME_SINGLE_THRESHOLD - Variable in class org.lionsoul.jcseg.segmenter.SegmenterConfig
the threshold of the single word that is a single word when it and the last char of the name make up a word.
next - Variable in class org.lionsoul.jcseg.util.IHashQueue.Entry
 
next - Variable in class org.lionsoul.jcseg.util.IIntFIFO.Entry
 
next - Variable in class org.lionsoul.jcseg.util.IIntQueue.Entry
 
next() - Method in interface org.lionsoul.jcseg.ISegment
segment a word from a char array from a specified position.
next() - Method in class org.lionsoul.jcseg.segmenter.DelimiterSeg
 
next() - Method in class org.lionsoul.jcseg.segmenter.DetectSeg
 
next() - Method in class org.lionsoul.jcseg.segmenter.NGramSeg
 
next() - Method in class org.lionsoul.jcseg.segmenter.NLPSeg
Override the next method to add the date-time entity recognition And we also invoke the parent.next method to get the next token
next() - Method in class org.lionsoul.jcseg.segmenter.Segmenter
 
next() - Method in class org.lionsoul.jcseg.sentence.SentenceSeg
get the next sentence
nextCJKSentence(int) - Method in class org.lionsoul.jcseg.segmenter.Segmenter
load a CJK char list from the stream start from the current position till the char is not a CJK char
nextCNNumeric(char[], int) - Method in class org.lionsoul.jcseg.segmenter.Segmenter
find the Chinese number from the current position count until the char in the specified position is not a other number or whitespace
nextLatinString(int) - Method in class org.lionsoul.jcseg.segmenter.Segmenter
the simple version of the next basic Latin fetch logic Just return the next Latin string with the keep punctuation after it
nextLatinWord(int, int) - Method in class org.lionsoul.jcseg.segmenter.NLPSeg
find the letter or digit word from the current position count until the char is whitespace or not letter_digit
nextLatinWord(int, int) - Method in class org.lionsoul.jcseg.segmenter.Segmenter
find the letter or digit word from the current position count until the char is whitespace or not letter_digit
nextLetterNumber(int) - Method in class org.lionsoul.jcseg.segmenter.Segmenter
find the next other letter from the current position find the letter number from the current position count until the char in the specified position is not a letter number or whitespace
nextOtherNumber(int) - Method in class org.lionsoul.jcseg.segmenter.Segmenter
find the other number from the current position count until the char in the specified position is not a other number or whitespace
NGRAM - org.lionsoul.jcseg.ISegment.Type
 
NGRAM - Static variable in interface org.lionsoul.jcseg.ISegment
 
NGRAM_MODE - Static variable in interface org.lionsoul.jcseg.ISegment
 
NGramSeg - Class in org.lionsoul.jcseg.segmenter
Jcseg n-gram tokenizer implementation
NGramSeg(SegmenterConfig, ADictionary) - Constructor for class org.lionsoul.jcseg.segmenter.NGramSeg
method to create a new ISegment
NLP - org.lionsoul.jcseg.ISegment.Type
 
NLP - Static variable in interface org.lionsoul.jcseg.ISegment
 
NLP_MODE - Static variable in interface org.lionsoul.jcseg.ISegment
 
NLPSeg - Class in org.lionsoul.jcseg.segmenter
NLP segmentation implementation And this extends all the properties of the Complex one the rest of them are build for NLP only
NLPSeg(SegmenterConfig, ADictionary) - Constructor for class org.lionsoul.jcseg.segmenter.NLPSeg
 
NUMBER_UNIT - Static variable in interface org.lionsoul.jcseg.dic.ILexicon
 
NUMERIC_POSPEECH - Static variable in interface org.lionsoul.jcseg.IWord
 
NumericUtil - Class in org.lionsoul.jcseg.util
a class to deal with Chinese numeric
NumericUtil() - Constructor for class org.lionsoul.jcseg.util.NumericUtil
 

O

org.lionsoul.jcseg - package org.lionsoul.jcseg
 
org.lionsoul.jcseg.dic - package org.lionsoul.jcseg.dic
 
org.lionsoul.jcseg.extractor - package org.lionsoul.jcseg.extractor
 
org.lionsoul.jcseg.extractor.impl - package org.lionsoul.jcseg.extractor.impl
 
org.lionsoul.jcseg.fi - package org.lionsoul.jcseg.fi
 
org.lionsoul.jcseg.segmenter - package org.lionsoul.jcseg.segmenter
 
org.lionsoul.jcseg.sentence - package org.lionsoul.jcseg.sentence
 
org.lionsoul.jcseg.test - package org.lionsoul.jcseg.test
 
org.lionsoul.jcseg.util - package org.lionsoul.jcseg.util
 

P

PPT_MAX_LENGTH - Variable in class org.lionsoul.jcseg.segmenter.SegmenterConfig
the maximum length for the text between the pair punctuation.
PPT_POSPEECH - Static variable in interface org.lionsoul.jcseg.IWord
 
prev - Variable in class org.lionsoul.jcseg.util.IHashQueue.Entry
 
prev - Variable in class org.lionsoul.jcseg.util.IIntQueue.Entry
 
printChunks(String, ArrayList<IChunk>) - Static method in class org.lionsoul.jcseg.segmenter.ComplexSeg
 
printMatrix(double[][]) - Static method in class org.lionsoul.jcseg.util.Util
print the specified matrix
PUNCTUATION - Static variable in interface org.lionsoul.jcseg.IWord
 
pushBack(int) - Method in class org.lionsoul.jcseg.segmenter.DelimiterSeg
push back the data to the stream
pushBack(int) - Method in class org.lionsoul.jcseg.segmenter.DetectSeg
push back the data to the stream
pushBack(int) - Method in class org.lionsoul.jcseg.segmenter.NGramSeg
push back the data to the stream
pushBack(int) - Method in class org.lionsoul.jcseg.segmenter.Segmenter
push back the data to the stream.
pushBack(String) - Method in class org.lionsoul.jcseg.segmenter.Segmenter
push back a string to the stream

Q

qCNNumericToArabic(String) - Static method in class org.lionsoul.jcseg.util.NumericUtil
 
QUANTIFIER - Static variable in interface org.lionsoul.jcseg.IWord
 
quickSelect(T[], int) - Static method in class org.lionsoul.jcseg.util.Sort
quick select algorithm
quicksort(T[]) - Static method in class org.lionsoul.jcseg.util.Sort
quick sort algorithm

R

read() - Method in class org.lionsoul.jcseg.util.IPushbackReader
read the next int from the stream this will check the buffer queue first and take the first item of the buffer as the result
read(char[], int, int) - Method in class org.lionsoul.jcseg.util.IPushbackReader
read the specified block from the stream
reader - Variable in class org.lionsoul.jcseg.segmenter.NGramSeg
 
reader - Variable in class org.lionsoul.jcseg.segmenter.Segmenter
 
reader - Variable in class org.lionsoul.jcseg.sentence.SentenceSeg
 
readNext() - Method in class org.lionsoul.jcseg.segmenter.DelimiterSeg
read the next char from the current position
readNext() - Method in class org.lionsoul.jcseg.segmenter.DetectSeg
read the next char from the current position
readNext() - Method in class org.lionsoul.jcseg.segmenter.NGramSeg
read the next char from the current position
readNext() - Method in class org.lionsoul.jcseg.segmenter.Segmenter
read the next char from the current position
readNext() - Method in class org.lionsoul.jcseg.sentence.SentenceSeg
read the next char from the current position
readUntil(char) - Method in class org.lionsoul.jcseg.sentence.SentenceSeg
loop the reader until the specified char is found.
remove() - Method in class org.lionsoul.jcseg.util.IHashQueue
remove the node from the head and you should make sure the size is larger than 0 by calling size() before you invoke the method or you will just get null.
remove(int) - Method in class org.lionsoul.jcseg.util.IntArrayList
remove the element at the specified position use System.arraycopy instead of a loop may be more efficient
remove(int, String) - Method in class org.lionsoul.jcseg.dic.HashMapDictionary
 
remove(int, String) - Method in interface org.lionsoul.jcseg.IDictionary
remove the mapping associate with the given key
reset(Reader) - Method in interface org.lionsoul.jcseg.ISegment
reset the reader
reset(Reader) - Method in class org.lionsoul.jcseg.segmenter.DelimiterSeg
 
reset(Reader) - Method in class org.lionsoul.jcseg.segmenter.DetectSeg
 
reset(Reader) - Method in class org.lionsoul.jcseg.segmenter.NGramSeg
 
reset(Reader) - Method in class org.lionsoul.jcseg.segmenter.Segmenter
input stream and reader reset.
reset(Reader) - Method in class org.lionsoul.jcseg.sentence.SentenceSeg
stream/reader reset.
resetMode(String) - Method in class org.lionsoul.jcseg.test.JcsegTest
 
resetPrefixLength(SegmenterConfig, ADictionary, int) - Static method in class org.lionsoul.jcseg.dic.ADictionary
check and reset the value of ADictionary.mixPrefixLength
resetSuffixLength(SegmenterConfig, ADictionary, int) - Static method in class org.lionsoul.jcseg.dic.ADictionary
check and reset the value of the ADictionary.mixSuffixLength
resetSynonymsNet() - Method in class org.lionsoul.jcseg.dic.ADictionary
1, synonyms words to synonyms entry 2, loop each synonyms word and set the IWord#synEntry 3, clear the synonyms buffer
rootWord - Variable in class org.lionsoul.jcseg.SynonymsEntry
the Synonyms root word

S

seg - Variable in class org.lionsoul.jcseg.extractor.KeyphraseExtractor
the ISegment object
seg - Variable in class org.lionsoul.jcseg.extractor.KeywordsExtractor
the ISegment object
SegKit - Class in org.lionsoul.jcseg.segmenter
abstract segmentor kit class
SegKit() - Constructor for class org.lionsoul.jcseg.segmenter.SegKit
 
Segmenter - Class in org.lionsoul.jcseg.segmenter
abstract segmentation super class: 1.
Segmenter(SegmenterConfig, ADictionary) - Constructor for class org.lionsoul.jcseg.segmenter.Segmenter
initialize the segment
SegmenterConfig - Class in org.lionsoul.jcseg.segmenter
Jcseg segmenter configuration class
SegmenterConfig() - Constructor for class org.lionsoul.jcseg.segmenter.SegmenterConfig
create the config and do nothing about initialize Note: this may cuz Incompatibility problems for the old version that has use this construct method
SegmenterConfig(boolean) - Constructor for class org.lionsoul.jcseg.segmenter.SegmenterConfig
create and initialize the config by autoload
SegmenterConfig(InputStream) - Constructor for class org.lionsoul.jcseg.segmenter.SegmenterConfig
create and initialize the task config from a InputStream
SegmenterConfig(String) - Constructor for class org.lionsoul.jcseg.segmenter.SegmenterConfig
create and initialize the task config from a properties file
SegmenterFunction - Interface in org.lionsoul.jcseg.fi
 
sentence(String) - Method in class org.lionsoul.jcseg.test.JcsegTest
key sentence extractor
Sentence - Class in org.lionsoul.jcseg.sentence
sentence desc class
Sentence(String) - Constructor for class org.lionsoul.jcseg.sentence.Sentence
 
Sentence(String, int) - Constructor for class org.lionsoul.jcseg.sentence.Sentence
construct method
sentenceNum - Variable in class org.lionsoul.jcseg.extractor.impl.TextRankSummaryExtractor
 
sentenceSeg - Variable in class org.lionsoul.jcseg.extractor.SummaryExtractor
sentence splitter object
SentenceSeg - Class in org.lionsoul.jcseg.sentence
document sentence splitter
SentenceSeg() - Constructor for class org.lionsoul.jcseg.sentence.SentenceSeg
 
SentenceSeg(Reader) - Constructor for class org.lionsoul.jcseg.sentence.SentenceSeg
construct method
set(int, char) - Method in class org.lionsoul.jcseg.util.IStringBuffer
set the char at the specified index
set(int, int) - Method in class org.lionsoul.jcseg.util.IntArrayList
 
set(String, String) - Method in class org.lionsoul.jcseg.segmenter.SegmenterConfig
set the option value from a specified key and value define in jcseg.properties
setAppendCJKPinyin(boolean) - Method in class org.lionsoul.jcseg.segmenter.SegmenterConfig
 
setAppendCJKSyn(boolean) - Method in class org.lionsoul.jcseg.segmenter.SegmenterConfig
 
setAppendPartOfSpeech(boolean) - Method in class org.lionsoul.jcseg.segmenter.SegmenterConfig
 
setAutoFilter(boolean) - Method in class org.lionsoul.jcseg.extractor.impl.TextRankKeywordsExtractor
 
setAutoload(boolean) - Method in class org.lionsoul.jcseg.segmenter.SegmenterConfig
 
setAutoMinLength(int) - Method in class org.lionsoul.jcseg.extractor.impl.TextRankKeyphraseExtractor
 
setClearStopwords(boolean) - Method in class org.lionsoul.jcseg.segmenter.SegmenterConfig
 
setCnFactionToArabic(boolean) - Method in class org.lionsoul.jcseg.segmenter.SegmenterConfig
 
setCnNumToArabic(boolean) - Method in class org.lionsoul.jcseg.segmenter.SegmenterConfig
 
setDelimiter(char) - Method in class org.lionsoul.jcseg.segmenter.DelimiterSeg
set the delimiter default to whitespace
setDELIMITER(char) - Method in class org.lionsoul.jcseg.segmenter.SegmenterConfig
 
setEnMaxLen(int) - Method in class org.lionsoul.jcseg.segmenter.SegmenterConfig
 
setEnSecondMinLen(int) - Method in class org.lionsoul.jcseg.segmenter.SegmenterConfig
 
setEnSecondSeg(boolean) - Method in class org.lionsoul.jcseg.segmenter.SegmenterConfig
 
setEntity(String[]) - Method in interface org.lionsoul.jcseg.IWord
set the entity name array of the word
setEntity(String[]) - Method in class org.lionsoul.jcseg.segmenter.Word
 
setEntityForNull(String[]) - Method in interface org.lionsoul.jcseg.IWord
 
setEntityForNull(String[]) - Method in class org.lionsoul.jcseg.segmenter.Word
 
setEnWordSeg(boolean) - Method in class org.lionsoul.jcseg.segmenter.SegmenterConfig
 
setFrequency(int) - Method in interface org.lionsoul.jcseg.IWord
set the frequency of the word
setFrequency(int) - Method in class org.lionsoul.jcseg.segmenter.Word
 
setGRAM(byte) - Method in class org.lionsoul.jcseg.segmenter.SegmenterConfig
 
setICnName(boolean) - Method in class org.lionsoul.jcseg.segmenter.SegmenterConfig
 
setIndex(int) - Method in class org.lionsoul.jcseg.extractor.impl.TextRankSummaryExtractor.Document
 
setKeepEnSecOriginalWord(boolean) - Method in class org.lionsoul.jcseg.segmenter.SegmenterConfig
 
setKeepEnSegOriginalWord(boolean) - Method in class org.lionsoul.jcseg.segmenter.SegmenterConfig
 
setKeepPunctuations(String) - Method in class org.lionsoul.jcseg.segmenter.SegmenterConfig
 
setKeepUnregWords(boolean) - Method in class org.lionsoul.jcseg.segmenter.SegmenterConfig
 
setKeywordsNum(int) - Method in class org.lionsoul.jcseg.extractor.impl.TextRankKeyphraseExtractor
 
setKeywordsNum(int) - Method in class org.lionsoul.jcseg.extractor.impl.TextRankKeywordsExtractor
 
setLastUpdateTime(long) - Method in class org.lionsoul.jcseg.dic.AutoLoadFile
 
setLength(int) - Method in interface org.lionsoul.jcseg.IWord
self define the length
setLength(int) - Method in class org.lionsoul.jcseg.segmenter.Word
 
setLength(int) - Method in class org.lionsoul.jcseg.sentence.Sentence
 
setLength(int) - Method in class org.lionsoul.jcseg.util.IStringBuffer
set the length of the buffer actually it just override the count and the actual buffer has nothing changed
setLexiconPath(String[]) - Method in class org.lionsoul.jcseg.segmenter.SegmenterConfig
 
setLoadCJKPinyin(boolean) - Method in class org.lionsoul.jcseg.segmenter.SegmenterConfig
 
setLoadCJKPos(boolean) - Method in class org.lionsoul.jcseg.segmenter.SegmenterConfig
 
setLoadCJKSyn(boolean) - Method in class org.lionsoul.jcseg.segmenter.SegmenterConfig
 
setLoadEntity(boolean) - Method in class org.lionsoul.jcseg.segmenter.SegmenterConfig
 
setMaxCnLnadron(int) - Method in class org.lionsoul.jcseg.segmenter.SegmenterConfig
 
setMaxIterateNum(int) - Method in class org.lionsoul.jcseg.extractor.impl.TextRankKeyphraseExtractor
 
setMaxIterateNum(int) - Method in class org.lionsoul.jcseg.extractor.impl.TextRankKeywordsExtractor
 
setMaxIterateNum(int) - Method in class org.lionsoul.jcseg.extractor.impl.TextRankSummaryExtractor
 
setMaxLength(int) - Method in class org.lionsoul.jcseg.segmenter.SegmenterConfig
 
setMaxWordsNum(int) - Method in class org.lionsoul.jcseg.extractor.impl.TextRankKeyphraseExtractor
 
setN(byte) - Method in class org.lionsoul.jcseg.segmenter.NGramSeg
 
setNameSingleThreshold(int) - Method in class org.lionsoul.jcseg.segmenter.SegmenterConfig
 
setParameter(String) - Method in interface org.lionsoul.jcseg.IWord
set the string parameters
setParameter(String) - Method in class org.lionsoul.jcseg.segmenter.Word
 
setPartSpeech(String[]) - Method in interface org.lionsoul.jcseg.IWord
 
setPartSpeech(String[]) - Method in class org.lionsoul.jcseg.segmenter.Word
 
setPartSpeechForNull(String[]) - Method in interface org.lionsoul.jcseg.IWord
 
setPartSpeechForNull(String[]) - Method in class org.lionsoul.jcseg.segmenter.Word
 
setPinyin(String) - Method in interface org.lionsoul.jcseg.IWord
set the pinyin of the word
setPinyin(String) - Method in class org.lionsoul.jcseg.segmenter.Word
 
setPollTime(int) - Method in class org.lionsoul.jcseg.segmenter.SegmenterConfig
 
setPosition(int) - Method in interface org.lionsoul.jcseg.IWord
set the position of the word
setPosition(int) - Method in class org.lionsoul.jcseg.segmenter.Word
 
setPosition(int) - Method in class org.lionsoul.jcseg.sentence.Sentence
 
setPPT_MAX_LENGTH(int) - Method in class org.lionsoul.jcseg.segmenter.SegmenterConfig
 
setRootWord(IWord) - Method in class org.lionsoul.jcseg.SynonymsEntry
 
setScore(double) - Method in class org.lionsoul.jcseg.extractor.impl.TextRankSummaryExtractor.Document
 
setSeg(ISegment) - Method in class org.lionsoul.jcseg.extractor.KeyphraseExtractor
 
setSeg(ISegment) - Method in class org.lionsoul.jcseg.extractor.KeywordsExtractor
 
setSentence(Sentence) - Method in class org.lionsoul.jcseg.extractor.impl.TextRankSummaryExtractor.Document
 
setSentenceNum(int) - Method in class org.lionsoul.jcseg.extractor.impl.TextRankSummaryExtractor
 
setSentenceSeg(SentenceSeg) - Method in class org.lionsoul.jcseg.extractor.SummaryExtractor
 
setSyn(SynonymsEntry) - Method in interface org.lionsoul.jcseg.IWord
 
setSyn(SynonymsEntry) - Method in class org.lionsoul.jcseg.segmenter.Word
 
setType(int) - Method in interface org.lionsoul.jcseg.IWord
set the type of the word
setType(int) - Method in class org.lionsoul.jcseg.segmenter.Word
 
setValue(String) - Method in class org.lionsoul.jcseg.sentence.Sentence
 
setWindowSize(int) - Method in class org.lionsoul.jcseg.extractor.impl.TextRankKeyphraseExtractor
 
setWindowSize(int) - Method in class org.lionsoul.jcseg.extractor.impl.TextRankKeywordsExtractor
 
setWords(List<IWord>) - Method in class org.lionsoul.jcseg.extractor.impl.TextRankSummaryExtractor.Document
 
setWordSeg(ISegment) - Method in class org.lionsoul.jcseg.extractor.SummaryExtractor
 
shellSort(T[]) - Static method in class org.lionsoul.jcseg.util.Sort
shell sort algorithm
SIMPLE - org.lionsoul.jcseg.ISegment.Type
 
SIMPLE - Static variable in interface org.lionsoul.jcseg.ISegment
Segmentation type constants
SIMPLE_MODE - Static variable in interface org.lionsoul.jcseg.ISegment
Segmentation type index
SimpleSeg - Class in org.lionsoul.jcseg.segmenter
Jcseg simple segmentation implementation
SimpleSeg(SegmenterConfig, ADictionary) - Constructor for class org.lionsoul.jcseg.segmenter.SimpleSeg
 
SIMSTR - Static variable in class org.lionsoul.jcseg.util.STConverter
 
SimToTraditional(String) - Static method in class org.lionsoul.jcseg.util.STConverter
convert the simplified words to traditional words of the specified string.
SimToTraditional(String, IStringBuffer) - Static method in class org.lionsoul.jcseg.util.STConverter
 
size() - Method in class org.lionsoul.jcseg.SynonymsEntry
 
size() - Method in class org.lionsoul.jcseg.util.IHashQueue
get the size of the queue
size() - Method in class org.lionsoul.jcseg.util.IIntFIFO
get the size of the queue
size() - Method in class org.lionsoul.jcseg.util.IIntQueue
get the size of the queue
size() - Method in class org.lionsoul.jcseg.util.IntArrayList
 
size(int) - Method in class org.lionsoul.jcseg.dic.HashMapDictionary
 
size(int) - Method in interface org.lionsoul.jcseg.IDictionary
return the size of the dictionary
Sort - Class in org.lionsoul.jcseg.util
All kind of Sort algorithm implemented method use the default compare method
Sort() - Constructor for class org.lionsoul.jcseg.util.Sort
 
START_SS_MASK - Static variable in interface org.lionsoul.jcseg.ISegment
Whether to start the Latin secondary segmentation.
startAutoload() - Method in class org.lionsoul.jcseg.dic.ADictionary
start the lexicon autoload thread
startsWith(String, String[]) - Static method in class org.lionsoul.jcseg.util.ArrayUtil
check if there is an element that starts with the specified string
STConverter - Class in org.lionsoul.jcseg.util
Simplified and traditional chinese convert class all the search work base on String.indexOf(int) you may store all the words in a HashMap for the purpuse of a faster fetch
STConverter() - Constructor for class org.lionsoul.jcseg.util.STConverter
 
STOP_WORD - Static variable in interface org.lionsoul.jcseg.dic.ILexicon
stop words
stopAutoload() - Method in class org.lionsoul.jcseg.dic.ADictionary
 
streamResetTo(String, int) - Method in class org.lionsoul.jcseg.segmenter.NGramSeg
reset the data back from the specified position
StringUtil - Class in org.lionsoul.jcseg.util
a class to deal with the English stop char like the English punctuation
StringUtil() - Constructor for class org.lionsoul.jcseg.util.StringUtil
 
subWordPool - Variable in class org.lionsoul.jcseg.segmenter.Segmenter
 
summary(String) - Method in class org.lionsoul.jcseg.test.JcsegTest
summary extractor
SummaryExtractor - Class in org.lionsoul.jcseg.extractor
document summary extractor
SummaryExtractor(ISegment, SentenceSeg) - Constructor for class org.lionsoul.jcseg.extractor.SummaryExtractor
construct method
sync - Variable in class org.lionsoul.jcseg.dic.ADictionary
 
SynonymsEntry - Class in org.lionsoul.jcseg
Synonyms words entry class
SynonymsEntry() - Constructor for class org.lionsoul.jcseg.SynonymsEntry
 
SynonymsEntry(IWord) - Constructor for class org.lionsoul.jcseg.SynonymsEntry
 

T

T_BASIC_LATIN - Static variable in interface org.lionsoul.jcseg.IWord
Latin series including the Arabic numbers.
T_CJK_PINYIN - Static variable in interface org.lionsoul.jcseg.IWord
Chinese Pinyin
T_CJK_WORD - Static variable in interface org.lionsoul.jcseg.IWord
Chinese,Japanese,Korean words
T_CN_NAME - Static variable in interface org.lionsoul.jcseg.IWord
Chinese last name.
T_CN_NICKNAME - Static variable in interface org.lionsoul.jcseg.IWord
Chinese nickname like: 老陈
T_CN_NUMERIC - Static variable in interface org.lionsoul.jcseg.IWord
Chinese numeric
T_LEN - Static variable in interface org.lionsoul.jcseg.dic.ILexicon
 
T_LETTER_NUMBER - Static variable in interface org.lionsoul.jcseg.IWord
letter number like 'ⅠⅡ'
T_MIXED_WORD - Static variable in interface org.lionsoul.jcseg.IWord
Chinese and English mix word like B超,SIM卡.
T_OTHER_NUMBER - Static variable in interface org.lionsoul.jcseg.IWord
other number like '①⑩⑽㈩'
T_PUNCTUATION - Static variable in interface org.lionsoul.jcseg.IWord
 
T_UNRECOGNIZE_WORD - Static variable in interface org.lionsoul.jcseg.IWord
useless chars like the CJK punctuation
TextRankKeyphraseExtractor - Class in org.lionsoul.jcseg.extractor.impl
document key phrase extractor base on textRank algorithm
TextRankKeyphraseExtractor(ISegment) - Constructor for class org.lionsoul.jcseg.extractor.impl.TextRankKeyphraseExtractor
 
TextRankKeywordsExtractor - Class in org.lionsoul.jcseg.extractor.impl
document keywords extractor base on textRank algorithm
TextRankKeywordsExtractor(ISegment) - Constructor for class org.lionsoul.jcseg.extractor.impl.TextRankKeywordsExtractor
 
textRankSortedDocuments(List<Sentence>, List<List<IWord>>) - Method in class org.lionsoul.jcseg.extractor.impl.TextRankSummaryExtractor
get the documents order by relevance score.
TextRankSummaryExtractor - Class in org.lionsoul.jcseg.extractor.impl
TextRank summary extractor base on textRank algorithm
TextRankSummaryExtractor(ISegment, SentenceSeg) - Constructor for class org.lionsoul.jcseg.extractor.impl.TextRankSummaryExtractor
 
TextRankSummaryExtractor.Document - Class in org.lionsoul.jcseg.extractor.impl
summary document inner class
TIME_POSPEECH - Static variable in interface org.lionsoul.jcseg.IWord
 
TimeUtil - Class in org.lionsoul.jcseg.util
Time Util class
TimeUtil() - Constructor for class org.lionsoul.jcseg.util.TimeUtil
 
toJsonArray(String[]) - Static method in class org.lionsoul.jcseg.util.ArrayUtil
 
toJsonObject(String[]) - Static method in class org.lionsoul.jcseg.util.ArrayUtil
implode the array elements as a Json array string
tokenize(String) - Method in class org.lionsoul.jcseg.test.JcsegTest
string tokenize handler
toLowerCase(int) - Static method in class org.lionsoul.jcseg.util.StringUtil
 
toString() - Method in class org.lionsoul.jcseg.segmenter.Chunk
 
toString() - Method in class org.lionsoul.jcseg.segmenter.Word
 
toString() - Method in class org.lionsoul.jcseg.sentence.Sentence
rewrite the toString method
toString() - Method in class org.lionsoul.jcseg.util.ByteCharCounter
 
toString() - Method in class org.lionsoul.jcseg.util.IStringBuffer
return the string of the current buffer
toUpperCase(int) - Static method in class org.lionsoul.jcseg.util.StringUtil
 
TRASTR - Static variable in class org.lionsoul.jcseg.util.STConverter
 
TraToSimplified(String) - Static method in class org.lionsoul.jcseg.util.STConverter
convert the traditional words to simplified words of the specified string.
TraToSimplified(String, IStringBuffer) - Static method in class org.lionsoul.jcseg.util.STConverter
 

U

UNMATCH_CJK_WORD - Static variable in interface org.lionsoul.jcseg.dic.ILexicon
unmatched word
unread(char[], int, int) - Method in class org.lionsoul.jcseg.util.IPushbackReader
unread a block from a char array to the stream
unread(int) - Method in class org.lionsoul.jcseg.util.IPushbackReader
unread the specified data to the stream push the data back to the queue in fact, you know
UNRECOGNIZE - Static variable in interface org.lionsoul.jcseg.IWord
 
Util - Class in org.lionsoul.jcseg.util
static method for jcseg.
Util() - Constructor for class org.lionsoul.jcseg.util.Util
 

V

valueOf(String) - Static method in enum org.lionsoul.jcseg.IDictionary.Type
Returns the enum constant of this type with the specified name.
valueOf(String) - Static method in enum org.lionsoul.jcseg.ISegment.Type
Returns the enum constant of this type with the specified name.
values() - Static method in enum org.lionsoul.jcseg.IDictionary.Type
Returns an array containing the constants of this enum type, in the order they are declared.
values() - Static method in enum org.lionsoul.jcseg.ISegment.Type
Returns an array containing the constants of this enum type, in the order they are declared.

W

windowSize - Variable in class org.lionsoul.jcseg.extractor.impl.TextRankKeyphraseExtractor
 
windowSize - Variable in class org.lionsoul.jcseg.extractor.impl.TextRankKeywordsExtractor
 
Word - Class in org.lionsoul.jcseg.segmenter
word class for Jcseg with the IWord interface implemented at 2017/03/29: make the synonym series method Word.getSyn() Word.setSyn(SynonymsEntry) and the part of speech series method Word.getPartSpeech() Word.setPartSpeech(String[]) Word.addPartSpeech(String) and the Word.clone() method synchronized for might happen concurrent access.
Word(String, int) - Constructor for class org.lionsoul.jcseg.segmenter.Word
 
Word(String, int, int) - Constructor for class org.lionsoul.jcseg.segmenter.Word
 
Word(String, int, int, String[]) - Constructor for class org.lionsoul.jcseg.segmenter.Word
construct method to initialize the newly created Word instance
Word(String, int, String[]) - Constructor for class org.lionsoul.jcseg.segmenter.Word
 
wordNewOrClone(int, String, int) - Method in class org.lionsoul.jcseg.segmenter.NGramSeg
check if the specified word is existed in a specified dictionary and if does clone it or create a new one.
wordNewOrClone(int, String, int) - Method in class org.lionsoul.jcseg.segmenter.Segmenter
check if the specified word is existed in a specified dictionary and if it does clone it or create a new one.
wordPool - Variable in class org.lionsoul.jcseg.segmenter.DelimiterSeg
 
wordPool - Variable in class org.lionsoul.jcseg.segmenter.DetectSeg
 
wordPool - Variable in class org.lionsoul.jcseg.segmenter.NGramSeg
CJK word cache pool, Reusable string buffer
wordPool - Variable in class org.lionsoul.jcseg.segmenter.Segmenter
CJK word cache pool, Reusable string buffer and the array list for basic integer
wordSeg - Variable in class org.lionsoul.jcseg.extractor.SummaryExtractor
ISegment word tokenizer object

_

__toString() - Method in class org.lionsoul.jcseg.segmenter.Word
for debug only
A B C D E F G H I J K L M N O P Q R S T U V W _ 
All Classes All Packages