Uses of Interface
org.lionsoul.jcseg.IWord
-
Packages that use IWord Package Description org.lionsoul.jcseg org.lionsoul.jcseg.dic org.lionsoul.jcseg.extractor org.lionsoul.jcseg.extractor.impl org.lionsoul.jcseg.segmenter org.lionsoul.jcseg.util -
-
Uses of IWord in org.lionsoul.jcseg
Fields in org.lionsoul.jcseg declared as IWord Modifier and Type Field Description protected IWordSynonymsEntry. rootWordthe Synonyms root wordMethods in org.lionsoul.jcseg that return IWord Modifier and Type Method Description IWordIDictionary. add(int t, String key, int type)add a new word to the dictionaryIWordIDictionary. add(int t, String key, int fre, int type)add a new word to the dictionaryIWordIDictionary. add(int t, String key, int fre, int type, String[] entity)add a new word to the dictionary with its statistics frequencyIWordIDictionary. add(int t, String key, int type, String[] entity)add a new word to the dictionaryIWordIDictionary. add(int t, IWord word)directly add a IWord item to the dictionaryIWordIWord. clone()make clone availableIWordIDictionary. get(int t, String key)return the IWord associate with the given key.IWordSynonymsEntry. getRootWord()return the base wordIWord[]IChunk. getWords()get the all the words in the chunk.IWordISegment. next()segment a word from a char array from a specified position.Methods in org.lionsoul.jcseg that return types with arguments of type IWord Modifier and Type Method Description List<IWord>SynonymsEntry. getList()return the synonyms listMethods in org.lionsoul.jcseg with parameters of type IWord Modifier and Type Method Description IWordIDictionary. add(int t, IWord word)directly add a IWord item to the dictionaryvoidSynonymsEntry. add(IWord word)add a new synonyms word and the newly added word will extend the part of speech and the entity from the base word if there are not setvoidSynonymsEntry. setRootWord(IWord rootWord)Constructors in org.lionsoul.jcseg with parameters of type IWord Constructor Description SynonymsEntry(IWord rootWord) -
Uses of IWord in org.lionsoul.jcseg.dic
Methods in org.lionsoul.jcseg.dic that return IWord Modifier and Type Method Description IWordHashMapDictionary. add(int t, String key, int type)IWordHashMapDictionary. add(int t, String key, int fre, int type)IWordHashMapDictionary. add(int t, String key, int fre, int type, String[] entity)IWordHashMapDictionary. add(int t, String key, int type, String[] entity)IWordHashMapDictionary. add(int t, IWord word)IWordHashMapDictionary. get(int t, String key)Methods in org.lionsoul.jcseg.dic with parameters of type IWord Modifier and Type Method Description IWordHashMapDictionary. add(int t, IWord word) -
Uses of IWord in org.lionsoul.jcseg.extractor
Methods in org.lionsoul.jcseg.extractor with parameters of type IWord Modifier and Type Method Description protected booleanKeyphraseExtractor. filter(IWord word)word item filterprotected booleanKeywordsExtractor. filter(IWord word)word item filter -
Uses of IWord in org.lionsoul.jcseg.extractor.impl
Methods in org.lionsoul.jcseg.extractor.impl that return types with arguments of type IWord Modifier and Type Method Description List<IWord>TextRankSummaryExtractor.Document. getWords()Method parameters in org.lionsoul.jcseg.extractor.impl with type arguments of type IWord Modifier and Type Method Description voidTextRankSummaryExtractor.Document. setWords(List<IWord> words)Constructor parameters in org.lionsoul.jcseg.extractor.impl with type arguments of type IWord Constructor Description Document(int index, Sentence sentence, List<IWord> words, double score)construct method -
Uses of IWord in org.lionsoul.jcseg.segmenter
Classes in org.lionsoul.jcseg.segmenter that implement IWord Modifier and Type Class Description classWordword class for Jcseg with theIWordinterface implemented at 2017/03/29: make the synonym series methodWord.getSyn()Word.setSyn(SynonymsEntry)and the part of speech series methodWord.getPartSpeech()Word.setPartSpeech(String[])Word.addPartSpeech(String)and theWord.clone()method synchronized for might happen concurrent access.Fields in org.lionsoul.jcseg.segmenter with type parameters of type IWord Modifier and Type Field Description protected LinkedList<IWord>Segmenter. subWordPoolprotected LinkedList<IWord>DelimiterSeg. wordPoolprotected LinkedList<IWord>DetectSeg. wordPoolprotected LinkedList<IWord>NGramSeg. wordPoolCJK word cache pool, Reusable string bufferprotected LinkedList<IWord>Segmenter. wordPoolCJK word cache pool, Reusable string buffer and the array list for basic integerMethods in org.lionsoul.jcseg.segmenter that return IWord Modifier and Type Method Description IWordWord. clone()Interface to clone the current objectprotected IWordMostSeg. getNextCJKWord(int c, int pos)get the next CJK word from the current position of the input stream and this function is the core part the most segmentation implementsprotected IWordNLPSeg. getNextCJKWord(int c, int pos)protected IWordSegmenter. getNextCJKWord(int c, int pos)get the next CJK word from the current position of the input streamprotected IWordNLPSeg. getNextDatetimeWord(IWord word, int entityIdx)get and return the next date-time wordprotected IWordSegmenter. getNextLatinWord(int c, int pos)get the next Latin word from the current position of the input streamprotected IWord[]Segmenter. getNextMatch(int maxLen, char[] chars, int index, List<IWord> wList)match the next CJK word in the dictionaryprotected IWordSegmenter. getNextMixedWord(char[] chars, int cjkidx)get the next mixed word, CJK-English or CJK-English-CJK or whateverprotected IWordSegmenter. getNextPunctuationPairWord(int c, int pos)get the next punctuation pair word from the current position of the input stream.protected IWordNLPSeg. getNextTheWord(IWord word)get the next the_xxx word like '第x个', '第x集' EG ...protected IWordNLPSeg. getNextTimeMergedWord(IWord word, int eIdx)get and return the next time merged date-time wordIWordNLPSeg. getNumericUnitComposedWord(int numeric, IWord unitWord)IWord[]Chunk. getWords()IWordDelimiterSeg. next()IWordDetectSeg. next()IWordNGramSeg. next()IWordNLPSeg. next()Override the next method to add the date-time entity recognition And we also invoke the parent.next method to get the next tokenIWordSegmenter. next()protected IWordNLPSeg. nextLatinWord(int c, int pos)find the letter or digit word from the current position count until the char is whitespace or not letter_digitprotected IWordSegmenter. nextLatinWord(int c, int pos)find the letter or digit word from the current position count until the char is whitespace or not letter_digitIWordNGramSeg. wordNewOrClone(int t, String str, int type)check if the specified word is existed in a specified dictionary and if does clone it or create a new one.IWordSegmenter. wordNewOrClone(int t, String str, int type)check if the specified word is existed in a specified dictionary and if it does clone it or create a new one.Methods in org.lionsoul.jcseg.segmenter that return types with arguments of type IWord Modifier and Type Method Description protected LinkedList<IWord>Segmenter. enSecondSeg(IWord w, LinkedList<IWord> wList)Do the secondary split for the specified complex Latin word This will split a complex English, Arabic, punctuation compose word to multiple simple parts Like 'qq2013' will split to 'qq' and '2013'protected LinkedList<IWord>MostSeg. enWordSeg(IWord w, LinkedList<IWord> wList)Latin word lexicon based English word segmentation for search modeprotected LinkedList<IWord>Segmenter. enWordSeg(IWord w, LinkedList<IWord> wList)Latin word lexicon based English word segmentation.Methods in org.lionsoul.jcseg.segmenter with parameters of type IWord Modifier and Type Method Description protected voidSegmenter. appendCJKWordFeatures(IWord word)check and append the pinyin and the synonyms words of the specified wordprotected voidSegmenter. appendLatinWordFeatures(IWord w)Check and append the synonyms/pinyin words of specified word included the CJK and basic Latin words All the synonyms words share the same position part of speech, word type with the primitive wordstatic voidSegKit. appendPinyin(SegmenterConfig config, LinkedList<IWord> wordPool, IWord wd)Quick interface to do the pinyin word append.static voidSegKit. appendSynonyms(SegmenterConfig config, LinkedList<IWord> wordPool, IWord wd)quick interface to do the synonyms word append You got check if the specified word has any synonyms firstprotected LinkedList<IWord>Segmenter. enSecondSeg(IWord w, LinkedList<IWord> wList)Do the secondary split for the specified complex Latin word This will split a complex English, Arabic, punctuation compose word to multiple simple parts Like 'qq2013' will split to 'qq' and '2013'protected booleanMostSeg. enSecondSegFilter(IWord w)protected booleanNLPSeg. enSecondSegFilter(IWord w)protected booleanSegmenter. enSecondSegFilter(IWord w)interface to check and do the English secondary segmentation.protected LinkedList<IWord>MostSeg. enWordSeg(IWord w, LinkedList<IWord> wList)Latin word lexicon based English word segmentation for search modeprotected LinkedList<IWord>Segmenter. enWordSeg(IWord w, LinkedList<IWord> wList)Latin word lexicon based English word segmentation.protected IWordNLPSeg. getNextDatetimeWord(IWord word, int entityIdx)get and return the next date-time wordprotected IWordNLPSeg. getNextTheWord(IWord word)get the next the_xxx word like '第x个', '第x集' EG ...protected IWordNLPSeg. getNextTimeMergedWord(IWord word, int eIdx)get and return the next time merged date-time wordIWordNLPSeg. getNumericUnitComposedWord(int numeric, IWord unitWord)Method parameters in org.lionsoul.jcseg.segmenter with type arguments of type IWord Modifier and Type Method Description static voidSegKit. appendPinyin(SegmenterConfig config, LinkedList<IWord> wordPool, IWord wd)Quick interface to do the pinyin word append.static voidSegKit. appendSynonyms(SegmenterConfig config, LinkedList<IWord> wordPool, IWord wd)quick interface to do the synonyms word append You got check if the specified word has any synonyms firstprotected LinkedList<IWord>Segmenter. enSecondSeg(IWord w, LinkedList<IWord> wList)Do the secondary split for the specified complex Latin word This will split a complex English, Arabic, punctuation compose word to multiple simple parts Like 'qq2013' will split to 'qq' and '2013'protected LinkedList<IWord>MostSeg. enWordSeg(IWord w, LinkedList<IWord> wList)Latin word lexicon based English word segmentation for search modeprotected LinkedList<IWord>Segmenter. enWordSeg(IWord w, LinkedList<IWord> wList)Latin word lexicon based English word segmentation.protected IWord[]Segmenter. getNextMatch(int maxLen, char[] chars, int index, List<IWord> wList)match the next CJK word in the dictionaryConstructors in org.lionsoul.jcseg.segmenter with parameters of type IWord Constructor Description Chunk(IWord[] words) -
Uses of IWord in org.lionsoul.jcseg.util
Classes in org.lionsoul.jcseg.util with type parameters of type IWord Modifier and Type Class Description classIHashQueue<T extends IWord>A normal queue base one single link but with hash index, so, it is fast for searchingMethods in org.lionsoul.jcseg.util that return IWord Modifier and Type Method Description static IWord[]TimeUtil. createDateTimePool()create and return a date-time poolMethods in org.lionsoul.jcseg.util with parameters of type IWord Modifier and Type Method Description static voidTimeUtil. fillDateTimePool(IWord[] wPool, int pIdx, IWord word)fill the date-time pool specified part with part index constantstatic intTimeUtil. fillDateTimePool(IWord[] wPool, IWord word)fill the date-time pool specified part through the specified time entity string.static voidTimeUtil. fillTimeToPool(IWord[] wPool, String timeVal)fill a date-time time part with a standard time format like '15:45:36' to the specified time poolstatic StringTimeUtil. getTimeKey(IWord word)
-