| Package | Description |
|---|---|
| org.ansj.app.newWord | |
| org.ansj.domain | |
| org.ansj.library | |
| org.ansj.recognition | |
| org.ansj.splitWord | |
| org.ansj.splitWord.analysis | |
| org.ansj.util |
| Modifier and Type | Method and Description |
|---|---|
void |
PatHashMap.addList(java.util.List<Term> terms)
增加term链到树中
|
| Constructor and Description |
|---|
NewTerm(Term term) |
| Modifier and Type | Field and Description |
|---|---|
static Term |
Term.NULL |
| Modifier and Type | Method and Description |
|---|---|
Term |
Term.getFrom() |
Term |
Term.getNext() |
Term |
Term.getTo() |
Term |
Term.merage(Term to)
进行term合并
|
Term |
Term.setNext(Term next)
返回他自己
|
| Modifier and Type | Method and Description |
|---|---|
int |
Term.compareTo(Term o) |
Term |
Term.merage(Term to)
进行term合并
|
void |
Term.setFrom(Term from) |
Term |
Term.setNext(Term next)
返回他自己
|
void |
Term.setPathScore(Term from)
核心构建最优的路径
|
void |
Term.setPathSelfScore(Term from)
核心分数的最优的路径,越小越好
|
void |
Term.setTo(Term to) |
| Modifier and Type | Method and Description |
|---|---|
static int |
NatureLibrary.getTwoTermFreq(Term fromTerm,
Term toTerm)
获得两个term之间的频率
|
static int |
NgramLibrary.getTwoWordFreq(Term from,
Term to)
查找两个词与词之间的频率
|
| Modifier and Type | Method and Description |
|---|---|
static void |
AsianPersonRecognition.nameAmbiguity(Term[] terms)
人名消歧,比如.邓颖超生前->邓颖 超生 前 fix to 丁颖超 生 前!
|
static void |
NumRecognition.recognition(Term[] terms)
数字+数字合并,zheng
|
| Constructor and Description |
|---|
AsianPersonRecognition(Term[] terms) |
CompanyRecogntion(Term[] terms) |
ForeignPersonRecognition(Term[] terms) |
NewWordRecognition(Term[] terms,
LearnTool learn) |
UserDefineRecognition(Term[] terms) |
UserDefineRecognition(Term[] terms,
love.cq.domain.Forest forest) |
| Constructor and Description |
|---|
NatureRecognition(java.util.List<Term> terms)
构造方法.传入分词的最终结果
|
| Modifier and Type | Method and Description |
|---|---|
Term |
Analysis.next() |
| Modifier and Type | Method and Description |
|---|---|
protected abstract java.util.List<Term> |
Analysis.getResult(Graph graph) |
abstract java.util.List<Term> |
Analysis.Merger.merger() |
protected java.util.List<Term> |
Analysis.parseStr(java.lang.String temp) |
| Modifier and Type | Method and Description |
|---|---|
protected java.util.List<Term> |
ToAnalysis.getResult(Graph graph) |
protected java.util.List<Term> |
NlpAnalysis.getResult(Graph graph) |
protected java.util.List<Term> |
IndexAnalysis.getResult(Graph graph) |
protected java.util.List<Term> |
BaseAnalysis.getResult(Graph graph) |
static java.util.List<Term> |
ToAnalysis.parse(java.lang.String str) |
static java.util.List<Term> |
NlpAnalysis.parse(java.lang.String str) |
static java.util.List<Term> |
IndexAnalysis.parse(java.lang.String str) |
static java.util.List<Term> |
BaseAnalysis.parse(java.lang.String str) |
static java.util.List<Term> |
ToAnalysis.parse(java.lang.String str,
love.cq.domain.Forest... forests) |
static java.util.List<Term> |
NlpAnalysis.parse(java.lang.String str,
LearnTool learn) |
| Modifier and Type | Field and Description |
|---|---|
protected Term |
Graph.end |
protected Term |
Graph.root |
Term[] |
Graph.terms |
| Modifier and Type | Method and Description |
|---|---|
static Term |
TermUtil.makeNewTermNum(Term from,
Term to,
TermNatures termNatures)
将两个term合并为一个全新的term
|
protected Term |
Graph.optimalRoot()
取得最优路径的root Term
|
protected static Term |
TermUtil.setToAndfrom(Term to,
Term from) |
| Modifier and Type | Method and Description |
|---|---|
java.util.List<Term> |
Graph.getResult(Analysis.Merger merger)
构建最优路径
|
static java.util.List<Term> |
FilterModifWord.modifResult(java.util.List<Term> all)
停用词.并且更新用户词表.如果是用此方法.
|
static java.util.List<Term> |
FilterModifWord.modifResult(java.util.List<Term> all,
java.util.HashMap<java.lang.String,java.lang.String> updateDic) |
| Modifier and Type | Method and Description |
|---|---|
void |
Graph.addTerm(Term term)
增加一个词语到图中
|
static double |
MathUtil.compuScore(Term from,
Term to)
从一个词的词性到另一个词的词的分数
|
static double |
MathUtil.compuScoreFreq(Term from,
Term term)
词性词频词长.计算出来一个分数
|
static void |
TermUtil.insertTerm(Term[] terms,
java.util.List<Term> tempList,
TermNatures nr) |
static void |
TermUtil.insertTerm(Term[] terms,
Term term)
将一个term插入到链表中的对应位置中,不排序了.
|
static void |
TermUtil.insertTerm(Term[] terms,
Term term)
将一个term插入到链表中的对应位置中,不排序了.
|
static void |
TermUtil.insertTermNum(Term[] terms,
Term term) |
static void |
TermUtil.insertTermNum(Term[] terms,
Term term) |
static Term |
TermUtil.makeNewTermNum(Term from,
Term to,
TermNatures termNatures)
将两个term合并为一个全新的term
|
protected static Term |
TermUtil.setToAndfrom(Term to,
Term from) |
static void |
TermUtil.termLink(Term from,
Term to) |
| Modifier and Type | Method and Description |
|---|---|
static void |
TermUtil.insertTerm(Term[] terms,
java.util.List<Term> tempList,
TermNatures nr) |
static double |
MathUtil.leftRightEntropy(java.util.List<Term> all)
新词熵及其左右熵
|
static java.util.List<Term> |
FilterModifWord.modifResult(java.util.List<Term> all)
停用词.并且更新用户词表.如果是用此方法.
|
static java.util.List<Term> |
FilterModifWord.modifResult(java.util.List<Term> all,
java.util.HashMap<java.lang.String,java.lang.String> updateDic) |
Copyright © 2013. All Rights Reserved.