Package org.lionsoul.jcseg.segmenter
Class MMSegFilter
- java.lang.Object
-
- org.lionsoul.jcseg.segmenter.MMSegFilter
-
public class MMSegFilter extends Object
MMSeg default filter class- Author:
- chenxin
-
-
Constructor Summary
Constructors Constructor Description MMSegFilter()
-
Method Summary
All Methods Static Methods Concrete Methods Modifier and Type Method Description static ArrayList<IChunk>getLargestAverageWordLengthChunks(ArrayList<IChunk> inChunks, ArrayList<IChunk> outChunks)2.static ArrayList<IChunk>getLargestSingleMorphemicFreedomChunks(ArrayList<IChunk> inChunks, ArrayList<IChunk> outChunks)the largest sum of degree of morphemic freedom of one-character words this rule will return the chunks that own the largest sum of degree of morphemic freedom of one-characterstatic ArrayList<IChunk>getMaximumMatchChunks(ArrayList<IChunk> inChunks, ArrayList<IChunk> outChunks)1.static ArrayList<IChunk>getSmallestVarianceWordLengthChunks(ArrayList<IChunk> inChunks, ArrayList<IChunk> outChunks)the smallest variance word length this rule will the chunks that one the smallest variance word length
-
-
-
Method Detail
-
getMaximumMatchChunks
public static ArrayList<IChunk> getMaximumMatchChunks(ArrayList<IChunk> inChunks, ArrayList<IChunk> outChunks)
1. the maximum match rule this rule will return the chunks that own the largest word length
-
getLargestAverageWordLengthChunks
public static ArrayList<IChunk> getLargestAverageWordLengthChunks(ArrayList<IChunk> inChunks, ArrayList<IChunk> outChunks)
2. largest average word length this rule will return the chunks that own the largest average word length
-
getSmallestVarianceWordLengthChunks
public static ArrayList<IChunk> getSmallestVarianceWordLengthChunks(ArrayList<IChunk> inChunks, ArrayList<IChunk> outChunks)
the smallest variance word length this rule will the chunks that one the smallest variance word length
-
getLargestSingleMorphemicFreedomChunks
public static ArrayList<IChunk> getLargestSingleMorphemicFreedomChunks(ArrayList<IChunk> inChunks, ArrayList<IChunk> outChunks)
the largest sum of degree of morphemic freedom of one-character words this rule will return the chunks that own the largest sum of degree of morphemic freedom of one-character
-
-