uk.ac.shef.dcs.oak.jate.core.feature
Class FeatureBuilderCorpusTermFrequency
java.lang.Object
uk.ac.shef.dcs.oak.jate.core.feature.AbstractFeatureBuilder
uk.ac.shef.dcs.oak.jate.core.feature.FeatureBuilderCorpusTermFrequency
public class FeatureBuilderCorpusTermFrequency
- extends AbstractFeatureBuilder
A specific type of feature builder that builds an instance of FeatureCorpusTermFrequency from a GlobalIndex.
Counting of term frequency is case-sensitive. For each canonical term form, each of its variants (letter case,
inflections etc) are counted in the document.
Methods inherited from class java.lang.Object |
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
FeatureBuilderCorpusTermFrequency
public FeatureBuilderCorpusTermFrequency(TermFreqCounter counter1,
WordCounter counter2,
Normalizer normaliser)
- Creates an instance
- Parameters:
counter1
- candidate term counter, counting distributions of candidate termscounter2
- word counter, counting number of words in documentsnormaliser
- a normaliser for returning terms to their canonical forms
over the corpus and add up to the total frequencies of the lemma.
build
public FeatureCorpusTermFrequency build(GlobalIndex index)
throws JATEException
- Build an instance of FeatureCorpusTermFrequency
- Specified by:
build
in class AbstractFeatureBuilder
- Parameters:
index
- the global resource index
- Returns:
-
- Throws:
JATEException