uk.ac.shef.dcs.oak.jate.core.algorithm
Class GlossExAlgorithm

java.lang.Object
  extended by uk.ac.shef.dcs.oak.jate.core.algorithm.GlossExAlgorithm
All Implemented Interfaces:
Algorithm

public class GlossExAlgorithm
extends java.lang.Object
implements Algorithm

An implementation of the GlossEx term recognition algorithm. See Kozakov, et. al 2004, Glossary extraction and utilization in the information search and delivery system for IBM Technical Support . This is the implementation of the scoring formula only, and does not include the filtering algorithm as mentioned in the paper.

In the equation C(T) = a* TD(T) + B*TC(T), default a=0.2, B = 0.8.

You might need to modify the value of B by increasing it substaintially when the reference corpus is relatively much bigger than the target corpus, such as the BNC corpus. For details, please refer to the paper.


Constructor Summary
GlossExAlgorithm()
           
GlossExAlgorithm(double alpha, double beta)
           
 
Method Summary
 Term[] execute(AbstractFeatureWrapper store)
          Execute the algorithm by analysing the features stored in the AbstractFeatureWrapper and return terms extracted and sorted by their relevance
 java.lang.String toString()
           
 
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, wait, wait, wait
 

Constructor Detail

GlossExAlgorithm

public GlossExAlgorithm()

GlossExAlgorithm

public GlossExAlgorithm(double alpha,
                        double beta)
Method Detail

execute

public Term[] execute(AbstractFeatureWrapper store)
               throws JATEException
Description copied from interface: Algorithm
Execute the algorithm by analysing the features stored in the AbstractFeatureWrapper and return terms extracted and sorted by their relevance

Specified by:
execute in interface Algorithm
Returns:
Throws:
JATEException

toString

public java.lang.String toString()
Overrides:
toString in class java.lang.Object