uk.ac.shef.dcs.oak.jate.core.algorithm
Class GlossExAlgorithm
java.lang.Object
uk.ac.shef.dcs.oak.jate.core.algorithm.GlossExAlgorithm
- All Implemented Interfaces:
- Algorithm
public class GlossExAlgorithm
- extends java.lang.Object
- implements Algorithm
An implementation of the GlossEx term recognition algorithm. See Kozakov, et. al 2004,
Glossary extraction and utilization in the information search and delivery system for IBM Technical Support
. This is the implementation of the scoring formula only, and does not include the filtering algorithm as mentioned
in the paper.
In the equation C(T) = a* TD(T) + B*TC(T), default a=0.2, B = 0.8.
You might need to modify the value of B by increasing it substaintially when the reference corpus is relatively
much bigger than the target corpus, such as the BNC corpus. For details, please refer to the paper.
Method Summary |
Term[] |
execute(AbstractFeatureWrapper store)
Execute the algorithm by analysing the features stored in the AbstractFeatureWrapper and return terms extracted and
sorted by their relevance |
java.lang.String |
toString()
|
Methods inherited from class java.lang.Object |
clone, equals, finalize, getClass, hashCode, notify, notifyAll, wait, wait, wait |
GlossExAlgorithm
public GlossExAlgorithm()
GlossExAlgorithm
public GlossExAlgorithm(double alpha,
double beta)
execute
public Term[] execute(AbstractFeatureWrapper store)
throws JATEException
- Description copied from interface:
Algorithm
- Execute the algorithm by analysing the features stored in the AbstractFeatureWrapper and return terms extracted and
sorted by their relevance
- Specified by:
execute
in interface Algorithm
- Returns:
-
- Throws:
JATEException
toString
public java.lang.String toString()
- Overrides:
toString
in class java.lang.Object