de.l3s.boilerpipe.extractors
Class NumWordsRulesExtractor

java.lang.Object
  extended by de.l3s.boilerpipe.extractors.ExtractorBase
      extended by de.l3s.boilerpipe.extractors.NumWordsRulesExtractor
All Implemented Interfaces:
BoilerpipeExtractor, BoilerpipeFilter

public class NumWordsRulesExtractor
extends ExtractorBase

A quite generic full-text extractor solely based upon the number of words per block (the current, the previous and the next block).

Author:
Christian Kohlschütter

Field Summary
static NumWordsRulesExtractor INSTANCE
           
 
Constructor Summary
NumWordsRulesExtractor()
           
 
Method Summary
static NumWordsRulesExtractor getInstance()
          Returns the singleton instance for NumWordsRulesExtractor.
 boolean process(TextDocument doc)
          Processes the given document doc.
 
Methods inherited from class de.l3s.boilerpipe.extractors.ExtractorBase
getText, getText, getText, getText, getText
 
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 

Field Detail

INSTANCE

public static final NumWordsRulesExtractor INSTANCE
Constructor Detail

NumWordsRulesExtractor

public NumWordsRulesExtractor()
Method Detail

getInstance

public static NumWordsRulesExtractor getInstance()
Returns the singleton instance for NumWordsRulesExtractor.


process

public boolean process(TextDocument doc)
                throws BoilerpipeProcessingException
Description copied from interface: BoilerpipeFilter
Processes the given document doc.

Parameters:
doc - The TextDocument that is to be processed.
Returns:
true if changes have been made to the TextDocument.
Throws:
BoilerpipeProcessingException