de.l3s.boilerpipe.filters.heuristics
Class DocumentTitleMatchClassifier

java.lang.Object
  extended by de.l3s.boilerpipe.filters.heuristics.DocumentTitleMatchClassifier
All Implemented Interfaces:
BoilerpipeFilter

public final class DocumentTitleMatchClassifier
extends java.lang.Object
implements BoilerpipeFilter

Marks TextBlocks which contain parts of the HTML <TITLE> tag, using some heuristics which are quite specific to the news domain.

Author:
Christian Kohlschütter

Constructor Summary
DocumentTitleMatchClassifier(java.lang.String title)
           
 
Method Summary
 boolean process(TextDocument doc)
          Processes the given document doc.
 
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 

Constructor Detail

DocumentTitleMatchClassifier

public DocumentTitleMatchClassifier(java.lang.String title)
Method Detail

process

public boolean process(TextDocument doc)
                throws BoilerpipeProcessingException
Description copied from interface: BoilerpipeFilter
Processes the given document doc.

Specified by:
process in interface BoilerpipeFilter
Parameters:
doc - The TextDocument that is to be processed.
Returns:
true if changes have been made to the TextDocument.
Throws:
BoilerpipeProcessingException