Uses of Class
de.l3s.boilerpipe.document.TextDocument

Packages that use TextDocument
de.l3s.boilerpipe The Boilerpipe top-level package. 
de.l3s.boilerpipe.extractors This package contains some standard extractors (i.e., completely piped BoilerpipeFilters)  
de.l3s.boilerpipe.filters.english The BoilerpipeFilters in this package have only been tested on English text. 
de.l3s.boilerpipe.filters.heuristics The BoilerpipeFilters in this package are pure heuristics. 
de.l3s.boilerpipe.filters.simple The BoilerpipeFilters in this package are straight-forward and probably not really specific to English. 
de.l3s.boilerpipe.sax Classes related to parsing and producing HTML from/to Boilerpipe TextDocuments. 
 

Uses of TextDocument in de.l3s.boilerpipe
 

Methods in de.l3s.boilerpipe that return TextDocument
 TextDocument BoilerpipeInput.getTextDocument()
          Returns (somehow) a TextDocument.
 

Methods in de.l3s.boilerpipe with parameters of type TextDocument
 java.lang.String BoilerpipeExtractor.getText(TextDocument doc)
          Extracts text from the given TextDocument object.
 boolean BoilerpipeFilter.process(TextDocument doc)
          Processes the given document doc.
 

Uses of TextDocument in de.l3s.boilerpipe.extractors
 

Methods in de.l3s.boilerpipe.extractors with parameters of type TextDocument
 java.lang.String ExtractorBase.getText(TextDocument doc)
          Extracts text from the given TextDocument object.
 boolean NumWordsRulesExtractor.process(TextDocument doc)
           
 boolean LargestContentExtractor.process(TextDocument doc)
           
 boolean KeepEverythingWithMinKWordsExtractor.process(TextDocument doc)
           
 boolean KeepEverythingExtractor.process(TextDocument doc)
           
 boolean DefaultExtractor.process(TextDocument doc)
           
 boolean ArticleSentencesExtractor.process(TextDocument doc)
           
 boolean ArticleExtractor.process(TextDocument doc)
           
 

Uses of TextDocument in de.l3s.boilerpipe.filters.english
 

Methods in de.l3s.boilerpipe.filters.english with parameters of type TextDocument
 boolean TerminatingBlocksFinder.process(TextDocument doc)
           
 boolean NumWordsRulesClassifier.process(TextDocument doc)
           
 boolean MinFulltextWordsFilter.process(TextDocument doc)
           
 boolean KeepLargestFulltextBlockFilter.process(TextDocument doc)
           
 boolean IgnoreBlocksAfterContentFilter.process(TextDocument doc)
           
 boolean DensityRulesClassifier.process(TextDocument doc)
           
 

Uses of TextDocument in de.l3s.boilerpipe.filters.heuristics
 

Methods in de.l3s.boilerpipe.filters.heuristics with parameters of type TextDocument
 boolean SimpleBlockFusionProcessor.process(TextDocument doc)
           
 boolean KeepLargestBlockFilter.process(TextDocument doc)
           
 boolean ExpandTitleToContentFilter.process(TextDocument doc)
           
 boolean DocumentTitleMatchClassifier.process(TextDocument doc)
           
 boolean BlockProximityFusion.process(TextDocument doc)
           
 

Uses of TextDocument in de.l3s.boilerpipe.filters.simple
 

Methods in de.l3s.boilerpipe.filters.simple with parameters of type TextDocument
 boolean SplitParagraphBlocksFilter.process(TextDocument doc)
           
 boolean MinWordsFilter.process(TextDocument doc)
           
 boolean MinClauseWordsFilter.process(TextDocument doc)
           
 boolean MarkEverythingContentFilter.process(TextDocument doc)
           
 boolean InvertedFilter.process(TextDocument doc)
           
 boolean BoilerplateBlockFilter.process(TextDocument doc)
           
 

Uses of TextDocument in de.l3s.boilerpipe.sax
 

Methods in de.l3s.boilerpipe.sax that return TextDocument
 TextDocument BoilerpipeSAXInput.getTextDocument()
           
 TextDocument BoilerpipeHTMLParser.toTextDocument()
          Returns a TextDocument containing the extracted TextBlocks.
 TextDocument BoilerpipeHTMLContentHandler.toTextDocument()
          Returns a TextDocument containing the extracted TextBlocks.
 

Constructors in de.l3s.boilerpipe.sax with parameters of type TextDocument
HTMLHighlighter(TextDocument doc, org.xml.sax.InputSource is)
          Prepares the HTMLHighlighter for the given TextDocument and the original HTML text (as an InputSource).
HTMLHighlighter(TextDocument doc, java.lang.String origHTML)
          Prepares the HTMLHighlighter for the given TextDocument and the original HTML text (as a String).