|
||||||||||
PREV NEXT | FRAMES NO FRAMES |
Packages that use BoilerpipeExtractor | |
---|---|
de.l3s.boilerpipe.extractors | This package contains some standard extractors (i.e., completely piped BoilerpipeFilters) |
de.l3s.boilerpipe.sax | Classes related to parsing and producing HTML from/to Boilerpipe TextDocuments. |
Uses of BoilerpipeExtractor in de.l3s.boilerpipe.extractors |
---|
Classes in de.l3s.boilerpipe.extractors that implement BoilerpipeExtractor | |
---|---|
class |
ArticleExtractor
A full-text extractor which is tuned towards news articles. |
class |
ArticleSentencesExtractor
A full-text extractor which is tuned towards extracting sentences from news articles. |
class |
CanolaExtractor
A full-text extractor trained on krdwrd Canola . |
class |
DefaultExtractor
A quite generic full-text extractor. |
class |
ExtractorBase
The base class of Extractors. |
class |
KeepEverythingExtractor
Marks everything as content. |
class |
KeepEverythingWithMinKWordsExtractor
A full-text extractor which extracts the largest text component of a page. |
class |
LargestContentExtractor
A full-text extractor which extracts the largest text component of a page. |
class |
NumWordsRulesExtractor
A quite generic full-text extractor solely based upon the number of words per block (the current, the previous and the next block). |
Uses of BoilerpipeExtractor in de.l3s.boilerpipe.sax |
---|
Methods in de.l3s.boilerpipe.sax with parameters of type BoilerpipeExtractor | |
---|---|
java.lang.String |
HTMLHighlighter.process(java.net.URL url,
BoilerpipeExtractor extractor)
|
|
||||||||||
PREV NEXT | FRAMES NO FRAMES |