|
||||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |
java.lang.Objectde.l3s.boilerpipe.sax.CommonTagActions
public abstract class CommonTagActions
Defines an action that is to be performed whenever a particular tag occurs during HTML parsing.
Nested Class Summary | |
---|---|
static class |
CommonTagActions.BlockTagLabelAction
CommonTagActions for block-level elements, which triggers some LabelAction on the generated
TextBlock . |
static class |
CommonTagActions.Chained
|
static class |
CommonTagActions.InlineTagLabelAction
CommonTagActions for inline elements, which triggers some LabelAction on the generated
TextBlock . |
Field Summary | |
---|---|
static TagAction |
TA_ANCHOR_TEXT
Marks this tag as "anchor" (this should usually only be set for the <A> tag). |
static TagAction |
TA_BLOCK_LEVEL
Explicitly marks this tag a simple "block-level" element, which always generates whitespace |
static TagAction |
TA_BODY
Marks this tag the body element (this should usually only be set for the <BODY> tag). |
static TagAction |
TA_FONT
Special TagAction for the <FONT> tag, which keeps track of the
absolute and relative font size. |
static TagAction |
TA_IGNORABLE_ELEMENT
Marks this tag as "ignorable", i.e. all its inner content is silently skipped. |
static TagAction |
TA_INLINE
Deprecated. Use TA_INLINE_WHITESPACE instead |
static TagAction |
TA_INLINE_NO_WHITESPACE
Marks this tag a simple "inline" element, which neither generates whitespace, nor a new block. |
static TagAction |
TA_INLINE_WHITESPACE
Marks this tag a simple "inline" element, which generates whitespace, but no new block. |
Method Summary |
---|
Methods inherited from class java.lang.Object |
---|
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
Field Detail |
---|
public static final TagAction TA_IGNORABLE_ELEMENT
public static final TagAction TA_ANCHOR_TEXT
<A>
tag).
Anchor tags may not be nested.
There is a bug in certain versions of NekoHTML which still allows nested tags.
If boilerpipe encounters such nestings, a SAXException is thrown.
public static final TagAction TA_BODY
<BODY>
tag).
public static final TagAction TA_INLINE_WHITESPACE
@Deprecated public static final TagAction TA_INLINE
TA_INLINE_WHITESPACE
insteadpublic static final TagAction TA_INLINE_NO_WHITESPACE
public static final TagAction TA_BLOCK_LEVEL
public static final TagAction TA_FONT
<FONT>
tag, which keeps track of the
absolute and relative font size.
|
||||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |