de.l3s.boilerpipe.sax
Class HTMLDocument

java.lang.Object
  extended by de.l3s.boilerpipe.sax.HTMLDocument
All Implemented Interfaces:
InputSourceable

public class HTMLDocument
extends java.lang.Object
implements InputSourceable

An InputSourceable for HTMLFetcher.

Author:
Christian Kohlschütter

Constructor Summary
HTMLDocument(byte[] data, java.nio.charset.Charset charset)
           
HTMLDocument(java.lang.String data)
           
 
Method Summary
 java.nio.charset.Charset getCharset()
           
 byte[] getData()
           
 org.xml.sax.InputSource toInputSource()
           
 
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 

Constructor Detail

HTMLDocument

public HTMLDocument(byte[] data,
                    java.nio.charset.Charset charset)

HTMLDocument

public HTMLDocument(java.lang.String data)
Method Detail

getCharset

public java.nio.charset.Charset getCharset()

getData

public byte[] getData()

toInputSource

public org.xml.sax.InputSource toInputSource()
Specified by:
toInputSource in interface InputSourceable