HTML Lexer Jar Home Page
A B C D E F G H I K L M N O P Q R S T U V

A

AbstractNode - Class in org.htmlparser.nodes
The concrete base class for all types of nodes (tags, text remarks).
AbstractNode(Page, int, int) - Constructor for class org.htmlparser.nodes.AbstractNode
Create an abstract node with the page positions given.
accept(NodeVisitor) - Method in interface org.htmlparser.Node
Apply the visitor to this node.
accept(Node) - Method in interface org.htmlparser.NodeFilter
Predicate to determine whether or not to keep the given node.
accept(NodeVisitor) - Method in class org.htmlparser.nodes.AbstractNode
Visit this node.
accept(NodeVisitor) - Method in class org.htmlparser.nodes.RemarkNode
Remark visiting code.
accept(NodeVisitor) - Method in class org.htmlparser.nodes.TagNode
Default tag visiting code.
accept(NodeVisitor) - Method in class org.htmlparser.nodes.TextNode
String visiting code.
add(Cursor) - Method in class org.htmlparser.lexer.PageIndex
Add an element to the list
add(int) - Method in class org.htmlparser.lexer.PageIndex
Add an element to the list
add(Node) - Method in class org.htmlparser.util.NodeList
 
add(NodeList) - Method in class org.htmlparser.util.NodeList
Add another node list to this one.
addCookies(URLConnection) - Method in class org.htmlparser.http.ConnectionManager
Generate a HTTP cookie header value string from the cookie jar.
addCookies(Vector, String, Vector) - Method in class org.htmlparser.http.ConnectionManager
Add qualified cookies from cookies into list.
advance() - Method in class org.htmlparser.lexer.Cursor
Move the cursor position ahead one character.
asString() - Method in class org.htmlparser.util.NodeList
 
Attribute - Class in org.htmlparser
An attribute within a tag.
Attribute(String, String, String, char) - Constructor for class org.htmlparser.Attribute
Create an attribute with the name, assignment, value and quote given.
Attribute(String, String, char) - Constructor for class org.htmlparser.Attribute
Create an attribute with the name, value and quote given.
Attribute(String) - Constructor for class org.htmlparser.Attribute
Create a whitespace attribute with the value given.
Attribute(String, String) - Constructor for class org.htmlparser.Attribute
Create an attribute with the name and value given.
Attribute(String, String, String) - Constructor for class org.htmlparser.Attribute
Create an attribute with the name, assignment string and value given.
Attribute() - Constructor for class org.htmlparser.Attribute
Create an empty attribute.
available() - Method in class org.htmlparser.lexer.InputStreamSource
Get the number of available characters.
available() - Method in class org.htmlparser.lexer.Source
Get the number of available characters.
available() - Method in class org.htmlparser.lexer.Stream
Returns the number of bytes that can be read (or skipped over) from this input stream without blocking by the next caller of a method for this input stream.
available() - Method in class org.htmlparser.lexer.StringSource
Get the number of available characters.

B

beginParsing() - Method in class org.htmlparser.visitors.NodeVisitor
Override this method if you wish to do special processing prior to the start of parsing.
breaksFlow() - Method in class org.htmlparser.nodes.TagNode
Determines if the given tag breaks the flow of text.
breaksFlow() - Method in interface org.htmlparser.Tag
Determines if the given tag breaks the flow of text.
breakTags - Static variable in class org.htmlparser.nodes.TagNode
Set of tags that breaks the flow.
bsearch(int) - Method in class org.htmlparser.lexer.PageIndex
Binary search for the element.
bsearch(int, int, int) - Method in class org.htmlparser.lexer.PageIndex
Binary search for the element.
bsearch(Sortable, Ordered, int, int) - Static method in class org.htmlparser.util.sort.Sort
Binary search for an object
bsearch(Sortable, Ordered) - Static method in class org.htmlparser.util.sort.Sort
Binary search for an object
bsearch(Vector, Ordered, int, int) - Static method in class org.htmlparser.util.sort.Sort
Binary search for an object
bsearch(Vector, Ordered) - Static method in class org.htmlparser.util.sort.Sort
Binary search for an object
bsearch(Ordered[], Ordered, int, int) - Static method in class org.htmlparser.util.sort.Sort
Binary search for an object
bsearch(Ordered[], Ordered) - Static method in class org.htmlparser.util.sort.Sort
Binary search for an object
BUFFER_SIZE - Static variable in class org.htmlparser.lexer.InputStreamSource
An initial buffer size.
BUFFER_SIZE - Static variable in class org.htmlparser.lexer.Stream
An initial buffer size.

C

capacity() - Method in class org.htmlparser.lexer.PageIndex
Get the capacity for elements without reallocation.
ChainedException - Exception in org.htmlparser.util
 
ChainedException() - Constructor for exception org.htmlparser.util.ChainedException
 
ChainedException(String) - Constructor for exception org.htmlparser.util.ChainedException
 
ChainedException(Throwable) - Constructor for exception org.htmlparser.util.ChainedException
 
ChainedException(String, Throwable) - Constructor for exception org.htmlparser.util.ChainedException
 
children - Variable in class org.htmlparser.nodes.AbstractNode
The children of this node.
clone() - Method in class org.htmlparser.http.Cookie
Returns a copy of this object.
clone() - Method in interface org.htmlparser.Node
Allow cloning of nodes.
clone() - Method in class org.htmlparser.nodes.AbstractNode
Clone this object.
close() - Method in class org.htmlparser.lexer.InputStreamSource
Does nothing.
close() - Method in class org.htmlparser.lexer.Page
Close the page by destroying the source of characters.
close() - Method in class org.htmlparser.lexer.Source
Does nothing.
close() - Method in class org.htmlparser.lexer.Stream
Closes this input stream and releases any system resources associated with the stream.
close() - Method in class org.htmlparser.lexer.StringSource
Does nothing.
collectInto(NodeList, NodeFilter) - Method in interface org.htmlparser.Node
Collect this node and its child nodes into a list, provided the node satisfies the filtering criteria.
collectInto(NodeList, NodeFilter) - Method in class org.htmlparser.nodes.AbstractNode
Collect this node and its child nodes (if-applicable) into the collectionList parameter, provided the node satisfies the filtering criteria.
column(Cursor) - Method in class org.htmlparser.lexer.Page
Get the column number for a cursor.
column(int) - Method in class org.htmlparser.lexer.Page
Get the column number for a cursor.
column(Cursor) - Method in class org.htmlparser.lexer.PageIndex
Get the column number for a cursor.
column(int) - Method in class org.htmlparser.lexer.PageIndex
Get the column number for a position.
compare(Object) - Method in class org.htmlparser.lexer.Cursor
Compare one reference to another.
compare(Object) - Method in interface org.htmlparser.util.sort.Ordered
Compares this object with another for order.
ConnectionManager - Class in org.htmlparser.http
Handles proxies, password protected URLs and request properties including cookies.
ConnectionManager() - Constructor for class org.htmlparser.http.ConnectionManager
Create a connection manager.
ConnectionManager(Hashtable) - Constructor for class org.htmlparser.http.ConnectionManager
Create a connection manager with the given connection properties.
ConnectionMonitor - Interface in org.htmlparser.http
Interface for HTTP connection notification callbacks.
constructUrl(String, String) - Method in class org.htmlparser.lexer.Page
Build a URL from the link and base provided using non-strict rules.
constructUrl(String, String, boolean) - Method in class org.htmlparser.lexer.Page
Build a URL from the link and base provided.
contains(Node) - Method in class org.htmlparser.util.NodeList
Check to see if the NodeList contains the supplied Node.
Cookie - Class in org.htmlparser.http
A HTTP cookie.
Cookie(String, String) - Constructor for class org.htmlparser.http.Cookie
Defines a cookie with an initial name/value pair.
copyToNodeArray(Node[]) - Method in class org.htmlparser.util.NodeList
 
createRemarkNode(Page, int, int) - Method in class org.htmlparser.lexer.Lexer
Create a new remark node.
createRemarkNode(Page, int, int) - Method in interface org.htmlparser.NodeFactory
Create a new remark node.
createStringNode(Page, int, int) - Method in class org.htmlparser.lexer.Lexer
Create a new string node.
createStringNode(Page, int, int) - Method in interface org.htmlparser.NodeFactory
Create a new text node.
createTagNode(Page, int, int, Vector) - Method in class org.htmlparser.lexer.Lexer
Create a new tag node.
createTagNode(Page, int, int, Vector) - Method in interface org.htmlparser.NodeFactory
Create a new tag node.
Cursor - Class in org.htmlparser.lexer
A bookmark in a page.
Cursor(Page, int) - Constructor for class org.htmlparser.lexer.Cursor
Construct a Cursor from the page and position given.

D

DEFAULT_CHARSET - Static variable in class org.htmlparser.lexer.Page
The default charset.
DEFAULT_CONTENT_TYPE - Static variable in class org.htmlparser.lexer.Page
The default content type.
destroy() - Method in class org.htmlparser.lexer.InputStreamSource
Close the source.
destroy() - Method in class org.htmlparser.lexer.Source
Close the source.
destroy() - Method in class org.htmlparser.lexer.StringSource
Close the source.
doSemanticAction() - Method in interface org.htmlparser.Node
Perform the meaning of this tag.
doSemanticAction() - Method in class org.htmlparser.nodes.AbstractNode
Perform the meaning of this tag.
dup() - Method in class org.htmlparser.lexer.Cursor
Make a new cursor just like this one.

E

elementAt(int) - Method in class org.htmlparser.lexer.PageIndex
Get an element from the list.
elementAt(int) - Method in class org.htmlparser.util.NodeList
 
elements() - Method in class org.htmlparser.util.NodeList
 
encode(byte[]) - Static method in class org.htmlparser.http.ConnectionManager
Encodes a byte array into BASE64 in accordance with RFC 2045.
EncodingChangeException - Exception in org.htmlparser.util
The encoding is changed invalidating already scanned characters.
EncodingChangeException(String, String, String) - Constructor for exception org.htmlparser.util.EncodingChangeException
Create an exception idicative of a problematic encoding change.
EOF - Static variable in class org.htmlparser.lexer.Page
Character value when the page is exhausted.
EOF - Static variable in class org.htmlparser.lexer.Source
Return value when the source is exhausted.
EOF - Static variable in class org.htmlparser.lexer.Stream
Return value when no more characters are left.
extractAllNodesThatMatch(NodeFilter) - Method in class org.htmlparser.util.NodeList
Filter the list with the given filter non-recursively.
extractAllNodesThatMatch(NodeFilter, boolean) - Method in class org.htmlparser.util.NodeList
Filter the list with the given filter.

F

fetch(int, Ordered) - Method in class org.htmlparser.lexer.PageIndex
Fetch the object at the given index.
fetch(int, Ordered) - Method in interface org.htmlparser.util.sort.Sortable
Fetch the object at the given index.
fill(int) - Method in class org.htmlparser.lexer.InputStreamSource
Fetch more characters from the underlying reader.
fill(boolean) - Method in class org.htmlparser.lexer.Stream
Fetch more bytes from the underlying stream.
fills - Variable in class org.htmlparser.lexer.Stream
The number of calls to fill.
finalize() - Method in class org.htmlparser.lexer.Page
Clean up this page, releasing resources.
findCharset(String, String) - Static method in class org.htmlparser.lexer.Page
Lookup a character set name.
finishedParsing() - Method in class org.htmlparser.visitors.NodeVisitor
Override this method if you wish to do special processing upon completion of parsing.
first() - Method in class org.htmlparser.lexer.PageIndex
Returns the first index of the Sortable.
first() - Method in interface org.htmlparser.util.sort.Sortable
Returns the first index of the Sortable.
fixSpaces(String) - Method in class org.htmlparser.http.ConnectionManager
Turn spaces into %20.

G

generateCookieProperty(Vector) - Method in class org.htmlparser.http.ConnectionManager
Creates the cookie request property value from the list of valid cookies for the domain.
get() - Method in class org.htmlparser.lexer.PageIndex
Get the elements as an array of int.
getAbsoluteURL(String) - Method in class org.htmlparser.lexer.Page
Create an absolute URL from a relative link.
getAbsoluteURL(String, boolean) - Method in class org.htmlparser.lexer.Page
Create an absolute URL from a relative link.
getAssignment() - Method in class org.htmlparser.Attribute
Get the assignment string of this attribute.
getAssignment(StringBuffer) - Method in class org.htmlparser.Attribute
Get the assignment string of this attribute.
getAssignment() - Method in class org.htmlparser.lexer.PageAttribute
Get the assignment string of this attribute.
getAssignment(StringBuffer) - Method in class org.htmlparser.lexer.PageAttribute
Get the assignment string of this attribute.
getAttribute(String) - Method in class org.htmlparser.nodes.TagNode
Returns the value of an attribute.
getAttribute(String) - Method in interface org.htmlparser.Tag
Returns the value of an attribute.
getAttributeEx(String) - Method in class org.htmlparser.nodes.TagNode
Returns the attribute with the given name.
getAttributeEx(String) - Method in interface org.htmlparser.Tag
Returns the attribute with the given name.
getAttributesEx() - Method in class org.htmlparser.nodes.TagNode
Gets the attributes in the tag.
getAttributesEx() - Method in interface org.htmlparser.Tag
Gets the attributes in the tag.
getBaseUrl() - Method in class org.htmlparser.lexer.Page
Gets the baseUrl.
getCharacter(int) - Method in class org.htmlparser.lexer.InputStreamSource
Retrieve a character again.
getCharacter(Cursor) - Method in class org.htmlparser.lexer.Page
Read the character at the given cursor position.
getCharacter(int) - Method in class org.htmlparser.lexer.Source
Retrieve a character again.
getCharacter(int) - Method in class org.htmlparser.lexer.StringSource
Retrieve a character again.
getCharacters(char[], int, int, int) - Method in class org.htmlparser.lexer.InputStreamSource
Retrieve characters again.
getCharacters(StringBuffer, int, int) - Method in class org.htmlparser.lexer.InputStreamSource
Append characters already read into a StringBuffer.
getCharacters(char[], int, int, int) - Method in class org.htmlparser.lexer.Source
Retrieve characters again.
getCharacters(StringBuffer, int, int) - Method in class org.htmlparser.lexer.Source
Append characters already read into a StringBuffer.
getCharacters(char[], int, int, int) - Method in class org.htmlparser.lexer.StringSource
Retrieve characters again.
getCharacters(StringBuffer, int, int) - Method in class org.htmlparser.lexer.StringSource
Append characters already read into a StringBuffer.
getCharset(String) - Method in class org.htmlparser.lexer.Page
Get a CharacterSet name corresponding to a charset parameter.
getChildren() - Method in interface org.htmlparser.Node
Get the children of this node.
getChildren() - Method in class org.htmlparser.nodes.AbstractNode
Get the children of this node.
getComment() - Method in class org.htmlparser.http.Cookie
Returns the comment describing the purpose of this cookie, or null if no such comment has been defined.
getConnection() - Method in class org.htmlparser.lexer.Page
Get the connection, if any.
getConnectionManager() - Static method in class org.htmlparser.lexer.Page
Get the connection manager all Parsers use.
getContentType() - Method in class org.htmlparser.lexer.Page
Try and extract the content type from the HTTP header.
getCookieProcessingEnabled() - Method in class org.htmlparser.http.ConnectionManager
Predicate to determine if cookie processing is currently enabled.
getCurrentLine() - Method in class org.htmlparser.lexer.Lexer
Get the current line.
getCurrentLineNumber() - Method in class org.htmlparser.lexer.Lexer
Get the current line number.
getCursor() - Method in class org.htmlparser.lexer.Lexer
Get the current scanning position.
getDefaultRequestProperties() - Static method in class org.htmlparser.http.ConnectionManager
Get the current default request header properties.
getDomain(String) - Method in class org.htmlparser.http.ConnectionManager
Get the domain from a host.
getDomain() - Method in class org.htmlparser.http.Cookie
Returns the domain of this cookie.
getEncoding() - Method in class org.htmlparser.lexer.InputStreamSource
Get the encoding being used to convert characters.
getEncoding() - Method in class org.htmlparser.lexer.Page
Get the current encoding being used.
getEncoding() - Method in class org.htmlparser.lexer.Source
Get the encoding being used to convert characters.
getEncoding() - Method in class org.htmlparser.lexer.StringSource
Get the encoding being used to convert characters.
getEnders() - Method in class org.htmlparser.nodes.TagNode
Return the set of tag names that cause this tag to finish.
getEnders() - Method in interface org.htmlparser.Tag
Return the set of tag names that cause this tag to finish.
getEndingLineNumber() - Method in class org.htmlparser.nodes.TagNode
Get the line number where this tag ends.
getEndingLineNumber() - Method in interface org.htmlparser.Tag
Get the line number where this tag ends.
getEndPosition() - Method in interface org.htmlparser.Node
Gets the ending position of the node.
getEndPosition() - Method in class org.htmlparser.nodes.AbstractNode
Gets the ending position of the node.
getEndTag() - Method in class org.htmlparser.nodes.TagNode
Get the end tag for this (composite) tag.
getEndTag() - Method in interface org.htmlparser.Tag
Get the end tag for this (composite) tag.
getEndTagEnders() - Method in class org.htmlparser.nodes.TagNode
Return the set of end tag names that cause this tag to finish.
getEndTagEnders() - Method in interface org.htmlparser.Tag
Return the set of end tag names that cause this tag to finish.
getExpiryDate() - Method in class org.htmlparser.http.Cookie
Returns the expiry date of the cookie.
getFirstChild() - Method in interface org.htmlparser.Node
Get the first child of this node.
getFirstChild() - Method in class org.htmlparser.nodes.AbstractNode
Get the first child of this node.
getIds() - Method in class org.htmlparser.nodes.TagNode
Return the set of names handled by this tag.
getIds() - Method in interface org.htmlparser.Tag
Return the set of names handled by this tag.
getLastChild() - Method in interface org.htmlparser.Node
Get the last child of this node.
getLastChild() - Method in class org.htmlparser.nodes.AbstractNode
Get the last child of this node.
getLength() - Method in class org.htmlparser.Attribute
Get the length of the string value of this attribute.
getLength() - Method in class org.htmlparser.lexer.PageAttribute
Get the length of the string value of this attribute.
getLine(Cursor) - Method in class org.htmlparser.lexer.Page
Get the text line the position of the cursor lies on.
getLine(int) - Method in class org.htmlparser.lexer.Page
Get the text line the position of the cursor lies on.
getLocation(HttpURLConnection) - Method in class org.htmlparser.http.ConnectionManager
Get the Location field if any.
getMessageChain() - Method in exception org.htmlparser.util.ChainedException
 
getMessageList() - Method in exception org.htmlparser.util.ChainedException
 
getMonitor() - Method in class org.htmlparser.http.ConnectionManager
Get the monitoring object, if any.
getName() - Method in class org.htmlparser.Attribute
Get the name of this attribute.
getName(StringBuffer) - Method in class org.htmlparser.Attribute
Get the name of this attribute.
getName() - Method in class org.htmlparser.http.Cookie
Returns the name of the cookie.
getName() - Method in class org.htmlparser.lexer.PageAttribute
Get the name of this attribute.
getName(StringBuffer) - Method in class org.htmlparser.lexer.PageAttribute
Get the name of this attribute.
getNameEndPosition() - Method in class org.htmlparser.lexer.PageAttribute
Get the ending position of the attribute name.
getNameStartPosition() - Method in class org.htmlparser.lexer.PageAttribute
Get the starting position of the attribute name.
getNewEncoding() - Method in exception org.htmlparser.util.EncodingChangeException
Gets the new encoding.
getNextSibling() - Method in interface org.htmlparser.Node
Get the next sibling to this node.
getNextSibling() - Method in class org.htmlparser.nodes.AbstractNode
Get the next sibling to this node.
getNodeFactory() - Method in class org.htmlparser.lexer.Lexer
Get the current node factory.
getOldEncoding() - Method in exception org.htmlparser.util.EncodingChangeException
Gets the old encoding.
getPage() - Method in class org.htmlparser.lexer.Cursor
Get this cursor's page.
getPage() - Method in class org.htmlparser.lexer.Lexer
Get the page this lexer is working on.
getPage() - Method in class org.htmlparser.lexer.PageAttribute
Get the page this attribute is anchored to, if any.
getPage() - Method in class org.htmlparser.lexer.PageIndex
Get this index's page.
getPage() - Method in interface org.htmlparser.Node
Get the page this node came from.
getPage() - Method in class org.htmlparser.nodes.AbstractNode
Get the page this node came from.
getParent() - Method in interface org.htmlparser.Node
Get the parent of this node.
getParent() - Method in class org.htmlparser.nodes.AbstractNode
Get the parent of this node.
getPassword() - Method in class org.htmlparser.http.ConnectionManager
Get the URL users's password.
getPath() - Method in class org.htmlparser.http.Cookie
Returns the prefix of all URLs for which this cookie is targetted.
getPosition() - Method in class org.htmlparser.lexer.Cursor
Get the position of this cursor.
getPosition() - Method in class org.htmlparser.lexer.Lexer
Get the current cursor position.
getPreviousSibling() - Method in interface org.htmlparser.Node
Get the previous sibling to this node.
getPreviousSibling() - Method in class org.htmlparser.nodes.AbstractNode
Get the previous sibling to this node.
getProxyHost() - Method in class org.htmlparser.http.ConnectionManager
Get the proxy host name, if any.
getProxyPassword() - Method in class org.htmlparser.http.ConnectionManager
Set the proxy user's password.
getProxyPort() - Method in class org.htmlparser.http.ConnectionManager
Get the proxy port number.
getProxyUser() - Method in class org.htmlparser.http.ConnectionManager
Get the user name for proxy authorization, if any.
getQuote() - Method in class org.htmlparser.Attribute
Get the quote, if any, surrounding the value of the attribute, if any.
getQuote(StringBuffer) - Method in class org.htmlparser.Attribute
Get the quote, if any, surrounding the value of the attribute, if any.
getRawTagName() - Method in class org.htmlparser.nodes.TagNode
Return the name of this tag.
getRawTagName() - Method in interface org.htmlparser.Tag
Return the name of this tag.
getRawValue() - Method in class org.htmlparser.Attribute
Get the raw value of the attribute.
getRawValue(StringBuffer) - Method in class org.htmlparser.Attribute
Get the raw value of the attribute.
getRawValue() - Method in class org.htmlparser.lexer.PageAttribute
Get the raw value of the attribute.
getRawValue(StringBuffer) - Method in class org.htmlparser.lexer.PageAttribute
Get the raw value of the attribute.
getRedirectionProcessingEnabled() - Method in class org.htmlparser.http.ConnectionManager
Predicate to determine if url redirection processing is currently enabled.
getRequestProperties() - Method in class org.htmlparser.http.ConnectionManager
Get the current request header properties.
getSecure() - Method in class org.htmlparser.http.Cookie
Returns the value of the 'secure' flag.
getSource() - Method in class org.htmlparser.lexer.Page
Get the source this page is reading from.
getStartingLineNumber() - Method in class org.htmlparser.nodes.TagNode
Get the line number where this tag starts.
getStartingLineNumber() - Method in interface org.htmlparser.Tag
Get the line number where this tag starts.
getStartPosition() - Method in interface org.htmlparser.Node
Gets the starting position of the node.
getStartPosition() - Method in class org.htmlparser.nodes.AbstractNode
Gets the starting position of the node.
getStream() - Method in class org.htmlparser.lexer.InputStreamSource
Get the input stream being used.
getString(int, int) - Method in class org.htmlparser.lexer.InputStreamSource
Retrieve a string.
getString(int, int) - Method in class org.htmlparser.lexer.Source
Retrieve a string comprised of characters already read.
getString(int, int) - Method in class org.htmlparser.lexer.StringSource
Retrieve a string comprised of characters already read.
getTagBegin() - Method in class org.htmlparser.nodes.TagNode
Gets the nodeBegin.
getTagEnd() - Method in class org.htmlparser.nodes.TagNode
Gets the nodeEnd.
getTagName() - Method in class org.htmlparser.nodes.TagNode
Return the name of this tag.
getTagName() - Method in interface org.htmlparser.Tag
Return the name of this tag.
getText(int, int) - Method in class org.htmlparser.lexer.Page
Get the text identified by the given limits.
getText(StringBuffer, int, int) - Method in class org.htmlparser.lexer.Page
Put the text identified by the given limits into the given buffer.
getText() - Method in class org.htmlparser.lexer.Page
Get all text read so far from the source.
getText(StringBuffer) - Method in class org.htmlparser.lexer.Page
Put all text read so far from the source into the given buffer.
getText(char[], int, int, int) - Method in class org.htmlparser.lexer.Page
Put the text identified by the given limits into the given array at the specified offset.
getText() - Method in interface org.htmlparser.Node
Returns the text of the node.
getText() - Method in class org.htmlparser.nodes.AbstractNode
Returns the text of the node.
getText() - Method in class org.htmlparser.nodes.RemarkNode
Returns the text contents of the comment tag.
getText() - Method in class org.htmlparser.nodes.TagNode
Return the text contained in this tag.
getText() - Method in class org.htmlparser.nodes.TextNode
Returns the text of the node.
getText() - Method in interface org.htmlparser.Remark
Returns the text contents of the comment tag.
getText() - Method in interface org.htmlparser.Text
Accesses the textual contents of the node.
getThisScanner() - Method in class org.htmlparser.nodes.TagNode
Return the scanner associated with this tag.
getThisScanner() - Method in interface org.htmlparser.Tag
Return the scanner associated with this tag.
getThrowable() - Method in exception org.htmlparser.util.ChainedException
 
getUrl() - Method in class org.htmlparser.lexer.Page
Get the URL for this page.
getUser() - Method in class org.htmlparser.http.ConnectionManager
Get the user name to access the URL.
getValue() - Method in class org.htmlparser.Attribute
Get the value of the attribute.
getValue(StringBuffer) - Method in class org.htmlparser.Attribute
Get the value of the attribute.
getValue() - Method in class org.htmlparser.http.Cookie
Returns the value of the cookie.
getValue() - Method in class org.htmlparser.lexer.PageAttribute
Get the value of the attribute.
getValue(StringBuffer) - Method in class org.htmlparser.lexer.PageAttribute
Get the value of the attribute.
getValueEndPosition() - Method in class org.htmlparser.lexer.PageAttribute
Get the ending position of the attribute value.
getValueStartPosition() - Method in class org.htmlparser.lexer.PageAttribute
Get the starting position of the attribute value.
getVersion() - Method in class org.htmlparser.http.Cookie
Returns the version of the cookie.
getVersion() - Static method in class org.htmlparser.lexer.Lexer
Return the version string of this parser.

H

hasMoreNodes() - Method in interface org.htmlparser.util.NodeIterator
Check if more nodes are available.
hasMoreNodes() - Method in interface org.htmlparser.util.SimpleNodeIterator
Check if more nodes are available.

I

indexOf(Node) - Method in class org.htmlparser.util.NodeList
Finds the index of the supplied Node.
InputStreamSource - Class in org.htmlparser.lexer
A source of characters based on an InputStream such as from a URLConnection.
InputStreamSource(InputStream) - Constructor for class org.htmlparser.lexer.InputStreamSource
Create a source of characters using the default character set.
InputStreamSource(InputStream, String) - Constructor for class org.htmlparser.lexer.InputStreamSource
Create a source of characters.
InputStreamSource(InputStream, String, int) - Constructor for class org.htmlparser.lexer.InputStreamSource
Create a source of characters.
insertElementAt(int, int) - Method in class org.htmlparser.lexer.PageIndex
Inserts an element into the list.
isEmpty() - Method in class org.htmlparser.Attribute
Predicate to determine if this attribute has an equals sign but no value.
isEmpty() - Method in class org.htmlparser.lexer.PageAttribute
Predicate to determine if this attribute has an equals sign but no value.
isEmptyXmlTag() - Method in class org.htmlparser.nodes.TagNode
Is this an empty xml tag of the form <tag/>.
isEmptyXmlTag() - Method in interface org.htmlparser.Tag
Is this an empty xml tag of the form <tag/>.
isEndTag() - Method in class org.htmlparser.nodes.TagNode
Predicate to determine if this tag is an end tag (i.e.
isEndTag() - Method in interface org.htmlparser.Tag
Predicate to determine if this tag is an end tag (i.e.
isStandAlone() - Method in class org.htmlparser.Attribute
Predicate to determine if this attribute has no equals sign (or value).
isStandAlone() - Method in class org.htmlparser.lexer.PageAttribute
Predicate to determine if this attribute has no equals sign (or value).
isValued() - Method in class org.htmlparser.Attribute
Predicate to determine if this attribute has a value.
isValued() - Method in class org.htmlparser.lexer.PageAttribute
Predicate to determine if this attribute has a value.
isWhitespace() - Method in class org.htmlparser.Attribute
Predicate to determine if this attribute is whitespace.
isWhitespace() - Method in class org.htmlparser.lexer.PageAttribute
Predicate to determine if this attribute is whitespace.
isWhiteSpace() - Method in class org.htmlparser.nodes.TextNode
Returns true if the node consists of only white space.

K

keepAllNodesThatMatch(NodeFilter) - Method in class org.htmlparser.util.NodeList
Remove nodes not matching the given filter non-recursively.
keepAllNodesThatMatch(NodeFilter, boolean) - Method in class org.htmlparser.util.NodeList
Remove nodes not matching the given filter.

L

last() - Method in class org.htmlparser.lexer.PageIndex
Returns the last index of the Sortable.
last() - Method in interface org.htmlparser.util.sort.Sortable
Returns the last index of the Sortable.
Lexer - Class in org.htmlparser.lexer
This class parses the HTML stream into nodes.
Lexer() - Constructor for class org.htmlparser.lexer.Lexer
Creates a new instance of a Lexer.
Lexer(Page) - Constructor for class org.htmlparser.lexer.Lexer
Creates a new instance of a Lexer.
Lexer(String) - Constructor for class org.htmlparser.lexer.Lexer
Creates a new instance of a Lexer.
Lexer(URLConnection) - Constructor for class org.htmlparser.lexer.Lexer
Creates a new instance of a Lexer.

M

main(String[]) - Static method in class org.htmlparser.lexer.Lexer
Mainline for command line operation
makeRemark(int, int) - Method in class org.htmlparser.lexer.Lexer
Create a remark node based on the current cursor and the one provided.
makeString(int, int) - Method in class org.htmlparser.lexer.Lexer
Create a string node based on the current cursor and the one provided.
makeTag(int, int, Vector) - Method in class org.htmlparser.lexer.Lexer
Create a tag node based on the current cursor and the one provided.
mark(int) - Method in class org.htmlparser.lexer.InputStreamSource
Mark the present position in the source.
mark(int) - Method in class org.htmlparser.lexer.Source
Mark the present position.
mark(int) - Method in class org.htmlparser.lexer.Stream
Marks the current position in this input stream.
mark(int) - Method in class org.htmlparser.lexer.StringSource
Mark the present position in the source.
markSupported() - Method in class org.htmlparser.lexer.InputStreamSource
Tell whether this source supports the mark() operation.
markSupported() - Method in class org.htmlparser.lexer.Source
Tell whether this source supports the mark() operation.
markSupported() - Method in class org.htmlparser.lexer.Stream
Tests if this input stream supports the mark and reset methods.
markSupported() - Method in class org.htmlparser.lexer.StringSource
Tell whether this source supports the mark() operation.
mAssignment - Variable in class org.htmlparser.Attribute
The assignment string of the attribute.
mAttributes - Variable in class org.htmlparser.nodes.TagNode
The tag attributes.
mBaseUrl - Variable in class org.htmlparser.lexer.Page
The base URL for this page.
mBuffer - Variable in class org.htmlparser.lexer.InputStreamSource
The characters read so far.
mBuffer - Variable in class org.htmlparser.lexer.Stream
The bytes read so far.
mComment - Variable in class org.htmlparser.http.Cookie
Describes the cookie's use.
mConnection - Variable in class org.htmlparser.lexer.Page
The connection this page is coming from or null.
mConnectionManager - Static variable in class org.htmlparser.lexer.Page
Connection control (proxy, cookies, authorization).
mContentLength - Variable in class org.htmlparser.lexer.Stream
The content length from the HTTP header.
mCookieJar - Variable in class org.htmlparser.http.ConnectionManager
Cookie storage, a hashtable (by site or host) of vectors of Cookies.
mCount - Variable in class org.htmlparser.lexer.PageIndex
The number of valid elements.
mCursor - Variable in class org.htmlparser.lexer.Lexer
The current position on the page.
mDebugLineTrigger - Static variable in class org.htmlparser.lexer.Lexer
Line number to trigger on.
mDefaultRequestProperties - Static variable in class org.htmlparser.http.ConnectionManager
Default Request header fields.
mDefaultScanner - Static variable in class org.htmlparser.nodes.TagNode
The default scanner for non-composite tags.
mDomain - Variable in class org.htmlparser.http.Cookie
Domain that sees cookie.
mEncoding - Variable in class org.htmlparser.lexer.InputStreamSource
The character set in use.
mEncoding - Variable in class org.htmlparser.lexer.StringSource
The encoding to report.
mExpiry - Variable in class org.htmlparser.http.Cookie
Cookie expires after this date.
mFactory - Variable in class org.htmlparser.lexer.Lexer
The factory for new nodes.
mFormat - Static variable in class org.htmlparser.http.ConnectionManager
Cookie expiry date format for parsing.
mIn - Variable in class org.htmlparser.lexer.Stream
The underlying stream.
mIncrement - Variable in class org.htmlparser.lexer.PageIndex
Increment for allocations.
mIndex - Variable in class org.htmlparser.lexer.Page
Character positions of the first character in each line.
mIndices - Variable in class org.htmlparser.lexer.PageIndex
The elements.
mLevel - Variable in class org.htmlparser.lexer.InputStreamSource
The number of valid bytes in the buffer.
mLevel - Variable in class org.htmlparser.lexer.Stream
The number of valid bytes in the buffer.
mMark - Variable in class org.htmlparser.lexer.InputStreamSource
The bookmark.
mMark - Variable in class org.htmlparser.lexer.Stream
The bookmark.
mMark - Variable in class org.htmlparser.lexer.StringSource
The bookmark.
mMonitor - Variable in class org.htmlparser.http.ConnectionManager
The object to be notified prior to and after each connection.
mName - Variable in class org.htmlparser.Attribute
The name of this attribute.
mName - Variable in class org.htmlparser.http.Cookie
The name of the cookie.
mNameEnd - Variable in class org.htmlparser.lexer.PageAttribute
The ending offset of the name within the page.
mNameStart - Variable in class org.htmlparser.lexer.PageAttribute
The starting offset of the name within the page.
mOffset - Variable in class org.htmlparser.lexer.InputStreamSource
The offset of the next byte returned by read().
mOffset - Variable in class org.htmlparser.lexer.Stream
The offset of the next byte returned by read().
mOffset - Variable in class org.htmlparser.lexer.StringSource
The current offset into the string.
mPage - Variable in class org.htmlparser.lexer.Cursor
This cursor's page.
mPage - Variable in class org.htmlparser.lexer.Lexer
The page lexemes are retrieved from.
mPage - Variable in class org.htmlparser.lexer.PageAttribute
The page this attribute is extracted from.
mPage - Variable in class org.htmlparser.lexer.PageIndex
The page associated with this index.
mPage - Variable in class org.htmlparser.nodes.AbstractNode
The page this node came from.
mPassword - Variable in class org.htmlparser.http.ConnectionManager
The user password for accessing the URL.
mPath - Variable in class org.htmlparser.http.Cookie
URLs that see the cookie.
mPosition - Variable in class org.htmlparser.lexer.Cursor
This cursor's position.
mProxyHost - Variable in class org.htmlparser.http.ConnectionManager
The proxy server name.
mProxyPassword - Variable in class org.htmlparser.http.ConnectionManager
The proxy user password.
mProxyPort - Variable in class org.htmlparser.http.ConnectionManager
The proxy port number.
mProxyUser - Variable in class org.htmlparser.http.ConnectionManager
The proxy username name.
mQuote - Variable in class org.htmlparser.Attribute
The quote, if any, surrounding the value of the attribute, if any.
mReader - Variable in class org.htmlparser.lexer.InputStreamSource
The converter from bytes to characters.
mRedirectionProcessingEnabled - Variable in class org.htmlparser.http.ConnectionManager
Flag determining if redirection processing is being handled manually.
mRequestProperties - Variable in class org.htmlparser.http.ConnectionManager
Request header fields.
mSecure - Variable in class org.htmlparser.http.Cookie
Use SSL.
mSource - Variable in class org.htmlparser.lexer.Page
The source of characters.
mStartIncrement - Static variable in class org.htmlparser.lexer.PageIndex
Starting increment for allocations.
mStream - Variable in class org.htmlparser.lexer.InputStreamSource
The stream of bytes.
mString - Variable in class org.htmlparser.lexer.StringSource
The source of characters.
mText - Variable in class org.htmlparser.nodes.RemarkNode
The contents of the remark node, or override text.
mText - Variable in class org.htmlparser.nodes.TextNode
The contents of the string node, or override text.
mUrl - Variable in class org.htmlparser.lexer.Page
The URL this page is coming from.
mUser - Variable in class org.htmlparser.http.ConnectionManager
The username name for accessing the URL.
mValue - Variable in class org.htmlparser.Attribute
The value of the attribute.
mValue - Variable in class org.htmlparser.http.Cookie
The cookie value.
mValueEnd - Variable in class org.htmlparser.lexer.PageAttribute
The ending offset of the name within the page.
mValueStart - Variable in class org.htmlparser.lexer.PageAttribute
The starting offset of the value within the page.
mVersion - Variable in class org.htmlparser.http.Cookie
If Version=1 it means RFC 2109++ style cookies.

N

nextNode() - Method in class org.htmlparser.lexer.Lexer
Get the next node from the source.
nextNode(boolean) - Method in class org.htmlparser.lexer.Lexer
Get the next node from the source.
nextNode() - Method in interface org.htmlparser.util.NodeIterator
Get the next node.
nextNode() - Method in interface org.htmlparser.util.SimpleNodeIterator
Get the next node.
Node - Interface in org.htmlparser
Specifies the minimum requirements for nodes returned by the Lexer or Parser.
nodeBegin - Variable in class org.htmlparser.nodes.AbstractNode
The beginning position of the tag in the line
nodeEnd - Variable in class org.htmlparser.nodes.AbstractNode
The ending position of the tag in the line
NodeFactory - Interface in org.htmlparser
This interface defines the methods needed to create new nodes.
NodeFilter - Interface in org.htmlparser
Implement this interface to select particular nodes.
NodeIterator - Interface in org.htmlparser.util
 
NodeList - Class in org.htmlparser.util
 
NodeList() - Constructor for class org.htmlparser.util.NodeList
 
NodeList(Node) - Constructor for class org.htmlparser.util.NodeList
Create a one element node list.
NodeVisitor - Class in org.htmlparser.visitors
The base class for the 'Visitor' pattern.
NodeVisitor() - Constructor for class org.htmlparser.visitors.NodeVisitor
Creates a node visitor that recurses itself and it's children.
NodeVisitor(boolean) - Constructor for class org.htmlparser.visitors.NodeVisitor
Creates a node visitor that recurses itself and it's children only if recurseChildren is true.
NodeVisitor(boolean, boolean) - Constructor for class org.htmlparser.visitors.NodeVisitor
Creates a node visitor that recurses itself only if recurseSelf is true and it's children only if recurseChildren is true.

O

offset() - Method in class org.htmlparser.lexer.InputStreamSource
Get the position (in characters).
offset() - Method in class org.htmlparser.lexer.Source
Get the position (in characters).
offset() - Method in class org.htmlparser.lexer.StringSource
Get the position (in characters).
openConnection(URL) - Method in class org.htmlparser.http.ConnectionManager
Opens a connection using the given url.
openConnection(String) - Method in class org.htmlparser.http.ConnectionManager
Opens a connection based on a given string.
Ordered - Interface in org.htmlparser.util.sort
Describes an object that knows about ordering.
org.htmlparser - package org.htmlparser
The basic API classes which will be used by most developers when working with the HTML Parser.
org.htmlparser.http - package org.htmlparser.http
The http package is responsible for HTTP connections to servers.
org.htmlparser.lexer - package org.htmlparser.lexer
The lexer package is the base level I/O subsystem.
org.htmlparser.nodes - package org.htmlparser.nodes
The nodes package has the concrete node implementations.
org.htmlparser.scanners - package org.htmlparser.scanners
The scanners package contains classes responsible for the tertiary identification of tags.
org.htmlparser.util - package org.htmlparser.util
 
org.htmlparser.util.sort - package org.htmlparser.util.sort
Provides generic sorting and searching.
org.htmlparser.visitors - package org.htmlparser.visitors
 

P

Page - Class in org.htmlparser.lexer
Represents the contents of an HTML page.
Page() - Constructor for class org.htmlparser.lexer.Page
Construct an empty page.
Page(URLConnection) - Constructor for class org.htmlparser.lexer.Page
Construct a page reading from a URL connection.
Page(InputStream, String) - Constructor for class org.htmlparser.lexer.Page
Construct a page from a stream encoded with the given charset.
Page(String, String) - Constructor for class org.htmlparser.lexer.Page
Construct a page from the given string.
Page(String) - Constructor for class org.htmlparser.lexer.Page
Construct a page from the given string.
Page(Source) - Constructor for class org.htmlparser.lexer.Page
Construct a page from a source.
PageAttribute - Class in org.htmlparser.lexer
An attribute within a tag on a page.
PageAttribute(Page, int, int, int, int, char) - Constructor for class org.htmlparser.lexer.PageAttribute
Create an attribute.
PageAttribute(String, String, String, char) - Constructor for class org.htmlparser.lexer.PageAttribute
Create an attribute with the name, assignment string, value and quote given.
PageAttribute(String, String, char) - Constructor for class org.htmlparser.lexer.PageAttribute
Create an attribute with the name, value and quote given.
PageAttribute(String) - Constructor for class org.htmlparser.lexer.PageAttribute
Create a whitespace attribute with the value given.
PageAttribute(String, String) - Constructor for class org.htmlparser.lexer.PageAttribute
Create an attribute with the name and value given.
PageAttribute(String, String, String) - Constructor for class org.htmlparser.lexer.PageAttribute
Create an attribute with the name, assignment string and value given.
PageAttribute() - Constructor for class org.htmlparser.lexer.PageAttribute
Create an empty attribute.
PageIndex - Class in org.htmlparser.lexer
A sorted array of integers, the positions of the first characters of each line.
PageIndex(Page) - Constructor for class org.htmlparser.lexer.PageIndex
Create an empty index.
PageIndex(Page, int) - Constructor for class org.htmlparser.lexer.PageIndex
Create an index with the one element given.
PageIndex(Page, int[]) - Constructor for class org.htmlparser.lexer.PageIndex
Create an index with the elements given.
parent - Variable in class org.htmlparser.nodes.AbstractNode
The parent of this node.
parseCDATA() - Method in class org.htmlparser.lexer.Lexer
Return CDATA as a text node.
parseCDATA(boolean) - Method in class org.htmlparser.lexer.Lexer
Return CDATA as a text node.
parseCookies(URLConnection) - Method in class org.htmlparser.http.ConnectionManager
Check for cookie and parse into cookie jar.
parseJsp(int) - Method in class org.htmlparser.lexer.Lexer
Parse a java server page node.
parsePI(int) - Method in class org.htmlparser.lexer.Lexer
Parse an XML processing instruction.
parseRemark(int, boolean) - Method in class org.htmlparser.lexer.Lexer
Parse a comment.
ParserException - Exception in org.htmlparser.util
Library-specific support for chained exceptions.
ParserException() - Constructor for exception org.htmlparser.util.ParserException
 
ParserException(String) - Constructor for exception org.htmlparser.util.ParserException
 
ParserException(Throwable) - Constructor for exception org.htmlparser.util.ParserException
 
ParserException(String, Throwable) - Constructor for exception org.htmlparser.util.ParserException
 
parseString(int, boolean) - Method in class org.htmlparser.lexer.Lexer
Parse a string node.
parseTag(int) - Method in class org.htmlparser.lexer.Lexer
Parse a tag.
postConnect(HttpURLConnection) - Method in interface org.htmlparser.http.ConnectionMonitor
Called just after calling connect.
preConnect(HttpURLConnection) - Method in interface org.htmlparser.http.ConnectionMonitor
Called just prior to calling connect.
prepend(Node) - Method in class org.htmlparser.util.NodeList
Insert the given node at the head of the list.
printStackTrace() - Method in exception org.htmlparser.util.ChainedException
 
printStackTrace(PrintStream) - Method in exception org.htmlparser.util.ChainedException
 
printStackTrace(PrintWriter) - Method in exception org.htmlparser.util.ChainedException
 

Q

QuickSort(Vector) - Static method in class org.htmlparser.util.sort.Sort
This is a generic version of C.A.R Hoare's Quick Sort algorithm.
QuickSort(Vector, int, int) - Static method in class org.htmlparser.util.sort.Sort
This is a generic version of C.A.R Hoare's Quick Sort algorithm.
QuickSort(Ordered[]) - Static method in class org.htmlparser.util.sort.Sort
This is a generic version of C.A.R Hoare's Quick Sort algorithm.
QuickSort(Ordered[], int, int) - Static method in class org.htmlparser.util.sort.Sort
This is a generic version of C.A.R Hoare's Quick Sort algorithm.
QuickSort(String[]) - Static method in class org.htmlparser.util.sort.Sort
This is a string version of C.A.R Hoare's Quick Sort algorithm.
QuickSort(String[], int, int) - Static method in class org.htmlparser.util.sort.Sort
This is a string version of C.A.R Hoare's Quick Sort algorithm.
QuickSort(Sortable, int, int) - Static method in class org.htmlparser.util.sort.Sort
This is a generic version of C.A.R Hoare's Quick Sort algorithm.
QuickSort(Sortable) - Static method in class org.htmlparser.util.sort.Sort
This is a generic version of C.A.R Hoare's Quick Sort algorithm.
QuickSort(Hashtable) - Static method in class org.htmlparser.util.sort.Sort
Sort a Hashtable.

R

read() - Method in class org.htmlparser.lexer.InputStreamSource
Read a single character.
read(char[], int, int) - Method in class org.htmlparser.lexer.InputStreamSource
Read characters into a portion of an array.
read(char[]) - Method in class org.htmlparser.lexer.InputStreamSource
Read characters into an array.
read() - Method in class org.htmlparser.lexer.Source
Read a single character.
read(char[], int, int) - Method in class org.htmlparser.lexer.Source
Read characters into a portion of an array.
read(char[]) - Method in class org.htmlparser.lexer.Source
Read characters into an array.
read() - Method in class org.htmlparser.lexer.Stream
Reads the next byte of data from the input stream.
read() - Method in class org.htmlparser.lexer.StringSource
Read a single character.
read(char[], int, int) - Method in class org.htmlparser.lexer.StringSource
Read characters into a portion of an array.
read(char[]) - Method in class org.htmlparser.lexer.StringSource
Read characters into an array.
ready() - Method in class org.htmlparser.lexer.InputStreamSource
Tell whether this source is ready to be read.
ready() - Method in class org.htmlparser.lexer.Source
Tell whether this source is ready to be read.
ready() - Method in class org.htmlparser.lexer.StringSource
Tell whether this source is ready to be read.
reallocations - Variable in class org.htmlparser.lexer.Stream
The number of reallocations.
Remark - Interface in org.htmlparser
This interface represents a comment in the HTML document.
RemarkNode - Class in org.htmlparser.nodes
The remark tag is identified and represented by this class.
RemarkNode(String) - Constructor for class org.htmlparser.nodes.RemarkNode
Constructor takes in the text string.
RemarkNode(Page, int, int) - Constructor for class org.htmlparser.nodes.RemarkNode
Constructor takes in the page and beginning and ending posns.
remove(Cursor) - Method in class org.htmlparser.lexer.PageIndex
Remove an element from the list
remove(int) - Method in class org.htmlparser.lexer.PageIndex
Remove an element from the list
remove(int) - Method in class org.htmlparser.util.NodeList
Remove the node at index.
remove(Node) - Method in class org.htmlparser.util.NodeList
Remove the supplied Node from the list.
removeAll() - Method in class org.htmlparser.util.NodeList
 
removeAttribute(String) - Method in class org.htmlparser.nodes.TagNode
Remove the attribute with the given key, if it exists.
removeAttribute(String) - Method in interface org.htmlparser.Tag
Remove the attribute with the given key, if it exists.
removeElementAt(int) - Method in class org.htmlparser.lexer.PageIndex
Remove an element from the list.
reset() - Method in class org.htmlparser.lexer.InputStreamSource
Reset the source.
reset() - Method in class org.htmlparser.lexer.Lexer
Reset the lexer to start parsing from the beginning again.
reset() - Method in class org.htmlparser.lexer.Page
Reset the page by resetting the source of characters.
reset() - Method in class org.htmlparser.lexer.Source
Reset the source.
reset() - Method in class org.htmlparser.lexer.Stream
Repositions this stream to the position at the time the mark method was last called on this input stream.
reset() - Method in class org.htmlparser.lexer.StringSource
Reset the source.
retreat() - Method in class org.htmlparser.lexer.Cursor
Move the cursor position back one character.
row(Cursor) - Method in class org.htmlparser.lexer.Page
Get the line number for a cursor.
row(int) - Method in class org.htmlparser.lexer.Page
Get the line number for a cursor.
row(Cursor) - Method in class org.htmlparser.lexer.PageIndex
Get the line number for a cursor.
row(int) - Method in class org.htmlparser.lexer.PageIndex
Get the line number for a position.
run() - Method in class org.htmlparser.lexer.Stream
Continually read the underlying stream untill exhausted.

S

saveCookies(Vector, URLConnection) - Method in class org.htmlparser.http.ConnectionManager
Save the cookies received in the response header.
scan(Tag, Lexer, NodeList) - Method in interface org.htmlparser.scanners.Scanner
Scan the tag.
scan(Tag, Lexer, NodeList) - Method in class org.htmlparser.scanners.TagScanner
Scan the tag.
scanJIS(Cursor) - Method in class org.htmlparser.lexer.Lexer
Advance the cursor through a JIS escape sequence.
Scanner - Interface in org.htmlparser.scanners
Generic interface for scanning.
setAssignment(String) - Method in class org.htmlparser.Attribute
Set the assignment string of this attribute.
setAttribute(String, String) - Method in class org.htmlparser.nodes.TagNode
Set attribute with given key, value pair.
setAttribute(String, String, char) - Method in class org.htmlparser.nodes.TagNode
Set attribute with given key, value pair where the value is quoted by quote.
setAttribute(Attribute) - Method in class org.htmlparser.nodes.TagNode
Set an attribute.
setAttribute(String, String) - Method in interface org.htmlparser.Tag
Set attribute with given key, value pair.
setAttribute(String, String, char) - Method in interface org.htmlparser.Tag
Set attribute with given key/value pair, the value is quoted by quote.
setAttributeEx(Attribute) - Method in class org.htmlparser.nodes.TagNode
Set an attribute.
setAttributeEx(Attribute) - Method in interface org.htmlparser.Tag
Set an attribute.
setAttributesEx(Vector) - Method in class org.htmlparser.nodes.TagNode
Sets the attributes.
setAttributesEx(Vector) - Method in interface org.htmlparser.Tag
Sets the attributes.
setBaseUrl(String) - Method in class org.htmlparser.lexer.Page
Sets the baseUrl.
setChildren(NodeList) - Method in interface org.htmlparser.Node
Set the children of this node.
setChildren(NodeList) - Method in class org.htmlparser.nodes.AbstractNode
Set the children of this node.
setComment(String) - Method in class org.htmlparser.http.Cookie
If a user agent (web browser) presents this cookie to a user, the cookie's purpose will be described using this comment.
setConnection(URLConnection) - Method in class org.htmlparser.lexer.Page
Set the URLConnection to be used by this page.
setConnectionManager(ConnectionManager) - Static method in class org.htmlparser.lexer.Page
Set the connection manager to use.
setCookie(Cookie, String) - Method in class org.htmlparser.http.ConnectionManager
Adds a cookie to the cookie jar.
setCookieProcessingEnabled(boolean) - Method in class org.htmlparser.http.ConnectionManager
Enables and disabled cookie processing.
setCursor(Cursor) - Method in class org.htmlparser.lexer.Lexer
Set the current scanning position.
setDefaultRequestProperties(Hashtable) - Static method in class org.htmlparser.http.ConnectionManager
Set the default request header properties.
setDomain(String) - Method in class org.htmlparser.http.Cookie
This cookie should be presented only to hosts satisfying this domain name pattern.
setEmptyXmlTag(boolean) - Method in class org.htmlparser.nodes.TagNode
Set this tag to be an empty xml node, or not.
setEmptyXmlTag(boolean) - Method in interface org.htmlparser.Tag
Set this tag to be an empty xml node, or not.
setEncoding(String) - Method in class org.htmlparser.lexer.InputStreamSource
Begins reading from the source with the given character set.
setEncoding(String) - Method in class org.htmlparser.lexer.Page
Begins reading from the source with the given character set.
setEncoding(String) - Method in class org.htmlparser.lexer.Source
Set the encoding to the given character set.
setEncoding(String) - Method in class org.htmlparser.lexer.StringSource
Set the encoding to the given character set.
setEndPosition(int) - Method in interface org.htmlparser.Node
Sets the ending position of the node.
setEndPosition(int) - Method in class org.htmlparser.nodes.AbstractNode
Sets the ending position of the node.
setEndTag(Tag) - Method in class org.htmlparser.nodes.TagNode
Set the end tag for this (composite) tag.
setEndTag(Tag) - Method in interface org.htmlparser.Tag
Set the end tag for this (composite) tag.
setExpiryDate(Date) - Method in class org.htmlparser.http.Cookie
Sets the expiry date of the cookie.
setMonitor(ConnectionMonitor) - Method in class org.htmlparser.http.ConnectionManager
Set the monitoring object.
setName(String) - Method in class org.htmlparser.Attribute
Set the name of this attribute.
setNameEndPosition(int) - Method in class org.htmlparser.lexer.PageAttribute
Set the ending position of the attribute name.
setNameStartPosition(int) - Method in class org.htmlparser.lexer.PageAttribute
Set the starting position of the attribute name.
setNodeFactory(NodeFactory) - Method in class org.htmlparser.lexer.Lexer
Set the current node factory.
setPage(Page) - Method in class org.htmlparser.lexer.Lexer
Set the page this lexer is working on.
setPage(Page) - Method in class org.htmlparser.lexer.PageAttribute
Set the page this attribute is anchored to.
setPage(Page) - Method in interface org.htmlparser.Node
Set the page this node came from.
setPage(Page) - Method in class org.htmlparser.nodes.AbstractNode
Set the page this node came from.
setParent(Node) - Method in interface org.htmlparser.Node
Sets the parent of this node.
setParent(Node) - Method in class org.htmlparser.nodes.AbstractNode
Sets the parent of this node.
setPassword(String) - Method in class org.htmlparser.http.ConnectionManager
Set the URL users's password.
setPath(String) - Method in class org.htmlparser.http.Cookie
This cookie should be presented only with requests beginning with this URL.
setPosition(int) - Method in class org.htmlparser.lexer.Cursor
Set the position of this cursor.
setPosition(int) - Method in class org.htmlparser.lexer.Lexer
Set the current cursor position.
setProxyHost(String) - Method in class org.htmlparser.http.ConnectionManager
Set the proxy host to use.
setProxyPassword(String) - Method in class org.htmlparser.http.ConnectionManager
Get the proxy user's password.
setProxyPort(int) - Method in class org.htmlparser.http.ConnectionManager
Set the proxy port number.
setProxyUser(String) - Method in class org.htmlparser.http.ConnectionManager
Set the user name for proxy authorization.
setQuote(char) - Method in class org.htmlparser.Attribute
Set the quote surrounding the value of the attribute.
setRawValue(String) - Method in class org.htmlparser.Attribute
Set the value of the attribute and the quote character.
setRedirectionProcessingEnabled(boolean) - Method in class org.htmlparser.http.ConnectionManager
Enables or disables manual redirection handling.
setRequestProperties(Hashtable) - Method in class org.htmlparser.http.ConnectionManager
Set the current request properties.
setSecure(boolean) - Method in class org.htmlparser.http.Cookie
Indicates to the user agent that the cookie should only be sent using a secure protocol (https).
setStartPosition(int) - Method in interface org.htmlparser.Node
Sets the starting position of the node.
setStartPosition(int) - Method in class org.htmlparser.nodes.AbstractNode
Sets the starting position of the node.
setTagBegin(int) - Method in class org.htmlparser.nodes.TagNode
Sets the nodeBegin.
setTagEnd(int) - Method in class org.htmlparser.nodes.TagNode
Sets the nodeEnd.
setTagName(String) - Method in class org.htmlparser.nodes.TagNode
Set the name of this tag.
setTagName(String) - Method in interface org.htmlparser.Tag
Set the name of this tag.
setText(String) - Method in interface org.htmlparser.Node
Sets the string contents of the node.
setText(String) - Method in class org.htmlparser.nodes.AbstractNode
Sets the string contents of the node.
setText(String) - Method in class org.htmlparser.nodes.RemarkNode
Sets the string contents of the node.
setText(String) - Method in class org.htmlparser.nodes.TagNode
Parses the given text to create the tag contents.
setText(String) - Method in class org.htmlparser.nodes.TextNode
Sets the string contents of the node.
setText(String) - Method in interface org.htmlparser.Remark
Sets the string contents of the node.
setText(String) - Method in interface org.htmlparser.Text
Sets the contents of the node.
setThisScanner(Scanner) - Method in class org.htmlparser.nodes.TagNode
Set the scanner associated with this tag.
setThisScanner(Scanner) - Method in interface org.htmlparser.Tag
Set the scanner associated with this tag.
setUrl(String) - Method in class org.htmlparser.lexer.Page
Set the URL for this page.
setUser(String) - Method in class org.htmlparser.http.ConnectionManager
Set the user name to access the URL.
setValue(String) - Method in class org.htmlparser.Attribute
Set the value of the attribute.
setValue(String) - Method in class org.htmlparser.http.Cookie
Sets the value of the cookie.
setValueEndPosition(int) - Method in class org.htmlparser.lexer.PageAttribute
Set the ending position of the attribute value.
setValueStartPosition(int) - Method in class org.htmlparser.lexer.PageAttribute
Set the starting position of the attribute value.
setVersion(int) - Method in class org.htmlparser.http.Cookie
Sets the version of the cookie protocol used when this cookie saves itself.
shouldRecurseChildren() - Method in class org.htmlparser.visitors.NodeVisitor
Depth traversal predicate.
shouldRecurseSelf() - Method in class org.htmlparser.visitors.NodeVisitor
Self traversal predicate.
SimpleNodeIterator - Interface in org.htmlparser.util
The HTMLSimpleEnumeration interface is similar to NodeIterator, except that it does not throw exceptions.
size() - Method in class org.htmlparser.lexer.PageIndex
Get the count of elements.
size() - Method in class org.htmlparser.util.NodeList
 
skip(long) - Method in class org.htmlparser.lexer.InputStreamSource
Skip characters.
skip(long) - Method in class org.htmlparser.lexer.Source
Skip characters.
skip(long) - Method in class org.htmlparser.lexer.StringSource
Skip characters.
Sort - Class in org.htmlparser.util.sort
A quick sort algorithm to sort Vectors or arrays.
Sortable - Interface in org.htmlparser.util.sort
Provides a mechanism to abstract the sort process.
Source - Class in org.htmlparser.lexer
A buffered source of characters.
Source() - Constructor for class org.htmlparser.lexer.Source
 
Stream - Class in org.htmlparser.lexer
Provides for asynchronous fetching from a stream.
Stream(InputStream) - Constructor for class org.htmlparser.lexer.Stream
Construct a stream with no assumptions about the number of bytes available.
Stream(InputStream, int) - Constructor for class org.htmlparser.lexer.Stream
Construct a stream to read the given number of bytes.
STRICT_REMARKS - Static variable in class org.htmlparser.lexer.Lexer
Process remarks strictly flag.
StringSource - Class in org.htmlparser.lexer
A source of characters based on a String.
StringSource(String) - Constructor for class org.htmlparser.lexer.StringSource
Construct a source using the provided string.
StringSource(String, String) - Constructor for class org.htmlparser.lexer.StringSource
Construct a source using the provided string and encoding.
swap(int, int) - Method in class org.htmlparser.lexer.PageIndex
Swaps the elements at the given indicies.
swap(int, int) - Method in interface org.htmlparser.util.sort.Sortable
Swaps the elements at the given indicies.
synchronous - Variable in class org.htmlparser.lexer.Stream
The number of synchronous (blocking) fills.

T

Tag - Interface in org.htmlparser
This interface represents a tag (<xxx yyy="zzz">) in the HTML document.
TagNode - Class in org.htmlparser.nodes
TagNode represents a generic tag.
TagNode() - Constructor for class org.htmlparser.nodes.TagNode
Create an empty tag.
TagNode(Page, int, int, Vector) - Constructor for class org.htmlparser.nodes.TagNode
Create a tag with the location and attributes provided
TagNode(TagNode, TagScanner) - Constructor for class org.htmlparser.nodes.TagNode
Create a tag like the one provided.
TagScanner - Class in org.htmlparser.scanners
TagScanner is an abstract superclass, subclassed to create specific scanners.
TagScanner() - Constructor for class org.htmlparser.scanners.TagScanner
Create a (non-composite) tag scanner.
Text - Interface in org.htmlparser
This interface represents a piece of the content of the HTML document.
TextNode - Class in org.htmlparser.nodes
Normal text in the HTML document is represented by this class.
TextNode(String) - Constructor for class org.htmlparser.nodes.TextNode
Constructor takes in the text string.
TextNode(Page, int, int) - Constructor for class org.htmlparser.nodes.TextNode
Constructor takes in the page and beginning and ending posns.
throwable - Variable in exception org.htmlparser.util.ChainedException
 
toHtml() - Method in interface org.htmlparser.Node
Return the HTML for this node.
toHtml(boolean) - Method in interface org.htmlparser.Node
Return the HTML for this node.
toHtml() - Method in class org.htmlparser.nodes.AbstractNode
Return the HTML for this node.
toHtml(boolean) - Method in class org.htmlparser.nodes.AbstractNode
Return the HTML for this node.
toHtml(boolean) - Method in class org.htmlparser.nodes.RemarkNode
Return The full HTML remark.
toHtml(boolean) - Method in class org.htmlparser.nodes.TagNode
Render the tag as HTML.
toHtml(boolean) - Method in class org.htmlparser.nodes.TextNode
Returns the text of the node.
toHtml(boolean) - Method in class org.htmlparser.util.NodeList
Convert this nodelist into the equivalent HTML.
toHtml() - Method in class org.htmlparser.util.NodeList
Convert this nodelist into the equivalent HTML.
toNodeArray() - Method in class org.htmlparser.util.NodeList
 
toPlainTextString() - Method in interface org.htmlparser.Node
A string representation of the node.
toPlainTextString() - Method in class org.htmlparser.nodes.AbstractNode
Returns a string representation of the node.
toPlainTextString() - Method in class org.htmlparser.nodes.RemarkNode
Return the remark text.
toPlainTextString() - Method in class org.htmlparser.nodes.TagNode
Get the plain text from this node.
toPlainTextString() - Method in class org.htmlparser.nodes.TextNode
Returns the text of the node.
toString() - Method in class org.htmlparser.Attribute
Get a text representation of this attribute.
toString(StringBuffer) - Method in class org.htmlparser.Attribute
Get a text representation of this attribute.
toString() - Method in class org.htmlparser.http.Cookie
Convert this cookie into a user friendly string.
toString() - Method in class org.htmlparser.lexer.Cursor
Return a string representation of this cursor
toString() - Method in class org.htmlparser.lexer.Page
Display some of this page as a string.
toString() - Method in interface org.htmlparser.Node
Return the string representation of the node.
toString() - Method in class org.htmlparser.nodes.AbstractNode
Return a string representation of the node.
toString() - Method in class org.htmlparser.nodes.RemarkNode
Print the contents of the remark tag.
toString() - Method in class org.htmlparser.nodes.TagNode
Print the contents of the tag.
toString() - Method in class org.htmlparser.nodes.TextNode
Express this string node as a printable string This is suitable for display in a debugger or output to a printout.
toString() - Method in class org.htmlparser.util.NodeList
Return the contents of the list as a string.
toTagHtml() - Method in class org.htmlparser.nodes.TagNode
Return the tag HTML.
toTagHtml() - Method in interface org.htmlparser.Tag
Return the tag HTML.

U

ungetCharacter(Cursor) - Method in class org.htmlparser.lexer.Page
Return a character.
unread() - Method in class org.htmlparser.lexer.InputStreamSource
Undo the read of a single character.
unread() - Method in class org.htmlparser.lexer.Source
Undo the read of a single character.
unread() - Method in class org.htmlparser.lexer.StringSource
Undo the read of a single character.

V

VERSION_DATE - Static variable in class org.htmlparser.lexer.Lexer
The date of the version ("Sep 17, 2006").
VERSION_NUMBER - Static variable in class org.htmlparser.lexer.Lexer
The floating point version number (2.0).
VERSION_STRING - Static variable in class org.htmlparser.lexer.Lexer
The display version ("2.0 (Release Build Sep 17, 2006)").
VERSION_TYPE - Static variable in class org.htmlparser.lexer.Lexer
The type of version ("Release Build").
visitAllNodesWith(NodeVisitor) - Method in class org.htmlparser.util.NodeList
Utility to apply a visitor to a node list.
visitEndTag(Tag) - Method in class org.htmlparser.visitors.NodeVisitor
Called for each Tag visited that is an end tag.
visitRemarkNode(Remark) - Method in class org.htmlparser.visitors.NodeVisitor
Called for each RemarkNode visited.
visitStringNode(Text) - Method in class org.htmlparser.visitors.NodeVisitor
Called for each StringNode visited.
visitTag(Tag) - Method in class org.htmlparser.visitors.NodeVisitor
Called for each Tag visited.

A B C D E F G H I K L M N O P Q R S T U V
© 2006 Derrick Oswald
April 1, 2006

HTML Parser is an open source library released under Common Public License.SourceForge.net