com.raritantechnologies.HTML
Interface IHTMLScraperProcessor

All Superinterfaces:
IConfigurable
All Known Implementing Classes:
AbstractPostProcessor

public interface IHTMLScraperProcessor
extends IConfigurable

PostProcessor for HTMLScraper.

Scenario - HTMLScraper generates XML which needs to be processed. Example - second process used to build the Parametric Index from a Site Scrape - Iterative form processor

Post processor may do additional HTMLScraping. The XML document contains whatever information the Post processor may need to do the additional page or site scraping.


Developed by Raritan Technologies Inc..

Author:
Ted Sullivan

Method Summary
 org.w3c.dom.Document getOutputDoc()
           
 void outputXML(java.io.Writer writer)
           
 void outputXML(java.io.Writer writer, javax.xml.transform.Transformer transformer)
           
 void process(org.w3c.dom.Document doc)
           
 void setPostProcessor(IHTMLScraperProcessor postProcessor)
           
 
Methods inherited from interface com.raritantechnologies.searchApp.IConfigurable
initialize
 

Method Detail

process

public void process(org.w3c.dom.Document doc)

getOutputDoc

public org.w3c.dom.Document getOutputDoc()

outputXML

public void outputXML(java.io.Writer writer)

outputXML

public void outputXML(java.io.Writer writer,
                      javax.xml.transform.Transformer transformer)

setPostProcessor

public void setPostProcessor(IHTMLScraperProcessor postProcessor)