com.raritantechnologies.HTML
Class HTMLScraperGateway

java.lang.Object
  extended bycom.raritantechnologies.HTML.HTMLScraperGateway
All Implemented Interfaces:
ICollectionGateway, java.lang.Runnable

public class HTMLScraperGateway
extends java.lang.Object
implements ICollectionGateway

Creates an IResultSet for processing from HTMLScraper output.

XML Configuration Template:
 <CollectionGateway name="ScrapProc_1" class="com.raritantechnologies.HTML.HTMLScraperGateway"
                       useCookies="[ true|false(default) ]" >

  <!-- One or more SourceURL's -->
  <SourceURL>http://www.sec.gov/divisions/corpfin/organization/cf2003.shtml</SourceURL>

  <LoginProcess>
    <Step type="[getURL|getURLSocket|postURL|postURLSocket]" URL="[the URL that the form should be sent to]" >
      <params>

      </params>
    </Step>
  </LoginProcess>

  <HTMLScraperConfig>BASE_PATH/EdgarCIK/CIKBrowseScraper.xml</HTMLScraperConfig>
  
  <ResultSetSize>1</ResultSetSize>

  <XMLResultProcessor>
     <!-- see XMLResultProcessor configuration -->
  </XMLResultProcessor>

 </CollectionGateway>
 


Developed by Raritan Technologies .

Author:
Ted Sullivan

Constructor Summary
HTMLScraperGateway()
           
 
Method Summary
 java.lang.String getConfigurationXML()
           
 void initialize(org.w3c.dom.Element gatewayElem, ISearchFieldMap sfMap)
          Initialize the ICollectionGateway from a Configuration XML element.
 boolean isPaused()
           
 void pause()
           
 void resume()
           
 void run()
           
 void run(ILoginInfo userInfo)
          Need to have loginInfo for sources which require login/pw access.
 void setDataAvailable(IResultSet rs)
           
 void setDataComplete()
           
 void setGatewayListener(IGatewayListener listener)
          Sets the IGatewayListener interface.
 void setInputParameters(OrderedMap inputParams)
          set input parameters...
 
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 

Constructor Detail

HTMLScraperGateway

public HTMLScraperGateway()
Method Detail

setInputParameters

public void setInputParameters(OrderedMap inputParams)
Description copied from interface: ICollectionGateway
set input parameters...

Specified by:
setInputParameters in interface ICollectionGateway

run

public void run()
Specified by:
run in interface ICollectionGateway

run

public void run(ILoginInfo userInfo)
Need to have loginInfo for sources which require login/pw access.

Specified by:
run in interface ICollectionGateway

setGatewayListener

public void setGatewayListener(IGatewayListener listener)
Description copied from interface: ICollectionGateway
Sets the IGatewayListener interface.

Specified by:
setGatewayListener in interface ICollectionGateway

setDataAvailable

public void setDataAvailable(IResultSet rs)

setDataComplete

public void setDataComplete()

initialize

public void initialize(org.w3c.dom.Element gatewayElem,
                       ISearchFieldMap sfMap)
Description copied from interface: ICollectionGateway
Initialize the ICollectionGateway from a Configuration XML element.

Specified by:
initialize in interface ICollectionGateway

pause

public void pause()
Specified by:
pause in interface ICollectionGateway

resume

public void resume()
Specified by:
resume in interface ICollectionGateway

isPaused

public boolean isPaused()
Specified by:
isPaused in interface ICollectionGateway

getConfigurationXML

public java.lang.String getConfigurationXML()
Specified by:
getConfigurationXML in interface ICollectionGateway