|
||||||||||
| PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
| SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD | |||||||||
java.lang.Objectcom.raritantechnologies.searchApp.dataCollection.WebCrawlerGateway
Collection Gateway that uses a Web Crawler or "spider" to collect document URLs for processing.
Can incorporate a configurable HTMLScraper to extract metadata from HTML
pages and/or an HTMLFilter to pre-process HTML pages.
| Constructor Summary | |
WebCrawlerGateway()
|
|
| Method Summary | |
java.lang.String |
getConfigurationXML()
|
void |
initialize(org.w3c.dom.Element gatewayElem,
ISearchFieldMap sfMap)
Initialize the ICollectionGateway from a Configuration XML element. |
boolean |
isPaused()
|
void |
pause()
|
void |
resume()
|
void |
run()
|
void |
run(ILoginInfo userInfo)
Need to have loginInfo for sources which require login/pw access. |
void |
setGatewayListener(IGatewayListener listener)
Sets the IGatewayListener interface. |
void |
setInputParameters(OrderedMap inputParams)
set input parameters... |
| Methods inherited from class java.lang.Object |
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
| Constructor Detail |
public WebCrawlerGateway()
| Method Detail |
public void run()
run in interface ICollectionGatewaypublic void run(ILoginInfo userInfo)
ICollectionGateway
run in interface ICollectionGatewaypublic void setInputParameters(OrderedMap inputParams)
ICollectionGateway
setInputParameters in interface ICollectionGatewaypublic void setGatewayListener(IGatewayListener listener)
ICollectionGateway
setGatewayListener in interface ICollectionGateway
public void initialize(org.w3c.dom.Element gatewayElem,
ISearchFieldMap sfMap)
ICollectionGateway
initialize in interface ICollectionGatewaypublic void pause()
pause in interface ICollectionGatewaypublic void resume()
resume in interface ICollectionGatewaypublic boolean isPaused()
isPaused in interface ICollectionGatewaypublic java.lang.String getConfigurationXML()
getConfigurationXML in interface ICollectionGateway
|
||||||||||
| PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
| SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD | |||||||||