Package com.raritantechnologies.searchApp.dataCollection

Describes objects and interfaces used to build data collection / processing pipelines.

See:
          Description

Interface Summary
ICollectionBuilder Base Interface for objects that can create a search index 'collection' or other data processing output.
ICollectionBuilderFactory
ICollectionGateway Collection Gateway implementations are responsible for collecting raw data and formatting it as IResultSet objects.
ICollectionIndexer  
IDataCollector Basic data collector interface
IGatewayListener Base interface for processors that convert IResultSets into collections and/or parametric indexes.
IGatewayOutputProcessor Base interface for objects that process ICollectionGateway results.
 

Class Summary
CachingResultSetProcessor Caches the IResultSet in session cache.
CollectionBuilder Central dispatcher for data collection and processing builds.
CollectionBuildManager Singleton class that maintains / manages a set of ongoing CollectionBuild operations.
CollectionBuildProcess Job Process that builds a search engine collection or other type of output.
CollectionStatisticsCollector Produces a flat file output of collection statistics.
CurrentResultsProcessor
DataCollectorSearchSource
DeduplicatingGatewayProcessorFilter
DispatchResultProcessor
FileSystemOutputProcessor
FilteredCollectionGateway Collection Gateway that enables an IResultSetFilter to be inserted.
FilteredOutputProcessor
FlatFileOutputProcessor Stores IResult fields in a flat file - for example for import into a spreadsheet.
FormattingOutputProcessor
FormPersistTag
GatewayFormatterFilter Subclass of GatewayProcessorFilter that enables IFieldFormatters to be plugged into a data collection / processing pipeline.
GatewayProcessorFilter Base class for objects that can add / remove or modify IResult items collected by a ICollectionGateway data collection / processing stream.
LinkVerifierGatewayFilter '404' Checker as links are generated by Collection buildware - check for correctness (i.e.
NestedResultExtractorFilter Extracts nested IResultSets from IResult objects.
ProcessFilesPostProcessorFilter  
QueryProcessorGateway
ResultTermsCollector
SequentialOutputProcessor Implementation of IGatewayOutputProcessor that contains one or more nested IGatewayOutputProcessors in which data processing operations are performed sequentially.
SequentialResultProcessor Implementation of IGatewayOutputProcessor and {com.raritantechnologies.searchApp.IResultSetProcessor} that contains one or more nested IResultSetProcessors in which data processing operations are performed sequentially.
SystemObjectCollectionGateway RecordAdapter that uses a Raritan ICollectionGateway.
TimeReportingOutputProcessor  
UpdateProcessorFilter Provides an update / delete checking service for an OutputProcessor using a Database.
WebCrawlerGateway Collection Gateway that uses a Web Crawler or "spider" to collect document URLs for processing.
XMLCollectionBuilderFactory Implementation of ICollectionBuilderFactory that builds a VerityCollectionBuilder from an XML configuration file.
XMLDispatchFilterListener
 

Package com.raritantechnologies.searchApp.dataCollection Description

Describes objects and interfaces used to build data collection / processing pipelines.