com.raritantechnologies.verity.profiler
Class ProfilerDocumentKeywordProcessor

java.lang.Object
  extended bycom.raritantechnologies.concept.AbstractDocumentKeywordProcessor
      extended bycom.raritantechnologies.verity.profiler.ProfilerDocumentKeywordProcessor
All Implemented Interfaces:
IConfigurable, IDocumentKeywordProcessor, IGatewayOutputProcessor, IResultSetProcessor

public class ProfilerDocumentKeywordProcessor
extends AbstractDocumentKeywordProcessor

Uses Verity Profiler to get a set of keywords for document clustering analysis.

XML Configuration Template:
  <DocumentProcessor class="com.raritantechnologies.verity.profiler.ProfilerDocumentKeywordProcessor" >

    <ProfilerFilter>

    </ProfilerFilter>

  </DocumentProcessor>
 


Field Summary
 
Fields inherited from class com.raritantechnologies.concept.AbstractDocumentKeywordProcessor
documents, resKeyField
 
Constructor Summary
ProfilerDocumentKeywordProcessor()
           
 
Method Summary
 java.lang.String getConfigurationXML()
           
protected  void getWords(IResult result, java.lang.String text, java.lang.String docKey)
          Subclasses must implement this method: extract keywords from the text for the document given by resultKey.
 void initialize(org.w3c.dom.Element elem)
          Initialize the from XML Element.
 void initialize(java.util.Map initParams)
          Dynamic initialization.
 boolean isKeyword(WordCount wordCount)
           
 
Methods inherited from class com.raritantechnologies.concept.AbstractDocumentKeywordProcessor
addWord, addWord, dataComplete, dataComplete, getDocuments, getDocuments, getKeywordAssociations, getKeywords, getWordCounts, getWordDocumentMap, getWordDocumentMap, initialize, processData, processResult, processResultSet, reset
 
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 

Constructor Detail

ProfilerDocumentKeywordProcessor

public ProfilerDocumentKeywordProcessor()
Method Detail

getWords

protected void getWords(IResult result,
                        java.lang.String text,
                        java.lang.String docKey)
Description copied from class: AbstractDocumentKeywordProcessor
Subclasses must implement this method: extract keywords from the text for the document given by resultKey. The implemented method should call the addWord( ) method with each keyword or word.

Specified by:
getWords in class AbstractDocumentKeywordProcessor

isKeyword

public boolean isKeyword(WordCount wordCount)
Specified by:
isKeyword in class AbstractDocumentKeywordProcessor

initialize

public void initialize(java.util.Map initParams)
Description copied from interface: IResultSetProcessor
Dynamic initialization.

Specified by:
initialize in interface IResultSetProcessor
Overrides:
initialize in class AbstractDocumentKeywordProcessor

initialize

public void initialize(org.w3c.dom.Element elem)
Description copied from interface: IResultSetProcessor
Initialize the from XML Element.

Specified by:
initialize in interface IResultSetProcessor
Overrides:
initialize in class AbstractDocumentKeywordProcessor

getConfigurationXML

public java.lang.String getConfigurationXML()