com.raritantechnologies.verity.profiler
Class ProfilerDocumentKeywordProcessor
java.lang.Object
com.raritantechnologies.concept.AbstractDocumentKeywordProcessor
com.raritantechnologies.verity.profiler.ProfilerDocumentKeywordProcessor
- All Implemented Interfaces:
- IConfigurable, IDocumentKeywordProcessor, IGatewayOutputProcessor, IResultSetProcessor
- public class ProfilerDocumentKeywordProcessor
- extends AbstractDocumentKeywordProcessor
Uses Verity Profiler to get a set of keywords for document clustering analysis.
XML Configuration Template:
<DocumentProcessor class="com.raritantechnologies.verity.profiler.ProfilerDocumentKeywordProcessor" >
<ProfilerFilter>
</ProfilerFilter>
</DocumentProcessor>
|
Method Summary |
java.lang.String |
getConfigurationXML()
|
protected void |
getWords(IResult result,
java.lang.String text,
java.lang.String docKey)
Subclasses must implement this method: extract keywords from the text for the document
given by resultKey. |
void |
initialize(org.w3c.dom.Element elem)
Initialize the from XML Element. |
void |
initialize(java.util.Map initParams)
Dynamic initialization. |
boolean |
isKeyword(WordCount wordCount)
|
| Methods inherited from class com.raritantechnologies.concept.AbstractDocumentKeywordProcessor |
addWord, addWord, dataComplete, dataComplete, getDocuments, getDocuments, getKeywordAssociations, getKeywords, getWordCounts, getWordDocumentMap, getWordDocumentMap, initialize, processData, processResult, processResultSet, reset |
| Methods inherited from class java.lang.Object |
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
ProfilerDocumentKeywordProcessor
public ProfilerDocumentKeywordProcessor()
getWords
protected void getWords(IResult result,
java.lang.String text,
java.lang.String docKey)
- Description copied from class:
AbstractDocumentKeywordProcessor
- Subclasses must implement this method: extract keywords from the text for the document
given by resultKey. The implemented method should call the addWord( ) method with each keyword
or word.
- Specified by:
getWords in class AbstractDocumentKeywordProcessor
isKeyword
public boolean isKeyword(WordCount wordCount)
- Specified by:
isKeyword in class AbstractDocumentKeywordProcessor
initialize
public void initialize(java.util.Map initParams)
- Description copied from interface:
IResultSetProcessor
- Dynamic initialization.
- Specified by:
initialize in interface IResultSetProcessor- Overrides:
initialize in class AbstractDocumentKeywordProcessor
initialize
public void initialize(org.w3c.dom.Element elem)
- Description copied from interface:
IResultSetProcessor
- Initialize the from XML Element.
- Specified by:
initialize in interface IResultSetProcessor- Overrides:
initialize in class AbstractDocumentKeywordProcessor
getConfigurationXML
public java.lang.String getConfigurationXML()