专利内容由知识产权出版社提供
专利名称:System and method for performing efficient
document scoring and clustering
发明人:Kenji Kawai,Lynne Marie Evans申请号:US10626984申请日:20030725
公开号:US20050022106A1公开日:20050127
专利附图:
摘要:A system and method for providing efficient document scoring of conceptswithin a document set is described. A frequency of occurrence of at least one conceptwithin a document retrieved from the document set is determined. A concept weight is
analyzed reflecting a specificity of meaning for the at least one concept within thedocument. A structural weight is analyzed reflecting a degree of significance based onstructural location within the document for the at least one concept. A corpus weight isanalyzed inversely weighing a reference count of occurrences for the at least oneconcept within the document. A score associated with the at least one concept isevaluated as a function of the frequency, concept weight, structural weight, and corpusweight.
申请人:Kenji Kawai,Lynne Marie Evans
地址:Seattle WA US,Renton WA US
国籍:US,US
更多信息请下载全文后查看