ebook img

Information processing apparatus and method, recording medium, and program PDF

59 Pages·2013·2.81 MB·English
by  
Save to my drive
Quick download
Download
Most books are stored in the elastic cloud where traffic is expensive. For this reason, we have a limit on daily download.

Preview Information processing apparatus and method, recording medium, and program

US 20030220922A1 (19) United States (12) Patent Application Publication (10) Pub. N0.2 US 2003/0220922 A1 Yamamoto et al. (43) Pub. Date: Nov. 27, 2003 (54) INFORMATION PROCESSING APPARATUS (57) ABSTRACT AND METHOD, RECORDING MEDIUM, AND PROGRAM The present invention relates to an information processing apparatus. The information processing apparatus has data (76) Inventors: Noriyuki Yamamoto, Tokyo (JP); Mari base creating means for classifying existing document infor Saito, KanagaWa (JP) mation into groups and creating a database having associ ated information about each of the groups; search means for Correspondence Address: searching predetermined document information for charac William S. Frommer, Esq. teristic Words; and presenting means for presenting, of the FROMMER LAWRENCE & HAUG LLP associated information created by the database creating 745 Fifth Avenue means, associated information associated With the charac New York, NY 10151 (US) teristic Words searched by the search means. The database creating means includes: selecting means for selecting, of (21) Appl. No.: 10/401,345 document information about all of the existing document information, the existing document information to be clas (22) Filed: Mar. 28, 2003 si?ed into the groups; classifying means for classifying the (30) Foreign Application Priority Data existing document information selected by the selecting means into the groups; single-out means for singling out at least one of the groups having the existing document infor Mar. 29, 2002 (JP) .................................... .. 2002-095413 mation; acquiring means for acquiring associated informa Publication Classi?cation tion about at least one of the groups having the existing document information; and accumulating means for accu mulating the associated information acquired by the acquir (51) Int. Cl? .. G06F 7/00 ing means by relating the associated information With the (52) U.S.Cl. ................................................................ .. 707/7 groups. (START DATABASE CREATING PROCESSING) \I/ IISELECT ELECTRONIC MAIL MESSAGES TO BE ANALYZEDI I~S1 I CREATE TOPICS (PUT ELECTRONIC MAIL MESSAGE INTO ~32 GROUPS) \I/ II SINGLE ouT PRIMARY TOPICS II~S3 \I II PERFORM MORPHOLOGICAL ANALYSIS II~$4 I II DELETE uNNECESSARY WORDS I Ivss \l/ I COMPuTE EVALUATION vALuES FOR wORDS IVSB \I ICORREOT EvALuATION vALuEs FOR EACH OF THE WORDS I~S7 \I/ ESTABLISH CHARACTERIST IC VECTOR AND CS8 SORT WORD VECTORS BASED ON EVALUATION vALuES ‘II I I SINGLE OuT SECONDARY TOPICS I I~S9 I/ I DETERMINE RECOMMENDED TOPIC CANDIDATE I-S10 I/ II DETERMINE RECOMMENDED TOPIC II~S1 1 I/ II PERFORM WEB SEARCH IIv$12 II I DETERMINE RECOMMENDED ASSOCIATED INFORMATION I~$13 @ Patent Application Publication Nov. 27, 2003 Sheet 1 0f 37 US 2003/0220922 A1 FIG. 1 21 I DOCUMENT ACOUISI TION BLOCK 22 $___- 23 I 2 DOCUMENT DOCUMENT ATTRI BUTE _>CONTENTS PROCESSING PROCESSING BLOCK BLOCK 2,4 25 DOCUMENT ASSOCI ATED CHARACTERISTICS lNFORMATlON MAILER DATABASE CREATING SEARCHv BLOCK BLOCK WORDS EVENT PROCESSOR DATABASE INQUIRY MANAGEMENT <—¢ PROGRAM BLOCK ' BLOCKIT | ER I I I 3 32 31 31A ASSOCIATED INFORMATION PRESENTATION BLOCK AGENT CONTROL BLOCK \ 13 1 Patent Application Publication Nov. 27, 2003 Sheet 2 0f 37 US 2003/0220922 A1 Fm? mw ~ mmmEmmm i ~ A25 Amw32Z3o‘Vm w _2>_o<mz ~5 5w2O275w5.%2ém2&52:?2i22w88m e SM5QE8WEE2E5 erwmwwv 5V5v663s330imm3mm V Patent Application Publication Nov. 27, 2003 Sheet 3 0f 37 US 2003/0220922 A1 FIG. 3 (START DATABASE CREATING PROCESSING) SELECT ELECTRONIC MAIL vMESSAGES TO BE ANALYZED ~81 \|/ CREATE TOPICS (PUT ELECTRONIC MAI L MESSAGE INTO ~32 GROUPS) \|/ SINGLE vOuT PRIMARY TOPICS ~83 I PERFORM MORPHOLOGICAL ANALYSIS ~84 I ' DELETE UNNECESSARY wORDS ~85 \|/ COMPuTE EvALuATION VALUES FOR WORDS ~86 \|/ CORRECT EvALuATION VALUES FOR EACH OF THE WORDS ~87 \I/ ESTABLISH CHARACTERISTIC vECTOR AND LS8 SORT WORD VECTORS BASED ON EvALuATION VALUES - \|I SINGLE OUT SECONDARY TOPICS ~89 \l/ DETERMINE RECOMMENDED TOPIC CANDIDATE ~$10 \|/ DETERMINE RECOMMENDED TOPIC ~81 1 \|/ PERFORM WEB SEARCH ~812 I DETERMINE RECOMMENDED ASSOCIATED INFORMATION ~S13 @ Patent Application Publication Nov. 27, 2003 Sheet 4 0f 37 US 2003/0220922 Al F I G . 4 START SELECTING ELECTRONIC MAIL TO BE ANALYZED REFERENCE SEND FOLDER. 82‘ THE NUMBER OF ELECTRONIC MA|L MESSAGES SENT IN LAST ONE N0 wEEKzPREDETERMlNED NUMBER? YES I S26 REFERENCE RECEIVE FDLDER. YES THE NUMBER OF ELECTRONIC MAIL MESSAGES RECEIVED lN LAST ONE WEEK; PREDETERMINED NUMBER? NO END SET DATE/TIME CONDITION AND @322 ADDRESS ATTR I BUTE CONDITION PERFORM FILTERING BASED‘ ON DATE/TIME CONDITION AND ~$23 ADDRESS ATTRIBUTE CONDITION DETERMINE ADDRESS CONDITION ~$24 PERFORM FILTERING ON ALL ELECTRONIC MAIL MESSAGES N825 BASED ON ADDRESS CONDITION AND DATE/TIME CONDITION @ Patent Application Publication Nov. 27, 2003 Sheet 5 0f 37 US 2003/0220922 A1 mmw 5m 20;; 0202:<. 0Q22zw_ w m ozHmmommm5h_o6zmz.jzm .UHm \ZMD Pmm wmmmszz mmw \ Patent Application Publication Nov. 27, 2003 Sheet 6 0f 37 US 2003/0220922 A1 FIG. 6 61 2 62/» TOPIC ID' (abcdef01) 63/5‘ DATE/T | ME 64"\'- SUBJECT MEMBER (MAI ADDRESS) 66’\*’ MAIL MESSAGE ID , CHARACTER l ST I C 67 \w WORD VECTOR VECTOR ’-\_’69 68’” LINKED BODY Patent Application Publication Nov. 27, 2003 Sheet 7 0f 37 US 2003/0220922 A1 FIG. 7 70 71"”‘M'1 CHARACTER STRING by PART OF 72 WORD 73"“? FREQUENCY. 74F». EVALUATION - ‘ VALUE Patent Application Publication Nov. 27, 2003 Sheet 8 0f 37 US 2003/0220922 A1 FIG. 8 START PRIMARY TOPIC SINGLE-OUT PROCESSING S41 YES THE NUMBER OF GROUPS N0 (TOPI CS); PREDETERM I NED NUMBER? 842 S43 / / SET CONSTITUENT MAIL SET CONST ITUENT MAIL COUNT CONDITION TO COUNT CONDI TION TO DELET ION OF EQUAL TO DELETION OF EQUAL TO OR LESS THAN THE NUMBER OR LESS THAN THE NUMBER OF "a" MAIL MESSAGES OF "b" MAIL MESSAGES BASED ON CONST ITUENT MAIL COUNT CONDITION, M844 PERFORM FILTER ING ON GROUPS (TOPICS) RETURN ‘ 4/ Patent Application Publication Nov. 27, 2003 Sheet 9 Of 37 US 2003/0220922 A1 FIG. 9 ( D START MORPHOLOGICAL ANALYSIS PROCESSING ‘' S51 ANY TOPIC ON WHICH MORPHOLOGICAL ANALYSIS 'HAS NOT BEEN N0 PERFORMED‘? \[YES PERFORM MORPHOLOGICAL ANALYSIS ON LINKED BODY CORRESPONDING L352 T0 GROUPS(TOPICS) EXTRACT NOuNS ~S53 V ‘ V ' GENERATE wORO VECTORS ~S54 I GENERATE' TOPIC WORD TABLE. WORD INDEX TABLE, AND TOPIC EVALUATION VALUE TABLE

Description:
ADDRESS ATTRIBUTE CONDITION. DETERMINE ADDRESS CONDITION ~$24 MAIL COUNT CONDITION, M844. PERFORM FILTER ING ON.
See more

The list of books you might like

Most books are stored in the elastic cloud where traffic is expensive. For this reason, we have a limit on daily download.