Title:
A quotation document change detecting device and a method
Document Type and Number:
Japanese Patent JP5948304
Kind Code:
B2
Abstract:
PROBLEM TO BE SOLVED: To determine a modification in a portion, such as only one character or only a numeric character, of a document to be excerpted during excerption.SOLUTION: The present invention comprises: cutting a character string segment out of one sentence of an input document; determining a start point of the character string segment; making a digest, in which a character string corresponding to a character string segment for each predetermined number of characters from the start point has been converted into a hash function, slide by a predetermined number of characters, and storing a document ID and a digest group of the digest in a digest DB; reading out the digest from the digest DB; and determining that a modification in the size of a predetermined window size w or less is performed, when providing the window size w and an allowable error α at the start position of a character string segment, in the case that a character string segment is detected that has a digest in common with that of the same document in a position separated by W+α.
More Like This:
JP5017416 | Language analysis program |
JP2001195400 | METHOD AND DEVICE FOR STRUCTURALIZING DOCUMENT CONTEXT |
JP2005339542 | QUERY TO TASK MAPPING |
Inventors:
Funakoshi Kaname
Seiji Washizaki
Seiji Washizaki
Application Number:
JP2013229440A
Publication Date:
July 06, 2016
Filing Date:
November 05, 2013
Export Citation:
Assignee:
Nippon Telegraph and Telephone Corporation
International Classes:
G06F17/27; G06F17/30
Domestic Patent References:
JP2010182238A |
Foreign References:
US20130232160 |
Attorney, Agent or Firm:
Tadashige Ito
Tadahiko Ito
Ryuji Ishihara
Tadahiko Ito
Ryuji Ishihara