To provide a new mechanism for further accurately recognizing a character written in a document.
A plurality of field-by-field term dictionary data bases 11a, 11b and 11c storing terms or characters classified by field are prepared, and a field to which the content described in the document belongs is determined. A field-by-field term dictionary data base related to the determined field is selected from the plurality of field-by-field term dictionary data bases 11a, 11b and 11c, and the character recognition is performed using, as a candidate, a term or character stored in the field-by-field term dictionary data base. In this method, since the field to which the content of the document belongs is determined, an appropriate field-by-field term dictionary data base according to the field is selected, and the character recognition is performed using it, the recognition accuracy can be expected to be improved.
NAKAMURA KOTARO
TATENO SHOICHI
TANAKA KEI
SAITO TERUKA
KOYAMA TOSHIYA
Next Patent: OBSERVATION DATA COLLECTION SYSTEM AND OBSERVATION DATA COLLECTION METHOD