Login| Sign Up| Help| Contact|

Patent Searching and Data


Title:
【発明の名称】半構造化テキストデータを処理する方法及び装置
Document Type and Number:
Japanese Patent JP2002534741
Kind Code:
A
Abstract:
A method of processing semistructured data, in particular semistructured textual data, to output data which is in accordance with a predetermined structure, wherein said semistructured data is structured into one or more elements according to a given syntax, the actual content of the syntax elements being variable and being called a token, said method comprising: extracting by means of an extractor ("parser") from said semistructured data one or more tokens, said parser being capable of returning at least one token in response to a respective specific command identifying the requested token by a token identifier, wherein said method further comprises: providing a sequence of commands and an associated data structure definition, both together being called a loader, said loader comprising the commands necessary to cause said parser to return the one or more tokens to be extracted; causing by said sequence of commands of said loader said parser to extract said one or more tokens from said semistructured data and further converting said extracted tokens into said predetermined data structure defined by said associated structure definition.

Inventors:
Esourt, Toure
Coupie, Thieri
Application Number:
JP2000592752A
Publication Date:
October 15, 2002
Filing Date:
December 23, 1999
Export Citation:
Click for automatic bibliography generation   Help
Assignee:
Lion Bioscience Akchen Gesell Shaft
International Classes:
G06F17/22; G06F17/27; G06F17/30; (IPC1-7): G06F17/30
Attorney, Agent or Firm:
Takashi Ishida (4 others)