Login| Sign Up| Help| Contact|

Patent Searching and Data


Title:
PROBABILISTIC DATA MINING MODEL COMPARISON ENGINE
Document Type and Number:
WIPO Patent Application WO/2012/045496
Kind Code:
A3
Abstract:
Comparison engine for comparing a first data mining model and a second data mining model is disclosed. A first data mining model M1 represents results of a first data mining task on a first data set D1 and provides a set of first prediction values. A second data mining model M2 represents results of a second data mining task on a second data set D2 and provides a set of second prediction values. A relation R is determined between said sets of prediction values. For at least a first record of an input data set, a first and second probability distribution is created based on the first and second data mining models applied to the first record, said probability distributions associating probabilities with said sets of prediction values.A distance measure d is calculated for said first record using the first and second probability distributions and the relation. At least one region of interest is determined based on said distance measure d.

Inventors:
LINGENFELDER CHRISTOPH (DE)
WURST MICHAEL (DE)
POMPEY PASCAL (IE)
Application Number:
PCT/EP2011/062076
Publication Date:
September 07, 2012
Filing Date:
July 14, 2011
Export Citation:
Click for automatic bibliography generation   Help
Assignee:
IBM (US)
LINGENFELDER CHRISTOPH (DE)
WURST MICHAEL (DE)
POMPEY PASCAL (IE)
International Classes:
G06F17/18; G06K9/62
Foreign References:
US7636698B22009-12-22
Other References:
POMPEY P: "Aufbau einer Methode zum semantischen Vergleich von heterogenen Data Mining Modellen", RECORD OF DIPLOMA THESIS SUBMITTED AT THE INSTITUTE OF APPLIED INFORMATICS AND FORMAL DESCRIPTION METHODS (AIFB), KARLSRUHE, GERMANY, 2010, XP055031381, Retrieved from the Internet [retrieved on 20120629]
DEMSAR J: "Statistical comparisons of classifiers over multiple data sets", JOURNAL OF MACHINE LEARNING RESEARCH, vol. 7, 2006, pages 1 - 30, XP055031390
SALZBERG S L: "On comparing classifiers: pitfalls to avoid and a recommended approach", DATA MINING AND KNOWLEDGE DISCOVERY, vol. 1, 1997, pages 317 - 328, XP055031391
Attorney, Agent or Firm:
KUISMA, Sirpa (IBM-Allee 1, Ehningen, DE)
Download PDF: