karl bühler digital

Home > Edited Book >

Publication details

Verlag: Springer

Ort: Berlin

Jahr: 2002

Pages: 123-139

ISBN (Hardback): 9783540433385

Volle Referenz:

Hiroki Arimura, Hiroshi Sakamoto, Setsuo Arikawa, "Efficient data mining from large text databases", in: Progress in discovery science, Berlin, Springer, 2002

Abstrakt

In this paper, we consider the problem of discovering a simple class of combinatorial patterns from a large collection of unstructured text data. As a framework of data mining, we adopted optimized pattern discovery in which a mining algorithm discovers the best patterns that optimize a given statistical measure within a class of hypothesis patterns on a given data set. We present efficient algorithms for the classes of proximity word association patterns and report the experiments on the keyword discovery from Web data.

Publication details

Verlag: Springer

Ort: Berlin

Jahr: 2002

Pages: 123-139

ISBN (Hardback): 9783540433385

Volle Referenz:

Hiroki Arimura, Hiroshi Sakamoto, Setsuo Arikawa, "Efficient data mining from large text databases", in: Progress in discovery science, Berlin, Springer, 2002