karl bühler digital

Home > Edited Book > Contribution

Publication details

Publisher: Springer

Place: Berlin

Year: 2002

Pages: 123-139

ISBN (Hardback): 9783540433385

Full citation:

Hiroki Arimura, Hiroshi Sakamoto, Setsuo Arikawa, "Efficient data mining from large text databases", in: Progress in discovery science, Berlin, Springer, 2002

Abstract

In this paper, we consider the problem of discovering a simple class of combinatorial patterns from a large collection of unstructured text data. As a framework of data mining, we adopted optimized pattern discovery in which a mining algorithm discovers the best patterns that optimize a given statistical measure within a class of hypothesis patterns on a given data set. We present efficient algorithms for the classes of proximity word association patterns and report the experiments on the keyword discovery from Web data.

Publication details

Publisher: Springer

Place: Berlin

Year: 2002

Pages: 123-139

ISBN (Hardback): 9783540433385

Full citation:

Hiroki Arimura, Hiroshi Sakamoto, Setsuo Arikawa, "Efficient data mining from large text databases", in: Progress in discovery science, Berlin, Springer, 2002