Research at Sofia University >
Faculty of Mathematics and Informatics >
Papers >
Please use this identifier to cite or link to this item:
http://hdl.handle.net/10506/24
|
Title: | Feature Selection and Generalisation for Retrieval of Textual Cases |
Authors: | Wiratung, Nirmalie Koychev, Ivan Massie, Stewart |
Keywords: | Case-Based Reasoning Knowledge Acquisition |
Issue Date: | 2004 |
Publisher: | Proceeding of the 7-th European Conference on Case-Based Reasoning. Lecture Notes in Artificial Intelligence |
Citation: | Wiratunga, N., Koychev, I., Massie, S. (2004). Feature Selection and Generalisation for Retrieval of Textual Cases – in the Proceeding of the 7-th European Conference on Case-Based Reasoning. Lecture Notes in Artificial Intelligence, Springer, Berlin, Heidelberg, New York. (Best paper award). |
Abstract: | Textual CBR systems solve problems by reusing experiences that are in textual form. Knowledge-rich comparison of textual cases remains an important challenge for these systems. However mapping text data into a structured case representation requires a significant knowledge engineering effort. In this paper we look at automated acquisition of the case indexing vocabulary as a two step process involving feature selection followed by feature generalisation. Boosted decision stumps are employed as a means to select features that are predictive and relatively orthogonal. Association rule induction is employed to capture feature co-occurrence patterns. Generalised features are constructed by applying these rules. Essentially, rules preserve implicit semantic relationships between features and applying them has the desired effect of bringing together cases that would have otherwise been overlooked during case retrieval. Experiments with four textual data sets show significant improvement in retrieval accuracy whenever gener¬alised features are used. The results further suggest that boosted decision stumps with generalised features to be a promising combination. |
URI: | http://hdl.handle.net/10506/24 |
Appears in Collections: | Papers
|
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.
|