pith. sign in

arxiv: 0910.0735 · v1 · submitted 2009-10-05 · 💻 cs.OH

A methodology for semi-automatic classification schema building

classification 💻 cs.OH
keywords classificationschemaapproachmethodologydocumentextensionalintensionalsemi-automatic
0
0 comments X
read the original abstract

This paper describe a methodology for semi-automatic classification schema definition (a classification schema is a taxonomy of categories useful for automatic document classification). The methodology is based on: (i) an extensional approach useful to create a typology starting from a document base, and (ii) an intensional approach to build the classification schema starting from the typology. The extensional approach uses clustering techniques to group together documents on the basis of a similarity measure, whereas the intensional approach uses different operations (aggregation, reduction, generalization specialization) to define classes. keywords: ontology, classification schema, fundamentum divisionis, cluster analysis classification task.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.