Automatic taxonomy construction

Automatic taxonomy construction (ATC) is the use of software programs to generate taxonomical classifications from a body of texts called a corpus. ATC is a branch of natural language processing, which in turn is a branch of artificial intelligence.

A taxonomy (or taxonomical classification) is a scheme of classification, especially, a hierarchical classification, in which things are organized into groups or types.[1][2][3][4][5][6] Among other things, a taxonomy can be used to organize and index knowledge (stored as documents, articles, videos, etc.), such as in the form of a library classification system, or a search engine taxonomy, so that users can more easily find the information they are searching for. Many taxonomies are hierarchies (and thus, have an intrinsic tree structure), but not all are.

Manually developing and maintaining a taxonomy is a labor-intensive task requiring significant time and resources, including familiarity of or expertise in the taxonomy's domain (scope, subject, or field), which drives the costs and limits the scope of such projects. Also, domain modelers have their own points of view which inevitably, even if unintentionally, work their way into the taxonomy. ATC uses artificial intelligence techniques to quickly automatically generate a taxonomy for a domain in order to avoid these problems and remove limitations.

  1. ^ "Taxonomy". 10 October 2021.
  2. ^ "Taxonomy Definition & Meaning". Dictionary.com. Retrieved 2022-05-13.
  3. ^ "What is Taxonomy?". 14 August 2017.
  4. ^ "TAXONOMY | Meaning & Definition for UK English". Lexico.com. Archived from the original on March 2, 2021. Retrieved 2022-05-13.
  5. ^ "What is Taxonomy?". 20 August 2003.
  6. ^ "TAXONOMY (Noun) definition and synonyms | Macmillan Dictionary".

© MMXXIII Rich X Search. We shall prevail. All rights reserved. Rich X Search