A text mining system for bioinformatics: requirements and architecture

Ilkka Karanta, Antti Pesonen, Lauri Seitsonen, Paula Silvonen

    Research output: Chapter in Book/Report/Conference proceedingConference article in proceedingsScientificpeer-review

    Abstract

    We describe OAT, a new information extraction system under development. It extracts (subject, predicate, object) triplets from natural language texts. It uses ontologies extensively: the results are saved in an ontology, and ontologies are used in the information extraction process itself. It is adaptable both to a domain of discourse and within a domain of discourse (finding new concepts). This paper concentrates on the requirements and architecture of OAT.
    Original languageEnglish
    Title of host publicationPoster proceedings
    Subtitle of host publicationIndustrial Conference on Data Mining, ICDM 2006. Leipzig, DE, 13 - 14 July 2006.
    EditorsPetra Perner
    Place of PublicationLeipzig
    Pages225-229
    Publication statusPublished - 2006
    MoE publication typeNot Eligible

    Keywords

    • information extraction
    • ontology
    • software requirements
    • software architecture

    Fingerprint Dive into the research topics of 'A text mining system for bioinformatics: requirements and architecture'. Together they form a unique fingerprint.

  • Cite this

    Karanta, I., Pesonen, A., Seitsonen, L., & Silvonen, P. (2006). A text mining system for bioinformatics: requirements and architecture. In P. Perner (Ed.), Poster proceedings: Industrial Conference on Data Mining, ICDM 2006. Leipzig, DE, 13 - 14 July 2006. (pp. 225-229).