A text mining system for bioinformatics: requirements and architecture

Ilkka Karanta, Antti Pesonen, Lauri Seitsonen, Paula Silvonen

    Research output: Chapter in Book/Report/Conference proceedingConference article in proceedingsScientificpeer-review


    We describe OAT, a new information extraction system under development. It extracts (subject, predicate, object) triplets from natural language texts. It uses ontologies extensively: the results are saved in an ontology, and ontologies are used in the information extraction process itself. It is adaptable both to a domain of discourse and within a domain of discourse (finding new concepts). This paper concentrates on the requirements and architecture of OAT.
    Original languageEnglish
    Title of host publicationPoster proceedings
    Subtitle of host publicationIndustrial Conference on Data Mining, ICDM 2006. Leipzig, DE, 13 - 14 July 2006.
    EditorsPetra Perner
    Place of PublicationLeipzig
    Publication statusPublished - 2006
    MoE publication typeNot Eligible


    • information extraction
    • ontology
    • software requirements
    • software architecture


    Dive into the research topics of 'A text mining system for bioinformatics: requirements and architecture'. Together they form a unique fingerprint.

    Cite this