This dissertation proposes a novel methodology for knowledge discovery in large data sets, with a focus on unstructured and semi-structured textual data. To our knowledge, extracting knowledge from unstructured and semi-structured textual data is a major unsolved problem in the area of knowledge discovery in databases (KDD). The problem becomes particularly acute due to ambiguity and lexical variations in natural language. This thesis seeks to address these problems. Firstly, it proposes a unified methodology, called the Ontologybased Knowledge Discovery in unstructured and semi-structured Text (On-KDT) methodology, to discover knowledge from unstructured/semi-structured texts. This approach leverages semantic information encoded in ontolog...