It is challenging to discover relevant features from long documents that describe user information needs due to the nature of text where synonymy, polysemy noise, and high dimensionality are inherited problems. Traditional feature selection methods could not effectively deal with these problems, because they assume that documents describe one topic only. Topic-based techniques, such as Latent Dirichlet Allocation (LDA), relax this assumption. They have been developed on the basis that a document can exhibit multiple hidden topics. However, LDA does not show encouraging results in selecting relevant features, because LDA calculates the weight of terms based on their local documents and does not generalise it globally at the collection level....
Collection selection is a crucial function, central to the effectiveness and efficiency of a federat...
This paper assesses topic coherence and human topic ranking of uncovered latent topics from scientif...
This paper assesses topic coherence and human topic ranking of uncovered latent topics from scientif...
Unsupervised topic models, such as Latent Dirichlet Allocation (LDA), are widely used as automated f...
Selecting features from documents that describe user information needs is challenging due to the nat...
Many mature term-based or pattern-based approaches have been used in the field of information filter...
This thesis presents innovative and effective feature selection models and frameworks to select and ...
Abstract-Text categorization is the task of automatically assigning unlabeled text documents to some...
Abstract — Text classification has become a critical step in big data analytics. For supervised mach...
Collection selection is a crucial function, central to the effectiveness and efficiency of a federate...
Topic modelling, such as Latent Dirichlet Allocation (LDA), was proposed to generate statistical mod...
Recently, a probabilistic topic modelling approach, latent dirichlet allocation (LDA), has been exte...
The machine learning & text mining area topic modeling has been extensively accepted etc. To generat...
Topic modelling methods such as Latent Dirichlet Allocation (LDA) have been successfully applied to ...
Topic modelling methods such as Latent Dirichlet Allocation (LDA) have been successfully applied to ...
Collection selection is a crucial function, central to the effectiveness and efficiency of a federat...
This paper assesses topic coherence and human topic ranking of uncovered latent topics from scientif...
This paper assesses topic coherence and human topic ranking of uncovered latent topics from scientif...
Unsupervised topic models, such as Latent Dirichlet Allocation (LDA), are widely used as automated f...
Selecting features from documents that describe user information needs is challenging due to the nat...
Many mature term-based or pattern-based approaches have been used in the field of information filter...
This thesis presents innovative and effective feature selection models and frameworks to select and ...
Abstract-Text categorization is the task of automatically assigning unlabeled text documents to some...
Abstract — Text classification has become a critical step in big data analytics. For supervised mach...
Collection selection is a crucial function, central to the effectiveness and efficiency of a federate...
Topic modelling, such as Latent Dirichlet Allocation (LDA), was proposed to generate statistical mod...
Recently, a probabilistic topic modelling approach, latent dirichlet allocation (LDA), has been exte...
The machine learning & text mining area topic modeling has been extensively accepted etc. To generat...
Topic modelling methods such as Latent Dirichlet Allocation (LDA) have been successfully applied to ...
Topic modelling methods such as Latent Dirichlet Allocation (LDA) have been successfully applied to ...
Collection selection is a crucial function, central to the effectiveness and efficiency of a federat...
This paper assesses topic coherence and human topic ranking of uncovered latent topics from scientif...
This paper assesses topic coherence and human topic ranking of uncovered latent topics from scientif...