In this study parsimonious language models were used to construct word clouds of the proceedings of the European Parliament. Multiple design choices had to be made and are discussed. Important features are stemming during tokenization, including bigrams into the word cloud and multilingualism. Also, the original parsimonious language models were extended with an additional term dampening unigrams that already occurred in the word cloud. This algorithm was tested in a small user study, using proceedings of the University of Amsterdam Science faculty's student council. Members of this council had to give their preference for multiple word clouds constructed using either parsimonious language models or simple Term Frequencies (TF) with stop wo...
Abstract. We study the problem of computing semantics-preserving word clouds in which semantically r...
Word Clouds have gained an impressive momentum for summarizing text documents in the last years. The...
Tag cloud is a visual interface that summarizes an underlying data by depicting the most frequent te...
Word clouds are a summarised representation of a document’s text, similar to tag clouds which summar...
The size of the words (phecode) in each cloud indicates the weights of the phenotypes on the topic. ...
Word cloud representing the country of included studies, the size of each term is in proportion to i...
Word clouds are an increasingly popular means of presenting statistical summaries of document collec...
The size of each of the words within the categorical word clouds correlates to its frequency in the ...
This article examines student responses to a technique for summarizing electronically avai...
Abstract: Intuitive and effective access to large volumes of information is increasingly important. ...
Word Clouds to Visually Present the Most Informative Words in Subject CategoriesApril 2020 by Neslih...
We here demonstrate how two types of NLP models - a topic model and a word2vec model - can be combin...
In recent years, automated political text processing became an indispensable requirement for providi...
by Monika Barget In the second edition of Doing digital history with Python, I would like to address...
Word clouds are a popular tool for visualizing documents, but they are not a good tool for comparing...
Abstract. We study the problem of computing semantics-preserving word clouds in which semantically r...
Word Clouds have gained an impressive momentum for summarizing text documents in the last years. The...
Tag cloud is a visual interface that summarizes an underlying data by depicting the most frequent te...
Word clouds are a summarised representation of a document’s text, similar to tag clouds which summar...
The size of the words (phecode) in each cloud indicates the weights of the phenotypes on the topic. ...
Word cloud representing the country of included studies, the size of each term is in proportion to i...
Word clouds are an increasingly popular means of presenting statistical summaries of document collec...
The size of each of the words within the categorical word clouds correlates to its frequency in the ...
This article examines student responses to a technique for summarizing electronically avai...
Abstract: Intuitive and effective access to large volumes of information is increasingly important. ...
Word Clouds to Visually Present the Most Informative Words in Subject CategoriesApril 2020 by Neslih...
We here demonstrate how two types of NLP models - a topic model and a word2vec model - can be combin...
In recent years, automated political text processing became an indispensable requirement for providi...
by Monika Barget In the second edition of Doing digital history with Python, I would like to address...
Word clouds are a popular tool for visualizing documents, but they are not a good tool for comparing...
Abstract. We study the problem of computing semantics-preserving word clouds in which semantically r...
Word Clouds have gained an impressive momentum for summarizing text documents in the last years. The...
Tag cloud is a visual interface that summarizes an underlying data by depicting the most frequent te...