The focus of this paper is to investigate the possibility of predicting several user and message attributes in text-based, real-time, online messaging services. For this purpose, a large collection of chat messages is examined. The applicability of various supervised classification techniques for extracting information from the chat messages is evaluated. Two competing models are used for defining the chat mining problem. A term-based approach is used to investigate the user and message attributes in the context of vocabulary use while a style-based approach is used to examine the chat messages according to the variations in the authors' writing styles. Among 100 authors, the identity of an author is correctly predicted with 99.7% accuracy....
Recent years have seen an abundance of user-generated texts published online. Mining these texts for...
Providing personalized e-learning environment is normally relying on a domain model representing the...
Providing personalized e-learning environment is normally relying on a domain model representing the...
Cataloged from PDF version of article.The focus of this paper is to investigate the possibility of p...
The aim of this paper is to investigate the feasibility of predicting the gender of a text document'...
Authorship attribution, conceived as the identification of the origin of a text be- tween different ...
The problem of online threats and abuse directed at public figures could potentially be mitigated wi...
The problem of online threats and abuse directed at public figures could potentially be mitigated wi...
The problem of online threats and abuse directed at public figures could potentially be mitigated wi...
Email, chat, instant messaging, blogs, and newsgroups are now common ways for people to interact. Al...
Social media and Instant Messaging applications have become popular among teenagers, making them vul...
In recent years, author identification has become an active research area, where the major differenc...
Various forms of computer-mediated communication (CMC) have become ubiquitous, and influence our liv...
Identifying the author of an electroni message is one of the main problems in text classification an...
In this paper, we report the collection and analysis of a corpus containing over 29,447 words, 2,541...
Recent years have seen an abundance of user-generated texts published online. Mining these texts for...
Providing personalized e-learning environment is normally relying on a domain model representing the...
Providing personalized e-learning environment is normally relying on a domain model representing the...
Cataloged from PDF version of article.The focus of this paper is to investigate the possibility of p...
The aim of this paper is to investigate the feasibility of predicting the gender of a text document'...
Authorship attribution, conceived as the identification of the origin of a text be- tween different ...
The problem of online threats and abuse directed at public figures could potentially be mitigated wi...
The problem of online threats and abuse directed at public figures could potentially be mitigated wi...
The problem of online threats and abuse directed at public figures could potentially be mitigated wi...
Email, chat, instant messaging, blogs, and newsgroups are now common ways for people to interact. Al...
Social media and Instant Messaging applications have become popular among teenagers, making them vul...
In recent years, author identification has become an active research area, where the major differenc...
Various forms of computer-mediated communication (CMC) have become ubiquitous, and influence our liv...
Identifying the author of an electroni message is one of the main problems in text classification an...
In this paper, we report the collection and analysis of a corpus containing over 29,447 words, 2,541...
Recent years have seen an abundance of user-generated texts published online. Mining these texts for...
Providing personalized e-learning environment is normally relying on a domain model representing the...
Providing personalized e-learning environment is normally relying on a domain model representing the...