Automatic identification of offensive/abusive language is very necessary to get rid of unwanted behavior. However, it is more challenging to generalize the solution due to the different grammatical structures and vocabulary of each language. Most of the prior work targeted western languages, however, one study targeted a low-resource language (Urdu). The prior study used basic linguistic features and a small dataset. This study designed a new dataset (collected from popular Pakistani Facebook pages) containing 7,500 posts for offensive language detection in Urdu. The proposed methodology used four types of feature engineering models: three are frequency-based and the fourth one is the embedding model. Frequency-based are either determined b...
Urdu is a language of the Indo-Aryan family, widely spoken in India and Pakistan, and an important m...
Nowadays, online social networks (OSNs) have become integral part of our daily life and online users...
Offensive content is pervasive in social media and a reason for concern to companies and government ...
With the growth of social media platform influence, the effect of their misuse becomes more and more...
The pervasiveness of offensive content in social media has become an important reason for concern fo...
Social media platforms have become a substratum for people to enunciate their opinions and ideas acr...
The presence of offensive language on social media is very common motivating platforms to invest in ...
Offensive content is pervasive in social media and a reason for concern to companies and government ...
Threatening content detection on social media has recently gained attention. There is very limited w...
Text classification of low resource language is always a trivial and challenging problem. This paper...
In our increasingly interconnected digital world, social media platforms have emerged as powerful ch...
The presence of offensive language on social media is very common motivating platforms to invest in ...
An increase in the volume of false information circulating as a direct consequence of the rise in th...
Although over 169 million people in the world are familiar with the Urdu language and a large quanti...
Urdu is a language of the Indo-Aryan family, widely spoken in India and Pakistan, and an important m...
Nowadays, online social networks (OSNs) have become integral part of our daily life and online users...
Offensive content is pervasive in social media and a reason for concern to companies and government ...
With the growth of social media platform influence, the effect of their misuse becomes more and more...
The pervasiveness of offensive content in social media has become an important reason for concern fo...
Social media platforms have become a substratum for people to enunciate their opinions and ideas acr...
The presence of offensive language on social media is very common motivating platforms to invest in ...
Offensive content is pervasive in social media and a reason for concern to companies and government ...
Threatening content detection on social media has recently gained attention. There is very limited w...
Text classification of low resource language is always a trivial and challenging problem. This paper...
In our increasingly interconnected digital world, social media platforms have emerged as powerful ch...
The presence of offensive language on social media is very common motivating platforms to invest in ...
An increase in the volume of false information circulating as a direct consequence of the rise in th...
Although over 169 million people in the world are familiar with the Urdu language and a large quanti...
Urdu is a language of the Indo-Aryan family, widely spoken in India and Pakistan, and an important m...
Nowadays, online social networks (OSNs) have become integral part of our daily life and online users...
Offensive content is pervasive in social media and a reason for concern to companies and government ...