The days of large amorphous corpora collected with armies of Web crawlers and stored indefinitely are, or should be, coming to an end. There is a wealth of hidden linguistic information which is increasingly difficult to access, hidden in personal and private data that would be unethical and technically challenging to collect using traditional methods such as Web crawling and mass surveillance of online discussion spaces. Advances in privacy regulations such as GDPR and changes in the zeitgeist bring into question the problematic ethical dimension of extracting information from unaware if not unwilling participants. Modern corpora need to adapt, be focused on testing specific hypotheses, and be respectful of the privacy of the people who ge...
Over the past few years, there have been an increase in the development and improvement of circumven...
With the growing popularity of social networks, cloud services and online applications, people are b...
Leakage of personal information in online conversations raises serious privacy concerns. For example...
The days of large amorphous corpora collected with armies of Web crawlers and stored indefinitely ar...
The Coronavirus Discourses project supports public health partners Public Health Wales, Public Healt...
The Coronavirus Discourses project supports public health partners Public Health Wales, Public Healt...
This article presents the privacy dictionary, a new linguistic resource for automated content analys...
Large multimodal language models have proven transformative in numerous applications. However, these...
Natural language privacy policies have become a de facto standard to address expectations of “notice...
The rapid advancement and widespread use of large language models (LLMs) have raised significant con...
The growing development of artificial intelligence (AI), particularly neural networks, is transformi...
Advanced Large Language Models (LLMs) struggle to produce accurate results and preserve user privacy...
The acceptable threshold for privacy is an individual choice, informed by culture, tradition and exp...
Speech recordings are a rich source of personal, sensitive data that can be used to support a pletho...
In recent events, user-privacy has been a main focus for all technological and data-holding companie...
Over the past few years, there have been an increase in the development and improvement of circumven...
With the growing popularity of social networks, cloud services and online applications, people are b...
Leakage of personal information in online conversations raises serious privacy concerns. For example...
The days of large amorphous corpora collected with armies of Web crawlers and stored indefinitely ar...
The Coronavirus Discourses project supports public health partners Public Health Wales, Public Healt...
The Coronavirus Discourses project supports public health partners Public Health Wales, Public Healt...
This article presents the privacy dictionary, a new linguistic resource for automated content analys...
Large multimodal language models have proven transformative in numerous applications. However, these...
Natural language privacy policies have become a de facto standard to address expectations of “notice...
The rapid advancement and widespread use of large language models (LLMs) have raised significant con...
The growing development of artificial intelligence (AI), particularly neural networks, is transformi...
Advanced Large Language Models (LLMs) struggle to produce accurate results and preserve user privacy...
The acceptable threshold for privacy is an individual choice, informed by culture, tradition and exp...
Speech recordings are a rich source of personal, sensitive data that can be used to support a pletho...
In recent events, user-privacy has been a main focus for all technological and data-holding companie...
Over the past few years, there have been an increase in the development and improvement of circumven...
With the growing popularity of social networks, cloud services and online applications, people are b...
Leakage of personal information in online conversations raises serious privacy concerns. For example...