In online domain-specific customer service applications, many companies struggle to deploy advanced NLP models successfully, due to the limited availability of and noise in their datasets. While prior research demonstrated the potential of migrating large open-domain pretrained models for domain-specific tasks, the appropriate (pre)training strategies have not yet been rigorously evaluated in such social media customer service settings, especially under multilingual conditions. We address this gap by collecting a multilingual social media corpus containing customer service conversations (865k tweets), comparing various pipelines of pretraining and finetuning approaches, applying them on 5 different end tasks. We show that pretraining a gene...
Fine-tuning pre-trained language models has significantly advanced the state of art in a wide range ...
The appearance of complex attention-based language models such as BERT, Roberta or GPT-3 has allowed...
Social media is commonly used by organizations to address customer service issues like complaints. F...
In online domain-specific customer service applications, many companies struggle to deploy advanced ...
This paper describes the submission of UZH_CLyp for the SemEval 2023 Task 9 "Multilingual Tweet Inti...
Language models are ubiquitous in current NLP, and their multilingual capacity has recently attracte...
We present TwHIN-BERT, a multilingual language model trained on in-domain data from the popular soci...
With growing societal acceptance and increasing cost efficiency due to mass production, service robo...
Digital connectivity is revolutionising people’s quality of life. As broadband and mobile services b...
Twitter has become an immensely popular platform where the users can share information within a cert...
We carried out a study in which we explored the feasibility of machine translation for Twitter for t...
We carried out a study in which we explored the feasibility of machine translation for Twitter for ...
Social media data such as Twitter messages ("tweets") pose a particular challenge to NLP systems bec...
Companies constantly rely on customer support to deliver pre-and post-sale services to their client...
Language models can be applied to a diverse set of tasks with great results, but training a language...
Fine-tuning pre-trained language models has significantly advanced the state of art in a wide range ...
The appearance of complex attention-based language models such as BERT, Roberta or GPT-3 has allowed...
Social media is commonly used by organizations to address customer service issues like complaints. F...
In online domain-specific customer service applications, many companies struggle to deploy advanced ...
This paper describes the submission of UZH_CLyp for the SemEval 2023 Task 9 "Multilingual Tweet Inti...
Language models are ubiquitous in current NLP, and their multilingual capacity has recently attracte...
We present TwHIN-BERT, a multilingual language model trained on in-domain data from the popular soci...
With growing societal acceptance and increasing cost efficiency due to mass production, service robo...
Digital connectivity is revolutionising people’s quality of life. As broadband and mobile services b...
Twitter has become an immensely popular platform where the users can share information within a cert...
We carried out a study in which we explored the feasibility of machine translation for Twitter for t...
We carried out a study in which we explored the feasibility of machine translation for Twitter for ...
Social media data such as Twitter messages ("tweets") pose a particular challenge to NLP systems bec...
Companies constantly rely on customer support to deliver pre-and post-sale services to their client...
Language models can be applied to a diverse set of tasks with great results, but training a language...
Fine-tuning pre-trained language models has significantly advanced the state of art in a wide range ...
The appearance of complex attention-based language models such as BERT, Roberta or GPT-3 has allowed...
Social media is commonly used by organizations to address customer service issues like complaints. F...