ZeroBERTo: Leveraging Zero-Shot Text Classification by Topic Modeling

Alcoforado, Alexandre
Ferraz, Thomas Palmeira
Gerber, Rodrigo
Bustos, Enzo
Oliveira, André Seidel
Veloso, Bruno Miguel
Siqueira, Fabio Levy
Costa, Anna Helena Reali

Open PDF

Open link

Publication date

June 2022

DOI

10.1007/978-3-030-98305-5_12

Publisher

Springer Science and Business Media LLC

Abstract

Traditional text classification approaches often require a good amount of labeled data, which is difficult to obtain, especially in restricted domains or less widespread languages. This lack of labeled data has led to the rise of low-resource methods, that assume low data availability in natural language processing. Among them, zero-shot learning stands out, which consists of learning a classifier without any previously labeled data. The best results reported with this approach use language models such as Transformers, but fall into two problems: high execution time and inability to handle long texts as input. This paper proposes a new model, ZeroBERTo, which leverages an unsupervised clustering step to obtain a compressed data representati...

Extracted data

We use cookies to provide a better user experience.

Data Protection

ZeroBERTo: Leveraging Zero-Shot Text Classification by Topic Modeling

Abstract

Extracted data

ZeroBERTo: Leveraging Zero-Shot Text Classification by Topic Modeling

Abstract

Extracted data

Related items

Related items