A deeper look into multi-task learning ability of unified text-to-text transformer

Cheng, Xiang

Publication date

May 2021

Abstract

Structure prediction (SP) tasks are important in natural language understanding in the sense that they provide complex and structured knowledge of the text. Recently, some unified text-to-text transformer models like T5 and TANL have produced competitive results on SP tasks. These models convert SP tasks into a seq2seq problem, where a transformer is used to generate sequences with special tokens representing the extracted spans, labels, and relationships. Compared to many popular Natural Language Understanding models that are designed specifically for the task, the output of the text-to-text transformer is more flexible. With proper format, it could be trained on multiple tasks together and take advantage of the shared knowledge between t...

Extracted data

We use cookies to provide a better user experience.

Data Protection

A deeper look into multi-task learning ability of unified text-to-text transformer

Abstract

Extracted data

A deeper look into multi-task learning ability of unified text-to-text transformer

Abstract

Extracted data

Related items

Related items