First is Better Than Last for Training Data Influence

Yeh, Chih-Kuan
Taly, Ankur
Sundararajan, Mukund
Liu, Frederick
Ravikumar, Pradeep

Publication date

June 2022

Abstract

The ability to identify influential training examples enables us to debug training data and explain model behavior. Existing techniques to do so are based on the flow of training data influence through the model parameters. For large models in NLP applications, it is often computationally infeasible to study this flow through all model parameters, therefore techniques usually pick the last layer of weights. However, we observe that since the activation connected to the last layer of weights contains ``shared logic'', the data influenced calculated via the last layer weights prone to a ``cancellation effect'', where the data influence of different examples have large magnitude that contradicts each other. The cancellation effect lowers the d...

Extracted data

We use cookies to provide a better user experience.

Data Protection

First is Better Than Last for Training Data Influence

Abstract

Extracted data

First is Better Than Last for Training Data Influence

Abstract

Extracted data

Related items

Related items