Engineering flexible machine learning systems by traversing functionally-invariant paths

Raghavan, Guruprasad
Tharwat, Bahey
Hari, Surya Narayanan
Satani, Dhruvil
Thomson, Matt

Publication date

September 2023

Language

English

Abstract

Transformers have emerged as the state of the art neural network architecture for natural language processing and computer vision. In the foundation model paradigm, large transformer models (BERT, GPT3/4, Bloom, ViT) are pre-trained on self-supervised tasks such as word or image masking, and then, adapted through fine-tuning for downstream user applications including instruction following and Question Answering. While many approaches have been developed for model fine-tuning including low-rank weight update strategies (eg. LoRA), underlying mathematical principles that enable network adaptation without knowledge loss remain poorly understood. Here, we introduce a differential geometry framework, functionally invariant paths (FIP), that prov...

Extracted data

We use cookies to provide a better user experience.

Data Protection

Engineering flexible machine learning systems by traversing functionally-invariant paths

Abstract

Extracted data

Engineering flexible machine learning systems by traversing functionally-invariant paths

Abstract

Extracted data

Related items

Related items