22.01.2024

Presentation Series: Train Your Engineering Network. Emin Nakilcioglu

Image generated by DALL·E 3, using Microsoft Copilot Image Creator.

Parameter Efficient Fine Tuning for a Domain-Specific Automatic Speech Recognition

Abstract

With the introduction of early pre-trained language models such as Google’s BERT and various early GPT models, we have seen an ever-increasing excitement and interest in foundation models. To leverage existing pre-trained foundation models and adapt them to specific tasks or domains, these models need to be fine-tuned using domain-specific data. However, fine-tuning can be quite resource-intensive and costly as millions of parameters will be modified as part of training.
PEFT is a technique designed to fine-tune models while minimizing the need for extensive resources and cost. It achieves this efficiency by freezing some of the layers of the pre-trained model and only fine-tuning the last few layers that are specific to the downstream task. With the help of PEFT, we can achieve a balance between retaining valuable knowledge from the pre-trained model and adapting it effectively to the downstream task with fewer parameters.

Talk in the series “Train Your Engineering Network”.