Improving Reliability of Fine-tuning with Block-wise Optimisation

Barakat, Basel and Huang, Qiang. 2023. Improving Reliability of Fine-tuning with Block-wise Optimisation. arXiv, New York. [Other]

[img]
Preview
Text
2301.06133v1.pdf - Published Version
Available under License Creative Commons Attribution.

Download (3MB) | Preview

Abstract or Description

Finetuning can be used to tackle domain specific tasks by transferring knowledge learned from pre-trained models. However, previous studies on finetuning focused on adapting only the weights of a task-specific classifier or re-optimising all layers of the pre-trained model using the new task data. The first type of method cannot mitigate the mismatch between a pre-trained model and the new task data, and the second type of method easily causes over-fitting when processing tasks with limited data. To explore the effectiveness of fine-tuning, we propose a novel block-wise optimisation mechanism, which adapts the weights of a group of layers of a pre-trained model. This work presents a theoretical framework and empirical evaluation of block-wise fine-tuning to find a reliable fine tuning strategy. The proposed approach is evaluated on two datasets, Oxford Flowers (OXF) and Caltech 101 (CAL), using 15 commonly used pre-trained base models. Results indicate that the proposed strategy consistently outperforms the baselines in terms of classification accuracy, although the specific block leading to optimal performance may vary across models. The investigation reveals that selecting a block from the fourth quarter of a base model generally yields improved performance compared to the baselines. Overall, the block-wise approach consistently outperforms the baselines and exhibits higher accuracy and reliability. This study provides valuable insights into the selection of salient blocks and highlights the effectiveness of block-wise fine-tuning in achieving improved classification accuracy in various models and datasets.

Item Type:

Other

Identification Number (DOI):

https://doi.org/10.48550/arXiv.2301.06133

Departments, Centres and Research Units:

Computing

Date:

15 January 2023

Item ID:

38177

Date Deposited:

31 Jan 2025 12:52

Last Modified:

31 Jan 2025 12:52

URI:

https://research.gold.ac.uk/id/eprint/38177

View statistics for this item...

Edit Record Edit Record (login required)