Is One Teacher Model Enough to Transfer Knowledge to a Student Model?
Nowadays, the transfer learning technique can be successfully applied in the deep learning field through techniques that fine-tune the CNN’s starting point so it may learn over a huge dataset such as ImageNet and continue to learn on a fixed dataset to achieve better performance. In this paper, we d...
Guardado en:
Autores principales: | , , |
---|---|
Formato: | article |
Lenguaje: | EN |
Publicado: |
MDPI AG
2021
|
Materias: | |
Acceso en línea: | https://doaj.org/article/b5508f2d11a04b6db13a9ff530052572 |
Etiquetas: |
Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!
|
id |
oai:doaj.org-article:b5508f2d11a04b6db13a9ff530052572 |
---|---|
record_format |
dspace |
spelling |
oai:doaj.org-article:b5508f2d11a04b6db13a9ff5300525722021-11-25T16:13:20ZIs One Teacher Model Enough to Transfer Knowledge to a Student Model?10.3390/a141103341999-4893https://doaj.org/article/b5508f2d11a04b6db13a9ff5300525722021-11-01T00:00:00Zhttps://www.mdpi.com/1999-4893/14/11/334https://doaj.org/toc/1999-4893Nowadays, the transfer learning technique can be successfully applied in the deep learning field through techniques that fine-tune the CNN’s starting point so it may learn over a huge dataset such as ImageNet and continue to learn on a fixed dataset to achieve better performance. In this paper, we designed a transfer learning methodology that combines the learned features of different teachers to a student network in an end-to-end model, improving the performance of the student network in classification tasks over different datasets. In addition to this, we tried to answer the following questions which are in any case directly related to the transfer learning problem addressed here. Is it possible to improve the performance of a small neural network by using the knowledge gained from a more powerful neural network? Can a deep neural network outperform the teacher using transfer learning? Experimental results suggest that neural networks can transfer their learning to student networks using our proposed architecture, designed to bring to light a new interesting approach for transfer learning techniques. Finally, we provide details of the code and the experimental settings.Nicola LandroIgnazio GalloRiccardo La GrassaMDPI AGarticledeep learningtransfer learningloss functionsIndustrial engineering. Management engineeringT55.4-60.8Electronic computers. Computer scienceQA75.5-76.95ENAlgorithms, Vol 14, Iss 334, p 334 (2021) |
institution |
DOAJ |
collection |
DOAJ |
language |
EN |
topic |
deep learning transfer learning loss functions Industrial engineering. Management engineering T55.4-60.8 Electronic computers. Computer science QA75.5-76.95 |
spellingShingle |
deep learning transfer learning loss functions Industrial engineering. Management engineering T55.4-60.8 Electronic computers. Computer science QA75.5-76.95 Nicola Landro Ignazio Gallo Riccardo La Grassa Is One Teacher Model Enough to Transfer Knowledge to a Student Model? |
description |
Nowadays, the transfer learning technique can be successfully applied in the deep learning field through techniques that fine-tune the CNN’s starting point so it may learn over a huge dataset such as ImageNet and continue to learn on a fixed dataset to achieve better performance. In this paper, we designed a transfer learning methodology that combines the learned features of different teachers to a student network in an end-to-end model, improving the performance of the student network in classification tasks over different datasets. In addition to this, we tried to answer the following questions which are in any case directly related to the transfer learning problem addressed here. Is it possible to improve the performance of a small neural network by using the knowledge gained from a more powerful neural network? Can a deep neural network outperform the teacher using transfer learning? Experimental results suggest that neural networks can transfer their learning to student networks using our proposed architecture, designed to bring to light a new interesting approach for transfer learning techniques. Finally, we provide details of the code and the experimental settings. |
format |
article |
author |
Nicola Landro Ignazio Gallo Riccardo La Grassa |
author_facet |
Nicola Landro Ignazio Gallo Riccardo La Grassa |
author_sort |
Nicola Landro |
title |
Is One Teacher Model Enough to Transfer Knowledge to a Student Model? |
title_short |
Is One Teacher Model Enough to Transfer Knowledge to a Student Model? |
title_full |
Is One Teacher Model Enough to Transfer Knowledge to a Student Model? |
title_fullStr |
Is One Teacher Model Enough to Transfer Knowledge to a Student Model? |
title_full_unstemmed |
Is One Teacher Model Enough to Transfer Knowledge to a Student Model? |
title_sort |
is one teacher model enough to transfer knowledge to a student model? |
publisher |
MDPI AG |
publishDate |
2021 |
url |
https://doaj.org/article/b5508f2d11a04b6db13a9ff530052572 |
work_keys_str_mv |
AT nicolalandro isoneteachermodelenoughtotransferknowledgetoastudentmodel AT ignaziogallo isoneteachermodelenoughtotransferknowledgetoastudentmodel AT riccardolagrassa isoneteachermodelenoughtotransferknowledgetoastudentmodel |
_version_ |
1718413278521065472 |