Robust feature space separation for deep convolutional neural network training
Abstract This paper introduces two deep convolutional neural network training techniques that lead to more robust feature subspace separation in comparison to traditional training. Assume that dataset has M labels. The first method creates M deep convolutional neural networks called $$\{\text {DCNN}...
Guardado en:
Autores principales: | , , , , |
---|---|
Formato: | article |
Lenguaje: | EN |
Publicado: |
Springer
2021
|
Materias: | |
Acceso en línea: | https://doaj.org/article/0c4975ea7511433cbff1e6d106499b8e |
Etiquetas: |
Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!
|
id |
oai:doaj.org-article:0c4975ea7511433cbff1e6d106499b8e |
---|---|
record_format |
dspace |
spelling |
oai:doaj.org-article:0c4975ea7511433cbff1e6d106499b8e2021-11-21T12:31:43ZRobust feature space separation for deep convolutional neural network training10.1007/s44163-021-00013-12731-0809https://doaj.org/article/0c4975ea7511433cbff1e6d106499b8e2021-11-01T00:00:00Zhttps://doi.org/10.1007/s44163-021-00013-1https://doaj.org/toc/2731-0809Abstract This paper introduces two deep convolutional neural network training techniques that lead to more robust feature subspace separation in comparison to traditional training. Assume that dataset has M labels. The first method creates M deep convolutional neural networks called $$\{\text {DCNN}_i\}_{i=1}^{M}$$ { DCNN i } i = 1 M . Each of the networks $$\text {DCNN}_i$$ DCNN i is composed of a convolutional neural network ( $$\text {CNN}_i$$ CNN i ) and a fully connected neural network ( $$\text {FCNN}_i$$ FCNN i ). In training, a set of projection matrices $$\{\mathbf {P}_i\}_{i=1}^M$$ { P i } i = 1 M are created and adaptively updated as representations for feature subspaces $$\{\mathcal {S}_i\}_{i=1}^M$$ { S i } i = 1 M . A rejection value is computed for each training based on its projections on feature subspaces. Each $$\text {FCNN}_i$$ FCNN i acts as a binary classifier with a cost function whose main parameter is rejection values. A threshold value $$t_i$$ t i is determined for $$i^{th}$$ i th network $$\text {DCNN}_i$$ DCNN i . A testing strategy utilizing $$\{t_i\}_{i=1}^M$$ { t i } i = 1 M is also introduced. The second method creates a single DCNN and it computes a cost function whose parameters depend on subspace separations using the geodesic distance on the Grasmannian manifold of subspaces $$\mathcal {S}_i$$ S i and the sum of all remaining subspaces $$\{\mathcal {S}_j\}_{j=1,j\ne i}^M$$ { S j } j = 1 , j ≠ i M . The proposed methods are tested using multiple network topologies. It is shown that while the first method works better for smaller networks, the second method performs better for complex architectures.Ali SekmenMustafa ParlaktunaAyad Abdul-MalekErdem ErdemirAhmet Bugra KokuSpringerarticleDeep Convolutional Neural NetworksSubspace SeparationRobust Deep LearningComputational linguistics. Natural language processingP98-98.5Electronic computers. Computer scienceQA75.5-76.95ENDiscover Artificial Intelligence, Vol 1, Iss 1, Pp 1-11 (2021) |
institution |
DOAJ |
collection |
DOAJ |
language |
EN |
topic |
Deep Convolutional Neural Networks Subspace Separation Robust Deep Learning Computational linguistics. Natural language processing P98-98.5 Electronic computers. Computer science QA75.5-76.95 |
spellingShingle |
Deep Convolutional Neural Networks Subspace Separation Robust Deep Learning Computational linguistics. Natural language processing P98-98.5 Electronic computers. Computer science QA75.5-76.95 Ali Sekmen Mustafa Parlaktuna Ayad Abdul-Malek Erdem Erdemir Ahmet Bugra Koku Robust feature space separation for deep convolutional neural network training |
description |
Abstract This paper introduces two deep convolutional neural network training techniques that lead to more robust feature subspace separation in comparison to traditional training. Assume that dataset has M labels. The first method creates M deep convolutional neural networks called $$\{\text {DCNN}_i\}_{i=1}^{M}$$ { DCNN i } i = 1 M . Each of the networks $$\text {DCNN}_i$$ DCNN i is composed of a convolutional neural network ( $$\text {CNN}_i$$ CNN i ) and a fully connected neural network ( $$\text {FCNN}_i$$ FCNN i ). In training, a set of projection matrices $$\{\mathbf {P}_i\}_{i=1}^M$$ { P i } i = 1 M are created and adaptively updated as representations for feature subspaces $$\{\mathcal {S}_i\}_{i=1}^M$$ { S i } i = 1 M . A rejection value is computed for each training based on its projections on feature subspaces. Each $$\text {FCNN}_i$$ FCNN i acts as a binary classifier with a cost function whose main parameter is rejection values. A threshold value $$t_i$$ t i is determined for $$i^{th}$$ i th network $$\text {DCNN}_i$$ DCNN i . A testing strategy utilizing $$\{t_i\}_{i=1}^M$$ { t i } i = 1 M is also introduced. The second method creates a single DCNN and it computes a cost function whose parameters depend on subspace separations using the geodesic distance on the Grasmannian manifold of subspaces $$\mathcal {S}_i$$ S i and the sum of all remaining subspaces $$\{\mathcal {S}_j\}_{j=1,j\ne i}^M$$ { S j } j = 1 , j ≠ i M . The proposed methods are tested using multiple network topologies. It is shown that while the first method works better for smaller networks, the second method performs better for complex architectures. |
format |
article |
author |
Ali Sekmen Mustafa Parlaktuna Ayad Abdul-Malek Erdem Erdemir Ahmet Bugra Koku |
author_facet |
Ali Sekmen Mustafa Parlaktuna Ayad Abdul-Malek Erdem Erdemir Ahmet Bugra Koku |
author_sort |
Ali Sekmen |
title |
Robust feature space separation for deep convolutional neural network training |
title_short |
Robust feature space separation for deep convolutional neural network training |
title_full |
Robust feature space separation for deep convolutional neural network training |
title_fullStr |
Robust feature space separation for deep convolutional neural network training |
title_full_unstemmed |
Robust feature space separation for deep convolutional neural network training |
title_sort |
robust feature space separation for deep convolutional neural network training |
publisher |
Springer |
publishDate |
2021 |
url |
https://doaj.org/article/0c4975ea7511433cbff1e6d106499b8e |
work_keys_str_mv |
AT alisekmen robustfeaturespaceseparationfordeepconvolutionalneuralnetworktraining AT mustafaparlaktuna robustfeaturespaceseparationfordeepconvolutionalneuralnetworktraining AT ayadabdulmalek robustfeaturespaceseparationfordeepconvolutionalneuralnetworktraining AT erdemerdemir robustfeaturespaceseparationfordeepconvolutionalneuralnetworktraining AT ahmetbugrakoku robustfeaturespaceseparationfordeepconvolutionalneuralnetworktraining |
_version_ |
1718418926690369536 |