bibtype C - Conference Paper (international conference)
ARLID 0577116
utime 20240402214613.7
mtime 20231026235959.9
SCOPUS 85179548445
DOI 10.1109/IPTA59101.2023.10320086
title (primary) (eng) CNN Ensemble Robust to Rotation Using Radon Transform
specification
page_count 6 s.
media_type E
serial
ARLID cav_un_epca*0576904
ISBN 979-8-3503-2541-6
title Proceedings of The 12th International Conference on Image Processing Theory, Tools and Applications (IPTA 2023)
publisher
place Piscataway
name IEEE
year 2023
keyword CNN
keyword rotation invariance
keyword equivariance
keyword Radon transform
keyword network fusion
keyword network ensemble
author (primary)
ARLID cav_un_auth*0457087
name1 Košík
name2 Václav
institution UTIA-B
full_dept (cz) Zpracování obrazové informace
full_dept (eng) Department of Image Processing
department (cz) ZOI
department (eng) ZOI
country CZ
fullinstit Ústav teorie informace a automatizace AV ČR, v. v. i.
author
ARLID cav_un_auth*0438860
name1 Karella
name2 Tomáš
institution UTIA-B
full_dept (cz) Zpracování obrazové informace
full_dept Department of Image Processing
department (cz) ZOI
department ZOI
country CZ
fullinstit Ústav teorie informace a automatizace AV ČR, v. v. i.
author
ARLID cav_un_auth*0101087
name1 Flusser
name2 Jan
institution UTIA-B
full_dept (cz) Zpracování obrazové informace
full_dept Department of Image Processing
department (cz) ZOI
department ZOI
full_dept Department of Image Processing
fullinstit Ústav teorie informace a automatizace AV ČR, v. v. i.
source
url http://library.utia.cas.cz/separaty/2023/ZOI/flusser-0577116.pdf
cas_special
project
project_id GA21-03921S
agency GA ČR
ARLID cav_un_auth*0412209
abstract (eng) A great deal of attention has been paid to alternative techniques to data augmentation in the literature. Their goal is to make convolutional neural networks (CNNs) invariant or at least robust to various transformations. In this paper, we present an ensemble model combining a classic CNN with an invariant CNN\nwhere both were trained without any augmentation. The goal is to preserve the performance of the classic CNN on nondeformed images (where it is supposed to classify more accurately) and the performance of the invariant CNN on deformed images (where it is the other way around). The combination is controlled by another network which outputs a coefficient that determines the fusion rule of the two networks. The auxiliary network is trained to output the coefficient depending on the intensity of the image deformation. In the experiments, we focus on rotation as a simple and most frequently studied case of transformation. In addition, we present a network invariant to rotation that is fed with the Radon transform of the input images. The performance of this network is tested on rotated MNIST and is further used in the ensemble whose performance is demonstrated on the CIFAR10- dataset.
action
ARLID cav_un_auth*0456973
name International Conference on Image Processing Theory, Tools and Applications (IPTA 2023) /12./
dates 20231016
mrcbC20-s 20231019
place Paris
country FR
RIV JD
FORD0 20000
FORD1 20200
FORD2 20204
reportyear 2024
num_of_auth 3
presentation_type PR
inst_support RVO:67985556
permalink https://hdl.handle.net/11104/0346499
confidential S
article_num 10320086
arlyear 2023
mrcbU14 85179548445 SCOPUS
mrcbU24 PUBMED
mrcbU34 WOS
mrcbU63 cav_un_epca*0576904 Proceedings of The 12th International Conference on Image Processing Theory, Tools and Applications (IPTA 2023) IEEE 2023 Piscataway 979-8-3503-2541-6