Facial Expression Recognition using Recent Convolutional Neural Network Models
dc.contributor.author | Yenilmez, Fulya | |
dc.contributor.author | Yildiz, Canan | |
dc.contributor.author | Ugur, Mukden | |
dc.date.accessioned | 2025-02-20T08:46:30Z | |
dc.date.available | 2025-02-20T08:46:30Z | |
dc.date.issued | 2023 | |
dc.department | Türk-Alman Üniversitesi | en_US |
dc.description | 26th International Conference on Computer and Information Technology, ICCIT 2023 -- 13 December 2023 through 15 December 2023 -- Cox's Bazar -- 197664 | en_US |
dc.description.abstract | In recent years, deep learning architectures have demonstrated remarkable achievements on diverse computer vision tasks, including the recognition of facial expressions. The development of precise and robust facial expression recognition (FER) models has the potential to improve a wide range of applications in various fields, such as human-computer interaction, emotion analysis, marketing, robotics, psychology, and health care.Within this domain, several Convolutional Neural Network (CNN) architectures have been employed to yield promising results. In this paper, we undertake a comparative analysis with the latest iterations of five widely adopted CNN models for the FER task: VGG19, InceptionV3, ResNet152V2, MobileNetV2, and EfficientNetV2B1. We also explore the effect of transfer learning, specifically the difference between pretraining on VGGFace versus ImageNet. Our primary goal is to comprehensively assess these architectures under identical conditions and provide clarity on their performance in comparison to each other and to previous works using various training strategies. Our assessments allow the conclusion that increasing the size and depth of these common "backbone"models leads to only minor improvements. Augmenting these models with additional architectural elements such as attention mechanisms seems much more promising in comparison. Our results further show that pretraining on a domain-specific dataset can lead to significant improvements, demonstrating the importance of a large FER related dataset. © 2023 IEEE. | |
dc.identifier.doi | 10.1109/ICCIT60459.2023.10441242 | |
dc.identifier.isbn | 979-835035901-5 | |
dc.identifier.scopus | 2-s2.0-85187362886 | |
dc.identifier.uri | https://doi.org/10.1109/ICCIT60459.2023.10441242 | |
dc.identifier.uri | https://hdl.handle.net/20.500.12846/1753 | |
dc.indekslendigikaynak | Scopus | |
dc.language.iso | en | |
dc.publisher | Institute of Electrical and Electronics Engineers Inc. | |
dc.relation.ispartof | 2023 26th International Conference on Computer and Information Technology, ICCIT 2023 | |
dc.relation.publicationcategory | Konferans Öğesi - Uluslararası - Kurum Öğretim Elemanı | |
dc.rights | info:eu-repo/semantics/closedAccess | |
dc.snmz | KA_Scopus_20250220 | |
dc.subject | CNN architectures | en_US |
dc.subject | face expression recognition | en_US |
dc.subject | fine-tuning | en_US |
dc.subject | neural networks | en_US |
dc.subject | transfer learning | en_US |
dc.title | Facial Expression Recognition using Recent Convolutional Neural Network Models | |
dc.type | Conference Object |