TY - JOUR
T1 - Human activity recognition with fine-tuned CNN-LSTM
AU - Genc, Erdal
AU - Yildirim, Mustafa Eren
AU - Salman, Yucel Batu
N1 - Publisher Copyright:
© This is an open access article licensed under the Creative Commons Attribution-NonCommercial-NoDerivatives License (http://creativecommons.org/licenses/by-nc-nd/4.0/).
PY - 2024/2/1
Y1 - 2024/2/1
N2 - Human activity recognition (HAR) by deep learning is a challenging and interesting topic. Although there are robust models, there is also a bunch of parameters and variables, which affect the performance such as the number of layers, pooling type. This study presents a new deep learning architecture that is obtained by fine-tuning of the conventional CNN-LSTM model, namely, CNN (+3)-LSTM. Three changes are made to the conventional model to increase the accuracy. Firstly, kernel size is set to 1×1 to extract more information. Secondly, three convolutional layers are added to the model. Lastly, average pooling is used instead of max-pooling. Performance analysis of the proposed model is conducted on the KTH dataset and implemented on Keras. In addition to the overall accuracy of the proposed model, the contribution of each change is observed individually. Results show that adding layers made the highest contribution followed by kernel size and pooling, respectively. The proposed model is compared with state-of-art and outperformed some of the recent studies with a 94.1% recognition rate.
AB - Human activity recognition (HAR) by deep learning is a challenging and interesting topic. Although there are robust models, there is also a bunch of parameters and variables, which affect the performance such as the number of layers, pooling type. This study presents a new deep learning architecture that is obtained by fine-tuning of the conventional CNN-LSTM model, namely, CNN (+3)-LSTM. Three changes are made to the conventional model to increase the accuracy. Firstly, kernel size is set to 1×1 to extract more information. Secondly, three convolutional layers are added to the model. Lastly, average pooling is used instead of max-pooling. Performance analysis of the proposed model is conducted on the KTH dataset and implemented on Keras. In addition to the overall accuracy of the proposed model, the contribution of each change is observed individually. Results show that adding layers made the highest contribution followed by kernel size and pooling, respectively. The proposed model is compared with state-of-art and outperformed some of the recent studies with a 94.1% recognition rate.
KW - KTH dataset
KW - convolutional neural network
KW - human activity recognition
UR - http://www.scopus.com/inward/record.url?scp=85187250614&partnerID=8YFLogxK
U2 - 10.2478/jee-2024-0002
DO - 10.2478/jee-2024-0002
M3 - Article
AN - SCOPUS:85187250614
SN - 1335-3632
VL - 75
SP - 8
EP - 13
JO - Journal of Electrical Engineering
JF - Journal of Electrical Engineering
IS - 1
ER -