Stereoscopic Video Quality Assessment Using Modified Parallax Attention Module

Hassan Imani, Selim Zaim, Md Baharul Islam, Masum Shah Junayed

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

4 Citations (Scopus)

Abstract

Deep learning techniques are utilized for most computer vision tasks. Especially, Convolutional Neural Networks (CNNs) have shown great performance in detection and classification tasks. Recently, in the field of Stereoscopic Video Quality Assessment (SVQA), 3D CNNs are used to extract spatial and temporal features from stereoscopic videos, but the importance of the disparity information which is very important did not consider well. Most of the recently proposed deep learning-based methods mostly used cost volume methods to produce the stereo correspondence for large disparities. Because the disparities can differ considerably for stereo cameras with different configurations, recently the Parallax Attention Mechanism (PAM) is proposed that captures the stereo correspondence disregarding the disparity changes. In this paper, we propose a new SVQA model using a base 3D CNN-based network, and a modified PAM-based left and right feature fusion model. Firstly, we use 3D CNNs and residual blocks to extract features from the left and right views of a stereo video patch. Then, we modify the PAM model to fuse the left and right features with considering the disparity information, and using some fully connected layers, we calculate the quality score of a stereoscopic video. We divided the input videos into cube patches for data augmentation and remove some cubes that confuse our model from the training dataset. Two standard stereoscopic video quality assessment benchmarks of LFOVIAS3DPh2 and NAMA3DS1-COSPAD1 are used to train and test our model. Experimental results indicate that our proposed model is very competitive with the state-of-the-art methods in the NAMA3DS1-COSPAD1 dataset, and it is the state-of-the-art method in the LFOVIAS3DPh2 dataset.

Original languageEnglish
Title of host publicationDigitizing Production Systems - Selected Papers from ISPR 2021
EditorsNuman M. Durakbasa, M. Güneş Gençyılmaz
PublisherSpringer Science and Business Media Deutschland GmbH
Pages39-50
Number of pages12
ISBN (Print)9783030904203
DOIs
Publication statusPublished - 2022
EventInternational Symposium for Production Research, ISPR2021 - Virtual, Online
Duration: 7 Oct 20219 Oct 2021

Publication series

NameLecture Notes in Mechanical Engineering
ISSN (Print)2195-4356
ISSN (Electronic)2195-4364

Conference

ConferenceInternational Symposium for Production Research, ISPR2021
CityVirtual, Online
Period7/10/219/10/21

Keywords

  • 3D convolutional neural networks
  • Deep learning
  • Disparity
  • Parallax attention mechanism
  • Quality assessment
  • Stereoscopic video

Fingerprint

Dive into the research topics of 'Stereoscopic Video Quality Assessment Using Modified Parallax Attention Module'. Together they form a unique fingerprint.

Cite this