Spatio-Temporal Consistent Non-homogeneous Extreme Video Retargeting

Hassan Imani, Md Baharul Islam

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Due to the availability of heterogeneous display devices and their aspect ratios, video retargeting has received considerable research attention among researchers. Non-consistent video retargeting can significantly affect a video's spatial and temporal quality, particularly for extreme retargeting cases. Since no perfectly annotated datasets exist for video retargeting, deep learning-based techniques are rarely utilized. This paper proposes a method that learns to retarget videos by detecting the salient areas and shifting them to the appropriate location. First, we segment the salient objects using a unified Transformer model. Using convolutional layers and a shifting strategy, we shift and warp objects to the suitable size and location in the frame. We use 1D convolution for shifting the salient objects. We also use a frame interpolation technique to preserve temporal information. To train the network, we feed the retargeted frames to a variational auto-encoder network to map the retargeted frames back to the input frames. Besides, we design perceptual and wavelet-based loss functions to train our model. Thus, we train the network unsupervised. Extensive qualitative and quantitative experiments and ablation studies on the DAVIS dataset show the superiority of the proposed method over the existing state-of-The-Art methods.

Original languageEnglish
Title of host publication2024 IEEE International Conference on Consumer Electronics, ICCE 2024
PublisherInstitute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)9798350324136
DOIs
Publication statusPublished - 2024
Externally publishedYes
Event2024 IEEE International Conference on Consumer Electronics, ICCE 2024 - Las Vegas, United States
Duration: 6 Jan 20248 Jan 2024

Publication series

NameDigest of Technical Papers - IEEE International Conference on Consumer Electronics
ISSN (Print)0747-668X
ISSN (Electronic)2159-1423

Conference

Conference2024 IEEE International Conference on Consumer Electronics, ICCE 2024
Country/TerritoryUnited States
CityLas Vegas
Period6/01/248/01/24

Keywords

  • CNNs
  • Salient objects
  • Segmentation
  • Spatial and temporal coherence
  • Video retargeting

Fingerprint

Dive into the research topics of 'Spatio-Temporal Consistent Non-homogeneous Extreme Video Retargeting'. Together they form a unique fingerprint.

Cite this