DeepPyNet: A Deep Feature Pyramid Network for Optical Flow Estimation

Afsana Ahsan Jeny, Md Baharul Islam, Tarkan Aydin

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

2 Citations (Scopus)

Abstract

Recent advances in optical flow prediction have been made possible by using feature pyramids and iterative refining. Though downsampling in feature pyramids may cause foreground items to merge with the background, the iterative processing could be incorrect in optical flow experiments. Particularly the outcomes of the movement of narrow and tiny objects can be more invisible in the flow scene. We introduce a novel method called DeepPyNet for optical flow estimation that includes feature extractor, multi-channel cost volume, and flow decoder. In this method, we propose a deep recurrent feature pyramid-based network for the end-to-end optical flow estimation. The feature extraction from each pixel of the feature map keeps essential information without modifying the feature receptive field. Then, a multi-scale 4D correlation volume is built from the visual similarity of each pair of pixels. Finally, we utilize the multi-scale correlation volumes to continuously update the flow field through an iterative recurrent method. Experimental results demonstrate that DeepPyNet significantly eliminates flow errors and provides state-of-the-art performance in various datasets. Moreover, DeepPyNet is less complex and uses only 6.1M parameters 81% and 35% smaller than the popular FlowNet and PWC-Net+, respectively.

Original languageEnglish
Title of host publicationProceedings of the 2021 36th International Conference on Image and Vision Computing New Zealand, IVCNZ 2021
EditorsMichael J. Cree
PublisherIEEE Computer Society
ISBN (Electronic)9781665406451
DOIs
Publication statusPublished - 2021
Externally publishedYes
Event36th International Conference on Image and Vision Computing New Zealand, IVCNZ 2021 - Tauranga, New Zealand
Duration: 9 Dec 202110 Dec 2021

Publication series

NameInternational Conference Image and Vision Computing New Zealand
Volume2021-December
ISSN (Print)2151-2191
ISSN (Electronic)2151-2205

Conference

Conference36th International Conference on Image and Vision Computing New Zealand, IVCNZ 2021
Country/TerritoryNew Zealand
CityTauranga
Period9/12/2110/12/21

Keywords

  • Convolutional neural network
  • Feature pyramid networks
  • Iterative recurrent unit
  • Multi-scale correlation volume
  • Optical flow estimation

Fingerprint

Dive into the research topics of 'DeepPyNet: A Deep Feature Pyramid Network for Optical Flow Estimation'. Together they form a unique fingerprint.

Cite this