Download PDFOpen PDF in browser

Predicting Episodic Video Memorability Using Deep Features Fusion Strategy

EasyChair Preprint 7501

16 pagesDate: February 24, 2022

Abstract

Video memorability prediction has become an important research topic in computer vision in recent years. The movie's input is highly remembered and gains much attention with unbounded time constraints. Episodic memory is a fascinating research area that needs much attention using video processing tools and techniques. Episodic memories are long-lasting with complete detail. Movies are one of the best instances of episodic memory. This paper proposes a novel framework to fuse deep features to predict the probability of recalling episodic events. Memories are reproducible and sensitive to a sophisticated set of properties rather than low-level properties—the proposed framework pin up the fusion of text, visual and motion features. A fuzzy-based FastText model, a supervised text extraction module, is designed to extract the annotations with their relevant classes. The color histogram analysis is done to determine the dominant color region that performs as a connected fragment to form episodic video sequences. A novel Faster R-CNN is designed to discover the scene objects using an informative regional proposal network formation. Here, the modified loss function sorts out the lowest overlapping regions yielding the best proposals. The 'high-level' properties are collected using Principal Component Analysis (PCA) to form episodic shots. These are fused to estimate the memorability score. The proposed framework is implemented in Mediaeval 2018 datasets. A superior spearman's rank correlation result is achieved as 0.6428 short-term and 0.4285 long-term memorabilities than the latest comparable methods.

Keyphrases: Faster R-CNN, Fuzzy based FastText, Regional Proposal Network & MediaEval 2018 datasets, Video memorability prediction, episodic memory

BibTeX entry
BibTeX does not have the right entry for preprints. This is a hack for producing the correct reference:
@booklet{EasyChair:7501,
  author    = {Hasnain Ali and Syed Omer Gilani and Muhammad Jawad Khan and Mohsin Jamil and Muazzam Khattak},
  title     = {Predicting Episodic Video Memorability Using Deep Features Fusion Strategy},
  howpublished = {EasyChair Preprint 7501},
  year      = {EasyChair, 2022}}
Download PDFOpen PDF in browser