×
Jul 3, 2020 · In this challenge, action recognition is posed as the problem of simultaneously predicting a single verb and noun class label given an input trimmed video clip.
A recently-proposed spatial-temporal video attention model, called `W3' (`What-Where-When') attention is introduced, which introduces a simple yet effective ...
Jul 3, 2020 · The challenging aspects of this real-life action recognition task include small fast moving objects, complex hand-object interactions, and ...
Jul 3, 2020 · In this attempt, we present a novel egocentric action recognition solution based on video attention learning and temporal contextual learning ...
Abstract: In egocentric videos, actions occur in quick succession. We capitalise on the action's temporal context and propose a method that learns to attend ...
Learning to Recognize Actions on Objects in Egocentric Video with Attention Dictionaries - Swathikiran Sudhakaran, Sergio Escalera, Oswald Lanz, T-PAMI 2021.
Abstract—We present EgoACO, a deep neural architecture for video action recognition that learns to pool action-context-object.
Egocentric activity recognition is one of the most chal- lenging tasks in video analysis. It requires a fine-grained discrimination of small objects and ...
Sep 22, 2023 · This work proposes a method for video action anticipation that integrates prior action predictions into the hidden-state of a recurrent model.
Feb 11, 2021 · We present EgoACO, a deep neural architecture for video action recognition that learns to pool action-context-object descriptors from frame level features.