ContextVP: Fully Context-Aware Video Prediction

Byeon, Wonmin; Wang, Qin; Srivastava, Rupesh Kumar; Koumoutsakos, Petros

Computer Science > Computer Vision and Pattern Recognition

arXiv:1710.08518 (cs)

[Submitted on 23 Oct 2017 (v1), last revised 9 Sep 2018 (this version, v3)]

Title:ContextVP: Fully Context-Aware Video Prediction

Authors:Wonmin Byeon, Qin Wang, Rupesh Kumar Srivastava, Petros Koumoutsakos

View PDF

Abstract:Video prediction models based on convolutional networks, recurrent networks, and their combinations often result in blurry predictions. We identify an important contributing factor for imprecise predictions that has not been studied adequately in the literature: blind spots, i.e., lack of access to all relevant past information for accurately predicting the future. To address this issue, we introduce a fully context-aware architecture that captures the entire available past context for each pixel using Parallel Multi-Dimensional LSTM units and aggregates it using blending units. Our model outperforms a strong baseline network of 20 recurrent convolutional layers and yields state-of-the-art performance for next step prediction on three challenging real-world video datasets: Human 3.6M, Caltech Pedestrian, and UCF-101. Moreover, it does so with fewer parameters than several recently proposed models, and does not rely on deep convolutional networks, multi-scale architectures, separation of background and foreground modeling, motion flow learning, or adversarial training. These results highlight that full awareness of past context is of crucial importance for video prediction.

Comments:	19 pages. ECCV 2018 oral presentation. Project webpage is at this https URL
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:1710.08518 [cs.CV]
	(or arXiv:1710.08518v3 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.1710.08518

Submission history

From: Wonmin Byeon [view email]
[v1] Mon, 23 Oct 2017 21:55:12 UTC (314 KB)
[v2] Mon, 14 May 2018 01:18:16 UTC (2,586 KB)
[v3] Sun, 9 Sep 2018 09:55:04 UTC (1,335 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:ContextVP: Fully Context-Aware Video Prediction

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:ContextVP: Fully Context-Aware Video Prediction

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators