StartNet: Online Detection of Action Start in Untrimmed Videos

Gao, Mingfei; Xu, Mingze; Davis, Larry S.; Socher, Richard; Xiong, Caiming

Computer Science > Computer Vision and Pattern Recognition

arXiv:1903.09868 (cs)

[Submitted on 23 Mar 2019]

Title:StartNet: Online Detection of Action Start in Untrimmed Videos

Authors:Mingfei Gao, Mingze Xu, Larry S. Davis, Richard Socher, Caiming Xiong

View PDF

Abstract:We propose StartNet to address Online Detection of Action Start (ODAS) where action starts and their associated categories are detected in untrimmed, streaming videos. Previous methods aim to localize action starts by learning feature representations that can directly separate the start point from its preceding background. It is challenging due to the subtle appearance difference near the action starts and the lack of training data. Instead, StartNet decomposes ODAS into two stages: action classification (using ClsNet) and start point localization (using LocNet). ClsNet focuses on per-frame labeling and predicts action score distributions online. Based on the predicted action scores of the past and current frames, LocNet conducts class-agnostic start detection by optimizing long-term localization rewards using policy gradient methods. The proposed framework is validated on two large-scale datasets, THUMOS'14 and ActivityNet. The experimental results show that StartNet significantly outperforms the state-of-the-art by 15%-30% p-mAP under the offset tolerance of 1-10 seconds on THUMOS'14, and achieves comparable performance on ActivityNet with 10 times smaller time offset.

Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:1903.09868 [cs.CV]
	(or arXiv:1903.09868v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.1903.09868

Submission history

From: Mingfei Gao [view email]
[v1] Sat, 23 Mar 2019 19:14:53 UTC (366 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CV

< prev | next >

new | recent | 2019-03

Change to browse by:

References & Citations

DBLP - CS Bibliography

listing | bibtex

Mingfei Gao
Mingze Xu
Larry S. Davis
Richard Socher
Caiming Xiong

export BibTeX citation

Computer Science > Computer Vision and Pattern Recognition

Title:StartNet: Online Detection of Action Start in Untrimmed Videos

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:StartNet: Online Detection of Action Start in Untrimmed Videos

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators