default search action
27th ACM Multimedia 2019: Nice, France
- Laurent Amsaleg, Benoit Huet, Martha A. Larson, Guillaume Gravier, Hayley Hung, Chong-Wah Ngo, Wei Tsang Ooi:
Proceedings of the 27th ACM International Conference on Multimedia, MM 2019, Nice, France, October 21-25, 2019. ACM 2019, ISBN 978-1-4503-6889-6
Keynote I
- Jean Carrive:
Using Artificial Intelligence to Preserve Audiovisual Archives: New Horizons, More Questions. 1-2
Session 1A: Multimodal Fusion&Visual Relations
- Chunxiao Liu, Zhendong Mao, An-An Liu, Tianzhu Zhang, Bin Wang, Yongdong Zhang:
Focus Your Attention: A Bidirectional Focal Attention Network for Image-Text Matching. 3-11 - Tan Wang, Xing Xu, Yang Yang, Alan Hanjalic, Heng Tao Shen, Jingkuan Song:
Matching Images and Text with Multi-modal Tensor Fusion and Re-ranking. 12-20 - Shijie Yang, Liang Li, Shuhui Wang, Dechao Meng, Qingming Huang, Qi Tian:
Structured Stochastic Recurrent Network for Linguistic Video Prediction. 21-29 - Hao Zhou, Chongyang Zhang, Chuanping Hu:
Visual Relationship Detection with Relative Location Mining. 30-38 - Tong Yu, Yilin Shen, Ruiyi Zhang, Xiangyu Zeng, Hongxia Jin:
Vision-Language Recommendation via Attribute Augmented Multimodal Reinforcement Learning. 39-47 - Huafeng Kuang, Rongrong Ji, Hong Liu, Shengchuan Zhang, Xiaoshuai Sun, Feiyue Huang, Baochang Zhang:
Multi-modal Multi-layer Fusion Network with Average Binary Center Loss for Face Anti-spoofing. 48-56 - Yi Hao, Nannan Wang, Xinbo Gao, Jie Li, Xiaoyu Wang:
Dual-alignment Feature Embedding for Cross-modality Person Re-identification. 57-65 - Lan Wang, Jiahao Shi, Yang Wang, Feng Su:
Video Text Detection by Attentive Spatiotemporal Fusion of Deep Convolutional Features. 66-74 - David Semedo, João Magalhães:
Cross-Modal Subspace Learning with Scheduled Adaptive Margin Constraints. 75-83 - Xufeng Qian, Yueting Zhuang, Yimeng Li, Shaoning Xiao, Shiliang Pu, Jun Xiao:
Video Relation Detection with Spatio-Temporal Graph. 84-93 - Xu Sun, Yuan Zi, Tongwei Ren, Jinhui Tang, Gangshan Wu:
Hierarchical Visual Relationship Detection. 94-102 - Wanneng Wang, Yanan Ma, Ke Gao, Juan Cao:
Cost-free Transfer Learning Mechanism: Deep Digging Relationships of Action Categories. 103-111 - Lixi Deng, Jingjing Chen, Qianru Sun, Xiangnan He, Sheng Tang, Zhaoyan Ming, Yongdong Zhang, Tat-Seng Chua:
Mixed-dish Recognition with Contextual Relation Networks. 112-120 - Sipeng Zheng, Shizhe Chen, Qin Jin:
Visual Relation Detection with Multi-Level Attention. 121-129
Session 1B: Affective Computing&Facial Analytics
- Yaochen Zhu, Zhenzhong Chen, Feng Wu:
Multimodal Deep Denoise Framework for Affective Video Content Analysis. 130-138 - Raj Kumar Gupta, Yinping Yang:
Predicting and Understanding News Social Popularity with Emotional Salience Features. 139-147 - Dong Zhang, Shoushan Li, Qiaoming Zhu, Guodong Zhou:
Effective Sentiment-relevant Word Selection for Multi-modal Sentiment Analysis in Spoken Language. 148-156 - Yue Gu, Xinyu Lyu, Weijia Sun, Weitian Li, Shuhong Chen, Xinyu Li, Ivan Marsic:
Mutual Correlation Attentive Factors in Dyadic Fusion Networks for Speech Emotion Recognition. 157-166 - Timothy Greer, Benjamin Ma, Matthew E. Sachs, Assal Habibi, Shrikanth S. Narayanan:
A Multimodal View into Music's Effect on Human Neural, Physiological, and Emotional Experience. 167-175 - Jia-Xin Ma, Hao Tang, Wei-Long Zheng, Bao-Liang Lu:
Emotion Recognition using Multimodal Residual LSTM Network. 176-183 - Yang Zhou, Wanli Yu, Zhu Li, Haibing Yin:
Stereoscopic Visual Discomfort Prediction Using Multi-scale DCT Features. 184-191 - Sicheng Zhao, Zizhou Jia, Hui Chen, Leida Li, Guiguang Ding, Kurt Keutzer:
PDANet: Polarity-consistent Deep Attention Network for Fine-grained Visual Emotion Regression. 192-201 - K. R. Prajwal, C. V. Jawahar, Ponnurangam Kumaraguru:
Towards Increased Accessibility of Meme Images with the Help of Rich Face Emotion Captions. 202-210 - Wenxuan Wang, Qiang Sun, Yanwei Fu, Tao Chen, Chenjie Cao, Ziqi Zheng, Guoqiang Xu, Han Qiu, Yu-Gang Jiang, Xiangyang Xue:
Comp-GAN: Compositional Generative Adversarial Network in Synthesizing and Recognizing Facial Expression. 211-219 - Juntong Cheng, Yi-Ping Phoebe Chen, Minjun Li, Yu-Gang Jiang:
TC-GAN: Triangle Cycle-Consistent GANs for Face Frontalization with Facial Features Preserved. 220-228 - Shiming Ge, Shengwei Zhao, Xindi Gao, Jia Li:
Fewer-Shots and Lower-Resolutions: Towards Ultrafast Face Recognition in the Wild. 229-237 - Can Wang, Shangfei Wang, Guang Liang:
Identity- and Pose-Robust Facial Expression Recognition through Adversarial Feature Learning. 238-246 - Veith Röthlingshöfer, Vivek Sharma, Rainer Stiefelhagen:
Self-supervised Face-Grouping on Graphs. 247-256
Session 1C: Fashion&Human Analysis
- Yunshan Ma, Xun Yang, Lizi Liao, Yixin Cao, Tat-Seng Chua:
Who, Where, and What to Wear?: Extracting Fashion Knowledge from Social Media. 257-265 - Na Zheng, Xuemeng Song, Zhaozheng Chen, Linmei Hu, Da Cao, Liqiang Nie:
Virtually Trying on New Clothing with Arbitrary Poses. 266-274 - Chia-Wei Hsieh, Chieh-Yun Chen, Chien-Lung Chou, Hong-Han Shuai, Jiaying Liu, Wen-Huang Cheng:
FashionOn: Semantic-guided Image-based Virtual Try-on with Detailed Human and Clothing Information. 275-283 - Weijian Ruan, Wu Liu, Qian Bao, Jun Chen, Yuhao Cheng, Tao Mei:
POINet: Pose-Guided Ovonic Insight Network for Multi-Person Pose Tracking. 284-292 - Zhonghua Wu, Guosheng Lin, Qingyi Tao, Jianfei Cai:
M2E-Try On Net: Fashion from Model to Everyone. 293-301 - Xue Dong, Xuemeng Song, Fuli Feng, Peiguang Jing, Xin-Shun Xu, Liqiang Nie:
Personalized Capsule Wardrobe Creation with Garment and User Modeling. 302-310 - Xin Jin, Le Wu, Geng Zhao, Xiaodong Li, Xiaokun Zhang, Shiming Ge, Dongqing Zou, Bin Zhou, Xinghui Zhou:
Aesthetic Attributes Assessment of Images. 311-319 - Xuemeng Song, Xianjing Han, Yunkai Li, Jingyuan Chen, Xin-Shun Xu, Liqiang Nie:
GP-BPR: Personalized Compatibility Modeling for Clothing Matching. 320-328 - Xin Wang, Bo Wu, Yueqi Zhong:
Outfit Compatibility Prediction and Diagnosis with Multi-Layered Comparison Network. 329-337 - Xinchen Liu, Meng Zhang, Wu Liu, Jingkuan Song, Tao Mei:
BraidNet: Braiding Semantics and Details for Accurate Human Parsing. 338-346 - Mang Ye, Xiangyuan Lan, Qingming Leng:
Modality-aware Collaborative Learning for Visible Thermal Person Re-Identification. 347-355 - Yuyu Guo, Lianli Gao, Jingkuan Song, Peng Wang, Wuyuan Xie, Heng Tao Shen:
Adaptive Multi-Path Aggregation for Human DensePose Estimation in the Wild. 356-364 - Yukun Huang, Zheng-Jun Zha, Xueyang Fu, Wei Zhang:
Illumination-Invariant Person Re-Identification. 365-373 - Jianbo Wang, Kai Qiu, Houwen Peng, Jianlong Fu, Jianke Zhu:
AI Coach: Deep Human Pose Estimation and Analysis for Personalized Athletic Training Assistance. 374-382
Session 1D: Live Multimedia Applications&Streaming
- Xiao Liu, Lin Zhang, Ying Shen, Shaoming Zhang, Shengjie Zhao:
Online Camera Pose Optimization for the Surround-view System. 383-391 - Xiang Chen, Tam V. Nguyen, Zhiqi Shen, Mohan S. Kankanhalli:
LiveSense: Contextual Advertising in Live Streaming Videos. 392-400 - Nicholas Diliberti, Chao Peng, Christopher Kaufman, Yangzi Dong, Jeffrey T. Hansberger:
Real-Time Gesture Recognition Using 3D Sensory Data and a Light Convolutional Neural Network. 401-410 - Yuqian Fu, Chengrong Wang, Yanwei Fu, Yu-Xiong Wang, Cong Bai, Xiangyang Xue, Yu-Gang Jiang:
Embodied One-Shot Video Recognition: Learning from Actions of a Virtual Embodied Agent. 411-419 - Rui-Xiao Zhang, Ming Ma, Tianchi Huang, Haitian Pang, Xin Yao, Chenglei Wu, Jiangchuan Liu, Lifeng Sun:
Livesmart: A QoS-Guaranteed Cost-Minimum Framework of Viewer Scheduling for Crowdsourced Live Streaming. 420-428 - Tianchi Huang, Chao Zhou, Rui-Xiao Zhang, Chenglei Wu, Xin Yao, Lifeng Sun:
Comyco: Quality-Aware Adaptive Video Streaming via Imitation Learning. 429-437 - Silas L. Fong, Salma Emara, Baochun Li, Ashish Khisti, Wai-Tian Tan, Xiaoqing Zhu, John G. Apostolopoulos:
Low-Latency Network-Adaptive Error Control for Interactive Streaming. 438-446 - Jounsup Park, Klara Nahrstedt:
Navigation Graph for Tiled Media Streaming. 447-455 - Yu Guan, Xinggong Zhang, Zongming Guo:
CACA: Learning-based Content-aware Cache Admission for Video Content in Edge Caching. 456-464 - Yabin Zhu, Chenglong Li, Bin Luo, Jin Tang, Xiao Wang:
Dense Feature Aggregation and Pruning for RGBT Tracking. 465-472 - Haosheng Chen, Qiangqiang Wu, Yanjie Liang, Xinbo Gao, Hanzi Wang:
Asynchronous Tracking-by-Detection on Adaptive Time Surfaces for Event-based Object Tracking. 473-481 - Gaoang Wang, Yizhou Wang, Haotian Zhang, Renshu Gu, Jenq-Neng Hwang:
Exploit the Connectivity: Multi-Object Tracking with TrackletNet. 482-490 - Yusen Li, Haoyuan Liu, Xiwei Wang, Lingjun Pu, Trent G. Marbach, Shanjiang Tang, Gang Wang, Xiaoguang Liu:
Themis: Efficient and Adaptive Resource Partitioning for Reducing Response Delay in Cloud Gaming. 491-499 - Can Zhang, Yuexian Zou, Guang Chen, Lei Gan:
PAN: Persistent Appearance Network with an Efficient Motion Cue for Fast Action Recognition. 500-509
Keynote II
- Pernille Bjørn, María Menéndez-Blanco:
FemTech: Broadening Participation to Digital Technology Development. 510-511
Session 2A: Knowledge Processing&Action Analysis
- Peng Zhang, Li Su, Liang Li, Bing-Kun Bao, Pamela C. Cosman, Guorong Li, Qingming Huang:
Training Efficient Saliency Prediction Models with Knowledge Distillation. 512-520 - Tao Zhuo, Zhiyong Cheng, Peng Zhang, Yongkang Wong, Mohan S. Kankanhalli:
Explainable Video Action Reasoning via Prior Knowledge and State Transitions. 521-529 - Guohao Li, Xin Wang, Wenwu Zhu:
Perceptual Visual Reasoning with Knowledge Propagation. 530-538 - Xuejing Liu, Liang Li, Shuhui Wang, Zheng-Jun Zha, Li Su, Qingming Huang:
Knowledge-guided Pairwise Reconstruction Network for Weakly Supervised Referring Expression Grounding. 539-547 - Xiaowen Huang, Quan Fang, Shengsheng Qian, Jitao Sang, Yan Li, Changsheng Xu:
Explainable Interaction-driven User Modeling over Knowledge Graph for Sequential Recommendation. 548-556 - Lei Meng, Long Chen, Xun Yang, Dacheng Tao, Hanwang Zhang, Chunyan Miao, Tat-Seng Chua:
Learning Using Privileged Information for Food Recognition. 557-565 - Bowen Pan, Shangfei Wang, Bin Xia:
Occluded Facial Expression Recognition Enhanced through Privileged Information. 566-573 - Yanli Ji, Feixiang Xu, Yang Yang, Ning Xie, Heng Tao Shen, Tatsuya Harada:
Attention Transfer (ANT) Network for View-invariant Action Recognition. 574-582 - Ziming Liu, Guangyu Gao, A. Kai Qin, Tong Wu, Chi Harold Liu:
Action Recognition with Bootstrapping based Long-range Temporal Context Attention. 583-591 - Changmao Cheng, Chi Zhang, Yichen Wei, Yu-Gang Jiang:
Sparse Temporal Causal Convolution for Efficient Action Modeling. 592-600 - Xiang Gao, Wei Hu, Jiaxiang Tang, Jiaying Liu, Zongming Guo:
Optimized Skeleton-based Action Recognition via Sparsified Graph Regression. 601-610 - Wanru Xu, Jian Yu, Zhenjiang Miao, Lili Wan, Qiang Ji:
Prediction-CGAN: Human Action Prediction with Conditional Generative Adversarial Networks. 611-619 - Haoze Wu, Zheng-Jun Zha, Xin Wen, Zhenzhong Chen, Dong Liu, Xuejin Chen:
Cross-Fiber Spatial-Temporal Co-enhanced Networks for Video Action Recognition. 620-628 - Dong Li, Ting Yao, Zhaofan Qiu, Houqiang Li, Tao Mei:
Long Short-Term Relation Networks for Video Action Detection. 629-637
Session 2B: Adversarial Learning
- Meijuan Jia, Hongyu Yang, Di Huang, Yunhong Wang:
Attacking Gait Recognition Systems via Silhouette Guided GANs. 638-646 - Yang Chen, Yingwei Pan, Ting Yao, Xinmei Tian, Tao Mei:
Mocycle-GAN: Unpaired Video-to-Video Translation. 647-655 - Zitai Wang, Qianqian Xu, Ke Ma, Yangbangyan Jiang, Xiaochun Cao, Qingming Huang:
Adversarial Preference Learning with Pairwise Comparisons. 656-664 - Jiawei Liu, Zheng-Jun Zha, Richang Hong, Meng Wang, Yongdong Zhang:
Deep Adversarial Graph Attention Convolution Network for Text-Based Person Search. 665-673 - Zhaoyu Zhang, Jun Yu:
STDGAN: ResBlock Based Generative Adversarial Nets Using Spectral Normalization and Two Different Discriminators. 674-682 - Tsai-Ho Sun, Chien-Hsun Lai, Sai-Keung Wong, Yu-Shuen Wang:
Adversarial Colorization of Icons Based on Contour and Color Conditions. 683-691 - Chen Ma, Chenxu Zhao, Hailin Shi, Li Chen, Jun-Hai Yong, Dan Zeng:
MetaAdvDet: Towards Robust Detection of Evolving Adversarial Attacks. 692-701 - Jen-Chun Lin, Wen-Li Wei, Tyng-Luh Liu, C.-C. Jay Kuo, Mark Liao:
Tell Me Where It is Still Blurry: Adversarial Blurred Region Mining and Refining. 702-710 - Rong Chen, Yuan Xie, Xiaotong Luo, Yanyun Qu, Cuihua Li:
Joint-attention Discriminator for Accurate Super-resolution via Adversarial Training. 711-719 - Hsin-Ying Hsieh, Chieh-Yu Chen, Yu-Shuen Wang, Jung-Hong Chuang:
BasketballGAN: Generating Basketball Play Simulation Through Sketching. 720-728 - Shuang Li, Chi Harold Liu, Binhui Xie, Limin Su, Zhengming Ding, Gao Huang:
Joint Adversarial Domain Adaptation. 729-737 - Chengwei Zhang, Yunlu Xu, Zhanzhan Cheng, Yi Niu, Shiliang Pu, Fei Wu, Futai Zou:
Adversarial Seeded Sequence Growing for Weakly-Supervised Temporal Action Localization. 738-746 - Jingjing Li, Erpeng Chen, Zhengming Ding, Lei Zhu, Ke Lu, Zi Huang:
Cycle-consistent Conditional Adversarial Transfer Networks. 747-755 - Peiying Li, Shikui Tu, Lei Xu:
GAN Flexible Lmser for Super-resolution. 756-764
Session 2C: Captioning&Video Analysis
- Longteng Guo, Jing Liu, Jinhui Tang, Jiangwei Li, Wei Luo, Hanqing Lu:
Aligning Linguistic Words and Visual Semantic Units for Image Captioning. 765-773 - Yaosi Hu, Zhenzhong Chen, Zheng-Jun Zha, Feng Wu:
Hierarchical Global-Local Temporal Modeling for Video Captioning. 774-783 - Yuqing Song, Shizhe Chen, Yida Zhao, Qin Jin:
Unpaired Cross-lingual Image Caption Generation with Self-Supervised Rewards. 784-792 - Xinhang Song, Bohan Wang, Gongwei Chen, Shuqiang Jiang:
MUCH: Mutual Coupling Enhancement of Scene Recognition and Dense Captioning. 793-801 - Yongqing Zhu, Shuqiang Jiang:
Attention-based Densely Connected LSTM for Video Captioning. 802-810 - Elaheh Barati, Xuewen Chen:
Critic-based Attention Network for Event-based Video Captioning. 811-817 - Xiangxi Shi, Jianfei Cai, Shafiq R. Joty, Jiuxiang Gu:
Watch It Twice: Video Captioning with a Refocused Video Encoder. 818-826 - Jiaxin Wu, Sheng-Hua Zhong, Yan Liu:
MvsGCN: A Novel Graph Convolutional Network for Multi-video Summarization. 827-835 - Junbo Wang, Wei Wang, Zhiyong Wang, Liang Wang, Dagan Feng, Tieniu Tan:
Stacked Memory Network for Video Summarization. 836-844 - Jingyi Zhang, Zhen Wei, Ionut Cosmin Duta, Fumin Shen, Li Liu, Fan Zhu, Xing Xu, Ling Shao, Heng Tao Shen:
Generative Reconstructive Hashing for Incomplete Video Analysis. 845-854 - Zhanzhan Cheng, Jing Lu, Yi Niu, Shiliang Pu, Fei Wu, Shuigeng Zhou:
You Only Recognize Once: Towards Fast Video Text Spotting. 855-863 - Linxi Jiang, Xingjun Ma, Shaoxiang Chen, James Bailey, Yu-Gang Jiang:
Black-box Adversarial Attacks on Video Recognition Models. 864-872 - Zheng Wang, Xinyu Yan, Yahong Han, Meijun Sun:
Ranking Video Salient Object Detection. 873-881 - Donghyeon Cho, Yunjae Jung, François Rameau, Dahun Kim, Sanghyun Woo, In So Kweon:
Video Retargeting: Trade-off between Content Preservation and Spatio-temporal Consistency. 882-889
Session 2D: 3D Visual Processing
- Tianxin Huang, Yong Liu:
3D Point Cloud Geometry Compression on Deep Learning. 890-898 - Haotian Zhang, Gaoang Wang, Zhichao Lei, Jenq-Neng Hwang:
Eye in the Sky: Drone-Based Object Tracking and 3D Localization. 899-907 - Weizhi Nie, Qi Liang, An-An Liu, Zhendong Mao, Yangyang Li:
MMJN: Multi-Modal Joint Networks for 3D Shape Recognition. 908-916 - Yizhou Wang, Yen-Ting Huang, Jenq-Neng Hwang:
Monocular Visual Object 3D Localization in Road Scenes. 917-925 - Xiheng Zhang, Yongkang Wong, Mohan S. Kankanhalli, Weidong Geng:
Unsupervised Domain Adaptation for 3D Human Pose Estimation. 926-934 - Hongwen Zhang, Jie Cao, Guo Lu, Wanli Ouyang, Zhenan Sun:
DaNet: Decompose-and-aggregate Network for 3D Human Shape and Pose Estimation. 935-944 - Jun Yu, Chang Wen Chen, Zengfu Wang:
3D Singing Head for Music VR: Learning External and Internal Articulatory Synchronicity from Lyric, Audio and Notes. 945-952 - Shan Huang, Zhi Wang, Laizhong Cui, Yong Jiang, Rui Gao:
Fine-grained Fitting Experience Prediction: A 3D-slicing Attention Approach. 953-961 - Dawei Zhong, Lei Han, Lu Fang:
iDFusion: Globally Consistent Dense 3D Reconstruction from RGB-D and Inertial Measurements. 962-970 - Jian Wu, Jianbo Jiao, Qingxiong Yang, Zheng-Jun Zha, Xuejin Chen:
Ground-Aware Point Cloud Semantic Segmentation for Autonomous Driving. 971-979 - Xiao Sun, Zhouhui Lian, Jianguo Xiao:
SRINet: Learning Strictly Rotation-Invariant Representations for Point Cloud Classification and Segmentation. 980-988 - Xinhai Liu, Zhizhong Han, Xin Wen, Yu-Shen Liu, Matthias Zwicker:
L2G Auto-encoder: Understanding Point Clouds by Local-to-Global Reconstruction with Hierarchical Self-Attention. 989-997 - Junnan Li, Jianquan Liu, Yongkang Wong, Shoji Nishimura, Mohan S. Kankanhalli:
Self-supervised Representation Learning Using 360° Data. 998-1006 - Ioannis Agtzidis, Mikhail Startsev, Michael Dorr:
360-degree Video Gaze Behaviour: A Ground-Truth Data Set and a Classification Algorithm for Eye Movements. 1007-1015
Demonstration I
- Ruben Tolosana, Rubén Vera-Rodríguez, Julian Fiérrez, Aythami Morales:
BioTouchPass Demo: Handwritten Passwords for Touchscreen Biometrics. 1023-1025 - Hannes Fassold:
Adapting Computer Vision Algorithms for Omnidirectional Video. 1026-1028 - Hanna Ragnarsdóttir, Þórhildur Þorleiksdóttir, Omar Shahbaz Khan, Björn Þór Jónsson, Gylfi Þór Guðmundsson, Jan Zahálka, Stevan Rudinac, Laurent Amsaleg, Marcel Worring:
Exquisitor: Breaking the Interaction Barrier for Exploration of 100 Million Images. 1029-1031 - Scott A. Carter, Laurent Denoue, Daniel Avrahami:
Documenting Physical Objects with Live Video and Object Detection. 1032-1034 - Maarten Wijnants, Sven Coppers, Gustavo Alberto Rovelo Ruiz, Peter Quax, Wim Lamotte:
Split & Dual Screen Comparison of Classic vs Object-based Video. 1035-1037 - Laurent Denoue, Scott A. Carter, Chelhwon Kim:
CamaLeon: Smart Camera for Conferencing in the Wild. 1038-1040 - Yi Dong, Chang Liu, Zhiqi Shen, Han Yu, Zhanning Gao, Pan Wang, Changgong Zhang, Peiran Ren, Xuansong Xie:
Personalized Video Summarization with Idiom Adaptation. 1041-1043 - Christine Bauer, Markus Schedl, Vera Angerer, Stefan Wegenkittl:
Tastalyzer: Audiovisual Exploration of Urban and Rural Variations in Music Taste. 1044-1046 - Yunjin Wu, Ziyuan Zhao, Shengqiang Zhang, Lulu Yao, Yan Yang, Tom Z. J. Fu, Stefan Winkler:
Interactive Multi-camera Soccer Video Analysis System. 1047-1049 - Naoki Sugimoto, Yuko Iinuma, Kiyoharu Aizawa:
Walker's Movie Map: Route Vies Synthesis Using Omni-directional Videos. 1050-1052 - Gjorgji Strezoski, Arumoy Shome, Riccardo Bianchi, Shruti Rao, Marcel Worring:
ACE: Art, Color and Emotion. 1053-1055 - Takumi Kiriu, Mohit Mittal, Panote Siriaraya, Yukiko Kawai, Shinsuke Nakajima:
Development of an Acoustic AR Gamification System to Support Physical Exercise. 1056-1058 - Xavier Alameda-Pineda, Soraya Arias, Yutong Ban, Guillaume Delorme, Laurent Girin, Radu Horaud, Xiaofei Li, Bastien Mourgue, Guillaume Sarrazin:
Audio-Visual Variational Fusion for Multi-Person Tracking with Robots. 1059-1061 - Benjamin Renoust, Matheus Oliveira Franca, Jacob Chan, Van Le, Ayaka Uesaka, Yuta Nakashima, Hajime Nagahara, Jueren Wang, Yutaka Fujioka:
BUDA.ART: A Multimodal Content Based Analysis and Retrieval System for Buddha Statues. 1062-1064 - Leonardo Galteri, Lorenzo Seidenari, Marco Bertini, Tiberio Uricchio, Alberto Del Bimbo:
Fast Video Quality Enhancement using GANs. 1065-1067 - Yang Chen, Yingwei Pan, Ting Yao, Xinmei Tian, Tao Mei:
Animating Your Life: Real-Time Video-to-Animation Translation. 1068-1070
Reproducibility
- Kanchan Bahirat, Yu-Yen Chung, Thiru Annaswamy, Gargi Raval, Kevin Desai, Balakrishnan Prabhakaran, Michael Riegler:
Using Mr. MAPP for Lower Limb Phantom Pain Management. 1071-1075 - Zhengyu Zhao, Zhuoran Liu, Martha A. Larson, Ahmet Iscen, Naoko Nitta:
Reproducible Experiments on Adaptive Discriminative Region Discovery for Scene Recognition. 1076-1079 - Ilya Makarov, Dmitrii Maslov, Olga Gerasimova, Vladimir Aliev, Alisa Korinevskaya, Ujjwal Sharma, Haoliang Wang:
On Reproducing Semi-dense Depth Map Reconstruction using Deep Convolutional Neural Networks with Perceptual Loss. 1080-1084 - Mengbai Xiao, Shuoqian Wang, Chao Zhou, Li Liu, Zhenhua Li, Yao Liu, Songqing Chen, Lucile Sassatelli, Gwendal Simon:
Companion Paper for. 1085-1088
Best Paper Session (*note: Honorable Mentions*)
- Yingying Zhang, Shengsheng Qian, Quan Fang, Changsheng Xu:
Multi-modal Knowledge-aware Hierarchical Attention Network for Explainable Medical Question Answering. 1089-1097 - Liqiang Nie, Wenjie Wang, Richang Hong, Meng Wang, Qi Tian:
Multimodal Dialog System: Generating Responses via Adaptive Decoders. 1098-1106 - Arun Asokan Nair, Austin Reiter, Changxi Zheng, Shree K. Nayar:
Audiovisual Zooming: What You See Is What You Hear. 1107-1118 - Zhiqi Shen, Shaojing Fan, Yongkang Wong, Tian-Tsong Ng, Mohan S. Kankanhalli:
Human-imperceptible Privacy Protection Against Machines. 1119-1128 - Xu Lu, Lei Zhu, Zhiyong Cheng, Jingjing Li, Xiushan Nie, Huaxiang Zhang:
Flexible Online Multi-modal Hashing for Large-scale Multimedia Retrieval. 1129-1137
Multimedia Art Exhibition
- Refik Anadol:
Latent History. 1138 - Peter A. C. Nelson:
Data Stones. 1139-1140 - Jean-Marc Chomaz, Laurent Karst, Gregory Louis:
Unresolved Sun / Soleil Irrésolu: An Art-Science Installation on the Origin of Time. 1141-1142 - Olivain Porry:
Toasters: Collective inter-connected behavioral objects and passive interaction. 1143-1144 - Lyn Chao-ling Chen:
The One: An Interactive Installation for Visualizing the Cognition of Mind State by Capturing Face Expression, Body Shape, Wearing Cloth and Talking Voice. 1145-1146 - Hiesun Cecilia Suhr:
I, You, We: Exploring Interactive Multimedia Performance. 1147-1148 - Yen-Ting Cho, Yen-Ling Kuo, Yen-Ting Yeh, Yi-Chin Lee:
MovIPrint: Move, Explore and Fabricate. 1151-1152 - Paul Chable, Gilles Azzaro, Jean Mélou, Yvain Quéau, Axel Carlier, Jean-Denis Durou:
Macrogroove: A Sound 3D-sculpture Interactive Player. 1153-1154
Keynote III
- Mireille Hildebrandt:
EU Data Protection Law: An Ally for Scientific Reproducibility? 1155-1156
Session 3A: Multimodal QA&Content Generation
- Jun Hu, Shengsheng Qian, Quan Fang, Changsheng Xu:
Hierarchical Graph Semantic Pooling Network for Multi-modal Community Question Answer Matching. 1157-1165 - Xiangpeng Li, Lianli Gao, Xuanhan Wang, Wu Liu, Xing Xu, Heng Tao Shen, Jingkuan Song:
Learnable Aggregating Net with Diversity Learning for Video Question Answering. 1166-1174 - Fei Liu, Jing Liu, Richang Hong, Hanqing Lu:
Erasing-based Attention Learning for Visual Question Answering. 1175-1183 - Tianhao Yang, Zheng-Jun Zha, Hongtao Xie, Meng Wang, Hanwang Zhang:
Question-Aware Tube-Switch Network for Video Question Answering. 1184-1192 - Weike Jin, Zhou Zhao, Mao Gu, Jun Yu, Jun Xiao, Yueting Zhuang:
Multi-interaction Network with Object Relation for Video Question Answering. 1193-1201 - Liang Peng, Yang Yang, Zheng Wang, Xiao Wu, Zi Huang:
CRA-Net: Composed Relation Attention Network for Visual Question Answering. 1202-1210 - Juncheng Li, Siliang Tang, Fei Wu, Yueting Zhuang:
Walking with MIND: Mental Imagery eNhanceD Embodied QA. 1211-1219 - Lejian Ren, Si Liu, Han Huang, Jizhong Han, Shuicheng Yan, Bo Li:
Finding Images by Dialoguing with Image. 1220-1229 - Songyang Zhang, Jinsong Su, Jiebo Luo:
Exploiting Temporal Relationships in Video Moment Localization with Natural Language. 1230-1238 - Yue Liu, Xin Wang, Yitian Yuan, Wenwu Zhu:
Cross-Modal Dual Learning for Sentence-to-Video Generation. 1239-1247 - KwanYong Park, Sanghyun Woo, Dahun Kim, Donghyeon Cho, In So Kweon:
Preserving Semantic and Temporal Consistency for Unpaired Video-to-Video Translation. 1248-1257 - Chao Zhang, Weiming Li, Wanli Ouyang, Qiang Wang, Woo-Shik Kim, Sunghoon Hong:
Referring Expression Comprehension with Semantic Visual Relationship and Word Mapping. 1258-1266 - Yaxing Wang, Abel Gonzalez-Garcia, Joost van de Weijer, Luis Herranz:
SDIT: Scalable and Diverse Cross-domain Image Translation. 1267-1276 - Pengfei Wang, Chengquan Zhang, Fei Qi, Zuming Huang, Mengyi En, Junyu Han, Jingtuo Liu, Errui Ding, Guangming Shi:
A Single-Shot Arbitrarily-Shaped Text Detector based on Context Attended Multi-Task Learning. 1277-1285
Session 3B: Attention&Saliency
- Xinhang Song, Sixian Zhang, Yuyun Hua, Shuqiang Jiang:
Aberrance-aware Gradient-sensitive Attentions for Scene Recognition with RGB-D Videos. 1286-1294 - Sheng-hua Zhong, Ahmed Fares, Jianmin Jiang:
An Attentional-LSTM for Improved Classification of Brain Activities Evoked by Images. 1295-1303 - Yifang Yin, Meng-Jiun Chiou, Zhenguang Liu, Harsh Shrivastava, Rajiv Ratn Shah, Roger Zimmermann:
Multi-Level Fusion based Class-aware Attention Model for Weakly Labeled Audio Tagging. 1304-1312 - MeiYu Liang, Junping Du, Wu Liu, Zhe Xue, Yue Geng, Cong-Xian Yang:
Fine-grained Cross-media Representation Learning with Deep Quantization Attention Network. 1313-1321 - Suping Zhou, Jia Jia, Yufeng Yin, Xiang Li, Yang Yao, Ying Zhang, Zeyang Ye, Kehua Lei, Yan Huang, Jialie Shen:
Understanding the Teaching Styles by an Attention based Multi-task Cross-media Dimensional Modeling. 1322-1330 - Weiqing Min, Linhu Liu, Zhengdong Luo, Shuqiang Jiang:
Ingredient-Guided Cascaded Multi-Attention Network for Food Recognition. 1331-1339 - Lian Gao, Di Huang, Yuanfang Guo, Yunhong Wang:
Pedestrian Attribute Recognition via Hierarchical Multi-task Learning and Relationship Attention. 1340-1348 - Zhong Ji, Qiankun Kong, Haoran Wang, Yanwei Pang:
Small and Dense Commodity Object Detection with Multi-Scale Receptive Field Attention. 1349-1357 - Huangyue Yu, Minjie Cai, Yunfei Liu, Feng Lu:
What I See Is What You See: Joint Attention Learning for First and Third Person Video Co-analysis. 1358-1366 - Souad Chaabouni, Frédéric Precioso:
Impact of Saliency and Gaze Features on Visual Control: Gaze-Saliency Interest Estimator. 1367-1374 - Bo Jiang, Xingyue Jiang, Ajian Zhou, Jin Tang, Bin Luo:
A Unified Multiple Graph Learning and Convolutional Network Model for Co-saliency Estimation. 1375-1382 - Sheng Yang, Qiuping Jiang, Weisi Lin, Yongtao Wang:
SGDNet: An End-to-End Saliency-Guided Deep Neural Network for No-Reference Image Quality Assessment. 1383-1391 - Bo Li, Zhengxing Sun, Quan Wang, Qian Li:
Co-saliency Detection Based on Hierarchical Consistency. 1392-1400
Session 3C: Smart Applications
- Xiao Zhang, Fuzhen Zhuang, Wenzhong Li, Haochao Ying, Hui Xiong, Sanglu Lu:
Inferring Mood Instability via Smartphone Sensing: A Multi-View Learning Approach. 1401-1409 - Zikang Yuan, Dongfu Zhu, Cheng Chi, Jinhui Tang, Chunyuan Liao, Xin Yang:
Visual-Inertial State Estimation with Pre-integration Correction for Robust Mobile Augmented Reality. 1410-1418 - Cong Wang, Yanru Xiao, Xing Gao, Li Li, Jun Wang:
Close the Gap between Deep Learning and Mobile Intelligence by Incorporating Training in the Loop. 1419-1427 - K. R. Prajwal, Rudrabha Mukhopadhyay, Jerin Philip, Abhishek Jha, Vinay P. Namboodiri, C. V. Jawahar:
Towards Automatic Face-to-Face Translation. 1428-1436 - Yinwei Wei, Xiang Wang, Liqiang Nie, Xiangnan He, Richang Hong, Tat-Seng Chua:
MMGCN: Multi-modal Graph Convolution Network for Personalized Recommendation of Micro-video. 1437-1445 - Yinwei Wei, Zhiyong Cheng, Xuzheng Yu, Zhou Zhao, Lei Zhu, Liqiang Nie:
Personalized Hashtag Recommendation for Micro-videos. 1446-1454 - Maarten Sukel, Stevan Rudinac, Marcel Worring:
Multimodal Classification of Urban Micro-Events. 1455-1463 - Yongqi Li, Meng Liu, Jianhua Yin, Chaoran Cui, Xin-Shun Xu, Liqiang Nie:
Routing Micro-videos via A Temporal Graph-guided Recommendation System. 1464-1472 - Bowen Yang, Chun Yang, Qi Liu, Xu-Cheng Yin:
Joint Rotation-Invariance Face Detection and Alignment with Angle-Sensitivity Cascaded Networks. 1473-1480 - Daiqian Ma, Yan Bai, Renjie Wan, Ce Wang, Boxin Shi, Ling-Yu Duan:
See Through the Windshield from Surveillance Camera. 1481-1489 - Kun Liu, Huadong Ma:
Exploring Background-bias for Anomaly Detection in Surveillance Videos. 1490-1499 - Liang Wu, Chengquan Zhang, Jiaming Liu, Junyu Han, Jingtuo Liu, Errui Ding, Xiang Bai:
Editing Text in the Wild. 1500-1508 - Yang Liu, Mengxi Guo, Jian Zhang, Yuesheng Zhu, Xiaodong Xie:
A Novel Two-stage Separable Deep Learning Framework for Practical Blind Watermarking. 1509-1517 - Ishwarya Ananthabhotla, Sebastian Ewert, Joseph A. Paradiso:
Towards a Perceptual Loss: Using a Neural Network Codec Approximation as a Loss for Generative Audio Models. 1518-1525
Session 3D: Algorithms in Multimedia
- Fan Liu, Zhiyong Cheng, Changchang Sun, Yinglong Wang, Liqiang Nie, Mohan S. Kankanhalli:
User Diverse Preference Modeling by Multimodal Attentive Metric Learning. 1526-1534 - Cheng Yan, Guansong Pang, Xiao Bai, Chunhua Shen, Jun Zhou, Edwin R. Hancock:
Deep Hashing by Discriminating Hard Examples. 1535-1542 - Xuguang Duan, Qi Wu, Chuang Gan, Yiwei Zhang, Wenbing Huang, Anton van den Hengel, Wenwu Zhu:
Watch, Reason and Code: Learning to Represent Videos Using Program. 1543-1551 - Bin-Cheng Yang:
Super Resolution Using Dual Path Connections. 1552-1560 - Xingbo Liu, Xiushan Nie, Quan Zhou, Yilong Yin:
Supervised Discrete Hashing With Mutual Linear Regression. 1561-1568 - Zhao Zhang, Jiahuan Ren, Sheng Li, Richang Hong, Zhengjun Zha, Meng Wang:
Robust Subspace Discovery by Block-diagonal Adaptive Locality-constrained Representation. 1569-1577 - Yuan Yao, Yu Zhang, Xutao Li, Yunming Ye:
Heterogeneous Domain Adaptation via Soft Transfer Network. 1578-1586 - Jingjing Li, Mengmeng Jing, Ke Lu, Lei Zhu, Yang Yang, Zi Huang:
Alleviating Feature Confusion for Generative Zero-shot Learning. 1587-1595 - Yangbangyan Jiang, Qianqian Xu, Zhiyong Yang, Xiaochun Cao, Qingming Huang:
Duet Robust Deep Subspace Clustering. 1596-1604 - Hui Liu, Yuheng Jia, Junhui Hou, Qingfu Zhang:
Imbalance-aware Pairwise Constraint Propagation. 1605-1613 - Jie Huang, Zhiwei Xiong, Xueyang Fu, Dong Liu, Zheng-Jun Zha:
Hybrid Image Enhancement With Progressive Laplacian Enhancing Unit. 1614-1622 - Lin Zhang, Lijun Zhang, Xiao Liu, Ying Shen, Shaoming Zhang, Shengjie Zhao:
Zero-Shot Restoration of Back-lit Images Using Deep Internal Learning. 1623-1631 - Yonghua Zhang, Jiawan Zhang, Xiaojie Guo:
Kindling the Darkness: A Practical Low-light Image Enhancer. 1632-1640 - Chenrui Zhang, Xiaoqing Lyu, Zhi Tang:
TGG: Transferable Graph Generation for Zero-shot and Few-shot Learning. 1641-1649
Doctoral Symposium
- Amanda Cardoso Duarte:
Cross-modal Neural Sign Language Translation. 1650-1654 - Michael Kerr:
On-Camera Digital Watermarking and its Application for Law Enforcement and Public Safety. 1655-1659 - Marc A. Kastner:
On Quantizing the Mental Image of Concepts for Visual Semantic Analyses. 1660-1664
Keynote IV
- Jean-Marc Chomaz:
Inventing Narratives of the Anthropocene: Microclimate Machines and Arts & Sciences Installations. 1665-1666
Session 4A: Cross-Modal Retrieval
- Heyu Zhou, An-An Liu, Weizhi Nie:
Dual-level Embedding Alignment Network for 2D Image-Based 3D Object Retrieval. 1667-1675 - Hangyu Lin, Yanwei Fu, Peng Lu, Shaogang Gong, Xiangyang Xue, Yu-Gang Jiang:
TC-Net for iSBIR: Triplet Classification Network for Instance-level Sketch Based Image Retrieval. 1676-1684 - Da Cao, Zhiwang Yu, Hanling Zhang, Jiansheng Fang, Liqiang Nie, Qi Tian:
Video-Based Cross-Modal Recipe Retrieval. 1685-1693 - Zhen-Duo Chen, Yongxin Wang, Hui-Qiong Li, Xin Luo, Liqiang Nie, Xin-Shun Xu:
A Two-Step Cross-Modal Hashing by Exploiting Label Correlations and Preserving Similarity in Both Steps. 1694-1702 - Zhenfang Chen, Zhanghui Kuang, Wayne Zhang, Kwan-Yee K. Wong:
Learning Local Similarity with Spatial Relations for Object Retrieval. 1703-1711 - Weikuo Guo, Huaibo Huang, Xiangwei Kong, Ran He:
Learning Disentangled Representation for Cross-Modal Retrieval with Deep Mutual Information Estimation. 1712-1720 - Peng Hu, Xu Wang, Liangli Zhen, Dezhong Peng:
Separated Variational Hashing Networks for Cross-Modal Retrieval. 1721-1729 - Xin Wang, Wenwu Zhu, Chenghao Liu:
Semi-supervised Deep Quantization for Cross-modal Search. 1730-1739 - Xiangteng He, Yuxin Peng, Liu Xie:
A New Benchmark and Approach for Fine-grained Cross-media Retrieval. 1740-1748 - Hui Chen, Guiguang Ding, Zijia Lin, Sicheng Zhao, Jungong Han:
Cross-Modal Image-Text Retrieval with Semantic Consistency. 1749-1757 - Po-Yao Huang, Guoliang Kang, Wenhe Liu, Xiaojun Chang, Alexander G. Hauptmann:
Annotation Efficient Cross-Modal Retrieval with Adversarial Attentive Alignment. 1758-1767 - Yinzheng Gu, Chuanpeng Li, Yu-Gang Jiang:
Towards Optimal CNN Descriptors for Large-Scale Image Retrieval. 1768-1776 - Jakub Lokoc, Gregor Kovalcík, Tomás Soucek, Jaroslav Moravec, Premysl Cech:
A Framework for Effective Known-item Search in Video. 1777-1785 - Xirong Li, Chaoxi Xu, Gang Yang, Zhineng Chen, Jianfeng Dong:
W2VV++: Fully Deep Learning for Ad-hoc Video Search. 1786-1794
Session 4B: Visual Analysis&Applications
- Weijiang Yu, Zhe Huang, Wayne Zhang, Litong Feng, Nong Xiao:
Gradual Network for Single Image De-raining. 1795-1804 - Muchao Ye, Xiaojiang Peng, Weihao Gan, Wei Wu, Yu Qiao:
AnoPCN: Video Anomaly Detection via Deep Predictive Coding Network. 1805-1813 - Youzhao Yang, Hong Lu:
Single Image Deraining via Recurrent Hierarchy Enhancement Network. 1814-1822 - Dan Guo, Kun Li, Zheng-Jun Zha, Meng Wang:
DADNet: Dilated-Attention-Deformable ConvNet for Crowd Counting. 1823-1832 - Zheng Wang, Jianwu Li, Ge Song:
DTDN: Dual-task De-raining Network. 1833-1841 - Zehui Yao, Boyan Zhang, Zhiyong Wang, Wanli Ouyang, Dong Xu, Dagan Feng:
IntersectGAN: Learning Domain Intersection for Generating Images with Multiple Attributes. 1842-1850 - Zhihui Wang, Shijie Wang, Pengbo Zhang, Haojie Li, Wei Zhong, Jianjun Li:
Weakly Supervised Fine-grained Image Classification via Correlation-guided Discriminative Learning. 1851-1860 - Ling Shen, Richang Hong, Haoran Zhang, Hanwang Zhang, Meng Wang:
Single-shot Semantic Image Inpainting with Densely Connected Generative Networks. 1861-1869 - Jianfu Zhang, Li Niu, Dexin Yang, Liwei Kang, Yaoyi Li, Weijie Zhao, Liqing Zhang:
GAIN: Gradient Augmented Inpainting Network for Irregular Holes. 1870-1878 - Zan Gao, Li-Shuai Gao, Hua Zhang, Zhiyong Cheng, Richang Hong:
Deep Spatial Pyramid Features Collaborative Reconstruction for Partial Person ReID. 1879-1887 - Ziling Huang, Zheng Wang, Wei Hu, Chia-Wen Lin, Shin'ichi Satoh:
DoT-GNN: Domain-Transferred Graph Neural Network for Group Re-identification. 1888-1896 - Zhi-Qi Cheng, Jun-Xiu Li, Qi Dai, Xiao Wu, Jun-Yan He, Alexander G. Hauptmann:
Improving the Learning of Multi-column Convolutional Neural Network for Crowd Counting. 1897-1906 - Xin Tan, Chun Tao, Tongwei Ren, Jinhui Tang, Gangshan Wu:
Crowd Counting via Multi-layer Regression. 1907-1915 - Yahui Liu, Marco De Nadai, Gloria Zen, Nicu Sebe, Bruno Lepri:
Gesture-to-Gesture Translation in the Wild via Category-Independent Conditional Maps. 1916-1924
Session 4C: Social Computing&Image Processing
- Tian Gan, Shaokun Wang, Meng Liu, Xuemeng Song, Yiyang Yao, Liqiang Nie:
Seeking Micro-influencers for Brand Promotion. 1933-1941 - Huaiwen Zhang, Quan Fang, Shengsheng Qian, Changsheng Xu:
Multi-modal Knowledge-aware Event Memory Network for Social Media Rumor Detection. 1942-1951 - Jiawei Wang, Jiansheng Fang, Jiao Xu, Shifeng Huang, Da Cao, Ming Yang:
MOC: Measuring the Originality of Courseware in Online Education Systems. 1952-1960 - Wim Boes, Hugo Van hamme:
Audiovisual Transformer Architectures for Large-Scale Classification and Synchronization of Weakly Labeled Audio Events. 1961-1969 - Xueting Wang, Yiwei Zhang, Toshihiko Yamasaki:
User-Aware Folk Popularity Rank: User-Popularity-Based Tag Recommendation That Can Enhance Social Popularity. 1970-1978 - Keyan Ding, Kede Ma, Shiqi Wang:
Intrinsic Image Popularity Assessment. 1979-1987 - Liang Han, Zhaozheng Yin, Zhurong Xia, Li Guo, Mingqian Tang, Rong Jin:
Vision-based Price Suggestion for Online Second-hand Items. 1988-1996 - Fan Yu, Haonan Wang, Tongwei Ren, Jinhui Tang, Gangshan Wu:
Instance of Interest Detection. 1997-2005 - Lijian Gao, Qirong Mao, Ming Dong, Yu Jing, Ratna Babu Chinnam:
On Learning Disentangled Representation for Acoustic Event Detection. 2006-2014 - Yang Wang, Yang Cao, Zheng-Jun Zha, Jing Zhang, Zhiwei Xiong, Wei Zhang, Feng Wu:
Progressive Retinex: Mutually Reinforced Illumination-Noise Perception Network for Low-Light Image Enhancement. 2015-2023 - Zheng Hui, Xinbo Gao, Yunchu Yang, Xiumei Wang:
Lightweight Image Super-Resolution with Information Multi-distillation Network. 2024-2032 - Xin Hong, Pengfei Xiong, Renhe Ji, Haoqiang Fan:
Deep Fusion Network for Image Completion. 2033-2042 - Jiangxin Sun, Jiafeng Xie, Jianfang Hu, Zihang Lin, Jianhuang Lai, Wenjun Zeng, Wei-Shi Zheng:
Predicting Future Instance Segmentation with Contextual Pyramid ConvLSTMs. 2043-2051 - Hao Tang, Dan Xu, Gaowen Liu, Wei Wang, Nicu Sebe, Yan Yan:
Cycle In Cycle Generative Adversarial Networks for Keypoint-Guided Image Generation. 2052-2060
Session 4D: Embedding&Network Learning
- David Semedo, João Magalhães:
Diachronic Cross-modal Embeddings. 2061-2069 - Shaobo Min, Hantao Yao, Hongtao Xie, Zheng-Jun Zha, Yongdong Zhang:
Domain-Specific Embedding Network for Zero-Shot Recognition. 2070-2078 - Shilong Bao, Qianqian Xu, Ke Ma, Zhiyong Yang, Xiaochun Cao, Qingming Huang:
Collaborative Preference Embedding against Sparse Labels. 2079-2087 - Yiling Wu, Shuhui Wang, Guoli Song, Qingming Huang:
Learning Fragment Self-Attention Embeddings for Image-Text Matching. 2088-2096 - Shuo Yang, Wei Yu, Ying Zheng, Hongxun Yao, Tao Mei:
Adaptive Semantic-Visual Tree for Hierarchical Embeddings. 2097-2105 - Yingying Hua, Shiming Ge, Xindi Gao, Xin Jin, Dan Zeng:
Defending Against Adversarial Examples via Soft Decision Trees Embedding. 2106-2114 - Yaoyu Li, Hantao Yao, Lingyu Duan, Hanxing Yao, Changsheng Xu:
Adaptive Feature Fusion via Graph Neural Network for Person Re-identification. 2115-2123 - Ziheng Zhang, Anpei Chen, Ling Xie, Jingyi Yu, Shenghua Gao:
Learning Semantics-aware Distance Map with Semantics Layering Network for Amodal Instance Segmentation. 2124-2132 - Xulun Ye, Jieyu Zhao:
Open Set Deep Learning with A Bayesian Nonparametric Generative Model. 2133-2141 - Lu Chi, Guiyu Tian, Yadong Mu, Lingxi Xie, Qi Tian:
Fast Non-Local Neural Networks with Spectral Residual Learning. 2142-2151 - Congcong Li, Dawei Du, Libo Zhang, Tiejian Luo, Yanjun Wu, Qi Tian, Longyin Wen, Siwei Lyu:
Data Priming Network for Automatic Check-Out. 2152-2160 - Wen Su, Haifeng Zhang, Jia Li, Wenzhen Yang, Zengfu Wang:
Monocular Depth Estimation as Regression of Classification using Piled Residual Networks. 2161-2169 - Yunze Man, Xinshuo Weng, Xi Li, Kris Kitani:
GroundNet: Monocular Ground Plane Normal Estimation with Geometric Consistency. 2170-2178 - Bingyan Liu, Yao Guo, Xiangqun Chen:
WealthAdapt: A General Network Adaptation Framework for Small Data Tasks. 2179-2187
Demonstration II
- Hao Jiang, Wenjie Wang, Meng Liu, Liqiang Nie, Ling-Yu Duan, Changsheng Xu:
Market2Dish: A Health-aware Food Recommendation System. 2188-2190 - Mikko Pitkänen, Marko Viitanen, Alexandre Mercat, Jarno Vanne:
Remote VR Gaming on Mobile Devices. 2191-2193 - Kele Xu, Yuxiang Wu, Zhifeng Gao:
Ultrasound-Based Silent Speech Interface using Sequential Convolutional Auto-encoder. 2194-2195 - Sarah Ibrahimi, Shuo Chen, Devanshu Arya, Arthur Câmara, Yunlu Chen, Tanja Crijns, Maurits van der Goes, Thomas Mensink, Emiel van Miltenburg, Daan Odijk, William Thong, Jiaojiao Zhao, Pascal Mettes:
Interactive Exploration of Journalistic Video Footage through Multimodal Semantic Matching. 2196-2198 - Federico Becattini, Andrea Ferracani, Filippo Principi, Marioemanuele Ghianni, Alberto Del Bimbo:
NeuronUnityIntegration2.0. A Unity Based Application for Motion Capture and Gesture Recognition. 2199-2201 - Kai Uwe Barthel, Nico Hezel, Konstantin Schall, Klaus Jung:
Real-Time Visual Navigation in Huge Image Sets Using Similarity Graphs. 2202-2204 - Arunodhayan Sampath Kumar, René Erler, Danny Kowerko:
A Real-Time Demo for Acoustic Event Classification in Ambient Assisted Living Contexts. 2205-2207 - Lucile Sassatelli, Marco Winckler, Thomas Fisichella, Ramon Aparicio:
User-Adaptive Editing for 360 degree Video Streaming with Deep Reinforcement Learning. 2208-2210 - Toshiharu Horiuchi, Sumaru Niida, Yasuhiro Takishima:
OtonoVR: Arbitrarily Angled Audio-visual VR Experience Using Selective Synthesis Sound Field Technique. 2211-2213 - Jiang Gao:
Active Learning of Identity Agnostic Roles for Character Grounding in Videos. 2214-2216 - Jaehyeong Cho, Wataru Shimoda, Keiji Yanai:
Ramen as You Like: Sketch-based Food Image Generation and Editing. 2217-2218 - Gianmarco Sanesi, Andrew D. Bagdanov, Marco Bertini, Alberto Del Bimbo:
DeepPhysio: Monitored Physiotherapeutic Exercise in the Comfort of your Own Home. 2219-2220 - Thomas Forgione, Axel Carlier, Géraldine Morin, Wei Tsang Ooi, Vincent Charvillat:
Using 3D Bookmarks for Desktop and Mobile DASH-3D Clients. 2221-2222 - Yunshan Ma, Lizi Liao, Tat-Seng Chua:
Automatic Fashion Knowledge Extraction from Social Media. 2223-2224 - Takuya Yonezawa, Yuanyuan Wang, Yukiko Kawai, Kazutoshi Sumiya:
A Cooking Support System by Extracting Difficult Scenes for Cooking Operations from Recipe Short Videos. 2225-2227 - Jianbo Wang, Kai Qiu, Houwen Peng, Jianlong Fu, Jianke Zhu:
AI Coach: Deep Human Pose Estimation and Analysis for Personalized Athletic Training Assistance. 2228-2230 - Zhaolin Qiu, Yufan Ren, Canchen Li, Hongfu Liu, Yifan Huang, Yiheng Yang, Songruoyao Wu, Hanjia Zheng, Juntao Ji, Jianjia Yu, Kejun Zhang:
Mind Band: A Crossmedia AI Music Composing Platform. 2231-2233
Panel 1
- Shih-Fu Chang, Louis-Philippe Morency, Alexander G. Hauptmann, Alberto Del Bimbo, Cathal Gurrin, Hayley Hung, Heng Ji, Alan F. Smeaton:
PANEL: Challenges for Multimedia/Multimodal Research in the Next Decade. 2234-2235
Brave New Ideas
- Shizhe Chen, Bei Liu, Jianlong Fu, Ruihua Song, Qin Jin, Pingping Lin, Xiaoyu Qi, Chunting Wang, Jin Zhou:
Neural Storyboard Artist: Visualizing Stories with Coherent Image Sequences. 2236-2244 - Devanshu Arya, Stevan Rudinac, Marcel Worring:
HyperLearn: A Distributed Approach for Representation Learning in Datasets With Many Modalities. 2245-2253 - Michael Xuelin Huang, Jiajia Li, Grace Ngai, Hong Va Leong, Andreas Bulling:
Moment-to-Moment Detection of Internal Thought during Video Viewing from Eye Vergence Behavior. 2254-2262 - Francesco Gelli, Tiberio Uricchio, Xiangnan He, Alberto Del Bimbo, Tat-Seng Chua:
Learning Subjective Attributes of Images from Auxiliary Sources. 2263-2271
Open Source Software Competition
- Jianhao Zhang, Yingwei Pan, Ting Yao, He Zhao, Tao Mei:
daBNN: A Super Fast Inference Framework for Binary Neural Networks on ARM devices. 2272-2275 - Abhishek Dutta, Andrew Zisserman:
The VIA Annotation Software for Images, Audio and Video. 2276-2279 - Junwei Liang, Jay D. Aronson, Alexander G. Hauptmann:
Shooter Localization Using Social Media Videos. 2280-2283 - Chun-Xun Lin, Tsung-Wei Huang, Guannan Guo, Martin D. F. Wong:
A Modern C++ Parallel Task Programming Library. 2284-2287 - Cise Midoglu, Anatoliy Zabrovskiy, Ozgu Alay, Daniel Hoelbling-Inzko, Carsten Griwodz, Christian Timmerer:
Docker-Based Evaluation Framework for Video Streaming QoE in Broadband Networks. 2288-2291 - Shinya Sumikura, Mikiya Shibuya, Ken Sakurada:
OpenVSLAM: A Versatile Visual SLAM Framework. 2292-2295
Session 5A: Summaries&Generation
- Xufeng He, Yang Hua, Tao Song, Zongpu Zhang, Zhengui Xue, Ruhui Ma, Neil Martin Robertson, Haibing Guan:
Unsupervised Video Summarization with Attentive Conditional Generative Adversarial Networks. 2296-2304 - Anuj Rathore, Pravin Nagar, Chetan Arora, C. V. Jawahar:
Generating 1 Minute Summaries of Day Long Egocentric Videos. 2305-2313 - Jiacheng Li, Haizhou Shi, Siliang Tang, Fei Wu, Yueting Zhuang:
Informative Visual Storytelling with Cross-modal Rules. 2314-2322 - Yuhang Li, Xuejin Chen, Feng Wu, Zheng-Jun Zha:
LinesToFacePhoto: Face Photo Generation From Lines With Conditional Self-Attention Generative Adversarial Networks. 2323-2331 - Yitian Yuan, Lin Ma, Wenwu Zhu:
Sentence Specified Dynamic Video Thumbnail Generation. 2332-2340 - Yadan Luo, Zi Huang, Zheng Zhang, Ziwei Wang, Jingjing Li, Yang Yang:
Curiosity-driven Reinforcement Learning for Diverse Visual Paragraph Generation. 2341-2350
Session 5B: Quality of Experience&Interaction
- Dingquan Li, Tingting Jiang, Ming Jiang:
Quality Assessment of In-the-Wild Videos. 2351-2359 - Jia Li, Kaiwen Yu, Yifan Zhao, Yu Zhang, Long Xu:
Cross-Reference Stitching Quality Assessment for 360° Omnidirectional Images. 2360-2368 - Eric Lindskog, Jesper Wrang, Madeleine Bäckström, Linn Hallonqvist, Niklas Carlsson:
Generalized Playback Bar for Interactive Branched Video. 2369-2377 - Alexandra Covaci, Ramona Trestian, Estêvão Bissoli Saleme, Ioan-Sorin Comsa, Gebremariam Assres, Celso A. S. Santos, Gheorghita Ghinea:
360° Mulsemedia: A Way to Improve Subjective QoE in 360° Videos. 2378-2386 - Simon Wedel, Michael Koppetz, Janto Skowronek, Alexander Raake:
ViProVoQ: Towards a Vocabulary for Video Quality Assessment in the Context of Creative Video Production. 2387-2395 - Saurabh Kumar, Yagnesh Badiyani, Subhasis Chaudhuri:
DeepQuantizedCS: Quantized Compressive Video Recovery using Deep Convolutional Networks. 2396-2404
Session 5C: Transport&Delivery
- Jeroen van der Hooft, Tim Wauters, Filip De Turck, Christian Timmerer, Hermann Hellwagner:
Towards 6DoF HTTP Adaptive Streaming Through Point Cloud Compression. 2405-2413 - Zhuo Chen, Kui Fan, Shiqi Wang, Ling-Yu Duan, Weisi Lin, Alex C. Kot:
Lossy Intermediate Deep Learning Feature Compression and Evaluation. 2414-2422 - Mohammad Amin Arab, Kiana Calagari, Mohamed Hefeeda:
Band and Quality Selection for Efficient Transmission of Hyperspectral Images. 2423-2430 - Zili Meng, Jing Chen, Yaning Guo, Chen Sun, Hongxin Hu, Mingwei Xu:
PiTree: Practical Implementation of ABR Algorithms Using Decision Trees. 2431-2439 - Hongshan Li, Yu Guo, Zhi Wang, Shutao Xia, Wenwu Zhu:
AdaCompress: Adaptive Compression for Online Computer Vision Services. 2440-2448 - Maarten Wijnants, Sven Coppers, Gustavo Alberto Rovelo Ruiz, Peter Quax, Wim Lamotte:
Talking Video Heads: Saving Streaming Bitrate by Adaptively Applying Object-based Video Principles to Interview-like Footage. 2449-2458
Session 5D: Art&Culture
- Liyi Chen, Jufeng Yang:
Recognizing the Style of Visual Arts via Adaptive Cross-layer Correlation. 2459-2467 - Masatoshi Hamanaka:
Melody Slot Machine: A Controllable Holographic Virtual Performer. 2468-2477 - Shurong Sheng, Marie-Francine Moens:
Generating Captions for Images of Ancient Artworks. 2478-2486 - Huikai Wu, Shuai Zheng, Junge Zhang, Kaiqi Huang:
GP-GAN: Towards Realistic High-Resolution Image Blending. 2487-2495 - Zongyu Guo, Zhibo Chen, Tao Yu, Jiale Chen, Sen Liu:
Progressive Image Inpainting with Full-Resolution Residual Network. 2496-2504 - Guangyao Shen, Wenbing Huang, Chuang Gan, Mingkui Tan, Junzhou Huang, Wenwu Zhu, Boqing Gong:
Facial Image-to-Video Translation by a Hidden Affine Transformation. 2505-2513
Panel 2
- Vivek K. Singh, Elisabeth André, Susanne Boll, Mireille Hildebrandt, David A. Shamma, Tat-Seng Chua:
Legal and Ethical Challenges in Multimedia Research. 2514-2515
Grand Challenge: iQIYI Celebrity Video Identification
- Yuanliu Liu, Peipei Shi, Bo Peng, He Yan, Yong Zhou, Bing Han, Yi Zheng, Chao Lin, Jianbin Jiang, Yin Fan, Tingwei Gao, Ganwen Wang, Jian Liu, Xiangju Lu, Junhui Liu, Danming Xie:
iQIYI Celebrity Video Identification Challenge. 2516-2520 - Zixuan Huang, Yuan Chang, Weizhao Chen, Qiwei Shen, Jianxin Liao:
ResidualDenseNetwork: A Simple Approach for Video Person Identification. 2521-2525 - Xi Fang, Ying Zou:
Make the Best of Face Clues in iQIYI Celebrity VideoIdentification Challenge 2019. 2526-2530 - Chuanqi Dong, Zheng Gu, Zhonghao Huang, Wen Ji, Jing Huo, Yang Gao:
DeepMEF: A Deep Model Ensemble Framework for Video Based Multi-modal Person Identification. 2531-2534 - Jianrong Chen, Li Yang, Yuanyuan Xu, Jing Huo, Yinghuan Shi, Yang Gao:
A Novel Deep Multi-Modal Feature Fusion Method for Celebrity Video Identification. 2535-2538 - Shichuan Zhang, Zengming Tang, Hao Pan, Xinyu Wei, Jun Huang:
A Hierarchical Framwork with Improved Loss for Large-scale Multi-modal Video Identification. 2539-2542
Grand Challenge: AI Meets Beauty
- Zehang Lin, Haoran Xie, Peipei Kang, Zhenguo Yang, Wenyin Liu, Qing Li:
Cross-domain Beauty Item Retrieval via Unsupervised Embedding Learning. 2543-2547 - Jiawei Wang, Shuai Zhu, Jiao Xu, Da Cao:
The Retrieval of the Beautiful: Self-Supervised Salient Object Detection for Beauty Product Retrieval. 2548-2552 - Jun Yu, Guochen Xie, Mengyan Li, Haonian Xie, Lingyun Yu:
Beauty Product Retrieval Based on Regional Maximum Activation of Convolutions with Generalized Attention. 2553-2557 - Yi Zhang, Linzi Qu, Lihuo He, Wen Lu, Xinbo Gao:
Beauty Aware Network: An Unsupervised Method for Makeup Product Retrieval. 2558-2562
Grand Challenge: BioMedia
- Steven Alexander Hicks, Michael Riegler, Pia H. Smedsrud, Trine B. Haugen, Kristin Ranheim Randel, Konstantin Pogorelov, Håkon Kvale Stensland, Duc-Tien Dang-Nguyen, Mathias Lux, Andreas Petlund, Thomas de Lange, Peter Thelin Schmidt, Pål Halvorsen:
ACM Multimedia BioMedia 2019 Grand Challenge Overview. 2563-2567 - Yuan Chang, Zixuan Huang, Weizhao Chen, Qiwei Shen:
Gastrointestinal Tract Diseases Detection with Deep Attention Neural Network. 2568-2572 - Philipp Harzig, Moritz Einfalt, Rainer Lienhart:
Automatic Disease Detection and Report Generation for Gastrointestinal Tract Examination. 2573-2577 - Trung-Hieu Hoang, Hai-Dang Nguyen, Viet-Anh Nguyen, Thanh-An Nguyen, Vinh-Tiep Nguyen, Minh-Triet Tran:
Enhancing Endoscopic Image Classification with Symptom Localization and Data Augmentation. 2578-2582 - Zhipeng Luo, Xiaowei Wang, Zhenyu Xu, Xue Li, Jiadong Li:
Adaptive Ensemble: Solution to the Biomedia ACM MM GrandChallenge 2019. 2583-2587 - Wenhua Meng, Shan Zhang, Xudong Yao, Xiaoshan Yang, Changsheng Xu, Xiaowen Huang:
Biomedia ACM MM Grand Challenge 2019: Using Data Enhancement to Solve Sample Unbalance. 2588-2592
Grand Challenge: Content-based video relevance prediction
- Peng Wang, Yunsheng Jiang, Chunxu Xu, Xiaohui Xie:
Overview of Content-Based Click-Through Rate Prediction Challenge for Video Recommendation. 2593-2596 - Xusong Chen, Dong Liu, Chenyi Lei, Rui Li, Zheng-Jun Zha, Zhiwei Xiong:
BERT4SessRec: Content-Based Video Relevance Prediction with Bidirectional Encoder Representations from Transformer. 2597-2601 - Xun Wang, Yali Du, Leimin Zhang, Xirong Li, Miao Zhang, Jianfeng Dong:
Exploring Content-based Video Relevance for Video Click-Through Rate Prediction. 2602-2606 - Zeyuan Chen, Kai Xu, Wei Zhang:
Content-Based Video Relevance Prediction with Multi-view Multi-level Deep Interest Network. 2607-2611 - Xinran Zhang, Xin Yuan, Yunwei Li, Yanru Zhang:
Cold-Start Representation Learning: A Recommendation Approach with Bert4Movie and Movie2Vec. 2612-2616 - Qidi Xu, Haocheng Xu, Weilong Chen, Chaojun Han, Haoyang Li, Wenxin Tan, Fumin Shen, Heng Tao Shen:
Time-aware Session Embedding for Click-Through-Rate Prediction. 2617-2621
Grand Challenge: Live Video Streaming
- Gang Yi, Dan Yang, Abdelhak Bentaleb, Weihua Li, Yi Li, Kai Zheng, Jiangchuan Liu, Wei Tsang Ooi, Yong Cui:
The ACM Multimedia 2019 Live Video Streaming Grand Challenge. 2622-2626 - Huan Peng, Yuan Zhang, Yongbei Yang, Jinyao Yan:
A Hybrid Control Scheme for Adaptive Live Streaming. 2627-2631 - Xiaolan Jiang, Yusheng Ji:
HD3: Distributed Dueling DQN with Discrete-Continuous Hybrid Action Spaces for Live Video Streaming. 2632-2636 - Ruying Hong, Qiwei Shen, Lei Zhang, Jing Wang:
Continuous Bitrate & Latency Control with Deep Reinforcement Learning for Live Video Streaming. 2637-2641 - Chen Wang, Jianfeng Guan, Tongtong Feng, Neng Zhang, Tengfei Cao:
BitLat: Bitrate-adaptivity and Latency-awareness Algorithm for Live Video Streaming. 2642-2646 - Yin Zhao, Qi-Wei Shen, Wei Li, Tong Xu, Wei-Hua Niu, Si-Ran Xu:
Latency Aware Adaptive Video Streaming using Ensemble Deep Reinforcement Learning. 2647-2651
Grand Challenge: Relation Understanding in Videos
- Xindi Shang, Junbin Xiao, Donglin Di, Tat-Seng Chua:
Relation Understanding in Videos: A Grand Challenge Overview. 2652-2656 - Xu Sun, Tongwei Ren, Yuan Zi, Gangshan Wu:
Video Visual Relation Detection via Multi-modal Feature Fusion. 2657-2661 - Sipeng Zheng, Xiangyu Chen, Shizhe Chen, Qin Jin:
Relation Understanding in Videos. 2662-2666
Grand Challenge: Social Media Prediction
- Bo Wu, Wen-Huang Cheng, Peiye Liu, Bei Liu, Zhaoyang Zeng, Jiebo Luo:
SMP Challenge: An Overview of Social Media Prediction Challenge 2019. 2667-2671 - Ziliang He, Zijian He, Jiahong Wu, Zhenguo Yang:
Feature Construction for Posts and Users Combined with LightGBM for Social Media Popularity Prediction. 2672-2676 - Peipei Kang, Zehang Lin, Shaohua Teng, Guipeng Zhang, Lingni Guo, Wei Zhang:
Catboost-based Framework with Additional User Information for Social Media Popularity Prediction. 2677-2681 - Keyan Ding, Ronggang Wang, Shiqi Wang:
Social Media Popularity Prediction: A Multiple Feature Fusion Approach with Deep Neural Networks. 2682-2686 - Chih-Chung Hsu, Li-Wei Kang, Chia-Yen Lee, Jun-Yi Lee, Zhong-Xuan Zhang, Shao-Min Wu:
Popularity Prediction of Social Media based on Multi-Modal Feature Mining. 2687-2691 - Junhong Chen, Dayong Liang, Zhanmo Zhu, Xiaojing Zhou, Zihan Ye, Xiuyun Mo:
Social Media Popularity Prediction Based on Visual-Textual Features with XGBoost. 2692-2696
Tutorials
- Winston H. Hsu:
Learning from 3D (Point Cloud) Data. 2697-2698 - Wenwu Zhu, Xin Wang, Wenpeng Zhang:
AutoML and Meta-learning for Multimedia. 2699-2700 - Luisa Verdoliva, Paolo Bestagini:
Multimedia Forensics. 2701-2702 - Christian Timmerer, Ali C. Begen:
A Journey Towards Fully Immersive Media Access. 2703-2705 - Muthusamy Chelliah, Soma Biswas, Lucky Dhakad:
Principle-to-program: Neural Fashion Recommendation with Multi-modal Input. 2706-2708 - Gerald Friedland:
Reproducibility and Experimental Design for Machine Learning on Audio and Multimedia Data. 2709-2710 - Pål Halvorsen, Michael Alexander Riegler, Klaus Schoeffmann:
Medical Multimedia Systems and Applications. 2711-2713 - Hayley Hung, Chirag Raman, Ekin Gedik, Stephanie Tan, Jose Vargas Quiros:
Multimodal Data Collection for Social Interaction Analysis In-the-Wild. 2714-2715
Workshop Summaries
- Raphaël Troncy, Jorma Laaksonen, Hamed R. Tavakoli, Lyndon J. B. Nixon, Vasileios Mezaris:
AI4TV 2019: 1st International Workshop on AI for Smart TV Content Production, Access and Delivery. 2716-2717 - Fabien Ringeval, Björn W. Schuller, Michel F. Valstar, Nicholas Cummins, Roddy Cowie, Maja Pantic:
AVEC'19: Audio/Visual Emotion Challenge and Workshop. 2718-2719 - Susanne Boll, Jeannie S. Lee, Jochen Meyer, Nitish Nag, Noel E. O'Connor:
HealthMedia'19: 4th International Workshop on Multimedia for Personal Health and Health Care. 2720-2721 - Stavroula G. Mougiakakou, Giovanni Maria Farinella, Keiji Yanai, Dario Allegra:
MADiMA'19: 5th International Workshop on Multimedia Assisted Dietary Management. 2722-2723 - Ralph Ewerth, Stefan Dietze, Anett Hoppe, Ran Yu:
SALMM'19: First International Workshop on Search as Learning with Multimedia Information. 2724-2725 - Valérie Gouet-Brunet, Margarita Khokhlova, Liming Chen, Sander Münster:
SUMAC 2019: The 1st workshop on Structuring and Understanding of Multimedia heritAge Contents. 2726-2727 - Xavier Alameda-Pineda, Miriam Redi, L. Elisa Celis, Nicu Sebe, Shih-Fu Chang:
FAT/MM'19: 1st International Workshop on Fairness, Accountability, and Transparency in MultiMedia. 2728-2729 - Xueliang Liu, Rui Min, Troy McDaniel:
MAHCI 2019: The 2nd Workshop on Multimedia for Accessible Human Computer Interface. 2730-2731 - Rainer Lienhart, Thomas B. Moeslund, Hideo Saito:
MMSports'19: 2nd ACM International Workshop on Multimedia Content Analysis in Sports. 2732-2733 - Jiang John Gao, Jia-Yu Tim Pan:
MULEA'19: The First International Workshop on Multimodal Understanding and Learning for Embodied Applications. 2734-2735
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.