default search action
International Journal of Multimedia Information Retrieval, Volume 11
Volume 11, Number 1, March 2022
- Silvan Heller, Viktor Gsteiger, Werner Bailer, Cathal Gurrin, Björn Þór Jónsson, Jakub Lokoc, Andreas Leibetseder, Frantisek Mejzlík, Ladislav Peska, Luca Rossetto, Konstantin Schall, Klaus Schoeffmann, Heiko Schuldt, Florian Spiess, Ly-Duyen Tran, Lucia Vadicamo, Patrik Veselý, Stefanos Vrochidis, Jiaxin Wu:
Interactive video retrieval evaluation at a distance: comparing sixteen interactive video search systems in a remote setting at the 10th Video Browser Showdown. 1-18 - S. Suganyadevi, V. Seethalakshmi, K. Balasamy:
A review on deep learning in medical image analysis. 19-38 - Sinda Elghoul, Faouzi Ghorbel:
A fast and robust affine-invariant method for shape registration under partial occlusion. 39-59 - Mohammad Farhad Bulbul, Saiful Islam, Zannatul Azme, Preksha Pareek, Md. Humaun Kabir, Hazrat Ali:
Enhancing the performance of 3D auto-correlation gradient features in depth action classification. 61-76 - Carlos de la Fuente, Jose J. Valero-Mas, Francisco J. Castellanos, Jorge Calvo-Zaragoza:
Multimodal image and audio music transcription. 77-84
Volume 11, Number 2, June 2022
- Devashree R. Patrikar, Mayur Rajaram Parate:
Anomaly detection using edge computing in video surveillance system: review. 85-110 - Jie Yan, Yuxiang Xie, Xidao Luan, Yanming Guo, Quanzhi Gong, Suru Feng:
Caption TLSTMs: combining transformer with LSTMs for image captioning. 111-121 - Md. Meraz, Md Afzal Ansari, Mohammed Javed, Pavan Chakraborty:
DC-GNN: drop channel graph neural network for object classification and part segmentation in the point cloud. 123-133 - Ohoud Nafea, Wadood Abdul, Ghulam Muhammad:
Multi-sensor human activity recognition using CNN and GRU. 135-147 - Xiaoyi Wang, Jun Huang:
A local representation-enhanced recurrent convolutional network for image captioning. 149-157 - Marco Fisichella:
Siamese coding network and pair similarity prediction for near-duplicate image detection. 159-170 - Masum Shah Junayed, Md Baharul Islam, Hassan Imani, Tarkan Aydin:
PDS-Net: A novel point and depth-wise separable convolution for real-time object detection. 171-188 - Jian Li, Yanming Guo, Songyang Lao, Xi Zhao, Liang Bai, Haoran Wang:
Few2Decide: towards a robust model via using few neuron connections to decide. 189-198
Volume 11, Number 3, September 2022
- Xiaoping Zhou, Xiangyu Han, Haoran Li, Jia Wang, Xun Liang:
Cross-domain image retrieval: methods and applications. 199-218 - Deepak Dagar, Dinesh Kumar Vishwakarma:
A literature review and perspectives in deepfakes: generation, detection, and applications. 219-289 - Veronica Naosekpam, Nilkanta Sahu:
Text detection, recognition, and script identification in natural scene images: a Review. 291-314 - Ademola Enitan Ilesanmi, Taiwo Ilesanmi, Oluwagbenga Paul Idowu, Drew A. Torigian, Jayaram K. Udupa:
Organ segmentation from computed tomography images using the 3D convolutional neural network: a systematic review. 315-331 - Ahmed Iqbal, Muhammad Sharif, Mussarat Yasmin, Mudassar Raza, Shabib Aftab:
Generative adversarial networks and its applications in the biomedical image segmentation: a comprehensive survey. 333-368 - Hao Pan, Jun Huang:
Semantic-enhanced discriminative embedding learning for cross-modal retrieval. 369-382 - Na He, Sam Ferguson:
Music emotion recognition based on segment-level two-stage learning. 383-394 - Ihssane Houhou, Athmane Zitouni, Yassine Ruichek, Salah Eddine Bekhouche, Mohamed Kas, Abdelmalik Taleb-Ahmed:
RGBD deep multi-scale network for background subtraction. 395-407 - Sweta Panigrahi, U. S. N. Raju:
InceptionDepth-wiseYOLOv2: improved implementation of YOLO framework for pedestrian detection. 409-430 - Mehdi Ellouze:
How can users' comments posted on social media videos be a source of effective tags? 431-443 - Deepika Varshney, Dinesh Kumar Vishwakarma:
A unified approach of detecting misleading images via tracing its instances on web and analyzing its past context for the verification of multimedia content. 445-459
Volume 11, Number 4, December 2022
- Pranjal Kumar, Piyush Rawat, Siddhartha Chauhan:
Contrastive self-supervised learning: review, progress, challenges and future research directions. 461-488 - Pranjal Kumar, Siddhartha Chauhan, Lalit Kumar Awasthi:
Human pose estimation using deep learning: review, methodologies, progress and future research directions. 489-521 - Jianlong Wu, Richang Hong, Qi Tian:
Special issue on cross-modal retrieval and analysis. 523-524 - Lingtao Meng, Feifei Zhang, Xi Zhang, Changsheng Xu:
Prototype local-global alignment network for image-text retrieval. 525-538 - Zhengjie Huang, Zhenguang Liu, Jianhai Chen, Qinming He, Shuang Wu, Lei Zhu, Meng Wang:
Who is gambling? Finding cryptocurrency gamblers using multi-modal retrieval methods. 539-551 - Ren Zhang, Ning He, Shengjie Liu, Ying Wu, Kang Yan, Yuzhe He, Ke Lu:
Your heart rate betrays you: multimodal learning with spatio-temporal fusion networks for micro-expression recognition. 553-566 - Zefan Zhang, Tianling Jiang, Chunping Liu, Yi Ji:
Multi-aware coreference relation network for visual dialog. 567-576 - Keyang Cheng, Xuesen Zhu, Yongzhao Zhan, Yunshen Pei:
Video deblurring and flow-guided feature aggregation for obstacle detection in agricultural videos. 577-588 - Xiaowei Zhang, Quan Fang, Jun Hu, Shengsheng Qian, Changsheng Xu:
TCKGE: Transformers with contrastive learning for knowledge graph embedding. 589-597 - Silin Cai, Changping Wang, Jiajun Ding, Jun Yu, Jianping Fan:
FDAM: full-dimension attention module for deep convolutional neural networks. 599-610 - Yuxiang Xie, Jie Yan, Lai Kang, Yanming Guo, Jiahui Zhang, Xidao Luan:
FCT: fusing CNN and transformer for scene classification. 611-618 - Mohammad Javad Parseh, Mohammad Rahmanimanesh, Parviz Keshavarzi, Zohreh Azimifar:
Semantic-aware visual scene representation. 619-638 - Mohamed Kas, Youssef El Merabet, Yassine Ruichek, Rochdi Messoussi:
Generative adversarial networks for 2D-based CNN pose-invariant face recognition. 639-651 - Benoughidene Abdel Halim, Titouna Faiza:
A novel method for video shot boundary detection using CNN-LSTM approach. 653-667 - Zhiguang Liu, Liangwei Wang, Jian Qiao:
Visual and semantic ensemble for scene text recognition with gated dual mutual attention. 669-680 - Junyan Yang, Jie Jiang, Yanming Guo:
MHA-WoML: Multi-head attention and Wasserstein-OT for few-shot learning. 681-694 - Mohammadreza Sheikh Fathollahi, Rezvan Heidari:
Gender classification from face images using central difference convolutional networks. 695-703 - You Yang, Yongzhi An, Juntao Hu, Longyue Pan:
Tri-RAT: optimizing the attention scores for image captioning. 705-715 - Stefanos-Iordanis Papadopoulos, Christos Koutlis, Symeon Papadopoulos, Ioannis Kompatsiaris:
Multimodal Quasi-AutoRegression: forecasting the visual popularity of new fashion products. 717-729 - Ren Togo, Yuki Honma, Maiku Abe, Takahiro Ogawa, Miki Haseyama:
Similar interior coordination image retrieval with multi-view features. 731-740
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.