


default search action
ACM Transactions on Multimedia Computing, Communications, and Applications, Volume 20
Volume 20, Number 1, January 2024
- Zhenbo Xu

, Hai-Miao Hu
, Liu Liu
, Dongping Zhang, Shifeng Zhang, Wenming Tan
:
Instance-Based Continual Learning: A Real-World Dataset and Baseline for Fresh Recognition. 1:1-1:23 - Xiaoping Liang

, Zhenjun Tang
, Zhixin Li
, Mengzhu Yu
, Hanyun Zhang
, Xianquan Zhang
:
Robust Hashing via Global and Local Invariant Features for Image Copy Detection. 2:1-2:22 - Sandipan Sarma

, Arijit Sur
:
DiRaC-I: Identifying Diverse and Rare Training Classes for Zero-Shot Learning. 3:1-3:23 - Chengyu Zheng

, Ning Song
, Ruoyu Zhang
, Lei Huang
, Zhiqiang Wei
, Jie Nie
:
Scale-Semantic Joint Decoupling Network for Image-Text Retrieval in Remote Sensing. 4:1-4:20 - Jiankai Li

, Yunhong Wang
, Weixin Li
:
Zero-shot Scene Graph Generation via Triplet Calibration and Reduction. 5:1-5:21 - Abid Yaqoob

, Gabriel-Miro Muntean
:
Advanced Predictive Tile Selection Using Dynamic Tiling for Prioritized 360° Video VR Streaming. 6:1-6:28 - Jia Wang

, Hong-Han Shuai
, Yung-Hui Li
, Wen-Huang Cheng
:
Language-guided Residual Graph Attention Network and Data Augmentation for Visual Grounding. 7:1-7:23 - Haoran Wang

, Yajie Wang
, Baosheng Yu
, Yibing Zhan
, Chunfeng Yuan
, Wankou Yang
:
Attentional Composition Networks for Long-Tailed Human Action Recognition. 8:1-8:18 - Zi-Chao Zhang

, Zhen-Duo Chen
, Zhen-Yu Xie
, Xin Luo
, Xin-Shun Xu
:
S3Mix: Same Category Same Semantics Mixing for Augmenting Fine-grained Images. 9:1-9:16 - Mingkui Tan

, Zhiquan Wen
, Leyuan Fang
, Qi Wu
:
Transformer-Based Relational Inference Network for Complex Visual Relational Reasoning. 10:1-10:23 - Yiming Yang

, Weipeng Hu
, Haifeng Hu
:
Syncretic Space Learning Network for NIR-VIS Face Recognition. 11:1-11:25 - Chenghua Li

, Zongze Li
, Jing Sun
, Yun Zhang
, Xiaoping Jiang
, Fan Zhang
:
Dynamic Weighted Gradient Reversal Network for Visible-infrared Person Re-identification. 12:1-12:23 - Jiajun Song

, Zhuo Li
, Weiqing Min
, Shuqiang Jiang
:
Towards Food Image Retrieval via Generalization-Oriented Sampling and Loss Function Design. 13:1-13:19 - Yiting Jin

, Jie Wu
, Wanliang Wang
, Yidong Yan
, Jiawei Jiang
, Jianwei Zheng
:
Cascading Blend Network for Image Inpainting. 14:1-14:21 - Kehua Guo

, Liang Chen
, Xiangyuan Zhu
, Xiaoyan Kui
, Jian Zhang
, Heyuan Shi
:
Double-Layer Search and Adaptive Pooling Fusion for Reference-Based Image Super-Resolution. 15:1-15:23 - Jing Zhao

, Bin Li
, Jiahao Li
, Ruiqin Xiong
, Yan Lu
:
A Universal Optimization Framework for Learning-based Image Codec. 16:1-16:19 - Liping Zhang

, Shukai Chen
, Fei Lin
, Wei Ren
, Kim-Kwang Raymond Choo
, Geyong Min
:
1DIEN: Cross-session Electrocardiogram Authentication Using 1D Integrated EfficientNet. 17:1-17:17 - Baian Chen

, Zhilei Chen
, Xiaowei Hu
, Jun Xu
, Haoran Xie
, Jing Qin
, Mingqiang Wei
:
Dynamic Message Propagation Network for RGB-D and Video Salient Object Detection. 18:1-18:21 - Xiang Gao

, Wei Hu
, Guo-Jun Qi
:
Self-supervised Multi-view Learning via Auto-encoding 3D Transformations. 19:1-19:23 - Dewang Wang

, Gaobo Yang
, Zhiqing Guo
, Jiyou Chen
:
Enhancing Adversarial Embedding based Image Steganography via Clustering Modification Directions. 20:1-20:20 - Xiaojia Zhao

, Tingting Xu
, Qiangqiang Shen
, Youfa Liu
, Yongyong Chen
, Jingyong Su
:
Double High-Order Correlation Preserved Robust Multi-View Ensemble Clustering. 21:1-21:21 - Shuji Tasaka

:
Usefulness of QoS in Multidimensional QoE Prediction for Haptic-Audiovisual Communications. 22:1-22:24 - Ching-Nung Yang

, Xiaotian Wu
, Min-Jung Chung
:
Enhancement of Information Carrying and Decoding for Visual Cryptography with Error Correction. 23:1-23:24 - Yuqing Zhang

, Yong Zhang
, Shaofan Wang
, Yun Liang
, Baocai Yin
:
Semi-supervised Video Object Segmentation Via an Edge Attention Gated Graph Convolutional Network. 24:1-24:23 - Wenying Wen

, Minghui Huang
, Yushu Zhang
, Yuming Fang
, Yifan Zuo
:
Visual Security Index Combining CNN and Filter for Perceptually Encrypted Light Field Images. 25:1-25:15 - Linlin Liu

, Haijun Zhang
, Qun Li
, Jianghong Ma
, Zhao Zhang
:
Collocated Clothing Synthesis with GANs Aided by Textual Information: A Multi-Modal Framework. 26:1-26:25 - Xulei Lou

, Tinghui Wu
, Haifeng Hu
, Dihu Chen
:
Self-Supervised Consistency Based on Joint Learning for Unsupervised Person Re-identification. 27:1-27:20 - Yichi Zhang

, Gongchun Ding
, Dandan Ding
, Zhan Ma
, Zhu Li
:
On Content-Aware Post-Processing: Adapting Statistically Learned Models to Dynamic Content. 28:1-28:23 - Jing Xu

, Bing Liu
, Yong Zhou
, Mingming Liu
, Rui Yao
, Zhiwen Shao
:
Diverse Image Captioning via Conditional Variational Autoencoder and Dual Contrastive Learning. 29:1-29:16 - Cong Zou

, Rui Wang
, Cheng Jin
, Sanyi Zhang
, Xin Wang
:
S2CL-Leaf Net: Recognizing Leaf Images Like Human Botanists. 30:1-30:20
Volume 20, Number 2, February 2024
- Suyel Namasudra

, Pascal Lorenz
, Seifedine Kadry
, Syed Ahmad Chan Bukhari
:
Introduction to the Special Issue on DNA-centric Modeling and Practice for Next-generation Computing and Communication Systems. 31:1-31:2
- Shaohua Wan

, Yi Jin
, Guandong Xu
, Michele Nappi
:
Editorial to Special Issue on Multimedia Cognitive Computing for Intelligent Transportation System. 32:1-32:2 - Ruonan Zhao

, Laurence T. Yang
, Debin Liu
, Wanli Lu
, Chenlu Zhu
, Yiheng Ruan
:
Tensor-Empowered LSTM for Communication-Efficient and Privacy-Enhanced Cognitive Federated Learning in Intelligent Transportation Systems. 33:1-33:21 - Hongjian Shi

, Hao Wang
, Ruhui Ma
, Yang Hua
, Tao Song
, Honghao Gao
, Haibing Guan
:
Robust Searching-Based Gradient Collaborative Management in Intelligent Transportation System. 34:1-34:23 - Zejia Weng

, Zuxuan Wu
, Hengduo Li
, Jingjing Chen
, Yu-Gang Jiang
:
HCMS: Hierarchical and Conditional Modality Selection for Efficient Video Recognition. 35:1-35:18 - Shixiong Zhang

, Wenmin Wang
, Honglei Li
, Shenyong Zhang
:
E-detector: Asynchronous Spatio-temporal for Event-based Object Detection in Intelligent Transportation System. 36:1-36:20 - Ram Prasad Padhy

, Pankaj Kumar Sa
, Fabio Narducci
, Carmen Bisogni
, Sambit Bakshi
:
Monocular Vision-aided Depth Measurement from RGB Images for Autonomous UAV Navigation. 37:1-37:22
- Zhihan Lv

, Fabio Poiesi
, Qi Dong
, Jaime Lloret
, Houbing Song
:
Special Issue on Deep Learning for Intelligent Human Computer Interaction. 38:1-38:5 - Wenjuan Gong

, Yue Zhang
, Wei Wang
, Peng Cheng
, Jordi Gonzàlez
:
Meta-MMFNet: Meta-learning-based Multi-model Fusion Network for Micro-expression Recognition. 39:1-39:20 - Youcef Djenouri, Asma Belhadi, Gautam Srivastava, Jerry Chun-Wei Lin

:
An Efficient and Accurate GPU-based Deep Learning Model for Multimedia Recommendation. 40:1-40:18 - Loveleen Gaur

, Mohan Bhandari
, Bhadwal Singh Shikhar, NZ Jhanjhi
, Mohammad Shorfuzzaman, Mehedi Masud
:
Explanation-Driven HCI Model to Examine the Mini-Mental State for Alzheimer's Disease. 41:1-41:16 - Mi Li

, Wei Zhang
, Bin Hu
, Jiaming Kang
, Yuqi Wang
, Shengfu Lu
:
Automatic Assessment of Depression and Anxiety through Encoding Pupil-wave from HCI in VR Scenes. 42:1-42:22 - Abdul Qayyum

, Imran Razzak
, Muhammad Tanveer
, Moona Mazher
:
Spontaneous Facial Behavior Analysis Using Deep Transformer-based Framework for Child-computer Interaction. 43:1-43:17 - Xiaowei Chen

, Xiao Jiang
, Lishuang Zhan
, Shihui Guo
, Qunsheng Ruan
, Guoliang Luo
, Minghong Liao
, Yipeng Qin
:
Full-body Human Motion Reconstruction with Sparse Joint Tracking Using Flexible Sensors. 44:1-44:19 - Shanbao Qiao

, Neal N. Xiong
, Yongbin Gao
, Zhijun Fang
, Wenjun Yu
, Juan Zhang
, Xiaoyan Jiang
:
Self-Supervised Learning of Depth and Ego-Motion for 3D Perception in Human Computer Interaction. 45:1-45:21 - Yan Kang

, Bin Pu
, Yongqi Kou
, Yun Yang
, Jianguo Chen
, Khan Muhammad
, Po Yang
, Lida Xu
, Mohammad Hijji
:
A Deep Graph Network with Multiple Similarity for User Clustering in Human-Computer Interaction. 46:1-46:20 - Bahar Uddin Mahmud

, Guan Y. Hong
, Bernard Fong
:
A Study of Human-AI Symbiosis for Creative Work: Recent Developments and Future Directions in Deep Learning. 47:1-47:21 - Xiaoling Gu, Jie Huang

, Yongkang Wong
, Jun Yu
, Jianping Fan
, Pai Peng
, Mohan S. Kankanhalli
:
PAINT: Photo-realistic Fashion Design Synthesis. 48:1-48:23 - Qingfeng Dai

, Yongkang Wong
, Guofei Sun
, Yanwei Wang
, Zhou Zhou
, Mohan S. Kankanhalli
, Xiangdong Li
, Weidong Geng
:
Unsupervised Domain Adaptation by Causal Learning for Biometric Signal-based HCI. 49:1-49:18 - Yi Xiao

, Tong Liu
, Yu Han
, Yue Liu
, Yongtian Wang
:
Realtime Recognition of Dynamic Hand Gestures in Practical Applications. 50:1-50:17 - Jianping Gou

, Liyuan Sun
, Baosheng Yu
, Shaohua Wan
, Dacheng Tao
:
Hierarchical Multi-Attention Transfer for Knowledge Distillation. 51:1-51:20
- Subhrajyoti Deb

, Abhilash Kumar Das
, Nirmalya Kar
:
An Applied Image Cryptosystem on Moore's Automaton Operating on δ (qk)/𝔽2. 52:1-52:20 - Sisi You

, Yukun Zuo
, Hantao Yao
, Changsheng Xu
:
Incremental Audio-Visual Fusion for Person Recognition in Earthquake Scene. 53:1-53:19 - Shiqi Sun

, Danlan Huang
, Xiaoming Tao
, Chengkang Pan
, Guangyi Liu
, Changwen Chen
:
Boosting Scene Graph Generation with Contextual Information. 54:1-54:24 - Jianwei Zheng

, Yu Liu
, Yuchao Feng
, Honghui Xu
, Meiyu Zhang
:
Contrastive Attention-guided Multi-level Feature Registration for Reference-based Super-resolution. 55:1-55:21 - Shangxi Wu

, Jitao Sang
, Kaiyan Xu
, Guanhua Zheng
, Changsheng Xu
:
Adaptive Adversarial Logits Pairing. 56:1-56:16 - Ying Chen

, Rui Yao
, Yong Zhou
, Jiaqi Zhao
, Bing Liu
, Abdulmotaleb El-Saddik
:
Black-box Attack against Self-supervised Video Object Segmentation Models with Contrastive Loss. 57:1-57:21 - Shuang Liang

, Wentao Ma
, Chi Xie
:
Relation with Free Objects for Action Recognition. 58:1-58:19 - Qiaolin He

, Zhijie Zheng
, Haifeng Hu
:
A Feature Map is Worth a Video Frame: Rethinking Convolutional Features for Visible-Infrared Person Re-identification. 59:1-59:20 - Wuliang Huang

, Yiqiang Chen
, Xinlong Jiang
, Teng Zhang
, Qian Chen
:
GJFusion: A Channel-Level Correlation Construction Method for Multimodal Physiological Signal Fusion. 60:1-60:23
Volume 20, Number 3, March 2024
- Chengji Shen

, Zhenjiang Liu
, Xin Gao
, Zunlei Feng
, Mingli Song
:
Self-Adaptive Clothing Mapping Based Virtual Try-on. 61:1-61:26 - Alberto Baldrati

, Marco Bertini
, Tiberio Uricchio
, Alberto Del Bimbo
:
Composed Image Retrieval using Contrastive Learning and Task-oriented CLIP-based Features. 62:1-62:24 - Yan Wang

, Peize Li
, Qingyi Si
, Hanwen Zhang
, Wenyu Zang
, Zheng Lin
, Peng Fu
:
Cross-modality Multiple Relations Learning for Knowledge-based Visual Question Answering. 63:1-63:22 - Qiang Guo

, Zhi Zhang
, Mingliang Zhou
, Hong Yue
, Huayan Pu
, Jun Luo
:
Image Defogging Based on Regional Gradient Constrained Prior. 64:1-64:17 - Jintao Guo

, Lei Qi
, Yinghuan Shi
, Yang Gao
:
PLACE Dropout: A Progressive Layer-wise and Channel-wise Dropout for Domain Generalization. 65:1-65:23 - Yuan Xiong

, Jingru Wang
, Zhong Zhou
:
VirtualLoc: Large-scale Visual Localization Using Virtual Images. 66:1-66:19 - Yiheng Zhang

, Ting Yao
, Zhaofan Qiu
, Tao Mei
:
Explaining Cross-domain Recognition with Interpretable Deep Classifier. 67:1-67:21 - Ruimin Wang

, Fasheng Wang
, Yiming Su
, Jing Sun
, Fuming Sun
, Haojie Li
:
Attention-guided Multi-modality Interaction Network for RGB-D Salient Object Detection. 68:1-68:22 - Jemily Rime

, Alan Archer-Boyd
, Tom Collins
:
How Will You Pod? Implications of Creators' Perspectives for Designing Innovative Podcasting Tools. 69:1-69:25 - Ming Cheung

:
Learning from the Past: Fast NAS for Tasks and Datasets. 70:1-70:18 - Xinyue Li

, Haiyong Xu
, Gangyi Jiang
, Mei Yu
, Ting Luo
, Xuebo Zhang
, Hongwei Ying
:
Underwater Image Quality Assessment from Synthetic to Real-world: Dataset and Objective Method. 71:1-71:23 - Sujuan Hou

, Jiacheng Li
, Weiqing Min
, Qiang Hou
, Yanna Zhao
, Yuanjie Zheng
, Shuqiang Jiang
:
Deep Learning for Logo Detection: A Survey. 72:1-72:23 - Yunjie Peng

, Jinlin Wu
, Boqiang Xu
, Chunshui Cao
, Xu Liu
, Zhenan Sun
, Zhiqiang He
:
Deep Learning Based Occluded Person Re-Identification: A Survey. 73:1-73:27 - Muhammad Arslan Manzoor

, Sarah Albarri
, Ziting Xian
, Zaiqiao Meng
, Preslav Nakov
, Shangsong Liang
:
Multimodality Representation Learning: A Survey on Evolution, Pretraining and Its Applications. 74:1-74:34 - Yanyan Shi

, Shaowu Yang
, Wenjing Yang
, Dianxi Shi
, Xuehui Li
:
Boosting Few-shot Object Detection with Discriminative Representation and Class Margin. 75:1-75:19 - Harry Cheng

, Yangyang Guo
, Tianyi Wang
, Qi Li
, Xiaojun Chang
, Liqiang Nie
:
Voice-Face Homogeneity Tells Deepfake. 76:1-76:22 - Jin Ye

, Meng Dan
, Wenchao Jiang
:
A Visual Sensitivity Aware ABR Algorithm for DASH via Deep Reinforcement Learning. 77:1-77:22 - Jian Wang

, Xiao Wang
, Guosheng Zhao
:
Task Recommendation via Heterogeneous Multi-modal Features and Decision Fusion in Mobile Crowdsensing. 78:1-78:20 - Si-chao Lei

, Yue-Jiao Gong
, Xiaolin Xiao
, Yicong Zhou
, Jun Zhang
:
Boosting Diversity in Visual Search with Pareto Non-Dominated Re-Ranking. 79:1-79:23 - Huijie Zhang

, Pu Li
, Xiaobai Liu
, Xianfeng Terry Yang
, Li An
:
An Iterative Semi-supervised Approach with Pixel-wise Contrastive Loss for Road Extraction in Aerial Images. 80:1-80:21 - Jing Fang

, Yinbo Yu
, Zhongyuan Wang
, Xin Ding
, Ruimin Hu
:
An Image Arbitrary-Scale Super-Resolution Network Using Frequency-domain Information. 81:1-81:23 - Xiao Luo

, Wei Ju
, Yiyang Gu
, Yifang Qin
, Siyu Yi
, Daqing Wu
, Luchen Liu
, Ming Zhang
:
Toward Effective Semi-supervised Node Classification with Hybrid Curriculum Pseudo-labeling. 82:1-82:19 - Wen Guo

, Wuzhou Quan
, Junyu Gao
, Tianzhu Zhang
, Changsheng Xu
:
Feature Disentanglement Network: Multi-Object Tracking Needs More Differentiated Features. 83:1-83:22 - Mohammed Khaleel

, Azeez Idris
, Wallapak Tavanapong
, Jacob Pratt
, Jung-Hwan Oh
, Piet C. de Groen
:
VisActive: Visual-concept-based Active Learning for Image Classification under Class Imbalance. 84:1-84:21 - Honghua Chen

, Zhiqi Li
, Mingqiang Wei
, Jun Wang
:
Geometric and Learning-Based Mesh Denoising: A Comprehensive Survey. 85:1-85:28 - Ning Han

, Yawen Zeng
, Chuhao Shi
, Guangyi Xiao
, Hao Chen
, Jingjing Chen
:
BiC-Net: Learning Efficient Spatio-temporal Relation for Text-Video Retrieval. 86:1-86:21 - Yuan Feng

, Yaojun Hu, Pengfei Fang, Sheng Liu, Yanhong Yang, Shengyong Chen:
Asymmetric Dual-Decoder U-Net for Joint Rain and Haze Removal. 87:1-87:23 - Yurui Xie

, Ling Guan
:
Sparsity-guided Discriminative Feature Encoding for Robust Keypoint Detection. 88:1-88:22 - Nicolas Beuve

, Wassim Hamidouche
, Olivier Déforges
:
Hierarchical Learning and Dummy Triplet Loss for Efficient Deepfake Detection. 89:1-89:18 - Suncheng Xiang

, Dahong Qian
, Jingsheng Gao
, Zirui Zhang
, Ting Liu
, Yuzhuo Fu
:
Rethinking Person Re-Identification via Semantic-based Pretraining. 90:1-90:17
Volume 20, Number 4, April 2024
- Min Peng

, Xiaohu Shao
, Yu Shi
, Xiangdong Zhou
:
Hierarchical Synergy-Enhanced Multimodal Relational Network for Video Question Answering. 91:1-91:22 - Bin Ren

, Hao Tang
, Fanyang Meng
, Runwei Ding
, Philip Torr
, Nicu Sebe
:
Cloth Interactive Transformer for Virtual Try-On. 92:1-92:20 - Xiushan Nie

, Yang Shi
, Ziyu Meng
, Jin Huang
, Weili Guan
, Yilong Yin
:
Complex Scenario Image Retrieval via Deep Similarity-aware Hashing. 93:1-93:24 - Jiawei Tan

, Hongxing Wang
, Junsong Yuan
:
Characters Link Shots: Character Attention Network for Movie Scene Segmentation. 94:1-94:23 - Mingliang Zhou

, Xinwen Zhao
, Futing Luo
, Jun Luo
, Huayan Pu
, Tao Xiang
:
Robust RGB-T Tracking via Adaptive Modality Weight Correlation Filters and Cross-modality Learning. 95:1-95:20 - Zicheng Zhang

, Wei Sun
, Yingjie Zhou
, Jun Jia
, Zhichao Zhang
, Jing Liu
, Xiongkuo Min
, Guangtao Zhai
:
Subjective and Objective Quality Assessment for in-the-Wild Computer Graphics Images. 96:1-96:22 - Shuvendu Roy

, Ali Etemad
:
Contrastive Learning of View-invariant Representations for Facial Expressions Recognition. 97:1-97:22 - Jun Liu

, Jiantao Zhou
, Haiwei Wu
, Weiwei Sun
, Jinyu Tian
:
Generating Robust Adversarial Examples against Online Social Networks (OSNs). 98:1-98:26 - Tao Yao

, Yiru Li
, Ying Li
, Yingying Zhu
, Gang Wang
, Jun Yue
:
Cross-modal Semantically Augmented Network for Image-text Matching. 99:1-99:18 - Ahmed Telili

, Sid Ahmed Fezza
, Wassim Hamidouche
, Hanene Brachemi Meftah
:
2BiVQA: Double Bi-LSTM-based Video Quality Assessment of UGC Videos. 100:1-100:22 - Hongzhou Chen

, Haihan Duan
, Maha Abdallah
, Yufeng Zhu
, Yonggang Wen
, Abdulmotaleb El-Saddik
, Wei Cai
:
Web3 Metaverse: State-of-the-Art and Vision. 101:1-101:42 - Lilong Wang

, Yunhui Shi
, Jin Wang
, Shujun Chen
, Baocai Yin
, Nam Ling
:
Graph Based Cross-Channel Transform for Color Image Compression. 102:1-102:25 - Kai Han

, Yu Liu
, Rukai Wei
, Ke Zhou
, Jinhui Xu
, Kun Long
:
Supervised Hierarchical Online Hashing for Cross-modal Retrieval. 103:1-103:23 - Fengyi Fu

, Shancheng Fang
, Weidong Chen
, Zhendong Mao
:
Sentiment-Oriented Transformer-Based Variational Autoencoder Network for Live Video Commenting. 104:1-104:24 - Yuxiang Peng

, Chong Fu
, Guixing Cao
, Wei Song
, Junxin Chen
, Chiu-Wing Sham
:
JPEG-compatible Joint Image Compression and Encryption Algorithm with File Size Preservation. 105:1-105:20 - Daizong Liu

, Xiaoye Qu
, Jianfeng Dong
, Pan Zhou
, Zichuan Xu
, Haozhao Wang
, Xing Di
, Weining Lu
, Yu Cheng:
Transform-Equivariant Consistency Learning for Temporal Sentence Grounding. 106:1-106:19 - Yijie Hu

, Bin Dong
, Kaizhu Huang
, Lei Ding
, Wei Wang
, Xiaowei Huang
, Qiu-Feng Wang
:
Scene Text Recognition via Dual-path Network with Shape-driven Attention Alignment. 107:1-107:20 - Rongjiao Liang

, Shichao Zhang
, Wenzhen Zhang
, Guixian Zhang
, Jinyun Tang
:
Nonlocal Hybrid Network for Long-tailed Image Classification. 108:1-108:22 - Piao Shi

, Min Hu
, Xuefeng Shi
, Fuji Ren
:
Deep Modular Co-Attention Shifting Network for Multimodal Sentiment Analysis. 109:1-109:23 - Jing Zhang

, Dan Guo
, Xun Yang
, Peipei Song
, Meng Wang
:
Visual-linguistic-stylistic Triple Reward for Cross-lingual Image Captioning. 110:1-110:23 - Zhaoyang Jia

, Yan Lu
, Houqiang Li
:
Exploring Neighbor Correspondence Matching for Multiple-hypotheses Video Frame Synthesis. 111:1-111:20 - Sheng Zhou

, Dan Guo
, Xun Yang
, Jianfeng Dong
, Meng Wang
:
Graph Pooling Inference Network for Text-based VQA. 112:1-112:21 - Hengtong Hu

, Lingxi Xie
, Xinyue Huo
, Richang Hong
, Qi Tian
:
One-Bit Supervision for Image Classification: Problem, Solution, and Beyond. 113:1-113:22 - Hang Yuan

, Wei Gao
, Siwei Ma
, Yiqiang Yan
:
Divide-and-conquer-based RDO-free CU Partitioning for 8K Video Compression. 114:1-114:20 - Mingyu Li

, Tao Zhou
, Zhuo Huang
, Jian Yang
, Jie Yang
, Chen Gong
:
Dynamic Weighted Adversarial Learning for Semi-Supervised Classification under Intersectional Class Mismatch. 115:1-115:24 - Hui Huang

, Di Xiao
, Jia Liang
:
Secure Low-complexity Compressive Sensing with Preconditioning Prior Regularization Reconstruction. 116:1-116:22 - Nathan Clement

, Alan Schoen
, Arnold P. Boedihardjo
, Andrew Jenkins
:
Synthetic Data and Hierarchical Object Detection in Overhead Imagery. 117:1-117:20 - Jiang Bian

, Xuhong Li
, Tao Wang
, Qingzhong Wang
, Jun Huang
, Chen Liu
, Jun Zhao
, Feixiang Lu
, Dejing Dou
, Haoyi Xiong
:
P2ANet: A Large-Scale Benchmark for Dense Action Detection from Table Tennis Match Broadcasting Videos. 118:1-118:23 - Jifan Yang

, Zhongyuan Wang
, Guangcheng Wang
, Baojin Huang
, Yuhong Yang
, Weiping Tu
:
Auxiliary Information Guided Self-attention for Image Quality Assessment. 119:1-119:23 - Zhanzhou Feng

, Jiaming Xu
, Lei Ma
, Shiliang Zhang
:
Efficient Video Transformers via Spatial-temporal Token Merging for Action Recognition. 120:1-120:21
Volume 20, Number 5, May 2024
- Shupei Zhang

, Chenqiu Zhao
, Anup Basu
:
Principal Component Approximation Network for Image Compression. 121:1-121:20 - Tianyu Zhang

, Weiqing Min
, Tao Liu
, Shuqiang Jiang
, Yong Rui
:
Toward Egocentric Compositional Action Anticipation with Adaptive Semantic Debiasing. 122:1-122:21 - Yu Liu

, Mingbo Zhao
, Zhao Zhang
, Yuping Liu
, Shuicheng Yan
:
Arbitrary Virtual Try-on Network: Characteristics Preservation and Tradeoff between Body and Clothing. 123:1-123:23 - Shih-Wei Yang

, Li-Hsiang Shen
, Hong-Han Shuai
, Kai-Ten Feng
:
CMAF: Cross-Modal Augmentation via Fusion for Underwater Acoustic Image Recognition. 124:1-124:25 - Yazhou Zhang

, Yang Yu
, Mengyao Wang
, Min Huang
, M. Shamim Hossain
:
Self-Adaptive Representation Learning Model for Multi-Modal Sentiment and Sarcasm Joint Analysis. 125:1-125:17 - Lei Qi

, Peng Dong
, Tan Xiong
, Hui Xue
, Xin Geng
:
DoubleAUG: Single-domain Generalized Object Detector in Urban via Color Perturbation and Dual-style Memory. 126:1-126:20 - Dan Shi

, Lei Zhu
, Jingjing Li
, Guohua Dong
, Huaxiang Zhang
:
Incomplete Cross-Modal Retrieval with Deep Correlation Transfer. 127:1-127:21 - Xianhua Zeng

, Xinyu Wang
, Yicai Xie
:
Multiple Pseudo-Siamese Network with Supervised Contrast Learning for Medical Multi-modal Retrieval. 128:1-128:23 - Sisi You

, Hantao Yao
, Bing-Kun Bao
, Changsheng Xu
:
Multi-object Tracking with Spatial-Temporal Tracklet Association. 129:1-129:21 - Gülnaziye Bingöl

, Simone Porcu
, Alessandro Floris
, Luigi Atzori
:
QoE Estimation of WebRTC-based Audio-visual Conversations from Facial and Speech Features. 130:1-130:23 - Heqian Qiu

, Hongliang Li
, Qingbo Wu
, Hengcan Shi
, Lanxiao Wang
, Fanman Meng
, Linfeng Xu
:
Learning Offset Probability Distribution for Accurate Object Detection. 131:1-131:24 - Alessandro Floris

, Simone Porcu
, Luigi Atzori
:
Controlling Media Player with Hands: A Transformer Approach and a Quality of Experience Assessment. 132:1-132:22 - Jingyu Li

, Zhendong Mao
, Hao Li
, Weidong Chen
, Yongdong Zhang
:
Exploring Visual Relationships via Transformer-based Graphs for Enhanced Image Captioning. 133:1-133:23 - Zeyu Ma

, Siwei Wang
, Xiao Luo
, Zhonghui Gu
, Chong Chen
, Jinxing Li
, Xian-Sheng Hua
, Guangming Lu
:
HARR: Learning Discriminative and High-Quality Hash Codes for Image Retrieval. 134:1-134:23 - Chengyang Zhang

, Yong Zhang, Bo Li
, Xinglin Piao, Baocai Yin:
CrowdGraph: Weakly supervised Crowd Counting via Pure Graph Neural Network. 135:1-135:23 - Jie Wang

, Guoqiang Li
, Jie Shi
, Jinwen Xi
:
Weighted Guided Optional Fusion Network for RGB-T Salient Object Detection. 136:1-136:20 - Yibo Zhang

, Weiguo Lin
, Junfeng Xu
:
Joint Audio-Visual Attention with Contrastive Learning for More General Deepfake Detection. 137:1-137:23 - Depei Wang

, Ruifeng Xu
, Lianglun Cheng
, Zhuowei Wang
:
Knowledge-integrated Multi-modal Movie Turning Point Identification. 138:1-138:19 - Chunpu Liu

, Guanglei Yang
, Wangmeng Zuo
, Tianyi Zang
:
DPDFormer: A Coarse-to-Fine Model for Monocular Depth Estimation. 139:1-139:21 - Yunyao Yan

, Guoqing Xiang
, Huizhu Jia
, Jie Chen
, Xiaofeng Huang
, Xiaodong Xie
:
Two-Stage Perceptual Quality Oriented Rate Control Algorithm for HEVC. 140:1-140:20 - Zongyi Li

, Yuxuan Shi
, Hefei Ling
, Jiazhong Chen
, Boyuan Liu
, Runsheng Wang
, Chengxin Zhao
:
Viewpoint Disentangling and Generation for Unsupervised Object Re-ID. 141:1-141:23 - Kuai Dai

, Xutao Li
, Huiwei Lin
, Yin Jiang
, Xunlai Chen
, Yunming Ye
, Di Xian
:
TinyPredNet: A Lightweight Framework for Satellite Image Sequence Prediction. 142:1-142:24 - Yingnan Ma

, Chenqiu Zhao
, Bingran Huang
, Xudong Li
, Anup Basu
:
RAST: Restorable Arbitrary Style Transfer. 143:1-143:21 - Wei-Yen Hsu

, Hsien-Wen Lin
:
Context-detail-aware United Network for Single Image Deraining. 144:1-144:18 - Yao Liu

, Gangfeng Cui
, Jiahui Luo
, Xiaojun Chang
, Lina Yao
:
Two-stream Multi-level Dynamic Point Transformer for Two-person Interaction Recognition. 145:1-145:22 - Chengxin Chen

, Pengyuan Zhang
:
Modality-collaborative Transformer with Hybrid Feature Reconstruction for Robust Emotion Recognition. 146:1-146:23 - Jiafeng Huang

, Tianjun Zhang
, Shengjie Zhao
, Lin Zhang
, Yicong Zhou
:
An Underwater Organism Image Dataset and a Lightweight Module Designed for Object Detection Networks. 147:1-147:23 - Jing Liu

, Litao Shang
, Yuting Su
, Weizhi Nie
, Xin Wen
, Anan Liu
:
Privacy-preserving Multi-source Cross-domain Recommendation Based on Knowledge Graph. 148:1-148:18 - Xingyu Liu

, Zhongyun Hua
, Shuang Yi
, Yushu Zhang
, Yicong Zhou
:
Bi-directional Block Encoding for Reversible Data Hiding over Encrypted Images. 149:1-149:23 - Peng Yi

, Zhongyuan Wang
, Laigan Luo
, Kui Jiang
, Zheng He
, Junjun Jiang
, Tao Lu
, Jiayi Ma:
Omniscient Video Super-Resolution with Explicit-Implicit Alignment. 150:1-150:23
Volume 20, Number 6, June 2024
- Amit Kumar Singh

, Deepa Kundur
, Mauro Conti
:
Introduction to the Special Issue on Integrity of Multimedia and Multimodal Data in Internet of Things. 151:1-151:4 - Wenyuan Yang

, Shaocong Wu
, Jianwei Fei
, Xianwang Zeng
, Yuemin Ding
, Zhihua Xia
:
A Bitcoin-based Secure Outsourcing Scheme for Optimization Problem in Multimedia Internet of Things. 152:1-152:23 - Qingzhi Liu

, Yuchen Huang
, Chenglu Jin
, Xiaohan Zhou
, Ying Mao
, Cagatay Catal
, Long Cheng
:
Privacy and Integrity Protection for IoT Multimodal Data Using Machine Learning and Blockchain. 153:1-153:18 - Simon Lucas Jonker

, Malthe Jelstrup
, Weizhi Meng
, Brooke Lampe
:
Detecting Post Editing of Multimedia Images using Transfer Learning and Fine Tuning. 154:1-154:22 - Carmen Bisogni

, Lucia Cascone
, Michele Nappi
, Chiara Pero
:
IoT-enabled Biometric Security: Enhancing Smart Car Safety with Depth-based Head Pose Estimation. 155:1-155:24 - Saif E. Nouma

, Attila A. Yavuz
:
Trustworthy and Efficient Digital Twins in Post-Quantum Era with Hybrid Hardware-Assisted Signatures. 156:1-156:30 - Fan Li

, Yanxiang Chen
, Haiyang Liu
, Zuxing Zhao
, Yuanzhi Yao
, Xin Liao
:
Vocoder Detection of Spoofing Speech Based on GAN Fingerprints and Domain Generalization. 157:1-157:20 - Jing Gao

, Peng Li
, Asif Ali Laghari
, Gautam Srivastava
, Thippa Reddy Gadekallu
, Sidra Abbas
, Jianing Zhang
:
Incomplete Multiview Clustering via Semidiscrete Optimal Transport for Multimedia Data Mining in IoT. 158:1-158:20 - Zhenyu Liu

, Da Li
, Xinyu Zhang
, Zhang Zhang
, Peng Zhang
, Caifeng Shan
, Jungong Han
:
Pedestrian Attribute Recognition via Spatio-temporal Relationship Learning for Visual Surveillance. 159:1-159:15
- Manvi Jha

, Ashish Kumar Bhandari
:
NSDIE: Noise Suppressing Dark Image Enhancement Using Multiscale Retinex and Low-Rank Minimization. 160:1-160:22 - Wenhao Fang

, Jiayuan Xie
, Hongfei Liu
, Jiali Chen
, Yi Cai
:
Diverse Visual Question Generation Based on Multiple Objects Selection. 161:1-161:22 - Yichi Zhang

, Dandan Ding
, Zhan Ma
, Zhu Li
:
A Reconfigurable Framework for Neural Network Based Video In-Loop Filtering. 162:1-162:20 - Ronglai Zuo

, Brian Mak
:
Improving Continuous Sign Language Recognition with Consistency Constraints and Signer Removal. 163:1-163:25 - Qinglin Liu

, Quanling Meng
, Xiaoqian Lv
, Zonglin Li
, Wei Yu
, Shengping Zhang
:
Human Selective Matting. 164:1-164:23 - Shenshen Li

, Xing Xu
, Xun Jiang
, Fumin Shen
, Zhe Sun
, Andrzej Cichocki
:
Cross-Modal Attention Preservation with Self-Contrastive Learning for Composed Query-Based Image Retrieval. 165:1-165:22 - Xizhong Wang

, Rui Liu
, Xin Yang, Qiang Zhang
, Dongsheng Zhou
:
MCFNet: Multi-Attentional Class Feature Augmentation Network for Real-Time Scene Parsing. 166:1-166:17 - Yanzhe Chen

, Jiahuan Zhou
, Yuxin Peng
:
SPIRIT: Style-guided Patch Interaction for Fashion Image Retrieval with Text Feedback. 167:1-167:17 - Huan Liu

, Xiaolong Liu, Zichang Tan, Xiaolong Li, Yao Zhao
:
PADVG: A Simple Baseline of Active Protection for Audio-Driven Video Generation. 168:1-168:19 - Yadong Huo

, Qibing Qin
, Jiangyan Dai
, Wenfeng Zhang
, Lei Huang
, Chengduan Wang
:
Deep Neighborhood-aware Proxy Hashing with Uniform Distribution Constraint for Cross-modal Retrieval. 169:1-169:23 - Yunyi Li

, Fu Xiao
, Wei Liang
, Linqing Gui
:
Multiply Complementary Priors for Image Compressive Sensing Reconstruction in Impulsive Noise. 170:1-170:22 - Weichao Zhao

, Hezhen Hu
, Wengang Zhou
, Li Li
, Houqiang Li
:
Exploiting Spatial-Temporal Context for Interacting Hand Reconstruction on Monocular RGB Video. 171:1-171:18 - Aashania Antil

, Chhavi Dhiman
:
MF2ShrT: Multimodal Feature Fusion Using Shared Layered Transformer for Face Anti-spoofing. 172:1-172:21 - M. Shamim Hossain

, Yixue Hao
, Long Hu
, Jia Liu
, Gang Wei
, Min Chen
:
Immersive Multimedia Service Caching in Edge Cloud with Renewable Energy. 173:1-173:23 - Ying Ying Zhang

, Shuo Zhang
, Ming Hui
:
Semantic-Consistency-guided Learning on Deep Features for Unsupervised Salient Object Detection. 174:1-174:23 - Xuelin Liu

, Jiebin Yan
, Liping Huang
, Yuming Fang
, Zheng Wan
, Yang Liu
:
Perceptual Quality Assessment of Omnidirectional Images: A Benchmark and Computational Model. 175:1-175:24 - Yuhao Cheng

, Yichao Yan
, Wenhan Zhu
, Ye Pan
, Bowen Pan
, Xiaokang Yang
:
Head3D: Complete 3D Head Generation via Tri-plane Feature Distillation. 176:1-176:20 - Hao Chen

, Yunlong Yu
, Yonghan Dong
, Zheming Lu
, Yingming Li
, Zhongfei Zhang
:
Multi-Content Interaction Network for Few-Shot Segmentation. 177:1-177:20 - Zicheng Zhang

, Wei Sun
, Haoning Wu
, Yingjie Zhou
, Chunyi Li
, Zijian Chen
, Xiongkuo Min
, Guangtao Zhai
, Weisi Lin
:
GMS-3DQA: Projection-Based Grid Mini-patch Sampling for 3D Model Quality Assessment. 178:1-178:19 - Jun Lyu

, Guangming Wang
, M. Shamim Hossain
:
Iterative Temporal-spatial Transformer-based Cardiac T1 Mapping MRI Reconstruction. 179:1-179:18 - Yuanjie Dang

, Chunxia Huang
, Peng Chen
, Dongdong Zhao
, Nan Gao
, Ronghua Liang
, Ruohong Huan
:
Discriminative Action Snippet Propagation Network for Weakly Supervised Temporal Action Localization. 180:1-180:21 - Qiong Chen

, Tianlin Huang
, Qingfa Liu
:
SWRM: Similarity Window Reweighting and Margin for Long-Tailed Recognition. 181:1-181:18 - Peiguang Jing, Xianyi Liu, Lijuan Zhang, Yun Li, Yu Liu, Yuting Su:

Multimodal Attentive Representation Learning for Micro-video Multi-label Classification. 182:1-182:23 - Qingbao Huang

, Pijian Li
, Youji Huang
, Feng Shuang
, Yi Cai
:
Region-Focused Network for Dense Captioning. 183:1-183:20 - Lei Qi

, Hongpeng Yang
, Yinghuan Shi
, Xin Geng
:
MultiMatch: Multi-task Learning for Semi-supervised Domain Generalization. 184:1-184:21 - Yucheng Suo

, Zhedong Zheng
, Xiaohan Wang
, Bang Zhang
, Yi Yang:
Jointly Harnessing Prior Structures and Temporal Consistency for Sign Language Video Generation. 185:1-185:18
Volume 20, Number 7, July 2024
- Roberto García

, Ana Cediel
, Mercè Teixidó
, Rosa Gil
:
Semantics and Non-fungible Tokens for Copyright Management on the Metaverse and Beyond. 186:1-186:20 - Tianxiu Xie

, Keke Gai
, Liehuang Zhu
, Shuo Wang
, Zijian Zhang
:
RAC-Chain: An Asynchronous Consensus-based Cross-chain Approach to Scalable Blockchain for Metaverse. 187:1-187:24 - Yongjun Ren

, Zhiying Lv
, Neal N. Xiong
, Jin Wang
:
HCNCT: A Cross-chain Interaction Scheme for the Blockchain-based Metaverse. 188:1-188:23 - Shuang-Min Chen

, Rui Xu
, Jian Xu
, Shiqing Xin
, Changhe Tu
, Chenglei Yang
, Lin Lu
:
QuickCSGModeling: Quick CSG Operations Based on Fusing Signed Distance Fields for VR Modeling. 189:1-189:18 - Qinnan Zhang

, Zehui Xiong
, Jianming Zhu
, Sheng Gao
, Wanting Yang
:
A Privacy-preserving Auction Mechanism for Learning Model as an NFT in Blockchain-driven Metaverse. 190:1-190:24 - Han Wang

, Hui Li
, Abla Smahi
, Feng Zhao
, Yao Yao
, Ching Chuen Chan
, Shiyu Wang
, Wenyuan Yang
, Shuo-Yen Robert Li
:
MIS: A Multi-Identifier Management and Resolution System in the Metaverse. 191:1-191:25
- Fei Peng

, Le Qin
, Min Long
, Jin Li
:
Detection of Adversarial Facial Accessory Presentation Attacks Using Local Face Differential. 192:1-192:28 - Xinjian Gao

, Ye Pang
, Yuyu Liu
, Maokun Han
, Jun Yu
, Wei Wang
, Yuanxu Chen
:
Multimodal Visual-Semantic Representations Learning for Scene Text Recognition. 193:1-193:18 - Si-chao Lei

, Yue-Jiao Gong
, Xiaolin Xiao
, Yicong Zhou
, Jun Zhang
:
Tensorial Evolutionary Optimization for Natural Image Matting. 194:1-194:23 - Jifan Yang

, Zhongyuan Wang
, Baojin Huang
, Jiaxin Ai
, Yuhong Yang
, Zixiang Xiong
:
Joint Distortion Restoration and Quality Feature Learning for No-reference Image Quality Assessment. 195:1-195:20 - Weiyao Lin

, Yufeng Zhang
, Wenrui Dai
, Huabin Liu
, John See
, Hongkai Xiong
:
Scene Graph Lossless Compression with Adaptive Prediction for Objects and Relations. 196:1-196:23 - Xiaofeng Qu

, Li Liu
, Lei Zhu
, Liqiang Nie
, Huaxiang Zhang
:
Instance-level Adversarial Source-free Domain Adaptive Person Re-identification. 197:1-197:22 - Runyu Yang

, Dong Liu
, Siwei Ma
, Feng Wu
, Wen Gao
:
Perceptual Quality-Oriented Rate Allocation via Distillation from End-to-End Image Compression. 198:1-198:22 - Liangzhe Chen

, Wei Li
, Xiaohui Cui
, Zhenyu Wang
, Stefano Berretti
, Shaohua Wan
:
MS-GDA: Improving Heterogeneous Recipe Representation via Multinomial Sampling Graph Data Augmentation. 199:1-199:23 - Lei Gao

, Zheng Guo
, Ling Guan
:
An Optimal Edge-weighted Graph Semantic Correlation Framework for Multi-view Feature Representation Learning. 200:1-200:23 - Xiaoping Liang

, Wanting Liu
, Xianquan Zhang
, Zhenjun Tang
:
Robust Image Hashing via CP Decomposition and DCT for Copy Detection. 201:1-201:22 - Feng Li

, Yixuan Wu
, Anqi Li
, Huihui Bai
, Runmin Cong
, Yao Zhao
:
Enhanced Video Super-Resolution Network towards Compressed Data. 202:1-202:21 - Penglei Gao

, Xi Yang
, Rui Zhang
, Kaizhu Huang
:
Continuous Image Outpainting with Neural ODE. 203:1-203:16 - Jaime Ruiz-Serra

, Jack White
, Stephen M. Petrie
, Tatiana Kameneva
, Chris McCarthy
:
Learning Scene Representations for Human-assistive Displays Using Self-attention Networks. 204:1-204:26 - Jinjia Peng

, Song Pengpeng
, Hui Li
, Huibing Wang
:
ReFID: Reciprocal Frequency-aware Generalizable Person Re-identification via Decomposition and Filtering. 205:1-205:20 - Carlos Cortés

, Irene Viola
, Jesús Gutiérrez
, Jack Jansen
, Shishir Subramanyam
, Evangelos Alexiou
, Pablo Pérez, Narciso García
, Pablo César
:
Delay Threshold for Social Interaction in Volumetric eXtended Reality Communication. 206:1-206:22 - JongBeom Jeong

, Soonbin Lee
, Eun-Seok Ryu
:
DATRA-MIV: Decoder-Adaptive Tiling and Rate Allocation for MPEG Immersive Video. 207:1-207:22 - Zheng Chen

, Jian Zhao
, Mingyu Yang
, Wengang Zhou
, Houqiang Li
:
Optimizing Camera Motion with MCTS and Target Motion Modeling in Multi-Target Active Object Tracking. 208:1-208:19 - Xiangming Gu

, Longshen Ou
, Wei Zeng
, Jianan Zhang
, Nicholas Wong
, Ye Wang
:
Automatic Lyric Transcription and Automatic Music Transcription from Multimodal Singing. 209:1-209:29 - Mingyu Deng

, Wanyi Zhang
, Jie Zhao
, Zhu Wang
, Mingliang Zhou
, Jun Luo
, Chao Chen
:
A Novel Framework for Joint Learning of City Region Partition and Representation. 210:1-210:23 - Xueqiang Han

, Biao Han
, Jinrong Li
, Congxi Song
:
Multi-agent DRL-based Multipath Scheduling for Video Streaming with QUIC. 211:1-211:23 - Wenxue Cui

, Xingtao Wang
, Xiaopeng Fan
, Shaohui Liu
, Xinwei Gao
, Debin Zhao
:
Deep Network for Image Compressed Sensing Coding Using Local Structural Sampling. 212:1-212:22 - Wenxi Liu

, Jiaxin Cai
, Qi Li
, Chenyang Liao
, Jingjing Cao
, Shengfeng He
, Yuanlong Yu
:
Learning Nighttime Semantic Segmentation the Hard Way. 213:1-213:23 - Xiaoya Yu

, Kejun Wu
, You Yang
, Qiong Liu
:
WaRENet: A Novel Urban Waterlogging Risk Evaluation Network. 214:1-214:28 - Jiawei Tan

, Pingan Yang
, Lu Chen
, Hongxing Wang
:
Temporal Scene Montage for Self-Supervised Video Scene Boundary Detection. 215:1-215:19 - Jun Liu

, Jiantao Zhou
, Jinyu Tian
, Weiwei Sun
:
Recoverable Privacy-Preserving Image Classification through Noise-like Adversarial Examples. 216:1-216:27 - Xiaobo Hu

, Youfang Lin
, Hehe Fan
, Shuo Wang
, Zhihao Wu
, Kai Lv
:
Building Category Graphs Representation with Spatial and Temporal Attention for Visual Navigation. 217:1-217:22 - Baoli Sun

, Xinchen Ye
, Tiantian Yan
, Zhihui Wang
, Haojie Li
, Zhiyong Wang
:
Discriminative Segment Focus Network for Fine-grained Video Action Recognition. 218:1-218:20 - Tingting Han

, Quan Zhou
, Jun Yu
, Zhou Yu
, Jianhui Zhang
, Sicheng Zhao
:
Effective Video Summarization by Extracting Parameter-Free Motion Attention. 219:1-219:20 - Huisi Wu

, Zhaoze Wang
, Yifan Li
, Xueting Liu
, Tong-Yee Lee
:
Suitable and Style-Consistent Multi-Texture Recommendation for Cartoon Illustrations. 220:1-220:26 - Shizhan Liu

, Weiyao Lin
, Yihang Chen
, Yufeng Zhang
, Wenrui Dai
, John See
, Hongkai Xiong
:
A Unified Framework for Jointly Compressing Visual and Semantic Data. 221:1-221:24 - Yefei Sheng

, Ming Tao
, Jie Wang
, Bing-Kun Bao
:
ISF-GAN: Imagine, Select, and Fuse with GPT-Based Text Enrichment for Text-to-Image Synthesis. 222:1-222:17 - Haorao Gao

, Yiming Su
, Fasheng Wang
, Haojie Li
:
Heterogeneous Fusion and Integrity Learning Network for RGB-D Salient Object Detection. 223:1-223:24 - Xiruo Jiang

, Yazhou Yao
, Sheng Liu
, Fumin Shen
, Liqiang Nie
, Xian-Sheng Hua
:
Dual Dynamic Threshold Adjustment Strategy. 224:1-224:18 - Panpan Zhang

, Meng Liu
, Xuemeng Song
, Da Cao
, Zan Gao
, Liqiang Nie
:
Universal Relocalizer for Weakly Supervised Referring Expression Grounding. 225:1-225:23 - Xiaolong Shen

, Zhedong Zheng
, Yi Yang:
StepNet: Spatial-temporal Part-aware Network for Isolated Sign Language Recognition. 226:1-226:19 - Kankana Roy

:
Multimodal Score Fusion with Sparse Low-rank Bilinear Pooling for Egocentric Hand Action Recognition. 227:1-227:22 - Huiyuan Fu

, Jin Liu
, Ting Yu
, Xin Wang
, Huadong Ma
:
Multi-Domain Image-to-Image Translation with Cross-Granularity Contrastive Learning. 228:1-228:21 - Hao Zhang

, Meng Liu
, Yuan Qi
, Ning Yang
, Shunbo Hu
, Liqiang Nie
, Wenyin Zhang
:
Efficient Brain Tumor Segmentation with Lightweight Separable Spatial Convolutional Network. 229:1-229:19
Volume 20, Number 8, August 2024
- Jinliang Liu

, Zhedong Zheng
, Zongxin Yang
, Yi Yang:
High Fidelity Makeup via 2D and 3D Identity Preservation Net. 230:1-230:24 - Junjian Huang

, Hao Ren
, Shulin Liu
, Yong Liu
, Chuanlu Lv
, Jiawen Lu
, Changyong Xie
, Hong Lu
:
Real-Time Attentive Dilated U-Net for Extremely Dark Image Enhancement. 231:1-231:19 - Mingfu Xiong

, Kaikang Hu
, Zhihan Lyu
, Fei Fang
, Zhongyuan Wang
, Ruimin Hu, Khan Muhammad
:
Inter-camera Identity Discrimination for Unsupervised Person Re-identification. 232:1-232:18 - Jiaqi Yu

, Jinhai Yang
, Hua Yang
, Renjie Pan
, Pingrui Lai
, Guangtao Zhai
:
Psychology-Guided Environment Aware Network for Discovering Social Interaction Groups from Videos. 233:1-233:23 - Qi Liu

, Xinchen Liu
, Kun Liu
, Xiaoyan Gu
, Wu Liu
:
SigFormer: Sparse Signal-guided Transformer for Multi-modal Action Segmentation. 234:1-234:22 - Jun Lyu

, Shouang Yan
, M. Shamim Hossain
:
DBGAN: Dual Branch Generative Adversarial Network for Multi-Modal MRI Translation. 235:1-235:22 - Dejun Zhang

, Mian Zhang
, Xuefeng Tan
, Jun Liu
:
Bridging the Domain Gap in Scene Flow Estimation via Hierarchical Smoothness Refinement. 236:1-236:21 - Ning Chen

, Zhipeng Cheng
, Xuwei Fan
, Zhang Liu
, Bangzhen Huang
, Yifeng Zhao
, Lianfen Huang
, Xiaojiang Du
, Mohsen Guizani
:
Integrated Sensing, Communication, and Computing for Cost-effective Multimodal Federated Perception. 237:1-237:28 - Jiayu Yang

, Chunhui Yang
, Fei Xiong
, Yongqi Zhai
, Ronggang Wang
:
Learned Video Compression with Adaptive Temporal Prior and Decoded Motion-aided Quality Enhancement. 238:1-238:21 - Xiaoling Gu

, Junkai Zhu
, Yongkang Wong
, Zizhao Wu
, Jun Yu
, Jianping Fan
, Mohan S. Kankanhalli
:
Recurrent Appearance Flow for Occlusion-Free Virtual Try-On. 239:1-239:17 - Yuanjie Lyu

, Penggang Qin
, Tong Xu
, Chen Zhu
, Enhong Chen
:
InteractNet: Social Interaction Recognition for Semantic-rich Videos. 240:1-240:21 - Mrinmoy Bhattacharjee

, S. R. Mahadeva Prasanna
, Prithwijit Guha
:
Exploration of Speech and Music Information for Movie Genre Classification. 241:1-241:19 - Sara Sarto

, Marcella Cornia
, Lorenzo Baraldi
, Alessandro Nicolosi
, Rita Cucchiara
:
Towards Retrieval-Augmented Architectures for Image Captioning. 242:1-242:22 - Kaihui Yang

, Junwei Han
, Guangyu Guo
, Chaowei Fang
, Yingzi Fan
, Lechao Cheng
, Dingwen Zhang
:
Progressive Adapting and Pruning: Domain-Incremental Learning for Saliency Prediction. 243:1-243:19 - Lv Tang

, Xinfeng Zhang
:
High Efficiency Deep-learning Based Video Compression. 244:1-244:23 - Pedro Gomes

, Silvia Rossi
, Laura Toni
:
AGAR - Attention Graph-RNN for Adaptative Motion Prediction of Point Clouds of Deformable Objects. 245:1-245:25 - Jiabo Ye

, Junfeng Tian
, Ming Yan
, Haiyang Xu
, Qinghao Ye
, Yaya Shi
, Xiaoshan Yang
, Xuwu Wang
, Ji Zhang
, Liang He
, Xin Lin
:
UniQRNet: Unifying Referring Expression Grounding and Segmentation with QRNet. 246:1-246:28 - Wei Zhou

, Qi Yang
, Wu Chen
, Qiuping Jiang
, Guangtao Zhai
, Weisi Lin
:
Blind Quality Assessment of Dense 3D Point Clouds with Structure Guided Resampling. 247:1-247:21 - Yuli Zhao

, Yin Zhang, Francis C. M. Lau
, Hai Yu
, Zhiliang Zhu
, Bin Zhang:
Expanding-Window Zigzag Decodable Fountain Codes for Scalable Multimedia Transmission. 248:1-248:24 - Xuanyu Jin

, Ni Li
, Wanzeng Kong
, Jiajia Tang
, Bing Yang
:
Unbiased Semantic Representation Learning Based on Causal Disentanglement for Domain Generalization. 249:1-249:20 - Bo Peng

, Lin Sun
, Jianjun Lei
, Bingzheng Liu
, Haifeng Shen
, Wanqing Li
, Qingming Huang
:
Self-Supervised Monocular Depth Estimation via Binocular Geometric Correlation Learning. 250:1-250:19 - Yang Yang

, Shuailong Qiu
, Lanling Zeng
, Zhigeng Pan
:
Detail-preserving Joint Image Upsampling. 251:1-251:23 - Xiao Kang

, Xingbo Liu
, Wen Xue
, Xiushan Nie
, Yilong Yin
:
Online Cross-modal Hashing With Dynamic Prototype. 252:1-252:18 - Yuqing Yang

, Boris Joukovsky
, José Oramas Mogrovejo
, Tinne Tuytelaars
, Nikos Deligiannis
:
SNIPPET: A Framework for Subjective Evaluation of Visual Explanations Applied to DeepFake Detection. 253:1-253:29 - Jinwang Pan

, Xianming Liu
, Yuanchao Bai
, Deming Zhai
, Junjun Jiang
, Debin Zhao
:
Illumination-Aware Low-Light Image Enhancement with Transformer and Auto-Knee Curve. 254:1-254:23 - Lohic Fotio Tiotsop

, Antonio Servetti
, Peter Pocta
, Glenn Van Wallendael
, Marcus Barkowsky
, Enrico Masala
:
Multiple Image Distortion DNN Modeling Individual Subject Quality Assessment. 255:1-255:27 - Yunhui Xu

, Youru Li
, Muhao Xu
, Zhenfeng Zhu
, Yao Zhao
:
HKA: A Hierarchical Knowledge Alignment Framework for Multimodal Knowledge Graph Completion. 256:1-256:19 - Li Zhou

, Zhenyu Liu
, Yutong Li
, Yuchi Duan
, Huimin Yu
, Bin Hu
:
Multi Fine-Grained Fusion Network for Depression Detection. 257:1-257:23 - Chenlei Lv

, Dan Zhang
, Shengling Geng
, Zhongke Wu
, Hui Huang
:
Color Transfer for Images: A Survey. 258:1-258:29 - Zhihao Zhang

, Jun Wang
, Shengjie Li
, Lei Jin
, Hao Wu
, Jian Zhao
, Bo Zhang
:
Review and Analysis of RGBT Single Object Tracking Methods: A Fusion Perspective. 259:1-259:27 - Muhammad Bilal Shaikh

, Douglas Chai
, Syed Mohammed Shamsul Islam
, Naveed Akhtar
:
From CNNs to Transformers in Multimodal Human Action Recognition: A Survey. 260:1-260:24 - Yuankun Liu

, Xiang Yuan
, Haochen Li
, Zhijie Tan
, Jinsong Huang
, Jingjie Xiao
, Weiping Li
, Tong Mo
:
SEMScene: Semantic-Consistency Enhanced Multi-Level Scene Graph Matching for Image-Text Retrieval. 261:1-261:28
Volume 20, Number 9, September 2024
- Bo Chen

, Zhisheng Yan
, Klara Nahrstedt
:
Context-aware Optimization for Bandwidth-Efficient Image Analytics Offloading. 262:1-262:22 - Quentin Guimard

, Lucile Sassatelli
, Francesco Marchetti
, Federico Becattini
, Lorenzo Seidenari
, Alberto Del Bimbo
:
Deep Variational Learning for 360° Adaptive Streaming. 263:1-263:25 - Yu Zheng

, Wenchao Zhang
, Wei Song
, Xiuhua Wang
, Chong Fu
:
Encrypted Video Search with Single/Multiple Writers. 264:1-264:23 - Haihan Duan

, Junhua Liao
, Lehao Lin
, Abdulmotaleb El-Saddik
, Wei Cai
:
Meetor: A Human-Centered Automatic Video Editing System for Meeting Recordings. 265:1-265:23 - Na Li

, Yao Liu
:
VertexShuffle-Based Spherical Super-Resolution for 360-Degree Videos. 266:1-266:17 - Guilherme de A. P. Marques, José Matheus Carvalho Boaro, Antonio José G. Busson, Álan L. V. Guedes, Julio Cesar Duarte, Sérgio Colcher:

Action Segmentation through Self-Supervised Video Features and Positional-Encoded Embeddings. 267:1-267:23 - Sara Vlahovic

, Ivan Slivar
, Matko Silic
, Lea Skorin-Kapov
, Mirko Suznjevic
:
Exploring the Facets of the Multiplayer VR Gaming Experience. 268:1-268:24 - Bekir Oguzhan Turkkan

, Ting Dai
, Adithya Raman
, Tevfik Kosar
, Changyou Chen
, Muhammed Fatih Bulut
, Jaroslav Zola
, Daby Sow
:
GreenABR+: Generalized Energy-Aware Adaptive Bitrate Streaming. 269:1-269:24 - Zhiming Hu

, Mete Kemertas
, Lan Xiao
, Caleb Phillips
, Iqbal Mohomed
, Afsaneh Fazly
:
Realizing Efficient On-Device Language-based Image Retrieval. 270:1-270:18 - Amit Hirway

, Yuansong Qiao
, Niall Murray
:
A Quality of Experience and Visual Attention Evaluation for 360° Videos with Non-spatial and Spatial Audio. 271:1-271:20 - Cheonjin Park

, Chinmaey Shende
, Subhabrata Sen
, Bing Wang
:
C2: ABR Streaming in Cognizant of Consumption Context for Improved QoE and Resource Usage Tradeoffs. 272:1-272:27
Volume 20, Number 10, October 2024
- Walayat Hussain

, Honghao Gao
, Rafiul Karim
, Abdulmotaleb El-Saddik
:
Seventeen Years of the ACM Transactions on Multimedia Computing, Communications and Applications: A Bibliometric Overview. 297:1-297:22
- Bowen Yuan

, Jiahao Lu
, Sisi You
, Bing-Kun Bao
:
Unbiased Feature Learning with Causal Intervention for Visible-Infrared Person Re-Identification. 298:1-298:20 - Sixian Chan

, Xianpeng Zeng
, Xinhua Wang
, Jie Hu
, Cong Bai
:
Auxiliary Feature Fusion and Noise Suppression for HOI Detection. 299:1-299:18 - Yefan Li

, Fuqing Duan
, Ke Lu
:
Gated Multi-Modal Edge Refinement Network for Light Field Salient Object Detection. 300:1-300:20 - Dongze Hao

, Qunbo Wang
, Xinxin Zhu
, Jing Liu
:
HCCL: Hierarchical Counterfactual Contrastive Learning for Robust Visual Question Answering. 301:1-301:21 - Jun Jia

, Zhongpai Gao
, Yiwei Yang
, Wei Sun
, Dandan Zhu
, Xiaohong Liu
, Xiongkuo Min
, Guangtao Zhai
:
Hidden Barcode in Sub-Images with Invisible Locating Marker. 302:1-302:24 - Junxin Lu

, Yongbin Gao
, Jieyu Chen
, Jenq-Neng Hwang
, Hamido Fujita
, Zhijun Fang
:
Monocular Depth and Ego-motion Estimation with Scale Based on Superpixel and Normal Constraints. 303:1-303:26 - Zhenjiang Guo

, Xiaohai He
, Yu Yang
, Linbo Qing
, Honggang Chen
:
DAG-YOLO: A Context-Feature Adaptive fusion Rotating Detection Network in Remote Sensing Images. 304:1-304:24 - Yong Zhou

, Zeming Xie
, Jiaqi Zhao
, Wen-Liang Du
, Rui Yao
, Abdulmotaleb El-Saddik
:
Multi-Modal LiDAR Point Cloud Semantic Segmentation with Salience Refinement and Boundary Perception. 305:1-305:20 - Yuanyuan Wang

, Meng Liu
, Xuemeng Song
, Liqiang Nie
:
Harnessing Representative Spatial-Temporal Information for Video Question Answering. 306:1-306:20 - Guibiao Liao

, Wei Gao
:
Rethinking Feature Mining for Light Field Salient Object Detection. 307:1-307:24 - Chao Liang

, Linchao Zhu
, Zongxin Yang
, Wei Chen
, Yi Yang:
Noise-Tolerant Hybrid Prototypical Learning with Noisy Web Data. 308:1-308:19 - Yitao Peng

, Lianghua He
, Die Hu
, Yihang Liu
, Longzhen Yang
, Shaohua Shang
:
Decoupling Deep Learning for Enhanced Image Recognition Interpretability. 309:1-309:24 - Baoli Sun

, Yanjun Guo
, Tiantian Yan
, Xinchen Ye
, Zhihui Wang
, Haojie Li
, Zhiyong Wang
:
Digging into Depth and Color Spaces: A Mapping Constraint Network for Depth Super-Resolution. 310:1-310:20 - Michael Seufert

, Marius Spangenberger
, Fabian Poignée
, Florian Wamser
, Werner Robitza
, Christian Timmerer
, Tobias Hoßfeld
:
COBIRAS: Offering a Continuous Bit Rate Slide to Maximize DASH Streaming Bandwidth Utilization. 311:1-311:24 - Zhangyong Tang

, Tianyang Xu
, Xiao-Jun Wu
, Josef Kittler
:
Multi-Level Fusion for Robust RGBT Tracking via Enhanced Thermal Representation. 312:1-312:24 - Hanyue Tu

, Li Li
, Wengang Zhou
, Houqiang Li
:
Reconstruction-Free Image Compression for Machine Vision via Knowledge Transfer. 313:1-313:19 - Gai Zhang

, Xinfeng Zhang
, Lv Tang
:
Unified and Scalable Deep Image Compression Framework for Human and Machine. 314:1-314:22 - Fengyong Li

, Huajun Zhai
, Teng Liu
, Xinpeng Zhang
, Chuan Qin
:
Learning Compressed Artifact for JPEG Manipulation Localization Using Wide-Receptive-Field Network. 315:1-315:23 - Shukang Yin

, Sirui Zhao
, Hao Wang
, Tong Xu
, Enhong Chen
:
Exploiting Instance-level Relationships in Weakly Supervised Text-to-Video Retrieval. 316:1-316:21 - Kayhan Latifzadeh

, Nima Gozalpour
, V. Javier Traver
, Tuukka Ruotsalo
, Aleksandra Kawala-Sterniuk
, Luis A. Leiva
:
Efficient Decoding of Affective States from Video-elicited EEG Signals: An Empirical Investigation. 317:1-317:24 - Ziyue Wu

, Junyu Gao
, Shucheng Huang
, Changsheng Xu
:
Learning Commonsense-aware Moment-Text Alignment for Fast Video Temporal Grounding. 318:1-318:22 - Daniele Lorenzi

, Farzad Tashtarian
, Hermann Hellwagner
, Christian Timmerer
:
MEDUSA: A Dynamic Codec Switching Approach in HTTP Adaptive Streaming. 319:1-319:23 - Ruoyan Pi

, Peng Wu
, Xiangteng He
, Yuxin Peng
:
EOGT: Video Anomaly Detection with Enhanced Object Information and Global Temporal Dependency. 320:1-320:21 - Shengbin Yue

, Yunbin Tu
, Liang Li
, Shengxiang Gao
, Zhengtao Yu
:
Multi-Grained Representation Aggregating Transformer with Gating Cycle for Change Captioning. 321:1-321:23 - Jingjing Wu

, Xi Zhou
, Xiaohong Li
, Hao Liu
, Meibin Qi
, Richang Hong
:
Asymmetric Deformable Spatio-temporal Framework for Infrared Object Tracking. 322:1-322:24 - Zhenyu Li

, Shanshan Gao
, Deqian Mao
, Shouwen Song
, Lei Li
, Yuanfeng Zhou
:
Deep Plug-and-Play Non-Iterative Cluster for 3D Global Feature Extraction. 323:1-323:18 - Mingfu Xue

, Yinghao Wu
, Leo Yu Zhang
, Dujuan Gu
, Yushu Zhang
, Weiqiang Liu
:
SSAT: Active Authorization Control and User's Fingerprint Tracking Framework for DNN IP Protection. 324:1-324:24 - Yongkang Li

, Qifan Liang
, Zhen Han
, Wenjun Mai
, Zhongyuan Wang
:
Few-Shot Face Sketch-to-Photo Synthesis via Global-Local Asymmetric Image-to-Image Translation. 325:1-325:24 - Shuqin Chen

, Xian Zhong
, Yi Zhang
, Lei Zhu
, Ping Li
, Xiaokang Yang
, Bin Sheng
:
Action-aware Linguistic Skeleton Optimization Network for Non-autoregressive Video Captioning. 326:1-326:24 - Yancun Yang

, Weiqing Min
, Jingru Song
, Guorui Sheng
, Lili Wang
, Shuqiang Jiang
:
Lightweight Food Recognition via Aggregation Block and Feature Encoding. 327:1-327:25 - Huaijin Liu

, Jixiang Du
, Yong Zhang
, Hongbo Zhang
, Jiandian Zeng
:
MSSA: Multi-Representation Semantics-Augmented Set Abstraction for 3D Object Detection. 328:1-328:23 - Vinicius Atsushi Sato Kawai

, Lucas Pascotti Valem
, Alexandro Baldassin
, Edson Borin
, Daniel Carlos Guimarães Pedronette
, Longin Jan Latecki
:
Rank-based Hashing for Effective and Efficient Nearest Neighbor Search for Image Retrieval. 329:1-329:19
Volume 20, Number 11, November 2024
- Ritesh Vyas

, Michele Nappi
, Alberto Del Bimbo
, Sambit Bakshi
:
Introduction to Special Issue on "Recent Trends in Multimedia Forensics". 330:1-330:7 - Vincenzo Carletti

, Pasquale Foggia
, Antonio Greco
, Alessia Saggese
, Mario Vento
:
Facial Soft-biometrics Obfuscation through Adversarial Attacks. 331:1-331:21 - Hanrui Wang

, Shuo Wang
, Cunjian Chen
, Massimo Tistarelli
, Zhe Jin
:
A Multi-Task Adversarial Attack against Face Authentication. 332:1-332:24 - Tian Wu

, Rongbo Zhu
, Shaohua Wan
:
Semantic Map Guided Identity Transfer GAN for Person Re-identification. 333:1-333:20 - Dhiran Kumar Mahto

, Amit Kumar Singh
, Kedar Nath Singh
, Om Prakash Singh
, Amrit Kumar Agrawal
:
Robust Copyright Protection Technique with High-embedding Capacity for Color Images. 334:1-334:12 - S. Shitharth

, Hariprasath Manoharan
, Alaa O. Khadidos
, Achyut Shankar
, Carsten Maple
, Adil Omar Khadidos
, Shahid Mumtaz
:
Improved Security for Multimedia Data Visualization using Hierarchical Clustering Algorithm. 335:1-335:21 - Youqiang Sun

, Jianyi Liu
, Ru Zhang
:
Generative Image Steganography Based on Guidance Feature Distribution. 336:1-336:18 - Paarth Neekhara

, Shehzeen Hussain
, Xinqiao Zhang
, Ke Huang
, Julian J. McAuley
, Farinaz Koushanfar
:
FaceSigns: Semi-fragile Watermarks for Media Authentication. 337:1-337:21 - Jing Zhao

, Hongwei Yang
, Hui He
, Jie Peng
, Weizhe Zhang
, Jiangqun Ni
, Arun Kumar Sangaiah
, Aniello Castiglione
:
Backdoor Two-Stream Video Models on Federated Learning. 338:1-338:20 - Farkhund Iqbal

, Ahmed Abbasi
, Abdul Rehman Javed
, Ahmad S. Almadhor
, Zunera Jalil
, Sajid Anwar
, Imad Rida
:
Data Augmentation-based Novel Deep Learning Method for Deepfaked Images Detection. 339:1-339:15 - Kaihan Lin

, Weihong Han
, Shudong Li
, Zhaoquan Gu
, Huimin Zhao
, Yangyang Mei
:
Detecting Deepfake Videos using Spatiotemporal Trident Network. 340:1-340:20 - Ijaz Ul Haq

, Khalid Mahmood Malik
, Khan Muhammad
:
Multimodal Neurosymbolic Approach for Explainable Deepfake Detection. 341:1-341:16 - Federico Becattini

, Carmen Bisogni
, Vincenzo Loia
, Chiara Pero
, Fei Hao
:
Head Pose Estimation Patterns as Deepfake Detectors. 342:1-342:24 - Luca Guarnera

, Oliver Giudice
, Sebastiano Battiato
:
Mastering Deepfake Detection: A Cutting-edge Approach to Distinguish GAN and Diffusion-model Images. 343:1-343:24 - Aakash Varma Nadimpalli

, Ajita Rattani
:
ProActive DeepFake Detection using GAN-based Visible Watermarking. 344:1-344:27 - Bachir Kaddar

, Sid Ahmed Fezza
, Zahid Akhtar
, Wassim Hamidouche
, Abdenour Hadid
, Joan Serra-Sagristà
:
Deepfake Detection Using Spatiotemporal Transformer. 345:1-345:21 - Shuai Xiao

, Zhuo Zhang
, Jiachen Yang
, Jiabao Wen
, Yang Li
:
Forgery Detection by Weighted Complementarity between Significant Invariance and Detail Enhancement. 346:1-346:20 - Paola Capasso

, Giuseppe Cattaneo
, Maria De Marsico
:
A Comprehensive Survey on Methods for Image Integrity. 347:1-347:34
- Yunfang Niu

, Lingxiang Wu
, Yufeng Zhang
, Yousong Zhu
, Guibo Zhu
, Jinqiao Wang
:
Multi-Model Style-Aware Diffusion Learning for Semantic Image Synthesis. 348:1-348:21 - Jingzheng Li

, Hailong Sun
, Lei Chai
, Jiyi Li
:
Target Structure Learning Framework for Unsupervised Multi-Class Domain Adaptation. 349:1-349:23 - Chih-Fan Hsu

, Yi-Chen Li
, Chung-Chi Tsai
, Jian-Kai Wang
, Cheng-Hsin Hsu
:
Federated Learning Using Multi-Modal Sensors with Heterogeneous Privacy Sensitivity Levels. 350:1-350:27 - Hengwei Li

, Wei Wang
, Xiao Wang
, Xin Yuan
, Xin Xu
:
Blind 3D Video Stabilization with Spatio-Temporally Varying Motion Blur. 351:1-351:23 - Shunan Mao

, Hao Chen
, Yaowei Wang
, Wei Zeng
, Shiliang Zhang
:
TPTE: Text-Guided Patch Token Exploitation for Unsupervised Fine-Grained Representation Learning. 352:1-352:18 - Aditya Panda

, Dipti Prasad Mukherjee
:
Knowledge Guided Transformer Network for Compositional Zero-Shot Learning. 353:1-353:25 - Wei-Yen Hsu

, Yu-Yu Hsu
:
Multi-Scale and Multi-Layer Lattice Transformer for Underwater Image Enhancement. 354:1-354:24 - Tengfei Shi

, Chenglizhao Chen
, Zhenyu Wu
, Aimin Hao
, Yuming Fang
:
Improving Image Aesthetic Assessment via Multiple Image Joint Learning. 355:1-355:24 - Gaurang Bansal

, Aditya Nawal
, Vinay Chamola
, Norbert Herencsar
:
Revolutionizing Visuals: The Role of Generative AI in Modern Image Generation. 356:1-356:22 - Yujie Li

, Xuekai Wei
, Xiaofeng Liao
, You Zhao
, Fan Jia
, Xu Zhuang
, Mingliang Zhou
:
A Deep Retinex-Based Low-Light Enhancement Network Fusing Rich Intrinsic Prior Information. 357:1-357:23 - Weimin Shi

, Dehong Gao
, Yuan Xiong
, Zhong Zhou
:
QR-CLIP: Introducing Explicit Knowledge for Location and Time Reasoning. 358:1-358:22 - Feiyang Liu

, Kun Li
, Zhun Zhong
, Wei Jia
, Bin Hu
, Xun Yang
, Meng Wang
, Dan Guo
:
Depth Matters: Spatial Proximity-Based Gaze Cone Generation for Gaze Following in Wild. 359:1-359:24 - Yonghui Wang

, Shaokai Liu
, Li Li
, Wengang Zhou
, Houqiang Li
:
SwinShadow: Shifted Window for Ambiguous Adjacent Shadow Detection. 360:1-360:20 - Xin Liu

, Chao Hao
, Zitong Yu
, Huanjing Yue
, Jingyu Yang
:
From Recognition to Prediction: Leveraging Sequence Reasoning for Action Anticipation. 361:1-361:19 - Jiaxing Wen

, Aohong Shen
, Zhen Han
, Zhongyuan Wang
, Liang Chen
:
Cross-Modal Face Super-Resolution Based on Quasi-Siamese Domain Transfer Fusion Network. 362:1-362:23
Volume 20, Number 12, December 2024
- Hongbin Wang

, Rui Tang
, Fan Li
:
Hypercube Pooling for Visual Semantic Embedding. 363:1-363:17 - Fei Wang

, Liang Ding
, Jun Rao
, Ye Liu
, Li Shen
, Changxing Ding
:
Can Linguistic Knowledge Improve Multimodal Alignment in Vision-Language Pretraining? 364:1-364:22 - Caixia Liu

, Yali Chen
, Minhong Zhu
, Chenhui Hao
, Hai-Sheng Li
, Xiaochuan Wang
:
DEGAN: Detail-Enhanced Generative Adversarial Network for Monocular Depth-Based 3D Reconstruction. 365:1-365:17 - Dan Song

, Shumeng Huo
, Xinwei Fu
, Chu-Meng Zhang
, Wenhui Li
, An-An Liu
:
Cross-Modal Contrastive Learning with a Style-Mixed Bridge for Single Image 3D Shape Retrieval. 366:1-366:24 - Ting-Lan Lin

, Bing-Wei Su
, Po-Cheng Shen
, Ding-Yuan Chen
, Chi-Fu Liang
, Yan-Cheng Chen
, Yangming Wen
, Mohammad Shahid
:
Upsampling Algorithm for V-PCC-Coded 3D Point Clouds. 367:1-367:23 - Yuanzhi Wang

, Yong Li
, Xiaoya Zhang
, Xin Liu
, Anbo Dai
, Antoni B. Chan
, Zhen Cui
:
Edit Temporal-Consistent Videos with Image Diffusion Model. 368:1-368:16 - Luis Álvarez

, Agustín Trujillo
, Nelson Monzón
, Jean-Michel Morel
:
Generation and Editing of 2D Shapes Using a Branched Representation. 369:1-369:25 - Xingbo Liu

, Jiamin Li
, Xiushan Nie
, Xuening Zhang
, Yilong Yin
:
Fast Unsupervised Cross-Modal Hashing with Robust Factorization and Dual Projection. 370:1-370:21 - Yongheng Zhang

, Yuanqiang Cai
, Danfeng Yan
, Rongheng Lin
:
Real-World Scene Image Enhancement with Contrastive Domain Adaptation Learning. 371:1-371:23 - Chunqiang Yu

, Shichao Cheng
, Xianquan Zhang
, Xinpeng Zhang
, Zhenjun Tang
:
Reversible Data Hiding in Shared JPEG Images. 372:1-372:24 - Boqian Liu

, Haojie Li
, Zhihui Wang
, Tianfan Xue
:
Transparent Depth Completion Using Segmentation Features. 373:1-373:19 - Yongtang Bao

, Chunjian Su
, Yutong Qi
, Yanbing Geng
, Haojie Li
:
Category-Level Pose Estimation and Iterative Refinement for Monocular RGB-D Image. 374:1-374:20 - Kuiyuan Sun

, Xiaolong Liu, Xiaolong Li, Yao Zhao
, Wei Wang
:
Multi-Modal Driven Pose-Controllable Talking Head Generation. 375:1-375:23 - Bing Liu

, Jinfu Lu
, Mingming Liu
, Hao Liu
, Yong Zhou
, Dongping Yang
:
Diverse Image Captioning via Panoptic Segmentation and Sequential Conditional Variational Transformer. 376:1-376:17 - Veronika Stephanie

, Ibrahim Khalil
, Mohammed Atiquzzaman
:
Weight-Based Privacy-Preserving Asynchronous SplitFed for Multimedia Healthcare Data. 377:1-377:24 - Chuanhao Li

, Chenchen Jing
, Zhen Li
, Yuwei Wu
, Yunde Jia
:
Adversarial Sample Synthesis for Visual Question Answering. 378:1-378:24 - Shipeng Zhu

, Jun Fang
, Pengfei Fang
, Hui Xue
:
Improving Scene Text Retrieval via Stylized Middle Modality. 379:1-379:18 - Xiao Liang

, Erkun Yang
, Cheng Deng
, Yanhua Yang
:
CrossFormer: Cross-Modal Representation Learning via Heterogeneous Graph Transformer. 380:1-380:21 - Jiayu Lin

, Yuan-Gen Wang
:
TSFormer: Tracking Structure Transformer for Image Inpainting. 381:1-381:23 - Yixuan Li

, Peilin Chen
, Hanwei Zhu
, Keyan Ding
, Leida Li
, Shiqi Wang
:
Deep Shape-Texture Statistics for Completely Blind Image Quality Evaluation. 382:1-382:21 - Zhenyu Zhou

, Qing Liao
, Lei Luo
, Xinwang Liu
, En Zhu
:
ProtoRefine: Enhancing Prototypes with Similar Structure in Few-Shot Learning. 383:1-383:24 - Mengzhu Yu

, Zhenjun Tang
, Xiaoping Liang
, Xianquan Zhang
, Zhixin Li
, Xinpeng Zhang
:
Robust Hashing with Deep Features and Meixner Moments for Image Copy Detection. 384:1-384:23 - Jiabei Liu

, Weiming Zhuang
, Yuanyuan Liu
, Yonggang Wen
, Jun Huang
, Wei Lin
:
Personalized Federated Mutual Learning for Unsupervised Camera-Aware Person Re-Identification. 385:1-385:19 - Yiyang Ma

, Haowei Kuang
, Huan Yang
, Jianlong Fu
, Jiaying Liu
:
Prompt-Based Modality Bridging for Unified Text-to-Face Generation and Manipulation. 386:1-386:23 - Peilin Chen

, Shiqi Wang
, Zhu Li
:
Occupancy Map Guided Attributes Artifacts Removal for Video-Based Point Cloud Compression. 387:1-387:20 - Yunda Sun

, Lin Zhang
, Zhong Wang
, Yang Chen
, Shengjie Zhao
, Yicong Zhou
:
I2P Registration by Learning the Underlying Alignment Feature Space from Pixel-to-Point Similarities. 388:1-388:21 - Daniel Gebre

, Siem Hadish
, Aron Sbhatu
, Moayad Aloqaily
, Mohsen Guizani
:
Establishing Trust and Security in Decentralized Metaverse: A Web 3.0 Approach. 389:1-389:17 - Yangjun Mao

, Jun Xiao
, Dong Zhang
, Meng Cao
, Jian Shao
, Yueting Zhuang
, Long Chen
:
Improving Reference-Based Distinctive Image Captioning with Contrastive Rewards. 390:1-390:24 - Shenglan Li

, Rui Yao
, Yong Zhou
, Hancheng Zhu
, Jiaqi Zhao
, Zhiwen Shao
, Abdulmotaleb El-Saddik
:
Motion-Aware Self-Supervised RGBT Tracking with Multi-Modality Hierarchical Transformers. 391:1-391:23 - Jun Ling

, Han Xue
, Anni Tang
, Rong Xie
, Li Song
:
ViCoFace: Learning Disentangled Latent Motion Representations for Visual-Consistent Face Reenactment. 392:1-392:24 - Jiachen Li

, Qing Xie
, Xiaojun Chang
, Jinyu Xu
, Yongjian Liu
:
Mutually-Guided Hierarchical Multi-Modal Feature Learning for Referring Image Segmentation. 393:1-393:18 - Fatima Alshehri

, Ghulam Muhammad
:
Ischemic Stroke Segmentation by Transformer and Convolutional Neural Network Using Few-Shot Learning. 394:1-394:21
- Kamran Gholizadeh HamlAbadi

, Fedwa Laamarti
, Abdulmotaleb El-Saddik
:
Meta-Review on Brain-Computer Interface (BCI) in the Metaverse. 395:1-395:42

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID














