


default search action
IEEE Transactions on Multimedia, Volume 25
Volume 25, 2023
- Zan-Xia Jin

, Heran Wu, Chun Yang, Fang Zhou
, Jingyan Qin, Lei Xiao, Xu-Cheng Yin
:
RUArt: A Novel Text-Centered Solution for Text-Based Visual Question Answering. 1-12 - Yu Wang

, Shiwei Chen:
Multi-Agent Trajectory Prediction With Spatio-Temporal Sequence Fusion. 13-23 - Jiayi Xie, Yaochen Zhu

, Zhenzhong Chen
:
Micro-Video Popularity Prediction Via Multimodal Variational Information Bottleneck. 24-37 - Zhicheng Guo

, Jiaxuan Zhao
, Licheng Jiao
, Xu Liu
, Fang Liu
:
A Universal Quaternion Hypergraph Network for Multimodal Video Question Answering. 38-49 - Xiao Lin

, Shuzhou Sun, Wei Huang, Bin Sheng
, Ping Li
, David Dagan Feng
:
EAPT: Efficient Attention Pyramid Transformer for Image Processing. 50-61 - Zhi Li

, Haoliang Li
, Xin Luo, Yongjian Hu
, Kwok-Yan Lam
, Alex C. Kot
:
Asymmetric Modality Translation for Face Presentation Attack Detection. 62-76 - Wei Lu

, Desheng Li, Liqiang Nie
, Peiguang Jing
, Yuting Su
:
Learning Dual Low-Rank Representation for Multi-Label Micro-Video Classification. 77-89 - Yun Wang

, Tong Zhang
, Chuanwei Zhou
, Zhen Cui
, Jian Yang
:
Instance-Aware Deep Graph Learning for Multi-Label Classification. 90-99 - Jae Young Choi

, Bumshik Lee
:
Combining Deep Convolutional Neural Networks With Stochastic Ensemble Weight Optimization for Facial Expression Recognition in the Wild. 100-111 - Zerui Shao

, Yifei Pu, Jiliu Zhou, Bihan Wen
, Yi Zhang
:
Hyper RPCA: Joint Maximum Correntropy Criterion and Laplacian Scale Mixture Modeling on-the-Fly for Moving Object Detection. 112-125 - Yajing Liu, Zhiwei Xiong

, Ya Li, Xinmei Tian
, Zheng-Jun Zha
:
Domain Generalization Via Encoding and Resampling in a Unified Latent Space. 126-139 - Hangwei Chen

, Xiongli Chai
, Feng Shao
, Xuejin Wang, Qiuping Jiang
, Xiangchao Meng
, Yo-Sung Ho
:
Perceptual Quality Assessment of Cartoon Images. 140-153 - Yang Li

, Shengbin Meng, Xinfeng Zhang
, Meng Wang
, Shiqi Wang
, Yue Wang, Siwei Ma
:
User-Generated Video Quality Assessment: A Subjective and Objective Study. 154-166 - Yan Yang

, Jun Yu
, Jian Zhang
, Weidong Han
, Hanliang Jiang, Qingming Huang
:
Joint Embedding of Deep Visual and Semantic Features for Medical Image Report Generation. 167-178 - Hancheng Zhu

, Yong Zhou
, Leida Li
, Yaqian Li
, Yandong Guo:
Learning Personalized Image Aesthetics From Subjective and Objective Attributes. 179-190 - Jun Cheng

, Fusheng Hao
, Fengxiang He
, Liu Liu
, Qieshi Zhang
:
Mixer-Based Semantic Spread for Few-Shot Learning. 191-202 - Haojie Yuan

, Qi Chu
, Feng Zhu
, Rui Zhao, Bin Liu
, Nenghai Yu
:
AutoMA: Towards Automatic Model Augmentation for Transferable Adversarial Attacks. 203-213 - Zefan Li

, Bingbing Ni
, Xiaokang Yang
, Wenjun Zhang
, Wen Gao:
Residual Quantization for Low Bit-Width Neural Networks. 214-227 - Zhaoliang Chen

, Jie Yao, Guobao Xiao
, Shiping Wang
:
Efficient and Differentiable Low-Rank Matrix Completion With Back Propagation. 228-242 - Tong Xue

, Abdallah El Ali
, Tianyi Zhang
, Gangyi Ding, Pablo César
:
CEAP-360VR: A Continuous Physiological and Behavioral Emotion Annotation Dataset for 360$^\circ$ VR Videos. 243-255 - Gaosheng Liu

, Huanjing Yue
, Jiamin Wu
, Jing-Yu Yang
:
Intra-Inter View Interaction Network for Light Field Image Super-Resolution. 256-266 - Zhihao Wu

, Jie Wen
, Yong Xu
, Jian Yang
, David Zhang
:
Multiple Instance Detection Networks With Adaptive Instance Refinement. 267-279 - Yanhua Yang, Xiaozhe Zhang, Muli Yang

, Cheng Deng
:
Adaptive Bias-Aware Feature Generation for Generalized Zero-Shot Learning. 280-290 - Tung-I Chen, Yueh-Cheng Liu

, Hung-Ting Su, Yu-Cheng Chang, Yu-Hsiang Lin, Jia-Fong Yeh
, Wen-Chin Chen, Winston H. Hsu
:
Dual-Awareness Attention for Few-Shot Object Detection. 291-301 - Laizhong Cui

, Erchao Ni, Yipeng Zhou
, Zhi Wang
, Lei Zhang
, Jiangchuan Liu
, Yuedong Xu
:
Towards Real-Time Video Caching at Edge Servers: A Cost-Aware Deep Q-Learning Solution. 302-314 - Sutong Wang

, Jiacheng Zhu, Yunqiang Yin, Dujuan Wang
, T. C. Edwin Cheng
, Yanzhang Wang:
Interpretable Multi-Modal Stacking-Based Ensemble Learning Method for Real Estate Appraisal. 315-328 - Zhihao Zhang

, Xianqiang Yang
, Chao Xu
:
Natural Image Stitching With Layered Warping Constraint. 329-338 - Hao Tang

, Guoshuai Zhao
, Yuxia Wu
, Xueming Qian
:
Multisample-Based Contrastive Loss for Top-K Recommendation. 339-351 - Ke Zhang

, Chun Yuan
, Yiming Zhu, Yong Jiang
, Lishu Luo:
Weakly Supervised Instance Segmentation by Exploring Entire Object Regions. 352-363 - Astha Verma

, A. Venkata Subramanyam
, Zheng Wang
, Shin'ichi Satoh
, Rajiv Ratn Shah
:
Unsupervised Domain Adaptation for Person Re-Identification Via Individual-Preserving and Environmental-Switching Cyclic Generation. 364-377 - Carlos M. Lentisco

, Luis Bellido
, Andrés Cárdenas
, Ricardo Flores Moyano
, David Fernández
:
Design of a 5G Multimedia Broadcast Application Function Supporting Adaptive Error Recovery. 378-388 - Huicong Wu

, Liang Xiao, Le Sun
, Byeungwoo Jeon
:
A Novel Video Stabilization Model With Motion Morphological Component Priors. 389-404 - Xuehao Gao

, Yang Yang
, Yimeng Zhang
, Maosen Li
, Jin-Gang Yu
, Shaoyi Du
:
Efficient Spatio-Temporal Contrastive Learning for Skeleton-Based 3-D Action Recognition. 405-417 - Cheng Xue, Xionghu Zhong

, Minjie Cai
, Hao Chen
, Wenwu Wang
:
Audio-Visual Event Localization by Learning Spatial and Semantic Co-Attention. 418-429 - Guang Han

, Jinpeng Su, Yaoming Liu, Yuqiu Zhao, Sam Kwong
:
Multi-Stage Visual Tracking With Siamese Anchor-Free Proposal Network. 430-442 - Lei Yu

, Bishan Wang
, Jingwei He, Gui-Song Xia
, Wen Yang
:
Single Image Deraining With Continuous Rain Density Estimation. 443-456 - Jianjun Xiang

, Gangyi Jiang
, Mei Yu
, Zhidi Jiang
, Yo-Sung Ho
:
No-Reference Light Field Image Quality Assessment Using Four-Dimensional Sparse Transform. 457-472 - Mehdi Rahmati

, Zhuoran Qi
, Dario Pompili:
Underwater Adaptive Video Transmissions Using MIMO-Based Software-Defined Acoustic Modems. 473-485 - Nan Jiang

, Kuiran Wang, Xiaoke Peng
, Xuehui Yu, Qiang Wang, Junliang Xing, Guorong Li
, Guodong Guo, Qixiang Ye
, Jianbin Jiao
, Jian Zhao
, Zhenjun Han
:
Anti-UAV: A Large-Scale Benchmark for Vision-Based UAV Tracking. 486-500 - Yujie Huang

, Ming-e Jing, Jinjia Zhou
, Yuhao Liu
, Yibo Fan
:
LCCStyle: Arbitrary Style Transfer With Low Computational Complexity. 501-514 - Jing Yi, Yaochen Zhu

, Jiayi Xie, Zhenzhong Chen
:
Cross-Modal Variational Auto-Encoder for Content-Based Micro-Video Background Music Recommendation. 515-528 - Luntian Mou

, Chao Zhou, Pengtao Xie, Pengfei Zhao
, Ramesh C. Jain
, Wen Gao, Baocai Yin
:
Isotropic Self-Supervised Learning for Driver Drowsiness Detection With Attention-Based Multimodal Fusion. 529-542 - Wenhui Li

, Yan Wang, Yuting Su
, Xuanya Li
, An-An Liu
, Yongdong Zhang
:
Multi-Scale Fine-Grained Alignments for Image and Sentence Matching. 543-556 - Yongqiang Kong

, Yunhong Wang
, Annan Li
, Qiuyu Huang:
Self-Sufficient Feature Enhancing Networks for Video Salient Object Detection. 557-571 - Qinchuan Zhang

, Yi Jiang, Qin Zhou, Yiru Zhao, Yao Liu, Hongtao Lu
, Xian-Sheng Hua
:
Single Person Dense Pose Estimation via Geometric Equivariance Consistency. 572-583 - Kailun Zhou

, Liping Zhao
, Zigao Ye
, Huihui Wang, Tao Lin
, Sheng Feng
, Yufen Yang
:
Equal Value String and Copy Above String Based String Prediction for SCC in AVS3. 584-592 - Maja Krivokuca

, Ehsan Miandji
, Christine Guillemot
, Philip A. Chou
:
Compression of Plenoptic Point Cloud Attributes Using 6-D Point Clouds and 6-D Transforms. 593-607 - Xiaoqing Luo

, Yuanhao Gao, Anqi Wang
, Zhancheng Zhang
, Xiaojun Wu
:
IFSepR: A General Framework for Image Fusion Based on Separate Representation Learning. 608-623 - Shihao Xu

, Haocong Rao
, Xiping Hu
, Jun Cheng
, Bin Hu
:
Prototypical Contrast and Reverse Prediction: Unsupervised Skeleton Based Action Recognition. 624-634 - Huabing Zhou

, Wei Wu
, Yanduo Zhang
, Jiayi Ma
, Haibin Ling
:
Semantic-Supervised Infrared and Visible Image Fusion Via a Dual-Discriminator Generative Adversarial Network. 635-648 - Ming Li, Bin Fu

, Zhengfu Zhang, Yu Qiao
:
Character-Aware Sampling and Rectification for Scene Text Recognition. 649-661 - Mingyue Su

, Guanghua Gu
, Xianlong Ren, Hao Fu, Yao Zhao
:
Semi-Supervised Knowledge Distillation for Cross-Modal Hashing. 662-675 - Lei Zhu

, Xiaoqiang Wang
, Ping Li
, Xin Yang, Qing Zhang, Weiming Wang
, Carola-Bibiane Schönlieb
, C. L. Philip Chen
:
S $^3$ Net: Self-Supervised Self-Ensembling Network for Semi-Supervised RGB-D Salient Object Detection. 676-689 - Xinjue Hu

, Yuxuan Pan
, Yumei Wang
, Lin Zhang, Shervin Shirmohammadi
:
Multiple Description Coding for Best-Effort Delivery of Light Field Video Using GNN-Based Compression. 690-705 - Le Wang

, Qing Li, Sanping Zhou, Nanning Zheng
:
Multi-Panda Tracking. 706-720 - Changsheng Gao, Dong Liu

, Li Li
, Feng Wu:
Towards Task-Generic Image Compression: A Study of Semantics-Oriented Metrics. 721-735 - Pei Lv

, Jianqi Fan
, Xixi Nie
, Weiming Dong
, Xiaoheng Jiang
, Bing Zhou
, Mingliang Xu
, Changsheng Xu
:
User-Guided Personalized Image Aesthetic Assessment Based on Deep Reinforcement Learning. 736-749 - Xiao Tan

, Huaian Chen
, Kai Xu
, Yi Jin
, Changan Zhu
:
Deep SR-HDR: Joint Learning of Super-Resolution and High Dynamic Range Imaging for Dynamic Scenes. 750-763 - Zhen Bai

, Zhi Liu
, Gongyang Li
, Yang Wang
:
Adaptive Group-Wise Consistency Network for Co-Saliency Detection. 764-776 - Chenghu Du

, Feng Yu
, Minghua Jiang
, Ailing Hua, Xiong Wei, Tao Peng
, Xinrong Hu:
VTON-SCFA: A Virtual Try-On Network Based on the Semantic Constraints and Flow Alignment. 777-791 - Shiji Zhou

, Zhi Wang
, Chenghao Hu, Yinan Mao, Haopeng Yan, Shanghang Zhang
, Chuan Wu
, Wenwu Zhu
:
Caching in Dynamic Environments: A Near-Optimal Online Learning Approach. 792-804 - Shuyi Li

, Bob Zhang
, Lunke Fei
, Shuping Zhao
, Yicong Zhou
:
Learning Sparse and Discriminative Multimodal Feature Codes for Finger Recognition. 805-815 - Wenxue Cui

, Shaohui Liu
, Feng Jiang
, Debin Zhao
:
Image Compressed Sensing Using Non-Local Neural Network. 816-830 - Nastaran Nourbakhsh Kaashki

, Pengpeng Hu
, Adrian Munteanu
:
Anet: A Deep Neural Network for Automatic 3D Anthropometric Measurement Extraction. 831-844 - Xiaoyan Cai

, Sen Liu, Junwei Han
, Libin Yang
, Zhenguo Liu, Tianming Liu
:
ChestXRayBERT: A Pretrained Language Model for Chest Radiology Report Summarization. 845-855 - Xuemeng Song

, Shi-Ting Fang
, Xiaolin Chen
, Yinwei Wei
, Zhongzhou Zhao, Liqiang Nie
:
Modality-Oriented Graph Learning Toward Outfit Compatibility Modeling. 856-867 - Jie Nie

, Zian Zhao
, Lei Huang
, Weizhi Nie
, Zhiqiang Wei:
Cross-Domain Recommendation Via User-Clustering and Multidimensional Information Fusion. 868-880 - Haimin Zhang

, Min Xu
:
Recognition of Emotions in User-Generated Videos through Frame-Level Adaptation and Emotion Intensity Learning. 881-891 - Fei Peng

, Bo Long, Min Long
:
A Semi-Fragile Reversible Watermarking for Authenticating 3D Models Based on Virtual Polygon Projection and Double Modulation Strategy. 892-906 - Karam Park

, Jae Woong Soh
, Nam Ik Cho
:
A Dynamic Residual Self-Attention Network for Lightweight Single Image Super-Resolution. 907-918 - Ming Li

, Jun Liu
, Ce Zheng
, Xinming Huang
, Ziming Zhang:
Exploiting Multi-View Part-Wise Correlation via an Efficient Transformer for Vehicle Re-Identification. 919-929 - Liyuan Ma

, Kejie Huang
, Dongxu Wei
, Zhaoyan Ming
, Haibin Shen
:
FDA-GAN: Flow-Based Dual Attention GAN for Human Pose Transfer. 930-941 - Chongyang Bai

, Haipeng Chen
, Srijan Kumar, Jure Leskovec
, V. S. Subrahmanian
:
M2P2: Multimodal Persuasion Prediction Using Adaptive Fusion. 942-952 - Prasen Kumar Sharma

, Arun Abraham
, Vikram Nelvoy Rajendiran
:
A Generalized Zero-Shot Quantization of Deep Convolutional Neural Networks Via Learned Weights Statistics. 953-965 - Fan Zhao

, Wenda Zhao
, Huimin Lu
, Yong Liu
, Libo Yao, Yu Liu
:
Depth-Distilled Multi-Focus Image Fusion. 966-978 - Xuanhan Wang

, Yuyu Guo
, Jingkuan Song
, Lianli Gao
, Heng Tao Shen
:
AMANet: Adaptive Multi-Path Aggregation for Learning Human 2D-3D Correspondences. 979-992 - Tiejian Zhang

, Xinwang Liu
, Lei Gong
, Siwei Wang
, Xin Niu
, Li Shen:
Late Fusion Multiple Kernel Clustering With Local Kernel Alignment Maximization. 993-1007 - Yiming Wang

, Dongxia Chang
, Zhiqiang Fu, Yao Zhao
:
Consistent Multiple Graph Embedding for Multi-View Clustering. 1008-1018 - Jingjing Xiong

, Lai-Man Po
, Wing Yin Yu
, Yuzhi Zhao
, Kwok-Wai Cheung:
Distortion Map-Guided Feature Rectification for Efficient Video Semantic Segmentation. 1019-1032 - Wei Qin

, Hanwang Zhang
, Richang Hong
, Ee-Peng Lim
, Qianru Sun
:
Causal Interventional Training for Image Recognition. 1033-1044 - Shikun Li

, Tongliang Liu
, Jiyong Tan
, Dan Zeng
, Shiming Ge
:
Trustable Co-Label Learning From Multiple Noisy Annotators. 1045-1057 - Jiebo Luo

:
Editorial. 1058-1059 - Yonggang Wen

:
Editorial. 1060 - Wenqian Wang

, Faliang Chang
, Chunsheng Liu
, Guangxin Li
, Bin Wang:
GA-Net: A Guidance Aware Network for Skeleton-Based Early Activity Recognition. 1061-1073 - Qifan Wang

, Yinwei Wei
, Jianhua Yin
, Jianlong Wu
, Xuemeng Song
, Liqiang Nie
:
DualGNN: Dual Graph Neural Network for Multimedia Recommendation. 1074-1084 - Xiaoping Liang

, Zhenjun Tang
, Jingli Wu, Zhixin Li
, Xinpeng Zhang
:
Robust Image Hashing With Isomap and Saliency Map for Copy Detection. 1085-1097 - Shuping Zhao

, Lunke Fei
, Jie Wen
, Jigang Wu
, Bob Zhang
:
Intrinsic and Complete Structure Learning Based Incomplete Multiview Clustering. 1098-1110 - Shixiang Wu, Chao Dong

, Yu Qiao
:
Blind Image Restoration Based on Cycle-Consistent Network. 1111-1124 - Jose Jaena Mari Ople, Tai-Ming Huang

, Ming-Chih Chiu
, Yi-Ling Chen
, Kai-Lung Hua
:
Adjustable Model Compression Using Multiple Genetic Algorithm. 1125-1132 - Le Wang

, Mo Zhou
, Zhenxing Niu, Qilin Zhang
, Nanning Zheng
:
Adaptive Ladder Loss for Learning Coherent Visual-Semantic Embedding. 1133-1147 - Weide Liu

, Xiangfei Kong, Tzu-Yi Hung, Guosheng Lin
:
Cross-Image Region Mining With Region Prototypical Network for Weakly Supervised Segmentation. 1148-1160 - Ziqiang Wang

, Zhi Liu
, Gongyang Li
, Yang Wang
, Tianhong Zhang, Lihua Xu, Jijun Wang:
Spatio-Temporal Self-Attention Network for Video Saliency Prediction. 1161-1174 - Rui Wang

, Jun Liu
, Qiuhong Ke
, Duo Peng
, Yinjie Lei
:
Dear-Net: Learning Diversities for Skeleton-Based Early Action Recognition. 1175-1189 - Cheng Wang

, Bingpeng Ma
, Hong Chang
, Shiguang Shan
, Xilin Chen
:
Person Search by a Bi-Directional Task-Consistent Learning Model. 1190-1203 - Jipeng Wu

, Rongrong Ji
, Qiang Wang, Shengchuan Zhang
, Xiaoshuai Sun
, Yan Wang
, Mingliang Xu
, Feiyue Huang:
Fast Monocular Depth Estimation via Side Prediction Aggregation with Continuous Spatial Refinement. 1204-1216 - Di Wang

, Caiping Zhang, Quan Wang
, Yumin Tian, Lihuo He
, Lin Zhao
:
Hierarchical Semantic Structure Preserving Hashing for Cross-Modal Retrieval. 1217-1229 - Min Cao

, Cong Ding
, Chen Chen
, Hao Dou, Xiyuan Hu
, Junchi Yan
:
Progressive Context-Aware Graph Feature Learning for Target Re-Identification. 1230-1242 - Yuting Su

, Wei Zhao, Peiguang Jing
, Liqiang Nie
:
Exploiting Low-Rank Latent Gaussian Graphical Model Estimation for Visual Sentiment Distributions. 1243-1255 - Gaoang Wang

, Yizhou Wang
, Renshu Gu
, Weijie Hu
, Jenq-Neng Hwang
:
Split and Connect: A Universal Tracklet Booster for Multi-Object Tracking. 1256-1268 - Qiao Liu

, Di Yuan
, Nana Fan, Peng Gao
, Xin Li
, Zhenyu He
:
Learning Dual-Level Deep Representation for Thermal Infrared Tracking. 1269-1281 - Wenhao Li

, Hong Liu
, Runwei Ding
, Mengyuan Liu
, Pichao Wang
, Wenming Yang
:
Exploiting Temporal Contexts With Strided Transformer for 3D Human Pose Estimation. 1282-1293 - Mengxi Jia

, Xinhua Cheng, Shijian Lu
, Jian Zhang
:
Learning Disentangled Representation Implicitly Via Transformer for Occluded Person Re-Identification. 1294-1305 - Zhe Tang

, Yi Yang
, Wen Li
, Defu Lian
, Lixin Duan:
Deep Cross-Attention Network for Crowdfunding Success Prediction. 1306-1319 - Kun Zhang

, Zhendong Mao
, An-An Liu
, Yongdong Zhang
:
Unified Adaptive Relevance Distinguishable Attention Network for Image-Text Matching. 1320-1332 - Dongnan Liu

, Chaoyi Zhang
, Yang Song
, Heng Huang
, Chenyu Wang
, Michael Barnett
, Tom Weidong Cai
:
Decompose to Adapt: Cross-Domain Object Detection Via Feature Disentanglement. 1333-1344 - Bin Chen

, Kunhong Liu
, Yong Xu, Qingqiang Wu, Junfeng Yao
:
Block Division Convolutional Network With Implicit Deep Features Augmentation for Micro-Expression Recognition. 1345-1358 - Yingjian Li

, Zheng Zhang
, Bingzhi Chen
, Guangming Lu
, David Zhang
:
Deep Margin-Sensitive Representation Learning for Cross-Domain Facial Expression Recognition. 1359-1373 - Jianjun Sun

, Yan Zhao
, Shigang Wang
, Jian Wei:
3D Holoscopic Image Compression Based on Gaussian Mixture Model. 1374-1389 - Huan Liu

, Wentao Liu, Zhixiang Chi
, Yang Wang
, Yuanhao Yu
, Jun Chen
, Jin Tang:
Fast Human Pose Estimation in Compressed Videos. 1390-1400 - Yujian Feng

, Yimu Ji
, Fei Wu
, Guangwei Gao
, Yang Gao, Tianliang Liu
, Shangdong Liu
, Xiao-Yuan Jing
, Jiebo Luo
:
Occluded Visible-Infrared Person Re-Identification. 1401-1413 - Haoyu Zhao

, Qi Wang, Guowei Zhan, Weidong Min
, Yi Zou, Shimiao Cui:
Need Only One More Point (NOOMP): Perspective Adaptation Crowd Counting in Complex Scenes. 1414-1426 - Jianjun Qian

, Shumin Zhu
, Chaoyu Zhao, Jian Yang
, Wai Keung Wong
:
OTFace: Hard Samples Guided Optimal Transport Loss for Deep Face Representation. 1427-1438 - Tianyu Shen

, Deqi Li, Fei-Yue Wang
, Hua Huang
:
Depth-Aware Multi-Person 3D Pose Estimation With Multi-Scale Waterfall Representations. 1439-1451 - Qianqian Yu

, Keqi Fan
, Yuhui Zheng
:
Domain Adaptive Transformer Tracking Under Occlusions. 1452-1461 - Zhihao Liu

, Yuanyuan Shang, Timing Li
, Guanlin Chen, Yu Wang
, Qinghua Hu
, Pengfei Zhu
:
Robust Multi-Drone Multi-Target Tracking to Resolve Target Occlusion: A Benchmark. 1462-1476 - Zhijing Yang

, Junyang Chen
, Yukai Shi
, Hao Li
, Tianshui Chen
, Liang Lin
:
OccluMix: Towards De-Occlusion Virtual Try-on by Semantically-Guided Mixup. 1477-1488 - Kunyu Peng

, Alina Roitberg
, Kailun Yang
, Jiaming Zhang
, Rainer Stiefelhagen:
Delving Deep Into One-Shot Skeleton-Based Action Recognition With Diverse Occlusions. 1489-1504 - Guangwei Gao

, Lei Tang, Fei Wu
, Huimin Lu
, Jian Yang
:
JDSR-GAN: Constructing an Efficient Joint Learning Network for Masked Face Super-Resolution. 1505-1512 - Puning Zhang

, Fengyi Huang
, Dapeng Wu
, Boran Yang
, Zhigang Yang, Lei Tan:
Device-Edge-Cloud Collaborative Acceleration Method Towards Occluded Face Recognition in High-Traffic Areas. 1513-1520 - Qun Li

, Ziyi Zhang
, Feng Zhang, Fu Xiao
:
HRNeXt: High-Resolution Context Network for Crowd Pose Estimation. 1521-1528 - Chunjie Ma

, Li Zhuo
, Jiafeng Li
, Yutong Zhang, Jing Zhang
:
Cascade Transformer Decoder Based Occluded Pedestrian Detection With Dynamic Deformable Convolution and Gaussian Projection Channel Attention Mechanism. 1529-1537 - Rui Wang

, Yixue Hao
, Long Hu
, Jincai Chen
, Min Chen
, Di Wu
:
Self-Supervised Learning With Data-Efficient Supervised Fine-Tuning for Crowd Counting. 1538-1546 - Yun Lan

, Ruimin Hu
, Xin Xu
, Dengshi Li
, Chao Wang
, Xiaochen Wang:
From Collective Attribute Association of Groups to Precise Attribute Association of Individuals. 1547-1554 - Xingyu Yang, Mengya Han, Yong Luo

, Han Hu
, Yonggang Wen
:
Two-Stream Prototype Learning Network for Few-Shot Face Recognition Under Occlusions. 1555-1563 - Qinyang Zeng

, Chengju Liu
, Ming Liu
, Qijun Chen
:
Contrastive 3D Human Skeleton Action Representation Learning via CrossMoCo With Spatiotemporal Occlusion Mask Data Augmentation. 1564-1574 - Jianping Gou

, Xia Yuan
, Baosheng Yu, Jiali Yu
, Zhang Yi
:
Intra- and Inter-Class Induced Discriminative Deep Dictionary Learning for Visual Recognition. 1575-1583 - Zheng Cao

, Liming Xu, Danny Z. Chen
, Honghao Gao
, Jian Wu
:
A Robust Shape-Aware Rib Fracture Detection and Segmentation Framework With Contrastive Learning. 1584-1591 - Junzhu Mao

, Yazhou Yao
, Zeren Sun
, Xingguo Huang
, Fumin Shen
, Heng Tao Shen
:
Attention Map Guided Transformer Pruning for Occluded Person Re-Identification on Edge Device. 1592-1599 - Yun Li

, Zhe Liu
, Lina Yao
, Xiaojun Chang
:
Attribute-Modulated Generative Meta Learning for Zero-Shot Learning. 1600-1610 - Mingjie Sun

, Jimin Xiao
, Eng Gee Lim
, Yao Zhao
:
Cycle-Free Weakly Referring Expression Grounding With Self-Paced Learning. 1611-1621 - Yang Chen

, Lin Zhang
, Ying Shen
, Brian Nlong Zhao, Yicong Zhou
:
Extrinsic Self-Calibration of the Surround-View System: A Weakly Supervised Approach. 1622-1635 - Rui Gao

, Xingsong Hou
, Jie Qin
, Yuming Shen
, Yang Long, Li Liu, Zhao Zhang
, Ling Shao
:
Visual-Semantic Aligned Bidirectional Network for Zero-Shot Learning. 1649-1664 - Rui Wang

, Zuxuan Wu, Zejia Weng
, Jingjing Chen
, Guo-Jun Qi
, Yu-Gang Jiang
:
Cross-Domain Contrastive Learning for Unsupervised Domain Adaptation. 1665-1673 - Peng Wu

, Xiaotao Liu
, Jing Liu
:
Weakly Supervised Audio-Visual Violence Detection. 1674-1685 - Jinlong Li

, Zequn Jie, Xu Wang
, Yu Zhou
, Xiaolin Wei
, Lin Ma
:
Weakly Supervised Semantic Segmentation Via Progressive Patch Learning. 1686-1699 - Yucheng Shu

, Hengbo Li, Bin Xiao
, Xiuli Bi
, Weisheng Li
:
Cross-Mix Monitoring for Medical Image Segmentation With Limited Supervision. 1700-1712 - Bin Fan

, Yuzhu Yang, Wensen Feng, Fuchao Wu, Jiwen Lu
, Hongmin Liu
:
Seeing Through Darkness: Visual Localization at Night via Weakly Supervised Learning of Domain Invariant Features. 1713-1726 - Tao Chen

, Yazhou Yao
, Lei Zhang
, Qiong Wang
, Guo-Sen Xie
, Fumin Shen
:
Saliency Guided Inter- and Intra-Class Relation Constraints for Weakly Supervised Semantic Segmentation. 1727-1737 - Yan Luo

, Yongkang Wong
, Mohan S. Kankanhalli
, Qi Zhao
:
Learning to Minimize the Remainder in Supervised Learning. 1738-1748 - Yuhang Zhang

, Xiaopeng Zhang
, Jie Li
, Robert C. Qiu
, Haohang Xu
, Qi Tian
:
Semi-Supervised Contrastive Learning With Similarity Co-Calibration. 1749-1759 - Jingwei Yan

, Jingjing Wang, Qiang Li, Chunmao Wang, Shiliang Pu
:
Weakly Supervised Regional and Temporal Learning for Facial Action Unit Recognition. 1760-1772 - Anran Zhang

, Yandan Yang
, Jun Xu
, Xianbin Cao
, Xiantong Zhen
, Ling Shao
:
Latent Domain Generation for Unsupervised Domain Adaptation Object Counting. 1773-1783 - Pedro H. T. Gama

, Hugo N. Oliveira
, José Marcato Junior
, Jefersson A. dos Santos
:
Weakly Supervised Few-Shot Segmentation via Meta-Learning. 1784-1797 - Xing Lan

, Qinghao Hu, Jian Cheng
:
ATF: An Alternating Training Framework for Weakly Supervised Face Alignment. 1798-1809 - Xiaoliang Qian

, Yinfeng Zeng
, Wei Wang
, Qiuwen Zhang
:
Co-Saliency Detection Guided by Group Weakly Supervised Learning. 1810-1818 - Zhigang Tu

, Jiaxu Zhang
, Hongyan Li
, Yujin Chen
, Junsong Yuan
:
Joint-Bone Fusion Graph Convolutional Network for Semi-Supervised Skeleton Action Recognition. 1819-1831 - Guoliang Hua

, Hong Liu
, Wenhao Li
, Qian Zhang, Runwei Ding
, Xin Xu
:
Weakly-Supervised 3D Human Pose Estimation With Cross-View U-Shaped Graph Convolutional Network. 1832-1843 - Zhuo Huang

, Jian Yang
, Chen Gong
:
They are Not Completely Useless: Towards Recycling Transferable Unlabeled Data for Class-Mismatched Semi-Supervised Learning. 1844-1857 - Peipei Song

, Dan Guo
, Jun Cheng
, Meng Wang
:
Contextual Attention Network for Emotional Video Captioning. 1858-1867 - Huifang Li

, Yidong Li
, Yuanzhouhan Cao
, Yushan Han
, Yi Jin
, Yunchao Wei
:
Weakly Supervised Object Detection With Class Prototypical Network. 1868-1878 - Guangwei Gao

, Yi Yu
, Huimin Lu
, Jian Yang
, Dong Yue
:
Context-Patch Representation Learning With Adaptive Neighbor Embedding for Robust Face Image Super-Resolution. 1879-1889 - Yufei Yin

, Jiajun Deng
, Wengang Zhou
, Li Li
, Houqiang Li
:
FI-WSOD: Foreground Information Guided Weakly Supervised Object Detection. 1890-1902 - Jun Kong

, Xuefeng Tao
, Min Jiang
, Tianshan Liu
:
Weakly Supervised Distribution Discrepancy Minimization Learning With State Information for Person Re-Identification. 1903-1915 - Xiao Dong

, Gengwei Zhang
, Xunlin Zhan, Yi Ding
, Yunchao Wei
, Minlong Lu, Xiaodan Liang
:
Caption-Aided Product Detection via Collaborative Pseudo-Label Harmonization. 1916-1927 - Guodong Ding

, Angela Yao
:
Temporal Action Segmentation With High-Level Complex Activity Labels. 1928-1939 - Cheng Qi

, Zhiyong Feng
, Meng Xing
, Yong Su
, Jinqing Zheng
, Yiming Zhang
:
Energy-Based Temporal Summarized Attentive Network for Zero-Shot Action Recognition. 1940-1953 - Yuke Li

, Pin Wang
, Ching-Yao Chan
:
RESTEP Into the Future: Relational Spatio-Temporal Learning for Multi-Person Action Forecasting. 1954-1963 - Jialun Pei

, Tianyang Cheng, He Tang
, Chuanbo Chen
:
Transformer-Based Efficient Salient Instance Segmentation Networks With Orientative Query. 1964-1978 - Xian Zhong

, Cheng Gu
, Mang Ye
, Wenxin Huang
, Chia-Wen Lin
:
Graph Complemented Latent Representation for Few-Shot Image Classification. 1979-1990 - Yu Qiu

, Yun Liu
, Yanan Chen, Jianwen Zhang
, Jinchao Zhu
, Jing Xu
:
A2SPPNet: Attentive Atrous Spatial Pyramid Pooling Network for Salient Object Detection. 1991-2006 - Li Li

, Zhu Li
, Shan Liu
, Houqiang Li
:
Plenoptic Point Cloud Compression Using Multiview Extension of High Efficiency Video Coding. 2007-2021 - Siwang Zhou

, Xiaoning Deng, Chengqing Li, Yonghe Liu
, Hongbo Jiang
:
Recognition-Oriented Image Compressive Sensing With Deep Learning. 2022-2032 - Zipeng Ye

, Mengfei Xia, Ran Yi
, Juyong Zhang
, Yu-Kun Lai
, Xuwei Huang, Guo-Xin Zhang, Yong-Jin Liu
:
Audio-Driven Talking Face Video Generation With Dynamic Convolution Kernels. 2033-2046 - Chen Li

, Li Song
, Shuai Chen
, Rong Xie, Wenjun Zhang
:
Deep Online Video Stabilization Using IMU Sensors. 2047-2060 - Yufan Hu

, Junyu Gao
, Changsheng Xu
:
Learning Scene-Aware Spatio-Temporal GNNs for Few-Shot Early Action Prediction. 2061-2073 - Mingjie Wang

, Hao Cai
, Xian-Feng Han
, Jun Zhou, Minglun Gong
:
STNet: Scale Tree Network With Multi-Level Auxiliator for Crowd Counting. 2074-2084 - Ming Lu

, Tong Chen
, Zhenyu Dai
, Dong Wang, Dandan Ding
, Zhan Ma
:
Decoder-Side Cross Resolution Synthesis for Video Compression Enhancement. 2097-2110 - Hongguang Zhang

, Hongdong Li
, Piotr Koniusz
:
Multi-Level Second-Order Few-Shot Learning. 2111-2126 - Wenli Song

, Lei Zhang
, Xinbo Gao
:
Compound Projection Learning for Bridging Seen and Unseen Objects. 2127-2139 - Yunxin Li

, Qian Yang, Qingcai Chen
, Baotian Hu
, Xiaolong Wang, Yuxin Ding, Lin Ma
:
Fast and Robust Online Handwritten Chinese Character Recognition With Deep Spatial and Contextual Information Fusion Network. 2140-2152 - Jin Xie

, Yanwei Pang
, Jing Nie
, Jiale Cao
, Jungong Han
:
Latent Feature Pyramid Network for Object Detection. 2153-2163 - Min Wang

, Wengang Zhou
, Qi Tian
, Houqiang Li
:
Deep Graph Convolutional Quantization Networks for Image Retrieval. 2164-2175 - Zelong Zeng

, Zheng Wang
, Fan Yang, Shin'ichi Satoh
:
Geo-Localization via Ground-to-Satellite Cross-View Image Retrieval. 2176-2188 - Yu Pang

, Chengdong Wu
, Hao Wu, Xiaosheng Yu:
Unsupervised Multi-Subclass Saliency Classification for Salient Object Detection. 2189-2202 - Haimin Zhang

, Min Xu
:
Multiscale Emotion Representation Learning for Affective Image Recognition. 2203-2212 - Jiahao Zheng

, Sen Zhang
, Zilu Wang
, Xiaoping Wang
, Zhigang Zeng
:
Multi-Channel Weight-Sharing Autoencoder Based on Cascade Multi-Head Attention for Multimodal Emotion Recognition. 2213-2225 - Nan Jiang, Bin Sheng

, Ping Li
, Tong-Yee Lee
:
PhotoHelper: Portrait Photographing Guidance Via Deep Feature Retrieval and Fusion. 2226-2238 - Xiao Li

, Dong Zhang
, Ming Li
, Dah-Jye Lee
:
Accurate Head Pose Estimation Using Image Rectification and a Lightweight Convolutional Neural Network. 2239-2251 - Yalan Ye

, Tongjie Pan
, Tonghoujun Luo
, Jingjing Li
, Heng Tao Shen
:
Learning MLatent Representations for Generalized Zero-Shot Learning. 2252-2265 - Min Meng

, Mengcheng Lan
, Jun Yu
, Jigang Wu
, Ligang Liu
:
Dual-Level Adaptive and Discriminative Knowledge Transfer for Cross-Domain Recognition. 2266-2279 - Debashri Roy

, Yuanyuan Li
, Tong Jian, Peng Tian, Kaushik Roy Chowdhury
, Stratis Ioannidis
:
Multi-Modality Sensing and Data Fusion for Multi-Vehicle Detection. 2280-2295 - Zhejing Hu

, Yan Liu, Gong Chen
, Yongxu Liu
:
Can Machines Generate Personalized Music? A Hybrid Favorite-Aware Method for User Preference Music Transfer. 2296-2308 - Nayyer Aafaq

, Ajmal Mian
, Naveed Akhtar
, Wei Liu
, Mubarak Shah
:
Dense Video Captioning With Early Linguistic Information Fusion. 2309-2322 - Han Yan, Haijun Zhang

, Linlin Liu, Dongliang Zhou
, Xiaofei Xu, Zhao Zhang
, Shuicheng Yan
:
Toward Intelligent Design: An AI-Based Fashion Designer Using Generative Adversarial Networks Aided by Sketch and Rendering Generators. 2323-2338 - Jiayao Shan

, Sifan Zhou
, Yubo Cui
, Zheng Fang
:
Real-Time 3D Single Object Tracking With Transformer. 2339-2353 - Zheng Chang

, Xinfeng Zhang
, Shanshe Wang
, Siwei Ma
, Wen Gao:
STAM: A SpatioTemporal Attention Based Memory for Video Prediction. 2354-2367 - Dezhi Peng

, Lianwen Jin
, Weihong Ma
, Canyu Xie, Hesuo Zhang, Shenggao Zhu, Jing Li:
Recognition of Handwritten Chinese Text by Segmentation: A Segment-Annotation-Free Approach. 2368-2381 - Junke Wang

, Shaoxiang Chen, Zuxuan Wu, Yu-Gang Jiang
:
FT-TDR: Frequency-Guided Transformer and Top-Down Refinement Network for Blind Face Inpainting. 2382-2392 - Yi-Xing Peng, Jile Jiao

, Xuetao Feng
, Wei-Shi Zheng
:
Consistent Discrepancy Learning for Intra-Camera Supervised Person Re-Identification. 2393-2403 - Lintai Wu

, Yong Xu
, Junhui Hou
, C. L. Philip Chen
, Cheng-Lin Liu
:
A Two-Level Rectification Attention Network for Scene Text Recognition. 2404-2414 - Hang Liu, Menghan Hu

, Yuzhen Chen, Qingli Li
, Guangtao Zhai
, Simon X. Yang
, Xiao-Ping Zhang
, Xiaokang Yang
:
Angel's Girl for Blind Painters: An Efficient Painting Navigation System Validated by Multimodal Evaluation Approach. 2415-2429 - Huakui Zhang

, Yi Cai
, Haopeng Ren, Qing Li
:
Multimodal Topic Modeling by Exploring Characteristics of Short Text Social Media. 2430-2445 - Mengyang Sun

, Wei Suo
, Peng Wang
, Yanning Zhang
, Qi Wu
:
A Proposal-Free One-Stage Framework for Referring Expression Comprehension and Generation via Dense Cross-Attention. 2446-2458 - Jinkun You

, Yuan-Gen Wang
, Guopu Zhu
, Ligang Wu
, Hongli Zhang
, Sam Kwong
:
Estimating the Secret Key of Spread Spectrum Watermarking Based on Equivalent Keys. 2459-2473 - Ziqiang Zheng, Yi Bin

, Xiaoou Lv, Yang Wu, Yang Yang
, Heng Tao Shen
:
Asynchronous Generative Adversarial Network for Asymmetric Unpaired Image-to-Image Translation. 2474-2487 - Weihe Li

, Jiawei Huang
, Shiqi Wang, Chuliang Wu, Sen Liu
, Jian-xin Wang
:
An Apprenticeship Learning Approach for Adaptive Video Streaming Based on Chunk Quality and User Preference. 2488-2502 - Xiaoya Zhang

, Shumin Zhang, Zhen Cui
, Zechao Li
, Jin Xie, Jian Yang
:
Tube-Embedded Transformer for Pixel Prediction. 2503-2514 - Zhi Jin

, Junjia Huang, Wenjin Wang
, Aolin Xiong, Xiaojun Tan
:
Estimating Human Weight From a Single Image. 2515-2527 - Chuan Qin

, Jinchuan Hu
, Fengyong Li
, Zhenxing Qian
, Xinpeng Zhang
:
JPEG Image Encryption With Adaptive DC Coefficient Prediction and RS Pair Permutation. 2528-2542 - Lisha Wang

, Chenglin Li
, Wenrui Dai
, Shaohui Li
, Junni Zou
, Hongkai Xiong
:
QoE-Driven Adaptive Streaming for Point Clouds. 2543-2558 - Mengmeng Jing

, Lichao Meng
, Jingjing Li
, Lei Zhu
, Heng Tao Shen
:
Adversarial Mixup Ratio Confusion for Unsupervised Domain Adaptation. 2559-2572 - Shaocan Liu

, Xin Ma
:
Attention-Driven Appearance-Motion Fusion Network for Action Recognition. 2573-2584 - Farzad Tashtarian

, Abdelhak Bentaleb
, Alireza R. Erfanian
, Hermann Hellwagner
, Christian Timmerer
, Roger Zimmermann
:
$\mathsf{HxL3}$: Optimized Delivery Architecture for HTTP Low-Latency Live Streaming. 2585-2600 - Xiaomei Zhang

, Yingying Chen
, Ming Tang
, Jinqiao Wang
, Xiangyu Zhu
, Zhen Lei
:
Human Parsing With Part-Aware Relation Modeling. 2601-2612 - Chengrun Qiu

, Dongheng Zhang
, Yang Hu
, Houqiang Li
, Qibin Sun, Yan Chen
:
Radio-Assisted Human Detection. 2613-2623 - Pengfei Wang

, Changxing Ding
, Wentao Tan, Mingming Gong
, Kui Jia
, Dacheng Tao
:
Uncertainty-Aware Clustering for Unsupervised Domain Adaptive Object Re-Identification. 2624-2635 - Liyang Sun

, Yixiang Mao
, Tongyu Zong
, Yong Liu
, Yao Wang
:
Live 360 Degree Video Delivery Based on User Collaboration in a Streaming Flock. 2636-2647 - Han Fang

, Zhaoyang Jia, Hang Zhou
, Zehua Ma
, Weiming Zhang
:
Encoded Feature Enhancement in Watermarking Network for Distortion in Real Scenes. 2648-2660 - Wei Wang, Junyu Gao

, Xiaoshan Yang, Changsheng Xu
:
Many Hands Make Light Work: Transferring Knowledge From Auxiliary Tasks for Video-Text Retrieval. 2661-2674 - A. Sophia Koepke

, Andreea-Maria Oncescu
, João F. Henriques, Zeynep Akata
, Samuel Albanie
:
Audio Retrieval With Natural Language Queries: A Benchmark Study. 2675-2685 - En Yu

, Zhuoling Li
, Shoudong Han
, Hongwei Wang
:
RelationTrack: Relation-Aware Multiple Object Tracking With Decoupled Representation. 2686-2697 - Yi Dong

, Xinghao Jiang
, Zhaohong Li
, Tanfeng Sun
, Zhenzhen Zhang:
Multi-Channel HEVC Steganography by Minimizing IPM Steganographic Distortions. 2698-2709 - Liang Chen

, Jun Liu
, Weidong Chen
, Bo Du
:
A GLRT-Based Multi-Pixel Target Detector in Hyperspectral Imagery. 2710-2722 - Anyi Rao

, Linning Xu, Zhizhong Li
, Qingqiu Huang, Zhanghui Kuang, Wayne Zhang
, Dahua Lin:
A Coarse-to-Fine Framework for Automatic Video Unscreen. 2723-2733 - Shuo Liu

, Weize Quan
, Chaoqun Wang, Yuan Liu, Bin Liu
, Dong-Ming Yan
:
Dense Modality Interaction Network for Audio-Visual Event Localization. 2734-2748 - Shaokun Wang

, Tian Gan
, Yuan Liu
, Jianlong Wu
, Yuan Cheng, Liqiang Nie
:
Micro-Influencer Recommendation by Multi-Perspective Account Representation Learning. 2749-2760 - Desheng Cai, Shengsheng Qian

, Quan Fang, Jun Hu, Wenkui Ding
, Changsheng Xu
:
Heterogeneous Graph Contrastive Learning Network for Personalized Micro-Video Recommendation. 2761-2773 - Lingfeng Ma

, Hongtao Xie
, Chuanbin Liu
, Yongdong Zhang
:
Learning Cross-Channel Representations for Semantic Segmentation. 2774-2787 - Zhenxiao Luo

, Zelong Wang
, Miao Hu
, Yipeng Zhou
, Di Wu
:
LiveSR: Enabling Universal HD Live Video Streaming With Crowdsourced Online Learning. 2788-2798 - Qiyao Deng

, Qi Li
, Jie Cao
, Yunfan Liu
, Zhenan Sun
:
Semantic-Aware Noise Driven Portrait Synthesis and Manipulation. 2799-2811 - Yuanjie Dang

, Chong Huang, Peng Chen
, Ronghua Liang
, Xin Yang
, Kwang-Ting Cheng
:
Path-Analysis-Based Reinforcement Learning Algorithm for Imitation Filming. 2812-2824 - Feng Li, Yixuan Wu, Huihui Bai

, Weisi Lin
, Runmin Cong
, Yao Zhao
:
Learning Detail-Structure Alternative Optimization for Blind Super-Resolution. 2825-2838 - Zhangyu Chang

, S.-H. Gary Chan
:
Bi-Criteria Approximation for a Multi-Origin Multi-Channel Auto-Scaling Live Streaming Cloud. 2839-2850 - Yaxin Liu

, Jianlong Wu
, Leigang Qu, Tian Gan
, Jianhua Yin
, Liqiang Nie
:
Self-Supervised Correlation Learning for Cross-Modal Retrieval. 2851-2863 - Fan Chen

, Yaolin Yang, Hongjie He
, Yuan Yuan:
Adaptive Coding and Ordered-Index Extended Scrambling Based RDH in Encrypted Images. 2864-2875 - Ce Wang

, Dejia Xu
, Renjie Wan
, Bin He
, Boxin Shi
, Ling-Yu Duan
:
Background Scene Recovery From an Image Looking Through Colored Glass. 2876-2887 - Yongri Piao

, Wei Wu
, Miao Zhang
, Yongyao Jiang
, Huchuan Lu
:
Noise-Sensitive Adversarial Learning for Weakly Supervised Salient Object Detection. 2888-2897 - Laure Prétet

, Gaël Richard
, Clément Souchier, Geoffroy Peeters:
Video-to-Music Recommendation Using Temporal Alignment of Segments. 2898-2911 - Simeng Sun

, Tao Yu
, Jiahua Xu
, Wei Zhou
, Zhibo Chen
:
GraphIQA: Learning Distortion Graph Representations for Blind Image Quality Assessment. 2912-2925 - Cong Yu, Zhi Wu, Dongheng Zhang, Zhi Lu, Yang Hu, Yan Chen:

RFGAN: RF-Based Human Synthesis. 2926-2938 - Jie Li

, Cong Zhang
, Zhi Liu
, Richang Hong, Han Hu
:
Optimal Volumetric Video Streaming With Hybrid Saliency Based Tiling. 2939-2953 - Dechao Meng

, Liang Li
, Xuejing Liu
, Lin Gao
, Qingming Huang
:
Viewpoint Alignment and Discriminative Parts Enhancement in 3D Space for Vehicle ReID. 2954-2965 - Depeng Wang

, Zhenzhen Hu
, Yuanen Zhou
, Richang Hong
, Meng Wang
:
A Text-Guided Generation and Refinement Model for Image Captioning. 2966-2977 - Jie Huang

, Xueyang Fu
, Zeyu Xiao
, Feng Zhao
, Zhiwei Xiong
:
Low-Light Stereo Image Enhancement. 2978-2992 - Youguang Yu

, Wei Zhang
, Fuzheng Yang
, Ge Li
:
Rate-Distortion Optimized Geometry Compression for Spinning LiDAR Point Cloud. 2993-3005 - Kenan E. Ak

, Ying Sun
, Joo Hwee Lim
:
Learning by Imagination: A Joint Framework for Text-Based Image Manipulation and Change Captioning. 3006-3016 - Yuzhi Zhao

, Lai-Man Po
, Wing Yin Yu
, Yasar Abbas Ur Rehman
, Mengyang Liu
, Yujia Zhang
, Weifeng Ou
:
VCGAN: Video Colorization With Hybrid Generative Adversarial Network. 3017-3032 - Pengfei Zhu

, Xinjie Yao
, Yu Wang
, Meng Cao, Binyuan Hui
, Shuai Zhao, Qinghua Hu
:
Latent Heterogeneous Graph Network for Incomplete Multi-View Learning. 3033-3045 - Na Li

, Xinbo Zhao
:
A Strong and Robust Skeleton-Based Gait Recognition Method with Gait Periodicity Priors. 3046-3058 - Yuchen Zhang

, Wenrui Dai
, Yong Li, Chenglin Li
, Junhui Hou
, Junni Zou
, Hongkai Xiong
:
Light Field Compression With Graph Learning and Dictionary-Guided Sparse Coding. 3059-3072 - Cheng-Hao Wu

, Chih-Fan Hsu
, Tzu-Kuan Hung, Carsten Griwodz
, Wei Tsang Ooi, Cheng-Hsin Hsu
:
Quantitative Comparison of Point Cloud Compression Algorithms With PCC Arena. 3073-3088 - Cunyi Lin

, Xianwei Rong
, Xiaoyan Yu
:
MSAFF-Net: Multiscale Attention Feature Fusion Networks for Single Image Dehazing and Beyond. 3089-3100 - Xiaoming Zhao

, Xingming Wu
, Jinyu Miao
, Weihai Chen
, Peter C. Y. Chen
, Zhengguo Li
:
ALIKE: Accurate and Lightweight Keypoint Detection and Descriptor Extraction. 3101-3112 - Yuxia Wu

, Lizi Liao
, Gangyi Zhang, Wenqiang Lei, Guoshuai Zhao
, Xueming Qian
, Tat-Seng Chua
:
State Graph Reasoning for Multimodal Conversational Recommendation. 3113-3124 - Xianxu Hou

, Xiaokang Zhang
, Hanbang Liang
, Linlin Shen
, Zhong Ming
:
Lifelong Age Transformation With a Deep Generative Prior. 3125-3139 - Yiming Li, Xiaoshan Yang, Xuhui Huang, Zhe Ma, Changsheng Xu

:
Zero-Shot Predicate Prediction for Scene Graph Parsing. 3140-3153 - Pengfei Wang

, Changxing Ding
, Zhiyin Shao, Zhibin Hong, Shengli Zhang
, Dacheng Tao
:
Quality-Aware Part Models for Occluded Person Re-Identification. 3154-3165 - Shu-Yu Chen

, Yu-Kun Lai
, Shihong Xia
, Paul L. Rosin
, Lin Gao
:
3D Face Reconstruction and Gaze Tracking in the HMD for Virtual Interaction. 3166-3179 - Shaojie Li

, Mingbao Lin
, Yan Wang
, Fei Chao
, Ling Shao
, Rongrong Ji
:
Learning Efficient GANs for Image Translation via Differentiable Masks and Co-Attention Distillation. 3180-3189 - Yutong Gao

, Liqian Liang
, Congyan Lang
, Songhe Feng
, Yidong Li
, Yunchao Wei
:
Clicking Matters: Towards Interactive Human Parsing. 3190-3203 - Yangbo Feng, Junyu Gao

, Changsheng Xu
:
Learning Dual-Routing Capsule Graph Neural Network for Few-Shot Video Classification. 3204-3216 - Ni Zhang

, Nian Liu
, Junwei Han
, Kaiyuan Wan, Ling Shao
:
Face De-Occlusion With Deep Cascade Guidance Learning. 3217-3229 - Xiaoke Li

, Zufan Zhang
, Chenquan Gan
, Yong Xiang
:
Multi-Label Speech Emotion Recognition via Inter-Class Difference Loss Under Response Residual Network. 3230-3244 - Peter Szabó, Anderson Augusto Simiscuka

, Stefano Masneri
, Mikel Zorrilla
, Gabriel-Miro Muntean
:
A CNN-Based Framework for Enhancing 360° VR Experiences With Multisensorial Effects. 3245-3258 - Guangwei Gao

, Guoan Xu, Juncheng Li
, Yi Yu
, Huimin Lu
, Jian Yang
:
FBSNet: A Fast Bilateral Symmetrical Network for Real-Time Semantic Segmentation. 3273-3283 - Zeren Sun

, Yazhou Yao
, Xiu-Shen Wei, Fumin Shen
, Jian Zhang
, Xian-Sheng Hua
:
Boosting Robust Learning Via Leveraging Reusable Samples in Noisy Web Data. 3284-3295 - Nayu Liu

, Xian Sun
, Hongfeng Yu, Fanglong Yao
, Guangluan Xu
, Kun Fu
:
Abstractive Summarization for Video: A Revisit in Multistage Fusion Network With Forget Gate. 3296-3310 - Mehwish Ghafoor

, Arif Mahmood
:
Quantification of Occlusion Handling Capability of a 3D Human Pose Estimation Framework. 3311-3318 - Lei Zhang

, Yingjun Du
, Jiayi Shen, Xiantong Zhen
:
Learning to Learn With Variational Inference for Cross-Domain Image Classification. 3319-3328 - Jian Xiong

, Hao Gao
, Miaohui Wang
, Hongliang Li
, King Ngi Ngan
, Weisi Lin
:
Efficient Geometry Surface Coding in V-PCC. 3329-3342 - Yahui Liu

, Yajing Chen, Linchao Bao
, Nicu Sebe
, Bruno Lepri
, Marco De Nadai
:
ISF-GAN: An Implicit Style Function for High-Resolution Image-to-Image Translation. 3343-3353 - Mingjie Sun

, Jimin Xiao
, Eng Gee Lim
, Yao Zhao
:
Starting Point Selection and Multiple-Standard Matching for Video Object Segmentation With Language Annotation. 3354-3363 - Lei Jin

, Xiaojuan Wang, Xuecheng Nie
, Luoqi Liu, Yandong Guo, Jian Zhao
:
Grouping by Center: Predicting Centripetal Offsets for the Bottom-up Human Pose Estimation. 3364-3374 - Tong Zhu

, Leida Li
, Jufeng Yang
, Sicheng Zhao
, Hantao Liu
, Jiansheng Qian:
Multimodal Sentiment Analysis With Image-Text Interaction Network. 3375-3385 - Kaiwen Yang

, Xinmei Tian
:
Domain-Class Correlation Decomposition for Generalizable Person Re-Identification. 3386-3396 - Weilun Wang

, Wengang Zhou
, Jianmin Bao, Houqiang Li
:
Coherent Image Animation Using Spatial-Temporal Correspondence. 3397-3408 - Xianxu Hou

, Xiaokang Zhang
, Yudong Li
, Linlin Shen
:
TextFace: Text-to-Style Mapping Based Face Generation and Manipulation. 3409-3419 - Qing Li

, Changqing Zhang
, Qinghua Hu
, Huazhu Fu
, Pengfei Zhu
:
Confidence-Aware Fusion Using Dempster-Shafer Theory for Multispectral Pedestrian Detection. 3420-3431 - Zhuangzi Li

, Ge Li
, Thomas H. Li, Shan Liu
, Wei Gao
:
Semantic Point Cloud Upsampling. 3432-3442 - Qi Liang

, Qiang Li, Weizhi Nie
, An-An Liu
:
Unsupervised Cross-Media Graph Convolutional Network for 2D Image-Based 3D Model Retrieval. 3443-3455 - Yunhao Zhou

, Yi Wang
, Lap-Pui Chau
:
Moving Towards Centers: Re-Ranking With Attention and Memory for Re-Identification. 3456-3468 - Lei Zhang

, Hua Huang
:
Image Stitching With Manifold Optimization. 3469-3482 - Wujie Zhou

, Enquan Yang, Jingsheng Lei, Jian Wan, Lu Yu:
PGDENet: Progressive Guided Fusion and Depth Enhancement Network for RGB-D Indoor Scene Parsing. 3483-3494 - Yi-Jen Shih

, Shih-Lun Wu
, Frank Zalkow
, Meinard Müller
, Yi-Hsuan Yang
:
Theme Transformer: Symbolic Music Generation With Theme-Conditioned Transformer. 3495-3508 - Jiayi Ma

, Yang Wang, Aoxiang Fan
, Guobao Xiao
, Riqing Chen
:
Correspondence Attention Transformer: A Context-Sensitive Network for Two-View Correspondence Learning. 3509-3524 - Fatemeh Nikoonezhad, Mohammed Ghanbari

:
PRAM: Penalized Resource Allocation Method for Video Services. 3525-3533 - Di Hu

, Zheng Wang
, Feiping Nie
, Rong Wang
, Xuelong Li
:
Self-Supervised Learning for Heterogeneous Audiovisual Scene Analysis. 3534-3545 - Songsong Wu

, Hao Tang
, Xiao-Yuan Jing, Haifeng Zhao
, Jianjun Qian
, Nicu Sebe
, Yan Yan:
Cross-View Panorama Image Synthesis. 3546-3559 - Shihao Zou

, Xinxin Zuo
, Sen Wang
, Yiming Qian, Chuan Guo
, Li Cheng
:
Human Pose and Shape Estimation From Single Polarization Images. 3560-3572 - Long Ma

, Risheng Liu
, Yiyang Wang
, Xin Fan
, Zhongxuan Luo:
Low-Light Image Enhancement via Self-Reinforced Retinex Projection Model. 3573-3586 - Chunhui Bao

, Qianru Sun
:
Generating Music With Emotions. 3602-3614 - Yunqing Li

, Jun Du
, Jianshu Zhang, Changjie Wu:
A Tree-Structure Analysis Network on Handwritten Chinese Character Error Correction. 3615-3627 - Zichen Zhao

, Hai-Miao Hu
, Hongda Zhang
, Fei Chen, Qiang Guo
:
Improving Color Constancy Using Chromaticity-Line Prior. 3642-3656 - Chang Liu, Xudong Jiang

, Henghui Ding
:
Instance-Specific Feature Propagation for Referring Segmentation. 3657-3667 - Liang Han

, Zhaozheng Yin
:
Global Memory and Local Continuity for Video Object Detection. 3681-3693 - Md Mofijul Islam

, Mohammad Samin Yasar
, Tariq Iqbal
:
MAVEN: A Memory Augmented Recurrent Approach for Multimodal Fusion. 3694-3708 - Ercheng Pei

, Yong Zhao
, Meshia Cédric Oveneke
, Dongmei Jiang
, Hichem Sahli
:
A Bayesian Filtering Framework for Continuous Affect Recognition From Facial Images. 3709-3722 - Yiwei Ma

, Jiayi Ji
, Xiaoshuai Sun
, Yiyi Zhou
, Yongjian Wu, Feiyue Huang, Rongrong Ji
:
Knowing What it is: Semantic-Enhanced Dual Attention Transformer. 3723-3736 - Yuzhi Zhao

, Lai-Man Po
, Xuehui Wang
, Qiong Yan, Wei Shen
, Yujia Zhang
, Wei Liu, Chun Kit Wong, Chiu-Sing Pang, Weifeng Ou
, Wing Yin Yu
, Buhua Liu:
ChildPredictor: A Child Face Prediction Framework With Disentangled Learning. 3737-3752 - Tongtong Feng

, Qi Qi
, Jingyu Wang
, Jianxin Liao
, Jiangchuan Liu
:
Timely and Accurate Bitrate Switching in HTTP Adaptive Streaming With Date-Driven I-Frame Prediction. 3753-3762 - Tianyi Zhang

, Abdallah El Ali
, Alan Hanjalic
, Pablo César
:
Few-Shot Learning for Fine-Grained Emotion Recognition Using Physiological Signals. 3773-3787 - Hengyue Bi

, Canhui Xu
, Cao Shi
, Guozhu Liu
, Yuteng Li
, Honghong Zhang
, Jing Qu
:
SRRV: A Novel Document Object Detector Based on Spatial-Related Relation and Vision. 3788-3798 - Minggang Gan, Yan Zhang

:
Temporal Attention-Pyramid Pooling for Temporal Action Detection. 3799-3810 - Xin Liu

, Jinhan Yi, Yiu-ming Cheung
, Xing Xu
, Zhen Cui
:
OMGH: Online Manifold-Guided Hashing for Flexible Cross-Modal Retrieval. 3811-3824 - Wujiang Xu

, Yifei Xu
, Genan Sang, Li Li, Aichen Wang
, Pingping Wei, Li Zhu
:
Recursive Multi-Relational Graph Convolutional Network for Automatic Photo Selection. 3825-3840 - Guanglei Yang

, Enrico Fini
, Dan Xu
, Paolo Rota
, Mingli Ding
, Hao Tang
, Xavier Alameda-Pineda
, Elisa Ricci
:
Continual Attentive Fusion for Incremental Learning in Semantic Segmentation. 3841-3854 - Li Li

, Zhu Li
, Shan Liu
, Houqiang Li
:
Frame-Level Rate Control for Geometry-Based LiDAR Point Cloud Compression. 3855-3867 - Hongrun Zhang

, Yanda Meng
, Yitian Zhao
, Xuesheng Qian
, Yihong Qiao
, Xiaoyun Yang, Yalin Zheng
:
3D Human Pose and Shape Reconstruction From Videos via Confidence-Aware Temporal Feature Aggregation. 3868-3880 - Weiming Yang

, Xianke Wang
, Bowen Tian
, Wei Xu
, Wenqing Cheng:
A Multi-Stage Automatic Evaluation System for Sight-Singing. 3881-3893 - Kehua Guo

, Changchun Shen
, Bin Hu
, Min Hu
, Xiaoyan Kui
:
RSNet: Relation Separation Network for Few-Shot Similar Class Recognition. 3894-3904 - Zhong Wang

, Lin Zhang
, Ying Shen
, Yicong Zhou
:
D-LIOM: Tightly-Coupled Direct LiDAR-Inertial Odometry and Mapping. 3905-3920 - Yunxiao Wang

, Meng Liu
, Yinwei Wei
, Zhiyong Cheng
, Yinglong Wang, Liqiang Nie
:
Siamese Alignment Network for Weakly Supervised Video Moment Retrieval. 3921-3933 - Tuxin Guan

, Chaofeng Li
, Ke Gu
, Hantao Liu
, Yuhui Zheng
, Xiaojun Wu
:
Visibility and Distortion Measurement for No-Reference Dehazed Image Quality Assessment via Complex Contourlet Transform. 3934-3949 - Tianwen Qian

, Jingjing Chen
, Shaoxiang Chen, Bo Wu
, Yu-Gang Jiang
:
Scene Graph Refinement Network for Visual Question Answering. 3950-3961 - Kejun Wu

, You Yang
, Qiong Liu
, Xiao-Ping Zhang
:
Focal Stack Image Compression Based on Basis-Quadtree Representation. 3975-3988 - Changwei Wang

, Rongtao Xu
, Shibiao Xu
, Weiliang Meng
, Xiaopeng Zhang
:
CNDesc: Cross Normalization for Local Descriptors Learning. 3989-4001 - Yao Xue

, Yu Cao
, Xubin Feng
, Meilin Xie, Ke Li
, Xingjun Zhang
, Xueming Qian
:
Towards Handling Sudden Changes in Feature Maps During Depth Estimation. 4002-4012 - Xiang Deng, Songhe Feng

, Gengyu Lyu, Tao Wang
, Congyan Lang
:
Beyond Word Embeddings: Heterogeneous Prior Knowledge Driven Multi-Label Image Classification. 4013-4025 - Chuntao Wang

, Tianjian Zhang, Hao Chen, Qiong Huang
, Jiangqun Ni
, Xinpeng Zhang
:
A Novel Encryption-Then-Lossy-Compression Scheme of Color Images Using Customized Residual Dense Spatial Network. 4026-4040 - Hengmin Zhang

, Feng Qian
, Bob Zhang
, Wenli Du
, Jianjun Qian
, Jian Yang
:
Incorporating Linear Regression Problems Into an Adaptive Framework With Feasible Optimizations. 4041-4051 - Jiafeng Li

, Yaopeng Li
, Li Zhuo
, Lingyan Kuang, Tianjian Yu:
USID-Net: Unsupervised Single Image Dehazing Network via Disentangled Representations. 3587-3601 - Bairong Li

, Biao Guo
, Yuesheng Zhu
, Jianfeng Yin, Xiangli Ji:
Superframe-Based Temporal Proposals for Weakly Supervised Temporal Action Detection. 3628-3641 - Jiaqi Zhao

, Hanzheng Wang
, Yong Zhou
, Rui Yao
, Silin Chen
, Abdulmotaleb El-Saddik
:
Spatial-Channel Enhanced Transformer for Visible-Infrared Person Re-Identification. 3668-3680 - Jiayi Ji

, Xiaoyang Huang
, Xiaoshuai Sun
, Yiyi Zhou
, Gen Luo
, Liujuan Cao
, Jianzhuang Liu
, Ling Shao
, Rongrong Ji
:
Multi-Branch Distance-Sensitive Self-Attention Network for Image Captioning. 3962-3974 - Chengpei Xu

, Wenjing Jia
, Tingcheng Cui, Ruomei Wang
, Yuan-fang Zhang
, Xiangjian He
:
Arbitrary-Shape Scene Text Detection via Visual-Relational Rectification and Contour Approximation. 4052-4066 - Wenlong Cheng

, Wei Tang, Yan Huang
, Yiwen Luo, Liang Wang
:
A Reconstruction-Based Visual-Acoustic-Semantic Embedding Method for Speech-Image Retrieval. 4067-4080 - Ming Li, Bin Fu

, Han Chen, Junjun He
, Yu Qiao
:
Dual Relation Network for Scene Text Recognition. 4094-4107 - Xin Deng

, Hao Wang, Mai Xu
, Li Li
, Zulin Wang:
Omnidirectional Image Super-Resolution via Latitude Adaptive Network. 4108-4120 - Sijie Mai

, Ying Zeng, Haifeng Hu
:
Multimodal Information Bottleneck: Learning Minimal Sufficient Unimodal and Multimodal Representations. 4121-4134 - Xiang Wen

, Shiwei Zhao
, Haobo Wang
, Runze Wu
, Manhu Qu, Tianlei Hu, Gang Chen, Jianrong Tao
, Changjie Fan:
Multi-Source Multi-Label Learning for User Profiling in Online Games. 4135-4147 - Yang Yang

, Hao Zheng
, Lanling Zeng, Xiangjun Shen
, Yongzhao Zhan
:
$L_{1}$-Regularized Reconstruction Model for Edge-Preserving Filtering. 4148-4162 - Zhengzheng Tu

, Yan Ma, Zhun Li
, Chenglong Li
, Jieming Xu, Yongtao Liu:
RGBT Salient Object Detection: A Large-Scale Dataset and Benchmark. 4163-4176 - Yu Zhou

, Weikang Gong, Yanjing Sun
, Leida Li
, Jinjian Wu
, Xinbo Gao
:
Pyramid Feature Aggregation for Hierarchical Quality Prediction of Stitched Panoramic Images. 4177-4186 - Lingxiang Yao

, Worapan Kusakunniran
, Peng Zhang, Qiang Wu
, Jian Zhang
:
Improving Disentangled Representation Learning for Gait Recognition Using Group Supervision. 4187-4198 - Chengpei Xu

, Wenjing Jia
, Ruomei Wang
, Xiaonan Luo
, Xiangjian He
:
MorphText: Deep Morphology Regularized Accurate Arbitrary-Shape Scene Text Detection. 4199-4212 - Si Liu

, Renda Bao, Defa Zhu, Shaofei Huang
, Qiong Yan, Liang Lin
, Chao Dong
:
Fine-Grained Face Editing via Personalized Spatial-Aware Affine Modulation. 4213-4224 - Haodan Zhang

, Yixuan Ban
, Zongming Guo
, Ken Chen, Xinggong Zhang
:
RAM360: Robust Adaptive Multi-Layer 360$^\circ$ Video Streaming With Lyapunov Optimization. 4225-4239 - Hongyi Sun

, Wanhua Li
, Yueqi Duan
, Jie Zhou
, Jiwen Lu
:
Learning Adaptive Patch Generators for Mask-Robust Image Inpainting. 4240-4252 - Huiyu Duan

, Wei Shen
, Xiongkuo Min
, Yuan Tian
, Jae-Hyun Jung
, Xiaokang Yang
, Guangtao Zhai
:
Develop Then Rival: A Human Vision-Inspired Framework for Superimposed Image Decomposition. 4267-4281 - Bosheng Qin

, Haoji Hu
, Yueting Zhuang:
Deep Residual Weight-Sharing Attention Network With Low-Rank Attention for Visual Question Answering. 4282-4295 - Shihui Zhang

, Dongxu Zuo
, Yongliang Yang
, Xiaowei Zhang
:
A Transferable Adversarial Belief Attack With Salient Region Perturbation Restriction. 4296-4306 - Dong Wei

, Xiaobo Shen
, Quansen Sun
, Xizhan Gao
, Zhenwen Ren
:
Sparse Representation Classifier Guided Grassmann Reconstruction Metric Learning With Applications to Image Set Analysis. 4307-4322 - Tongzhen Si

, Fazhi He
, Zhong Zhang
, Yansong Duan
:
Hybrid Contrastive Learning for Unsupervised Person Re-Identification. 4323-4334 - Xiao Wang

, Xiujun Shu
, Shiliang Zhang
, Bo Jiang
, Yaowei Wang
, Yonghong Tian
, Feng Wu:
MFGNet: Dynamic Modality-Aware Filter Generation for RGB-T Tracking. 4335-4348 - Zhengyan Chen

, Hong Liu
, Linlin Zhang, Xin Liao
:
Multi-Dimensional Attention With Similarity Constraint for Weakly-Supervised Temporal Action Localization. 4349-4360 - Jiacheng Chen

, Bin-Bin Gao
, Zongqing Lu, Jing-Hao Xue
, Chengjie Wang
, Qingmin Liao
:
APANet: Adaptive Prototypes Alignment Network for Few-Shot Semantic Segmentation. 4361-4373 - Bowen Ma

, Tong Jia, Min Su, Xiaodong Jia, Dongyue Chen, Yichun Zhang:
Automated Segmentation of Prohibited Items in X-Ray Baggage Images Using Dense De-Overlap Attention Snake. 4374-4386 - Deyang Liu

, Yan Huang
, Yuming Fang
, Yifan Zuo
, Ping An
:
Multi-Stream Dense View Reconstruction Network for Light Field Image Compression. 4400-4414 - Liang Xu

, Cuiling Lan
, Wenjun Zeng
, Cewu Lu
:
Skeleton-Based Mutually Assisted Interacted Object Localization and Human Action Recognition. 4415-4425 - Chaoqin Huang

, Qinwei Xu
, Yanfeng Wang, Yu Wang
, Ya Zhang
:
Self-Supervised Masking for Unsupervised Anomaly Detection and Localization. 4426-4438 - Xiaozhou Lei

, Zixiang Fei
, Wenju Zhou
, Huiyu Zhou
, Minrui Fei
:
Low-Light Image Enhancement Using the Cell Vibration Model. 4439-4454 - Kaijun Liu

, Shujing Lyu
, Yue Lu
:
Few-Shot Segmentation for Prohibited Items Inspection With Patch-Based Self-Supervised Learning and Prototype Reverse Validation. 4455-4463 - Aite Zhao

, Yue Wang, Jianbo Li:
Transferable Self-Supervised Instance Learning for Sleep Recognition. 4464-4477 - Souradeep Chakraborty

, Zijun Wei
, Conor Kelton, Seoyoung Ahn
, Aruna Balasubramanian, Gregory J. Zelinsky, Dimitris Samaras
:
Predicting Visual Attention in Graphic Design Documents. 4478-4493 - Sheng Liu

, Annan Li
, Jiahao Wang
, Yunhong Wang
:
Bidirectional Maximum Entropy Training With Word Co-Occurrence for Video Captioning. 4494-4507 - Sanchita Ghose

, John J. Prevost
:
FoleyGAN: Visually Guided Generative Adversarial Network-Based Synchronous Sound Generation in Silent Videos. 4508-4519 - Wentao Tan

, Lei Zhu
, Jingjing Li
, Huaxiang Zhang
, Junwei Han
:
Teacher-Student Learning: Efficient Hierarchical Message Aggregation Hashing for Cross-Modal Retrieval. 4520-4532 - Qi Liu

, Honglei Su
, Tianxin Chen
, Hui Yuan
, Raouf Hamzaoui
:
No-Reference Bitstream-Layer Model for Perceptual Quality Assessment of V-PCC Encoded Point Clouds. 4533-4546 - Jin Li

, Wanyun Li
, Zichen Xu
, Yuhao Wang
, Qiegen Liu
:
Wavelet Transform-Assisted Adaptive Generative Modeling for Colorization. 4547-4562 - Guangzhi Wang

, Yangyang Guo
, Ziwei Xu
, Yongkang Wong
, Mohan S. Kankanhalli
:
Semantic-Aware Triplet Loss for Image Classification. 4563-4572 - Sangwook Park, David K. Han

, Mounya Elhilali
:
Cross-Referencing Self-Training Network for Sound Event Detection in Audio Mixtures. 4573-4585 - Yusheng Tao

, Jian Zhang
, Jiajing Hong, Yuesheng Zhu
:
DREAMT: Diversity Enlarged Mutual Teaching for Unsupervised Domain Adaptive Person Re-Identification. 4586-4597 - Jingtao Xu

, Yali Li
, Shengjin Wang
:
AdaZoom: Towards Scale-Aware Large Scene Object Detection. 4598-4609 - Xuesong Wang

, Ke Jin
, Yi Kong
, C. L. Philip Chen
, Yuhu Cheng
:
Discriminator-Quality Evaluation GAN. 4081-4093 - Xiaolong Cheng

, Xuan Zheng
, Jialun Pei
, He Tang
, Zehua Lyu, Chuanbo Chen
:
Depth-Induced Gap-Reducing Network for RGB-D Salient Object Detection: An Interaction, Guidance and Refinement Approach. 4253-4266 - Xuena Ren

, Dongming Zhang
, Xiuguo Bao, Yongdong Zhang
:
S$^{2}$-Net:Semantic and Saliency Attention Network for Person Re-Identification. 4387-4399 - Pingyu Wang

, Zhicheng Zhao
, Fei Su
, Hongying Meng
:
LTReID: Factorizable Feature Generation With Independent Components for Long-Tailed Person Re-Identification. 4610-4622 - Wenbin Zou

, Liang Chen
, Yi Wu
, Yunchen Zhang, Yuxiang Xu, Jun Shao:
Joint Wavelet Sub-Bands Guided Network for Single Image Super-Resolution. 4623-4637 - Yihao Liu

, Jingwen He
, Xiangyu Chen
, Zhengwen Zhang, Hengyuan Zhao, Chao Dong
, Yu Qiao
:
Very Lightweight Photo Retouching Network With Conditional Sequential Modulation. 4638-4652 - Mingrui Zhang

, Mading Li, Jiahao Yu, Li Chen
:
Aesthetic Photo Collage With Deep Reinforcement Learning. 4653-4664 - Guanchen Ding

, Daiqin Yang
, Tao Wang, Sihan Wang, Yunfei Zhang
:
Crowd Counting via Unsupervised Cross-Domain Feature Adaptation. 4665-4678 - Qingping Sun

, Yi Xiao
, Jie Zhang, Shizhe Zhou
, Chi-Sing Leung
, Xin Su:
A Local Correspondence-Aware Hybrid CNN-GCN Model for Single-Image Human Body Reconstruction. 4679-4690 - Chuanyi Zhang

, Guosheng Lin
, Qiong Wang
, Fumin Shen
, Yazhou Yao
, Zhenmin Tang:
Guided by Meta-Set: A Data-Driven Method for Fine-Grained Visual Recognition. 4691-4703 - Yunan Li

, Huizhou Chen, Qiguang Miao
, Daohui Ge, Siyu Liang
, Zhuoqi Ma
, Bocheng Zhao:
Image Hazing and Dehazing: From the Viewpoint of Two-Way Image Translation With a Weakly Supervised Framework. 4704-4717 - Qitong Wang

, Bin Fu
, Ming Li, Junjun He
, Xi Peng
, Yu Qiao
:
Region-Aware Arbitrary-Shaped Text Detection With Progressive Fusion. 4718-4729 - Zhi Wu

, Dongheng Zhang
, Chunyang Xie
, Cong Yu
, Jinbo Chen
, Yang Hu
, Yan Chen
:
RFMask: A Simple Baseline for Human Silhouette Segmentation With Radio Signals. 4730-4741 - Hai Wang

, Wenming Yang
, Qingmin Liao
, Jie Zhou
:
Bi-RSTU: Bidirectional Recurrent Upsampling Network for Space-Time Video Super-Resolution. 4742-4751 - Hao Li

, Jinghui Qin
, Zhijing Yang
, Pengxu Wei
, Jinshan Pan
, Liang Lin
, Yukai Shi
:
Real-World Image Super-Resolution by Exclusionary Dual-Learning. 4752-4763 - Li Zhang, Tong Qiao

, Ming Xu
, Ning Zheng
, Shichuang Xie
:
Unsupervised Learning-Based Framework for Deepfake Video Detection. 4785-4799 - Xulun Ye

, Jieyu Zhao
:
Graph Convolutional Network With Unknown Class Number. 4800-4813 - Xiangyu Hu, Liquan Shen

, Mingxing Jiang
, Ran Ma
, Ping An
:
LA-HDR: Light Adaptive HDR Reconstruction Framework for Single LDR Image Considering Varied Light Conditions. 4814-4829 - Zhuo Chen, Fei Yin

, Qing Yang
, Cheng-Lin Liu
:
Cross-Lingual Text Image Recognition via Multi-Hierarchy Cross-Modal Mimic. 4830-4841 - Liping Nong

, Jie Peng
, Wenhui Zhang, Jiming Lin, Hongbing Qiu
, Junyi Wang
:
Adaptive Multi-Hypergraph Convolutional Networks for 3D Object Classification. 4842-4855 - Lei Qi

, Lei Wang
, Yinghuan Shi
, Xin Geng
:
A Novel Mix-Normalization Method for Generalizable Multi-Source Person Re-Identification. 4856-4867 - Linhui Dai

, Xiang Song, Xiaohong Liu
, Chengqi Li, Zhihao Shi, Jun Chen
, Martin Brooks:
Enabling Trimap-Free Image Matting With a Frequency-Guided Saliency-Aware Network via Joint Learning. 4868-4879 - Junda Cheng, Xin Yang

, Yuechuan Pu
, Peng Guo
:
Region Separable Stereo Matching. 4880-4893 - Jiehang Xie

, Xuanbai Chen, Tianyi Zhang
, Yixuan Zhang, Shao-Ping Lu
, Pablo César
, Yulu Yang:
Multimodal-Based and Aesthetic-Guided Narrative Video Summarization. 4894-4908 - Di Wang

, Shuai Liu
, Quan Wang
, Yumin Tian
, Lihuo He
, Xinbo Gao
:
Cross-Modal Enhancement Network for Multimodal Sentiment Analysis. 4909-4921 - Wenfeng Pang

, Wei Xie
, Qianhua He
, Yanxiong Li
, Jichen Yang
:
Audiovisual Dependency Attention for Violence Detection in Videos. 4922-4932 - Chuangchuang Tan

, Guanghua Gu
, Tao Ruan
, Shikui Wei
, Yao Zhao
:
Dual-Gradients Localization Framework With Skip-Layer Connections for Weakly Supervised Object Localization. 4933-4942 - Kangjian He

, Xuejie Zhang
, Dan Xu
, Jian Gong
, Lisiqi Xie
:
Fidelity-driven Optimization Reconstruction and Details Preserving Guided Fusion for Multi-Modality Medical Image. 4943-4957 - Hamed RahmaniKhezri

, Suhong Kim, Mohamed Hefeeda
:
Unsupervised Single-Image Reflection Removal. 4958-4971 - Lele Fu

, Zhaoliang Chen
, Yongyong Chen
, Shiping Wang
:
Unified Low-Rank Tensor Learning and Spectral Embedding for Multi-View Subspace Clustering. 4972-4985 - Dongliang Zhou

, Haijun Zhang
, Qun Li
, Jianghong Ma
, Xiaofei Xu:
COutfitGAN: Learning to Synthesize Compatible Outfits Supervised by Silhouette Masks and Fashion Styles. 4986-5001 - Jingjing Jiang

, Ziyi Liu
, Nanning Zheng
:
LiVLR: A Lightweight Visual-Linguistic Reasoning Framework for Video Question Answering. 5002-5013 - Yuchen Su

, Zhiwen Shao
, Yong Zhou
, Fanrong Meng, Hancheng Zhu
, Bing Liu
, Rui Yao
:
TextDCT: Arbitrary-Shaped Text Detection via Discrete Cosine Transform Mask. 5030-5042 - Tiesong Zhao

, Ying Fang, Kai Wang, Qian Liu
, Yuzhen Niu
:
High Efficiency Vibrotactile Codec Based on Gate Recurrent Network. 5043-5052 - Zhi Lu

, Yang Hu
, Cong Yu
, Yunchao Jiang, Yan Chen
, Bing Zeng
:
Personalized Fashion Recommendation With Discrete Content-Based Tensor Factorization. 5053-5064 - An-An Liu

, Heyu Zhou
, Xuanya Li
, Lanjun Wang
:
Vulnerability of Feature Extractors in 2D Image-Based 3D Object Retrieval. 5065-5076 - Sanaz Nami

, Farhad Pakdaman
, Mahmoud Reza Hashemi
, Shervin Shirmohammadi
:
BL-JUNIPER: A CNN-Assisted Framework for Perceptual Video Coding Leveraging Block-Level JND. 5077-5092 - Pengfei Guo

, Hantao Liu
, Delu Zeng
, Tao Xiang
, Leida Li
, Ke Gu
:
An Underwater Image Quality Assessment Metric. 5093-5106 - Zhulin Tao

, Xiaohao Liu
, Yewei Xia
, Xiang Wang
, Lifang Yang
, Xianglin Huang
, Tat-Seng Chua:
Self-Supervised Learning for Multimedia Recommendation. 5107-5116 - Devanshu Anand

, Mohammed Amine Togou
, Gabriel-Miro Muntean
:
A Machine Learning Solution for Video Delivery to Mitigate Co-Tier Interference in 5G HetNets. 5117-5129 - Weide Liu

, Chi Zhang, Henghui Ding
, Tzu-Yi Hung, Guosheng Lin
:
Few-Shot Segmentation With Optimal Transport Matching and Message Flow. 5130-5141 - Miao Zhang

, Shunyu Yao
, Beiqi Hu, Yongri Piao
, Wei Ji
:
C$^{2}$DFNet: Criss-Cross Dynamic Filter Network for RGB-D Salient Object Detection. 5142-5154 - Wei Zhai

, Yang Cao
, Haiyong Xie, Zheng-Jun Zha
:
Deep Texton-Coherence Network for Camouflaged Object Detection. 5155-5165 - Jiande Sun

, Fanfu Xue
, Jing Li, Lei Zhu
, Huaxiang Zhang
, Jia Zhang
:
TSINIT: A Two-Stage Inpainting Network for Incomplete Text. 5166-5177 - Haidong Qin

, Jing Li
, Yuqi Jiang
, Yanran Dai, Shikuan Hong
, Feng Zhou
, Zhijun Wang
, Tao Yang
:
Bullet-Time Video Synthesis Based on Virtual Dynamic Target Axis. 5178-5191 - Jiaqi Zhou

, Zehua Fu, Qiuyu Huang, Qingjie Liu
, Yunhong Wang
:
LgNet: A Local-Global Network for Action Recognition and Beyond. 5192-5205 - Xiaodi Guan

, Fan Li
, Yangfan Zhang, Pamela C. Cosman
:
End-to-End Blind Video Quality Assessment Based on Visual and Memory Attention Modeling. 5206-5221 - Yiqing Cai

, Zhenwei Ma, Changhong Lu, Changbo Wang
, Gaoqi He
:
Global Representation Guided Adaptive Fusion Network for Stable Video Crowd Counting. 5222-5233 - Junna Gao

, Dehui Kong
, Shaofan Wang
, Jinghua Li, Baocai Yin
:
DASI: Learning Domain Adaptive Shape Impression for 3D Object Reconstruction. 5248-5262 - Jingwen Hou

, Weisi Lin
, Guanghui Yue
, Weide Liu
, Baoquan Zhao
:
Interaction-Matrix Based Personalized Image Aesthetics Assessment. 5263-5278 - Nam Joon Kim, Hyun Kim

:
FP-AGL: Filter Pruning With Adaptive Gradient Learning for Accelerating Deep Convolutional Neural Networks. 5279-5290 - Hanqi Zhu

, Jiajun Deng
, Yu Zhang
, Jianmin Ji
, Qiuyu Mao, Houqiang Li
, Yanyong Zhang
:
VPFNet: Improving 3D Object Detection With Virtual Point Based LiDAR and Stereo Data Fusion. 5291-5304 - Lingyun Song

, Xuequn Shang
, Chen Yang
, Mingxuan Sun:
Attribute-Guided Multiple Instance Hashing Network for Cross-Modal Zero-Shot Hashing. 5305-5318 - Xianjing Han

, Xuemeng Song
, Xingning Dong, Yinwei Wei
, Meng Liu
, Liqiang Nie
:
DBiased-P: Dual-Biased Predicate Predictor for Unbiased Scene Graph Generation. 5319-5329 - Tianpeng Liu

, Jing Li
, Jia Wu
, Jun Chang
, Beihang Song, Bowen Yao:
Tracking With Mutual Attention Network. 5330-5343 - Yuhang Liu

, Wei Wei
, Daowan Peng, Xian-Ling Mao
, Zhiyong He, Pan Zhou
:
Depth-Aware and Semantic Guided Relational Attention Network for Visual Question Answering. 5344-5357 - Jianzhao Liu, Wei Zhou

, Xin Li
, Jiahua Xu
, Zhibo Chen
:
LIQA: Lifelong Blind Image Quality Assessment. 5358-5373 - Zhi Chen

, Yadan Luo
, Sen Wang
, Jingjing Li
, Zi Huang
:
GSMFlow: Generation Shifts Mitigating Flow for Generalized Zero-Shot Learning. 5374-5385 - Qi Zhang

, Jianchao Wei
, Shanshe Wang
, Siwei Ma
, Wen Gao:
RealVR: Efficient, Economical, and Quality-of- Experience-Driven VR Video System Based on MPEG OMAF. 5386-5399 - Lanxiao Wang

, Hongliang Li
, Wenzhe Hu
, Xiaoliang Zhang
, Heqian Qiu
, Fanman Meng
, Qingbo Wu
:
What Happens in Crowd Scenes: A New Dataset About Crowd Scenes for Image Captioning. 5400-5412 - Wei Tang

, Fazhi He
, Yu Liu
:
YDTR: Infrared and Visible Image Fusion via Y-Shape Dynamic Transformer. 5413-5428 - Xianye Ben

, Chen Gong
, Tianhuan Huang
, Chuanye Li, Rui Yan, Yujun Li
:
Tackling Micro-Expression Data Shortage via Dataset Alignment and Active Learning. 5429-5443 - Ze Zhou

, Quansen Sun
, Hongjun Li
, Chaobo Li
, Zhenwen Ren
:
Regression-Selective Feature-Adaptive Tracker for Visual Object Tracking. 5444-5457 - Naishan Zheng

, Jie Huang
, Feng Zhao
, Xueyang Fu
, Feng Wu:
Unsupervised Underexposed Image Enhancement via Self-Illuminated and Perceptual Guidance. 5469-5484 - Xiaochuang Shu, Xiangdong Zhang

, Quanxue Gao
, Ming Yang
, Rong Wang
, Xinbo Gao:
Self-Weighted Anchor Graph Learning for Multi-View Clustering. 5485-5499 - Maregu Assefa

, Wei Jiang
, Kumie Gedamu
, Getinet Yilma
, Bulbula Kumeda
, Melese Ayalew
:
Self-Supervised Scene-Debiasing for Video Representation Learning via Background Patching. 5500-5515 - Zhi Lu

, Yang Hu
, Cong Yu
, Yan Chen
, Bing Zeng
:
Learning Fashion Compatibility With Context Conditioning Embedding. 5516-5526 - Xin Wei

, Yuyuan Yao, Haoyu Wang
, Liang Zhou
:
Perception-Aware Cross-Modal Signal Reconstruction: From Audio-Haptic to Visual. 5527-5538 - Chengliang Liu

, Zhihao Wu
, Jie Wen
, Yong Xu
, Chao Huang
:
Localized Sparse Incomplete Multi-View Clustering. 5539-5551 - Liming Zou

, Jing Li
, Wenbo Wan
, Q. M. Jonathan Wu
, Jiande Sun
:
Robust Coverless Image Steganography Based on Neglected Coverless Image Dataset Construction. 5552-5564 - Xixi Nie, Bo Hu

, Xinbo Gao
:
MLNet: A Multi-Domain Lightweight Network for Multi-Focus Image Fusion. 5565-5579 - Qianting Ma

, Yang Wang
, Tieyong Zeng
:
Retinex-Based Variational Framework for Low-Light Image Enhancement and Denoising. 5580-5588 - Huanjing Yue

, Yijia Cheng, Yan Mao, Cong Cao, Jing-Yu Yang
:
Recaptured Screen Image Demoiréing in Raw Domain. 5589-5600 - Jie Nie

, Chenglong Wang
, Shusong Yu
, Jinjin Shi, Xiaowei Lv, Zhiqiang Wei
:
MIGN: Multiscale Image Generation Network for Remote Sensing Image Semantic Segmentation. 5601-5613 - Lei Li

, Kai Fan
, Chun Yuan
:
StrokeNet: Stroke Assisted and Hierarchical Graph Reasoning Networks. 5614-5625 - Rui Li

, Danna Xue, Yu Zhu
, Hao Wu, Jinqiu Sun
, Yanning Zhang
:
Self-Supervised Monocular Depth Estimation With Frequency-Based Recurrent Refinement. 5626-5637 - Xian-Feng Han

, Yi-Fei Jin, Hui-Xian Cheng, Guoqiang Xiao
:
Dual Transformer for Point Cloud Analysis. 5638-5648 - Jiaheng Liu

, Jinyang Guo
, Dong Xu
:
GeometryMotion-Transformer: An End-to-End Framework for 3D Action Recognition. 5649-5661 - Weihe Li

, Jiawei Huang
, Wenjun Lyu
, Baoshen Guo
, Wanchun Jiang
, Jianxin Wang
:
RAV: Learning-Based Adaptive Streaming to Coordinate the Audio and Video Bitrate Selections. 5662-5675 - Kuiyuan Zhang

, Zhongyun Hua
, Yuanman Li
, Yongyong Chen
, Yicong Zhou
:
AMS-Net: Adaptive Multi-Scale Network for Image Compressive Sensing. 5676-5689 - Kangle Wu

, Jun Chen
, Jiayi Ma
:
DMEF: Multi-Exposure Image Fusion Based on a Novel Deep Decomposition Method. 5690-5703 - Huaian Chen

, Jianfeng Wang
, Minghui Duan
, Yi Jin
, Yan Kan
, Changan Zhu
:
Video Denoising for Scenes With Challenging Motion: A Comprehensive Analysis and a New Framework. 5704-5719 - Xiaofeng Ding

, Tieyong Zeng
, Jian Tang
, Zhengping Che
, Yaxin Peng
:
SRRNet: A Semantic Representation Refinement Network for Image Segmentation. 5720-5732 - Kaihua Zhang

, Yang Wu
, Mingliang Dong
, Bo Liu, Dong Liu, Qingshan Liu
:
Deep Object Co-Segmentation and Co-Saliency Detection via High-Order Spatial-Semantic Network Modulation. 5733-5746 - Shaowei Weng

, Ye Zhou, Tiancong Zhang
, Mengyao Xiao
, Yao Zhao
:
General Framework to Reversible Data Hiding for JPEG Images With Multiple Two-Dimensional Histograms. 5747-5762 - Shule Deng

, Jin-Gang Yu
, Zihao Wu
, Hongxia Gao, Yansheng Li
, Yang Yang
:
Learning Relative Feature Displacement for Few-Shot Open-Set Recognition. 5763-5774 - Jian Jin

, Xingxing Zhang
, Lili Meng
, Weisi Lin
, Jie Liang
, Huaxiang Zhang
, Yao Zhao
:
Auto-Weighted Layer Representation Based View Synthesis Distortion Estimation for 3-D Video Coding. 5775-5788 - Ginam Kim, Hyunsung Kim, Kyeongbo Kong, Jou Won Song

, Suk-Ju Kang
:
Human Body-Aware Feature Extractor Using Attachable Feature Corrector for Human Pose Estimation. 5789-5799 - Junwen Xiong

, Yu Zhou
, Peng Zhang
, Lei Xie
, Wei Huang
, Yufei Zha
:
Look&listen: Multi-Modal Correlation Learning for Active Speaker Detection and Speech Enhancement. 5800-5812 - Haoyu Chen, Minggui Teng

, Boxin Shi
, Yizhou Wang
, Tie-Jun Huang
:
A Residual Learning Approach to Deblur and Generate High Frame Rate Video With an Event Camera. 5826-5839 - Frank Po Wen Lo

, Yao Guo
, Yingnan Sun
, Jianing Qiu
, Benny Lo
:
An Intelligent Vision-Based Nutritional Assessment Method for Handheld Food Items. 5840-5851 - Longrong Yang

, Hongliang Li
, Qingbo Wu
, Fanman Meng
, Heqian Qiu
, Linfeng Xu
:
Bias-Correction Feature Learner for Semi-Supervised Instance Segmentation. 5852-5863 - Yanli Ji

, Shuo Ma, Xing Xu
, Xuelong Li
, Heng Tao Shen
:
Self-Supervised Fine-Grained Cycle-Separation Network (FSCN) for Visual-Audio Separation. 5864-5876 - Yushu Zhang

, Wentao Zhou, Ruoyu Zhao
, Xinpeng Zhang
, Xiaochun Cao:
F-TPE: Flexible Thumbnail-Preserving Encryption Based on Multi-Pixel Sum-Preserving Encryption. 5877-5891 - Muli Yang

, Chenghao Xu, Aming Wu, Cheng Deng
:
A Decomposable Causal View of Compositional Zero-Shot Learning. 5892-5902 - Tianxin Huang

, Hao Zou, Jinhao Cui, Jiangning Zhang
, Xuemeng Yang, Lin Li
, Yong Liu
:
Adaptive Recurrent Forward Network for Dense Point Cloud Completion. 5903-5915 - Lin Zhang, Mingxin Zhang

, Ran Song
, Ziying Zhao, Xiaolei Li:
Unsupervised Embedding Learning With Mutual-Information Graph Convolutional Networks. 5916-5926 - Jie Li

, Yong Xiang
, Hao Wu, Shaowen Yao
, Dan Xu
:
Optimal Transport-Based Patch Matching for Image Style Transfer. 5927-5940 - Jacob Chakareski

, Xavier Corbillon, Gwendal Simon, Viswanathan (Vishy) Swaminathan:
User Navigation Modeling, Rate-Distortion Analysis, and End-to-End Optimization for Viewport-Driven 360$^\circ $ Video Streaming. 5941-5956 - Jiaqi Zhang

, Yunrui Jian
, Suhong Wang
, Chuanmin Jia
, Shanshe Wang
, Siwei Ma
, Wen Gao:
Textural and Directional Information Based Offset In-Loop Filtering in AVS3. 5957-5971 - Jun-Sang Yoo

, Dong-Wook Kim, Yucheng Lu
, Seung-Won Jung
:
RZSR: Reference-Based Zero-Shot Super-Resolution With Depth Guided Self-Exemplars. 5972-5983 - Majjed Al-Qatf

, Xingfu Wang
, Ammar Hawbani
, Amr Abdussalam
, Saeed Hamood Alsamhi
:
Image Captioning With Novel Topics Guidance and Retrieval-Based Topics Re-Weighting. 5984-5999 - Zhentan Zheng

, Jianyi Liu
, Nanning Zheng
:
P$^{2}$-GAN: Efficient Stroke Style Transfer Using Single Style Image. 6000-6012 - Ali Ak

, Abhishek Goswami
, Wolf Hauser, Patrick Le Callet
, Frédéric Dufaux
:
RV-TMO: Large-Scale Dataset for Subjective Quality Assessment of Tone Mapped Images. 6013-6025 - Pei Wang

, Yun Yang
, Yuelong Xia, Kun Wang, Xingyi Zhang
, Song Wang
:
Information Maximizing Adaptation Network With Label Distribution Priors for Unsupervised Domain Adaptation. 6026-6039 - Dingkang Liang

, Wei Xu
, Yingying Zhu
, Yu Zhou
:
Focal Inverse Distance Transform Maps for Crowd Localization. 6040-6052 - Sheikh Tania

, Gour C. Karmakar
, Shyh Wei Teng
, M. Manzur Murshed
:
A Robust Local Texture Descriptor in the Parametric Space of the Weibull Distribution. 6053-6066 - Huihui Yue, Jichang Guo

, Xiangjun Yin
, Yi Zhang, Sida Zheng:
Deep Label Prior: Pre-Training-Free Salient Object Detection Network Based on Label Learning. 6067-6078 - Jakub Nawala

, Lucjan Janowski
, Bogdan Cmiel, Krzysztof Rusek
, Pablo Pérez
:
Generalized Score Distribution: A Two-Parameter Discrete Distribution Accurately Describing Responses From Quality of Experience Subjective Experiments. 6090-6104 - Congcong Li

, Jing Li
, Yuguang Xie, Jiayang Nie, Tao Yang
, Zhaoyang Lu:
Calibration-Free Cross-Camera Target Association Using Interaction Spatiotemporal Consistency. 6105-6120 - Zhe Xu

, Kun Wei
, Xu Yang, Cheng Deng
:
Point-Supervised Video Temporal Grounding. 6121-6131 - Wenfeng Song, Xia Hou, Shuai Li, Chenglizhao Chen, Danyang Gao, Xian'e Wang, Yuzhe Sun, Jianxia Hou, Aimin Hao:

An Intelligent Virtual Standard Patient for Medical Students Training Based on Oral Knowledge Graph. 6132-6145 - Xu Yin

, Dongbo Min
, Yuchi Huo
, Sung-Eui Yoon
:
Contour-Aware Equipotential Learning for Semantic Segmentation. 6146-6156 - Junbao Zhuo

, Shuhui Wang
, Qingming Huang
:
Uncertainty Modeling for Robust Domain Adaptation Under Noisy Environments. 6157-6170 - Zhiqi Pang

, Lingling Zhao, Qiuyang Liu, Chunyu Wang
:
Camera Invariant Feature Learning for Unsupervised Person Re-Identification. 6171-6182 - Jiahao Nie

, Zhiwei He
, Yuxiang Yang
, Mingyu Gao
, Zhekang Dong
:
Learning Localization-Aware Target Confidence for Siamese Visual Tracking. 6194-6206 - Chao Sun, Zhedong Zheng

, Xiaohan Wang
, Mingliang Xu
, Yi Yang:
Self-Supervised Point Cloud Representation Learning via Separating Mixed Shapes. 6207-6218 - Fuxiang Wu

, Liu Liu
, Fusheng Hao
, Fengxiang He, Jun Cheng
:
Language-Based Image Manipulation Built on Language-Guided Ranking. 6219-6231 - Yamin Sepehri

, Pedram Pad, Clément Kündig, Pascal Frossard
, L. Andrea Dunbar:
Privacy-Preserving Image Acquisition for Neural Vision Systems. 6232-6244 - Sweta Anmulwar

, Ning Wang
, Vu San Ha Huynh
, Stewart Bryant, Jinze Yang, Regius Rahim Tafazolli
:
HoloSync: Frame Synchronisation for Multi-Source Holographic Teleportation Applications. 6245-6257 - Jingzhao Xu

, Mengke Yuan
, Dong-Ming Yan
, Tieru Wu
:
Illumination Guided Attentive Wavelet Network for Low-Light Image Enhancement. 6258-6271 - Xixia Xu

, Qi Zou
, Xue Lin
:
Structure-Enriched Topology Learning For Cross-Domain Multi-Person Pose Estimation. 6272-6284 - Jun Chen

, Hui Duan
, Yuanxin Song
, Zemin Cai
, Guangguang Yang
:
Optical Flow Computation for Video Under the Dynamic Illumination. 6285-6300 - Jiandian Zeng

, Jiantao Zhou
, Tianyi Liu
:
Robust Multimodal Sentiment Analysis via Tag Encoding of Uncertain Missing Modalities. 6301-6314 - Wei Wang, Junyu Gao

, Changsheng Xu
:
Weakly-Supervised Video Object Grounding via Learning Uni-Modal Associations. 6329-6340 - Qing Li

, Ying Chen
, Aoyang Zhang, Yong Jiang
, Longhao Zou
, Zhimin Xu
, Gabriel-Miro Muntean
:
A Super-Resolution Flexible Video Coding Solution for Improving Live Streaming Quality. 6341-6355 - Hanyang Jin, Shenqi Lai, Qi Tang, Tianyu Zhu, Xueming Qian

:
MPPM: A Mobile-Efficient Part Model for Object re-ID. 6356-6370 - Qiangqiang Shen

, Shuangyan Yi
, Yongsheng Liang
, Yongyong Chen
, Wei Liu:
Bilateral Fast Low-Rank Representation With Equivalent Transformation for Subspace Clustering. 6371-6383 - Qiaokang Xie

, Zhenbo Lu, Wengang Zhou
, Houqiang Li
:
Improving Person Re-Identification With Multi-Cue Similarity Embedding and Propagation. 6384-6396 - Wenhong Duan

, Zhenhua Liu, Chuanmin Jia
, Shanshe Wang
, Siwei Ma
, Wen Gao:
Differential Weight Quantization for Multi-Model Compression. 6397-6410 - Tiesong Zhao

, Yuhang Huang, Weize Feng, Yiwen Xu
, Sam Kwong
:
Efficient VVC Intra Prediction Based on Deep Feature Fusion and Probability Estimation. 6411-6421 - Yawen Cui

, Wanxia Deng
, Xin Xu
, Zhen Liu
, Zhong Liu, Matti Pietikäinen
, Li Liu
:
Uncertainty-Guided Semi-Supervised Few-Shot Class-Incremental Learning With Knowledge Distillation. 6422-6435 - Dongyan Nie

, Jialin Liu
, Hong Fei, Xiaoying Sun
:
Neuromorphic Similarity Measurement of Tactile Stimuli in Human-Machine Interface. 6436-6445 - Huaxin Pang

, Shikui Wei
, Gangjian Zhang
, Shiyin Zhang
, Shuang Qiu
, Yao Zhao
:
Heterogeneous Feature Alignment and Fusion in Cross-Modal Augmented Space for Composed Image Retrieval. 6446-6457 - Chuang Yang

, Mulin Chen
, Yuan Yuan, Qi Wang
:
Reinforcement Shrink-Mask for Text Detection. 6458-6470 - Junjie Wu

, Changqun Xia
, Tianshu Yu
, Jia Li
:
View-Aware Salient Object Detection for $360^{\circ }$ Omnidirectional Image. 6471-6484 - Wendong Mao

, Shuai Yang
, Huihong Shi
, Jiaying Liu
, Zhongfeng Wang
:
Intelligent Typography: Artistic Text Style Transfer for Complex Texture and Structure. 6485-6498 - Guanghui Yue

, Di Cheng
, Leida Li
, Tianwei Zhou
, Hantao Liu
, Tianfu Wang
:
Semi-Supervised Authentically Distorted Image Quality Assessment With Consistency-Preserving Dual-Branch Convolutional Neural Network. 6499-6511 - Naiyu Fang

, Lemiao Qiu
, Shuyou Zhang
, Zili Wang
, Kerui Hu, Liangyu Dong:
A Novel Human Image Sequence Synthesis Method by Pose-Shape-Content Inference. 6512-6524 - Jinchao Zhu

, Xiaoyu Zhang, Xian Fang
, Yuxuan Wang
, Panlong Tan, Junnan Liu
:
Perception-and-Regulation Network for Salient Object Detection. 6525-6537 - Dehui Zhu

, Bo Du
, Yanni Dong
, Liangpei Zhang
:
Target Detection With Spatial-Spectral Adaptive Sample Generation and Deep Metric Learning for Hyperspectral Imagery. 6538-6550 - Yiming Wang

, Dongxia Chang
, Zhiqiang Fu
, Jie Wen
, Yao Zhao
:
Graph Contrastive Partial Multi-View Clustering. 6551-6562 - Changchong Sheng

, Li Liu
, Wanxia Deng
, Liang Bai, Zhong Liu, Songyang Lao
, Gangyao Kuang, Matti Pietikäinen
:
Importance-Aware Information Bottleneck Learning Paradigm for Lip Reading. 6563-6574 - Huan Deng

, Zhenguo Yang
, Tianyong Hao
, Qing Li
, Wenyin Liu
:
Multimodal Affective Computing With Dense Fusion Transformer for Inter- and Intra-Modality Interactions. 6575-6587 - Gaosheng Liu

, Huanjing Yue
, Jiamin Wu
, Jing-Yu Yang
:
Efficient Light Field Angular Super-Resolution With Sub-Aperture Feature Learning and Macro-Pixel Upsampling. 6588-6600 - Yukun Qiu

, Fa-Ting Hong
, Wei-Hong Li
, Wei-Shi Zheng
:
Learning Relation Models to Detect Important People in Still Images. 6601-6615 - Congcong Zhu

, Xiaoqiang Li
, Jide Li
, Songmin Dai, Weiqin Tong
:
Multi-Sourced Knowledge Integration for Robust Self-Supervised Facial Landmark Tracking. 6616-6628 - Huibing Wang

, Guangqi Jiang, Jinjia Peng, Ruoxi Deng, Xianping Fu:
Towards Adaptive Consensus Graph: Multi-View Clustering via Graph Collaboration. 6629-6641 - Yong Li

, Qiang Hao
, Jianguo Hu, Xinmiao Pan
, Zechao Li
, Zhen Cui
:
3D3M: 3D Modulated Morphable Model for Monocular Face Reconstruction. 6642-6652 - Tingyu Weng

, Jun Xiao
, Feilong Yan, Haiyong Jiang
:
Context-Aware 3D Point Cloud Semantic Segmentation With Plane Guidance. 6653-6664 - Wei Xia

, Qianqian Wang
, Quanxue Gao
, Ming Yang
, Xinbo Gao:
Self-Consistent Contrastive Attributed Graph Clustering With Pseudo-Label Prompt. 6665-6677 - Xin Yao

, Min Wang
, Wengang Zhou
, Houqiang Li
:
Hash Bit Selection With Reinforcement Learning for Image Retrieval. 6678-6687 - Chen Ju

, Peisen Zhao
, Siheng Chen
, Ya Zhang
, Xiaoyun Zhang
, Yanfeng Wang
, Qi Tian
:
Adaptive Mutual Supervision for Weakly-Supervised Temporal Action Localization. 6688-6701 - Peipei Zhu

, Xiao Wang
, Yong Luo, Zhenglong Sun
, Wei-Shi Zheng
, Yaowei Wang
, Changwen Chen
:
Unpaired Image Captioning by Image-Level Weakly-Supervised Visual Concept Recognition. 6702-6716 - Xinsheng Wang

, Qicong Xie, Jihua Zhu
, Lei Xie
, Odette Scharenborg
:
AnyoneNet: Synchronized Speech and Talking Head Generation for Arbitrary Persons. 6717-6728 - Taro Narahara

, Toshihiko Yamasaki
:
Subjective Functionality and Comfort Prediction for Apartment Floor Plans and Its Application to Intuitive Online Property Search. 6729-6742 - Zhenrong Zhang

, Jiefeng Ma, Jun Du
, Licheng Wang, Jianshu Zhang:
Multimodal Pre-Training Based on Graph Attention Network for Document Understanding. 6743-6755 - Chunjie Zhang

, Huihui Bai
, Yao Zhao
:
Fine-Grained Image Classification by Class and Image-Specific Decomposition With Multiple Views. 6756-6766 - Zehua Sheng

, Xiongwei Liu
, Si-Yuan Cao, Hui-Liang Shen
, Huaqi Zhang:
Frequency-Domain Deep Guided Image Denoising. 6767-6781 - Kunpeng Niu

, Yanli Liu
, Enhua Wu, Guanyu Xing
:
A Boundary-Aware Network for Shadow Removal. 6782-6793 - Lirong Zheng

, Yanshan Li
, Kaihao Zhang
, Wenhan Luo
:
T-Net: Deep Stacked Scale-Iteration Network for Image Dehazing. 6794-6807 - Dengyan Luo

, Mao Ye
, Shuai Li
, Ce Zhu
, Xue Li
:
Spatio-Temporal Detail Information Retrieval for Compressed Video Quality Enhancement. 6808-6820 - Jian Zhu

, Qingwu Zhang, Lunke Fei
, Ruichu Cai
, Yuan Xie
, Bin Sheng
, Xiaokang Yang
:
FFFN: Frame-By-Frame Feedback Fusion Network for Video Super-Resolution. 6821-6835 - Shijia Ni

, Feng Shao
, Xiongli Chai
, Hangwei Chen
, Yo-Sung Ho
:
Composition-Guided Neural Network for Image Cropping Aesthetic Assessment. 6836-6851 - Yuanman Li

, Jiaxiang You, Jiantao Zhou
, Wei Wang
, Xin Liao
, Xia Li
:
Image Operation Chain Detection with Machine Translation Framework. 6852-6867 - Tong Zhu

, Leida Li
, Jufeng Yang
, Sicheng Zhao
, Xiao Xiao
:
Multimodal Emotion Classification With Multi-Level Semantic Reasoning Network. 6868-6880 - Pinzhuo Tian

, Shaorong Xie
:
An Adversarial Meta-Training Framework for Cross-Domain Few-Shot Learning. 6881-6891 - Minsoo Song

, Gi-Mun Um, Heekyung Lee
, Jeongil Seo
, Wonjun Kim
:
Dynamic Residual Filtering With Laplacian Pyramid for Instance Segmentation. 6892-6903 - Nannan Hu, Yue Ming

, Chunxiao Fan, Fan Feng
, Boyang Lyu:
TSFNet: Triple-Steam Image Captioning. 6904-6916 - Weimin Tan

, Ganghui Ru, Yueming Jiang, Jichun Li
, Bo Yan
:
Rethinking and Improving Few-Shot Segmentation From a Contour-Aware Perspective. 6917-6929 - Yuchun Fang

, Sirui Cai, Yiting Cao
, Zhengchen Li, Zhaoxiang Zhang
:
Adversarial Learning Guided Task Relatedness Refinement for Multi-Task Deep Learning. 6946-6957 - Haonan Zhang

, Longjun Liu
, Bingyao Kang, Nanning Zheng
:
Hierarchical Model Compression via Shape-Edge Representation of Feature Maps - an Enlightenment From the Primate Visual System. 6958-6970 - Runmin Cong

, Kepu Zhang
, Chen Zhang
, Feng Zheng
, Yao Zhao
, Qingming Huang
, Sam Kwong
:
Does Thermal Really Always Matter for RGB-T Salient Object Detection? 6971-6982 - Yadong Qu

, Hongtao Xie
, Shancheng Fang
, Yuxin Wang
, Yongdong Zhang
:
ADNet: Rethinking the Shrunk Polygon-Based Approach in Scene Text Detection. 6983-6996 - Aihua Mao

, Zhi Yang
, Ken Lin, Jun Xuan, Yong-Jin Liu
:
Positional Attention Guided Transformer-Like Architecture for Visual Question Answering. 6997-7009 - Haijin Zeng

, Jize Xue
, Hiep Luong, Wilfried Philips:
Multimodal Core Tensor Factorization and its Applications to Low-Rank Tensor Completion. 7010-7024 - Fengda Hao

, Jiaojiao Li
, Rui Song
, Yunsong Li
, Kailang Cao:
Structure-Aware Graph Convolution Network for Point Cloud Parsing. 7025-7036 - Wei Huang

, Yintao Zhou
, Yiu-ming Cheung
, Peng Zhang
, Yufei Zha
, Meng Pang
:
Facial Expression Guided Diagnosis of Parkinson's Disease via High-Quality Data Augmentation. 7037-7050 - Wuyang Li

, Xinyu Liu
, Yixuan Yuan
:
SCAN++: Enhanced Semantic Conditioned Adaptation for Domain Adaptive Object Detection. 7051-7061 - Qingrong Cheng

, Keyu Wen
, Xiaodong Gu
:
Vision-Language Matching for Text-to-Image Synthesis via Generative Adversarial Networks. 7062-7075 - Cankun Zhong

, Wing W. Y. Ng
:
A Robust Frequency-Domain-Based Graph Adaptive Network for Parkinson's Disease Detection From Gait Data. 7076-7088 - Zhonghong Ou

, Zhongjie Chen
, Shengyi Shen
, Lina Fan
, Siyuan Yao
, Meina Song
, Pan Hui
:
Free$\rm ^{3}$Net: Gliding Free, Orientation Free, and Anchor Free Network for Oriented Object Detection. 7089-7100 - Yuchen Hong

, Youwei Lyu
, Si Li
, Gang Cao, Boxin Shi
:
Reflection Removal With NIR and RGB Image Feature Fusion. 7101-7112 - Lu Yang

, Qing Song
, Zhihui Wang, Zhiwei Liu, Songcen Xu, Zhihao Li
:
Quality-Aware Network for Human Parsing. 7128-7138 - SangEun Lee

, Chaeeun Ryu, Eunil Park
:
OSANet: Object Semantic Attention Network for Visual Sentiment Analysis. 7139-7148 - Fan Liu

, Huilin Chen, Zhiyong Cheng
, Anan Liu
, Liqiang Nie
, Mohan S. Kankanhalli
:
Disentangled Multimodal Representation Learning for Recommendation. 7149-7159 - Yuanwei Zhu

, Yakun Huang
, Xiuquan Qiao
, Zhijie Tan, Boyuan Bai, Huadong Ma
, Schahram Dustdar
:
A Semantic-Aware Transmission With Adaptive Control Scheme for Volumetric Video Service. 7160-7172 - Yue Zhang

, Chao Liang
, Longxiang Jiang
:
Confidence-Aware Active Feedback for Interactive Instance Search. 7173-7184 - Soushi Ueno

, Takuya Fujihashi
, Toshiaki Koike-Akino
, Takashi Watanabe
:
Point Cloud Soft Multicast for Untethered XR Users. 7185-7195 - Bianca Jansen Van Rensburg

, William Puech
, Jean-Pierre Pedeboy:
A Format Compliant Encryption Method for 3D Objects Allowing Hierarchical Decryption. 7196-7207 - Yuqi Bu

, Liuwu Li, Jiayuan Xie
, Qiong Liu
, Yi Cai
, Qingbao Huang
, Qing Li
:
Scene-Text Oriented Referring Expression Comprehension. 7208-7221 - Yan Wang

, Tongtong Su
, Yusen Li
, Jiuwen Cao
, Gang Wang
, Xiaoguang Liu
:
DDistill-SR: Reparameterized Dynamic Distillation Network for Lightweight Image Super-Resolution. 7222-7234 - Dawei Zhao

, Qingwei Gao
, Yixiang Lu
, Dong Sun
:
Non-Aligned Multi-View Multi-Label Classification via Learning View-Specific Labels. 7235-7247 - Lianli Gao

, Qike Zhao
, Junchen Zhu
, Sitong Su
, Lechao Cheng
, Lei Zhao
:
From External to Internal: Structuring Image for Text-to-Image Attributes Manipulation. 7248-7261 - Yangfan Sun

, Li Li
, Zhu Li
, Shizheng Wang, Shan Liu
, Ge Li
:
Learning a Compact Spatial-Angular Representation for Light Field. 7262-7273 - Yiyun Chen

, Yunmeng Liu
, Mingliang Chen, Zirui Wang
, Wenming Yang
, Qingmin Liao
:
Blind JPEG Compression Artifacts Removal by Integrating Channel Regulation With Exit Strategy. 7274-7286 - Yan Bai

, Jile Jiao
, Yihang Lou
, Shengsen Wu, Jun Liu
, Xuetao Feng, Ling-Yu Duan
:
Dual-Tuning: Joint Prototype Transfer and Structure Regularization for Compatible Feature Learning. 7287-7298 - Ruisong Zhang

, Weize Quan, Yong Zhang, Jue Wang
, Dong-Ming Yan
:
W-Net: Structure and Texture Interaction for Image Inpainting. 7299-7310 - Xihua Sheng

, Jiahao Li
, Bin Li
, Li Li
, Dong Liu
, Yan Lu:
Temporal Context Mining for Learned Video Compression. 7311-7322 - Zhening Xing

, Yuchen Wu
, Si Liu
, Shangzhe Di, Huimin Ma
:
Virtual Try-On With Garment Self-Occlusion Conditions. 7323-7336 - Jiaxiang Chen

, Jiayuan Fan
, Hancheng Ye
, Jie Li
, Yongbin Liao
, Tao Chen
:
Exploring Kernel-Based Texture Transfer for Pose-Guided Person Image Generation. 7337-7349 - Tong Qiao

, Jiasheng Wu, Ning Zheng
, Ming Xu
, Xiangyang Luo
:
FGDNet: Fine-Grained Detection Network Towards Face Anti-Spoofing. 7350-7363 - Jun Jia

, Zhongpai Gao
, Dandan Zhu
, Xiongkuo Min
, Menghan Hu
, Guangtao Zhai
:
RIVIE: Robust Inherent Video Information Embedding. 7364-7377 - Biwei Cao

, Jiuxin Cao
, Jie Gui
, Jiayun Shen, Bo Liu
, Lei He, Yuan Yan Tang, James Tin-Yau Kwok
:
AlignVE: Visual Entailment Recognition Based on Alignment Relations. 7378-7387 - Guoqiang Gong

, Linchao Zhu
, Yadong Mu
:
Language-Guided Multi-Granularity Context Aggregation for Temporal Sentence Grounding. 7402-7414 - Fuxiang Huang, Lei Zhang

, Yuhang Zhou, Xinbo Gao
:
Adversarial and Isotropic Gradient Augmentation for Image Retrieval With Text Feedback. 7415-7427 - Chengyin Xu

, Zenghao Chai
, Zhengzhuo Xu
, Hongjia Li, Qiruyi Zuo
, Lingyu Yang, Chun Yuan
:
HHF: Hashing-Guided Hinge Function for Deep Hashing Retrieval. 7428-7440 - Ziming Liu

, Song Guo
, Jingcai Guo
, Yuanyuan Xu, Fushuo Huo
:
Towards Unbiased Multi-Label Zero-Shot Learning With Pyramid and Semantic Attention. 7441-7455 - Pan Yang

, Xiong Luo
, Jiankun Sun
:
A Simple but Effective Method for Balancing Detection and Re-Identification in Multi-Object Tracking. 7456-7468 - Xiang Li

, Jinglu Wang
, Xiao Li
, Yan Lu:
Video Instance Segmentation by Instance Flow Assembly. 7469-7479 - Masum Shah Junayed

, Md Baharul Islam
:
Consistent Video Inpainting Using Axial Attention-Based Style Transformer. 7494-7504 - Jiaxiang Wang

, Chenglong Li
, Aihua Zheng
, Jin Tang
, Bin Luo
:
Looking and Hearing Into Details: Dual-Enhanced Siamese Adversarial Network for Audio-Visual Matching. 7505-7516 - Xiang Fang

, Daizong Liu
, Pan Zhou
, Yuchong Hu
:
Multi-Modal Cross-Domain Alignment Network for Video Moment Retrieval. 7517-7532 - Kai Yang

, Haijun Zhang
, Feng Gao, Jianyang Shi, Yanfeng Zhang, Q. M. Jonathan Wu
:
DETA: A Point-Based Tracker With Deformable Transformer and Task-Aligned Learning. 7545-7558 - Hezhen Hu, Junfu Pu

, Wengang Zhou
, Houqiang Li
:
Collaborative Multilingual Continuous Sign Language Recognition: A Unified Framework. 7559-7570 - Han Fang

, Zhaoyang Jia, Yupeng Qiu
, Jiyi Zhang, Weiming Zhang
, Ee-Chien Chang
:
De-END: Decoder-Driven Watermarking Network. 7571-7581 - Yiran Yang

, Xian Sun
, Wenhui Diao
, Xuee Rong
, Shiyao Yan, Dongshuo Yin
, Xinming Li:
Optimal Partition Assignment for Universal Object Detection. 7582-7593 - Zhao Xie

, Jiansong Chen
, Kewei Wu
, Dan Guo
, Richang Hong
:
Global Temporal Difference Network for Action Recognition. 7594-7606 - Yucheng Zhu

, Yunhao Li
, Wei Sun
, Xiongkuo Min
, Guangtao Zhai
, Xiaokang Yang
:
Blind Image Quality Assessment via Cross-View Consistency. 7607-7620 - Wenhui Zhou

, Hua Zhang, Zhengmao Yan
, Weisheng Wang, Lili Lin
:
DecoupledPoseNet: Cascade Decoupled Pose Learning for Unsupervised Camera Ego-Motion Estimation. 1636-1648 - Xiaobao Guo

, Adams Wai-Kin Kong
, Alex C. Kot
:
Deep Multimodal Sequence Fusion by Regularized Expressive Representation Distillation. 2085-2096 - Sheng Huang

, Yunhe Zhang
, Lele Fu
, Shiping Wang
:
Learnable Multi-View Matrix Factorization With Graph Embedding and Flexible Loss. 3259-3272 - Xiao Fu

, Hangyu Deng, Xin Yuan, Jinglu Hu
:
Generating High Coherence Monophonic Music Using Monte-Carlo Tree Search. 3763-3772 - Jixiang Gao

, Jingjing Chen
, Huazhu Fu
, Yu-Gang Jiang
:
Dynamic Mixup for Multi-Label Long-Tailed Food Ingredient Recognition. 4764-4773 - Haichao Yao

, Rongrong Ni
, Hadi Amirpour
, Christian Timmerer
, Yao Zhao
:
Detection and Localization of Video Transcoding From AVC to HEVC Based on Deep Representations of Decoded Frames and PU Maps. 5014-5029 - Yangyang Li

, Wei Zhai
, Yang Cao
, Zheng-Jun Zha
:
Location-Free Camouflage Generation Network. 5234-5247 - Bin Wang

, Chunsheng Liu
, Faliang Chang
, Wenqian Wang
, Nanjun Li:
AE-Net:Adjoint Enhancement Network for Efficient Action Recognition in Video Understanding. 5458-5468 - Xiaopeng Li

, Xiaojie Guo
:
SPN2D-GAN: Semantic Prior Based Night-to-Day Image-to-Image Translation. 7621-7634 - Xinhui Li

, Mingjia Li
, Xiaopeng Li
, Xiaojie Guo
:
Learning Generalized Knowledge From a Single Domain on Urban-Scene Segmentation. 7635-7646 - Yujian Feng

, Jian Yu
, Feng Chen
, Yimu Ji
, Fei Wu
, Shangdong Liu
, Xiao-Yuan Jing
:
Visible-Infrared Person Re-Identification via Cross-Modality Interaction Transformer. 7647-7659 - Shuwei Shao

, Ran Li
, Zhongcai Pei
, Zhong Liu
, Weihai Chen
, Wentao Zhu
, Xingming Wu
, Baochang Zhang
:
Towards Comprehensive Monocular Depth Estimation: Multiple Heads are Better Than One. 7660-7671 - Yipo Huang

, Leida Li
, Yuzhe Yang
, Yaqian Li
, Yandong Guo:
Explainable and Generalizable Blind Image Quality Assessment via Semantic Attribute Reasoning. 7672-7685 - Ashutosh Kulkarni

, Prashant W. Patil
, Subrahmanyam Murala
, Sunil Gupta
:
Unified Multi-Weather Visibility Restoration. 7686-7698 - Zhong Ji

, Junhua Hu, Deyin Liu
, Lin Yuanbo Wu
, Ye Zhao
:
Asymmetric Cross-Scale Alignment for Text-Based Person Search. 7699-7709 - Jiahao Hong, Wei Zhang

, Zhiwei Feng, Wenqiang Zhang
:
Dual Cross-Attention for Video Object Segmentation via Uncertainty Refinement. 7710-7725 - Jian Wang

, Xinyue Li
, Zhichao Zhang
, Wei Song
, Weiqi Guo:
Ranked Similarity Weighting and Top-nk Sampling in Deep Metric Learning. 7726-7735 - Yiming Bao

, Xu Zhao
, Dahong Qian
:
FusePose: IMU-Vision Sensor Fusion in Kinematic Space for Parametric Human Pose Estimation. 7736-7746 - Fan Luo

, Shaoxiang Chen, Jingjing Chen
, Zuxuan Wu, Yu-Gang Jiang
:
Self-Supervised Learning for Semi-Supervised Temporal Language Grounding. 7747-7757 - Zheren Fu

, Zhendong Mao
, Bo Hu
, An-An Liu
, Yongdong Zhang
:
Intra-Class Adaptive Augmentation With Neighbor Correction for Deep Metric Learning. 7758-7771 - Han Fang, Pengfei Xiong, Luhui Xu

, Wenhan Luo
:
Transferring Image-CLIP to Video-Text Retrieval via Temporal Relations. 7772-7785 - Yulan Guo

, Yun Wang
, Longguang Wang
, Zi Wang
, Chen Cheng:
CVCNet: Learning Cost Volume Compression for Efficient Stereo Matching. 7786-7799 - Zhishe Wang

, Wenyu Shao
, Yanlin Chen
, Jiawei Xu
, Xiaoqin Zhang
:
Infrared and Visible Image Fusion via Interactive Compensatory Attention Adversarial Learning. 7800-7813 - Dong Chen

, Yueting Zhuang, Zijin Shen
, Carl Yang
, Guoming Wang
, Siliang Tang
, Yi Yang:
Cross-Modal Data Augmentation for Tasks of Different Modalities. 7814-7824 - Rahul Sharma

, Krishna Somandepalli
, Shrikanth Narayanan
:
Cross Modal Video Representations for Weakly Supervised Active Speaker Localization. 7825-7836 - Yonghao Xu

, Fengxiang He, Bo Du
, Dacheng Tao
, Liangpei Zhang
:
Self-Ensembling GAN for Cross-Domain Semantic Segmentation. 7837-7850 - Fengyong Li

, Zhenjia Pei, Xinpeng Zhang
, Chuan Qin
:
Image Manipulation Localization Using Multi-Scale Feature Fusion and Adaptive Edge Supervision. 7851-7866 - Nan Xu

, Junyan Wang, Yuan Tian, Ruike Zhang, Wenji Mao
:
AnANet: Association and Alignment Network for Modeling Implicit Relevance in Cross-Modal Correlation Classification. 7867-7880 - Weiwei Xing

, Jie Yao
, Zixia Liu, Weibin Liu
, Shunli Zhang, Liqiang Wang:
Contrastive JS: A Novel Scheme for Enhancing the Accuracy and Robustness of Deep Models. 7881-7893 - Zhu Liu

, Teng Wang
, Jinrui Zhang
, Feng Zheng
, Wenhao Jiang
, Ke Lu
:
Show, Tell and Rephrase: Diverse Video Captioning via Two-Stage Progressive Training. 7894-7905 - Lingwei Wei

, Dou Hu
, Wei Zhou
, Songlin Hu
:
Modeling Both Intra- and Inter-Modality Uncertainty for Multimodal Fake News Detection. 7906-7916 - Ziyi Tang, Ruimao Zhang

, Zhanglin Peng
, Jinrui Chen, Liang Lin
:
Multi-Stage Spatio-Temporal Aggregation Transformer for Video Person Re-Identification. 7917-7929 - Jing Chen

, Linlin Chen, Huanqiang Zeng
, Chih-Hsien Hsia, Tianlei Wang
, Kai-Kuang Ma
:
3D-Gradient Guided Rate Control Model for Screen Content Video Coding. 7930-7942 - Fei Wu

, Qingzhong Wang
, Jiang Bian, Ning Ding, Feixiang Lu, Jun Cheng, Dejing Dou
, Haoyi Xiong
:
A Survey on Video Action Recognition in Sports: Datasets, Methods and Applications. 7943-7966 - Yang Zhang

, Xian Zhang
, Canghong Shi, Xi Wu
, Xiaojie Li
, Jing Peng, Kunlin Cao, Jiancheng Lv
, Jiliu Zhou:
Pluralistic Face Inpainting With Transformation of Attribute Information. 7967-7979 - Man Zhang, Yong Zhou

, Bing Liu
, Jiaqi Zhao
, Rui Yao
, Zhiwen Shao
, Hancheng Zhu
:
Weakly Supervised Few-Shot Semantic Segmentation via Pseudo Mask Enhancement and Meta Learning. 7980-7991 - Xiaowei Chen

, Guoliang Fan
:
Indoor Camera Pose Estimation From Room Layouts and Image Outer Corners. 7992-8005 - Renjie Wan

, Boxin Shi
, Wenhan Yang
, Bihan Wen
, Ling-Yu Duan
, Alex C. Kot
:
Purifying Low-Light Images via Near-Infrared Enlightened Image. 8006-8019 - Jiarun Song

, Xionghui Mao, Fuzheng Yang
:
The Impact of Black Edge Artifact on QoE of the FOV-Based Cloud VR Services. 8020-8035 - Yongqing Zhu

, Xiangyang Li
, Mao Zheng, Jiahao Yang
, Zihan Wang, Xiaoqian Guo
, Zifeng Chai, Yuchen Yuan, Shuqiang Jiang
:
Focus and Align: Learning Tube Tokens for Video-Language Pre-Training. 8036-8050 - Hui Zhang

, Junkun Tang
, Yihong Cao
, Yurong Chen
, Yaonan Wang, Q. M. Jonathan Wu
:
Cycle Consistency Based Pseudo Label and Fine Alignment for Unsupervised Domain Adaptation. 8051-8063 - Xichuan Zhou

, Rui Ding, Yuxiao Wang, Wenjia Wei, Haijun Liu
:
Cellular Binary Neural Network for Accurate Image Classification and Semantic Segmentation. 8064-8075 - Jing Xiao

, Kangmin Xu, Mengshun Hu
, Liang Liao
, Zheng Wang
, Chia-Wen Lin
, Mi Wang
, Shin'ichi Satoh
:
Progressive Motion Boosting for Video Frame Interpolation. 8076-8090 - Xiaokai Yi, Hanli Wang

, Sam Kwong
, C.-C. Jay Kuo
:
Task-Driven Video Compression for Humans and Machines: Framework Design and Optimization. 8091-8102 - Kangle Wu

, Jun Chen
, Yang Yu, Jiayi Ma
:
ACE-MEF: Adaptive Clarity Evaluation-Guided Network With Illumination Correction for Multi-Exposure Image Fusion. 8103-8118 - Chen Zhou

, Min Jiang
, Jun Kong
:
BGTracker: Cross-Task Bidirectional Guidance Strategy for Multiple Object Tracking. 8132-8144 - Tianshu Song

, Leida Li
, Jinjian Wu
, Yuzhe Yang
, Yaqian Li
, Yandong Guo, Guangming Shi
:
Knowledge-Guided Blind Image Quality Assessment With Few Training Samples. 8145-8156 - Xuesong Wang

, Ke Jin
, Kun Yu
, Yuhu Cheng
:
Asymmetric Training in RealnessGAN. 8157-8169 - Bin Cui

, Zhuang Shao
, Wei Tao, Hui Zhao
:
Hole Inpainting Algorithm for Half-Organized Point Cloud Obtained by Structured-Light Section System. 8170-8182 - Peng Li

, Jing Gao
, Jianing Zhang
, Shan Jin, Zhikui Chen
:
Deep Reinforcement Clustering. 8183-8193 - Qiaosong Qi, Aixi Zhang

, Yue Liao
, Wenyu Sun, Yongliang Wang, Xiaobo Li
, Si Liu
:
Simultaneously Training and Compressing Vision-and-Language Pre-Training Model. 8194-8203 - Hoda Roodaki

, Mahdi Nazm Bojnordi
:
Compressed Geometric Arrays for Point Cloud Processing. 8204-8211 - Ke Zhang

, Miao Long, Jie Chen, Mingzhu Liu
, Jingjing Li
:
CFPNet: A Denoising Network for Complex Frequency Band Signal Processing. 8212-8224 - Hao Cheng

, Joey Tianyi Zhou
, Wee Peng Tay
, Bihan Wen
:
Graph Neural Networks With Triple Attention for Few-Shot Learning. 8225-8239 - Bo Seok Shim

, Jae Hong Choe, Jong-Uk Hou
:
Source Identification of 3D Printer Based on Layered Texture Encoders. 8240-8252 - Yibo Zhao

, Hua Zhang, Zan Gao
, Wen Gao, Meng Wang
, Shengyong Chen
:
A Novel Action Saliency and Context-Aware Network for Weakly-Supervised Temporal Action Localization. 8253-8266 - Xiaowei Zhao

, Xianglong Liu
, Yuqing Ma
, Shihao Bai, Yifan Shen
, Zeyu Hao, Aishan Liu
:
Temporal Speciation Network for Few-Shot Object Detection. 8267-8278 - Jun Cheng

, Fuxiang Wu
, Liu Liu
, Qieshi Zhang
, Leszek Rutkowski
, Dacheng Tao
:
InDecGAN: Learning to Generate Complex Images From Captions via Independent Object-Level Decomposition and Enhancement. 8279-8293 - Jie Li

, Qi Song
, Xiaohu Yan, Yongquan Chen
, Rui Huang
:
From Front to Rear: 3D Semantic Scene Completion Through Planar Convolution and Attention-Based Network. 8294-8307 - Binglu Wang

, Kang Yang, Yongqiang Zhao
, Teng Long, Xuelong Li
:
Prototype-Based Intent Perception. 8308-8319 - Wanjie Li

, Hongxia Wang, Yijing Chen
, Sani M. Abdullahi
, Jie Luo
:
Constructing Immunized Stego-Image for Secure Steganography via Artificial Immune System. 8320-8333 - Xiang-Jun Shen

, Yanan Cai
, Stanley Ebhohimhen Abhadiomhen
, Zhifeng Liu
, Yongzhao Zhan
, Jianping Fan
:
Deep Robust Low Rank Correlation With Unifying Clustering Structure for Cross Domain Adaptation. 8334-8345 - Yahui Xu

, Yi Bin
, Jiwei Wei
, Yang Yang
, Guoqing Wang
, Heng Tao Shen
:
Multi-Modal Transformer With Global-Local Alignment for Composed Query Image Retrieval. 8346-8357 - Xiufang Li

, Qigong Sun
, Licheng Jiao
, Fang Liu
, Xu Liu
, Lingling Li
, Puhua Chen, Yi Zuo:
D³K: Dynastic Data-Free Knowledge Distillation. 8358-8371 - Yun Li

, Zhe Liu
, Xiaojun Chang
, Julian J. McAuley, Lina Yao
:
Diversity-Boosted Generalization-Specialization Balancing for Zero-Shot Learning. 8372-8382 - Jun Rao

, Liang Ding
, Shuhan Qi
, Meng Fang, Yang Liu
, Li Shen
, Dacheng Tao
:
Dynamic Contrastive Distillation for Image-Text Retrieval. 8383-8395 - Jianxin Lin

, Lianying Yin
, Yijun Wang
:
Steformer: Efficient Stereo Image Super-Resolution With Transformer. 8396-8407 - Xiaofeng Yang

, Fayao Liu
, Guosheng Lin
:
Effective End-to-End Vision Language Pretraining With Semantic Visual Loss. 8408-8417 - Pierre R. Lebreton

, Kazuhisa Yamagishi
:
Quitting Ratio-Based Bitrate Ladder Selection Mechanism for Adaptive Bitrate Video Streaming. 8418-8431 - Tengfei Liang

, Yi Jin
, Wu Liu
, Yidong Li
:
Cross-Modality Transformer With Modality Mining for Visible-Infrared Person Re-Identification. 8432-8444 - Guoqing Ma

, Yalong Bai
, Wei Zhang
, Ting Yao
, Basem Shihada
, Tao Mei
:
Boosting Generic Visual-Linguistic Representation With Dynamic Contexts. 8445-8457 - Zhiqing Guo

, Gaobo Yang
, Jiyou Chen
, Xingming Sun:
Exposing Deepfake Face Forgeries With Guided Residuals. 8458-8470 - Xingyan Chen

, Mu Wang
, Changqiao Xu
, Yu Zhao
, Shujie Yang, Ke Jiang, Qing Li, Lujie Zhong
, Gabriel-Miro Muntean
:
FedLive: A Federated Transmission Framework for Panoramic Livecast With Reinforced Variational Inference. 8471-8486 - Yang Yu

, Xiaohui Zhao
, Rongrong Ni
, Siyuan Yang
, Yao Zhao
, Alex C. Kot
:
Augmented Multi-Scale Spatiotemporal Inconsistency Magnifier for Generalized DeepFake Detection. 8487-8498 - Wentao Tan

, Lei Zhu
, Jingjing Li
, Zheng Zhang
, Huaxiang Zhang
:
Partial Multi-Modal Hashing via Neighbor-Aware Completion Learning. 8499-8510 - Qi Mao

, Siwei Ma
:
Enhancing Style-Guided Image-to-Image Translation via Self-Supervised Metric Learning. 8511-8526 - Haoyu Tian

, Xin Ma
, Xiang Li
, Yibin Li
:
Skeleton-Based Action Recognition With Select-Assemble-Normalize Graph Convolutional Networks. 8527-8538 - Daizong Liu

, Xiang Fang
, Wei Hu
, Pan Zhou
:
Exploring Optical-Flow-Guided Motion and Detection-Based Appearance for Temporal Sentence Grounding. 8539-8553 - Qiqi Bao

, Yunmeng Liu
, Bowen Gang
, Wenming Yang
, Qingmin Liao
:
SCTANet: A Spatial Attention-Guided CNN-Transformer Aggregation Network for Deep Face Image Super-Resolution. 8554-8565 - Zhaoyang Wang

, Guanghua Liu
, Dongchen Zhang, Xinhai Hua, Lingmin Xu
, Peng Gao, Tao Jiang
:
Edge-Assisted Massive Video Delivery Over Cell-Free Massive MIMO. 8566-8579 - Zhihao Wu

, Xincan Lin
, Zhenghong Lin
, Zhaoliang Chen
, Yang Bai
, Shiping Wang
:
Interpretable Graph Convolutional Network for Multi-View Semi-Supervised Learning. 8593-8606 - Yaolin Yang

, Hongjie He
, Fan Chen
, Yuan Yuan
, Ningxiong Mao:
Reversible Data Hiding in Encrypted Images Based on Time-Varying Huffman Coding Table. 8607-8619 - Hongchen Tan

, Baocai Yin
, Kun Wei
, Xiuping Liu
, Xin Li
:
ALR-GAN: Adaptive Layout Refinement for Text-to-Image Synthesis. 8620-8631 - Huaiwen Zhang

, Yang Yang
, Fan Qi, Shengsheng Qian
, Changsheng Xu
:
Robust Video-Text Retrieval Via Noisy Pair Calibration. 8632-8645 - Zheng Wang

, Xing Xu
, Guoqing Wang
, Yang Yang
, Heng Tao Shen
:
Quaternion Relation Embedding for Scene Graph Generation. 8646-8656 - Binxin Yang

, Xuejin Chen
, Chaoqun Wang
, Chi Zhang
, Zihan Chen, Xiaoyan Sun:
Semantics-Preserving Sketch Embedding for Face Generation. 8657-8671 - Mingde Yao

, Dongliang He
, Xin Li, Zhihong Pan, Zhiwei Xiong
:
Bidirectional Translation Between UHD-HDR and HD-SDR Videos. 8672-8686 - Chen Pang

, Xuequan Lu
, Lei Lyu
:
Skeleton-Based Action Recognition Through Contrasting Two-Stream Spatial-Temporal Networks. 8699-8711 - Zhenhua Tang

, Jia Li
, Yanbin Hao
, Richang Hong
:
MLP-JCG: Multi-Layer Perceptron With Joint-Coordinate Gating for Efficient 3D Human Pose Estimation. 8712-8724 - Yunhao Du

, Zhicheng Zhao
, Yang Song
, Yanyun Zhao
, Fei Su
, Tao Gong, Hongying Meng
:
StrongSORT: Make DeepSORT Great Again. 8725-8737 - Shaowei Weng

, Ye Zhou, Tiancong Zhang
, Mengyao Xiao
, Yao Zhao
:
Reversible Data Hiding for JPEG Images With Adaptive Multiple Two-Dimensional Histogram and Mapping Generation. 8738-8752 - Zhuang Shao

, Jungong Han
, Kurt Debattista
, Yanwei Pang
:
Textual Context-Aware Dense Captioning With Diverse Words. 8753-8766 - Huibin Lin

, Chun-Yang Zhang
, Shiping Wang
, Wenzhong Guo
:
A Probabilistic Contrastive Framework for Semi-Supervised Learning. 8767-8779 - Aswathy Madhu

, Suresh Kumaraswamy
:
RQNet: Residual Quaternion CNN for Performance Enhancement in Low Complexity and Device Robust Acoustic Scene Classification. 8780-8792 - Hao Liu

, Yanni Ma
, Qingyong Hu
, Yulan Guo
:
CenterTube: Tracking Multiple 3D Objects With 4D Tubelets in Dynamic Point Clouds. 8793-8804 - Guoguang Hua, Muxin Liao

, Shishun Tian
, Yuhang Zhang
, Wenbin Zou
:
Multiple Relational Learning Network for Joint Referring Expression Comprehension and Segmentation. 8805-8816 - Rui Ma

, Qingbo Wu
, King Ngi Ngan, Hongliang Li
, Fanman Meng
, Linfeng Xu
:
Forgetting to Remember: A Scalable Incremental Learning Framework for Cross-Task Blind Image Quality Assessment. 8817-8827 - Shengbin Yue

, Yunbin Tu
, Liang Li
, Ying Yang, Shengxiang Gao
, Zhengtao Yu
:
I3N: Intra- and Inter-Representation Interaction Network for Change Captioning. 8828-8841 - Baptiste Chopin

, Hao Tang
, Naima Otberdout
, Mohamed Daoudi
, Nicu Sebe
:
Interaction Transformer for Human Reaction Generation. 8842-8854 - Yongli Chang

, Sumei Li
, Anqi Liu
, Jie Jin
, Wei Xiang
:
Coarse-to-Fine Feedback Guidance Based Stereo Image Quality Assessment Considering Dominant Eye Fusion. 8855-8867 - Depu Meng

, Changqian Yu
, Jiajun Deng, Deheng Qian, Houqiang Li
, Dongchun Ren
:
Hybrid Motion Representation Learning for Prediction From Raw Sensor Data. 8868-8879 - Yuxuan Liu

, Jianxin Yang, Xiao Gu
, Yijun Chen, Yao Guo
, Guang-Zhong Yang
:
EgoFish3D: Egocentric 3D Pose Estimation From a Fisheye Camera via Self-Supervised Learning. 8880-8891 - Weicheng Xie, Wenya Lu, Zhibin Peng

, Linlin Shen
:
Consistency Preservation and Feature Entropy Regularization for GAN Based Face Editing. 8892-8905 - Jiayu Jiao

, Yu-Ming Tang
, Kun-Yu Lin
, Yipeng Gao, Andy J. Ma
, Yaowei Wang
, Wei-Shi Zheng
:
DilateFormer: Multi-Scale Dilated Transformer for Visual Recognition. 8906-8919 - Shuo Wang

, Zhihao Wu, Xiaobo Hu
, Youfang Lin
, Kai Lv
:
Skill-Based Hierarchical Reinforcement Learning for Target Visual Navigation. 8920-8932 - Chen Chen

, Dan Wang
, Bin Song
, Hao Tan:
Inter-Intra Modal Representation Augmentation With DCT-Transformer Adversarial Network for Image-Text Matching. 8933-8945 - Xiaoqi Wang

, Jian Xiong
, Weisi Lin
:
Visual Interaction Perceptual Network for Blind Image Quality Assessment. 8958-8971 - Jun Zhang

, Licheng Jiao
, Wenping Ma
, Fang Liu
, Xu Liu
, Lingling Li
, Puhua Chen
, Shuyuan Yang
:
Transformer Based Conditional GAN for Multimodal Image Fusion. 8988-9001 - Zhiwu Qing

, Ziyuan Huang
, Shiwei Zhang
, Mingqian Tang, Changxin Gao
, Rong Jin, Marcelo H. Ang
, Nong Sang
:
ParamCrop: Parametric Cubic Cropping for Video Contrastive Learning. 9002-9014 - Qin Xu

, Jiahui Wang
, Bo Jiang
, Bin Luo
:
Fine-Grained Visual Classification via Internal Ensemble Learning Transformer. 9015-9028 - Chuang Yang

, Mulin Chen
, Yuan Yuan, Qi Wang
:
Text Growing on Leaf. 9029-9043 - Zhihua Wang

, Qiuping Jiang
, Shanshan Zhao, Wensen Feng, Weisi Lin
:
Deep Blind Image Quality Assessment Powered by Online Hard Example Mining. 4774-4784 - Qiang Zhai

, Fan Yang
, Xin Li
, Guo-Sen Xie
, Hong Cheng
, Zicheng Liu
:
Co-Communication Graph Convolutional Network for Multi-View Crowd Counting. 5813-5825 - Xiaohan Wang

, Linchao Zhu
, Zhedong Zheng
, Mingliang Xu
, Yi Yang:
Align and Tell: Boosting Text-Video Retrieval With Local Alignment and Fine-Grained Supervision. 6079-6089 - Sadbhawna Thakur

, Vinit Jakhetiya
, Badri N. Subudhi
, Sunil Prasad Jaiswal
, Leida Li
, Weisi Lin
:
Context Region Identification Based Quality Assessment of 3D Synthesized Views. 6183-6193 - Ran Yi

, Zipeng Ye
, Zhiyao Sun
, Juyong Zhang
, Guo-Xin Zhang, Pengfei Wan
, Hujun Bao
, Yong-Jin Liu
:
Predicting Personalized Head Movement From Short Video and Speech Signal. 6315-6328 - Abdelhak Bentaleb

, Mehmet N. Akcay, May Lim
, Ali C. Begen
, Roger Zimmermann
:
BoB: Bandwidth Prediction for Real-Time Communications Using Heuristic and Reinforcement Learning. 6930-6945 - Jun Chen

, Meng Yang
, Wenping Gong
, Yang Yu:
Multi-Neighborhood Guided Kendall Rank Correlation Coefficient for Feature Matching. 7113-7127 - Chen Du

, Sarah Graham, Colin Depp, Truong Q. Nguyen
:
View-Invariant Center-of-Pressure Metrics Estimation With Monocular RGB Camera. 7388-7401 - Xunquan Chen

, Xuexin Xu
, Jinhui Chen
, Zhihong Zhang
, Tetsuya Takiguchi
, Edwin R. Hancock
:
Speaker-Independent Emotional Voice Conversion via Disentangled Representations. 7480-7493 - Wei Chen

, Haoyang Xu
, Nan Pu
, Yu Liu
, Mingrui Lao, Weiping Wang
, Li Liu
, Michael S. Lew:
Lifelong Fine-Grained Image Retrieval. 7533-7544 - Ying Fu

, Zichun Wang, Tao Zhang
, Jun Zhang
:
Low-Light Raw Video Denoising With a High-Quality Realistic Motion Dataset. 8119-8131 - Huafeng Liu

, Pai Peng, Tao Chen
, Qiong Wang
, Yazhou Yao
, Xian-Sheng Hua
:
FECANet: Boosting Few-Shot Semantic Segmentation With Feature-Enhanced Context-Aware Network. 8580-8592 - Zehua Ma

, Xi Yang
, Han Fang
, Weiming Zhang
, Nenghai Yu
:
OAcode: Overall Aesthetic 2D Barcode on Screen. 8687-8698 - Rong-Cheng Tu

, Xian-Ling Mao
, Qinghong Lin, Wenjin Ji, Weize Qin, Wei Wei
, Heyan Huang
:
Unsupervised Cross-Modal Hashing via Semantic Text Mining. 8946-8957 - Jun Xiao

, Xinyang Jiang
, Ningxin Zheng, Huan Yang
, Yifan Yang
, Yuqing Yang
, Dongsheng Li
, Kin-Man Lam
:
Online Video Super-Resolution With Convolutional Kernel Bypass Grafts. 8972-8987 - Quanling Meng

, Shengping Zhang
, Zonglin Li
, Chenyang Wang
, Weigang Zhang
, Qingming Huang
:
Automatic Shadow Generation via Exposure Fusion. 9044-9056 - Yukun Zuo

, Hantao Yao
, Liansheng Zhuang
, Changsheng Xu
:
Dual Structural Knowledge Interaction for Domain Adaptation. 9057-9070 - Xue-Ying Ding

, Xiao-Qian Liu
, Xin Luo
, Xin-Shun Xu
:
DOC: Text Recognition via Dual Adaptation and Clustering. 9071-9081 - Kaiyi Luo

, Chao Zhang
, Huaxiong Li
, Xiuyi Jia
, Chunlin Chen
:
Adaptive Marginalized Semantic Hashing for Unpaired Cross-Modal Retrieval. 9082-9095 - Yuxiang Yang

, Xing Tian
, Wing W. Y. Ng
, Ying Gao
:
Knowledge Distillation Hashing for Occluded Face Retrieval. 9096-9107 - Dongyun Lin

, Yiqun Li, Yi Cheng
, Shitala Prasad, Aiyuan Guo, Yanpeng Cao
:
Multi-Range View Aggregation Network With Vision Transformer Feature Fusion for 3D Object Retrieval. 9108-9119 - Peiguang Jing

, Kai Cui
, Weili Guan
, Liqiang Nie
, Yuting Su:
Category-Aware Multimodal Attention Network for Fashion Compatibility Modeling. 9120-9131 - Yuan Zhang

, Lingjun Pu
, Tao Lin
, Jinyao Yan
:
QoE-Oriented Mobile Virtual Reality Game in Distributed Edge Networks. 9132-9146 - Hao Chen

, Xiu-Shen Wei
, Liang Xiao:
Prototype Learning for Automatic Check-Out. 9147-9160 - Yanzhao Xie, Rukai Wei

, Jingkuan Song
, Yu Liu
, Yangtao Wang
, Ke Zhou:
Label-Affinity Self-Adaptive Central Similarity Hashing for Image Retrieval. 9161-9174 - Guanyu Zhu

, Yong Zhou
, Rui Yao
, Hancheng Zhu
:
Cross-Class Bias Rectification for Point Cloud Few-Shot Segmentation. 9175-9188 - Jie Guo

, Meiting Wang, Yan Zhou
, Bin Song
, Yuhao Chi
, Wei Fan, Jianglong Chang:
HGAN: Hierarchical Graph Alignment Network for Image-Text Retrieval. 9189-9202 - Shunxin Xiao

, Shide Du
, Zhaoliang Chen
, Yunhe Zhang
, Shiping Wang
:
Dual Fusion-Propagation Graph Neural Network for Multi-View Clustering. 9203-9215 - Tiankai Hang

, Huan Yang
, Bei Liu
, Jianlong Fu
, Xin Geng
, Baining Guo
:
Language-Guided Face Animation by Recurrent StyleGAN-Based Generator. 9216-9227 - Wenda Zhao

, Fei Wei, Haipeng Wang
, You He
, Huchuan Lu
:
Full-Scene Defocus Blur Detection With DeFBD+ via Multi-Level Distillation Learning. 9228-9240 - Yanxiong Li

, Hao Chen, Wenchang Cao, Qisheng Huang, Qianhua He:
Few-Shot Speaker Identification Using Lightweight Prototypical Network With Feature Grouping and Interaction. 9241-9253 - Qiang Zhou

, Chaohui Yu
:
Object Detection Made Simpler by Eliminating Heuristic NMS. 9254-9262 - Yingjie Song

, Zhi Liu
, Gongyang Li
, Dan Zeng
, Tianhong Zhang, Lihua Xu, Jijun Wang:
RINet: Relative Importance-Aware Network for Fixation Prediction. 9263-9277 - Fuyun Wang, Xingyu Gao

, Zhenyu Chen
, Lei Lyu
:
Contrastive Multi-Level Graph Neural Networks for Session-Based Recommendation. 9278-9289 - Shuwei Huo

, Yuan Zhou
, Ruolin Wang
, Wei Xiang
, Sun-Yuan Kung
:
Semantic Relevance Learning for Video-Query Based Video Moment Retrieval. 9290-9301 - Pu Li

, Marie A. Roch, Holger Klinck
, Erica Fleishman, Douglas Gillespie
, Eva-Marie Nosal
, Yu Shiu
, Xiaobai Liu
:
Learning Stage-Wise GANs for Whistle Extraction in Time-Frequency Spectrograms. 9302-9314 - Ziqiang Wu

, Bingpeng Ma
, Hong Chang
, Shiguang Shan
:
Refined Knowledge Transfer for Language-Based Person Search. 9315-9329 - Wenbin Wang

, Maurice Pagnucco, Chengpei Xu
, Yang Song
:
InterREC: An Interpretable Method for Referring Expression Comprehension. 9330-9342 - Kang Liu

, Feng Xue
, Dan Guo
, Peijie Sun
, Shengsheng Qian
, Richang Hong
:
Multimodal Graph Contrastive Learning for Multimedia-Based Recommendation. 9343-9355 - Jieyan Liu, Hongcai He, Mingzhu Liu

, Jingjing Li
, Ke Lu
:
Manifold Regularized Joint Transfer for Open Set Domain Adaptation. 9356-9369 - Jingyuan Zhu

, Huimin Ma
, Jiansheng Chen
, Jian Yuan
:
MotionVideoGAN: A Novel Video Generator Based on the Motion Space Learned From Image Pairs. 9370-9382 - Weizhi Nie

, Chuanqi Jiao, Rihao Chang
, Lei Qu
, An-An Liu
:
CPG3D: Cross-Modal Priors Guided 3D Object Reconstruction. 9383-9396 - Zikang Yuan, Junda Cheng, Xin Yang

:
CR-LDSO: Direct Sparse LiDAR-Assisted Visual Odometry With Cloud Reusing. 9397-9409 - Xiao Lv

, Tao Xiang
, Ying Yang
, Hantao Liu
:
Blind Dehazed Image Quality Assessment: A Deep CNN-Based Approach. 9410-9424 - Kun Xia

, Le Wang
, Yichao Shen, Sanping Zhou
, Gang Hua
, Wei Tang
:
Exploring Action Centers for Temporal Action Localization. 9425-9436 - Jiawei Liu

, Qiang Wang
, Huijie Fan
, Wentao Li, Liangqiong Qu
, Yandong Tang
:
A Decoupled Multi-Task Network for Shadow Removal. 9449-9463 - Xiaobao Guo

, A. C. Kot
, Adams Wai-Kin Kong
:
Pace-Adaptive and Noise-Resistant Contrastive Learning for Multimodal Feature Fusion. 9437-9448 - Yangbo Feng, Junyu Gao

, Shicai Yang
, Changsheng Xu
:
Spatial-Temporal Exclusive Capsule Network for Open Set Action Recognition. 9464-9478 - Yongchun Chen, Min Liu

, Xueping Wang
, Fei Wang, An-An Liu
, Yaonan Wang
:
Refining Noisy Labels With Label Reliability Perception for Person Re-Identification. 9479-9490 - Sebastiano Verde

, Cecilia Pasquini
, Federica Lago
, Alessandro Goller, Francesco G. B. De Natale
, Alessandro Piva
, Giulia Boato
:
Multi-Clue Reconstruction of Sharing Chains for Social Media Images. 9491-9505 - Shideng Lin, Fan Tang

, Weiming Dong
, Xingjia Pan
, Changsheng Xu
:
SMNet: Synchronous Multi-Scale Low Light Enhancement Network With Local and Global Concern. 9506-9517 - Yunbin Tu

, Liang Li
, Li Su
, Ke Lu
, Qingming Huang
:
Neighborhood Contrastive Transformer for Change Captioning. 9518-9529 - Xiaoqing Liu

, Huanqiang Zeng
, Yifan Shi
, Jianqing Zhu
, Chih-Hsien Hsia, Kai-Kuang Ma
:
Deep Cross-Modal Hashing Based on Semantic Consistent Ranking. 9530-9542 - Zhou Yu

, Zitian Jin
, Jun Yu
, Mingliang Xu
, Hongbo Wang
, Jianping Fan
:
Bilaterally Slimmable Transformer for Elastic and Efficient Visual Question Answering. 9543-9556 - Nian Hu

, Xiangdong Huang
, Wenhui Li
, Xuanya Li
, An-An Liu
:
Cross-Domain Image-Object Retrieval Based on Weighted Optimal Transport. 9557-9571 - Chen Wan

, Fangjun Huang
, Xianfeng Zhao
:
Average Gradient-Based Adversarial Attack. 9572-9585 - Hao Liu

, Mei Ma
, Zixian Gao, Zongyong Deng
, Fengjun Li, Zhendong Li
:
Siamese Graph Learning for Semi-Supervised Age Estimation. 9586-9596 - Xiaodong Wang

, Zhedong Zheng
, Yang He
, Fei Yan, Zhiqiang Zeng
, Yi Yang:
Progressive Local Filter Pruning for Image Retrieval Acceleration. 9597-9607

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID














