


default search action
ICIP 2024: Abu Dhabi, UAE
- IEEE International Conference on Image Processing, ICIP 2024, Abu Dhabi, United Arab Emirates, october 27-30, 2024. IEEE 2024, ISBN 979-8-3503-4940-5

- Jeongwoo Park, Je Hyeong Hong:

HoloGesture: A Multimodal Dataset For Hand Gesture Recognition Robust To Hand Textures On Head-Mounted Mixed-Reality Devices. 1-7 - Shuai Guo, Houqiang Zhong

, Qiuwen Wang, Ziyu Chen, Yijie Gao, Jiajing Yuan, Chenyu Zhang, Rong Xie, Li Song:
A New People-Object Interaction Dataset and NVS Benchmarks. 8-14 - Yingjie Zhou, Zicheng Zhang, Wei Sun, Xiaohong Liu

, Xiongkuo Min, Zhihua Wang, Xiao-Ping Zhang, Guangtao Zhai:
Thqa: A Perceptual Quality Assessment Database for Talking Heads. 15-21 - Chengyan Zhang, Rahul Chaudhari:

VR-Based Generation of Photorealistic Synthetic Data for Training Hand-Object Tracking Models. 22-28 - Fengbo Lan, Chang Wen Chen:

Removing Reflective Flare in Real-World Conditions. 29-33 - Lukas Ewecker, Florian Schiffel

, Robin Schwager, Tim Brühl, Tin Stribor Sohn, Thomas Villmann:
PVDN-Urban - A Dataset for Provident Vehicle Detection at Night in Urban Scenarios. 34-40 - Alexander Jaus, Constantin Seibold, Kelsey Hermann, Negar Shahamiri, Alexandra Walter

, Kristina Giske
, Johannes Haubold, Jens Kleesiek, Rainer Stiefelhagen:
Towards Unifying Anatomy Segmentation: Automated Generation of a Full-Body CT Dataset. 41-47 - Cheng-Han Lee, Maniratnam Mandal, Neil Birkbeck, Yilin Wang, Balu Adsumilli, Alan C. Bovik:

Subjective Portrait Region Cropping On Landscape Video Study. 48-54 - Shenlu Jiang, Yuxin Bian

, Yiran Wang, Xufeng Li, Zhankeng Liu, Yi Ren, Yunxuan Zhao:
EarthquakeNet: A High-Resolution UAV-Based Dataset for Earthquake Damage Assessment. 55-61 - Aniket Roy, Anirban Roy, Soma Mitra, Kuntal Ghosh:

Bri3L: A Brightness Illusion Image Dataset for Identification and Localization of Regions of Illusory Perception. 62-68 - Karen Sanchez, Carlos Hinojosa

, Olinto Mieles, Chen Zhao, Bernard Ghanem
, Henry Arguello:
Co2Wounds-V2: Extended Chronic Wounds Dataset from Leprosy Patients. 69-75 - Anuja Vats, Bilal Ahmad, Pål Anders Floor, Ahmed Kedir Mohammed, Marius Pedersen, Øistein Hovde:

CAPTIV8: A Comprehensive Large Scale Capsule Endoscopy Dataset For Integrated Diagnosis. 76-82 - Bowen Chen, Zaixi Shang, Alan C. Bovik, Jae Won Chung, David Lerner:

A Real-World Satellite Video Subjective QOE Database. 83-88 - Mariusz Wisniewski, Loris Giulivi, Giacomo Boracchi:

SE3D: A Framework for Saliency Method Evaluation in 3D Imaging. 89-95 - Yilin Wang, Joong Gon Yim, Neil Birkbeck, Balu Adsumilli:

Youtube SFV+HDR Quality Dataset. 96-102 - Ziang Shi, Yang Xiao, Da Yan, Min-Te Sun, Wei-Shinn Ku, Bo Hui:

Bmt-Bench: A Benchmark Sports Dataset For Video Generation. 103-109 - Risa Shinoda, Kaede Shiohara:

OpenAnimalTracks: A Dataset for Animal Track Recognition. 110-116 - Ali Ak, Emin Zerman, Maurice Quach, Aladine Chetouani, Giuseppe Valenzise, Patrick Le Callet:

A Toolkit to Benchmark Point Cloud Quality Metrics with Multi-Track Evaluation Criteria. 117-123 - Paula Moral

, Álvaro García-Martín, José M. Martínez:
Long-Term Geo-Positioned Re-Identification Dataset of Urban Elements. 124-130 - Ahmed Telili, Ibrahim Farhat, Wassim Hamidouche, Hadi Amirpour:

ODVISTA: An Omnidirectional Video Dataset for Super-Resolution and Quality Enhancement Tasks. 131-136 - Javier Montalvo, Pablo Carballeira, Álvaro García-Martín:

Synthmanticlidar: A Synthetic Dataset For Semantic Segmentation On Lidar Imaging. 137-143 - Alessio Barbaro Chisari

, Alessandro Ortis, Luca Guarnera, Wladimiro Carlo Patatu, Rosaria Ausilia Giandolfo, Emanuele Spampinato, Sebastiano Battiato, Mario Valerio Giuffrida:
On the Cloud Detection from Backscattered Images Generated from a Lidar-Based Ceilometer: Current State and Opportunities. 144-150 - Daniel Pisani, Dylan Seychell

, Carl James Debono, Michael Schembri:
SODA: A Dataset for Small Object Detection in UAV Captured Imagery. 151-157 - Daniel Batrakhanov, Tuomas Eerola, Kaisa Kraft, Lumi Haraguchi, Lasse Lensu, Sanna Suikkanen, María Teresa Camarena-Gómez, Jukka Seppälä, Heikki Kälviäinen

:
DAPlankton: Benchmark Dataset For Multi-Instrument Plankton Recognition Via Fine-Grained Domain Adaptation. 158-164 - Pierre R. Lebreton, Patrick Le Callet, Neil Birkbeck, Yilin Wang, Balu Adsumilli:

A Dataset for Understanding Open UGC Video Datasets. 165-171 - Bideaux Maxence, Phe Alice, Mohamed Chaouch, Luvison Bertrand, Quoc-Cuong Pham:

3D-COCO: Extension of MS-COCO Dataset for Scene Understanding and 3D Reconstruction. 172-178 - Nikhil Kumar, Avinash Upadhyay, Shreya Sharma, Manoj Sharma, Pravendra Singh:

MWIRSTD: A MWIR Small Target Detection Dataset. 179-185 - Avinash Upadhyay, Bhipanshu Dhupar, Manoj Sharma, Ankit Shukla

, Ajith Abraham:
LWIRPOSE: A Novel Long Wave Infrared Thermal Image Pose Dataset and Benchmark. 186-192 - Niccolò Bisagno, Antonio Luigi Stefani, Nicola Garau, Francesco G. B. De Natale, Nicola Conci:

Unicrowd Simulator: Visual and Behavioral Fidelity For The Generation of Crowd Datasets. 193-199 - Zhilong Li, Kejun Wu, Junhao Liu, Qiong Liu, You Yang:

Multi-View Multi-Focus Image Fusion: A Novel Benchmark Dataset and Method. 200-206 - Onur Keles, A. Murat Tekalp:

Paon: A New Neuron Model Using Padé Approximants. 207-213 - Joao O. Parracho, Eduardo A. B. da Silva, Lucas A. Thomaz

, Luis M. N. Tavora, Sérgio M. M. Faria
:
Non-Separablewavelet Transform Using Learnable Convolutional Lifting Steps. 214-220 - Théo Rudkiewicz, Mohamed Ouerfelli, Riccardo Finotello, Zakariya Chaouai, Mohamed Tamaazousti:

Robustness of Tensor Decomposition-Based Neural Network Compression. 221-227 - Yavuz Yarici, Kiran Kokilepersaud, Mohit Prabhushankar, Ghassan AlRegib:

Explaining Representation Learning With Perceptual Components. 228-234 - Meghna P. Ayyar, Jenny Benois-Pineau, Akka Zemmari:

ET: Explain to Train: Leveraging Explanations to Enhance the Training of A Multimodal Transformer. 235-241 - Aniket Singh, Anoop M. Namboodiri:

Saliency As A Schedule: Intuitive Image Attribution. 242-248 - Shaurya Gupta, Neil Gautam, Anurag Malyala

:
ATAC-NET: Zoomed View Works Better for Anomaly Detection. 249-255 - Chengdao Pu, Jun Yu, Wen Su, Tianyu Liu:

Rotated R-CNN: A Two-Stage Object Detection Method Adapted To Oriented Bounding Boxes. 256-262 - Jiantao Wu, Shentong Mo, Sara Atito, Zhenhua Feng, Josef Kittler, Syed Sameed Husain, Muhammad Awais:

Masked Momentum Contrastive Learning for Semantic Understanding by Observation. 263-269 - Kiran Kokilepersaud, Yavuz Yarici, Mohit Prabhushankar, Ghassan AlRegib:

Taxes are All You Need: Integration Of Taxonomical Hierarchy Relationships Into the Contrastive Loss. 270-276 - Rui Yang

, Emmanuel Dellandréa, Matthieu Grard, Liming Chen:
Imbalanced Data Robust Online Continual Learning Based on Evolving Class Aware Memory Selection and Built-In Contrastive Representation Learning. 277-283 - Cheng Feng, Chaoliang Zhong, Jie Wang, Jun Sun, Yasuto Yokota:

Conditional Past Experience Generation for Dark Continual Learning. 284-290 - Kebin Liu

, Chuang Zhu:
Unsupervised Domain Adaptive Semantic Segmentation Based on Clip-Guided Prototypical Contrastive Learning. 291-297 - Zhipeng Zhang

, Wenting Ma, Xiaohang Yuan, Yuan Hao, Meng Guo, Hongyi Tang, Zhiheng Zhou, Zhenjie Yao:
Instance-Aware Uncertainty for Active Learning in Object Detection. 298-304 - Takumi Karasawa, Nakamasa Inoue, Rei Kawakami:

Spatiality-Aware Prompt Tuning for Few-Shot Small Object Detection. 305-311 - Jiyong Jang, Hayeon Lee, Younkwan Lee:

Disentangled Knowledge Distillation for Unified Multi-Class Anomaly Detection. 312-318 - Georgios-Fotios Angelis

, Alexandros Emvoliadis, Anastasios Drosou, Dimitrios Tzovaras:
MMAQ: A Multi-Modal Self-Supervised Approach For Estimating Air Quality From Remote Sensing Data. 319-325 - Zihao Li, Ning Luo, Xiwen Zhang, Ziliang Guo, Xingqi Fang, Yu Qiao:

Crowdassign: A Label Assignment Scheme for Pedestrian Detection in Crowded Scenes. 326-331 - Taiga Yamane, Satoshi Suzuki, Ryo Masumura, Shotaro Tora:

MVAFormer: RGB-Based Multi-View Spatio-Temporal Action Recognition with Transformer. 332-338 - Bokyeung Lee, Kyungdeuk Ko, Jonghwan Hong, Hanseok Ko:

Prune Channel And Distill: Discriminative Knowledge Distillation For Semantic Segmentation. 339-345 - Wenrui Hu, Yuan Xie, Wei Yu:

TDAD: Trident Distillations for Anomaly Detection. 346-352 - Farhad G. Zanjani, Hong Cai, Yinhao Zhu, Leyla Mirvakhabova, Fatih Porikli

:
Neural Mesh Fusion: Unsupervised 3D Planar Surface Understanding. 353-359 - Xueyuan Chen, Baojiang Zhong:

Contrast-Guided Wireframe Parsing. 360-366 - Ezgi Paket, Inci M. Baytas:

Adversarial Robustness for Deep Metric Learning. 367-373 - Pengfeng Lu, Sei-Ichiro Kamata, Mengyunqiu Zhang, Weilian Zhou:

Adversarial Detection Transformer For Kuzushiji Recognition. 374-380 - Nada Baili, Hichem Frigui:

Improving Automatic Target Recognition With Infrared Imagery Using Vision Transformers and Focused Data Augmentation. 381-387 - Hiroaki Tani:

Graph Convolutional Networks With Minimal Appearance Information For Action Recognition. 388-394 - Jamil Ahmad, Wail Gueaieb, Abdulmotaleb El-Saddik, Giulia De Masi, Fakhri Karray:

Knowledge-Infused Learning for Fine-Grained Plant Disease Recognition. 395-401 - Yaobin Huang, Hongxia Gao, Xiaomeng Li:

Adaptxray: Vision Transformer And Adapter In X-Ray Images For Prohibited Items Detection. 402-408 - Zehai Wu, Lijie Sheng, Songnian Zhang, Qiguang Miao:

Fusion of Independent and Interactive Features for Human-Object Interaction Detection. 409-415 - Amrutha Machireddy, Ranganath Krishnan, Athmanarayanan Lakshmi Narayanan, Omesh Tickoo:

Source-Free Continual Adaptive Learning With Limited Labels on Evolving Data Drifts. 416-422 - Youwei Zhang, Jing Jiang, Yuying Zhao, Kongming Liang:

SLNL: Soft Label Regularization For Semi-Supervised Facial Expression Recognition With Negative Label Learning. 423-429 - Ruoyue Shen, Nakamasa Inoue, Koichi Shinoda:

Pyramid Coder: Hierarchical Code Generator for Compositional Visual Question Answering. 430-436 - Stanislav Dereka, Ivan Karpukhin, Maksim Zhdanov

, Sergey Kolesnikov:
Diversifying Deep Ensembles: A Saliency Map Approach for Enhanced OOD Detection, Calibration, and Accuracy. 437-443 - Jesus Franco-Robles, Jorge E. Avilés-Mejia, Ouiddad Labbani-Igbida:

Crocos-V1: Enhancing Mask Leakage and Bounding Box Localization for Real-Time Crop/Weed Instance Segmentation. 444-450 - Yiming Chen, Nan He, Lifeng Sun:

FedAWA: Aggregation Weight Adjustment in Federated Domain Generalization. 451-457 - Ioanna Ntinou, Enrique Sanchez, Georgios Tzimiropoulos:

Memsvd: Long-Range Temporal Structure Capturing Using Incremental SVD. 458-464 - Saif Hassan, Mohib Ullah, Ali Shariq Imran, Ghulam Mujtaba, Muhammad Mudassar Yamin, Ehtesham Hashmi

, Faouzi Alaya Cheikh, Azeddine Beghdadi:
A Self-Supervised Diffusion Framework For Facial Emotion Recognition. 465-471 - Bo Hu, Yuheng Bu, José C. Príncipe:

Learning Orthonormal Features in Self-Supervised Learning using Functional Maximal Correlation. 472-478 - Tanapat Ratchatorn

, Masayuki Tanaka:
Adaptive Adversarial Cross-Entropy Loss for Sharpness-Aware Minimization. 479-485 - Xiang Li, Ren Togo, Keisuke Maeda, Takahiro Ogawa, Miki Haseyama:

Reinforcing Pre-Trained Models Using Counterfactual Images. 486-492 - Giovanni Bellitto, Renato Sortino

, Paolo Spadaro, Simone Palazzo, Federica Proietto Salanitri
, Giuseppe Fiameni, Efstratios Gavves, Concetto Spampinato:
Vito: Vision Transformer Optimization Via Knowledge Distillation On Decoders. 493-499 - Shuyun Lu, Jian Jiao, Lanxiao Wang, Heqian Qiu, Xingtao Lin, Hefei Mei, Hongliang Li:

Video Class-Incremental Learning With Clip Based Transformer. 500-506 - Michihiro Kuroki, Toshihiko Yamasaki:

Explaining 3D Object Detection Through Shapley Value-Based Attribution Map. 507-513 - Pasquale Coscia, Angelo Genovese

, Fabio Scotti
, Vincenzo Piuri:
Features Disentanglement For Explainable Convolutional Neural Networks. 514-520 - Seyedalireza Khoshsirat, Chandra Kambhamettu:

Embedding Attention Blocks For Answer Grounding. 521-527 - Bhushan Atote, Victor Sanchez:

Enhanced Prototypical Part Network (EPPNet) For Explainable Image Classification Via Prototypes. 528-534 - Philip Keller, Daniel Jost, Arne Roennau

, Rüdiger Dillmann:
Interactive Teaching For Fine-Granular Few-Shot Object Recognition Using Vision Transformers. 535-541 - Jing Ma, Meng Lin, Gang Zhou, Zhenhong Jia:

Joint Image Restoration For Domain Adaptive Object Detection In Foggy Weather Condition. 542-548 - Hyunjun Kim, Dahye Lee, Sungjune Park, Yong Man Ro:

Weather-Aware Drone-View Object Detection Via Environmental Context Understanding. 549-555 - Xi Tao, Ke Qi, Peijia Chen, Wenhao Xu, Yutao Qi:

Sparse Transformer Refinement Similarity Map for Aerial Tracking. 556-562 - Chih-Chung Hsu

, Ming-Hsuan Wu, En-Chao Liu:
LFGN: Low-Level Feature-Guided Network For Adversarial Defense. 563-567 - Huiwang Liu, Yan Huang, Linlin Zeng, Ya Li:

Towards Robust Person Re-Identification Via Efficient and Generalized Adversarial Training. 568-574 - Zheng Wang, Yingjie Gao, Qingjie Liu, Yunhong Wang:

Semantic Enhanced Few-Shot Object Detection. 575-581 - Mariia Khan, Yue Qiu, Yuren Cong, Bodo Rosenhahn, Jumana Abu-Khalaf, David Suter:

Segment Any Object Model (SAOM): Real-To-Simulation Fine-Tuning Strategy For Multi-Class Multi-Instance Segmentation. 582-588 - Abhishek Saini, Sajjad Moazeni:

Accelerating Cascade Classifier Training with Genetic Algorithms for Edge ML Applications. 589-595 - Satoshi Suzuki, Shotaro Tora, Ryo Masumura:

Scene Generalized Multi-View Pedestrian Detection with Rotation-Based Augmentation and Regularization. 596-602 - Aratrik Chattopadhyay, Harshita Soni, Shuaib Ahmed:

Lercpose: Learned Ranking and Contrastive Loss for Robust Head Pose Estimation. 603-609 - Erik Brorsson

, Knut Åkesson, Lennart Svensson, Kristofer Bengtsson:
ECAP: Extensive Cut-and-Paste Augmentation for Unsupervised Domain Adaptive Semantic Segmentation. 610-616 - Efe Ozturk, Mohit Prabhushankar, Ghassan AlRegib:

Intelligent Multi-View Test Time Augmentation. 617-623 - Danyang Sun

, Fadi Dornaika, Vinh Truong Hoang
, Nagore Barrena:
Superpixel Mixing: A Data Augmentation Technique For Robust Deep Visual Recognition Models. 624-630 - Ling Yue, Lin Feng, Qiuping Shuai, Lingxiao Xu, Zihao Li:

Diversified Task Augmentation with Redundancy Reduction for Cross-Domain Few-Shot Learning. 631-637 - Minju Kang, Taehun Kong, Tae-Kyun Kim:

Semi-Supervised 3D Object Detection With Channel Augmentation Using Transformation Equivariance. 638-644 - Shihao Zeng, Xinghong Liu, Yi Zhou:

Decoupling Domain Invariance and Variance With Tailored Prompts for Open-Set Domain Adaptation. 645-651 - Daniel Brignac, Abhijit Mahalanobis:

Cascading Unknown Detection With Known Classification For Open Set Recognition. 652-658 - Kun Dong, Jian Xue, Xing Lan, Ke Lu:

3Dlaneformer: Rethinking Learning Views for 3D Lane Detection. 659-665 - Amira Guesmi, Ioan Marius Bilasco, Muhammad Shafique

, Ihsen Alouani
:
AdvART: Adversarial Art for Camouflaged Object Detection Attacks. 666-672 - Qi Zeng, Chongren Zhao, Pengfei He, Hongchao Gao:

LSDM-PCB: A Lightweight Small Defect Detection Model for Printed Circuit Board. 673-679 - Yu-Ming Zhang

, Jun-Wei Hsieh, Yu-Hsiu Chang, Xin Li, Ming-Ching Chang, Chun-Chieh Lee, Kuo-Chin Fan:
Set-Nas: Sample-Efficient Training For Neural Architecture Search With Strong Predictor And Stratified Sampling. 680-686 - Tingzhang Luo, Mingxuan Du, Jiatao Shi, Xinxiang Chen, Bingchen Zhao, Shaoguang Huang:

Contextuality Helps Representation Learning for Generalized Category Discovery. 687-693 - Kateryna Chumachenko, Alexandros Iosifidis

, Moncef Gabbouj
:
Uimt: A Framework for Improving Unimodal Inference via Multimodal Training. 694-700 - Jingyi Liao, Xun Xu, Chuan-Sheng Foo, Lile Cai:

Box-Level Class-Balanced Sampling For Active Object Detection. 701-707 - Hasib Zunair, Md Shakib Khan, A. Ben Hamza:

Rsud20K: a Dataset for Road Scene Understanding in Autonomous Driving. 708-714 - Lixing Xiao, Ruixiao Shi, Xiaoyang Tang, Yi Zhou:

Multimodal-Enhanced Objectness Learner For Corner Case Detection In Autonomous Driving. 715-721 - Francesco Barbato, Elena Camuffo, Simone Milani, Pietro Zanuttigh:

Continual Road-Scene Semantic Segmentation Via Feature-Aligned Symmetric Multi-Modal Network. 722-728 - Sheng Luo, Yi Zhou:

Open World Object Detection Via Cooperative Foundation Models for Driving Scenes. 729-735 - Keigo Kunikata, Amane Kashino, Yota Yamamoto, Yukinobu Taniguchi, Yoko Sogabe, Ayumi Matsumoto, Masaki Kitahara, Go Irie:

Pose-Invariant Learning for Efficient Person Identification from Hyperspectral Hand Images. 736-740 - Kexuan Wang, Chenhua Liu, Huiguang Wei, Li Jing, Rongfu Zhang:

RFNET: Refined Fusion Three-Branch RGB-D Salient Object Detection Network. 741-746 - Mohamad Alansari, Ahmed Abughali, Obadah Habash, Khaled Alnuaimi, Sajid Javed, Naoufel Werghi:

Integrating Vision-Language Supervision for Uniform Appearance Tracking. 747-752 - Abdelfatah Hassan Ahmed

, Divya Velayudhan
, Mahmoud Elmezain, Muaz Al Radi, Abderrahmene Boudiaf, Taimur Hassan, Mohamed Deriche, Mohammed Bennamoun
, Naoufel Werghi:
CLIFS: Clip-Driven Few-Shot Learning for Baggage Threat Classification. 753-759 - Pengfei Li, Muaz Al Radi, Mahmoud Said Elmezain, Abdelfatah Hassan Ahmed

, Abderrahmene Boudiaf, Said Boumaraf, Jorge Dias, Hamad Karki, Sajid Javed, Khalid Yousef Al Awadhi, Naoufel Werghi:
SMO-CLIP: Enhancing Anomalous Smoke Density Assessment Using A Hybrid LLM-VLM Approach. 760-765 - Hanan Gani, Nada Saadi, Noor Hussein, Karthik Nandakumar:

Multi-Attribute Vision Transformers are Efficient and Robust Learners. 766-772 - Wentao Hu, Jiarun Liu

, Jiawei Wang, Hui Tian:
Meta-DM: Applications of Diffusion Models on Few-Shot Learning. 773-779 - Qun Zhao, Yuan-Gen Wang:

Universal Black-Box Adversarial Patch Attack with Optimized Genetic Algorithm. 780-786 - Junshuai Zheng

, Yichao Zhou, Xiyuan Hu, Zhenmin Tang:
Deepfake Detection With Combined Unsupervised-Supervised Contrastive Learning. 787-793 - Thomas Gittings, Steve Schneider, John P. Collomosse:

SegGuard: Defending Scene Segmentation Against Adversarial Patch Attack. 794-800 - Akshay Agarwal, Nalini K. Ratha:

Face Morphing Detection in Social Media Content. 801-806 - Xu Tan, Junqi Chen, Jiawei Yang, Sylwan Rahardja, Mou Wang, Susanto Rahardja

:
Ensemble of Deep Variational Mixture Models for Unsupervised Clustering. 807-813 - Lili Zhao, Zhili Liu, Qian Yin

, Lei Yang
, Meng Guo:
Towards Robust Visual Localization Using Multi-View Images and HD Vector Map. 814-820 - Wenjing Yang

, Abd-Krim Seghouane, Pavel Krupskiy
:
An α-Divergence Approach To Robust Canonical Correlation Analysis. 821-827 - Fatemeh Amerehi

, Patrick Healy:
VF-Net: Robustness Via Understanding Distortions and Transformations. 828-834 - Yirui Yang, Xubin Lin, Li He, Yisheng Guan, Hong Zhang:

Factorized Embedding Graph Matching Network For Learning Lawler's Quadratic Assignment Problem. 835-841 - Shota Sugawara

, Ryuji Imamura:
PUAD: Frustratingly Simple Method for Robust Anomaly Detection. 842-848 - Jacob Fein-Ashley, Sachini Wickramasinghe, Bingyi Zhang, Rajgopal Kannan, Viktor K. Prasanna:

A Single Graph Convolution is All You Need: Efficient Grayscale Image Classification. 849-855 - Gan Yang, Zhaohui Wang:

Light-Weight Self-Supervised Contrastive Learning Network For Small Sample Hyperspectral Image Classification. 856-861 - Long Yu, Jun Li, Li Zhuo:

MSSPG-AL: Few-Shot Hyperspectral Image Classification with Active Learning Updated Multi-Scale Superpixel Graph Fusion. 862-867 - Francesco Longobardi

, Daniel Riccio:
Graphic - Graph-Based Representation for Analyzing People's High-Level Interactions in Crowds. 868-874 - Xuezhi Xiang, Yiming Chen

, Denis Ombati, Lei Zhang, Xiantong Zhen:
Deep Optical Flow Learning With Deformable Large-Kernel Cross-Attention. 875-879 - Vahid Jebraeeli, Bo Jiang

, Derya Cansever, Hamid Krim
:
Koopcon: A new approach towards smarter and less complex learning. 880-886 - Behnam Rahmati, Shahram Shirani, Zahra Keshavarz-Motamed:

Medical Knowledge-Guided Semi-Supervised Bi-Ventricular Segmentation. 887-893 - Ketan Kotwal, Tanay Deshmukh, Preeti Gopal:

Latent Enhancing Autoencoder for Occluded Image Classification. 894-900 - Po-Hsuan Huang, Chia-Ching Lin, Chih-Fan Hsu, Ming-Ching Chang, Wei-Chao Chen:

Learning With Instance-Dependent Noisy Labels By Anchor Hallucination And Hard Sample Label Correction. 901-907 - Aditya Humnabadkar, Arindam Sikdar

, Huaizhong Zhang
, Tanveer Hussain, Ardhendu Behera
:
Driving Through Graphs: a Bipartite Graph for Traffic Scene Analysis. 908-914 - Xingtao Lin, Chuanyang Gong, Lanxiao Wang, Heqian Qiu, Shengyu Tong, Hongliang Li:

A Text Detector Based on the Specific Text Prompt. 915-921 - Maryam Rahnemoonfar:

Deep Spectral Siamese Network For Heterogeneous Object Verification In Amazon Robotic Warehouse. 922-928 - Nandish Chattopadhyay, Amira Guesmi, Muhammad Shafique

:
Anomaly Unveiled: Securing Image Classification against Adversarial Patch Attacks. 929-935 - Umamaheswaran Raman Kumar

, Patrick Vandewalle:
Similarity-Weighted IoU (sIOU): A Comprehensive Metric for Evaluating Model Performance Through Similarity-Weighted Class Overlaps. 936-942 - Honori Udo, Takafumi Koshinaka:

Reading is Believing: Revisiting Language Bottleneck Models for Image Classification. 943-949 - Jun Chen, Yiwei Wang, Haiyan Zhang:

Norm-Integrated Softmax Loss For Deep Face Recognition. 950-956 - Haoang Ren, Mengke Tian, Guanwen Zhang, Wei Zhou:

A Multi-Scale Feature Fusion Network for Chip Surface Defect Detection. 957-962 - Jiahao Wang

, Mingxuan Li, Haichen Luo, Jinguo Zhu, Aijun Yang, Mingzhe Rong, Xiaohua Wang:
Power-Llava: Large Language and Vision Assistant for Power Transmission Line Inspection. 963-969 - Chanyeong Park, Junbo Jang, Heegwang Kim, Joonki Paik:

Enhanced Detection of Small Objects in Aerial Imagery: A High-Resolution Neural Network Approach With Amplified Feature Pyramid and Sigmoid Re-Weighting. 970-976 - Berker Demirel, Huseyin Ozkan:

Decompl: Decompositional Learning with Attention Pooling for Group Activity Recognition from a Single Volleyball Image. 977-983 - Chi-Han Chen, Chieh-Ming Chen, Wen-Huang Cheng, Ching-Chun Huang:

Aerial View River Landform Video Segmentation: A Weakly Supervised Context-Aware Temporal Consistency Distillation Approach. 984-990 - Boon Yin Yin, Nurul Japar

:
Salient Guided Text Detection in E-Commerce Images. 991-997 - Jen-Hao Cheng, Sheng-Yao Kuan, Hou-I Liu, Hugo Latapie, Gaowen Liu, Jenq-Neng Hwang:

CenterRadarNet: Joint 3D Object Detection and Tracking Framework Using 4D FMCW Radar. 998-1004 - Hyungtae Lee, Yan Zhang, Heesung Kwon, Shuvra S. Bhattacharyya:

Exploring the Potential of Synthetic Data to Replace Real Data. 1005-1011 - Yi-Kuan Hsieh, Jun-Wei Hsieh, Ying-Yu Chen:

Class-Specific Channel Attention For Few Shot Learning. 1012-1018 - Jun Chen, Wei Yu, Xin Tian, Jun Huang, Jiayi Ma:

Mdbfusion: A Visible And Infrared Image Fusion Framework Capable For Motion Deblurring. 1019-1025 - Gangqi Chen, Zhaoyong Mao, Junge Shen:

Advanced Object Detection in Multibeam Forward-Looking Sonar Images Using Linear Cross-Attention Techniques. 1026-1031 - Zhigang Yang, Yiming Liu, Zehao Gao, Jiayue He, Tao Chen, Wei Emma Zhang:

Attention Enhancement With Parallel Groups for Remote Sensing Object Detection. 1032-1036 - Bilal Faye, Hanane Azzag

, Mustapha Lebbah, Djamel Bouchaffra:
Adaptative Context Normalization: A Boost for Deep Learning in Image Processing. 1037-1043 - Nan Yang, Zihan Li, Zhen Long, Xiaolin Huang, Ce Zhu, Yipeng Liu:

Efficient Black-Box Adversarial Attack on Deep Clustering Models. 1044-1049 - Jeongjin Shin

:
Mask-Based Invisible Backdoor Attacks on Object Detection. 1050-1056 - Indu Solomon, Aye Phyu Phyu Aung, Uttam Kumar, Senthilnath Jayavelu

:
U-Tell: Unsupervised Task Expert Lifelong Learning. 1057-1063 - Kuan Zhou, Zhenyu Xu, Qieshi Zhang, Jun Cheng, Ziliang Ren, Xiangyang Gao:

AAGF: An Efficient Transformer With Mix-Features For Visual Place Recognition. 1064-1070 - Jiwon Yoo, Jangwon Lee, Gyeonghwan Kim:

A Decoding Scheme With Successive Aggregation of Multi-Level Features For Light-Weight Semantic Segmentation. 1071-1077 - Nazanin Moradinasab

, Hassan Jafarzadeh, Donald E. Brown:
Gengmm: Generalized Gaussian-Mixture-Based Domain Adaptation Model for Semantic Segmentation. 1078-1084 - Koki Mukai, Soichiro Kumano, Nicolas Michel, Ling Xiao

, Toshihiko Yamasaki:
Adversarially Robust Continual Learning with Anti-Forgetting Loss. 1085-1091 - Tong Zhao, Qiang Fang, Shuohao Shi, Xin Xu:

Density-Guided Dense Pseudo Label Selection for Semi-Supervised Oriented Object Detection. 1092-1098 - Maria Tzelepi, Vasileios Mezaris:

Online Anchor-Based Training For Image Classification Tasks. 1099-1105 - Blazej Leporowski, Arian Bakhtiarnia, Nicole Bonnici, Adrian Muscat

, Luca Zanella, Yiming Wang
, Alexandros Iosifidis
:
MAVAD: Audio-Visual Dataset and Method for Anomaly Detection in Traffic Videos. 1106-1112 - Yunxiang Cao, Li Chen

, Yubo Wang, Zhida Feng, Xiaoming Liu:
Unleashing Fine-Coarse Curve Perception Via Trunk-Branch Perturbation. 1113-1119 - Junqi Chen, Xu Tan, Jiawei Yang, Sylwan Rahardja, Susanto Rahardja

:
FlexAE: A Self-Conditioned Detector To Prevent Model Overfitting For Unsupervised Video Anomaly Detection. 1120-1125 - Wanting Zhang

, Libao Zhang:
Dynamic Activation Function Based on the Branching Process and its Application in Image Classification. 1126-1132 - Liyana Sahir, Anwesha Banerjee, Soma Biswas:

Adaprompt: Prompt Tuning with Adaptive Neighbours for Generalized Category Discovery. 1133-1138 - Linhan Cao, Wei Sun, Xiongkuo Min, Jun Jia, Zicheng Zhang, Zijian Chen, Yucheng Zhu, Lizhou Liu, Qiubo Chen, Jing Chen, Guangtao Zhai:

SG-JND: Semantic-Guided Just Noticeable Distortion Predictor for Image Compression. 1139-1145 - Farhad Pakdaman, Sanaz Nami

, Moncef Gabbouj
:
Perceptual Learned Image Compression via End-to-End JND-Based Optimization. 1146-1151 - Maniratnam Mandal, Neil Birkbeck, Balu Adsumilli, Alan C. Bovik:

Legit: Text Legibility For User-Generated Media. 1152-1158 - Ali Ak, Abhishek Gera, Denise Noyes, Hassene Tmar, Ioannis Katsavounidis, Patrick Le Callet:

Comparison of Crowdsourcing And Laboratory Settings for Subjective Assessment of Video Quality and Acceptability & Annoyance. 1159-1164 - Wael Suliman, Mohamed Deriche, Naoufel Werghi, Azeddine Beghdadi:

A Fusion-Based Approach for Blind Contrast-Enhanced Image Ranking. 1165-1171 - Chanda Grover Kamra, Indra Deep Mastan, Nitin Kumar, Debayan Gupta:

Simsam: Simple Siamese Representations Based Semantic Affinity Matrix for Unsupervised Image Segmentation. 1172-1178 - Ankur Singh, Senthilnath Jayavelu

:
Robust Representation Learning With Self-Distillation For Domain Generalization. 1179-1185 - Tzu-Han Huang, Wen-Jiin Tsai:

An Anchor-Free Contour-Based Method For Instance Segmentation. 1186-1192 - Srinivasa Rao Nandam, Sara Atito, Zhenhua Feng, Josef Kittler, Muhammad Awais:

Investigating Self-Supervised Methods for Label-Efficient Learning. 1193-1199 - S. Ebrahimkhani, J. Zheng, A. C. Y. Ngo, N.-M. Cheung:

Masked Signal Modeling for Plastic Waste Resin Classification. 1200-1206 - Long Tang

, Liang Yuan, Guoquan Zheng, Zesheng Wang, Guangtao Zhai:
DTSN: No-Reference Image Quality Assessment via Deformable Transformer and Semantic Network. 1207-1211 - Guanghui Yue, Lixin Zhang, Jinxia Zhang, Zhaofei Xu, Shuigen Wang, Tianwei Zhou, Yuanhao Gong, Wei Zhou:

Subjective Quality Assessment of Thermal Infrared Images. 1212-1217 - Weixia Zhang, Chengguang Zhu, Jingnan Gao, Yichao Yan, Guangtao Zhai, Xiaokang Yang:

A Comparative Study of Perceptual Quality Metrics For Audio-Driven Talking Head Videos. 1218-1224 - Duc V. Nguyen, Tran Thuy Hien, Truong Thu Huong:

A Subjective Quality Evaluation of 3D Mesh With Dynamic Level of Detail in Virtual Reality. 1225-1231 - Jianxun Lou, Xinbo Wu

, Yingying Wu, Padraig Corcoran, Gualtiero Colombo, Roger M. Whitaker, Hantao Liu:
A Benchmark of Variance of Opinion Scores in Image Quality Assessment. 1232-1238 - Liu Yang, Huiyu Duan, Long Teng, Yucheng Zhu, Xiaohong Liu

, Menghan Hu, Xiongkuo Min, Guangtao Zhai, Patrick Le Callet:
AIGCOIQA2024: Perceptual Quality Assessment of AI Generated Omnidirectional Images. 1239-1245 - Yajing Pei, Shiyu Huang, Yiting Lu, Xin Li

, Zhibo Chen:
Priorformer: A UGC-VQA Method With Content and Distortion Priors. 1246-1252 - Borhen-Eddine Dakkar, Azeddine Beghdadi, Stefania Colonnese, Naveed Iqbal, Azzedine Zerguine:

Assessing Video Shakiness: A Novel Data And Protocols Framework. 1253-1259 - Mohammed Alsaafin, Musab Alsheikh, Saeed Anwar, Muhammad Usman:

Attention Down-Sampling Transformer, Relative Ranking and Self-Consistency For Blind Image Quality Assessment. 1260-1266 - Abderrezzaq Sendjasni, Mohamed-Chaker Larabi:

Enhancing Perceptual Quality Assessment for 360-Degree Images Based on Adaptive Patch Labeling and Multi-Label Learning. 1267-1273 - Hochang Rhee, Haesoo Chung, Junho Jo, Eunji Lee, Nam Ik Cho:

SANERV: Scene-Adaptive Neural Representation for Videos. 1274-1280 - Bingke Zhu, Hao Li, Changlin Chen, Liujie Hua, Jinqiao Wang:

Estate: Expert-Guided State Text Enhancement for Zero-Shot Industrial Anomaly Detection. 1281-1287 - Lovre Antonio Budimir

, Marko Subasic, Zoran Kalafatic, Sven Loncaric:
Scalable Hypersphere Embedding For Semantic Metric Learning. 1288-1294 - Nattapong Kurpukdee, Adrian G. Bors:

Temporal Transformer Encoder for Video Class Incremental Learning. 1295-1301 - Wonil Song, Kwanghoon Sohn, Dongbo Min:

Improving Self-Supervised Vision Transformers for Visual Control. 1302-1308 - Renkai Zhang, Nong Sang:

Multi-Task Affinity Propagation Based Natural Image Matting. 1309-1315 - Mengyu Yang, Ye Tian, Lanshan Zhang, Xiao Liang, Xuming Ran, Wendong Wang:

AdaViPro: Region-Based Adaptive Visual Prompt For Large-Scale Models Adapting. 1316-1322 - Ruhao Zhao, Xian Zhong, Liang Liao, Wenxuan Liu, Wenxin Huang, Zheng Wang:

Localization of Image Splicing Under Segment Anything Model With Integrated Compression and Edge Artifacts. 1323-1329 - Monikka Roslianna Busto, Shohei Enomoto, Takeharu Eda

:
Collaborative Intelligence For Vision Transformers: A Token Sparsity-Driven Edge-Cloud Framework. 1330-1335 - Shiyang Ye, Yuan Fang, Hong Liu, Hu Chen, Wenchao Du, Hongyu Yang:

Dtpose: Learning Disentangled Token Representation For Effective Human Pose Estimation. 1336-1342 - Tony Zhang, Robert P. Dick:

A Context-Oriented Multi-Scale Neural Network for Fire Segmentation. 1343-1349 - Chih-Chung Hsu

, Yun-Zhong Jiang, Wei-Hao Huang:
VCDSet: A New Vehicle Collision Dataset In Asia Countries For Anticipating Accidents. 1350-1356 - Rémi Cogranne:

Detectability of Defects in the Presence of Linear Nuisance Parameters and Images Signal-Dependent Noise. 1357-1363 - Yuxi Lu, Zhuming Zhang, Shiming Lin, Dengpan Zhang, Haibin Ma, Zengchang Qin:

S3GCN: Sport Scoring Siamese Graph Convolution Network. 1364-1370 - Jian Ma, Xiuhong Li, Yuye Zhang, Boyuan Li, Dangxuan Wu, Zhenhong Jia:

U-Convnext Network for Infrared Small Target Detection. 1371-1376 - Jinhui Zhao, Hongxia Gao, Tongtong Liu:

Surface Anomaly Detection With Anomalous Feature Restriction And Difference-Aware Enhancement. 1377-1383 - Mohamed Sanim Akremi, Najett Neji, Hedi Tabia:

Temporal-Spatial SPDAGG Network For Skeleton-Based Human Action Recognition From Aerial Perspectives. 1384-1390 - Yunzhuo Chen, Naveed Akhtar, Nur Al Hasan Haldar, Jordan Vice, Ajmal Mian:

A Statistical Image Realism Score For Deepfake Detection. 1391-1396 - Maboud F. Kaloorazi, Salman Ahmadi-Asl

, Susanto Rahardja
:
Low-Rank Matrix and Tensor Decomposition Using Randomized Two-Sided Subspace Iteration With Application to Video Reconstruction. 1397-1402 - Muhammad Nor Azzafri Nor-Azman, Usman Ullah Sheikh, Mohammed Sultan Mohammed

, Jeevan Sirkunan, Muhammad Nadzir Marsono:
Correlation-Aware Joint Pruning-Quantization using Graph Neural Networks. 1403-1409 - Rahul Palnitkar, Jeová Farias Sales Rocha Neto:

A Sparse Graph Formulation for Efficient Spectral Image Segmentation. 1410-1416 - Hamadi Chihaoui, Paolo Favaro:

When Self-Supervised Pre-Training Meets Single Image Denoising. 1417-1423 - Chenxiao Zhang

, Xin Deng, Hongpeng Sun, Jingyi Xu
, Mai Xu:
SN-NET: Semismooth Newton Driven Lightweight Network for Real-World Image Denoising. 1424-1430 - Seyed Alireza Hosseini, Tam Thuc Do, Gene Cheung, Yuichi Tanaka:

Constructing an Interpretable Deep Denoiser by Unrolling Graph Laplacian Regularizer. 1431-1437 - Mary Damilola Aiyetigbo, Dineshchandar Ravichandran, Reda Chalhoub, Peter Kalivas, Feng Luo, Nianyi Li:

Unsupervised Coordinate-Based Video Denoising. 1438-1444 - Zhuang Sun, Li Chen

, Zhida Feng, Xiaoming Liu:
B-Walk: Bernoulli Principle Guided Biased Random Walk for Curve Connection. 1445-1451 - Yuhong He, Aiwen Jiang, Lingfang Jiang, Long Peng, Zhifeng Wang, Lu Wang:

Dual-Path Coupled Image Deraining Network Via Spatial-Frequency Interaction. 1452-1458 - Parham Eftekhar, Gene Cheung, Tim Eadie:

Declouding of Satellite Images for Crop Growth Monitoring Via Unrolling of Gradient Graph Laplacian Regularizer. 1459-1465 - Xijun Wang, Santiago López-Tapia, Aggelos K. Katsaggelos:

Real-World Atmospheric Turbulence Correction Via Domain Adaptation. 1466-1472 - JingShuo Guan, Na Qi, Qing Zhu, Liang Chen:

UTrCGAN: Uncertainty-Driven Cycle-Consistent Generative Adversarial Network for Low-Light Image Enhancement. 1473-1479 - Ruirui Lin, Nantheera Anantrasirichai, Alexandra Malyugina, David Bull

:
A Spatio-Temporal Aligned SUNet Model For Low-Light Video Enhancement. 1480-1486 - Yuhang He

, Senmao Tian, Jian Zhang, Shunli Zhang:
Dual Attention Enhanced Transformer for Image Defocus Deblurring. 1487-1493 - Uditangshu Aurangabadkar, Anil C. Kokaram:

A Dictionary Based Approach for Removing Out-of-Focus Blur. 1494-1499 - Francisco M. Castro-Macías, Fernando Pérez-Bueno, Miguel Vega, Javier Mateos, Rafael Molina, Aggelos K. Katsaggelos:

Bayesian Blind Image Deconvolution using an Hyperbolic-Secant prior. 1500-1506 - Jiahao Tian, Ziyang Zheng, Xinyu Peng, Yong Li, Wenrui Dai, Hongkai Xiong:

DCCM: Dual Data Consistency Guided Consistency Model for Inverse Problems. 1507-1513 - Hsuan Yuan, Shao-Yu Weng, I-Hsuan Lo, Wei-Chen Chiu, Yu-Syuan Xu, Hao-Chien Hsueh, Jen-Hui Chuang, Ching-Chun Huang:

Two Heads Better Than One: Dual Degradation Representation for Blind Super-Resolution. 1514-1520 - Keuntek Lee, Jaehyun Park

, Gu Yong Park, Nam Ik Cho:
RFG-HDR: Representative Feature-Guided Transformer For Multi-Exposure High Dynamic Range Imaging. 1521-1527 - Ziyad Alswaidan, M. Hashem Shullar, Khalil Chikhaoui, Motaz Alfarraj:

Object-Aware Adaptive Image Retargeting Via Importance Map Fusion. 1528-1533 - Fangzheng Yuan, Xiaoyue Jiang, Xiaoyi Feng, Moncef Gabbouj

:
Intrinsic Image Decomposition Based on Quantized Prior Codebook. 1534-1539 - Zhangke Wang, Na Qi, Xiyuan Zhao, Wei Xu, Jingzhong Qi, Qing Zhu:

Coarse-To-Fine Spatio-Temporal Luminance-Aware Reconstruction For High-Speed Motion Scene. 1540-1546 - Yanick Christian Tchenko, Hicham Hadj-Abdelkader, Hedi Tabia:

Draft - Distilled Recurrent All-Pairs Field Transforms For Optical Flow. 1547-1553 - Yuanhao Gong, Guanghui Yue:

Start-Tv: A Closed-Form Initialization For Total Variation Models. 1554-1559 - Junhao Huang, Fang Zhang, Meiliang Liu

, Zhengye Si, Zhiwen Zhao:
A Novel Architecture for Image Vectorization with Increasing Granularity. 1560-1566 - Mir Sazzat Hossain

, AKM Mahbubur Rahman, Md. Ashraful Amin, Amin Ahsan Ali:
Lightweight Recurrent Neural Network for Image Super-Resolution. 1567-1573 - Jiahuan Ji, Baojiang Zhong, Kai-Kuang Ma, Fuhui Zhou, Qihui Wu:

An Image Decomposition-Guided Network for Image Interpolation. 1574-1580 - Karim El Khoury, Tiffanie Godelaine, Simon Delvaux, Sébastien Lugan, Benoît Macq:

Streamlined Hybrid Annotation Framework Using Scalable Codestream for Bandwidth-Restricted UAV Object Detection. 1581-1587 - Hideyuki Ogura, Shinya Ezumi, Masaaki Ikehara:

Face Drawing GAN by Channel Attention and Matrix Product Attention. 1588-1594 - Yuanlin Wang, Ruiqin Xiong, Jing Zhao, Tiejun Huang:

Reconstruct Dynamic Scene for Spike Camera Based on 3D Space Time Similarity. 1595-1601 - Takaki Ikeda, Takafumi Iwaguchi, Diego Thomas, Hiroshi Kawasaki:

A Practical Calibration Method for Cameras and Multiple Line-Lasers in Light Sectioning Systems for Underwater Environments. 1602-1608 - Ibrar Amin, Ruiyuan Kang, Hasan Al-Marzouqi, Zeyar Aung, Panos Liatsis:

Convolutional Neural Network With Learnable Masks For EIT Based Tactile Sensing. 1609-1615 - Yanmeng Liu, Libao Zhang:

Remote Sensing Image Uneven Haze Removal Based On Haze Density Estimation and Saliency-Driven Dual Channel Fusion. 1616-1622 - Ryushiro Matsumoto, Mashiho Mukaida, Takanori Koga, Noriaki Suetake:

A Hue-Preserving Contrast Enhancement Method Using Histogram Specification for Each RGB Component. 1623-1628 - Zihao Ye, Jaehoon Cho, Changjae Oh:

Improving Image De-Raining Using Reference-Guided Transformers. 1629-1634 - Jingxuan Zhang, Libao Zhang:

Clouds and Haze Co-Removal Based on Weight-Tuned Overlap Refinement Diffusion Model for Remote Sensing Images. 1635-1641 - Zhibo Du, Long Peng, Yang Wang, Yang Cao, Zheng-Jun Zha:

FC3DNET: A Fully Connected Encoder-Decoder for Efficient Demoiréing. 1642-1648 - Jin Zhang, Haiyan Jin, Haonan Su, Yuanlin Zhang, Zhaolin Xiao, Bin Wang:

A Cnn-Transformer Network Based Snr Guided High Frequency Reconstruction for Low Light Image Enhancement. 1649-1655 - David Reixach, Josep Ramon Morros

:
Fast Unsupervised Tensor Restoration via Low-Rank Deconvolution. 1656-1662 - Ziqiang Shi, Rujie Liu:

Project, Skate, and Refresh: Improved Schrödinger Bridge Sampler for Image Restoration. 1663-1669 - Yuelong Zhuo, Weiling Li, Beibei Yang, Yan Fang, Huaqiang Yuan:

Counting Repetitive Actions in Event Stream. 1670-1675 - Hiroyuki Deguchi, Mana Masuda, Takuya Nakabayashi, Hideo Saito:

E2GS: Event Enhanced Gaussian Splatting. 1676-1682 - Le Thi Hue Dao, An Gia Vien, Jooyoung Lee, Seyoon Jeong, Naeun Yang, Chul Lee:

Content-Aware Supervision For Diffusion-Based Restoration of Extremely Compressed Background For VCM. 1683-1689 - Zheng-Hui Huang, Tse-Yan Lee, Li-Jen Chang, Yong-Wei Chen, Ping-Jui Chiang, Jo-Fan Wu, Yung-Yu Chuang:

Semantic-Region Specific Lookup Tables for Image Enhancement Via Unpaired Learning. 1690-1696 - May Thet Tun, Yosuke Sugiura, Tetsuya Shimamura:

Lightweight Underwater Image Enhancement via Impulse Response of Low-Pass Filter Based Attention Network. 1697-1703 - Polina Karpikova, Andrei Spiridonov

, Anna Vorontsova, Anastasia Yaschenko, Ekaterina Radionova, Igor Medvedev, Alexander Limonov:
Super: Selfie Undistortion and Head Pose Editing with Identity Preservation. 1704-1710 - Florin-Alexandru Vasluianu, Zongwei Wu, Radu Timofte:

SFNet - A Spatial-Frequency Domain Neural Network For Image Lens Flare Removal. 1711-1717 - Mengjiao Zhao, Mengting Ma, Ao Gao, Wei Zhang:

Frequency-Spatial Domain Information Fusion Network for Pan-Sharpening. 1718-1724 - Marcos V. Conde, Florin-Alexandru Vasluianu, Radu Timofte:

Toward Efficient Deep Blind Raw Image Restoration. 1725-1731 - Jiahui Liu, Chunling Yang:

A Dual-Domain Collaboration Network for VCS Reconstruction. 1732-1738 - Sabeethan Kanagasingham

, Andrew R. Mills, Visakan Kadirkamanathan
:
Computationally Efficient Kalman Filter Framework for Intra-Frame Image Reconstruction with a Rolling Shutter Camera. 1739-1745 - Wendi Liang, Yihan Wen, Zewei Wang, Jianuo Jiang, Tat-Ming Lok, Guanchong Niu:

Enhanced Facial Restoration with Misinformation-Filtered Guide-Denoising Diffusion Probabilistic Models. 1746-1752 - Anna Meyer, Srivatsa Prativadibhayankaram, André Kaup:

Efficient Learned Wavelet Image and Video Coding. 1753-1759 - Mustafa Akin Yilmaz, O. Ugur Ulas, Ahmet Bilican, A. Murat Tekalp:

Motion-Adaptive Inference for Flexible Learned B-Frame Compression. 1760-1766 - Shiyu Qin, Yi-Min Zhou, Jin-Peng Wang, Bin Chen, Baoyi An, Tao Dai, Shu-Tao Xia:

Progressive Learning with Visual Prompt Tuning for Variable-Rate Image Compression. 1767-1773 - H. Burak Dogaroglu

, Ahmet Burakhan Koyuncu, Atanas Boev, Elena Alshina, Eckehard G. Steinbach:
Adapting Learned Image Codecs To Screen Content Via Adjustable Transformations. 1774-1780 - Qiaoxi Chen, Changsheng Gao, Dong Liu:

End-to-End Learned Scalable Multilayer Feature Compression For Machine Vision Tasks. 1781-1787 - Lingyu Zhu

, Binzhe Li
, Riyu Lu
, Peilin Chen, Qi Mao, Zhao Wang, Wenhan Yang, Shiqi Wang:
Learned Image Compression for Both Humans and Machines via Dynamic Adaptation. 1788-1794 - Oguzhan Güngördü

, A. Murat Tekalp:
Saliency-Aware End-to-End Learned Variable-Bitrate 360-Degree Image Compression. 1795-1801 - Gabriele Spadaro, Alberto Presta, Enzo Tartaglione, Jhony H. Giraldo, Marco Grangetto, Attilio Fiandrotti:

Gabic: Graph-Based Attention Block for Image Compression. 1802-1808 - Chih-Yu Lai, Dung N. Tran, Kazuhito Koishida:

Learned Image Compression With Text Quality Enhancement. 1809-1815 - Runyu Yang

, Dong Liu, Feng Wu, Wen Gao:
Neural Radiance Field-Assisted Static-Scene Video Coding. 1816-1822 - Christian R. Helmrich, Valeri George, Vignesh V. Menon

, Adam Wieckowski, Benjamin Bross, Detlev Marpe:
Fast Constant-Quality Video Encoding Using VVENC With Rate Capping Based On Pre-Analysis Statistics. 1823-1828 - Vignesh V. Menon

, Christian R. Helmrich, Adam Wieckowski, Benjamin Bross, Detlev Marpe:
Convex-Hull Estimation using Xpsnr for Versatile Video Coding. 1829-1835 - Sebastian Schwarz, Miska M. Hannuksela, Döne Bugdayci Sansli:

IN-Loop Filter for Object Mask Coding in Versatile Video Coding. 1836-1842 - Haruhisa Kato

, Yoshitaka Kidani, Kei Kawamura:
Extended Multiple Cross-Component Linear Models With Adaptive Thresholding and Overlapped Averaging Beyond VVC. 1843-1849 - Lei Zhao, Kai Zhang, Li Zhang:

Subblock-Based Combined Inter and Intra Prediction Beyond VVC. 1850-1856 - Marc Windsheimer, Fabian Brand, André Kaup:

ON Annotation-Free Optimization of Video Coding for Machines. 1857-1863 - Yi Peng, Zixiang Zhang, Li Yu:

MFLFC: Multi-Frame Fusion Based Low-Resolution Feature Compression For Object Tracking. 1864-1869 - Zifu Zhang, Shengxi Li, Tie Liu, Mai Xu, Tao Xu, Zhenyu Guan, Zhuoyi Lv:

Hybrid Single Input and Multiple Output Method For Compressing Features Towards Machine Vision Tasks. 1870-1876 - Honglei Zhang, Jukka I. Ahonen, Nam Le, Ruiying Yang, Francesco Cricri:

Competitive Learning For Achieving Content-Specific Filters In Video Coding For Machines. 1877-1882 - Xuelin Shen, Haoqiao Ou, Wenhan Yang:

Image Coding For Machine Via Analytics-Driven Appearance Redundancy Reduction. 1883-1889 - Alessandro Gnutti, Stefano Della Fiore, Mattia Savardi, Yi-Hsin Chen, Riccardo Leonardi, Wen-Hsiao Peng:

Lidar Depth Map Guided Image Compression Model. 1890-1896 - Nicolas Neumann, Priyanka Das, Tim Classen, Mathias Wien:

Fast Template Matching-Based Reference Picture Padding for Video Coding. 1897-1902 - Dayong Wang, Junyi Yu, Xin Lu, Frédéric Dufaux, Hongwei Guo, Hui Guo, Ce Zhu:

Fast Coding Mode Prediction for Intra Prediction in VVC SCC. 1903-1909 - Bharath Vishwanath, Wenyi Wang, Yingzhan Xu, Kai Zhang, Li Zhang:

Sample Domain Prediction and Transform Skip for Region Adaptive Hierarchical Transform in Geometric Point Cloud Compression. 1910-1915 - Cem Eteke, Alexander Griessel, Wolfgang Kellerer, Eckehard G. Steinbach:

Real-Time Semantic Video Communication of General Scenes. 1916-1921 - Yueyu Hu, Chenhao Zhang, Onur G. Guleryuz, Debargha Mukherjee, Yao Wang:

Standard Compliant Video Coding Using Low Complexity, Switchable Neural Wrappers. 1922-1928 - Keiichi Chono, Naoya Niwa

, Hiroe Iwasaki:
Picture Partitioning Design of Neural Network-Based Intra Coding For Video Coding For Machines. 1929-1934 - Xue Wu, Tong Tang, Zhiyuan Zhu, Hong Zou:

Feature Enhanced Learning Image Compression With Recurrent Criss-Cross Attention. 1935-1939 - Zenghui Duan, Cheolkon Jung, Yang Liu, Ming Li:

Learned Image Compression Using A Long and Short Attention Module. 1940-1946 - Raül Pérez-Gonzalo, Andreas Espersen, Antonio Agudo:

Generalized Nested Latent Variable Models For Lossy Coding Applied To Wind Turbine Scenarios. 1947-1953 - Lucas S. Lopes, Ricardo L. de Queiroz, Philip A. Chou:

Rate-Complexity Optimization in Lossless Neural-Based Image Compression. 1954-1959 - Zong-Lin Gao, Sang NguyenQuang, Wen-Hsiao Peng, Xiem HoangVan:

Omra: Online Motion Resolution Adaptation To Remedy Domain Shift in Learned Hierarchical B-Frame Coding. 1960-1966 - Xiaojie Wu, Ping Wang, Xinhong Wang:

ROI-DVC: A Region-of-Interest Based Deep Video Coding Framework. 1967-1972 - Lorenzo Catania, Dario Allegra:

Redefining Visual Quality: The Impact of Loss Functions on INR-Based Image Compression. 1973-1979 - Haobo Lei, Zhisong Bie, Zhao Jing, Hongxia Bie:

Talking-Head Video Compression With Motion Semantic Enhancement Model. 1980-1986 - Christian D. Rask, Daniel E. Lucani:

Rage for the Machine: Image Compression with Low-Cost Random Access for Embedded Applications. 1987-1993 - Yoshitaka Kidani, Haruhisa Kato, Kei Kawamura:

Bi-Predictive Intra Block Copy for Enhanced Video Coding Beyond VVC. 1994-2000 - Xidan Zhang, Jianing Wei, Atsunori Moteki, Yoshie Kobayashi, Genta Suzuki, Zhiming Tan:

MSD-CRFS: Multi-Scale Dual Aggregation Conditional Random Fields for Monocular Depth Estimation. 2001-2007 - Ming-Zheng Peng, Hao-Chung Cheng, Phuong-Thi Le, Cheng-Chun Wang, Chien-Yao Wang, Jia-Ching Wang:

Scene Text Recognition Using Progressive Rectification Network And Spelling Error Correction Language Model. 2008-2014 - Shota Hirose, Kazuki Kotoyori, Kasidis Arunruangsirilert, Fangzheng Lin, Heming Sun, Jiro Katto:

Real-Time Video Prediction With Fast Video Interpolation Model and Prediction Training. 2015-2021 - Qiangqiang He, Jie Zhang, Shuwei Qian, Chongjun Wang:

Some Can Be Better than All: Multimodal Star Transformer for Visual Dialog. 2022-2026 - Yifan Qiang, Naian Liu:

Fast Inter Mode Decision with Resolution Sampling For VVC 360-Degree Video Coding. 2027-2033 - Marcos V. Conde, Andy Bigos, Radu Timofte:

Streaming Neural Images. 2034-2040 - Hongyi Cai, Anna Zhu:

Cross-Modal Alignment of Local and Global Features for Zero-Shot Chinese Character Recognition. 2041-2047 - Zhifeng Wang

, Kaihao Zhang, Ramesh S. Sankaranarayana:
Lrdif: Diffusion Models For Under-Display Camera Emotion Recognition. 2048-2054 - Ye Lu, Jianjun Gao, Chen Cai, Ruoyu Wang, Duc Tri Phan, Kim-Hui Yap:

Hdplifter: Hierarchical Dynamics Perception For 2D-to-3D Human Pose Lifting. 2055-2061 - Hongyuan He, Daming Wang, Md. Rakibul Hasan

, Tom Gedeon, Md. Zakir Hossain:
TCA-NET: Triplet Concatenated-Attentional Network for Multimodal Engagement Estimation. 2062-2068 - Junpei Honma, Akisato Kimura, Go Irie:

Estimating Indoor Scene Depth Maps From Ultrasonic Echoes. 2069-2073 - Jimut B. Pal

, Suyash P. Awate:
A Hard Convex-Shape Constraint In Dnns For Object Segmentation. 2074-2080 - Huu-Phong Luong, Hoang-Son Bui

, Nam-Khanh Nguyen, Thi-Loan Pham, Gia-Minh Pham, Sy-Hoang Tran, Thanh-Hai Tran, Thi-Lan Le:
SovaSeg-Net: Scale Invariant Ovarian Tumors Segmentation from Ultrasound Images. 2081-2087 - Gouverneur François, Pourjavan Sayeh, Macq Benoit:

Deep Convolutional Neural Network Prediction For Glaucoma Detection Using OCT and OCT-Angiography Disc-and Macula-Centered Images and Their Combined Power. 2088-2094 - Ayush Somani

, Anshul Gupta, Arif Ahmed Sekh, Krishna Agarwal, Dilip K. Prasad:
Blend & Predict: Domain-Adaptable Few-Shot Learning for Microscopy Imaging. 2095-2100 - Yufan Liu, Ziyang Wang

, Tianxiang Chen, Zi Ye
:
Quadruple-Consistency Vision Transformer for Medical Image Segmentation with Limited Number of Sparse Annotations. 2101-2107 - Anurag Goel, Angshul Majumdar:

Semi-Supervised Graphical Deep Dictionary Learning for Hyperspectral Image Classification From Limited Samples. 2108-2114 - Frank Sippel, Jürgen Seiler, André Kaup:

Fast Edge-Aware Occlusion Detection In The Context of Multispectral Camera Arrays. 2115-2120 - Katja Kossira, David Schön, Jürgen Seiler, André Kaup:

Conditional Optimal Filter Selection For Multispectral Object Classification. 2121-2127 - Gyeong-Eun Youm, Tae-Sung Park, Jong-Ok Kim:

Cross-Fusion of Band-Specific Spectral Features For Multi-Band NIR Colorization. 2128-2134 - Jamie Koerner, Vivienne Sze:

ClearDepth: Addressing Depth Distortions Caused By Eyelashes For Accurate Geometric Gaze Estimation On Mobile Devices. 2135-2141 - Aakansha Mishra, Aditya Agarwala, Utsav Tiwari, Vikram Nelvoy Rajendiran, Srinivas Soumitri Miriyala:

Efficient Visual Question Answering on Embedded Devices: Cross-Modality Attention With Evolutionary Quantization. 2142-2148 - Yasmine Hachani, Patrick Bouthemy, Elisa Fromont, Sylvie Ruffini, Ludivine Laffont, Alline de Paula Reis

:
Early Prediction Of The Transferability Of Bovine Embryos From Videomicroscopy. 2149-2155 - Jincheng Yang, Lishun Wang, Miao Cao, Huan Wang, Yinping Zhao, Xin Yuan:

Coarse-Fine Spectral-Aware Deformable Convolution for Hyperspectral Image Reconstruction. 1-7 - Andreas Unterberger, Cheau Tyan Foo, Zachary Adrian Emuang, Fabio J. W. A. Martins, Khadijeh Mohri:

Metaheuristic Camera Calibration for Optical Tomographic Imaging in Industrial Environments. 2163-2169 - Xirang Zhang, Yongyi Yang, Jovan G. Brankov, P. Hendrik Pretorius, Michael A. King:

Temporal Regularization for Robust Motion Compensation in Reduced Dose Cardiac-Gated Spect Images. 2170-2174 - Bipin Gaikwad

, Abani Patra, Carl R. Crawford, Eric L. Miller:
Self-Supervised Anomaly Detection and a New Benchmark for X-Ray Cargo Images. 2175-2181 - Yu Mitsuzumi, Akisato Kimura, Go Irie, Atsushi Nakazawa

:
Cross-Action Cross-Subject Skeleton Action Recognition Via Simultaneous Action-Subject Learning With Two-Step Feature Removal. 2182-2186 - Yanan Luo, Jinhui Yi, Yazan Abu Farha, Moritz Wolter

, Juergen Gall:
Rethinking Temporal Self-Similarity For Repetitive Action Counting. 2187-2193 - Tharsan Senthivel, Ngoc-Son Vu:

Subgroups For Detection Transformer. 2194-2200 - Suyuan Huang, Haoxin Zhang, Yanyu Xu, Yan Gao, Yao Hu, Zengchang Qin:

Caseg: Clip-Based Action Segmentation With Learnable Text Prompt. 2201-2207 - Nicolas Bizzozzero, Ihab Bendidi, Olivier Risser-Maroix:

Prompt Performance Prediction For Image Generation. 2208-2214 - Anna Sokolova, Anna Vorontsova, Bulat Gabdullin, Alexander Limonov:

FAWN: Floor-and-Walls Normal Regularization for Direct Neural TSDF Reconstruction. 2215-2221 - Kyujin Shim, Kangwook Ko, Jubi Hwang, Changick Kim:

Adaptrack: Adaptive Thresholding-Based Matching for Multi-Object Tracking. 2222-2228 - Dongni Lu, Jiaxuan Chen, Haiyan Chen, Ziyi Peng, Rong Quan, Jie Qin:

Camouflaged Object Detection Via Style Transfer-Based Data Augmentation. 2229-2235 - Ruoyu Wang, Chen Cai, Wenqian Wang, Jianjun Gao, Dan Lin, Wenyang Liu, Kim-Hui Yap:

CM2-Net: Continual Cross-Modal Mapping Network For Driver Action Recognition. 2236-2242 - Mikhail Artemyev, Anna Vorontsova, Anna Sokolova, Alexander Limonov:

Medea: Multi-View Efficient Depth Adjustment. 2243-2249 - Luyang Tang, Yongqi Zhai, Ronggang Wang:

Compression-Aware Tuning for Compressing Volumetric Radiance Fields. 2250-2256 - Qianxi Lu, Yi He, Shilin Wang:

Personatalk: Preserving Personalized Dynamic Speech Style In Talking Face Generation. 2257-2263 - Wei-Chian Liang, Chieh-Yun Chen, Hong-Han Shuai:

Toward Low Artifact Virtual Try-On Via Pre-Warping Partitioned Clothing Alignment. 2264-2270 - Hao-Yun Chang, Wen-Jiin Tsai:

Shadow-Aware Makeup Transfer with Lighting Adaptation. 2271-2277 - Yaping Zhao, Pei Zhang, Chutian Wang, Edmund Y. Lam:

Controllable Unsupervised Event-Based Video Generation. 2278-2284 - Muhammad Ahmad, Muhammad Usama, Salvatore Distefano, Manuel Mazzara:

Hyperspectral Image Classification With Fuzzy Spatial-Spectral Class Discriminate Information. 2285-2291 - Santiago Rivier, Carlos Hinojosa

, Silvio Giancola, Bernard Ghanem
:
Efficient Semantic Segmentation For Aerial Imagery Using Query Points and Superpixel Supervision. 2292-2298 - Tamara R. Lenhard, Andreas Weinmann

, Stefan Jäger, Tobias Koch
:
YOLO-Feder Fusionnet: A Novel Deep Learning Architecture for Drone Detection. 2299-2305 - Ziyang Zheng, Ziliang Ren, Zhanhao Liang

, Gulin Wang, Qieshi Zhang:
MSGAT: Multi-Stage Graph Attention Network For Human Motion Prediction. 2306-2312 - Guillem Capellera, Luis Ferraz, Antonio Rubio, Antonio Agudo, Francesc Moreno-Noguer:

Footbots: A Transformer-Based Architecture for Motion Prediction in Soccer. 2313-2319 - Yichen Shi, Feifei Zhang, Wenming Yang, Guijin Wang, Nan Su

:
Agent-Guided Gaze Estimation Network by Two-Eye Asymmetry Exploration. 2320-2326 - Robert Gabriel Popescu

, Nantheera Anantrasirichai, Juliet Biggs
:
Anomaly Detection for the Identification of Volcanic Unrest in Satellite Imagery. 2327-2333 - Quan Zhao, Siying Wu

, Yueyi Zhang, Xiaoyan Sun:
Semantic-Enhanced Point-Box Joint Prompting for Video Object Segmentation. 2334-2340 - Xianbin Hu, Wei Wu, Zhu Li, Xueliang Luo, Zhengfeng Chen:

Two-Stage Tripletnet: Light Weight Remote Sensing Scene Classification. 2341-2346 - Syed Tahir Hussain Rizvi, Øyvind Meinich-Bache

, Vilde Kolstad, Siren Rettedal, Sara Brunner, Kjersti Engan:
Semi-Supervised Action Recognition From Newborn Resuscitation Videos. 2347-2353 - Yiwei Chen, Jiaqian Yu, Siyang Pan, Sangil Jung, Wu Bi, Seung-In Park, Qiang Wang, ByungIn Yoo:

Gradtrans: Transformer-Based Gradient Guidance for Image Generation. 2354-2360 - Ayush Dubey, Shiv Ram Dubey, Satish Kumar Singh, Wei-Ta Chu:

Transformer-Based Clipped Contrastive Quantization Learning For Unsupervised Image Retrieval. 2361-2367 - Langning Miao, Ryo Kakimoto, Kaoru Ohishi, Yoshihiro Watanabe:

Improving Real-Time Near-Infrared Face Alignment With a Paired VIS-NIR Dataset and Data Augmentation Through Image-to-Image Translation. 2368-2374 - Yu Wei Chen, Huu-Phu Do, Chia-Wei Kuo, Hsuan-Tung Liu, Ching-Chun Huang:

Lipface: Lipschitz-Conditioned For Resolution Robust Face Recognition. 2375-2381 - Yusuke Sekikawa, Chingwei Hsu, Satoshi Ikehata, Rei Kawakami, Ikuro Sato:

Gumbel-NeRF: Representing Unseen Objects as Part-Compositional Neural Radiance Fields. 2382-2388 - Jian Lin, Xueting Liu, Chengze Li, Minshan Xie

, Tien-Tsin Wong:
SKETCH2MANGA: Shaded Manga Screening from Sketch with Diffusion Models. 2389-2395 - Vuong D. Nguyen, Pranav Mantini

, Shishir K. Shah:
ACML: Attention-Based Cross-Modality Learning For Cloth-Changing and Occluded Person Re-Identification. 2396-2402 - Chaowei Han, Gaofeng Meng, Chunlei Huo:

SFD: Similar Frame Dataset for Content-Based Video Retrieval. 2403-2409 - Haijun Xiong

, Yunze Deng, Bin Feng, Xinggang Wang, Wenyu Liu:
Gaitgs: Temporal Feature Learning in Granularity And Span Dimension for Gait Recognition. 2410-2416 - Ying Zhang, Hyunhee Park, Hanchao Jia, Fan Wang, Jianxing Zhang, Xiangyu Kong:

Adaptively Hierarchical Quantization Variational Autoencoder Based on Feature Decoupling and Semantic Consistency for Image Generation. 2417-2423 - Yunze Deng, Haijun Xiong

, Bin Feng:
Licaf: Lidar-Camera Asymmetric Fusion For Gait Recognition. 2424-2430 - Huaying Zhang, Rintaro Yanagi, Ren Togo, Takahiro Ogawa, Miki Haseyama:

Zero-Shot Composed Image Retrieval Considering Query-Target Relationship Leveraging Masked Image-Text Pairs. 2431-2437 - Tayeba Qazi, Brejesh Lall:

Thermal Videodiff (TVD): A Diffusion Architecture For Thermal Video Synthesis. 2438-2444 - Hichem Sahbi:

One-Shot Multi-Rate Pruning Of Graph Convolutional Networks For Skeleton-Based Recognition. 2445-2451 - Patrik Patera

, Yie-Tarng Chen, Wen-Hsien Fang:
Spatio-Temporal Adaptation With Dilated Neighbourhood Attention For Accident Anticipation. 2452-2458 - Tiancheng Ying, Rong Quan, Peng Zheng

, Yichao Yan, Jie Qin:
MTA-PS: Towards Practical Person Search in Videos. 2459-2465 - Chun-Ting Fang, Tsung-Jung Liu, Kuan-Hsien Liu:

Micro-Expression Recognition Based On 3DCNN Combined With GRU and New Attention Mechanism. 2466-2472 - Zhihao Liu, Yi Zhang, Wenhui Huang, Yan Liu, Mengyang Pu, Chao Deng, Junlan Feng:

Learning Temporal Cues for Fine-Grained Action Recognition. 2473-2479 - Juhyeong Seon, Woobin Im, Sebin Lee, Jumin Lee, Sung-Eui Yoon:

Extending Segment Anything Model into Auditory and Temporal Dimensions for Audio-Visual Segmentation. 2480-2486 - Taehoon Kim, Jaemin Na, Joong-Won Hwang, Wonjun Hwang

:
Stay Focus on Object: Cross-Domain Detection Using Domain-Invariant Object Representation. 2487-2493 - Yi-Chia Chen, Wei-Hua Li

, Chu-Song Chen:
Open-Vocabulary Panoptic Segmentation Using Bert Pre-Training of Vision-Language Multiway Transformer Model. 2494-2500 - Priscilla Indira Osa

, Josiane Zerubia, Zoltan Kato:
Gabor Feature Network for Transformer-Based Building Change Detection Model in Remote Sensing. 2501-2507 - Subham Das, C. Chandra Sekhar:

Leveraging Generated Image Captions for Visual Commonsense Reasoning. 2508-2514 - Mayssa Zaier, Hazem Wannous, Hassen Drira, Jacques Boonaert:

Motion-Lie Transformer: Geometric Attention For 3D Human Pose Motion Prediction. 2515-2521 - Cheng Long

, Sayantika Nag, Adrian Barbu:
PCA-UNET for Object Segmentation. 2522-2528 - Zaber Ibn Abdul Hakim, Rasman Mubtasim Swargo

, Muhammad Abdullah Adnan:
Exploring Attention Mechanisms in Integration of Multi-Modal Information for Sign Language Recognition and Translation. 2529-2535 - Yongpeng Chang, Guangchun Gao:

Spatial-Channel Collaborated Attention for Cross-Scale Crowd Counting. 2536-2542 - Zhenhua Wang, Linwei Ye:

Referring Image Segmentation with Two-Stage Multi-Modal Interaction. 2543-2549 - Antoine Chaffin, Ewa Kijak, Vincent Claveau:

Distinctive Image Captioning: Leveraging Ground Truth Captions in Clip Guided Reinforcement Learning. 2550-2556 - Marcella Astrid, Enjie Ghorbel, Djamila Aouada:

Statistics-Aware Audio-Visual Deepfake Detector. 2557-2563 - Yaozong Gan

, Guang Li, Ren Togo, Keisuke Maeda, Takahiro Ogawa, Miki Haseyama:
Cross-Domain Few-Shot In-Context Learning For Enhancing Traffic Sign Recognition. 2564-2570 - Jiasheng Wang, Zhenhua Wang, Jifeng Ning:

Edge-Reserved Knowledge Distillation for Image Matting. 2571-2577 - Zhiwen Chen, Wei Wu, Zhengfeng Chen:

Learning A Rain-Invariant Network For Instance Segmentation In The Rain. 2578-2584 - Ruoyu Feng, Tao Yu, Xin Jin, Xiaoyuan Yu, Lei Xiao, Zhibo Chen:

Rethinking Domain Adaptation and Generalization in the ERA Of Clip. 2585-2591 - Muhammad Ali

, Mamoona Javaid, Mubashir Noman, Mustansar Fiaz, Salman Khan:
Fanet: Feature Amplification Network for Semantic Segmentation in Cluttered Background. 2592-2598 - Yajie Liu, Pu Ge, Haoxiang Ma, Shichao Fan, Qingjie Liu, Di Huang, Yunhong Wang:

Towards Generalizable Referring Image Segmentation Via Target Prompt And Visual Coherence. 2599-2605 - Theodora Kyprianidi, Effrosyni Doutsi, George Tzagkarakis, Panagiotis Tsakalides:

Exploring the Potential of Recurrence Quantification Analysis for Video Analysis and Motion Detection. 2606-2612 - Shubhabrata Mukherjee, Cory C. Beard, Zhu Li:

MODIPHY: Multimodal Obscured Detection for IoT using PHantom Convolution-Enabled Faster YOLO. 2613-2619 - Bissmella Bahaduri, Zuheng Ming, Fangchen Feng, Anissa Mokraoui:

Multimodal Transformer Using Cross-Channel Attention For Object Detection In Remote Sensing Images. 2620-2626 - Wangzhi Xing, Diqi Chen, Mohammad Aminul Islam

, Jun Zhou:
Bidfuse: Harnessing Bi-Directional Attention with Modality-Specific Encoders for Infrared-Visible Image Fusion. 2627-2633 - Guohua Lv, Xinyue Fu, Chaoqun Sima, Yanlong Xu, Baodong Zhang, Hanju Bao:

Illumination-Enhanced Infrared and Low-Light Visible Image Fusion. 2634-2640 - Takumi Watanabe, Rei Kawakami, Masayuki Tanaka, Masatoshi Okutomi:

Object Detection Framework Using Multiple Tone Mappings on High-Dynamic-Range Images. 2641-2647 - Mengyao Ji, Cheolkon Jung:

Deep Fusion of Visible and Near Infrared Images for Registration and Defogging Using Cross Modal Transformer. 2648-2654 - Guohua Lv, Xiyan Wang, Yongbiao Gao, Yi Zhai, Guixin Zhao, Guangxiao Ma:

Rafmnet: Reinforced Attention Fusion and Multiscale Network For Noisy Infrared and Visible Image Fusion. 2655-2661 - Gahyeon Kim, An Gia Vien, Duong Hai Nguyen, Chul Lee:

Feature Decomposition Transformers for Infrared and Visible Image Fusion. 2662-2668 - Xinyue Fan, Libao Zhang:

Land Use Classification Via Multi-Modal Complementary Feature Fusion and Context Information Enhancement For Optical and Sar Images. 2669-2675 - Yunfei Li, Jun Li:

Investigating and Reducing the Impairment of Point Spread Effect For Spatiotemporal Fusion Of Remote Sensing Imagery. 2676-2682 - Kévin Riou, Kaiwen Dong, Yujie Huang

, Kévin Subrin, Patrick Le Callet, Yanjing Sun:
Evaluating 3D Human Pose Estimation in Occluded Multi-Sensor Scenarios: Dataset and Annotation Approach. 2683-2689 - Prasad Theeda

, Chee-Ming Ting, Arghya Pal
, Hernando Ombao
:
A Preconditioning Approach To Optimizing Sensing Matrix For Improved Compressed Sensing CT Reconstruction. 2690-2695 - Karen O. Egiazarian

, Vladimir Katkovnik
:
3F-PNP: Compressive Sensing Using Nonlocal Self-Similarity and Deep Learning Priors. 2696-2701 - Yun Li, Hao Xie, Jun Xiao, Cong Zhang, Tianshan Liu, Kin-Man Lam:

Hierarchical Vertex-Wise Intensification Graph Convolution for Skeleton-Based Activity Recognition. 2702-2708 - Qijun Yang

, Hujun Yin
:
Fourier Ptychography With Information Entropy Based No-Reference Image Quality Assessment Learning. 2709-2715 - Giulia Martinelli, Nicola Garau, Niccolò Bisagno, Nicola Conci:

All Skeletons are Created Equal! A Domain Adaptation Transformer to Handle Multiple Topologies. 2716-2722 - Abolfazl Meyarian, Xiaohui Yuan, Zhinan Qiao:

Spatial Plaid Attention Decoder for Semantic Segmentation. 2723-2729 - Hrishav Bakul Barua, Ganesh Krishnasamy, KokSheik Wong, Abhinav Dhall, Kalin Stefanov

:
Histohdr-Net: Histogram Equalization for Single LDR to HDR Image Translation. 2730-2736 - Umut Cem Entok, Firas Laakom, Farhad Pakdaman, Moncef Gabbouj

:
Pixel-Wise Color Constancy Via Smoothness Techniques In Multi-Illuminant Scenes. 2737-2743 - Lianwei Yang, Zhikai Li, Junrui Xiao, Haisong Gong, Qingyi Gu:

MGRQ: Post-Training Quantization For Vision Transformer With Mixed Granularity Reconstruction. 2744-2750 - Dixin Yang, Mariko Isogawa:

Efficient Circular and Confocal Non-Line-Of-Sight Imaging With Transient Sinogram Super Resolution. 2751-2757 - Omar Elezabi, Marcos V. Conde, Radu Timofte:

Simple Image Signal Processing using Global Context Guidance. 2758-2764 - Lu Xu, Chao Zhang, Yasi Wang, Qiang Wang:

Generate DSLR-Like Image With Global Information and Prior Guided ISP. 2765-2771 - Shuo Zhang

, Xinyu Yang, Xiwen Bai, Yu Li:
Clip-Based Composition-Aware Image Cropping. 2772-2778 - Wenbin Luo, Takafumi Iwaguchi, Ryusuke Sagawa, Hiroshi Kawasaki:

Multi-Path Interference Mitigation For Indirect Time-of-Flight Camera By the Distortion of Coding Curve. 2779-2785 - Chris Henry, Paras Maharjan, Zhu Li, George York:

E2SIFT: Neuromorphic SIFT via Direct Feature Pyramid Recovery from Events. 2786-2792 - Jinhao Qiao, Jiang Liu, Heng Yu, Yi Xiao, Hongshan Yu, Yan Zheng, Sihan Li:

VAG: Voxel Attenuation Grid For Sparse-View CBCT Reconstruction. 2793-2799 - Chee-Ming Ting, Fuad Noman, Raphaël C.-W. Phan, Hernando Ombao

:
Dynamic MRI Reconstruction Using Low-Rank Plus Sparse Decomposition With Smoothness Regularization. 2800-2806 - Tariq M. Khan, Shahzaib Iqbal

, Syed Saud Naqvi, Imran Razzak, Erik Meijering:
LMBF-Net: A Lightweight Multipath Bidirectional Focal Attention Network For Multifeatures Segmentation. 2807-2813 - Aymen Sadraoui, Astrid Laurent-Bellue, Mounir Kaaniche, Amel Benazza-Benyahia, Catherine Guettier, Jean-Christophe Pesquet:

Unrolled Projected Gradient Algorithm For Stain Separation In Digital Histopathological Images. 2814-2819 - K. Pavan Kumar Reddy, Kunal N. Chaudhury:

Deep Regularization For Scale-Agnostic Superresolution of MR Images. 2820-2826 - Min Xiao

, Zi Wang, Jiefeng Guo, Di Guo, Xiaobo Qu:
A 1D Plug-and-Play Synthetic Data Deep Learning For Undersampled Magnetic Resonance Image Reconstruction. 2827-2832 - Adel Oulefki

, Abbes Amira, Fatih Kurugollu, Thaweesak Trongtirakul, Sos S. Agaian, Menen Kassim Mohammed, Mohammad Alshoweky:
Enhancing Intubation Accuracy: Advanced Tracheal Segmentation Techniques In Video Endoscopy. 2833-2838 - Yanyi Li, Jianping Yin:

Adaptive Sampling Method for Whole-Body Low-Dose Pet Reconstruction Based on Reconstruction Difficulty. 2839-2845 - Juliana Do Nascimento Damurie Da Silva, Patrick Horain:

Fourier Ptychography Microscopy With Integrated Positional Misalignment Correction. 2846-2851 - Satoshi Ito, Yuki Sato, Naoya Endo, Shohei Ouchi:

Deep-Learning-Based Magnetic Resonance Simultaneous Multislice Imaging Using Holographic Image Decoding. 2852-2857 - Kazuki Yamato, Satoshi Ito:

Improvement of Image Reconstruction for MRI Using Phase-Scrambling Fourier Transform and Dual-Domain Strategy. 2858-2864 - Vazim Ibrahim, Joseph Suresh Paul:

A Cross Domain Generative Network for Accelerated MRI. 2865-2870 - Jiayue He, Nan Su, Yanping Liao, Yiming Yan, Shou Feng, Chunhui Zhao:

A Multi-Modality Feature Enhancement Method Based On Feature Disentanglement For Sar Image Target Detection. 2871-2877 - Shuang Li

, Ganggang Dong:
A Learnable Radar Imaging Paradigm Driven by Deep Generative Model. 2878-2884 - Fatima Zaidi, Hira Hameed, Muhammad Farooq

, Aisha Fatima, Kamran Arshad, Khaled Assaleh, Qammer H. Abbasi:
Privacy-Preserving Visual Cues Communication for Hearing-Impaired People Using Deep Learning. 2885-2888 - Sabri Mustafa Kahya, Boran Hamdi Sivrikaya, Muhammet Sami Yavuz, Eckehard G. Steinbach:

Food: Facial Authentication And Out-Of-Distribution Detection With Short-Range FMCW Radar. 2889-2894 - Hao-Chiang Shao

, Tse-Yu Tseng, Yuan-Rong Liao, Chi-Chun Chen, Chung-Yang Hung, Ming-Hsin Liang:
Detecting Biomedical Copy-Move Forgery by Attention-Based Multiscale Deep Descriptors. 2895-2901 - Zelin Li

, Zhaoke Huang, Zhen Zhu, Sicheng You, Zhongying Zhao
, Hong Yan:
Directional And Topological Transformer With Topology Priors For 4D Cellular Image Segmentation. 2902-2908 - Martin Blanchard, Olivier Delézay

, Christophe Ducottet, Damien Muselet:
Delving into the Explainability of Prototype-Based CNN for Biological Cell Analysis. 2909-2915 - Sayan Acharya, Aditya Ganguly, Ram Sarkar, Abin Jose:

Cell Cycle State Prediction Using Graph Neural Networks. 2916-2922 - Ammar Chouchane, Abdelmalik Ouamane, El Ouanas Belabbaci

, Yassine Himeur, Abbes Amira:
Deep Learning-Based Leaf Image Analysis for Tomato Plant Disease Detection and Classification. 2923-2929 - Anindita Mohanta, Sourav Dey Roy

, Niharika Nath, Mrinal Kanti Bhowmik:
Novel Meta Attention Guided Framework for Breast Abnormality Classification With Combination of FSL and DA. 2930-2936 - Zequn Song, Lingfeng Wang:

Dual Multi-Modal Feature Fusion Network for the Evaluation of Osteosarcoma. 2937-2943 - Niladri Chakraborti, Deepak Ranjan Nayak:

MCT-Net: a Lightweight Multiscale Convolutional Transformer Network for Polyp Segmentation. 2944-2950 - Hossam Magdy Balaha, Mayada Elgendy, Ahmed Alksas

, Mohamed Shehata, Norah Saleh Alghamdi, Fatma Taher, Mohammed Ghazal, Mahitab Ghoneim, Eslam Hamed
, Fatma Sherif, Ahmed Elgarayhi, Mohammed Sallah, Mohamed Abdelbadie Salem, Elsharawy Kamal, Harpal Sandhu, Ayman El-Baz:
A Neuroimaging Yolov8-Based Cad Framework for Anosmia Grading in Covid-19. 2951-2956 - Kosuke Kurihara, Yoshihiro Maeda, Daisuke Sugimura, Takayuki Hamamoto:

Physiological Modeling With Multispectral Imaging for Heart Rate Estimation. 2957-2963 - Ahmed Sharafeldeen

, Adel Khelifi, Mohammed Ghazal, Maha Yaghi, Sohail Contractor
, Ayman El-Baz:
Automated Segmentation of Lung Regions in 3D CT Scans Using Hybrid Unsupervised-Supervised Models. 2964-2969 - Ming Kang, Chee-Ming Ting, Fung Fung Ting, Raphaël C.-W. Phan:

Cafct-Net: A Cnn-Transformer Hybrid Network With Contextual And Attentional Feature Fusion For Liver Tumor Segmentation. 2970-2974 - Syed Muhammad Anwar, Abhijeet Parida, Sara Atito, Muhammad Awais, Gustavo Nino, Josef Kittler, Marius George Linguraru:

SS-CXR: Self-Supervised Pretraining Using Chest X-Rays Towards A Domain Specific Foundation Model. 2975-2981 - Song Wang, Zhong Zhang, Huan Yan, Ming Xu, Guanghui Wang:

Mix-Domain Contrastive Learning For Unpaired H&E-to-IHC Stain Translation. 2982-2988 - Qianyu Du, Baojiang Zhong, Kai-Kuang Ma:

ATU-NET: An Adaptive Transformation-Based U-NET for Medical Image Segmentation. 2989-2995 - Kai-Jun See, Chee-Ming Ting, Fuad Noman, Junn Yong Loo, Yee-Fan Tan, Hernando Ombao

, Raphaël C.-W. Phan:
Deep Multi-Graph Embedded Clustering for Community Detection in FMRI Functional Brain Networks Across Individuals. 2996-3002 - Likai Wang, Tao Zhu, Yipu Zhang:

An Interpretable Deep Graph Neural Network Based On Attentional Multi-Scale Feature Fusion for FMRI Analysis. 3003-3009 - Muhammad Uzair Zahid, Aysen Degerli

, Fahad Sohrab
, Serkan Kiranyaz, Tahir Hamid, Rashid Mazhar, Moncef Gabbouj
:
Refining Myocardial Infarction Detection: A Novel Multi-Modal Composite Kernel Strategy in One-Class Classification. 3010-3016 - Agata M. Wijata, Bartlomiej Pycinski, Jakub Nalepa:

A Needle In A (Medical) Haystack: Detecting A Biopsy Needle In Ultrasound Images Using Vision Transformers. 3017-3023 - Ming Kang, Chee-Ming Ting, Fung Fung Ting, Raphaël C.-W. Phan:

CST-Yolo: A Novel Method For Blood Cell Detection Based On Improved Yolov7 And CNN-Swin Transformer. 3024-3029 - Meryem Amaouche, Ouassim Karrakchou

, Mounir Ghogho, Anouar El Ghazzaly, Mohamed Alami, Ahmed Ameur:
Redefining Cystoscopy With AI: Bladder Cancer Diagnosis Using an Efficient Hybrid CNN-Transformer Model. 3030-3036 - Nagur Shareef Shaik, Teja Krishna Cherukuri, Dong Hye Ye:

M3T: Multi-Modal Medical Transformer To Bridge Clinical Context With Visual Insights For Retinal Image Medical Description Generation. 3037-3043 - Elodie Germani

, Elisa Fromont, Camille Maumet:
Uncovering Communities Of Pipelines in the Task-FMRI Analytical Space. 3044-3050 - Mohamed Yousuf, Samir Harb

, Islam Alkabbany
, Asem M. Ali, Salwa Elshazley, Aly A. Farag:
Multi-View Network for Colorectal Polyps Detection in CT Colonography. 3051-3056 - Phuong Thao Nguyen, Hiroshi Watanabe

:
GEEG-YOLOv8: Gaussian Enhanced Euclidean Norm Ghost Attention for Real-Time Polyp Detection. 3057-3063 - Avinash Gaikwad, Anjali Gautam

:
Segmentation of Hard Exudates And Hemorrhages from Diabetic Retinopathy Images Using Residual U-Net with Squeeze and Excite Blocks. 3064-3069 - Damian Kucharski

, Agata M. Wijata, Lu Fu, Weidong Lin, Yumei Xue, Jacek Kawa, Yalin Zheng, Gregory Yoke Hong Lip, Jakub Nalepa:
Giraffe: A Genetic Programming Algorithm To Build Deep Learning Ensembles For Ecg Arrhythmia Classification. 3070-3076 - Ammar Ahmed

, Ali Shariq Imran, Mohib Ullah, Zenun Kastrati, Sher Muhammad Daudpota:
Navigating Limitations With Precision: A Fine-Grained Ensemble Approach To Wrist Pathology Recognition On A Limited X-Ray Dataset. 3077-3083 - Zhengyong Huang, Yao Sui:

Contour-Weighted Loss For Class-Imbalanced Image Segmentation. 3084-3090 - Salma Hassan, Hamad Al Hammadi, Ibrahim Mohammed, Muhammad Haris Khan:

Multi-Modal Medical Image Fusion for Non-Small Cell Lung Cancer Classification. 3091-3097 - Teja Krishna Cherukuri, Nagur Shareef Shaik, Dong Hye Ye:

Guided Context Gating: Learning To Leverage Salient Lesions in Retinal Fundus Images. 3098-3104 - Zhongyuan Jing

, Hongyan Xiang, Ruyan Wang:
FEDMI: A Federated Learning Framewoek for Secure Sharing of Medical Images. 3105-3111 - Bingzhen Hou, Guimei Zhang, Huiqun Liu, Yipeng Qin, Ying Chen:

Dcctnet: Kidney Tumors Segmentation Based On Dual-Level Combination Of Cnn And Transformer. 3112-3116 - Yawei Zhang, Bo Li, Xin Li, Yuhan Huang, Hui Ding

:
Wavelet-Enhanced CNN for Depression Classification Based on MRI Images. 3117-3123 - Khalil Chikhaoui, Motaz Alfarraj:

Advancing Colorectal Polyp Segmentation With Watershed Algorithm-Enhanced Parallel Self-Supervised Learning. 3124-3130 - Dhouha Attia, Amel Benazza-Benyahia:

Multiclassification Of Vocal Folds Disorders From Videos By Spatio-Temporal Deep Features. 3131-3136 - Sung-Hyeon Kim, Tae-Min Choi

, Sun-Kyung Lee, Minhee Kim, Jae Gwan Kim, Jong-Hwan Kim:
Event-Specific EEG-FNIRS Feature Fusion FOR Alzheimer's Disease Classification. 3137-3143 - Zhen Sun, Huan Xu, Jinlin Wu, Zhen Chen

, Hongbin Liu, Zhen Lei:
PWISeg: Weakly-Supervised Surgical Instrument Instance Segmentation. 3144-3150 - Israa Sharaby, Ahmed Alksas

, Hossam Magdy Balaha, Ali Mahmoud, Mohammed Ali Badawy, Mohamed Abou El-Ghar, Ashraf Khalil, Mohammed Ghazal, Sohail Contractor
, Ayman El-Baz:
A Novel Approach for 3D Renal Segmentation Using a Modified GAN Model and Texture Analysis. 3151-3157 - Sadaf Khademi, Anastasia Oikonomou, Konstantinos N. Plataniotis, Arash Mohammadi:

Nyctale: Neuro-Evidence Transformer for Adaptive and Personalized Lung Nodule Invasiveness Prediction. 3158-3164 - Lei Zhang, Xiaoke Wang, Edward H. Herskovits, Elias R. Melhem, Linda Chang, Ze Wang, Thomas Ernst:

Reducing Motion Artifacts in Brain MRI Using Vision Transformers and Self-Supervised Learning. 3165-3171 - Joohi Chauhan

, Paul L. Rosin, Puneet Goyal:
Burnsnet: Burn Region Segmentation Network From Color Images With Two-Way CNN. 3172-3178 - Chitimireddy Sindhura, Phaneendra K. Yalavarthy, Subrahmanyam Gorthi:

SINO-CT-Fusion-Net: A Lightweight Deep Learning Framework for Detection and Classification of Intracranial Hemorrhages. 3179-3185 - Jules Collenne, Rabah Iguernaissi, Séverine Dubuisson, Djamal Merad:

Reset: A Residual Set-Transformer Approach to Tackle the Ugly-Duckling Sign in Melanoma Detection. 3186-3191 - Baptiste Schall, Rodolphe Anty, Lionel Fillatre:

One-Hot Logistic Regression for Radiomics-Based Classification. 3192-3198 - Devi Prasad Maharathy

, Prabhala Sandhya Gayatri, Angshuman Paul:
Attention-Based Few-Shot Diagnosis of Chest X-Rays Using Semantic Signatures. 3199-3204 - Muhammad Owais, Muhammad Zubair, Taimur Hassan, Divya Velayudhan, Irfan Hussain, Naoufel Werghi:

Recurrent 3-D Multi-Level Visual Transformer For Joint Classification of Heterogeneous 2-d AND 3-D Radiographic Data. 3205-3211 - Samir Harb

, A. Elsayed, Mohamed Yousuf, Islam Alkabbany
, Asem M. Ali, Salwa Elshazley, Aly A. Farag:
Accurate Colon Segmentation Using 2D Convolutional Neural Networks With 3D Contextual Information. 3212-3218 - Ju-Hyeon Nam, Seo-Hyung Park, Su Jung Kim, Sang-Chul Lee:

Vizecgnet: Visual ECG Image Network for Cardiovascular Diseases Classification With Multi-Modal Training and Knowledge Distillation. 3219-3223 - Ahmad Hassanpour, Yasamin Kowsari, Hatef Otroshi-Shahreza, Bian Yang, Sébastien Marcel:

Chatgpt and Biometrics: an Assessment of Face Recognition, Gender Detection, and Age Estimation Capabilities. 3224-3229 - Muhammad Mohzary

, Baek-Young Choi, Sejun Song:
A Trustworthy Authentication Against Visual Master Face Dictionary Attacks (Trauma). 3230-3235 - Anudeep Vurity

, Emanuela Marasco, Raghavendra Ramachandra, Duoduo Liao:
Interpreting the Fraudulence Level of Different Finger Photo Presentation Attack Instruments. 3236-3242 - Sahar Husseini, Jean-Luc Dugelay:

Alignface: Enhancing Face Verification Models Through Adaptive Alignment Of Pose, Expression, and Illumination. 3243-3249 - Léo Nicollier, Marc Michel Pic

, Enric Meinhardt-Llopis, Gabriele Facciolo:
A New Fingerprinting Technique for Engraved Binary Matrix Authentication. 3250-3256 - Joshua Krinsky, Alan Bettis, Qiuyu Tang, Daniel Moreira

, Aparna Bharati:
Exploring Saliency Bias in Manipulation Detection. 3257-3263 - Lin Lu

, Yunhong Wang, Wenqi Zhuo, Liang Zhang, Guangshuai Gao, Yuanfang Guo:
Deepfake Detection Via Separable Self-Consistency Learning. 3264-3270 - Chen Chen

, Xingjun Wang:
A Large-Capacity Data Hiding Scheme in Encrypted VVC Video. 3271-3277 - Youngin Park, Seungtae Nam, Cheul-Hee Hahm, Eunbyung Park:

FREQ-MIP-AA: Frequency Mip Representation for Anti-Aliasing Neural Radiance Fields. 3278-3284 - Jianfeng Xu, Haruhisa Kato, Kei Kawamura:

Temporal Scalable Coding For Dynamic Meshes. 3285-3291 - Zihan Zheng, Houqiang Zhong

, Qiang Hu, Xiaoyun Zhang, Li Song, Ya Zhang, Yanfeng Wang:
JOINTRF: End-To-End Joint Optimization for Dynamic Neural Radiance Field Representation and Compression. 3292-3298 - Tam Thuc Do, Philip A. Chou, Gene Cheung:

Learned Nonlinear Predictor for Critically Sampled 3D Point Cloud Attribute Compression. 3299-3305 - Preeti Meena, Himanshu Kumar, Sandeep Kumar Yadav:

An Indoor Scene Localization Method Using Graphical Summary of Multi-View RGB-D Images. 3306-3312 - Ryosuke Watanabe

, Keisuke Nonaka, Eduardo Pavez, Tatsuya Kobayashi, Antonio Ortega:
Full-Reference Point Cloud Quality Assessment Using Spectral Graph Wavelets. 3313-3319 - Eric Lei, Muhammad Asad Lodhi, Jiahao Pang, Junghyun Ahn, Dong Tian:

WrappingNet: Mesh Autoencoder Via Deep Sphere Deformation. 3320-3326 - Zehao Yan, Lin Zhang, Zhong Wang, Shenjie Zhao:

IMU-Assisted Target-Free Extrinsic Calibration of Heterogeneous Lidars Based on Continuous-Time Optimization. 3327-3333 - Birendra Kathariya, Zhu Li, Geert Van der Auwera:

TSF-NET3D: TSF-NET for 3D Point Cloud Attribute Compression Artifacts Removal. 3334-3340 - Jiahua Xu, Si Zuo, Chenfeng Wei, Wei Zhou:

LiSD: An Efficient Multi-Task Learning Framework For Lidar Segmentation and Detection. 3341-3347 - Mona Alawadh

, Mahesan Niranjan, Hansung Kim:
3D Semantic Scene Completion From A Depth Map With Unsupervised Learning For Semantics Prioritisation. 3348-3354 - Dat Thanh Nguyen, Daniel Zieger, Marc Stamminger, André Kaup:

End-to-End Learned Lossy Dynamic Point Cloud Attribute Compression. 3355-3360 - Hitoshi Nishimura, Haruhisa Kato, Kei Kawamura:

Quantization After Inter Prediction in Displacement Coding of Dynamic Meshes. 3361-3367 - Koki Kishimoto, Kei Kawamura, Haruhisa Kato:

Minimization of Submesh Boundary Errors In Dynamic Mesh Coding. 3368-3374 - Mahshad MahdaviMoghadam, Stéphane Coulombe, Carlos Vázquez

, Mohammadreza Jamali, Ahmad Vakili:
Enhancing TMIV Performance Through Proximity-Aware Grouping and Preservation of Small Clusters. 3375-3381 - Mohammadreza Ghafari, André F. R. Guarda, Nuno M. M. Rodrigues, Fernando Pereira:

Learning-Based Point Cloud Decoding with Independent and Scalable Reduced Complexity. 3382-3388 - Illia Oleksiienko

, Alexandros Iosifidis
:
Uncertainty-Aware AB3DMOT by Variational 3D Object Detection. 3389-3395 - Conghao Lv, Ping Jiang, Meng Wang, Lixin Lin, Xuechen Chen, Xiaoheng Deng:

Rdssd: 3D Single Stage Object Detector For Roadside Lidar Sensors. 3396-3402 - Efthymios Koukoulis, Gerasimos Arvanitis, Konstantinos Moustakas:

Unleashing the Power of Generalized Iterative Closest Point for Swift and Effective Point Cloud Registration. 3403-3409 - Daniele Mari, André F. R. Guarda, Nuno M. M. Rodrigues, Simone Milani, Fernando Pereira:

Point Cloud Geometry Scalable Coding with a Quality-Conditioned Latents Probability Estimator. 3410-3416 - Shengyang Zhao, Xin Jin:

An Explainable Spectral Analysis For Light Field Image Quality Assessment. 3417-3423 - Yu-Hsiang Huang, Wei Wang, Homer H. Chen:

Super-Resolution for Near-Eye Light Field Display in Fourier Space. 3424-3430 - Vinh Van Duong, Thuc Nguyen Huu, Jonghoon Yim, Byeungwoo Jeon:

Two-Level Intra Prediction Using High-Order Macropixel Neighbors For Plenoptic Video Coding. 3431-3435 - Rômulo Marconato Stringhini, Thiago S. Lermen, Thiago L. T. da Silveira, Cláudio R. Jung:

Single-Panorama Classification of 3D Objects Using Horizontally Stacked Dilated Convolutions. 3436-3442 - Christian Benz, Volker Rodehorst

:
MVCrackViT: Robust Multi-View Crack Detection For Point Cloud Segmentation Using View Attention. 3443-3449 - Jaewoo Park, Jaeguk Kim, Nam Ik Cho:

Multi-Reference Flow-Guided Cross-Domain Reconstruction For General Object 6D Pose Estimation. 3450-3456 - Xudong Jin

, Jianfeng Xu, Kei Kawamura:
Partial Inter-Frame Coding for Dynamic Meshes. 3457-3463 - Cheng Feng

, Congxuan Zhang
, Zhen Chen, Weiming Hu, Liyue Ge:
Real-Time Monocular Depth Estimation on Embedded Systems. 3464-3470 - Alejandro Casanova, Antonio Agudo:

Uncalibrated and Unsupervised Photometric Stereo with Piecewise Regularizer. 3471-3476 - Xinghui Li, Yuchen Ji, Xiansong Lai, Wanting Zhang, Long Zeng:

Fine-Detailed Neural Indoor Scene Reconstruction Using Multi-Level Importance Sampling And Multi-View Consistency. 3477-3483 - Zhiyu Liu, Baojiang Zhong:

DALSM: A Direction-Aware Line Segment Matching Method. 3484-3490 - Junhong Min, Youngpil Jeon:

Confidence Aware Stereo Matching for Realistic Cluttered Scenario. 3491-3497 - Muhammad Waleed, Abdul Rauf, Murtaza Taj:

Camera Calibration Through Geometric Constraints from Rotation and Projection Matrices. 3498-3504 - Huizhu Pan, Ling Li, Senjian An, Hui Xie

:
Combining Raft-Based Stereo Disparity and Optical Flow Models For Scene Flow Estimation. 3505-3511 - Hikaru Chikugo, Kento Arai, Sarthak Pathak, Kazunori Umeda:

Fisheye Stereo Camera Using Fisheye Vertical Stereo Method. 3512-3518 - Yutong Zhang, Wenbo Zhao, Daxin Li, Junjun Jiang, Xianming Liu:

Context-Adaptive Entropy Model With Adapters For Lossless Point Cloud Geometry Compression. 3519-3525 - Masahiro Yamaguchi, Kyota Higa, Toshinori Hosoi, Takashi Shibata:

Robust 3D Semantic Segmentation With Incomplete Point Clouds Based on Sequential Frame Sampling. 3526-3532 - Bowen Liu, Wei Liu, Siang Chen, Pengwei Xie, Guijin Wang:

Category-Agnostic Pose Estimation for Point Clouds. 3533-3539 - Sajid Umair, Birendra Kathariya, Zhu Li, Anique Akhtar, Geert Van der Auwera:

ResNeRF-PCAC: Super Resolving Residual Learning NeRF for High Efficiency Point Cloud Attributes Coding. 3540-3546 - Remco Royen, Kostas Pataridis, Ward van der Tempel, Adrian Munteanu:

RESSCAL3D++: Joint Acquisition and Semantic Segmentation of 3D Point Clouds. 3547-3553 - Monyneath Yim, Jui-Chiu Chiang:

Mamba-PCGC: Mamba-Based Point Cloud Geometry Compression. 3554-3560 - Liangjing Shao

, Benshuang Chen, Xinrong Chen:
3D Clothed Human Reconstruction From One In-the-Wild RGB Image. 3561-3567 - Lintao Xiang, Hujun Yin

:
Self-Supervised Multi-View Stereo with Adaptive Depth Priors. 3568-3574 - Hyung Kyu Kim, Sangmin Lee

, Hak Gu Kim:
Analyzing Visible Articulatory Movements in Speech Production For Speech-Driven 3D Facial Animation. 3575-3579 - Jianhua Zhang, Huiyu Zhou, Na Lv:

Adaptive Spatial-Temporal Modelling For Human Motion Prediction. 3580-3586 - Taeyun Woo, Tae-Kyun Kim, Jinah Park:

Hand-Object Reconstruction Via Interaction-Aware Graph Attention Mechanism. 3587-3593 - Julian Strohmayer, Martin Kampel:

Directional Antenna Systems for Long-Range Through-Wall Human Activity Recognition. 3594-3599 - Ryota Kondo, Hiroaki Minoura, Tsubasa Hirakawa, Takayoshi Yamashita, Hironobu Fujiyoshi:

Binary-Decomposed Vision Transformer: Compressing and Accelerating Vision Transformer by Binary Decomposition. 3600-3605 - Zicong Hu, Jian Cao, Weichen Xu, Ruilong Ren, Tianhao Fu, Xinxin Xu, Xing Zhang:

Empirical Research On Quantization For 3D Multi-Modal Vit Models. 3606-3612 - Nathan Maurice, Julien Sopena, Lionel Lacassagne:

A New Efficient Split & Merge Algorithm for Embedded Systems. 3613-3619 - Tingting Hu, Ryuji Fuchikami, Shigekiyo Nosaka:

Temporal Clustering and Temporal Reference Based Specular Detection For 1-MS Visual Feedback System. 3620-3626 - Xiao Jiang, Fei Zhou:

Characterization Of Dim Light Response In DVS Pixel: Discontinuity of Event Triggering Time. 3627-3632 - Ranhao Zhang, Mingtao Huang, Xueming Li, Yuan Shen:

Adaptive Tilt-Series Alignment With Feature Resampling in Cryo-Electron Tomography. 3633-3639 - Bohan Lei, Yueting Zhuang, Xiaoyin Xu, Min Zhang:

An Optimal Transport-Based Method For Medical Image Generation. 3640-3646 - Angeliki Katsenou, Xinyi Wang, Daniel Schien, David Bull:

Rate-Quality or Energy-Quality Pareto Fronts for Adaptive Video Streaming? 3647-3653 - Christian Herglotz, Steven Le Moan, Alexandre Mercat

:
Energy Reduction Opportunities in HDR Video Encoding. 3654-3660 - Steven Le Moan, Mitra Amiri, Christian Herglotz:

Exploiting Change Blindness to Reduce Bitrate and Display Luminance in Video Streaming. 3661-3666 - Xuelin Liu, Haoyun Zhang, Jiebin Yan, Hao Zhang, Yuming Fang, Shiqi Wang:

Quality of Experience of Viewport Adaptive Omnidirectional Video Streaming. 3667-3673 - Yichi Zhang

, Zhihao Duan, Fengqing Zhu:
On Efficient Neural Network Architectures for Image Compression. 3674-3680 - Xuanye Zhang, Zhaobin Zhang, Yaojun Wu, Semih Esenlik, Xiaoyan Sun, Kai Zhang, Li Zhang:

Optimized Decoupled Structure with Non-Local Attention for Deep Image Compression. 3681-3687 - Florian Borzechowski, Michael Schäfer

, Heiko Schwarz, Jonathan Pfaff, Detlev Marpe, Thomas Wiegand:
Optimizing Learned Image Compression On Scalar and Entropy-Constraint Quantization. 3688-3694 - Tianma Shen, Ying Liu:

Parallel Task-Prompts ICM: A Versatile Feature Codec for Machine Vision. 3695-3701 - Takahiro Shindo, Kein Yamada, Taiju Watanabe, Hiroshi Watanabe

:
Image Coding For Machines With Edge Information Learning Using Segment Anything. 3702-3708 - Bolin Chen

, Shanzhi Yin
, Peilin Chen, Shiqi Wang
, Yan Ye:
Generative Visual Compression: A Review. 3709-3715 - Mateen Ulhaq, Ivan V. Bajic:

Learned Compression of Encoding Distributions. 3716-3722 - Mingyi Yang, Xionghui Mao, Yujie Yin, Zhiwei Zhu, Defa Wang, Shuai Wan, Fuzheng Yang:

Learning-Based Video Compression with Continuously Variable Bitrate Coding. 3723-3729 - Md Adnan Faisal Hossain, Fengqing Zhu:

Structured Pruning and Quantization for Learned Image Compression. 3730-3736 - Du Liu, Jacob Ström, Mitra Damghanian, Per Wennersten:

NN-Based In-Loop Filtering With Inputs Transformed. 3737-3743 - Srivatsa Prativadibhayankaram, Mahadev Prasad Panda, Jürgen Seiler, Thomas Richter, Heiko Sparenberg, Siegfried Foessel, André Kaup:

A Study on the Effect of Color Spaces in Learned Image Compression. 3744-3750 - Marwa Tarchouli, Thomas Guionnet, Marc Rivière, Wassim Hamidouche, Meriem Outtas, Olivier Déforges:

Res-NeRV: Residual Blocks For A Practical Implicit Neural Video Decoder. 3751-3757 - Yuxin Xie, Li Yu, Farhad Pakdaman, Moncef Gabbouj

:
Joint End-to-End Image Compression and Denoising: Leveraging Contrastive Learning and Multi-Scale Self-Onns. 3758-3764 - Yichen Zhou, Xinfeng Zhang, Yingzhan Xu, Kai Zhang, Li Zhang:

Adaptive Downsampling and Spatial Upconversion for Point Cloud Compression. 3765-3770 - Kei Iino, Shunsuke Akamatsu, Hiroshi Watanabe

, Shohei Enomoto, Akira Sakamoto, Takeharu Eda
:
Improving Image Coding for Machines Through Optimizing Encoder Via Auxiliary Loss. 3771-3777 - Yuhang Lu, Touradj Ebrahimi:

Towards the Detection of AI-Synthesized Human Face Images. 3778-3784 - Nikolaos Fotos

, Jaime Delgado:
Towards Privacy-Enhancing Provenance Annotations for Images. 3785-3791 - Yuqing Yang

, Charuka Moremada, Nikos Deligiannis
:
On The Detection Of Images Generated From Text. 3792-3798 - Deepayan Bhowmik, Sabrina B. Caldwell

, Jaime Delgado, Touradj Ebrahimi, Nikolaos Fotos
, Xiaojun Gu, Ziyuan Hu, Xin Kang, Fernando Pereira, Leonard Rosenthol, Frederik Temmermans
, Haibo Zhou:
An International Standard For Assessing Trustworthiness In Media. 3799-3805 - Orazio Pontorno

, Luca Guarnera, Sebastiano Battiato:
On the Exploitation of DCT-Traces in the Generative-AI Domain. 3806-3812 - Razaib Tariq, Shahroz Tariq

, Simon S. Woo:
Exploring the Impact of Moire Pattern on Deepfake Detectors. 3813-3819 - Qurat Ul Ain, Ali Javed, Khalid Mahmood Malik, Aun Irtaza:

Exposing the Limits of Deepfake Detection using novel Facial mole attack: A Perceptual Black- Box Adversarial Attack Study. 3820-3826 - Zihang Lyu, Jun Xiao, Cong Zhang, Kin-Man Lam:

AI-Generated Image Detection With Wasserstein Distance Compression and Dynamic Aggregation. 3827-3833 - Nora Hofer:

Increasing Trust in Image Analysis by Detecting Trellis Quantization in JPEG Images. 3834-3840 - Halil Ismail Helvaci, Chen-Nee Chuah, Sally Ozonoff, Sen-Ching Samson Cheung

:
Localizing Moments of Actions in Untrimmed Videos of Infants with Autism Spectrum Disorder. 3841-3847 - Ishan Rajendrakumar Dave, Tristan de Blegiers, Chen Chen, Mubarak Shah:

Codamal: Contrastive Domain Adaptation for Malaria Detection in Low-Cost Microscopes. 3848-3853 - Honghui Chen, Baoquan Zhao, Guanghui Yue, Weide Liu, Chenlei Lv, Ruomei Wang, Fan Zhou:

Clip-Medfake: Synthetic Data Augmentation With AI-Generated Content for Improved Medical Image Classification. 3854-3860 - Maroof Abdul Aziz, Fatemeh Javadian, Sherin Susheel Mathew, Avinash Gopal, Johannes Stegmaier, Sonit Singh

, Abin Jose:
Deep Learning Approach for Renal Cell Carcinoma Detection, Subtyping, And Grading. 3861-3867 - Ufaq Khan, Umair Nawaz, Mustaqeem Khan, Wail Gueaieb, Abdulmotaleb El-Saddik:

Deepskinformer: Skin Lesion Segmentation Using Hierarchical Transformers And Edge Enhancement. 3868-3874 - Savas Özkan, Mete Özay:

Towards Better Control Of Latent Spaces For Face Editing. 3875-3881 - Mariano Rivera:

How to Train Your VAE. 3882-3888 - Guanji Li, Hongxia Gao:

Apnet: Generating Precise Anomaly Prior Information for Mixed-Supervised Defect Detection. 3889-3895 - Lukas Strack, Futa Waseda, Huy H. Nguyen, Yinqiang Zheng, Isao Echizen:

Defending Against Physical Adversarial Patch attacks On Infrared Human Detection. 3896-3902 - Cansu Korkmaz, Ege Çirakman, A. Murat Tekalp, Zafer Dogan:

Trustworthy Sr: Resolving Ambiguity In Image Super-Resolution Via Diffusion Models And Human Feedback. 3903-3909 - AprilPyone MaungMaung, Huy H. Nguyen, Hitoshi Kiya, Isao Echizen:

Fine-Tuning Text-To-Image Diffusion Models for Class-Wise Spurious Feature Generation. 3910-3916 - Lyes Saad Saoud, Zhenwei Niu, Lakmal D. Seneviratne, Irfan Hussain:

Real-Time and Resource-Efficient Multi-Scale Adaptive Robotics Vision for Underwater Object Detection and Domain Generalization. 3917-3923 - Mehvish Nissar, Badri Narayan Subudhi, Vinit Jakhetiya, Amit Kumar Mishra:

Underwater Change Detection Using Multiple Sampling-Based Probabilistic Learner and Feature Preservance Discriminator. 3924-3930 - Yuehui Fan, Baoyao Yang, Meng Shen, Fei Lyu:

Domain Dilation for Single Domain Generalization. 3931-3937 - Prithwijit Chowdhury, Mohit Prabhushankar, Ghassan AlRegib, Mohamed Deriche:

Are Objective Explanatory Evaluation Metrics Trustworthy? An Adversarial Analysis. 3938-3944 - Lixin Liu, Zhibo Liu, Xiaozhen Lu, Yanling Bu, Bin Han, Liang Xiao:

Reinforcement Learning-Based Secure Video Transmission For IOV Systems. 3945-3950 - Meha Hachani, Azza Ouled Zaid:

JPEG Image Ciphering Based on Chaotic Encryption. 3951-3957 - Weixuan Chen, Qianqian Yang, Zhaohui Yang

, Yiping Duan, Zhaoyang Zhang:
Pilot-Free Semantic Communication Over Multi-User Mimo Fading Channels. 3958-3964 - Dong Han

, Yufan Jiang
, Yong Li, Ricardo Mendes, Joachim Denzler:
Robust Skin Color Driven Privacy-Preserving Face Recognition Via Function Secret Sharing. 3965-3971 - Anupam Borthakur, Apoorva Srivastava, Avik Kar, Dipayan Dewan, Debdoot Sheet:

Fantom: Federated Adversarial Network for Training Multi-Sequence Magnetic Resonance Imaging in Semantic Segmentation. 3972-3978 - Yohann Perron, Eric Bezzam, Martin Vetterli:

A Modular and Robust Physics-Based Approach for Lensless Image Reconstruction. 3979-3985 - Igor Shevkunov

, Mykola Ponomarenko
, Jere Heimo, Karen O. Egiazarian
:
Lensless Phase Retrieval With Regularization By Blind Noise Map Estimation and Denoising. 3986-3992 - Leon Suarez-Rodriguez, Roman Jacome, Henry Arguello:

Highly Constrained Coded Aperture Imaging Systems Design Via a Knowledge Distillation Approach. 3993-3999 - Julian Strohmayer, Rafael Sterzinger, Christian Stippel, Martin Kampel:

Through-Wall Imaging Based On WiFi Channel State Information. 4000-4006 - Vatsala Sharma

, Suyash P. Awate:
Adversarial EM For Partially-Supervised Image-Quality Enhancement: Application To Low-Dose Pet Imaging. 4007-4013 - Islam I. Osman, Mohamed S. Shehata:

Learn By An Example Transformer For Domain Generalization In Video Object Segmentation. 4014-4020 - Qingwang Wang, Xin Qu, Liyao Zhou, Pengcheng Jin, Chengbiao Fu, Tao Shen:

Edge-Guided Pixel Level Connected Component Assisted Camouflaged Object Detection. 4021-4027 - Rahul Tekchandani, Ritik Maheshwari, Praful Hambarde, Satya Narayan Tazi, Santosh Kumar Vipparthi

, Subrahmanyam Murala:
Luminate: Linguistic Understanding and Multi-Granularity Interaction for Video Object Segmentation. 4028-4034 - H. Çagriota Bilgi, A. Aydiotan Alatan:

Bi-Directional Tracklet Embedding for Multi-Object Tracking. 4035-4041 - Kyujin Shim, Jubi Hwang, Kangwook Ko, Changick Kim:

A Confidence-Aware Matching Strategy For Generalized Multi-Object Tracking. 4042-4048

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID














