


default search action
ACCV 2024: Hanoi, Vietnam - Part VI
- Minsu Cho

, Ivan Laptev, Du Tran, Angela Yao
, Hongbin Zha:
Computer Vision - ACCV 2024 - 17th Asian Conference on Computer Vision, Hanoi, Vietnam, December 8-12, 2024, Proceedings, Part VI. Lecture Notes in Computer Science 15477, Springer 2025, ISBN 978-981-96-0959-8
Applications of Computer Vision
- Lingya Li

, Zhixing Hou
, Ming Ma, Jing Xiang, Chuangxin Yuan, Guihua Xia:
Spotlight on Small-Scale Ship Detection: Empowering YOLO with Advanced Techniques and a Novel Dataset. 3-17 - Minse Ha, Wan-Gi Bae

, Geunyoung Bae, Jong Taek Lee
:
ELLAR: An Action Recognition Dataset for Extremely Low-Light Conditions with Dual Gamma Adaptive Modulation. 18-35 - Qi Chen

, Yutong Xie
, Biao Wu, Xiaomin Chen, James Ang, Minh-Son To
, Xiaojun Chang
, Qi Wu:
Act Like a Radiologist: Radiology Report Generation Across Anatomical Regions. 36-52 - Jiahao Ma

, Zicheng Duan
, Liang Zheng
, Chuong Nguyen
:
Multiview Detection with Cardboard Human Modeling. 53-70 - Trong-Thang Pham, Ngoc-Vuong Ho, Nhat-Tan Bui, Thinh Phan, Patel Brijesh Patel, Donald A. Adjeroh, Gianfranco Doretto, Anh Nguyen, Carol C. Wu, Hien Nguyen, Ngan Le:

FG-CXR: A Radiologist-Aligned Gaze Dataset for Enhancing Interpretability in Chest X-Ray Report Generation. 71-88 - Shuhong Chen

, Matthias Zwicker
:
Match-Free Inbetweening Assistant (MIBA): A Practical Animation Tool Without User Stroke Correspondence. 89-103 - Chao Huang, Susan Liang, Yapeng Tian, Anurag Kumar, Chenliang Xu:

High-Quality Visually-Guided Sound Separation from Diverse Categories. 104-122 - Susan Liang, Chao Huang, Yapeng Tian, Anurag Kumar, Chenliang Xu:

Language-Guided Joint Audio-Visual Editing via One-Shot Adaptation. 123-139 - Dongliang Zhang, Yunfei Li, Jiaran Zhou, Yuezun Li:

DPL: Cross-Quality DeepFake Detection via Dual Progressive Learning. 140-156 - Le Wang

, Shigang Li
:
Learning Neural Radiance Field from Quasi-uniformly Sampled Spherical Image for Immersive Virtual Reality. 157-171 - Devank, Jayateja Kalla, Soma Biswas:

CoVLM: Leveraging Consensus from Vision-Language Models for Semi-supervised Multi-modal Fake News Detection. 172-189 - Bach-Hoang Ngo

, Si-Tri Ngo, Phu-Duc Le, Quang-Minh Phan, Minh-Triet Tran
, Trung-Nghia Le
:
CrossPAR: Enhancing Pedestrian Attribute Recognition with Vision-Language Fusion and Human-Centric Pre-training. 190-205 - Qianqian Zhang

, Linwei Qiu
, Li Zhou
, Junshe An:
ESM-YOLO: Enhanced Small Target Detection Based on Visible and Infrared Multi-modal Fusion. 206-221 - Zhanyi Lu, Yue Zhou, Ao Chen:

Enhancing Photo Animation: Augmented Stylistic Modules and Prior Knowledge Integration. 222-238 - Cairong Yan

, Meng Ma, Yanting Zhang
, Yongquan Wan
:
Dual-Path Multimodal Optimal Transport for Composed Image Retrieval. 239-254 - Alvaro Budria

, Adrián López Rodríguez
, Òscar Lorente, Francesc Moreno-Noguer
:
InstantGeoAvatar: Effective Geometry and Appearance Modeling of Animatable Avatars from Monocular Video. 255-277 - Jaekyeong Lee, Geonung Kim, Sunghyun Cho:

RNA: Video Editing with ROI-Based Neural Atlas. 278-293 - Hongda Liu, Longguang Wang, Weijun Guan, Ye Zhang, Yulan Guo:

Pluggable Style Representation Learning for Multi-style Transfer. 294-312 - Bingzhi Duan, Xiaoyue Wan, Xu Zhao:

FSGait: Fine-Grained Self-supervised Gait Abnormality Detection. 313-329 - Chiheng Zhou, Yongxia Zhou, Chen Pan:

FocusNet: Cascaded Lightweight Networks and Ascending Feature Enhancement for Efficient Salient Object Detection. 330-345 - Jingchong Weng

, Boyang Li
, Kai Huang
:
Event-Based Image Enhancement Under High Dynamic Range Scenarios. 346-360 - Chi Dai Tran

, Long Hoang Pham
, Duong Nguyen-Ngoc Tran
, Quoc Pham-Nam Ho
, Jae Wook Jeon
:
Dual Memory Networks Guided Reverse Distillation for Unsupervised Anomaly Detection. 361-378 - Wenbin Tian, Qingmiao Jiang

, Lu Chen, Haolin Li, Jinyao Yan
:
Enhanced Asymmetric Invertible Network for Neural Video Delivery. 379-394 - Tharsan Senthivel, Ngoc-Son Vu:

QR-DETR: Query Routing for Detection Transformer. 395-412 - Tsung-Han Chou

, Brian Wang
, Wei-Chen Chiu
, Jun-Cheng Chen
:
A Recipe for CAC: Mosaic-Based Generalized Loss for Improved Class-Agnostic Counting. 413-428 - Xu Guo, Yujin Zheng

, Dingwen Wang:
PMTrack: Multi-object Tracking with Motion-Aware. 429-444 - Mohammadreza Salehi, Nikolaos Apostolikas, Efstratios Gavves, Cees G. M. Snoek, Yuki M. Asano:

Redefining Normal: A Novel Object-Level Approach for Multi-object Novelty Detection. 445-461 - Jiwon Kim, Byeongho Heo, Sangdoo Yun, Seungryong Kim, Dongyoon Han:

Match Me If You Can: Semi-supervised Semantic Correspondence Learning with Unpaired Images. 462-479 - Marina Khoroshiltseva

, Luca Palmieri
, Sinem Aslan
, Sebastiano Vascon
, Marcello Pelillo
:
Nash Meets Wertheimer: Using Good Continuation in Jigsaw Puzzles. 480-495

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID














