


default search action
23rd MMSP 2021: Tampere, Finland
- 23rd International Workshop on Multimedia Signal Processing, MMSP 2021, Tampere, Finland, October 6-8, 2021. IEEE 2021, ISBN 978-1-6654-3288-7

- Jens Brandenburg, Adam Wieckowski, Anastasia Henkel, Benjamin Bross, Detlev Marpe:

Pareto-optimized coding configurations for VVenC, a fast and efficient VVC encoder. 1-6 - Shahab Pasha, Arman Arian, Jan Lundgren

:
Machine-learnt Beamforming for Large Aperture 3D Microphone Arrays, An Industrial Application. 1-6 - Alper Koz

, Baris Demirkiliç, Yunus Bilge Kurt, Ahmet Oguz Akyüz
, Sinan Kalkan, A. Aydin Alatan, Alan Chalmers
:
HDR Image Construction from Trifocal Multiexposure Images. 1-5 - Xiaoya Zhang, Yuanzhi Yao

, Nenghai Yu:
Convolutional Neural Network-driven Optimal Prediction for Image Reversible Data Hiding. 1-6 - Viktoria Heimann, Andreas Spruck, André Kaup:

Frequency-Selective Mesh-to-Mesh Resampling for Color Upsampling of Point Clouds. 1-6 - Michael Buron Yuen, Carlos Vázquez

:
Human Subject Distance Estimation Using the Pupillary Distance and Head Orientation. 1-6 - Wentao Yu, Steffen Zeiler, Dorothea Kolossa:

Large-vocabulary Audio-visual Speech Recognition in Noisy Environments. 1-6 - Negar Heidari, Alexandros Iosifidis

:
Progressive Spatio-Temporal Bilinear Network with Monte Carlo Dropout for Landmark-based Facial Expression Recognition with Uncertainty Estimation. 1-6 - Lohic Fotio Tiotsop, Tomas Mizdos, Enrico Masala, Marcus Barkowsky, Peter Pocta

:
How to Train No Reference Video Quality Measures for New Coding Standards using Existing Annotated Datasets? 1-6 - Ho Tan Nguyen, Chi Do-Kim Pham, Jinjia Zhou:

SpeedDeblur: A Framework to speed up CNN-based Deblurring for HEVC compressed video. 1-6 - Waqas Ellahi, Toinon Vigier, Patrick Le Callet:

A machine-learning framework to predict TMO preference based on image and visual attention features. 1-6 - Maxim Verwilst, Nina Zizakic, Lingchen Gu, Aleksandra Pizurica

:
Deep image hashing based on twin-bottleneck hashing with variational autoencoders. 1-6 - María Santamaría

, Vinod Kumar Malamal Vadakital, Lukasz Kondrad, Antti Hallapuro, Miska M. Hannuksela:
Coding of volumetric content with MIV using VVC subpictures. 1-6 - Hannes Fassold:

Detecting speaking persons in video. 1 - Anubhav Jain, Pavel Korshunov, Sébastien Marcel:

Improving Generalization of Deepfake Detection by Training for Attribution. 1-6 - Yingqi Tang, Xiang Zhang, Donghang Chen, Zhizhuo Zhang, Haifei Yu:

Motion-augmented Change Detection for Video Surveillance. 1-6 - Anthony Trioux

, Giuseppe Valenzise, Marco Cagnazzo
, Michel Kieffer
, François-Xavier Coudoux, Patrick Corlay, Mohamed Gharbi:
A Perceptual Study of the Decoding Process of the SoftCast Wireless Video Broadcast Scheme. 1-6 - Simoni Panayi, Alessandro Artusi:

Hazing or Dehazing: the big dilemma for object detection. 1-9 - Bohan Li, Lauren Partin, Jingning Han, Yaowu Xu:

A Temporal Filtering Approach Based on Optical Flow Estimation for Video Coding. 1-6 - Gerasimos Arvanitis

, Aris S. Lalos, Konstantinos Moustakas:
Fast Spatio-temporal Compression of Dynamic 3D Meshes. 1-6 - Shangyin Gao, Lev Markhasin, Bi Wang:

Spatial Cross-Attention RGB-D Fusion Module for Object Detection. 1-6 - Kelvin Chelli, Roopak R. Tamboli, Thorsten Herfet:

Deep Learning-based Semantic Analysis of Sparse Light Field Ray Sets. 1-6 - Mehryar Abbasi, Parvaneh Saeedi, Jason Au, Jon Havelock:

Timed Data Incrementation: A Data Regularization Method for IVF Implantation Outcome Prediction from Length Variant Time-lapse Image Sequences. 1-5 - Antonio Jesús Muñoz-Montoro

, Julio J. Carabias-Orti
, Pedro Vera-Candeas
:
Ambisonics domain Singing Voice Separation combining Deep Neural Network and Direction Aware Multichannel NMF. 1-6 - Sardar Basiri, Kaiwen Zhang, Stéphane Coulombe:

An Action-Aware Combat Model for Efficient Video Compression of Massively Multiplayer Online Role-playing Games on Cloud Gaming Platforms. 1-6 - Toby Godwin, Georgios Rizos, Alice Baird, Najla D. Al Futaisi, Vincent Brisse, Björn W. Schuller:

Evaluating Deep Music Generation Methods Using Data Augmentation. 1-6 - Deeraj Nagothu, Ronghua Xu, Yu Chen, Erik Blasch, Alexander J. Aved:

DeFake: Decentralized ENF-Consensus Based DeepFake Detection in Video Conferencing. 1-6 - Yuki Sugimoto, Shoko Imaizumi:

A Lossless Image Processing Method with Contrast and Saturation Enhancement. 1-6 - Marc Górriz Blanch, Issa Khalifeh, Noel E. O'Connor

, Marta Mrak:
Attention-based Stylisation for Exemplar Image Colourisation. 1-6 - Mateusz Guzik

, Mieszko Fras, Konrad Kowalczyk
:
Incorporation of Localization Information for Sound Source Separation in Spherical Harmonic Domain. 1-6 - Ilyass Abouelaziz, Aladine Chetouani, Mohammed El Hassouni, Hocine Cherifi:

No-Reference Mesh Visual Quality Assessment Using Graph-Based Deep Learning. 1-6 - Haruhisa Kato

, Tatsuya Kobayashi, Masaru Sugano, Sei Naito:
Split Rendering of the Transparent Channel for Cloud AR. 1-6 - Wang Peng, Liping Yang, Xiaohua Gu:

Convolutional Receptive Field Dual Selection Mechanism for Acoustic Scene Classification. 1-6 - Vignesh V. Menon

, Hadi Amirpour, Christian Timmerer, Mohammed Ghanbari:
INCEPT: Intra CU Depth Prediction for HEVC. 1-6 - Ismael Seidel, Vanio Rodrigues Filho, Mateus Grellert

, Luciano Volcan Agostini, José Luís Güntzel:
SAD or SATD? How the Distortion Metric Impacts a Fractional Motion Estimation VLSI Architecture. 1-6 - Julitta Bartolewska

, Konrad Kowalczyk
:
Frame-based Maximum a Posteriori Estimation of Second-Order Statistics for Multichannel Speech Enhancement in Presence of Noise. 1-6 - Bilal Hassan, Ebroul Izquierdo:

ApparelNet: Person Verification Encompassing Auxiliary Attachments Variation. 1-6 - Yuzhuo Ren, Braeden Syrnyk, Niranjan Avadhanam:

Dual Attention Network for Heart Rate and Respiratory Rate Estimation. 1-6 - Andrey Makrushin

, Mark Trebeljahr, Stefan Seidlitz, Jana Dittmann:
On feasibility of GAN-based fingerprint morphing. 1-6 - Erion-Vasilis M. Pikoulis, Christos Mavrokefalidis, Aris S. Lalos:

A data-aware dictionary-learning based technique for the acceleration of deep convolutional networks. 1-5 - Conggui Liu, Yoshinao Sato:

Enhancing Block-Online Speech Separation using Interblock Context Flow. 1-6 - Federico Simonetta

, Stavros Ntalampiras
, Federico Avanzini:
Audio-to-Score Alignment Using Deep Automatic Music Transcription. 1-6 - Srividya Tirunellai Rajamani, Kumar T. Rajamani, Björn W. Schuller:

Towards an Efficient Deep Learning Model for Emotion and Theme Recognition in Music. 1-5 - Mikko Parviainen, Pasi Pertilä:

Time Difference of Arrival Estimation of Multiple Simultaneous Speakers Using Deep Clustering Neural Networks. 1-6 - Da-Yoon Nam, Hae-Kwang Kim, Jong-Ki Han:

Efficient View Synthesis Algorithm Using View Selection for Generating 6DoF Images. 1-6 - Davi Lazzarotto, Evangelos Alexiou

, Touradj Ebrahimi
:
Benchmarking of objective quality metrics for point cloud compression. 1-6 - Simon Grosche

, Fabian Brand, André Kaup:
A Novel End-To-End Network for Reconstruction of Non-Regularly Sampled Image Data Using Locally Fully Connected Layers. 1-6 - Farid Alijani

, Esa Rahtu
:
Supervised Fine-tuning Evaluation for Long-term Visual Place Recognition. 1-6 - Ugur Alican Alma

, Pablo Alvarez Romeo
, Mehmet Ercan Altinsoy
:
Preliminary Study of Upper-Body Haptic Feedback Perception on Cinematic Experience. 1-6 - Dorsaf Sebai:

Multi-rate deep semantic image compression with quantized modulated autoencoder. 1-6 - David Heise, Helen L. Bear:

Visually Exploring Multi-Purpose Audio Data. 1-6 - Steve Göring, Alexander Raake

:
Rule of Thirds and Simplicity for Image Aesthetics using Deep Neural Networks. 1-6 - Theyab A. Alotaibi, Farid Bourennani, Ishtiaq Rasool Khan

:
Assessing the Performance of Image Quality Assessment Metrics. 1-6 - Sarah Fachada

, Armand Losfeld, Takanori Senoh, Gauthier Lafruit, Mehrdad Teratani:
A Calibration Method for Subaperture Views of Plenoptic 2.0 Camera Arrays. 1-6 - Zeman Shao, Shaobo Fang, Runyu Mao, Jiangpeng He, Janine L. Wright, Deborah A. Kerr

, Carol J. Boushey, Fengqing Zhu:
Towards Learning Food Portion From Monocular Images With Cross-Domain Feature Adaptation. 1-6 - Steve Göring, Rakesh Rao Ramachandra Rao, Stephan Fremerey, Alexander Raake

:
AVrate Voyager: an open source online testing platform. 1-6 - Madhukar Bhat, Jean-Marc Thiesse, Patrick Le Callet:

VVC partitioning decision driven by machine learning for a comprehensive hardware encoder. 1-6 - Aladine Chetouani, Maurice Quach, Giuseppe Valenzise, Frédéric Dufaux

:
Convolutional Neural Network for 3D Point Cloud Quality Assessment with Reference. 1-6 - Minghong Mo, Fan Liang, Jun Wang:

An Optimization Algorithm for Color Table Generation of Palette Mode for VVC. 1-5 - Çaglar Aytekin, Sakari Alenius, Dmytro Paliy, Juuso Gren:

A Sub-band Approach to Deep Denoising Wavelet Networks and a Frequency-adaptive Loss for Perceptual Quality. 1-6 - Alireza Zare, Alireza Aminlou, Miska M. Hannuksela:

VVC Adaptive Loop Filter Optimization for Subpicture-based Viewport-adaptive Streaming. 1-6 - Shoken Kaneko, Hannes Gamper:

A Fast Forest Reverberator Using Single Scattering Cylinders. 1-5 - Hans-Jürgen Zepernick, Kerstin Pieper, Robert P. Spang, Ulrich Engelke, Matthias Hirth

, Babak Naderi:
On the Impact of COVID-19 on Subjective Digital Media Quality Assessment. 1-6 - Runyu Mao, Jiangpeng He, Luotao Lin, Zeman Shao, Heather A. Eicher-Miller, Fengqing Zhu:

Improving Dietary Assessment Via Integrated Hierarchy Food Classification. 1-6 - Jan Willem Kleinrouweler, Toni Dimitrovski

, Sjors Braam, Rick Hindriks
, Hans van den Berg, Lucia D'Acunto, Omar Niamut
:
Dynamic Edge Offloading for Real-time Video Processing Pipelines. 1 - Ashish Alex, Lin Wang

, Paolo Gastaldo, Andrea Cavallaro:
Mixup Augmentation for Generalizable Speech Separation. 1-6 - Vanio Rodrigues Filho, Marcio Monteiro, Ismael Seidel, Mateus Grellert

, José Luís Güntzel:
Hardware-Friendly Search Patterns for the Versatile Video Coding Fractional Motion Estimation. 1-6 - Andreas Papandreou, Andreas Kloukiniotis, Aris S. Lalos, Konstantinos Moustakas:

Deep multi-modal data analysis and fusion for robust scene understanding in CAVs. 1-6 - Minh Nguyen, Ekrem Çetinkaya

, Hermann Hellwagner, Christian Timmerer:
WISH: User-centric Bitrate Adaptation for HTTP Adaptive Streaming on Mobile Devices. 1-6 - Nesryne Mejri

, Konstantinos Papadopoulos, Djamila Aouada:
Leveraging High-Frequency Components for Deepfake Detection. 1-6 - Weiyan Chen

, Changjian Zhu, Shan Zhang:
Piecewise Segmentation Occlusion Model for Image-Based Plenoptic Spectral Analysis. 1-6 - Farid Alijani, Esa Rahtu:

Supervised Fine-tuning Evaluation for Long-term Visual Place Recognition. 1 - Hyunse Yoon, Seongmin Lee, Jiwoo Kang, Sanghoon Lee:

Deep Chessboard Corner Detection Using Multi-task Learning. 1-6 - Nayna Jain, Karthik Nandakumar, Nalini K. Ratha

, Sharath Pankanti, Uttam Kumar:
Optimizing Homomorphic Encryption based Secure Image Analytics. 1-6 - Mert Seker, Anssi Männistö, Alexandros Iosifidis

, Jenni Raitoharju
:
Automatic Main Character Recognition for Photographic Studies. 1-6 - Xhenis Çoba, Fangchen Feng, Azeddine Beghdadi:

Blind image separation for document restoration using plug-and-play approach. 1-6 - Jian Cao, Yifan Jia, Fan Liang, Jun Wang:

Encounter CU Again: History-Based Complexity Reduction Strategy for VVC Intra-Frame Encoder. 1-6 - Chaofei Wang, Wenjie Zhu, Yingzhan Xu, Yiling Xu, Le Yang

:
Point-Voting based Point Cloud Geometry Compression. 1-5 - Yana Nehmé, Patrick Le Callet, Florent Dupont, Jean-Philippe Farrugia, Guillaume Lavoué

:
Exploring Crowdsourcing for Subjective Quality Assessment of 3D Graphics. 1-6 - Milan Stepanov, M. Umair Mukati, Giuseppe Valenzise, Søren Forchhammer

, Frédéric Dufaux
:
Learning-based lossless light field compression. 1-6 - Paritosh Parmar, Jaiden Reddy, Brendan Morris:

Piano Skills Assessment. 1-5 - Sotirios Papadopoulos, Charalampos Symeonidis, Ioannis Pitas:

Leader and breakaway detection in racing sports videos. 1-5 - Evgeny Belyaev:

Fast Decoding and Parameters Selection for CS-JPEG Video Codec. 1-6 - Yufei Zeng, Yanxiong Li, Zhenfeng Zhou, Ruiqi Wang, Difeng Lu:

Domestic Activities Classification from Audio Recordings Using Multi-scale Dilated Depthwise Separable Convolutional Network. 1-5 - Fang-Yi Chao, Cagri Ozcinar, Aljosa Smolic:

Transformer-based Long-Term Viewport Prediction in 360° Video: Scanpath is All You Need. 1-6 - Stephen Voran

:
Optimal Frame Duration for Oracle Audio Signal Separation is Determined by Joint Minimization of Two Antagonistic Artifacts. 1-6 - Christoph Gerhardt, Florian Weidner

, Wolfgang Broll:
OUTSIDE: Multi-Scale Semantic Segmentation of Universal Outdoor Scenes. 1-6 - Frank Sippel, Jürgen Seiler, André Kaup:

Hyperspectral Image Reconstruction from Multispectral Images Using Non-Local Filtering. 1-6 - Emre Can Kaya, Ioan Tabus

:
Neural Network Modeling of Probabilities for Coding the Octree Representation of Point Clouds. 1-6 - Arnaud Soulier

, Pauline Puteaux, Frédéric Comby, William Puech:
Lossless Satellite Data Compression for Real-Time Navigation of Autonomous Vehicles. 1-6 - Yoichi Matsuo

, Kazuhisa Yamagishi, Shoko Takahashi:
Shapley-value-based Quality Degradation Analysis Method for Adaptive Bitrate Streaming Services. 1-6 - Bishwo Adhikari, Xingyang Ni, Esa Rahtu

, Heikki Huttunen
:
Towards a Real-Time Facial Analysis System. 1-6 - Teck Kai Chan, Cheng Siong Chin:

Detecting Sound Events Using Convolutional Macaron Net With Pseudo Strong Labels. 1-6 - Alireza Javaheri, Catarina Brites

, Fernando Pereira, João Ascenso
:
A Point-to-Distribution Joint Geometry and Color Metric for Point Cloud Quality Assessment. 1-6 - Olfa Haggui, Hamza Bayd, Baptiste Magnier, Arezki Aberkane:

Human Detection in Moving Fisheye Camera using an Improved YOLOv3 Framework. 1-6 - Randy Frans Fela

, Nick Zacharov, Søren Forchhammer:
Perceptual Evaluation of 360 Audiovisual Quality and Machine Learning Predictions. 1-6 - Davide Berghi

, Adrian Hilton
, Philip J. B. Jackson:
Visually Supervised Speaker Detection and Localization via Microphone Array. 1-6 - Haoyu Chen

, Edward J. Delp, Amy R. Reibman
:
Estimating Image Quality for Person Re-Identification. 1-6 - Rita Fermanian, Mikael Le Pendu, Christine Guillemot

:
Regularizing the Deep Image Prior with a Learned Denoiser for Linear Inverse Problems. 1-6 - Ailbhe Gill, Mikael Le Pendu, Martin Alain, Emin Zerman, Aljosa Smolic:

Light Field Visual Attention Prediction Using Fourier Disparity Layers. 1-6 - Pramit Mazumdar

, Giuliano Arru, Marco Carli, Federica Battisti:
Analysis of the influence of human faces for the estimation of salience in omnidirectional images. 1-5 - Anastasios Vafeiadis, Ioannis Papadimitriou, Anastasis Papanagnou

, Dimitrios Giakoumis, Konstantinos Votis, Dimitrios Tzovaras
:
Evaluating Spectral Magnitude Representation and Spectral Energy for Audio-based Activity Detection. 1-6 - Zhenyu Lei, Yejing Xie, Suiyi Ling, Andreas Pastor, Junle Wang, Junyu Dong, Patrick Le Callet:

Multi-Modal Aesthetic Assessment for Mobile Gaming Image. 1-5 - Abhishek Goswami

, Ali Ak
, Wolf Hauser, Patrick Le Callet, Frédéric Dufaux
:
Reliability of Crowdsourcing for Subjective Quality Evaluation of Tone Mapping Operators. 1-6 - Joakim Edlund, Christine Guillemot, Mårten Sjöström

:
Analysis of Top-Down Connections in Multi-Layered Convolutional Sparse Coding. 1-6 - Stuart W. Perry

, Luís Alberto da Silva Cruz
, Emil Dumic
, Nhung Hong Thi Nguyen, António M. G. Pinheiro, Evangelos Alexiou
:
Comparison of Remote Subjective Assessment Strategies in the Context of the JPEG Pleno Point Cloud Activity. 1-6 - Meenakshi

, Seshan Srirangarajan:
Low-Rank Double Relaxed Regression for Discriminative Projection Learning. 1-6 - Hui Yuan, Raouf Hamzaoui, Ferrante Neri

, Shengxiang Yang, Tingting Wang:
Global Rate-distortion Optimization of Video-based Point Cloud Compression with Differential Evolution. 1-6

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID














