SlideShare a Scribd company logo
All rights reserved. ©2020
All rights reserved. ©2020
Machine Learning Based Video Coding
Enhancements for HTTP Adaptive Streaming
ACM MMSys’21 Doctoral Symposium
September 30, 2021
Ekrem Çetinkaya
Christian Doppler Laboratory ATHENA | Alpen-Adria-Universität Klagenfurt | Austria
ekrem.cetinkaya@aau.at | athena.itec.aau.at
1
All rights reserved. ©2020
● Introduction
● Research Questions
● Methodology & Existing Results
● Ongoing & Future Work
● Q & A
Agenda
All rights reserved. ©2020
2
Introduction
All rights reserved. ©2020
3
All rights reserved. ©2020
Video Streaming
Share in the Internet Traffic
82%
Content Characteristics
1 Million
minutes
Video Streamed Every Second
As of 2021
* Cisco VNI Forecast Highlights (2021)
All rights reserved. ©2020
4
All rights reserved. ©2020
HTTP Adaptive Streaming (HAS)
Very Nice Video
Play
Play
5
240
kbps
Client HAS Server
1200
kbps
3500
kbps
480
kbps
2500
kbps
7000
kbps
All rights reserved. ©2020
Video Encoding
Block
Partitioning
Motion
Compensation
Transformation
& Quantization
Entropy Coding
Entropy
Decoding
Inverse
Transformation &
Inverse
Quantization
Inter or Intra
Prediction
Picture Buffer In-loop Filtering
6
All rights reserved. ©2020
Video Codecs
C. Feldmann, “State of Compression Standards - VVC”, 2020, https://bitmovin.com/compression-standards-vvc-2020/
Vanne et.al., “Comparative Rate-Distortion-Complexity Analysis of HEVC and AVC Video Codecs”, TCSVT, 2012
High Efficiency Video
Coding (HEVC)
2003
Advanced Video
Coding (AVC)
2013
Versatile Video
Coding (VVC)
2020
170 % 954 %
37 % 35 %
Block size 16x16
Quaternary tree
Supports up to 4K
Block size 64x64
Quaternary tree
Supports up to 8K
Block size 128x128
Multi-type tree
Supports up to
16K, 360° videos
7
All rights reserved. ©2020
Video Encoding with Machine Learning
Block
Partitioning
Motion
Compensation
Transformation
& Quantization
Entropy
Coding
Entropy
Decoding
Inverse
Transformation
& Inverse
Quantization
Inter or Intra
Prediction
Picture Buffer
In-loop
Filtering
Block
Partitioning
Decision
Prediction
Optical Flow
Detection
Mode
Prediction
Angular
Direction
Prediction
Deblocking
with ML
Denoising
with ML
Super-resolution
8
Research Questions
All rights reserved. ©2020
9
All rights reserved. ©2020
10
RQ-1 How to efficiently provide multi-rate
representations over a wide range of resolutions for HAS?
RQ-2 How to improve the performance of video codecs
using machine learning?
RQ-4 How to use machine learning to improve
perceptual quality assessment for videos?
RQ-3 How to improve the visual quality of videos using
machine learning?
Why?
🔋High-resolution content is getting more common,
required number of representations for HAS is increasing.
Literature
🗂 ML based approaches are utilized in video codecs to
speed up encoder decision.
🗂 Some attempts in end-to-end ML based video codecs.
Literature
🗂 ML based refinement techniques applied.
🗂 Post-processing in decoded frames to improve
quality.
🗂 Super-resolution for images and videos.
Literature
🗂 ML model is used in VMAF.
🗂 Several more attempts for non-reference perceptual
quality assessment.
Why?
🔋ML based image restoration methods are improving,
however video is mostly ignored. QoE can be increased.
Why?
🔋Finding a reliable metric for perceptual quality is
important as current objective metrics are problematic.
Why?
🔋More complex codecs, many possibilities to apply ML,
still much room for improvement.
Literature
🗂 Choose a reference representation and use its
information to speed up remaining encodings.
Methodology & Existing Results
All rights reserved. ©2020
11
All rights reserved. ©2020
Design and Abstraction Methodology
Design
Propose a solution (algorithm,
concept, protocol, etc.) for a
given problem
Implement
Prototype software
implementation using the
proposed solution
Analyze
Qualitative and quantitative analysis
of the solution
Repeat the cycle
to improve the
solutions
12
All rights reserved. ©2020
● State-of-the-art:
○ Encode the highest quality 1
or the lowest
quality 2
as the reference first then use these
information
● Proposed Method 3
:
○ Encode the highest quality first,
○ Use its information to encode the lowest
quality
○ Use information from both representations to
encode the remaining representations
○ Double bound for CTU search ranges
1
Schroeder, Damien, et al. "Efficient multi-rate video encoding for HEVC-based adaptive HTTP streaming." IEEE Transactions on Circuits and systems for
Video Technology 28.1 (2016): 143-157.
2
B. Guo, Y. Han, J. Wen, "Fast Block Structure Determination in AV1-based Multiple Resolutions Video Encoding," in 2018 IEEE International Conference on
Multimedia and Expo (ICME), San Diego, CA, USA, July 2018.
3
H. Amirpour, E. Çetinkaya, C. Timmerer and M. Ghanbari, "Fast Multi-rate Encoding for Adaptive HTTP Streaming," 2020 Data Compression Conference
(DCC), Snowbird, UT, USA, 2020, pp. 358-358
QP1
QPN
QPN-1
QP3
QP2
...
13
Fast Multi-rate Encoding (DCC’20)
All rights reserved. ©2020
14
Fast Multi-rate Encoding (DCC’20)
All rights reserved. ©2020
● State-of-the-art:
○ Encode the highest quality 1
or the lowest
quality 2
as the reference first then use these
information
● Proposed Method 3
:
○ Try different quality levels as the reference
representation to determine the best starting
point for parallel encoding
○ Encode the middle quality first and use its
information.
○ Upper or lower bound depending on the quality
level
Towards Optimal Multirate Encoding (MMM’21)
1
Schroeder, Damien, et al. "Efficient multi-rate video encoding for HEVC-based adaptive HTTP streaming." IEEE Transactions on Circuits and systems for
Video Technology 28.1 (2016): 143-157.
2
B. Guo, Y. Han, J. Wen, "Fast Block Structure Determination in AV1-based Multiple Resolutions Video Encoding," in 2018 IEEE International Conference on
Multimedia and Expo (ICME), San Diego, CA, USA, July 2018.
3
H. Amirpour, E. Çetinkaya, C. Timmerer and M. Ghanbari, "Towards Optimal Multirate Encoding for HTTP Adaptive Streaming," The International MultiMedia
Modeling Conference (MMM), Prague, Czech Republic, 2021
QPN/2
QPN
QP2
QP1
...
15
All rights reserved. ©2020
16
Towards Optimal Multirate Encoding (MMM’21)
All rights reserved. ©2020
● State-of-the-art:
○ Use a CNN to predict CTU depth decisions 1
● Proposed Method 2
:
○ Train a CNN with encoding information
obtained from the reference representation and
use its decision to encode dependent
representations.
○ Focus on parallel encoding, thus only apply for
bottleneck situations
○ Train different CNNs for different QP targets
1
Kim, Kyungah, and Won Woo Ro. "Fast CU depth decision for HEVC using neural networks." IEEE Transactions on Circuits and Systems for Video Technology
29.5 (2018): 1462-1473.
2
E. Çetinkaya, H. Amirpour, C. Timmerer and M. Ghanbari, “FaME-ML: Fast Multirate Encoding for HTTP Adaptive Streaming Using Machine Learning,” 2020
IEEE International Conference on Visual Communications and Image Processing (VCIP), Macau, 2020, pp. 87-90.
QPN
CNN
QPN-1
QP1
QP2
...
HEVC
HEVC
HEVC
CNN
HEVC HEVC
17
Fast Multi-rate Encoding with ML (VCIP’20)
All rights reserved. ©2020
18
Fast Multi-rate Encoding with ML (VCIP’20)
All rights reserved. ©2020
● State-of-the-art:
○ Use the highest quality representation as the
reference 1
● Proposed Method 2
:
○ Train a CNN with encoding information obtained
from the reference representation (the highest
quality from the lowest resolution) and use its
decision to encode dependent representations
○ Improves parallel encoding as well as serial
encoding
○ Train different CNNs for different QP and resolution
targets
1
Schroeder, Damien, et al. "Efficient multi-rate video encoding for HEVC-based adaptive HTTP streaming." IEEE Transactions on Circuits and systems for
Video Technology 28.1 (2016): 143-157.
2
E. Çetinkaya, H. Amirpour, C. Timmerer and M. Ghanbari, "Fast Multi-Resolution and Multi-Rate Encoding for HTTP Adaptive Streaming Using Machine
Learning," in IEEE Open Journal of Signal Processing, vol. 2, pp. 484-495, 2021, doi: 10.1109/OJSP.2021.3078657.
19
Fast Multi-rate and Multi-resolution
Encoding with ML (IEEE OJ-SP)
HEVC
QP1
HEVC
QP2
CNN
HEVC
QPN
CNN
HEVC
..
CNN
HEVC
QP2
CNN
HEVC
QPN
CNN
HEVC
..
CNN
HEVC
QP2
CNN
HEVC
QPN
CNN
HEVC
..
CNN
HEVC
QP1
CNN
HEVC
QP1
CNN
540p
540p
1080p
2160p
All rights reserved. ©2020
20
Fast Multi-rate and Multi-resolution Encoding with ML (IEEE OJ-SP)
Normalized
Encoding
time
HM 16.21 Lower Bound FaRes-ML
Ongoing & Future Work
All rights reserved. ©2020
21
All rights reserved. ©2020
Work Plan
2019
Q4
2020 2021 2022
Q1 Q2 Q3 Q4 Q1 Q2 Q3 Q4 Q1 Q2 Q3 Q4 Q1 Q2
1. How to efficiently provide multi-bitrate representations over a wide
range of resolutions for HAS ?
2. How to improve performance of video codecs using machine learning ?
3. How to improve quality of videos using machine learning ?
4. How to use machine learning to improve perceptual quality assessment
for videos ?
Literature review
DCC’20 Paper
MMM’21 Paper
VCIP’20 Paper
Multi-rate and Multi-resolution Encoding
IEEE OJSP Paper
RQ1
RQ2
RQ3
RQ4
Super-resolution
Literature
Review
Perceptual Quality Assessment with ML
2023
Thesis
Bitrate Ladder Prediction
Literature
Review
Improvement in In-loop Filtering
with ML
Mobile Player Optimization with SR
22
Fast multi-rate encoding for adaptive http streaming
Towards optimal multirate encoding for HTTP
adaptive streaming
FaME-ML: Fast multirate encoding for HTTP adaptive
streaming using machine learning
Fast Multi-Resolution and Multi-Rate Encoding for
HTTP Adaptive Streaming Using Machine Learning
All rights reserved. ©2020
Thank you!
ekrem.cetinkaya@aau.at @ekremcetinkaya_ linkedin.com/in/ekrcet

More Related Content

What's hot (20)

PDF
CSDN: CDN-Aware QoE Optimization in SDN-Assisted HTTP Adaptive Video Streaming
Alpen-Adria-Universität
 
PPTX
Bandwidth Prediction in Low-Latency Chunked Streaming
Alpen-Adria-Universität
 
DOCX
Press Release of 131st WG11 (MPEG) Meeting
Alpen-Adria-Universität
 
PPTX
Where to Encode: A Performance Analysis of Intel x86 and Arm-based Amazon EC2...
Alpen-Adria-Universität
 
PDF
Video complexity analyzer (VCA) for streaming applications
Alpen-Adria-Universität
 
PDF
FaME-ML: Fast Multirate Encoding for HTTP Adaptive Streaming Using Machine Le...
Alpen-Adria-Universität
 
PDF
PEMWN'21 - ANGELA
Jesus Aguilar
 
PDF
EADAS: Edge Assisted Adaptation Scheme for HTTP Adaptive Streaming
Alpen-Adria-Universität
 
PDF
EPIQ'21: Days of Future Past: An Optimization-based Adaptive Bitrate Algorith...
Minh Nguyen
 
PDF
Video Coding Enhancements for HTTP Adaptive Streaming
Alpen-Adria-Universität
 
PDF
Video Coding for Large-Scale HTTP Adaptive Streaming Deployments: State of th...
Alpen-Adria-Universität
 
PDF
SLFC: Scalable Light Field Coding
Alpen-Adria-Universität
 
PDF
Docker-Based Evaluation Framework for Video Streaming QoE in Broadband Networks
Alpen-Adria-Universität
 
PDF
What’s new in MPEG?
Alpen-Adria-Universität
 
PDF
Quality Optimization of Live Streaming Services over HTTP with Reinforcement ...
Alpen-Adria-Universität
 
PPTX
Policy-driven Dynamic HTTP Adaptive Streaming Player Environment
Minh Nguyen
 
PPTX
A Channel Allocation Algorithm for Cognitive Radio Users Based on Channel Sta...
Alpen-Adria-Universität
 
PDF
INCEPT: Intra CU Depth Prediction for HEVC
Alpen-Adria-Universität
 
PDF
ES-HAS: An Edge- and SDN-Assisted Framework for HTTP Adaptive Video Streaming
Alpen-Adria-Universität
 
PDF
ComplexCTTP: Complexity Class Based Transcoding Time Prediction for Video Seq...
Alpen-Adria-Universität
 
CSDN: CDN-Aware QoE Optimization in SDN-Assisted HTTP Adaptive Video Streaming
Alpen-Adria-Universität
 
Bandwidth Prediction in Low-Latency Chunked Streaming
Alpen-Adria-Universität
 
Press Release of 131st WG11 (MPEG) Meeting
Alpen-Adria-Universität
 
Where to Encode: A Performance Analysis of Intel x86 and Arm-based Amazon EC2...
Alpen-Adria-Universität
 
Video complexity analyzer (VCA) for streaming applications
Alpen-Adria-Universität
 
FaME-ML: Fast Multirate Encoding for HTTP Adaptive Streaming Using Machine Le...
Alpen-Adria-Universität
 
PEMWN'21 - ANGELA
Jesus Aguilar
 
EADAS: Edge Assisted Adaptation Scheme for HTTP Adaptive Streaming
Alpen-Adria-Universität
 
EPIQ'21: Days of Future Past: An Optimization-based Adaptive Bitrate Algorith...
Minh Nguyen
 
Video Coding Enhancements for HTTP Adaptive Streaming
Alpen-Adria-Universität
 
Video Coding for Large-Scale HTTP Adaptive Streaming Deployments: State of th...
Alpen-Adria-Universität
 
SLFC: Scalable Light Field Coding
Alpen-Adria-Universität
 
Docker-Based Evaluation Framework for Video Streaming QoE in Broadband Networks
Alpen-Adria-Universität
 
What’s new in MPEG?
Alpen-Adria-Universität
 
Quality Optimization of Live Streaming Services over HTTP with Reinforcement ...
Alpen-Adria-Universität
 
Policy-driven Dynamic HTTP Adaptive Streaming Player Environment
Minh Nguyen
 
A Channel Allocation Algorithm for Cognitive Radio Users Based on Channel Sta...
Alpen-Adria-Universität
 
INCEPT: Intra CU Depth Prediction for HEVC
Alpen-Adria-Universität
 
ES-HAS: An Edge- and SDN-Assisted Framework for HTTP Adaptive Video Streaming
Alpen-Adria-Universität
 
ComplexCTTP: Complexity Class Based Transcoding Time Prediction for Video Seq...
Alpen-Adria-Universität
 

Similar to Machine Learning Based Video Coding Enhancements for HTTP Adaptive Streaming (20)

PDF
HTTP Adaptive Streaming – Quo Vadis?
Alpen-Adria-Universität
 
PDF
QoE- and Energy-aware Content Consumption for HTTP Adaptive Streaming
DanieleLorenzi6
 
PDF
Research@Lunch_Presentation.pdf
Vignesh V Menon
 
PDF
HTTP Adaptive Streaming – Quo Vadis (2024)
Alpen-Adria-Universität
 
PDF
Video Coding Enhancements for HTTP Adaptive Streaming Using Machine Learning
Ekrem Çetinkaya
 
PDF
Video Coding Enhancements for HTTP Adaptive Streaming Using Machine Learning
Alpen-Adria-Universität
 
PDF
Doctoral Symposium presentation.pdf
Vignesh V Menon
 
PDF
HTTP Adaptive Streaming – Quo Vadis? (2023)
Alpen-Adria-Universität
 
PPTX
MHV'22 - Super-resolution Based Bitrate Adaptation for HTTP Adaptive Streamin...
Minh Nguyen
 
PDF
Online Bitrate ladder prediction for Adaptive VVC Streaming
Vignesh V Menon
 
PDF
Video Streaming: Then, Now, and in the Future
Alpen-Adria-Universität
 
PDF
Convex-hull Estimation using XPSNR for Versatile Video Coding
Vignesh V Menon
 
PDF
OPTE: Online Per-title Encoding for Live Video Streaming
Alpen-Adria-Universität
 
PDF
OPTE: Online Per-title Encoding for Live Video Streaming.pdf
Vignesh V Menon
 
PDF
VCIP_MCBE_presentation.pdf
Vignesh V Menon
 
PDF
Energy-Efficient Multi-Codec Bitrate-Ladder Estimation for Adaptive Video Str...
Alpen-Adria-Universität
 
PDF
Immersive Video Delivery: From Omnidirectional Video to Holography
Alpen-Adria-Universität
 
PDF
Content_adaptive_video_coding_for_HTTP_Adaptive_Streaming.pdf
Vignesh V Menon
 
PDF
Content-adaptive Video Coding for HTTP Adaptive Streaming
Alpen-Adria-Universität
 
PDF
Motion Vector Recovery for Real-time H.264 Video Streams
IDES Editor
 
HTTP Adaptive Streaming – Quo Vadis?
Alpen-Adria-Universität
 
QoE- and Energy-aware Content Consumption for HTTP Adaptive Streaming
DanieleLorenzi6
 
Research@Lunch_Presentation.pdf
Vignesh V Menon
 
HTTP Adaptive Streaming – Quo Vadis (2024)
Alpen-Adria-Universität
 
Video Coding Enhancements for HTTP Adaptive Streaming Using Machine Learning
Ekrem Çetinkaya
 
Video Coding Enhancements for HTTP Adaptive Streaming Using Machine Learning
Alpen-Adria-Universität
 
Doctoral Symposium presentation.pdf
Vignesh V Menon
 
HTTP Adaptive Streaming – Quo Vadis? (2023)
Alpen-Adria-Universität
 
MHV'22 - Super-resolution Based Bitrate Adaptation for HTTP Adaptive Streamin...
Minh Nguyen
 
Online Bitrate ladder prediction for Adaptive VVC Streaming
Vignesh V Menon
 
Video Streaming: Then, Now, and in the Future
Alpen-Adria-Universität
 
Convex-hull Estimation using XPSNR for Versatile Video Coding
Vignesh V Menon
 
OPTE: Online Per-title Encoding for Live Video Streaming
Alpen-Adria-Universität
 
OPTE: Online Per-title Encoding for Live Video Streaming.pdf
Vignesh V Menon
 
VCIP_MCBE_presentation.pdf
Vignesh V Menon
 
Energy-Efficient Multi-Codec Bitrate-Ladder Estimation for Adaptive Video Str...
Alpen-Adria-Universität
 
Immersive Video Delivery: From Omnidirectional Video to Holography
Alpen-Adria-Universität
 
Content_adaptive_video_coding_for_HTTP_Adaptive_Streaming.pdf
Vignesh V Menon
 
Content-adaptive Video Coding for HTTP Adaptive Streaming
Alpen-Adria-Universität
 
Motion Vector Recovery for Real-time H.264 Video Streams
IDES Editor
 
Ad

More from Alpen-Adria-Universität (20)

PDF
Energy-Quality-aware Variable Framerate Pareto-Front for Adaptive Video Strea...
Alpen-Adria-Universität
 
PPTX
End-to-end Quality of Experience Evaluation for HTTP Adaptive Streaming
Alpen-Adria-Universität
 
PDF
Energy Efficient Video Encoding for Cloud and Edge Computing Instances
Alpen-Adria-Universität
 
PDF
VEED: Video Encoding Energy and CO2 Emissions Dataset for AWS EC2 instances
Alpen-Adria-Universität
 
PDF
GREEM: An Open-Source Energy Measurement Tool for Video Processing
Alpen-Adria-Universität
 
PDF
Optimal Quality and Efficiency in Adaptive Live Streaming with JND-Aware Low ...
Alpen-Adria-Universität
 
PDF
VEEP: Video Encoding Energy and CO₂ Emission Prediction
Alpen-Adria-Universität
 
PPTX
Empowerment of Atypical Viewers via Low-Effort Personalized Modeling of Video...
Alpen-Adria-Universität
 
PPTX
Empowerment of Atypical Viewers via Low-Effort Personalized Modeling of Vid...
Alpen-Adria-Universität
 
PPTX
Optimizing Video Streaming for Sustainability and Quality: The Role of Prese...
Alpen-Adria-Universität
 
PPTX
Machine Learning Based Resource Utilization Prediction in the Computing Conti...
Alpen-Adria-Universität
 
PDF
Evaluation of Quality of Experience of ABR Schemes in Gaming Stream
Alpen-Adria-Universität
 
PDF
Network-Assisted Delivery of Adaptive Video Streaming Services through CDN, S...
Alpen-Adria-Universität
 
PDF
Multi-access Edge Computing for Adaptive Video Streaming
Alpen-Adria-Universität
 
PPTX
Policy-Driven Dynamic HTTP Adaptive Streaming Player Environment
Alpen-Adria-Universität
 
PDF
VE-Match: Video Encoding Matching-based Model for Cloud and Edge Computing In...
Alpen-Adria-Universität
 
PDF
Energy Consumption in Video Streaming: Components, Measurements, and Strategies
Alpen-Adria-Universität
 
PDF
Exploring the Energy Consumption of Video Streaming: Components, Challenges, ...
Alpen-Adria-Universität
 
PDF
Optimizing QoE and Latency of Live Video Streaming Using Edge Computing a...
Alpen-Adria-Universität
 
PDF
SARENA: SFC-Enabled Architecture for Adaptive Video Streaming Applications
Alpen-Adria-Universität
 
Energy-Quality-aware Variable Framerate Pareto-Front for Adaptive Video Strea...
Alpen-Adria-Universität
 
End-to-end Quality of Experience Evaluation for HTTP Adaptive Streaming
Alpen-Adria-Universität
 
Energy Efficient Video Encoding for Cloud and Edge Computing Instances
Alpen-Adria-Universität
 
VEED: Video Encoding Energy and CO2 Emissions Dataset for AWS EC2 instances
Alpen-Adria-Universität
 
GREEM: An Open-Source Energy Measurement Tool for Video Processing
Alpen-Adria-Universität
 
Optimal Quality and Efficiency in Adaptive Live Streaming with JND-Aware Low ...
Alpen-Adria-Universität
 
VEEP: Video Encoding Energy and CO₂ Emission Prediction
Alpen-Adria-Universität
 
Empowerment of Atypical Viewers via Low-Effort Personalized Modeling of Video...
Alpen-Adria-Universität
 
Empowerment of Atypical Viewers via Low-Effort Personalized Modeling of Vid...
Alpen-Adria-Universität
 
Optimizing Video Streaming for Sustainability and Quality: The Role of Prese...
Alpen-Adria-Universität
 
Machine Learning Based Resource Utilization Prediction in the Computing Conti...
Alpen-Adria-Universität
 
Evaluation of Quality of Experience of ABR Schemes in Gaming Stream
Alpen-Adria-Universität
 
Network-Assisted Delivery of Adaptive Video Streaming Services through CDN, S...
Alpen-Adria-Universität
 
Multi-access Edge Computing for Adaptive Video Streaming
Alpen-Adria-Universität
 
Policy-Driven Dynamic HTTP Adaptive Streaming Player Environment
Alpen-Adria-Universität
 
VE-Match: Video Encoding Matching-based Model for Cloud and Edge Computing In...
Alpen-Adria-Universität
 
Energy Consumption in Video Streaming: Components, Measurements, and Strategies
Alpen-Adria-Universität
 
Exploring the Energy Consumption of Video Streaming: Components, Challenges, ...
Alpen-Adria-Universität
 
Optimizing QoE and Latency of Live Video Streaming Using Edge Computing a...
Alpen-Adria-Universität
 
SARENA: SFC-Enabled Architecture for Adaptive Video Streaming Applications
Alpen-Adria-Universität
 
Ad

Recently uploaded (20)

PDF
Empower Inclusion Through Accessible Java Applications
Ana-Maria Mihalceanu
 
PDF
Complete JavaScript Notes: From Basics to Advanced Concepts.pdf
haydendavispro
 
PDF
Chris Elwell Woburn, MA - Passionate About IT Innovation
Chris Elwell Woburn, MA
 
PDF
CIFDAQ Token Spotlight for 9th July 2025
CIFDAQ
 
PPTX
✨Unleashing Collaboration: Salesforce Channels & Community Power in Patna!✨
SanjeetMishra29
 
PPTX
Darren Mills The Migration Modernization Balancing Act: Navigating Risks and...
AWS Chicago
 
PDF
Wojciech Ciemski for Top Cyber News MAGAZINE. June 2025
Dr. Ludmila Morozova-Buss
 
PPTX
Building a Production-Ready Barts Health Secure Data Environment Tooling, Acc...
Barts Health
 
PDF
Smart Air Quality Monitoring with Serrax AQM190 LITE
SERRAX TECHNOLOGIES LLP
 
PDF
Fl Studio 24.2.2 Build 4597 Crack for Windows Free Download 2025
faizk77g
 
PDF
July Patch Tuesday
Ivanti
 
PPTX
Extensions Framework (XaaS) - Enabling Orchestrate Anything
ShapeBlue
 
PDF
Impact of IEEE Computer Society in Advancing Emerging Technologies including ...
Hironori Washizaki
 
PDF
Empowering Cloud Providers with Apache CloudStack and Stackbill
ShapeBlue
 
PDF
The Builder’s Playbook - 2025 State of AI Report.pdf
jeroen339954
 
PDF
DevBcn - Building 10x Organizations Using Modern Productivity Metrics
Justin Reock
 
PDF
Human-centred design in online workplace learning and relationship to engagem...
Tracy Tang
 
PDF
HCIP-Data Center Facility Deployment V2.0 Training Material (Without Remarks ...
mcastillo49
 
PDF
Windsurf Meetup Ottawa 2025-07-12 - Planning Mode at Reliza.pdf
Pavel Shukhman
 
PDF
LLMs.txt: Easily Control How AI Crawls Your Site
Keploy
 
Empower Inclusion Through Accessible Java Applications
Ana-Maria Mihalceanu
 
Complete JavaScript Notes: From Basics to Advanced Concepts.pdf
haydendavispro
 
Chris Elwell Woburn, MA - Passionate About IT Innovation
Chris Elwell Woburn, MA
 
CIFDAQ Token Spotlight for 9th July 2025
CIFDAQ
 
✨Unleashing Collaboration: Salesforce Channels & Community Power in Patna!✨
SanjeetMishra29
 
Darren Mills The Migration Modernization Balancing Act: Navigating Risks and...
AWS Chicago
 
Wojciech Ciemski for Top Cyber News MAGAZINE. June 2025
Dr. Ludmila Morozova-Buss
 
Building a Production-Ready Barts Health Secure Data Environment Tooling, Acc...
Barts Health
 
Smart Air Quality Monitoring with Serrax AQM190 LITE
SERRAX TECHNOLOGIES LLP
 
Fl Studio 24.2.2 Build 4597 Crack for Windows Free Download 2025
faizk77g
 
July Patch Tuesday
Ivanti
 
Extensions Framework (XaaS) - Enabling Orchestrate Anything
ShapeBlue
 
Impact of IEEE Computer Society in Advancing Emerging Technologies including ...
Hironori Washizaki
 
Empowering Cloud Providers with Apache CloudStack and Stackbill
ShapeBlue
 
The Builder’s Playbook - 2025 State of AI Report.pdf
jeroen339954
 
DevBcn - Building 10x Organizations Using Modern Productivity Metrics
Justin Reock
 
Human-centred design in online workplace learning and relationship to engagem...
Tracy Tang
 
HCIP-Data Center Facility Deployment V2.0 Training Material (Without Remarks ...
mcastillo49
 
Windsurf Meetup Ottawa 2025-07-12 - Planning Mode at Reliza.pdf
Pavel Shukhman
 
LLMs.txt: Easily Control How AI Crawls Your Site
Keploy
 

Machine Learning Based Video Coding Enhancements for HTTP Adaptive Streaming

  • 1. All rights reserved. ©2020 All rights reserved. ©2020 Machine Learning Based Video Coding Enhancements for HTTP Adaptive Streaming ACM MMSys’21 Doctoral Symposium September 30, 2021 Ekrem Çetinkaya Christian Doppler Laboratory ATHENA | Alpen-Adria-Universität Klagenfurt | Austria [email protected] | athena.itec.aau.at 1
  • 2. All rights reserved. ©2020 ● Introduction ● Research Questions ● Methodology & Existing Results ● Ongoing & Future Work ● Q & A Agenda All rights reserved. ©2020 2
  • 4. All rights reserved. ©2020 Video Streaming Share in the Internet Traffic 82% Content Characteristics 1 Million minutes Video Streamed Every Second As of 2021 * Cisco VNI Forecast Highlights (2021) All rights reserved. ©2020 4
  • 5. All rights reserved. ©2020 HTTP Adaptive Streaming (HAS) Very Nice Video Play Play 5 240 kbps Client HAS Server 1200 kbps 3500 kbps 480 kbps 2500 kbps 7000 kbps
  • 6. All rights reserved. ©2020 Video Encoding Block Partitioning Motion Compensation Transformation & Quantization Entropy Coding Entropy Decoding Inverse Transformation & Inverse Quantization Inter or Intra Prediction Picture Buffer In-loop Filtering 6
  • 7. All rights reserved. ©2020 Video Codecs C. Feldmann, “State of Compression Standards - VVC”, 2020, https://bitmovin.com/compression-standards-vvc-2020/ Vanne et.al., “Comparative Rate-Distortion-Complexity Analysis of HEVC and AVC Video Codecs”, TCSVT, 2012 High Efficiency Video Coding (HEVC) 2003 Advanced Video Coding (AVC) 2013 Versatile Video Coding (VVC) 2020 170 % 954 % 37 % 35 % Block size 16x16 Quaternary tree Supports up to 4K Block size 64x64 Quaternary tree Supports up to 8K Block size 128x128 Multi-type tree Supports up to 16K, 360° videos 7
  • 8. All rights reserved. ©2020 Video Encoding with Machine Learning Block Partitioning Motion Compensation Transformation & Quantization Entropy Coding Entropy Decoding Inverse Transformation & Inverse Quantization Inter or Intra Prediction Picture Buffer In-loop Filtering Block Partitioning Decision Prediction Optical Flow Detection Mode Prediction Angular Direction Prediction Deblocking with ML Denoising with ML Super-resolution 8
  • 9. Research Questions All rights reserved. ©2020 9
  • 10. All rights reserved. ©2020 10 RQ-1 How to efficiently provide multi-rate representations over a wide range of resolutions for HAS? RQ-2 How to improve the performance of video codecs using machine learning? RQ-4 How to use machine learning to improve perceptual quality assessment for videos? RQ-3 How to improve the visual quality of videos using machine learning? Why? 🔋High-resolution content is getting more common, required number of representations for HAS is increasing. Literature 🗂 ML based approaches are utilized in video codecs to speed up encoder decision. 🗂 Some attempts in end-to-end ML based video codecs. Literature 🗂 ML based refinement techniques applied. 🗂 Post-processing in decoded frames to improve quality. 🗂 Super-resolution for images and videos. Literature 🗂 ML model is used in VMAF. 🗂 Several more attempts for non-reference perceptual quality assessment. Why? 🔋ML based image restoration methods are improving, however video is mostly ignored. QoE can be increased. Why? 🔋Finding a reliable metric for perceptual quality is important as current objective metrics are problematic. Why? 🔋More complex codecs, many possibilities to apply ML, still much room for improvement. Literature 🗂 Choose a reference representation and use its information to speed up remaining encodings.
  • 11. Methodology & Existing Results All rights reserved. ©2020 11
  • 12. All rights reserved. ©2020 Design and Abstraction Methodology Design Propose a solution (algorithm, concept, protocol, etc.) for a given problem Implement Prototype software implementation using the proposed solution Analyze Qualitative and quantitative analysis of the solution Repeat the cycle to improve the solutions 12
  • 13. All rights reserved. ©2020 ● State-of-the-art: ○ Encode the highest quality 1 or the lowest quality 2 as the reference first then use these information ● Proposed Method 3 : ○ Encode the highest quality first, ○ Use its information to encode the lowest quality ○ Use information from both representations to encode the remaining representations ○ Double bound for CTU search ranges 1 Schroeder, Damien, et al. "Efficient multi-rate video encoding for HEVC-based adaptive HTTP streaming." IEEE Transactions on Circuits and systems for Video Technology 28.1 (2016): 143-157. 2 B. Guo, Y. Han, J. Wen, "Fast Block Structure Determination in AV1-based Multiple Resolutions Video Encoding," in 2018 IEEE International Conference on Multimedia and Expo (ICME), San Diego, CA, USA, July 2018. 3 H. Amirpour, E. Çetinkaya, C. Timmerer and M. Ghanbari, "Fast Multi-rate Encoding for Adaptive HTTP Streaming," 2020 Data Compression Conference (DCC), Snowbird, UT, USA, 2020, pp. 358-358 QP1 QPN QPN-1 QP3 QP2 ... 13 Fast Multi-rate Encoding (DCC’20)
  • 14. All rights reserved. ©2020 14 Fast Multi-rate Encoding (DCC’20)
  • 15. All rights reserved. ©2020 ● State-of-the-art: ○ Encode the highest quality 1 or the lowest quality 2 as the reference first then use these information ● Proposed Method 3 : ○ Try different quality levels as the reference representation to determine the best starting point for parallel encoding ○ Encode the middle quality first and use its information. ○ Upper or lower bound depending on the quality level Towards Optimal Multirate Encoding (MMM’21) 1 Schroeder, Damien, et al. "Efficient multi-rate video encoding for HEVC-based adaptive HTTP streaming." IEEE Transactions on Circuits and systems for Video Technology 28.1 (2016): 143-157. 2 B. Guo, Y. Han, J. Wen, "Fast Block Structure Determination in AV1-based Multiple Resolutions Video Encoding," in 2018 IEEE International Conference on Multimedia and Expo (ICME), San Diego, CA, USA, July 2018. 3 H. Amirpour, E. Çetinkaya, C. Timmerer and M. Ghanbari, "Towards Optimal Multirate Encoding for HTTP Adaptive Streaming," The International MultiMedia Modeling Conference (MMM), Prague, Czech Republic, 2021 QPN/2 QPN QP2 QP1 ... 15
  • 16. All rights reserved. ©2020 16 Towards Optimal Multirate Encoding (MMM’21)
  • 17. All rights reserved. ©2020 ● State-of-the-art: ○ Use a CNN to predict CTU depth decisions 1 ● Proposed Method 2 : ○ Train a CNN with encoding information obtained from the reference representation and use its decision to encode dependent representations. ○ Focus on parallel encoding, thus only apply for bottleneck situations ○ Train different CNNs for different QP targets 1 Kim, Kyungah, and Won Woo Ro. "Fast CU depth decision for HEVC using neural networks." IEEE Transactions on Circuits and Systems for Video Technology 29.5 (2018): 1462-1473. 2 E. Çetinkaya, H. Amirpour, C. Timmerer and M. Ghanbari, “FaME-ML: Fast Multirate Encoding for HTTP Adaptive Streaming Using Machine Learning,” 2020 IEEE International Conference on Visual Communications and Image Processing (VCIP), Macau, 2020, pp. 87-90. QPN CNN QPN-1 QP1 QP2 ... HEVC HEVC HEVC CNN HEVC HEVC 17 Fast Multi-rate Encoding with ML (VCIP’20)
  • 18. All rights reserved. ©2020 18 Fast Multi-rate Encoding with ML (VCIP’20)
  • 19. All rights reserved. ©2020 ● State-of-the-art: ○ Use the highest quality representation as the reference 1 ● Proposed Method 2 : ○ Train a CNN with encoding information obtained from the reference representation (the highest quality from the lowest resolution) and use its decision to encode dependent representations ○ Improves parallel encoding as well as serial encoding ○ Train different CNNs for different QP and resolution targets 1 Schroeder, Damien, et al. "Efficient multi-rate video encoding for HEVC-based adaptive HTTP streaming." IEEE Transactions on Circuits and systems for Video Technology 28.1 (2016): 143-157. 2 E. Çetinkaya, H. Amirpour, C. Timmerer and M. Ghanbari, "Fast Multi-Resolution and Multi-Rate Encoding for HTTP Adaptive Streaming Using Machine Learning," in IEEE Open Journal of Signal Processing, vol. 2, pp. 484-495, 2021, doi: 10.1109/OJSP.2021.3078657. 19 Fast Multi-rate and Multi-resolution Encoding with ML (IEEE OJ-SP) HEVC QP1 HEVC QP2 CNN HEVC QPN CNN HEVC .. CNN HEVC QP2 CNN HEVC QPN CNN HEVC .. CNN HEVC QP2 CNN HEVC QPN CNN HEVC .. CNN HEVC QP1 CNN HEVC QP1 CNN 540p 540p 1080p 2160p
  • 20. All rights reserved. ©2020 20 Fast Multi-rate and Multi-resolution Encoding with ML (IEEE OJ-SP) Normalized Encoding time HM 16.21 Lower Bound FaRes-ML
  • 21. Ongoing & Future Work All rights reserved. ©2020 21
  • 22. All rights reserved. ©2020 Work Plan 2019 Q4 2020 2021 2022 Q1 Q2 Q3 Q4 Q1 Q2 Q3 Q4 Q1 Q2 Q3 Q4 Q1 Q2 1. How to efficiently provide multi-bitrate representations over a wide range of resolutions for HAS ? 2. How to improve performance of video codecs using machine learning ? 3. How to improve quality of videos using machine learning ? 4. How to use machine learning to improve perceptual quality assessment for videos ? Literature review DCC’20 Paper MMM’21 Paper VCIP’20 Paper Multi-rate and Multi-resolution Encoding IEEE OJSP Paper RQ1 RQ2 RQ3 RQ4 Super-resolution Literature Review Perceptual Quality Assessment with ML 2023 Thesis Bitrate Ladder Prediction Literature Review Improvement in In-loop Filtering with ML Mobile Player Optimization with SR 22 Fast multi-rate encoding for adaptive http streaming Towards optimal multirate encoding for HTTP adaptive streaming FaME-ML: Fast multirate encoding for HTTP adaptive streaming using machine learning Fast Multi-Resolution and Multi-Rate Encoding for HTTP Adaptive Streaming Using Machine Learning
  • 23. All rights reserved. ©2020 Thank you! [email protected] @ekremcetinkaya_ linkedin.com/in/ekrcet