SlideShare a Scribd company logo
KEYFRAME-BASED VIDEO
SUMMARIZATION DESIGNER
Carlos Ramos Caballero
Advisors: Horst Eidenberger and Xavier Giró I Nieto
Contents
 Introduction
 State of the art
 Methodology
 Results assessment
 Conclusions
2
 The application: Designer Master
DEMONSTRATION
3
Contents
 Introduction
 State of the art
 Methodology
 Results assessment
 Conclusions
4
Introduction
 Motivation
 Designer Master: keyframe-based video summarization interface
 Object Maps: system for automatic video summarization
5
Graphical User Interface
(Designer Master)
Computer Vision Engine
(Object Maps)
Introduction
 Goals of the thesis
6
Introduction
 Goals of the thesis
 Improving the keyframe extraction module
7
Introduction
 Goals of the thesis
 Improving the keyframe extraction module
 Assessing the improvement
8
Contents
 Introduction
 State of the art
 Methodology
 Results assessment
 Conclusions
9
State of the art
 Shot segmentation
10
Hierarchical decomposition and representation of video content [1]
[1] http://www.scholarpedia.org/article/Video_Content_Structuring
State of the art
 Shot segmentation example
11
Shot boundary detection example [2].
[2] Martos, M. “Content-based Video Summarization to Object Maps”, Vienna University of Technology, Austria (2013).
State of the art
 Shot segmentation techniques
 Pixel-to-pixel methods
• Global pixel-to-pixel
• Cumulative pixel-to-pixel
 Histogram-based methods
• Simple histogram
• Maximum histogram
• Weighted histogram
 Hausdorff method
12
Contents
 Introduction
 State of the art
 Methodology
 Results assessment
 Conclusions
13
Methodology: Implemented solution
 System architecture overview
14
Methodology : Implemented solution
 Uniform sampling
𝑓𝑝𝑠𝑖: frame rate of the input video.
𝐿𝑖: total number of frames of the input video.
𝑁0: total number of frames we want to keep (𝑁0=100).
15
Methodology : Implemented solution
 Gray scale domain
16
Color model transformation RGB to YIQ.
Methodology : Implemented solution
 Difference computation
Where 𝐼(𝑡,𝑖,𝑗) represents the intensity value at frame t in pixel(𝑖,𝑗).
X and Y are the width and height of the video frames, respectively.
17
Methodology : Implemented solution
 Normalization
Where 𝑑̂ is the normalized value, 256 is the number of grey levels, X and Y are the
width and height of the video frames, respectively.
18
Methodology : Implemented solution
 Decision making
The threshold value used in our application is 𝜏 = 0.1 (as defined in [2]).
19
[2] Martos, M. “Content-based Video Summarization to Object Maps”, Vienna University of Technology, Austria (2013).
Methodology: Environment
 Environment
20
Contents
 Introduction
 State of the art
 Methodology
 Results assessment
 Conclusions
21
Results assessment
 TEST 1: Testing the applications + ‘in situ’ survey
 11 participants
 Test data: The intouchables trailer
22
Results assessment
 Example: pair of summaries
23
Designer Master v1 Designer Master v2
Results assessment
 TEST 2: web-based survey
 43 participants
 Test data: The Intouchables trailer
24
Results assessment
 EVALUATION
 Quality of the generated summaries
 Representativeness of the generated summaries
 Mean Opinion Score
• 1. Unacceptable
• 2. Poor
• 3. Good
• 4. Very good
• 5. Excellent
25
Results assessment
 Quality generated summaries
“Please, rate summary 1”
26
“Please, rate summary 2”
Results assessment
 Quality generated summaries
27
MOS MOS – scores distribution
Results assessment
 Representativeness of the summaries
“Which summary let you better recognize the video content?”
28
Results assessment
 Representativeness of the summaries
29
Results assessment
 Ease-of-use of the application
“Do you think the application is intuitive and easy to use?”
30
Results assessment
 Ease-of-use of the application
31
Results assessment
 Execution time
32
Contents
 Introduction
 State of the art
 Methodology
 Results assessment
 Conclusions
33
Conclusions
 Accomplishment of the initial goals
 Improving the keyframe extraction module by integrating both
projects.
 Assessing the improvement.
34
Conclusions
 Accomplishment of the initial goals
 Improving the keyframe extraction module by integrating both
projects.
 Assessing the improvement.
 Our work has slightly improved Designer Master
 Users can create better video summaries and easily due the better
quality of the extracted keyframes.
35
Conclusions
 Accomplishment of the initial goals
 Improving the keyframe extraction module by integrating both
projects.
 Assessing the improvement.
 Our work has slightly improved Designer Master
 Users can create better video summaries and easily due the better
quality of the extracted keyframes.
 It is hoped to develop this work into a product for the Austrian
Broadcasting station ORF
36
Conclusions
 Accomplishment of the initial goals
 Improving the keyframe extraction module by integrating both
projects.
 Assessing the improvement.
 Our work has slightly improved Designer Master
 Users can create better video summaries and easily due the better
quality of the extracted keyframes.
 It is hoped to develop this work into a product for the Austrian
Broadcasting station ORF
37
Thank you very much for your attention!
Danke schön!
Moltes gràcies!
38

More Related Content

What's hot (20)

PPT
Augmented reality
Anugya Shukla
 
PPTX
Virtual Reality & Augmented Reality in Automobile Industry - infiVR.com
OoBI - Out of Box Interactions
 
PPTX
Virtual Reality for Industrial Training
CHRP INDIA
 
PDF
Image captioning with Keras and Tensorflow - Debarko De @ Practo
Debarko De
 
PDF
Smart Glasses Technology
vivatechijri
 
DOCX
screen less display documentation
mani akuthota
 
PDF
On-device Motion Tracking for Immersive VR
Qualcomm Research
 
PPTX
virtual reality Barkha manral seminar on augmented reality.ppt
Barkha Manral
 
PPTX
AR&VR Implementation
Vusal Suleyman
 
PDF
Blue Brain Seminar Report
Varun A M
 
PPTX
Blue brain
Parker Punj
 
PDF
Occlusion and Abandoned Object Detection for Surveillance Applications
Editor IJCATR
 
PPTX
Artifcial Eye
Christian Bibentyo Mtbg
 
PPTX
Eye Mouse
Amir Al-Ansary
 
DOCX
45891026 brain-computer-interface-seminar-report
kapilpanwariet
 
PPTX
Augmented reality ppt
Dark Side
 
DOCX
Fogscreen seminar report
Sovan Misra
 
PDF
BEV Joint Detection and Segmentation
Yu Huang
 
PPTX
Augmented reality
Niranjan Arya
 
Augmented reality
Anugya Shukla
 
Virtual Reality & Augmented Reality in Automobile Industry - infiVR.com
OoBI - Out of Box Interactions
 
Virtual Reality for Industrial Training
CHRP INDIA
 
Image captioning with Keras and Tensorflow - Debarko De @ Practo
Debarko De
 
Smart Glasses Technology
vivatechijri
 
screen less display documentation
mani akuthota
 
On-device Motion Tracking for Immersive VR
Qualcomm Research
 
virtual reality Barkha manral seminar on augmented reality.ppt
Barkha Manral
 
AR&VR Implementation
Vusal Suleyman
 
Blue Brain Seminar Report
Varun A M
 
Blue brain
Parker Punj
 
Occlusion and Abandoned Object Detection for Surveillance Applications
Editor IJCATR
 
Eye Mouse
Amir Al-Ansary
 
45891026 brain-computer-interface-seminar-report
kapilpanwariet
 
Augmented reality ppt
Dark Side
 
Fogscreen seminar report
Sovan Misra
 
BEV Joint Detection and Segmentation
Yu Huang
 
Augmented reality
Niranjan Arya
 

Viewers also liked (12)

PDF
Content based video summarization into object maps
Universitat Politècnica de Catalunya
 
PPT
Video summarization using clustering
Sahil Biswas
 
PPTX
Goal Recognition in Soccer Match
Dharmesh Tank
 
PPT
Applying Media Content Analysis to the Production of Musical Videos as Summar...
Chris Huang
 
PPT
Howen CCTV System worldwide Application-201309
Berry Gao
 
PDF
Paralleling Variable Block Size Motion Estimation of HEVC On CPU plus GPU Pla...
Shanghai Jiao Tong University(上海交通大学)
 
PDF
VIDEO SUMMARIZATION: CORRELATION FOR SUMMARIZATION AND SUBTRACTION FOR RARE E...
Journal For Research
 
PDF
Video Analysis with Recurrent Neural Networks (Master Computer Vision Barcelo...
Universitat Politècnica de Catalunya
 
PDF
"Image and Video Summarization," a Presentation from the University of Washin...
Edge AI and Vision Alliance
 
PPTX
Color image processing Presentation
Revanth Chimmani
 
PPT
Integrating Physical And Logical Security
Jorge Sebastiao
 
Content based video summarization into object maps
Universitat Politècnica de Catalunya
 
Video summarization using clustering
Sahil Biswas
 
Goal Recognition in Soccer Match
Dharmesh Tank
 
Applying Media Content Analysis to the Production of Musical Videos as Summar...
Chris Huang
 
Howen CCTV System worldwide Application-201309
Berry Gao
 
Paralleling Variable Block Size Motion Estimation of HEVC On CPU plus GPU Pla...
Shanghai Jiao Tong University(上海交通大学)
 
VIDEO SUMMARIZATION: CORRELATION FOR SUMMARIZATION AND SUBTRACTION FOR RARE E...
Journal For Research
 
Video Analysis with Recurrent Neural Networks (Master Computer Vision Barcelo...
Universitat Politècnica de Catalunya
 
"Image and Video Summarization," a Presentation from the University of Washin...
Edge AI and Vision Alliance
 
Color image processing Presentation
Revanth Chimmani
 
Integrating Physical And Logical Security
Jorge Sebastiao
 
Ad

Similar to Keyframe-based Video Summarization Designer (20)

PDF
VISUAL ATTENTION BASED KEYFRAMES EXTRACTION AND VIDEO SUMMARIZATION
cscpconf
 
PDF
Key frame extraction for video summarization using motion activity descriptors
eSAT Publishing House
 
PDF
Key frame extraction for video summarization using motion activity descriptors
eSAT Journals
 
PPTX
Mtech First progress PRESENTATION ON VIDEO SUMMARIZATION
NEERAJ BAGHEL
 
PDF
Visual Summary of Egocentric Photostreams by Representative Keyframes (BSc Ri...
Universitat Politècnica de Catalunya
 
PDF
Video Hyperlinking Tutorial (Part B)
LinkedTV
 
PDF
A Survey on Video Content Analysis
editor1knowledgecuddle
 
PDF
Parking Surveillance Footage Summarization
IRJET Journal
 
PDF
Re-using Media on the Web tutorial: Media Fragment Creation and Annotation
MediaMixerCommunity
 
PDF
Cb35446450
IJERA Editor
 
PDF
A Semi-Automatic Annotation Tool For Cooking Video
Brittany Allen
 
PDF
F0953235
IOSR Journals
 
PDF
Augmented Reality Video Playlist - Computer Vision Project
Surya Chandra
 
PPTX
Inside proposal 16 113
Pranavaghanan Murugesh
 
PPTX
Inside proposal 16 113 - version 01
Kasun Udayanga
 
PDF
Visual Summary of Egocentric Photostreams by Representative Keyframes (WEsAX ...
Universitat Politècnica de Catalunya
 
PDF
Visual Summary of Egocentric Photostreams by Representative Keyframes
Marc Bolaños Solà
 
PDF
IRJET-Feature Extraction from Video Data for Indexing and Retrieval
IRJET Journal
 
PPTX
CA-SUM Video Summarization
VasileiosMezaris
 
VISUAL ATTENTION BASED KEYFRAMES EXTRACTION AND VIDEO SUMMARIZATION
cscpconf
 
Key frame extraction for video summarization using motion activity descriptors
eSAT Publishing House
 
Key frame extraction for video summarization using motion activity descriptors
eSAT Journals
 
Mtech First progress PRESENTATION ON VIDEO SUMMARIZATION
NEERAJ BAGHEL
 
Visual Summary of Egocentric Photostreams by Representative Keyframes (BSc Ri...
Universitat Politècnica de Catalunya
 
Video Hyperlinking Tutorial (Part B)
LinkedTV
 
A Survey on Video Content Analysis
editor1knowledgecuddle
 
Parking Surveillance Footage Summarization
IRJET Journal
 
Re-using Media on the Web tutorial: Media Fragment Creation and Annotation
MediaMixerCommunity
 
Cb35446450
IJERA Editor
 
A Semi-Automatic Annotation Tool For Cooking Video
Brittany Allen
 
F0953235
IOSR Journals
 
Augmented Reality Video Playlist - Computer Vision Project
Surya Chandra
 
Inside proposal 16 113
Pranavaghanan Murugesh
 
Inside proposal 16 113 - version 01
Kasun Udayanga
 
Visual Summary of Egocentric Photostreams by Representative Keyframes (WEsAX ...
Universitat Politècnica de Catalunya
 
Visual Summary of Egocentric Photostreams by Representative Keyframes
Marc Bolaños Solà
 
IRJET-Feature Extraction from Video Data for Indexing and Retrieval
IRJET Journal
 
CA-SUM Video Summarization
VasileiosMezaris
 
Ad

More from Universitat Politècnica de Catalunya (20)

PDF
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
Universitat Politècnica de Catalunya
 
PDF
Deep Generative Learning for All
Universitat Politècnica de Catalunya
 
PDF
The Transformer in Vision | Xavier Giro | Master in Computer Vision Barcelona...
Universitat Politècnica de Catalunya
 
PDF
Towards Sign Language Translation & Production | Xavier Giro-i-Nieto
Universitat Politècnica de Catalunya
 
PDF
The Transformer - Xavier Giró - UPC Barcelona 2021
Universitat Politècnica de Catalunya
 
PDF
Learning Representations for Sign Language Videos - Xavier Giro - NIST TRECVI...
Universitat Politècnica de Catalunya
 
PDF
Open challenges in sign language translation and production
Universitat Politècnica de Catalunya
 
PPTX
Generation of Synthetic Referring Expressions for Object Segmentation in Videos
Universitat Politècnica de Catalunya
 
PPTX
Discovery and Learning of Navigation Goals from Pixels in Minecraft
Universitat Politècnica de Catalunya
 
PDF
Learn2Sign : Sign language recognition and translation using human keypoint e...
Universitat Politècnica de Catalunya
 
PDF
Intepretability / Explainable AI for Deep Neural Networks
Universitat Politècnica de Catalunya
 
PDF
Convolutional Neural Networks - Xavier Giro - UPC TelecomBCN Barcelona 2020
Universitat Politècnica de Catalunya
 
PDF
Self-Supervised Audio-Visual Learning - Xavier Giro - UPC TelecomBCN Barcelon...
Universitat Politècnica de Catalunya
 
PDF
Attention for Deep Learning - Xavier Giro - UPC TelecomBCN Barcelona 2020
Universitat Politècnica de Catalunya
 
PDF
Generative Adversarial Networks GAN - Xavier Giro - UPC TelecomBCN Barcelona ...
Universitat Politècnica de Catalunya
 
PDF
Q-Learning with a Neural Network - Xavier Giró - UPC Barcelona 2020
Universitat Politècnica de Catalunya
 
PDF
Language and Vision with Deep Learning - Xavier Giró - ACM ICMR 2020 (Tutorial)
Universitat Politècnica de Catalunya
 
PDF
Image Segmentation with Deep Learning - Xavier Giro & Carles Ventura - ISSonD...
Universitat Politècnica de Catalunya
 
PDF
Curriculum Learning for Recurrent Video Object Segmentation
Universitat Politècnica de Catalunya
 
PDF
Deep Self-supervised Learning for All - Xavier Giro - X-Europe 2020
Universitat Politècnica de Catalunya
 
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
Universitat Politècnica de Catalunya
 
Deep Generative Learning for All
Universitat Politècnica de Catalunya
 
The Transformer in Vision | Xavier Giro | Master in Computer Vision Barcelona...
Universitat Politècnica de Catalunya
 
Towards Sign Language Translation & Production | Xavier Giro-i-Nieto
Universitat Politècnica de Catalunya
 
The Transformer - Xavier Giró - UPC Barcelona 2021
Universitat Politècnica de Catalunya
 
Learning Representations for Sign Language Videos - Xavier Giro - NIST TRECVI...
Universitat Politècnica de Catalunya
 
Open challenges in sign language translation and production
Universitat Politècnica de Catalunya
 
Generation of Synthetic Referring Expressions for Object Segmentation in Videos
Universitat Politècnica de Catalunya
 
Discovery and Learning of Navigation Goals from Pixels in Minecraft
Universitat Politècnica de Catalunya
 
Learn2Sign : Sign language recognition and translation using human keypoint e...
Universitat Politècnica de Catalunya
 
Intepretability / Explainable AI for Deep Neural Networks
Universitat Politècnica de Catalunya
 
Convolutional Neural Networks - Xavier Giro - UPC TelecomBCN Barcelona 2020
Universitat Politècnica de Catalunya
 
Self-Supervised Audio-Visual Learning - Xavier Giro - UPC TelecomBCN Barcelon...
Universitat Politècnica de Catalunya
 
Attention for Deep Learning - Xavier Giro - UPC TelecomBCN Barcelona 2020
Universitat Politècnica de Catalunya
 
Generative Adversarial Networks GAN - Xavier Giro - UPC TelecomBCN Barcelona ...
Universitat Politècnica de Catalunya
 
Q-Learning with a Neural Network - Xavier Giró - UPC Barcelona 2020
Universitat Politècnica de Catalunya
 
Language and Vision with Deep Learning - Xavier Giró - ACM ICMR 2020 (Tutorial)
Universitat Politècnica de Catalunya
 
Image Segmentation with Deep Learning - Xavier Giro & Carles Ventura - ISSonD...
Universitat Politècnica de Catalunya
 
Curriculum Learning for Recurrent Video Object Segmentation
Universitat Politècnica de Catalunya
 
Deep Self-supervised Learning for All - Xavier Giro - X-Europe 2020
Universitat Politècnica de Catalunya
 

Recently uploaded (20)

PPTX
UiPath Academic Alliance Educator Panels: Session 2 - Business Analyst Content
DianaGray10
 
PDF
TrustArc Webinar - Data Privacy Trends 2025: Mid-Year Insights & Program Stra...
TrustArc
 
PPTX
MSP360 Backup Scheduling and Retention Best Practices.pptx
MSP360
 
PDF
NewMind AI Journal - Weekly Chronicles - July'25 Week II
NewMind AI
 
PDF
Building Resilience with Digital Twins : Lessons from Korea
SANGHEE SHIN
 
PDF
Presentation - Vibe Coding The Future of Tech
yanuarsinggih1
 
PPTX
Webinar: Introduction to LF Energy EVerest
DanBrown980551
 
PPTX
WooCommerce Workshop: Bring Your Laptop
Laura Hartwig
 
PDF
Persuasive AI: risks and opportunities in the age of digital debate
Speck&Tech
 
PDF
SWEBOK Guide and Software Services Engineering Education
Hironori Washizaki
 
PDF
July Patch Tuesday
Ivanti
 
PDF
How Startups Are Growing Faster with App Developers in Australia.pdf
India App Developer
 
PDF
HCIP-Data Center Facility Deployment V2.0 Training Material (Without Remarks ...
mcastillo49
 
PDF
Wojciech Ciemski for Top Cyber News MAGAZINE. June 2025
Dr. Ludmila Morozova-Buss
 
PDF
CIFDAQ Weekly Market Wrap for 11th July 2025
CIFDAQ
 
PDF
CIFDAQ Token Spotlight for 9th July 2025
CIFDAQ
 
PDF
SFWelly Summer 25 Release Highlights July 2025
Anna Loughnan Colquhoun
 
PPTX
OpenID AuthZEN - Analyst Briefing July 2025
David Brossard
 
PPTX
Building Search Using OpenSearch: Limitations and Workarounds
Sease
 
PPTX
Building a Production-Ready Barts Health Secure Data Environment Tooling, Acc...
Barts Health
 
UiPath Academic Alliance Educator Panels: Session 2 - Business Analyst Content
DianaGray10
 
TrustArc Webinar - Data Privacy Trends 2025: Mid-Year Insights & Program Stra...
TrustArc
 
MSP360 Backup Scheduling and Retention Best Practices.pptx
MSP360
 
NewMind AI Journal - Weekly Chronicles - July'25 Week II
NewMind AI
 
Building Resilience with Digital Twins : Lessons from Korea
SANGHEE SHIN
 
Presentation - Vibe Coding The Future of Tech
yanuarsinggih1
 
Webinar: Introduction to LF Energy EVerest
DanBrown980551
 
WooCommerce Workshop: Bring Your Laptop
Laura Hartwig
 
Persuasive AI: risks and opportunities in the age of digital debate
Speck&Tech
 
SWEBOK Guide and Software Services Engineering Education
Hironori Washizaki
 
July Patch Tuesday
Ivanti
 
How Startups Are Growing Faster with App Developers in Australia.pdf
India App Developer
 
HCIP-Data Center Facility Deployment V2.0 Training Material (Without Remarks ...
mcastillo49
 
Wojciech Ciemski for Top Cyber News MAGAZINE. June 2025
Dr. Ludmila Morozova-Buss
 
CIFDAQ Weekly Market Wrap for 11th July 2025
CIFDAQ
 
CIFDAQ Token Spotlight for 9th July 2025
CIFDAQ
 
SFWelly Summer 25 Release Highlights July 2025
Anna Loughnan Colquhoun
 
OpenID AuthZEN - Analyst Briefing July 2025
David Brossard
 
Building Search Using OpenSearch: Limitations and Workarounds
Sease
 
Building a Production-Ready Barts Health Secure Data Environment Tooling, Acc...
Barts Health
 

Keyframe-based Video Summarization Designer