SlideShare a Scribd company logo
Guided By:-        Presented By:-
Ms. Savita Vijay   Kanika Rathore
                   B.tech (C.S.E) IIIyr
                   BTBTC10168
   Introduction
   What is the need?
   Aesthetic Aspects
   Video Aesthetic Features
   Audio Aesthetic Features
   Pivot Representation
   Advantages
   Applications
   Conclusion
   The PIVOT VECTOR SPACE APPROACH in audio mixing is
    a novel technique that automatically picks the best
    audio clip to mix with the given video shot.

   This technique uses pivot vector space mixing
    framework & High level perceptual descriptors of audio
    & video characteristics.

   It uses a Pivot Vector space mapping method that
    matches video shots with music segments based on
    aesthetic cinematographic heuristics .

   This automatic audio-video mixing technique is suited
    for Home videos.
   Most videos such as movies and sitcoms have several
    segments devoid of any speech. Adding carefully chosen
    music to such segments conveys emotions such as joy, tension
    ,or melancholy.

   In a typical professional video production, skilled audio-
    mixing artists aesthetically add appropriate audio to the given
    video shots. This process is tedious, time-consuming, and
    expensive.

   Many home video users would like to make their videos
    appear like professional productions before they share it with
    family and friends.
   Movies comprise :-
    Images
    Graphic traces
    Recorded speech , music and noises
    Sound effects

   Roles of music in movies :-
    Setting the scene
    Adding emotional meaning
    Serving as a background filler
    Creating continuity across shots or scenes
    Emphasizing climaxes
The table shows Aesthetic Features that correspond in video & music
Zettl based these proposed mixing rules on the following aspects :-

 Tonal matching

 Structural matching

 Thematic matching

 Historical-geographical matching
   A set of attributed features required to describe videos.

   This consists of features which required to describe videos.
     Light falloff :- refers to the brightness contrast between
    the light and shadow sides of an object

     Color features :- it consist of four features
        saturation
        hue
        brightness
        energy
     Motion vectors :- To measure the video segments’ motion
    intensity.
We obtained the mean and standard deviation for estimating the
confidence level of the Video & audio attributed features for any
test shot.
   Low level features :-
    Spectral centroid (brightness):- measure of a sound’s
    brightness.



    Zero crossing :- measure of the frequency content of the
    signal
Volume (loudness) :- represents the subjective measure ,
which depends on the human listener’s frequency response.
   Dynamics :- the volume of musical sound related to the
    music’s loudness or softness.

   Tempo features :- that makes the music flow unique and
    differentiates it from other types of audio signal is temporal
    organization . (beat rate)

   Perceptual pitch feature :- it has an important role in human
    hearing, and the auditory system apparently assigns a pitch to
    any thing that comes to its attention.
   A vector space P acts as a pivot between the audio and video
    representation.



   Independent of any media.



   This space is defined with some aesthetic features in which
    music M and videos V are mapped.
   We consider how to represent video and audio clips into
    their aesthetic spaces V or M

   In the two spaces, a dimension corresponds to an attributed
    feature,

   It includes brightness_high , brightness_low , and so on.

   One video shot is associated with one vector in the V space.

   Obtaining the values for each dimension resembles
    handling fuzzy variables
   The aesthetic feature playing the role of a fuzzy     variable
    and the attribute descriptor acting as a fuzzy value which is
    represented using diagram.
   The X-axis refers to the actual computed feature value and the
    Y-axis simultaneously indicates the aesthetic label and the
    confidence value.
In the below figure shows that
 a) Matching between the video L02_30 & the music T01_5
 b)Sample frame the video
pivot vector space approach in audio-video mixing
   Before the development of the PIVOT VECTOR SPACE
    APPROACH, audio-video mixing process can be carried out
    only by professional mixing artists.

   The Pivot vector space approach enables all the home video
    users and amateur video enthusiasts to give a professional look
    and feel to their videos.

   This technique also eliminates the need for professional
    mixing artists, thereby significantly reducing the cost, time
    and labour involved.
   A large amount of home video footage is being
    produced due to products such as Digital video
    camcorders , Handicams etc.

   Hence, this technique will be of great use to all the
    amateur video enthusiasts and home video users
   This is a technique that all amateur and home video
    artists can use in the creation of video footage that
    gives a professional look and feel.

   Since it is fully automatic, the user need not worry
    about his aesthetic capabilities.
   http://www-mrim.imag.fr/publications/2003/PM001/v_final.pdf
   http://ieeexplore.ieee.org
   www.edutalks.org
   www.scribd.com
pivot vector space approach in audio-video mixing

More Related Content

PPT
Voice morphing
sukhbeer2314
 
PPTX
Digital cinema
Naveen Sihag
 
PPTX
Voice Morping ppt
ciciapaul
 
PPTX
Digital Cinema
Sreenivas vasu
 
PPTX
Blue Eye Technology Seminar Presentation
Vaibhav Kumar
 
PDF
Video Compression Basics
Sanjiv Malik
 
DOC
Voice Morphing
Sayyed Z
 
PPTX
SPEECH BASED EMOTION RECOGNITION USING VOICE
VamshidharSingh
 
Voice morphing
sukhbeer2314
 
Digital cinema
Naveen Sihag
 
Voice Morping ppt
ciciapaul
 
Digital Cinema
Sreenivas vasu
 
Blue Eye Technology Seminar Presentation
Vaibhav Kumar
 
Video Compression Basics
Sanjiv Malik
 
Voice Morphing
Sayyed Z
 
SPEECH BASED EMOTION RECOGNITION USING VOICE
VamshidharSingh
 

What's hot (20)

PPT
Voice morphing ppt
himadrigupta
 
PPTX
Neuromorphic computing
SreekuttanJayakumar
 
PPTX
Silent sound technology
priya_trehan
 
PPTX
Sensory rehabilitation
NIVETA SINGH
 
PPTX
Silent sound technology NEW
Neha Tyagi
 
PDF
digital image processing, image processing
Kalyan Acharjya
 
PDF
SSVEP-BCI
Seyed Yahya Moradi
 
PPT
Brain computer interface by akshay parmar
Akshay Parmar
 
PPT
Voice morphing-
Navneet Sharma
 
DOCX
Voice morphing document
himadrigupta
 
PPTX
Affective Computing
Debabrata Chakraborty
 
PDF
Seminar report skinput techonology
Golam Murshid
 
PPTX
Introduction to Fog Computing
Er. Ajay Sirsat
 
PPTX
Compositing, Composing Worlds
Nelson Zagalo
 
PPT
Voicemorphingppt 110328163403-phpapp01
Madhu Babu
 
PPT
Speech recognition
Charu Joshi
 
PPTX
Digital Scent Technology
Jyoti Chintadi
 
PDF
Silentsound documentation
Raj Niranjan
 
PDF
EMOTION DETECTION USING AI
Aantariksh Developers
 
PPTX
SPEECH RECOGNITION USING NEURAL NETWORK
Kamonasish Hore
 
Voice morphing ppt
himadrigupta
 
Neuromorphic computing
SreekuttanJayakumar
 
Silent sound technology
priya_trehan
 
Sensory rehabilitation
NIVETA SINGH
 
Silent sound technology NEW
Neha Tyagi
 
digital image processing, image processing
Kalyan Acharjya
 
Brain computer interface by akshay parmar
Akshay Parmar
 
Voice morphing-
Navneet Sharma
 
Voice morphing document
himadrigupta
 
Affective Computing
Debabrata Chakraborty
 
Seminar report skinput techonology
Golam Murshid
 
Introduction to Fog Computing
Er. Ajay Sirsat
 
Compositing, Composing Worlds
Nelson Zagalo
 
Voicemorphingppt 110328163403-phpapp01
Madhu Babu
 
Speech recognition
Charu Joshi
 
Digital Scent Technology
Jyoti Chintadi
 
Silentsound documentation
Raj Niranjan
 
EMOTION DETECTION USING AI
Aantariksh Developers
 
SPEECH RECOGNITION USING NEURAL NETWORK
Kamonasish Hore
 
Ad

Viewers also liked (20)

PDF
Vector video standards
Misión Cristiana Nuevo Pacto
 
PPTX
Zig zag for website
Joan Treistman
 
PDF
Prezentace Inkscape a jeho použití v mojí praxi
Petr Šimčík
 
PDF
Featured Pattern Run Length Coding for Test Data Compression
Henry Shen
 
PDF
The ZIG ZAG
Noralina A.
 
PPT
ZIG ZAG FEEDER
Kudamm_Corporation
 
PDF
ประวัติแนะนำตนเอง
Kanokp Swn
 
PDF
Canon Hd
cratytiger
 
PPS
Inivitation to an exotic party
Joke Channel
 
PPT
Laboratorios California Janedy
guesta049f9
 
PDF
B800.1
performanceweb
 
PPTX
What is the definition of the 6 Sigma?
tnay chow
 
PPS
Photo Reflections
Joke Channel
 
PPS
Water Themed Pictures
Joke Channel
 
PPTX
Lenguaje Multimedia
Unab
 
PPT
Vasolix Nereyda
vivianbc83
 
DOCX
Jumarni p
jumarni pahuna
 
PPS
Living Quarters in China
Joke Channel
 
PPTX
Rhinehart emerson power point presentation
Andrew Rhinehart
 
PDF
Jumarni.p
jumarni pahuna
 
Vector video standards
Misión Cristiana Nuevo Pacto
 
Zig zag for website
Joan Treistman
 
Prezentace Inkscape a jeho použití v mojí praxi
Petr Šimčík
 
Featured Pattern Run Length Coding for Test Data Compression
Henry Shen
 
The ZIG ZAG
Noralina A.
 
ZIG ZAG FEEDER
Kudamm_Corporation
 
ประวัติแนะนำตนเอง
Kanokp Swn
 
Canon Hd
cratytiger
 
Inivitation to an exotic party
Joke Channel
 
Laboratorios California Janedy
guesta049f9
 
What is the definition of the 6 Sigma?
tnay chow
 
Photo Reflections
Joke Channel
 
Water Themed Pictures
Joke Channel
 
Lenguaje Multimedia
Unab
 
Vasolix Nereyda
vivianbc83
 
Jumarni p
jumarni pahuna
 
Living Quarters in China
Joke Channel
 
Rhinehart emerson power point presentation
Andrew Rhinehart
 
Jumarni.p
jumarni pahuna
 
Ad

Similar to pivot vector space approach in audio-video mixing (20)

PPTX
C programming
RagulTamil1
 
PPT
Applying Media Content Analysis to the Production of Musical Videos as Summar...
Chris Huang
 
PPTX
Identity fraud evaluation
J_Scott01
 
PDF
Music video codes and conventions. .pdf
MatijaSekulic
 
PDF
post-production-cmyk-2021-web.pdf
ElliotDaroczy
 
PDF
TVC production process
Mai Bằng
 
PDF
Performance Analysis of Audio and Video Synchronization using Spreaded Code D...
Eswar Publications
 
PPTX
COMP2 fundamental frequency of secondary succession season and unsure 🤔
aljhonAomay
 
PPTX
Identity fraud evaluation
J_Scott01
 
PPTX
Master the Cut: Learn the Basics of Video Editing for Beginners
CBitss Technologies
 
PPTX
Pod handler
TheaJennings2
 
PPTX
10 Ways to Enhance your Corporate Video on Edit Table
Media Designs
 
PPT
Video Data
Sanea
 
PPTX
Evaluation question 6
Ashleigh Greenaway
 
PPTX
Subtitle
Karppinen Ngoc Anh
 
PDF
Example-Based Remixing of Multimedia Contents
MediaMixerCommunity
 
PPTX
Forms and conventions of music videos
Sean Canning
 
PDF
How to Edit a Music Video Like a || GXYZ Radio.pdf
GXYZ Inc
 
DOCX
2nd editing sessionn
mollyallen19
 
C programming
RagulTamil1
 
Applying Media Content Analysis to the Production of Musical Videos as Summar...
Chris Huang
 
Identity fraud evaluation
J_Scott01
 
Music video codes and conventions. .pdf
MatijaSekulic
 
post-production-cmyk-2021-web.pdf
ElliotDaroczy
 
TVC production process
Mai Bằng
 
Performance Analysis of Audio and Video Synchronization using Spreaded Code D...
Eswar Publications
 
COMP2 fundamental frequency of secondary succession season and unsure 🤔
aljhonAomay
 
Identity fraud evaluation
J_Scott01
 
Master the Cut: Learn the Basics of Video Editing for Beginners
CBitss Technologies
 
Pod handler
TheaJennings2
 
10 Ways to Enhance your Corporate Video on Edit Table
Media Designs
 
Video Data
Sanea
 
Evaluation question 6
Ashleigh Greenaway
 
Example-Based Remixing of Multimedia Contents
MediaMixerCommunity
 
Forms and conventions of music videos
Sean Canning
 
How to Edit a Music Video Like a || GXYZ Radio.pdf
GXYZ Inc
 
2nd editing sessionn
mollyallen19
 

Recently uploaded (20)

PDF
Economic Impact of Data Centres to the Malaysian Economy
flintglobalapac
 
PPTX
Introduction to Flutter by Ayush Desai.pptx
ayushdesai204
 
PPTX
Dev Dives: Automate, test, and deploy in one place—with Unified Developer Exp...
AndreeaTom
 
PPTX
The-Ethical-Hackers-Imperative-Safeguarding-the-Digital-Frontier.pptx
sujalchauhan1305
 
PPTX
Simple and concise overview about Quantum computing..pptx
mughal641
 
PPTX
New ThousandEyes Product Innovations: Cisco Live June 2025
ThousandEyes
 
PPTX
Agile Chennai 18-19 July 2025 Ideathon | AI Powered Microfinance Literacy Gui...
AgileNetwork
 
PDF
AI-Cloud-Business-Management-Platforms-The-Key-to-Efficiency-Growth.pdf
Artjoker Software Development Company
 
PDF
A Strategic Analysis of the MVNO Wave in Emerging Markets.pdf
IPLOOK Networks
 
PPTX
IT Runs Better with ThousandEyes AI-driven Assurance
ThousandEyes
 
PDF
CIFDAQ's Market Wrap : Bears Back in Control?
CIFDAQ
 
PDF
MASTERDECK GRAPHSUMMIT SYDNEY (Public).pdf
Neo4j
 
PDF
SparkLabs Primer on Artificial Intelligence 2025
SparkLabs Group
 
PDF
Unlocking the Future- AI Agents Meet Oracle Database 23ai - AIOUG Yatra 2025.pdf
Sandesh Rao
 
PDF
GDG Cloud Munich - Intro - Luiz Carneiro - #BuildWithAI - July - Abdel.pdf
Luiz Carneiro
 
PDF
Orbitly Pitch Deck|A Mission-Driven Platform for Side Project Collaboration (...
zz41354899
 
PDF
NewMind AI Weekly Chronicles - July'25 - Week IV
NewMind AI
 
PDF
Trying to figure out MCP by actually building an app from scratch with open s...
Julien SIMON
 
PDF
Data_Analytics_vs_Data_Science_vs_BI_by_CA_Suvidha_Chaplot.pdf
CA Suvidha Chaplot
 
PDF
The Future of Artificial Intelligence (AI)
Mukul
 
Economic Impact of Data Centres to the Malaysian Economy
flintglobalapac
 
Introduction to Flutter by Ayush Desai.pptx
ayushdesai204
 
Dev Dives: Automate, test, and deploy in one place—with Unified Developer Exp...
AndreeaTom
 
The-Ethical-Hackers-Imperative-Safeguarding-the-Digital-Frontier.pptx
sujalchauhan1305
 
Simple and concise overview about Quantum computing..pptx
mughal641
 
New ThousandEyes Product Innovations: Cisco Live June 2025
ThousandEyes
 
Agile Chennai 18-19 July 2025 Ideathon | AI Powered Microfinance Literacy Gui...
AgileNetwork
 
AI-Cloud-Business-Management-Platforms-The-Key-to-Efficiency-Growth.pdf
Artjoker Software Development Company
 
A Strategic Analysis of the MVNO Wave in Emerging Markets.pdf
IPLOOK Networks
 
IT Runs Better with ThousandEyes AI-driven Assurance
ThousandEyes
 
CIFDAQ's Market Wrap : Bears Back in Control?
CIFDAQ
 
MASTERDECK GRAPHSUMMIT SYDNEY (Public).pdf
Neo4j
 
SparkLabs Primer on Artificial Intelligence 2025
SparkLabs Group
 
Unlocking the Future- AI Agents Meet Oracle Database 23ai - AIOUG Yatra 2025.pdf
Sandesh Rao
 
GDG Cloud Munich - Intro - Luiz Carneiro - #BuildWithAI - July - Abdel.pdf
Luiz Carneiro
 
Orbitly Pitch Deck|A Mission-Driven Platform for Side Project Collaboration (...
zz41354899
 
NewMind AI Weekly Chronicles - July'25 - Week IV
NewMind AI
 
Trying to figure out MCP by actually building an app from scratch with open s...
Julien SIMON
 
Data_Analytics_vs_Data_Science_vs_BI_by_CA_Suvidha_Chaplot.pdf
CA Suvidha Chaplot
 
The Future of Artificial Intelligence (AI)
Mukul
 

pivot vector space approach in audio-video mixing

  • 1. Guided By:- Presented By:- Ms. Savita Vijay Kanika Rathore B.tech (C.S.E) IIIyr BTBTC10168
  • 2. Introduction  What is the need?  Aesthetic Aspects  Video Aesthetic Features  Audio Aesthetic Features  Pivot Representation  Advantages  Applications  Conclusion
  • 3. The PIVOT VECTOR SPACE APPROACH in audio mixing is a novel technique that automatically picks the best audio clip to mix with the given video shot.  This technique uses pivot vector space mixing framework & High level perceptual descriptors of audio & video characteristics.  It uses a Pivot Vector space mapping method that matches video shots with music segments based on aesthetic cinematographic heuristics .  This automatic audio-video mixing technique is suited for Home videos.
  • 4. Most videos such as movies and sitcoms have several segments devoid of any speech. Adding carefully chosen music to such segments conveys emotions such as joy, tension ,or melancholy.  In a typical professional video production, skilled audio- mixing artists aesthetically add appropriate audio to the given video shots. This process is tedious, time-consuming, and expensive.  Many home video users would like to make their videos appear like professional productions before they share it with family and friends.
  • 5. Movies comprise :- Images Graphic traces Recorded speech , music and noises Sound effects  Roles of music in movies :- Setting the scene Adding emotional meaning Serving as a background filler Creating continuity across shots or scenes Emphasizing climaxes
  • 6. The table shows Aesthetic Features that correspond in video & music
  • 7. Zettl based these proposed mixing rules on the following aspects :-  Tonal matching  Structural matching  Thematic matching  Historical-geographical matching
  • 8. A set of attributed features required to describe videos.  This consists of features which required to describe videos. Light falloff :- refers to the brightness contrast between the light and shadow sides of an object Color features :- it consist of four features saturation hue brightness energy Motion vectors :- To measure the video segments’ motion intensity.
  • 9. We obtained the mean and standard deviation for estimating the confidence level of the Video & audio attributed features for any test shot.
  • 10. Low level features :- Spectral centroid (brightness):- measure of a sound’s brightness. Zero crossing :- measure of the frequency content of the signal
  • 11. Volume (loudness) :- represents the subjective measure , which depends on the human listener’s frequency response.
  • 12. Dynamics :- the volume of musical sound related to the music’s loudness or softness.  Tempo features :- that makes the music flow unique and differentiates it from other types of audio signal is temporal organization . (beat rate)  Perceptual pitch feature :- it has an important role in human hearing, and the auditory system apparently assigns a pitch to any thing that comes to its attention.
  • 13. A vector space P acts as a pivot between the audio and video representation.  Independent of any media.  This space is defined with some aesthetic features in which music M and videos V are mapped.
  • 14. We consider how to represent video and audio clips into their aesthetic spaces V or M  In the two spaces, a dimension corresponds to an attributed feature,  It includes brightness_high , brightness_low , and so on.  One video shot is associated with one vector in the V space.  Obtaining the values for each dimension resembles handling fuzzy variables
  • 15. The aesthetic feature playing the role of a fuzzy variable and the attribute descriptor acting as a fuzzy value which is represented using diagram.  The X-axis refers to the actual computed feature value and the Y-axis simultaneously indicates the aesthetic label and the confidence value.
  • 16. In the below figure shows that a) Matching between the video L02_30 & the music T01_5 b)Sample frame the video
  • 18. Before the development of the PIVOT VECTOR SPACE APPROACH, audio-video mixing process can be carried out only by professional mixing artists.  The Pivot vector space approach enables all the home video users and amateur video enthusiasts to give a professional look and feel to their videos.  This technique also eliminates the need for professional mixing artists, thereby significantly reducing the cost, time and labour involved.
  • 19. A large amount of home video footage is being produced due to products such as Digital video camcorders , Handicams etc.  Hence, this technique will be of great use to all the amateur video enthusiasts and home video users
  • 20. This is a technique that all amateur and home video artists can use in the creation of video footage that gives a professional look and feel.  Since it is fully automatic, the user need not worry about his aesthetic capabilities.
  • 21. http://www-mrim.imag.fr/publications/2003/PM001/v_final.pdf  http://ieeexplore.ieee.org  www.edutalks.org  www.scribd.com