SlideShare a Scribd company logo
Large-Scale Capture of Producer-Defined Musical
Semantics
Ryan Stables
School of Digital Media Technology
Birmingham City University
Problem...
Problem Definition
Producer:
Audio effects parameters
usually refer to low-level
attributes.
Professionally produced audio
often requires extensive
training.
Researcher:
Lack of semantically annotated
music production datasets.
How can we map low-level
descriptors to perceived
muscial timbre?
Problem Definition
Descriptors need to represent
the views of music producers.
These may change with genre,
musical instruments, etc...
Various terms may be used to
define similar things (colour,
texture etc...)
Project Aims
1. Gather large amounts of semantics data during the music
creation/production process.
Develop a series of DAW plug-ins.
Extract information and anonymously upload it to a server.
2. Identify correlation and patterns in the semantics data.
3. Use the data to improve/aid music production tasks.
Model...
Project overview
Server
Descriptor name...
Save...Load...
Save...
Semantic Descriptor
Parameter Space
Feature Set
Pre/Post Gain
Analysis...
Natural Language
Processing
Dimensionality
Reduction
Etc...
(1)
(2)(3)
Figure : Schematic Overview of the Semantic Audio Feature Extraction Project.
(1) Plug-in interface
Parameters can be set
experimentally.
Semantic descriptors to be
stored in text field.
Descriptors can be loaded
through same interface.
Parameters are stored and/or
set.
Figure : Semantic Audio plug-in: Multi-band distortion
(2) Feature Extraction
Features are extracted from the
selected region.
The parameter space is stored.
Semantic descriptors are sent
as targets.
Additional metadata is sent, if
available.
Server
Descriptor name...
Save...Load...
Save...
Semantic Descriptor
Parameter Space
Feature Set
Pre/Post Gain
Analysis...
Natural Language
Processing
Dimensionality
Reduction
Etc...
Figure : Stored attributes.
(3) Mapping
NLP Algorithms to identify
semantic correlation.
Dimensionality reduction to
find correlation in
features/parameters.
Additional data partitions
based on metadata (Genre,
instrument, etc...)
Results sent back to user
plug-in.
Server
Descriptor name...
Save...Load...
Sav
Semantic
Paramete
Feature Se
Pre/
Analysis...
Natural Language
Processing
Dimensionality
Reduction
Etc...
Figure : Results processing
Design Constraints...
Architecture
Requirements:
Maximisation of user-base.
Transparency: Access to the processing chain.
Design decisions:
Stand-alone plug-ins.
MultiFX.
Plug-ins within a plug-in.
Analysis-only.
Other:
Free field vs. fixed word.
Before and after.
Metadata pane.
Analysis framework
LibXtract.
Hard-coded, C library.
Around 400 combined
audio features*.
[Bullock, 2007]
Vamp.
Plug-in within a plug-in.
Hosts LibXtract features,
amongst others.
[Cannam et al., 2006]
Mini-Project...
Mini-Project: Aims
Analyse the production requirements of musicians.
Birmingham Conservatoire
The Music Producers Guild
The Birmingham Music Network
Build a series of prototype systems for the collection of
musical semantics data.
Use these systems to collect data from a small group of
musicians during the production process.
Evaluate the results in order to identify a suitable system for
future research.
Demonstrate the feasibility of a wider research project in this
area.
Mini Project: Schematic
Plug-in
development
Interface design
Algorithm
Development
Server, network,
data distribution
User Testing Data
Aquisition
Results
Analysis
Figure : Schematic Overview of the Mini-Project.
Positions and Timescale
2 x PhD Students: 1 x C4DM (QMUL) & 1 x DMT (BCU).
3 x Advisory roles.
Timescale: 6-months from September 2013.
Future: collaborative grant application.
Thanks!
ryan.stables@bcu.ac.uk
References
Bullock, J. (2007).
Libxtract: A lightweight library for audio feature extraction.
In Proceedings of the International Computer Music
Conference, volume 43.
Cannam, C., Landone, C., Sandler, M. B., and Bello, J. P.
(2006).
The sonic visualiser: A visualisation platform for semantic
descriptors from musical signals.
In ISMIR, pages 324–327.

More Related Content

PPT
News collections at the British Library - Luke McKernan (Semantic Media @ The...
sebastianewert
 
PPT
British library radio collections
sebastianewert
 
PPTX
Semantic Media Project Introduction - Mark Sandler (Barbican Arts Centre, Oct...
sebastianewert
 
PPTX
BBC R&D Datasets - Jana Eggink (Semantic Media @ BBC, Feb 2013)
sebastianewert
 
PPTX
Exploring the British Library's audio collections - Richard Ranft (Semantic M...
sebastianewert
 
PPTX
Mood-based Classification of TV Programmes - Jana Eggink, Sam Davies, Denise...
sebastianewert
 
PDF
C4DM Seminar 2016-07-12: Brecht De Man
sebastianewert
 
PDF
Semantic Linking of Information, Content and Metadata for Early Music (SLICKM...
sebastianewert
 
News collections at the British Library - Luke McKernan (Semantic Media @ The...
sebastianewert
 
British library radio collections
sebastianewert
 
Semantic Media Project Introduction - Mark Sandler (Barbican Arts Centre, Oct...
sebastianewert
 
BBC R&D Datasets - Jana Eggink (Semantic Media @ BBC, Feb 2013)
sebastianewert
 
Exploring the British Library's audio collections - Richard Ranft (Semantic M...
sebastianewert
 
Mood-based Classification of TV Programmes - Jana Eggink, Sam Davies, Denise...
sebastianewert
 
C4DM Seminar 2016-07-12: Brecht De Man
sebastianewert
 
Semantic Linking of Information, Content and Metadata for Early Music (SLICKM...
sebastianewert
 

Similar to Large-Scale Capture of Producer-Defined Musical Semantics - Ryan Stables (Semantic Media @ The British Library, 23 September 2013) (16)

PPTX
Music Objects to Social Machines
David De Roure
 
PDF
MediaEval 2020: Emotion and Theme Recognition in Music Using Jamendo
multimediaeval
 
PDF
Introduction to Music Information Retrieval
Andrea Gazzarini
 
PDF
Introduction to Music Information Retrieval
Sease
 
PDF
Applsci 08-00606-v3
IsraelEbonko
 
PDF
Computational Approaches for Melodic Description in Indian Art Music Corpora
Sankalp Gulati
 
PPTX
Computational models of symphonic music
Emilia Gómez
 
PDF
AMT overview
WarNik Chow
 
PDF
FORECASTING MUSIC GENRE (RNN - LSTM)
IRJET Journal
 
PDF
Music similarity: what for?
Emilia Gómez
 
PDF
machine learning x music
Yi-Hsuan Yang
 
PDF
20211026 taicca 1 intro to mir
Yi-Hsuan Yang
 
PPT
Towards User-friendly Audio Creation
Jean Vanderdonckt
 
PDF
New Forms of Representation to Listen, Analyze, and Create Electroacoustic Music
Pierre Couprie
 
PDF
Trends in Music Recommendations 2018
Karthik Murugesan
 
PDF
Music Recommendation 2018
Fabien Gouyon
 
Music Objects to Social Machines
David De Roure
 
MediaEval 2020: Emotion and Theme Recognition in Music Using Jamendo
multimediaeval
 
Introduction to Music Information Retrieval
Andrea Gazzarini
 
Introduction to Music Information Retrieval
Sease
 
Applsci 08-00606-v3
IsraelEbonko
 
Computational Approaches for Melodic Description in Indian Art Music Corpora
Sankalp Gulati
 
Computational models of symphonic music
Emilia Gómez
 
AMT overview
WarNik Chow
 
FORECASTING MUSIC GENRE (RNN - LSTM)
IRJET Journal
 
Music similarity: what for?
Emilia Gómez
 
machine learning x music
Yi-Hsuan Yang
 
20211026 taicca 1 intro to mir
Yi-Hsuan Yang
 
Towards User-friendly Audio Creation
Jean Vanderdonckt
 
New Forms of Representation to Listen, Analyze, and Create Electroacoustic Music
Pierre Couprie
 
Trends in Music Recommendations 2018
Karthik Murugesan
 
Music Recommendation 2018
Fabien Gouyon
 
Ad

Recently uploaded (20)

PPTX
The-Ethical-Hackers-Imperative-Safeguarding-the-Digital-Frontier.pptx
sujalchauhan1305
 
PDF
Brief History of Internet - Early Days of Internet
sutharharshit158
 
PDF
Unlocking the Future- AI Agents Meet Oracle Database 23ai - AIOUG Yatra 2025.pdf
Sandesh Rao
 
PDF
Tea4chat - another LLM Project by Kerem Atam
a0m0rajab1
 
PDF
How Open Source Changed My Career by abdelrahman ismail
a0m0rajab1
 
PDF
The Future of Mobile Is Context-Aware—Are You Ready?
iProgrammer Solutions Private Limited
 
PDF
MASTERDECK GRAPHSUMMIT SYDNEY (Public).pdf
Neo4j
 
PPTX
Agile Chennai 18-19 July 2025 Ideathon | AI Powered Microfinance Literacy Gui...
AgileNetwork
 
PDF
Structs to JSON: How Go Powers REST APIs
Emily Achieng
 
PDF
Peak of Data & AI Encore - Real-Time Insights & Scalable Editing with ArcGIS
Safe Software
 
PDF
Economic Impact of Data Centres to the Malaysian Economy
flintglobalapac
 
PDF
AI-Cloud-Business-Management-Platforms-The-Key-to-Efficiency-Growth.pdf
Artjoker Software Development Company
 
PPTX
New ThousandEyes Product Innovations: Cisco Live June 2025
ThousandEyes
 
PDF
Automating ArcGIS Content Discovery with FME: A Real World Use Case
Safe Software
 
PPTX
What-is-the-World-Wide-Web -- Introduction
tonifi9488
 
PDF
Oracle AI Vector Search- Getting Started and what's new in 2025- AIOUG Yatra ...
Sandesh Rao
 
PPTX
AI and Robotics for Human Well-being.pptx
JAYMIN SUTHAR
 
PDF
Get More from Fiori Automation - What’s New, What Works, and What’s Next.pdf
Precisely
 
PDF
Using Anchore and DefectDojo to Stand Up Your DevSecOps Function
Anchore
 
PPTX
Simple and concise overview about Quantum computing..pptx
mughal641
 
The-Ethical-Hackers-Imperative-Safeguarding-the-Digital-Frontier.pptx
sujalchauhan1305
 
Brief History of Internet - Early Days of Internet
sutharharshit158
 
Unlocking the Future- AI Agents Meet Oracle Database 23ai - AIOUG Yatra 2025.pdf
Sandesh Rao
 
Tea4chat - another LLM Project by Kerem Atam
a0m0rajab1
 
How Open Source Changed My Career by abdelrahman ismail
a0m0rajab1
 
The Future of Mobile Is Context-Aware—Are You Ready?
iProgrammer Solutions Private Limited
 
MASTERDECK GRAPHSUMMIT SYDNEY (Public).pdf
Neo4j
 
Agile Chennai 18-19 July 2025 Ideathon | AI Powered Microfinance Literacy Gui...
AgileNetwork
 
Structs to JSON: How Go Powers REST APIs
Emily Achieng
 
Peak of Data & AI Encore - Real-Time Insights & Scalable Editing with ArcGIS
Safe Software
 
Economic Impact of Data Centres to the Malaysian Economy
flintglobalapac
 
AI-Cloud-Business-Management-Platforms-The-Key-to-Efficiency-Growth.pdf
Artjoker Software Development Company
 
New ThousandEyes Product Innovations: Cisco Live June 2025
ThousandEyes
 
Automating ArcGIS Content Discovery with FME: A Real World Use Case
Safe Software
 
What-is-the-World-Wide-Web -- Introduction
tonifi9488
 
Oracle AI Vector Search- Getting Started and what's new in 2025- AIOUG Yatra ...
Sandesh Rao
 
AI and Robotics for Human Well-being.pptx
JAYMIN SUTHAR
 
Get More from Fiori Automation - What’s New, What Works, and What’s Next.pdf
Precisely
 
Using Anchore and DefectDojo to Stand Up Your DevSecOps Function
Anchore
 
Simple and concise overview about Quantum computing..pptx
mughal641
 
Ad

Large-Scale Capture of Producer-Defined Musical Semantics - Ryan Stables (Semantic Media @ The British Library, 23 September 2013)

  • 1. Large-Scale Capture of Producer-Defined Musical Semantics Ryan Stables School of Digital Media Technology Birmingham City University
  • 3. Problem Definition Producer: Audio effects parameters usually refer to low-level attributes. Professionally produced audio often requires extensive training. Researcher: Lack of semantically annotated music production datasets. How can we map low-level descriptors to perceived muscial timbre?
  • 4. Problem Definition Descriptors need to represent the views of music producers. These may change with genre, musical instruments, etc... Various terms may be used to define similar things (colour, texture etc...)
  • 5. Project Aims 1. Gather large amounts of semantics data during the music creation/production process. Develop a series of DAW plug-ins. Extract information and anonymously upload it to a server. 2. Identify correlation and patterns in the semantics data. 3. Use the data to improve/aid music production tasks.
  • 7. Project overview Server Descriptor name... Save...Load... Save... Semantic Descriptor Parameter Space Feature Set Pre/Post Gain Analysis... Natural Language Processing Dimensionality Reduction Etc... (1) (2)(3) Figure : Schematic Overview of the Semantic Audio Feature Extraction Project.
  • 8. (1) Plug-in interface Parameters can be set experimentally. Semantic descriptors to be stored in text field. Descriptors can be loaded through same interface. Parameters are stored and/or set. Figure : Semantic Audio plug-in: Multi-band distortion
  • 9. (2) Feature Extraction Features are extracted from the selected region. The parameter space is stored. Semantic descriptors are sent as targets. Additional metadata is sent, if available. Server Descriptor name... Save...Load... Save... Semantic Descriptor Parameter Space Feature Set Pre/Post Gain Analysis... Natural Language Processing Dimensionality Reduction Etc... Figure : Stored attributes.
  • 10. (3) Mapping NLP Algorithms to identify semantic correlation. Dimensionality reduction to find correlation in features/parameters. Additional data partitions based on metadata (Genre, instrument, etc...) Results sent back to user plug-in. Server Descriptor name... Save...Load... Sav Semantic Paramete Feature Se Pre/ Analysis... Natural Language Processing Dimensionality Reduction Etc... Figure : Results processing
  • 12. Architecture Requirements: Maximisation of user-base. Transparency: Access to the processing chain. Design decisions: Stand-alone plug-ins. MultiFX. Plug-ins within a plug-in. Analysis-only. Other: Free field vs. fixed word. Before and after. Metadata pane.
  • 13. Analysis framework LibXtract. Hard-coded, C library. Around 400 combined audio features*. [Bullock, 2007] Vamp. Plug-in within a plug-in. Hosts LibXtract features, amongst others. [Cannam et al., 2006]
  • 15. Mini-Project: Aims Analyse the production requirements of musicians. Birmingham Conservatoire The Music Producers Guild The Birmingham Music Network Build a series of prototype systems for the collection of musical semantics data. Use these systems to collect data from a small group of musicians during the production process. Evaluate the results in order to identify a suitable system for future research. Demonstrate the feasibility of a wider research project in this area.
  • 16. Mini Project: Schematic Plug-in development Interface design Algorithm Development Server, network, data distribution User Testing Data Aquisition Results Analysis Figure : Schematic Overview of the Mini-Project.
  • 17. Positions and Timescale 2 x PhD Students: 1 x C4DM (QMUL) & 1 x DMT (BCU). 3 x Advisory roles. Timescale: 6-months from September 2013. Future: collaborative grant application. Thanks! [email protected]
  • 18. References Bullock, J. (2007). Libxtract: A lightweight library for audio feature extraction. In Proceedings of the International Computer Music Conference, volume 43. Cannam, C., Landone, C., Sandler, M. B., and Bello, J. P. (2006). The sonic visualiser: A visualisation platform for semantic descriptors from musical signals. In ISMIR, pages 324–327.