SlideShare a Scribd company logo
2
Most read
3
Most read
5
Most read
Gender Recognition by Voice
TEAM 6
Team Members:
1. Dithya Prasanna
2. Shamanth H S
3. Priyadarshini B
4. Sarthak Sharma
5. Meghana K
6. Prasun Sarkar
Introduction
• One of the most common means of communication in
the world is through voice. In the real world, it is
possible for people to identify the gender of a person/s
through their voice.
• Voice is filled with a lot of linguistic features, which are
often considered as the voice prints to recognize the
gender of the speaker.
Problem Statement
Building reliable models to identify a voice as male or
female, based upon acoustic properties of the voice and
speech.
The goal is to compare outputs of different models and
suggest the best model that can be used for gender
recognition by voice for real-world inputs.
About the dataset
• The dataset consists of 3,168 recorded voice samples,
collected from male and female speakers. The voice
samples are pre-processed by acoustic analysis in R
using the seewave and tuneR packages, with an
analyzed frequency range of 0hz-280hz (human vocal
range). These samples were recorded across 20 features.
About the dataset – Data Description
• meanfreq: mean frequency (in kHz)
• sd: standard deviation of frequency
• median: median frequency (in kHz)
• Q25: first quantile (in kHz)
• Q75: third quantile (in kHz)
• IQR: interquantile range (in kHz)
• skew: skewness (see note in specprop description)
• kurt: kurtosis (see note in specprop description)
Data Description (contd.)
• sp.ent: spectral entropy
• sfm: spectral flatness
• mode: mode frequency
• centroid: frequency centroid (see specprop)
• meanfun: average of fundamental frequency measured
across acoustic signal
• minfun: minimum fundamental frequency measured
across acoustic signal
Data Description (contd.)
• maxfun: maximum fundamental frequency measured
across acoustic signal
• meandom: average of dominant frequency measured
across acoustic signal
• mindom: minimum of dominant frequency measured
across acoustic signal
• maxdom: maximum of dominant frequency measured
across acoustic signal
Data Description (contd.)
• modindx: modulation index. Calculated as the
accumulated absolute difference between adjacent
measurements of fundamental frequencies divided by
the frequency range
• dfrange: range of dominant frequency measured across
acoustic signal
• label: male or female (Target/Dependent Variable)
Does this
model work
when
deployed?
Let’s find out!
https://gender-recognition-by-
voice.herokuapp.com/
Predicted: Male
Conclusion
• Of all the models built, we see that Gradient Boosting
Classifier model has the best accuracy score of 0.9887.
• The features that play the most important role in
identifying the gender by voice are meanfun, sd, Q25,
sfm, sp.ent and meanfreq.
• Gender Recognition using voice can be used for various
applications, such as detecting feelings, differentiating
between audio and video using tags, etc.
References
• https://iopscience.iop.org/article/10.1088/1757-
899X/263/4/042083/pdf
• https://sci-hub.se/10.1109/IC3.2018.8530520
• https://sci-hub.se/10.1109/ISMSIT.2019.8932818
• https://sci-hub.se/10.1109/ICABCD.2018.8465466
• https://www.slideshare.net/irjetjournal/irjet-voice-based-
gender-recognition
THANK YOU!

More Related Content

PDF
IRJET- Voice based Gender Recognition
IRJET Journal
 
PDF
Gender voice classification with huge accuracy rate
TELKOMNIKA JOURNAL
 
PDF
voice and speech recognition using machine learning
MohammedWahhab4
 
PDF
D017552025
IOSR Journals
 
PPTX
B06.pptx
balum34
 
PDF
V4101134138
IJERA Editor
 
PPTX
Machine_learning_algorithms1111wwww11.pptx
banerjeeshramana75
 
PDF
H IDDEN M ARKOV M ODEL A PPROACH T OWARDS E MOTION D ETECTION F ROM S PEECH S...
csandit
 
IRJET- Voice based Gender Recognition
IRJET Journal
 
Gender voice classification with huge accuracy rate
TELKOMNIKA JOURNAL
 
voice and speech recognition using machine learning
MohammedWahhab4
 
D017552025
IOSR Journals
 
B06.pptx
balum34
 
V4101134138
IJERA Editor
 
Machine_learning_algorithms1111wwww11.pptx
banerjeeshramana75
 
H IDDEN M ARKOV M ODEL A PPROACH T OWARDS E MOTION D ETECTION F ROM S PEECH S...
csandit
 

Similar to Gender Recognition in the voice PPT.pptx (9)

PDF
GENDER RECOGNITION SYSTEM USING SPEECH SIGNAL
IJCSEIT Journal
 
PDF
Novel Methodologies for Classifying Gender and Emotions Using Machine Learnin...
BRIGHT WORLD INNOVATIONS
 
PDF
Voice Signal Synthesis using Non Negative Matrix Factorization
IRJET Journal
 
PPTX
Dupesh_PppppppppppppppppppppppPT[1].pptx
dineshdc10112002
 
PPTX
Gender voice recognition.pptx
Rohith572864
 
PDF
Identification of Sex of the Speaker With Reference To Bodo Vowels: A Compara...
IJERA Editor
 
ODP
VacPresentation
Lindokuhle Biyase
 
PDF
De4201715719
IJERA Editor
 
PDF
Identity authentication using voice biometrics technique
eSAT Journals
 
GENDER RECOGNITION SYSTEM USING SPEECH SIGNAL
IJCSEIT Journal
 
Novel Methodologies for Classifying Gender and Emotions Using Machine Learnin...
BRIGHT WORLD INNOVATIONS
 
Voice Signal Synthesis using Non Negative Matrix Factorization
IRJET Journal
 
Dupesh_PppppppppppppppppppppppPT[1].pptx
dineshdc10112002
 
Gender voice recognition.pptx
Rohith572864
 
Identification of Sex of the Speaker With Reference To Bodo Vowels: A Compara...
IJERA Editor
 
VacPresentation
Lindokuhle Biyase
 
De4201715719
IJERA Editor
 
Identity authentication using voice biometrics technique
eSAT Journals
 
Ad

More from Priyadarshini648418 (13)

PPTX
Process scheduling commands in unix.pptx
Priyadarshini648418
 
PPTX
DBMS_Online database management sys.pptx
Priyadarshini648418
 
PPTX
3. Context of a process in a unix .pptx
Priyadarshini648418
 
PPTX
1 Data Manipulation, data mining techniq
Priyadarshini648418
 
PPT
Applied artificial intelligece of pg.ppt
Priyadarshini648418
 
PPT
AAI expert system and their usecases.ppt
Priyadarshini648418
 
PPTX
deep learn about blood vessel auto1.pptx
Priyadarshini648418
 
PPT
PowerPoint_merge.ppt on unix programming
Priyadarshini648418
 
PPT
Applied Artificial Intelligence presenttt
Priyadarshini648418
 
PPTX
Nest_Dictionaries in python coding1.pptx
Priyadarshini648418
 
PPTX
Data Science Machine Lerning Bigdat.pptx
Priyadarshini648418
 
PPTX
2. UNIX OS System Architecture easy.pptx
Priyadarshini648418
 
PPTX
Unix_Introduction_BCA.pptx the very basi
Priyadarshini648418
 
Process scheduling commands in unix.pptx
Priyadarshini648418
 
DBMS_Online database management sys.pptx
Priyadarshini648418
 
3. Context of a process in a unix .pptx
Priyadarshini648418
 
1 Data Manipulation, data mining techniq
Priyadarshini648418
 
Applied artificial intelligece of pg.ppt
Priyadarshini648418
 
AAI expert system and their usecases.ppt
Priyadarshini648418
 
deep learn about blood vessel auto1.pptx
Priyadarshini648418
 
PowerPoint_merge.ppt on unix programming
Priyadarshini648418
 
Applied Artificial Intelligence presenttt
Priyadarshini648418
 
Nest_Dictionaries in python coding1.pptx
Priyadarshini648418
 
Data Science Machine Lerning Bigdat.pptx
Priyadarshini648418
 
2. UNIX OS System Architecture easy.pptx
Priyadarshini648418
 
Unix_Introduction_BCA.pptx the very basi
Priyadarshini648418
 
Ad

Recently uploaded (20)

PDF
The Serious Men A novel by Manu Joseph.pdf
AmaanMirza17
 
PDF
3Below Moon fight sequence which happens on the moon
Max Lawson
 
PDF
Regarding honorarium for the year 2025-26 human resources(XV-FC) approved und...
khankhan307705
 
PPTX
TDXFYIYILFXDGFFGUYIFXGFXGFHYUYIOFHXFHFYIOIUYYFDHFGUYILPPT.pptx
dilludcruz
 
PDF
Scene from Dawn of the Croods animated series 01
Max Lawson
 
PDF
The Adventures of Master Faridi and Asim &The Secret of Mystery Island
hamid801536
 
PPTX
一比一还原日本工业大学毕业证/NIT毕业证书2025原版定制成绩单
e7nw4o4
 
PPTX
Entrepreneurship innovator Chapter 1-PPT.pptx
ahmed5156
 
PPTX
Theatre of the Absurd: Understanding the Philosophy Behind Absurdist Drama
maxmag791
 
PDF
Emcee Candy - The voice of every vibe! Hosting dreams with charm and precision..
mail2mcdivya
 
PPTX
Amanat Mann IPS Solving The Third Key – A Silent Game of Justice.pptx
vijayrahavin
 
PPTX
attack on titan anime designAttack on Titan premiered as an anime adaptation ...
timesidiomasaulas
 
PPTX
3rd week continents.pptxsswdewceceededede
EmanEssa14
 
PPTX
Bill and Lalu prasad yadav jokes to laugh.pptx
PRAKASHBHATTARAI32
 
PDF
Keep It Short: India's Talent Launchpad for Filmmakers
Cinystore Technologies
 
PPTX
购买英国毕业证|补办贝尔法斯特女王大学毕业证|补办QUB文凭国外学位认证
mookxk3
 
PPTX
电子版本制作约克圣约翰大学毕业证学历认证学位证展示学历学位证制作
6b9ab940
 
DOCX
Jumping Jacks_ The Timeless Powerhouse of Fitness.docx
Custom Printing Boxes
 
PDF
Scene with dragon stuff from Dragons: The Nine Realmes
Max Lawson
 
PPTX
90's Kallinvhvhvugguygubibiytggjrbkg.pptx
bhaswatideka11
 
The Serious Men A novel by Manu Joseph.pdf
AmaanMirza17
 
3Below Moon fight sequence which happens on the moon
Max Lawson
 
Regarding honorarium for the year 2025-26 human resources(XV-FC) approved und...
khankhan307705
 
TDXFYIYILFXDGFFGUYIFXGFXGFHYUYIOFHXFHFYIOIUYYFDHFGUYILPPT.pptx
dilludcruz
 
Scene from Dawn of the Croods animated series 01
Max Lawson
 
The Adventures of Master Faridi and Asim &The Secret of Mystery Island
hamid801536
 
一比一还原日本工业大学毕业证/NIT毕业证书2025原版定制成绩单
e7nw4o4
 
Entrepreneurship innovator Chapter 1-PPT.pptx
ahmed5156
 
Theatre of the Absurd: Understanding the Philosophy Behind Absurdist Drama
maxmag791
 
Emcee Candy - The voice of every vibe! Hosting dreams with charm and precision..
mail2mcdivya
 
Amanat Mann IPS Solving The Third Key – A Silent Game of Justice.pptx
vijayrahavin
 
attack on titan anime designAttack on Titan premiered as an anime adaptation ...
timesidiomasaulas
 
3rd week continents.pptxsswdewceceededede
EmanEssa14
 
Bill and Lalu prasad yadav jokes to laugh.pptx
PRAKASHBHATTARAI32
 
Keep It Short: India's Talent Launchpad for Filmmakers
Cinystore Technologies
 
购买英国毕业证|补办贝尔法斯特女王大学毕业证|补办QUB文凭国外学位认证
mookxk3
 
电子版本制作约克圣约翰大学毕业证学历认证学位证展示学历学位证制作
6b9ab940
 
Jumping Jacks_ The Timeless Powerhouse of Fitness.docx
Custom Printing Boxes
 
Scene with dragon stuff from Dragons: The Nine Realmes
Max Lawson
 
90's Kallinvhvhvugguygubibiytggjrbkg.pptx
bhaswatideka11
 

Gender Recognition in the voice PPT.pptx

  • 1. Gender Recognition by Voice TEAM 6
  • 2. Team Members: 1. Dithya Prasanna 2. Shamanth H S 3. Priyadarshini B 4. Sarthak Sharma 5. Meghana K 6. Prasun Sarkar
  • 3. Introduction • One of the most common means of communication in the world is through voice. In the real world, it is possible for people to identify the gender of a person/s through their voice. • Voice is filled with a lot of linguistic features, which are often considered as the voice prints to recognize the gender of the speaker.
  • 4. Problem Statement Building reliable models to identify a voice as male or female, based upon acoustic properties of the voice and speech. The goal is to compare outputs of different models and suggest the best model that can be used for gender recognition by voice for real-world inputs.
  • 5. About the dataset • The dataset consists of 3,168 recorded voice samples, collected from male and female speakers. The voice samples are pre-processed by acoustic analysis in R using the seewave and tuneR packages, with an analyzed frequency range of 0hz-280hz (human vocal range). These samples were recorded across 20 features.
  • 6. About the dataset – Data Description • meanfreq: mean frequency (in kHz) • sd: standard deviation of frequency • median: median frequency (in kHz) • Q25: first quantile (in kHz) • Q75: third quantile (in kHz) • IQR: interquantile range (in kHz) • skew: skewness (see note in specprop description) • kurt: kurtosis (see note in specprop description)
  • 7. Data Description (contd.) • sp.ent: spectral entropy • sfm: spectral flatness • mode: mode frequency • centroid: frequency centroid (see specprop) • meanfun: average of fundamental frequency measured across acoustic signal • minfun: minimum fundamental frequency measured across acoustic signal
  • 8. Data Description (contd.) • maxfun: maximum fundamental frequency measured across acoustic signal • meandom: average of dominant frequency measured across acoustic signal • mindom: minimum of dominant frequency measured across acoustic signal • maxdom: maximum of dominant frequency measured across acoustic signal
  • 9. Data Description (contd.) • modindx: modulation index. Calculated as the accumulated absolute difference between adjacent measurements of fundamental frequencies divided by the frequency range • dfrange: range of dominant frequency measured across acoustic signal • label: male or female (Target/Dependent Variable)
  • 10. Does this model work when deployed? Let’s find out! https://gender-recognition-by- voice.herokuapp.com/
  • 12. Conclusion • Of all the models built, we see that Gradient Boosting Classifier model has the best accuracy score of 0.9887. • The features that play the most important role in identifying the gender by voice are meanfun, sd, Q25, sfm, sp.ent and meanfreq. • Gender Recognition using voice can be used for various applications, such as detecting feelings, differentiating between audio and video using tags, etc.
  • 13. References • https://iopscience.iop.org/article/10.1088/1757- 899X/263/4/042083/pdf • https://sci-hub.se/10.1109/IC3.2018.8530520 • https://sci-hub.se/10.1109/ISMSIT.2019.8932818 • https://sci-hub.se/10.1109/ICABCD.2018.8465466 • https://www.slideshare.net/irjetjournal/irjet-voice-based- gender-recognition