Recognition of Unistroke
   Gesture Sequences
              Presented by
              Justin Permar
                   &
       Arvind Krishnaa Jagannathan
Problem Statement
         To develop a gesture recognition system, which
         will segment the input sequence consisting of
         multiple gestures, drawn in one stroke, into the
         constituent.
         Our primary objective in solving this problem is to
         have a minimal set of training data in order to
         quickly build a prototype system.
Related Work

• Recognition of individual gestures in unistroke and multistroke sequences of
  digits – Yang et al: Training HMMs for continuous sequences
• Individual gesture recognition : Extracting several features from an input and
  constructing Hidden Markov Models for each gesture – Tanguay
• Recognizing an individual unistroke gesture sequence, using a distance
  measure from existing templates - $1 Recognizer - Wobbrock et al
Our Approach – Salient Features
• Extend $1 recognizer to identify multiple gestures in a gesture sequence.
• Hardest problem is that of segmentation – Classified in literature as “Inverse
  Perception Problem”.
• Based on a “Visual Affinity” approach – a type of geometric match.
• Employs Dynamic Time Warping to decide the distance from a given template.
• No restriction on gestures in the input sequence –
    • Arbitrarily long
    • Can contain any pre-defined gesture. New gestures can also be added to the system.
Approach – Training mode
• In training mode, the user draws a template of
   either an existing or new gesture.
• Only individual gestures are drawn – no need to
   train the system with gesture sequences.
• If misclassification occurs during training, user
   can indicate the intended gesture.
• Score next to the shape (“star”) is based upon a
   distance measure between the input and the
   indicated gesture.
Approach – Match using Dynamic Time
                    Warping
• Euclidean distance used as the distance metric between the neighboring points of input
  sequence and each template.
                               d = (𝑥 𝑘 −𝑥 𝑖 )2 + (𝑦 𝑘 −𝑦 𝑖 )2
      where (𝑥 𝑘 , 𝑦 𝑘 ) correspond to the points on the template under consideration
               and (𝑥 𝑖 , 𝑦 𝑖 ) correspond to the points on the input sequence.
• For a given set of input points, if there is a template which gives a score lower than a
  fixed threshold, then that gesture is recorded as being part of the input.
• The identified portion of the input is “spliced off ” and the matching procedure is
  repeated for the remaining input.
Approach – Segmentation and Recognition



                                   Candidate input points   Slide the input set of points Remaining input after splicing
                                   No Match                 Match with “v” (First dot) off the matched template


Input Gesture Sequence
Blue Dot – Start of the Sequence
Red Dots – Points at which
sequence is segmented
                                           Next set of candidate points       Sliding the set of points
                                           from remaining input – No Match    Match with “Arrow” (Final dot)
Results and Interpretation
                                           Sequence Length   Segmentation
Sequence Length       Accuracy Rate (%)                                            Sequence Length       Relaxed Accuracy (%)
                                                             Accuracy(%)
                  1                 57.5                 1                  71.5                     2                  56.19

                  2                32.16                 2                  71.5                     3                  56.37

                  3                25.72                 3              52.27               Table:3 Relaxed Accuracy

Table:1 Overall Accuracy – Exact Match        Table:2 Segmentation Accuracy
                                                                          𝑅𝑒𝑝𝑜𝑟𝑡𝑒𝑑 𝐺𝑒𝑠𝑡𝑢𝑟𝑒𝑠 𝑖𝑛 𝑖𝑛𝑝𝑢𝑡
                                                    𝑅𝑒𝑙𝑎𝑥𝑒𝑑 𝐴𝑐𝑐𝑢𝑟𝑎𝑐𝑦 =
                                                                       𝑇𝑜𝑡𝑎𝑙 𝐺𝑒𝑠𝑡𝑢𝑟𝑒𝑠 𝑖𝑛 𝑖𝑛𝑝𝑢𝑡 𝑠𝑒𝑞𝑢𝑒𝑛𝑐𝑒
                                                 Training Size Vs. Accuracy Rate graph, indicates the following:
                                                 1. Even for relatively low training samples (𝑛 = 3) fairly good
                                                     accuracy rates are reported.
                                                 2. An optimum recognition rate is achieved at (𝑛 = 5)
                                                 3. Too many templates, cause incorrect recognition due to
                                                     “confusion” caused by many variations of an individual gesture.
Discussion

• Training only on individual gestures  lose features indicating a transition
  from one gesture to another (such as a pause)
• Construct HMMs for each individual gesture in the “gesture library” and
  then chain HMMs to construct arbitrarily long sequences of gestures.
• Compromise training time and computation for improved accuracy.
References
[1] Lyddane, Donald. United states attorneys’ bulletin. Technical report, United States Department
of Justice Executive Office for United States Attorneys, May 2006.
[2] J. Yang, Y. Xu, and C. S. Chen. Gesture interface: Modeling and learning. In Robotics and
Automation, 1994. Proceedings., 1994 IEEE International Conference on, pages 1747–1752, 1994.
[3] D. O. Tanguay Jr. Hidden markov models for gesture recognition. Master’s thesis, Massachusetts
Institute of Technology, 1995.
[4] J.O. Wobbrock, A.D. Wilson, and Y. Li. Gestures without libraries, toolkits or training: a $1
recognizer for user interface prototypes. In Proceedings of the 20th annual ACM symposium on
User interface software and technology, pages 159–168. ACM, 2007.
[5] Zygmunt Pizlo. Perception viewed as an inverse problem. Vision Research, 41(24):3145–3161,
November 2001.

More Related Content

PDF
Understanding computer vision with Deep Learning
PDF
Machine Learning Lecture 2 Basics
PDF
2013 - Andrei Zmievski: Machine learning para datos
PDF
L05 language model_part2
PPTX
Ensemble Learning and Random Forests
PDF
Ap31289293
PDF
Lecture 8: Machine Learning in Practice (1)
PDF
Machine Learning Lecture 3 Decision Trees
Understanding computer vision with Deep Learning
Machine Learning Lecture 2 Basics
2013 - Andrei Zmievski: Machine learning para datos
L05 language model_part2
Ensemble Learning and Random Forests
Ap31289293
Lecture 8: Machine Learning in Practice (1)
Machine Learning Lecture 3 Decision Trees

What's hot (17)

PPTX
Improper integral of second kind
PDF
Introduction to machine learning
PPTX
Bi model face recognition framework
PPTX
Machine Learning Tutorial Part - 2 | Machine Learning Tutorial For Beginners ...
PPTX
Measures of dispersion
PDF
Foundations: Artificial Neural Networks
PDF
Training Deep Neural Nets
PPTX
Catching co occurrence information using word2vec-inspired matrix factorization
PDF
Unit8: Uncertainty in AI
PPTX
Neural collaborative filtering-발표
DOCX
Smu bscit sem 2 summer 2015 assignments
PDF
Recursion in Java
PPTX
Joint contrastive learning with infinite possibilities
PPT
Mc call tutorial
PDF
A simple study on computer algorithms by S. M. Risalat Hasan Chowdhury
PDF
Chapter3 hundred page machine learning
PPTX
One R (1R) Algorithm
Improper integral of second kind
Introduction to machine learning
Bi model face recognition framework
Machine Learning Tutorial Part - 2 | Machine Learning Tutorial For Beginners ...
Measures of dispersion
Foundations: Artificial Neural Networks
Training Deep Neural Nets
Catching co occurrence information using word2vec-inspired matrix factorization
Unit8: Uncertainty in AI
Neural collaborative filtering-발표
Smu bscit sem 2 summer 2015 assignments
Recursion in Java
Joint contrastive learning with infinite possibilities
Mc call tutorial
A simple study on computer algorithms by S. M. Risalat Hasan Chowdhury
Chapter3 hundred page machine learning
One R (1R) Algorithm
Ad

Viewers also liked (20)

PPSX
Unit 0
PPT
Digiaika - Mikä Muuttuu Markkinoinnissa
PPT
Cheryl Physician Presentation
PDF
Green Energy Incentives Workbook
PPTX
Hyvinvointituotteen markkinointi - miten onnistut aina!
PPT
Ota sosiaalinen media tehokäyttöön
PDF
The HFA pMDI Patent Landscape: Minefield or Goldmine
PDF
Canvas Based Presentation tool - First Review
PDF
CambridgeIP Chevening Lecture: The Economics of Climate Change - Taking the ...
PPT
Pennsylvania\'s Energy Landscape
PPT
Ch25
KEY
Picture Essay
PDF
Economics of Green Growth & National Innovation Strategies
PPSX
P I Infosoft Is Different
PDF
Participants List For Jsm
PDF
CambridgeIP: Case Studies Of Recent Client Engagements
PPSX
P I Infosoft Is Different
PPT
Verkko Myyntikanavana
PDF
Resume M Pitcher 2010 08 27
PDF
Maine Association of Broadcasters - How to Make Money with Social Media
Unit 0
Digiaika - Mikä Muuttuu Markkinoinnissa
Cheryl Physician Presentation
Green Energy Incentives Workbook
Hyvinvointituotteen markkinointi - miten onnistut aina!
Ota sosiaalinen media tehokäyttöön
The HFA pMDI Patent Landscape: Minefield or Goldmine
Canvas Based Presentation tool - First Review
CambridgeIP Chevening Lecture: The Economics of Climate Change - Taking the ...
Pennsylvania\'s Energy Landscape
Ch25
Picture Essay
Economics of Green Growth & National Innovation Strategies
P I Infosoft Is Different
Participants List For Jsm
CambridgeIP: Case Studies Of Recent Client Engagements
P I Infosoft Is Different
Verkko Myyntikanavana
Resume M Pitcher 2010 08 27
Maine Association of Broadcasters - How to Make Money with Social Media
Ad

Similar to Recognition of unistroke gesture sequences (20)

PDF
Declarative Gesture Spotting Using Inferred and Refined Control Points
PPTX
Vector-based, Structure Preserving Stroke Gesture Recognition
PDF
HAND GESTURE RECOGNITION USING STATISTICAL AND ARTIFICIAL GEOMETRIC METHODS: ...
PDF
Hand Gesture Recognition Using Statistical and Artificial Geometric Methods :...
PDF
HSV Brightness Factor Matching for Gesture Recognition System
PPTX
EXPLORATORY PROJECT
PDF
3AMIGAS - Paper5: Feifei Huo
PDF
Detection Tracking and Recognition of Human Poses for a Real Time Spatial Game
PDF
Ijarcet vol-2-issue-3-947-950
PPTX
µV: An Articulation, Rotation, Scaling, and Translation Invariant (ARST) Mult...
PDF
Gesture Recognition Based Video Game Controller
PPTX
Elsevier PPT
PPTX
Hand Gesture Recognition Applications
PPTX
Kccsi 2012 a real-time robust object tracking-v2
PPTX
!FTL, an Articulation-Invariant Stroke Gesture Recognizer with Controllable P...
PDF
Hand gesture recognition using support vector machine
DOC
PPTX
hand gestuer12233322233333333666666.pptx
PDF
Continuous hand gesture segmentation and acknowledgement of hand gesture path...
PDF
IRJET-Real Time Hand Gesture Recognition using Finger Tips
Declarative Gesture Spotting Using Inferred and Refined Control Points
Vector-based, Structure Preserving Stroke Gesture Recognition
HAND GESTURE RECOGNITION USING STATISTICAL AND ARTIFICIAL GEOMETRIC METHODS: ...
Hand Gesture Recognition Using Statistical and Artificial Geometric Methods :...
HSV Brightness Factor Matching for Gesture Recognition System
EXPLORATORY PROJECT
3AMIGAS - Paper5: Feifei Huo
Detection Tracking and Recognition of Human Poses for a Real Time Spatial Game
Ijarcet vol-2-issue-3-947-950
µV: An Articulation, Rotation, Scaling, and Translation Invariant (ARST) Mult...
Gesture Recognition Based Video Game Controller
Elsevier PPT
Hand Gesture Recognition Applications
Kccsi 2012 a real-time robust object tracking-v2
!FTL, an Articulation-Invariant Stroke Gesture Recognizer with Controllable P...
Hand gesture recognition using support vector machine
hand gestuer12233322233333333666666.pptx
Continuous hand gesture segmentation and acknowledgement of hand gesture path...
IRJET-Real Time Hand Gesture Recognition using Finger Tips

More from Arvind Krishnaa (17)

PPTX
Twitter Agreement Analysis
PPTX
Analogical thinking
PPTX
Human Altruism and Cooperation
PDF
Chowka bhara
PDF
Canscape
PDF
Final review presentation
PDF
Third review presentation
PDF
Second review presentation
PDF
First review presentation
PDF
Zeroth review presentation - eBay Turmeric / SMC
PPTX
Canvas Based Presentation - Zeroth Review
PDF
Data Binding and Data Grid View Classes
PPT
Smart camera monitoring system
ODP
Marine Pollution
PPTX
Unix Shell and System Boot Process
ODP
Multithreading Concepts
PPTX
Design and Analysis of Algorithms
Twitter Agreement Analysis
Analogical thinking
Human Altruism and Cooperation
Chowka bhara
Canscape
Final review presentation
Third review presentation
Second review presentation
First review presentation
Zeroth review presentation - eBay Turmeric / SMC
Canvas Based Presentation - Zeroth Review
Data Binding and Data Grid View Classes
Smart camera monitoring system
Marine Pollution
Unix Shell and System Boot Process
Multithreading Concepts
Design and Analysis of Algorithms

Recently uploaded (20)

PPTX
PLASMA AND ITS CONSTITUENTS 123.pptx
PPTX
2025 High Blood Pressure Guideline Slide Set.pptx
PDF
Lecture on Viruses: Structure, Classification, Replication, Effects on Cells,...
PDF
Farming Based Livelihood Systems English Notes
PDF
PUBH1000 - Module 6: Global Health Tute Slides
PPTX
Reproductive system-Human anatomy and physiology
PDF
M.Tech in Aerospace Engineering | BIT Mesra
PDF
Laparoscopic Colorectal Surgery at WLH Hospital
PDF
MICROENCAPSULATION_NDDS_BPHARMACY__SEM VII_PCI Syllabus.pdf
PDF
Fun with Grammar (Communicative Activities for the Azar Grammar Series)
PDF
Literature_Review_methods_ BRACU_MKT426 course material
PDF
LIFE & LIVING TRILOGY- PART (1) WHO ARE WE.pdf
PDF
African Communication Research: A review
PPTX
pharmaceutics-1unit-1-221214121936-550b56aa.pptx
PDF
anganwadi services for the b.sc nursing and GNM
PPTX
BSCE 2 NIGHT (CHAPTER 2) just cases.pptx
PDF
LIFE & LIVING TRILOGY - PART (3) REALITY & MYSTERY.pdf
PDF
1.Salivary gland disease.pdf 3.Bleeding and Clotting Disorders.pdf important
PDF
Mucosal Drug Delivery system_NDDS_BPHARMACY__SEM VII_PCI Syllabus.pdf
PDF
Disorder of Endocrine system (1).pdfyyhyyyy
PLASMA AND ITS CONSTITUENTS 123.pptx
2025 High Blood Pressure Guideline Slide Set.pptx
Lecture on Viruses: Structure, Classification, Replication, Effects on Cells,...
Farming Based Livelihood Systems English Notes
PUBH1000 - Module 6: Global Health Tute Slides
Reproductive system-Human anatomy and physiology
M.Tech in Aerospace Engineering | BIT Mesra
Laparoscopic Colorectal Surgery at WLH Hospital
MICROENCAPSULATION_NDDS_BPHARMACY__SEM VII_PCI Syllabus.pdf
Fun with Grammar (Communicative Activities for the Azar Grammar Series)
Literature_Review_methods_ BRACU_MKT426 course material
LIFE & LIVING TRILOGY- PART (1) WHO ARE WE.pdf
African Communication Research: A review
pharmaceutics-1unit-1-221214121936-550b56aa.pptx
anganwadi services for the b.sc nursing and GNM
BSCE 2 NIGHT (CHAPTER 2) just cases.pptx
LIFE & LIVING TRILOGY - PART (3) REALITY & MYSTERY.pdf
1.Salivary gland disease.pdf 3.Bleeding and Clotting Disorders.pdf important
Mucosal Drug Delivery system_NDDS_BPHARMACY__SEM VII_PCI Syllabus.pdf
Disorder of Endocrine system (1).pdfyyhyyyy

Recognition of unistroke gesture sequences

  • 1. Recognition of Unistroke Gesture Sequences Presented by Justin Permar & Arvind Krishnaa Jagannathan
  • 2. Problem Statement To develop a gesture recognition system, which will segment the input sequence consisting of multiple gestures, drawn in one stroke, into the constituent. Our primary objective in solving this problem is to have a minimal set of training data in order to quickly build a prototype system.
  • 3. Related Work • Recognition of individual gestures in unistroke and multistroke sequences of digits – Yang et al: Training HMMs for continuous sequences • Individual gesture recognition : Extracting several features from an input and constructing Hidden Markov Models for each gesture – Tanguay • Recognizing an individual unistroke gesture sequence, using a distance measure from existing templates - $1 Recognizer - Wobbrock et al
  • 4. Our Approach – Salient Features • Extend $1 recognizer to identify multiple gestures in a gesture sequence. • Hardest problem is that of segmentation – Classified in literature as “Inverse Perception Problem”. • Based on a “Visual Affinity” approach – a type of geometric match. • Employs Dynamic Time Warping to decide the distance from a given template. • No restriction on gestures in the input sequence – • Arbitrarily long • Can contain any pre-defined gesture. New gestures can also be added to the system.
  • 5. Approach – Training mode • In training mode, the user draws a template of either an existing or new gesture. • Only individual gestures are drawn – no need to train the system with gesture sequences. • If misclassification occurs during training, user can indicate the intended gesture. • Score next to the shape (“star”) is based upon a distance measure between the input and the indicated gesture.
  • 6. Approach – Match using Dynamic Time Warping • Euclidean distance used as the distance metric between the neighboring points of input sequence and each template. d = (𝑥 𝑘 −𝑥 𝑖 )2 + (𝑦 𝑘 −𝑦 𝑖 )2 where (𝑥 𝑘 , 𝑦 𝑘 ) correspond to the points on the template under consideration and (𝑥 𝑖 , 𝑦 𝑖 ) correspond to the points on the input sequence. • For a given set of input points, if there is a template which gives a score lower than a fixed threshold, then that gesture is recorded as being part of the input. • The identified portion of the input is “spliced off ” and the matching procedure is repeated for the remaining input.
  • 7. Approach – Segmentation and Recognition Candidate input points Slide the input set of points Remaining input after splicing No Match Match with “v” (First dot) off the matched template Input Gesture Sequence Blue Dot – Start of the Sequence Red Dots – Points at which sequence is segmented Next set of candidate points Sliding the set of points from remaining input – No Match Match with “Arrow” (Final dot)
  • 8. Results and Interpretation Sequence Length Segmentation Sequence Length Accuracy Rate (%) Sequence Length Relaxed Accuracy (%) Accuracy(%) 1 57.5 1 71.5 2 56.19 2 32.16 2 71.5 3 56.37 3 25.72 3 52.27 Table:3 Relaxed Accuracy Table:1 Overall Accuracy – Exact Match Table:2 Segmentation Accuracy 𝑅𝑒𝑝𝑜𝑟𝑡𝑒𝑑 𝐺𝑒𝑠𝑡𝑢𝑟𝑒𝑠 𝑖𝑛 𝑖𝑛𝑝𝑢𝑡 𝑅𝑒𝑙𝑎𝑥𝑒𝑑 𝐴𝑐𝑐𝑢𝑟𝑎𝑐𝑦 = 𝑇𝑜𝑡𝑎𝑙 𝐺𝑒𝑠𝑡𝑢𝑟𝑒𝑠 𝑖𝑛 𝑖𝑛𝑝𝑢𝑡 𝑠𝑒𝑞𝑢𝑒𝑛𝑐𝑒 Training Size Vs. Accuracy Rate graph, indicates the following: 1. Even for relatively low training samples (𝑛 = 3) fairly good accuracy rates are reported. 2. An optimum recognition rate is achieved at (𝑛 = 5) 3. Too many templates, cause incorrect recognition due to “confusion” caused by many variations of an individual gesture.
  • 9. Discussion • Training only on individual gestures  lose features indicating a transition from one gesture to another (such as a pause) • Construct HMMs for each individual gesture in the “gesture library” and then chain HMMs to construct arbitrarily long sequences of gestures. • Compromise training time and computation for improved accuracy.
  • 10. References [1] Lyddane, Donald. United states attorneys’ bulletin. Technical report, United States Department of Justice Executive Office for United States Attorneys, May 2006. [2] J. Yang, Y. Xu, and C. S. Chen. Gesture interface: Modeling and learning. In Robotics and Automation, 1994. Proceedings., 1994 IEEE International Conference on, pages 1747–1752, 1994. [3] D. O. Tanguay Jr. Hidden markov models for gesture recognition. Master’s thesis, Massachusetts Institute of Technology, 1995. [4] J.O. Wobbrock, A.D. Wilson, and Y. Li. Gestures without libraries, toolkits or training: a $1 recognizer for user interface prototypes. In Proceedings of the 20th annual ACM symposium on User interface software and technology, pages 159–168. ACM, 2007. [5] Zygmunt Pizlo. Perception viewed as an inverse problem. Vision Research, 41(24):3145–3161, November 2001.