Sparsity Normalization: Stabilizing the Expected Outputs of Deep Networks

0 likes168 views

The document discusses the issue of variable sparsity in deep networks, highlighting that differing sparsity levels in input data can hinder training and affect output consistency. It proposes a method called sparsity normalization to stabilize outputs regardless of input sparsity, leading to improved performance across various datasets including collaborative filtering and electronic health records. Experimental results indicate that sparsity normalization enhances model accuracy and stability, outperforming existing techniques on multiple benchmark datasets.

SPARSITY NORMALIZATION:
STABILIZING THE EXPECTED
OUTPUTS OF DEEP NETWORKS
2019. 06. 07.
JoonyoungYi
joonyoung.yi@kaist.ac.kr

2
• Many benchmark datasets differ in the sparsity between the data
instances. 
 
 
 
 
• Variable sparsity problem: the expected value of the output layer
depends on  
the sparsity of the input data instance which makes the training difficult.
• Varying outputs for data instances with similar characteristics under
different sparsity. 
VARIABLE SPARSITY PROBLEM

3
• Divide each input data instance by l0:
• So that outputs are not dependent on sparsity (can be applied to CNN
similarly). 
 
 
 
 
 
 
• Sparsity Normalization solves various sparsity problem  
(theoretically, experimentally).
• Sparsity in a hidden layer is more stable after applying Sparsity Normalization.
SPARSITY NORMALIZATION

4
• Collaborative filtering datasets: Achieved states-of-the-arts
performance on Movielens 100K & 1M by simply applying Sparsity
Normalization to non-states-of-the-arts model.
• Electronic health records (EHR) dataset: Better AUC & orthogonal to
Dropout. 
 
 
 
 
• Vision datasets: Better accuracy with less capacity & orthogonal to BN. 
 
 
 
 
• 6 UCI datasets: better performance even compared to other missing
handling techniques.
EXPERIMENTAL RESULTS

Ad

Recommended

PDF

Structure - Processing Linkages in Polyethylenedavid_brough1

DOCX

Computationally efficient, real time, and embeddable prognostic techniques fo...I3E Technologies

DOCX

stable operation of the power systemengineer Sunny

PDF

An Improved Sampling Algorithm for Stochastic Modelling of Random-Wound Elect...Antti Lehikoinen

PPTX

NOC POWER MANAGEMENT CONTROLLER DESIGN Engr. Muhammad Shan Saleem

PDF

ESTIMATION OF THE PARAMETERS OF SOLAR CELLS FROM CURRENT-VOLTAGE CHARACTERIST...ijscai

PDF

Application of particle swarm optimization with ANFIS model for double scroll...IJECEIAES

PPT

FORECASTING OF RENEWABLE ENERGY PRODUCTION BY USING GENETIC ALGORITHM (GA) FO...u772020

PDF

Mixture-Rank Matrix Approximation for Collaborative FilteringJoonyoung Yi

PDF

Low-rank Matrix Approximation with StabilityJoonyoung Yi

PDF

Introduction to MAML (Model Agnostic Meta Learning) with DiscussionsJoonyoung Yi

PDF

A Neural Autoregressive Approach to Collaborative Filtering (CF-NADE) Slide Joonyoung Yi

PDF

Introduction to XGBoostJoonyoung Yi

PDF

Why biased matrix factorization works well?Joonyoung Yi

PDF

Dynamically Expandable Network (DEN)Joonyoung Yi

PDF

Introduction to Low-rank Matrix CompletionJoonyoung Yi

PDF

Exact Matrix Completion via Convex Optimization Slide (PPT)Joonyoung Yi

PDF

Blockchain Transactions Explained For EveryoneCIFDAQ

PDF

The Builder’s Playbook - 2025 State of AI Report.pdfjeroen339954

PPTX

WooCommerce Workshop: Bring Your LaptopLaura Hartwig

PDF

Timothy Rottach - Ramp up on AI Use Cases, from Vector Search to AI Agents wi...AWS Chicago

PDF

Reverse Engineering of Security Products: Developing an Advanced Microsoft De...nwbxhhcyjv

PPT

Interview paper part 3, It is based on Interview PrepSoumyadeepGhosh39

PPTX

Webinar: Introduction to LF Energy EVerestDanBrown980551

PDF

Complete JavaScript Notes: From Basics to Advanced Concepts.pdfhaydendavispro

PDF

July Patch TuesdayIvanti

PDF

"Beyond English: Navigating the Challenges of Building a Ukrainian-language R...Fwdays

PDF

SFWelly Summer 25 Release Highlights July 2025Anna Loughnan Colquhoun

PDF

CIFDAQ Token Spotlight for 9th July 2025CIFDAQ

PDF

Fl Studio 24.2.2 Build 4597 Crack for Windows Free Download 2025faizk77g

More Related Content

More from Joonyoung Yi (9)

PDF

Mixture-Rank Matrix Approximation for Collaborative FilteringJoonyoung Yi

PDF

Low-rank Matrix Approximation with StabilityJoonyoung Yi

PDF

Introduction to MAML (Model Agnostic Meta Learning) with DiscussionsJoonyoung Yi

PDF

A Neural Autoregressive Approach to Collaborative Filtering (CF-NADE) Slide Joonyoung Yi

PDF

Introduction to XGBoostJoonyoung Yi

PDF

Why biased matrix factorization works well?Joonyoung Yi

PDF

Dynamically Expandable Network (DEN)Joonyoung Yi

PDF

Introduction to Low-rank Matrix CompletionJoonyoung Yi

PDF

Exact Matrix Completion via Convex Optimization Slide (PPT)Joonyoung Yi

Mixture-Rank Matrix Approximation for Collaborative FilteringJoonyoung Yi

Low-rank Matrix Approximation with StabilityJoonyoung Yi

Introduction to MAML (Model Agnostic Meta Learning) with DiscussionsJoonyoung Yi

A Neural Autoregressive Approach to Collaborative Filtering (CF-NADE) Slide Joonyoung Yi

Introduction to XGBoostJoonyoung Yi

Why biased matrix factorization works well?Joonyoung Yi

Dynamically Expandable Network (DEN)Joonyoung Yi

Introduction to Low-rank Matrix CompletionJoonyoung Yi

Exact Matrix Completion via Convex Optimization Slide (PPT)Joonyoung Yi

Recently uploaded (20)

PDF

Blockchain Transactions Explained For EveryoneCIFDAQ

PDF

The Builder’s Playbook - 2025 State of AI Report.pdfjeroen339954

PPTX

WooCommerce Workshop: Bring Your LaptopLaura Hartwig

PDF

Timothy Rottach - Ramp up on AI Use Cases, from Vector Search to AI Agents wi...AWS Chicago

PDF

Reverse Engineering of Security Products: Developing an Advanced Microsoft De...nwbxhhcyjv

PPT

Interview paper part 3, It is based on Interview PrepSoumyadeepGhosh39

PPTX

Webinar: Introduction to LF Energy EVerestDanBrown980551

PDF

Complete JavaScript Notes: From Basics to Advanced Concepts.pdfhaydendavispro

PDF

July Patch TuesdayIvanti

PDF

"Beyond English: Navigating the Challenges of Building a Ukrainian-language R...Fwdays

PDF

SFWelly Summer 25 Release Highlights July 2025Anna Loughnan Colquhoun

PDF

CIFDAQ Token Spotlight for 9th July 2025CIFDAQ

PDF

Fl Studio 24.2.2 Build 4597 Crack for Windows Free Download 2025faizk77g

PPTX

MSP360 Backup Scheduling and Retention Best Practices.pptxMSP360

PDF

Learn Computer Forensics, Second EditionAnuraShantha7

PDF

LLMs.txt: Easily Control How AI Crawls Your SiteKeploy

PPTX

"Autonomy of LLM Agents: Current State and Future Prospects", Oles` PetrivFwdays

PPTX

OpenID AuthZEN - Analyst Briefing July 2025David Brossard

PDF

Presentation - Vibe Coding The Future of Techyanuarsinggih1

PDF

HCIP-Data Center Facility Deployment V2.0 Training Material (Without Remarks ...mcastillo49

Blockchain Transactions Explained For EveryoneCIFDAQ

The Builder’s Playbook - 2025 State of AI Report.pdfjeroen339954

WooCommerce Workshop: Bring Your LaptopLaura Hartwig

Timothy Rottach - Ramp up on AI Use Cases, from Vector Search to AI Agents wi...AWS Chicago

Reverse Engineering of Security Products: Developing an Advanced Microsoft De...nwbxhhcyjv

Interview paper part 3, It is based on Interview PrepSoumyadeepGhosh39

Webinar: Introduction to LF Energy EVerestDanBrown980551

Complete JavaScript Notes: From Basics to Advanced Concepts.pdfhaydendavispro

July Patch TuesdayIvanti

"Beyond English: Navigating the Challenges of Building a Ukrainian-language R...Fwdays

SFWelly Summer 25 Release Highlights July 2025Anna Loughnan Colquhoun

CIFDAQ Token Spotlight for 9th July 2025CIFDAQ

Fl Studio 24.2.2 Build 4597 Crack for Windows Free Download 2025faizk77g

MSP360 Backup Scheduling and Retention Best Practices.pptxMSP360

Learn Computer Forensics, Second EditionAnuraShantha7

LLMs.txt: Easily Control How AI Crawls Your SiteKeploy

"Autonomy of LLM Agents: Current State and Future Prospects", Oles` PetrivFwdays

OpenID AuthZEN - Analyst Briefing July 2025David Brossard

Presentation - Vibe Coding The Future of Techyanuarsinggih1

HCIP-Data Center Facility Deployment V2.0 Training Material (Without Remarks ...mcastillo49

Ad

Sparsity Normalization: Stabilizing the Expected Outputs of Deep Networks

1. SPARSITY NORMALIZATION: STABILIZING THE EXPECTED OUTPUTS OF DEEP NETWORKS 2019. 06. 07. JoonyoungYi [email protected]

2. 2 • Many benchmark datasets differ in the sparsity between the data instances.          • Variable sparsity problem: the expected value of the output layer depends on   the sparsity of the input data instance which makes the training difficult. • Varying outputs for data instances with similar characteristics under different sparsity.  VARIABLE SPARSITY PROBLEM

3. 3 • Divide each input data instance by l0: • So that outputs are not dependent on sparsity (can be applied to CNN similarly).              • Sparsity Normalization solves various sparsity problem   (theoretically, experimentally). • Sparsity in a hidden layer is more stable after applying Sparsity Normalization. SPARSITY NORMALIZATION

4. 4 • Collaborative filtering datasets: Achieved states-of-the-arts performance on Movielens 100K & 1M by simply applying Sparsity Normalization to non-states-of-the-arts model. • Electronic health records (EHR) dataset: Better AUC & orthogonal to Dropout.          • Vision datasets: Better accuracy with less capacity & orthogonal to BN.          • 6 UCI datasets: better performance even compared to other missing handling techniques. EXPERIMENTAL RESULTS