SlideShare a Scribd company logo
End-to-end convolutional
network for saliency prediction
Junting Pan Xavier Giró-i-Nieto
Slides online
@DocXavi
Large-scale Scene
Understanding (LSUN)
Challenge 2015
http://bit.ly/juntingnet
2
Financial supportTechnical support
Albert Gil Josep Pujal
ACKNOWLEDGMENTS
3
LSUN SALIENCY CHALLENGE: A Déjà vu ?
John Markoff, “Scientists see promise in deep learning Programs”, The New York Times (Nov2012).
Photo: Keith Penner
4
LSUN SALIENCY CHALLENGE: A Déjà vu ?
[Mohedano’14]
5
LSUN SALIENCY CHALLENGE: A Déjà vu ?
6
RELATED WORK: Deep Saliency
Kümmerer, Matthias, Lucas Theis, and Matthias Bethge. "Deep Gaze I: Boosting Saliency Prediction with Feature Maps Trained on ImageNet." arXiv preprint
arXiv:1411.1045 (2014).
7
RELATED WORK: Deep Saliency
Vig, Eleonora, Michael Dorr, and David Cox. "Large-scale optimization of hierarchical features for saliency prediction in natural images." Computer Vision and
Pattern Recognition (CVPR), 2014 IEEE Conference on. IEEE, 2014.
8
RELATED WORK: Fully convolutional
Long, Jonathan, Evan Shelhamer, and Trevor Darrell. "Fully convolutional networks for semantic segmentation." Computer Vision and Pattern Recognition
(CVPR), 2015 IEEE Conference on. IEEE, 2015.
9
RELATED WORK: Image Classification
CaffeNet
ARCHITECTURE
[Khrizevsky’12]
DATA
[Deng’09]
FRAMEWORK
[Jia’14]
10
SALIENCY PREDICTION: JuntingNet
JuntingNet
11
SALIENCY PREDICTION: JuntingNet
JuntingNet
DATA
iSun [Xu’15]
SALICON [Jiang’15]
12
SALIENCY PREDICTION: Data
TRAIN VALIDATION TEST
SALICON [Jiang’15] 10,000 5,000 5,000
iSun [Xu’15] 6,000 926 2,000
CAT2000 [Borji’15] 2,000 - 2,000
MIT300 [Judd’12] 300 . -
Large
Scale
13
SALIENCY PREDICTION: JuntingNet
JuntingNet
ARCHITECTURE
[Pan’15]
DATA
iSun [Xu’15]
SALICON [Jiang’15]
14
SALIENCY PREDICTION: Architecture
15
SALIENCY PREDICTION: Architecture
End to end + regression = JuntingNet
16
SALIENCY PREDICTION: Architecture
Resize
96x96
Upsample +
filter
4608 = 48x48
2D
map
17
SALIENCY PREDICTION: JuntingNet
JuntingNet
ARCHITECTURE
[Pan’15] (soon)
DATA
iSun [Xu’15]
SALICON [Jiang’15]
FRAMEWORK
[Bergstra’10]
[Bastien’12]
18
SALIENCY PREDICTION: Framework
Tutorial by Daniel Nouri (*) on
regression for facial points for
Kaggle.
(*) Daniel Nouri, “Using convolution networks to detect facil points” (Dec 2014).
on Lasagne
19
SALIENCY PREDICTION: Training
Data augmentation with horizontal mirroring.
20
SALIENCY PREDICTION: Training
Loss function Mean Square Error (MSE)
Weight initialization Gaussian distribution
Learning rate 0.03 to 0.0001
Mini batch size 128
Training time 7h (SALICON) / 3h (iSUN)
Acceleration Sigmoid + nesterov momentum 0.9
Regularisation Maxout norm
GPU NVidia GTX 980
21
RESULTS: Qualitative (iSUN)
JuntingNetGround TruthPixels
22
RESULTS: Qualitative (iSUN)
JuntingNetGround TruthPixels
23
RESULTS: Quantitative (iSUN)
24
RESULTS: Qualitative (SALICON)
JuntingNetGround TruthPixels
25
RESULTS: Qualitative (SALICON)
JuntingNetGround TruthPixels
26
RESULTS: Quantitative (SALICON)
27
RESULTS: Publications by end of June
http://bit.ly/juntingnet
28
Thank you LSUN ! Thank you Boston !
http://bit.ly/juntingnetSlides online @DocXavi

More Related Content

Viewers also liked (12)

PPT
Twitter työkäytössä
Karoliina Luoto
 
PDF
jQuery sans jQuery
goldoraf
 
PDF
LNUG - A year with AWS
Andrew Clarke
 
PPTX
El periodico en el aula
Daniele Mendonça de Chaves
 
PDF
Lionel Barzon III: Four Digital Skills For Your Career
Lionel Barzon III
 
PDF
Boletin Septiembre - Destacan trabajo del CNE en procesos electorales
Dra. Roxana Silva Ch.
 
PPT
Protecting Your SsaSets 01.07.10
michael keyes
 
PDF
A Creative Design Agency & Printing Press
KS Designers
 
PDF
"Machinima: Symbiosis of the Participatory Digital Culture and the Game Indus...
Sherry Jones
 
PPTX
25 Ways to Spot a Graphic Designer
Logo Design Guru
 
PPT
2011년도 원광대학교 컴퓨터공학과 소개자료
창여 김창여
 
DOC
Ley creacion escuelas
Laura Marrone
 
Twitter työkäytössä
Karoliina Luoto
 
jQuery sans jQuery
goldoraf
 
LNUG - A year with AWS
Andrew Clarke
 
El periodico en el aula
Daniele Mendonça de Chaves
 
Lionel Barzon III: Four Digital Skills For Your Career
Lionel Barzon III
 
Boletin Septiembre - Destacan trabajo del CNE en procesos electorales
Dra. Roxana Silva Ch.
 
Protecting Your SsaSets 01.07.10
michael keyes
 
A Creative Design Agency & Printing Press
KS Designers
 
"Machinima: Symbiosis of the Participatory Digital Culture and the Game Indus...
Sherry Jones
 
25 Ways to Spot a Graphic Designer
Logo Design Guru
 
2011년도 원광대학교 컴퓨터공학과 소개자료
창여 김창여
 
Ley creacion escuelas
Laura Marrone
 

Similar to End to-end convolutional network for saliency prediction (20)

PDF
Deep Learning for Computer Vision: Saliency Prediction (UPC 2016)
Universitat Politècnica de Catalunya
 
PDF
Neural Architectures for Still Images - Xavier Giro- UPC Barcelona 2019
Universitat Politècnica de Catalunya
 
PDF
Image classification on Imagenet (D1L4 2017 UPC Deep Learning for Computer Vi...
Universitat Politècnica de Catalunya
 
PDF
Image Classification on ImageNet (D1L3 Insight@DCU Machine Learning Workshop ...
Universitat Politècnica de Catalunya
 
PDF
Visual Saliency Prediction with Deep Learning - Kevin McGuinness - UPC Barcel...
Universitat Politècnica de Catalunya
 
PDF
CNNs: from the Basics to Recent Advances
Dmytro Mishkin
 
PDF
Deep Visual Saliency - Kevin McGuinness - UPC Barcelona 2017
Universitat Politècnica de Catalunya
 
PDF
project work for a robust hybrid dep learning
vindhucse95
 
PPTX
Analysis by semantic segmentation of Multispectral satellite imagery using de...
Yogesh S Awate
 
PDF
Interactive Geovisualization of Seismic Activity
Stuti Deshpande
 
PDF
745592main 2013 falker_presentation_chicago
Clifford Stone
 
PPTX
Final thesis presentation
Pawan Singh
 
PDF
Deep Learning Representations for All - Xavier Giro-i-Nieto - IRI Barcelona 2020
Universitat Politècnica de Catalunya
 
PDF
Bad Smells in Industrial Automation: Sniffing out Feature Envy
SEAA 2022
 
PDF
Bad Smells in Industrial Automation: Sniffing out Feature Envy
RickRabiser1
 
PDF
AlexNet(ImageNet Classification with Deep Convolutional Neural Networks)
UMBC
 
PDF
Underwater sparse image classification using deep convolutional neural networks
Mohamed Elawady
 
PDF
Open-ended Visual Question-Answering
Universitat Politècnica de Catalunya
 
PDF
Deep Learning for Computer Vision: ImageNet Challenge (UPC 2016)
Universitat Politècnica de Catalunya
 
PPTX
AlexNet(ImageNet Classification with Deep Convolutional Neural Networks)
UMBC
 
Deep Learning for Computer Vision: Saliency Prediction (UPC 2016)
Universitat Politècnica de Catalunya
 
Neural Architectures for Still Images - Xavier Giro- UPC Barcelona 2019
Universitat Politècnica de Catalunya
 
Image classification on Imagenet (D1L4 2017 UPC Deep Learning for Computer Vi...
Universitat Politècnica de Catalunya
 
Image Classification on ImageNet (D1L3 Insight@DCU Machine Learning Workshop ...
Universitat Politècnica de Catalunya
 
Visual Saliency Prediction with Deep Learning - Kevin McGuinness - UPC Barcel...
Universitat Politècnica de Catalunya
 
CNNs: from the Basics to Recent Advances
Dmytro Mishkin
 
Deep Visual Saliency - Kevin McGuinness - UPC Barcelona 2017
Universitat Politècnica de Catalunya
 
project work for a robust hybrid dep learning
vindhucse95
 
Analysis by semantic segmentation of Multispectral satellite imagery using de...
Yogesh S Awate
 
Interactive Geovisualization of Seismic Activity
Stuti Deshpande
 
745592main 2013 falker_presentation_chicago
Clifford Stone
 
Final thesis presentation
Pawan Singh
 
Deep Learning Representations for All - Xavier Giro-i-Nieto - IRI Barcelona 2020
Universitat Politècnica de Catalunya
 
Bad Smells in Industrial Automation: Sniffing out Feature Envy
SEAA 2022
 
Bad Smells in Industrial Automation: Sniffing out Feature Envy
RickRabiser1
 
AlexNet(ImageNet Classification with Deep Convolutional Neural Networks)
UMBC
 
Underwater sparse image classification using deep convolutional neural networks
Mohamed Elawady
 
Open-ended Visual Question-Answering
Universitat Politècnica de Catalunya
 
Deep Learning for Computer Vision: ImageNet Challenge (UPC 2016)
Universitat Politècnica de Catalunya
 
AlexNet(ImageNet Classification with Deep Convolutional Neural Networks)
UMBC
 
Ad

More from Universitat Politècnica de Catalunya (20)

PDF
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
Universitat Politècnica de Catalunya
 
PDF
Deep Generative Learning for All
Universitat Politècnica de Catalunya
 
PDF
The Transformer in Vision | Xavier Giro | Master in Computer Vision Barcelona...
Universitat Politècnica de Catalunya
 
PDF
Towards Sign Language Translation & Production | Xavier Giro-i-Nieto
Universitat Politècnica de Catalunya
 
PDF
The Transformer - Xavier Giró - UPC Barcelona 2021
Universitat Politècnica de Catalunya
 
PDF
Learning Representations for Sign Language Videos - Xavier Giro - NIST TRECVI...
Universitat Politècnica de Catalunya
 
PDF
Open challenges in sign language translation and production
Universitat Politècnica de Catalunya
 
PPTX
Generation of Synthetic Referring Expressions for Object Segmentation in Videos
Universitat Politècnica de Catalunya
 
PPTX
Discovery and Learning of Navigation Goals from Pixels in Minecraft
Universitat Politècnica de Catalunya
 
PDF
Learn2Sign : Sign language recognition and translation using human keypoint e...
Universitat Politècnica de Catalunya
 
PDF
Intepretability / Explainable AI for Deep Neural Networks
Universitat Politècnica de Catalunya
 
PDF
Convolutional Neural Networks - Xavier Giro - UPC TelecomBCN Barcelona 2020
Universitat Politècnica de Catalunya
 
PDF
Self-Supervised Audio-Visual Learning - Xavier Giro - UPC TelecomBCN Barcelon...
Universitat Politècnica de Catalunya
 
PDF
Attention for Deep Learning - Xavier Giro - UPC TelecomBCN Barcelona 2020
Universitat Politècnica de Catalunya
 
PDF
Generative Adversarial Networks GAN - Xavier Giro - UPC TelecomBCN Barcelona ...
Universitat Politècnica de Catalunya
 
PDF
Q-Learning with a Neural Network - Xavier Giró - UPC Barcelona 2020
Universitat Politècnica de Catalunya
 
PDF
Language and Vision with Deep Learning - Xavier Giró - ACM ICMR 2020 (Tutorial)
Universitat Politècnica de Catalunya
 
PDF
Image Segmentation with Deep Learning - Xavier Giro & Carles Ventura - ISSonD...
Universitat Politècnica de Catalunya
 
PDF
Curriculum Learning for Recurrent Video Object Segmentation
Universitat Politècnica de Catalunya
 
PDF
Deep Self-supervised Learning for All - Xavier Giro - X-Europe 2020
Universitat Politècnica de Catalunya
 
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
Universitat Politècnica de Catalunya
 
Deep Generative Learning for All
Universitat Politècnica de Catalunya
 
The Transformer in Vision | Xavier Giro | Master in Computer Vision Barcelona...
Universitat Politècnica de Catalunya
 
Towards Sign Language Translation & Production | Xavier Giro-i-Nieto
Universitat Politècnica de Catalunya
 
The Transformer - Xavier Giró - UPC Barcelona 2021
Universitat Politècnica de Catalunya
 
Learning Representations for Sign Language Videos - Xavier Giro - NIST TRECVI...
Universitat Politècnica de Catalunya
 
Open challenges in sign language translation and production
Universitat Politècnica de Catalunya
 
Generation of Synthetic Referring Expressions for Object Segmentation in Videos
Universitat Politècnica de Catalunya
 
Discovery and Learning of Navigation Goals from Pixels in Minecraft
Universitat Politècnica de Catalunya
 
Learn2Sign : Sign language recognition and translation using human keypoint e...
Universitat Politècnica de Catalunya
 
Intepretability / Explainable AI for Deep Neural Networks
Universitat Politècnica de Catalunya
 
Convolutional Neural Networks - Xavier Giro - UPC TelecomBCN Barcelona 2020
Universitat Politècnica de Catalunya
 
Self-Supervised Audio-Visual Learning - Xavier Giro - UPC TelecomBCN Barcelon...
Universitat Politècnica de Catalunya
 
Attention for Deep Learning - Xavier Giro - UPC TelecomBCN Barcelona 2020
Universitat Politècnica de Catalunya
 
Generative Adversarial Networks GAN - Xavier Giro - UPC TelecomBCN Barcelona ...
Universitat Politècnica de Catalunya
 
Q-Learning with a Neural Network - Xavier Giró - UPC Barcelona 2020
Universitat Politècnica de Catalunya
 
Language and Vision with Deep Learning - Xavier Giró - ACM ICMR 2020 (Tutorial)
Universitat Politècnica de Catalunya
 
Image Segmentation with Deep Learning - Xavier Giro & Carles Ventura - ISSonD...
Universitat Politècnica de Catalunya
 
Curriculum Learning for Recurrent Video Object Segmentation
Universitat Politècnica de Catalunya
 
Deep Self-supervised Learning for All - Xavier Giro - X-Europe 2020
Universitat Politècnica de Catalunya
 
Ad

Recently uploaded (20)

PDF
Fl Studio 24.2.2 Build 4597 Crack for Windows Free Download 2025
faizk77g
 
PDF
HCIP-Data Center Facility Deployment V2.0 Training Material (Without Remarks ...
mcastillo49
 
PPTX
AUTOMATION AND ROBOTICS IN PHARMA INDUSTRY.pptx
sameeraaabegumm
 
PDF
How Startups Are Growing Faster with App Developers in Australia.pdf
India App Developer
 
PDF
Building Real-Time Digital Twins with IBM Maximo & ArcGIS Indoors
Safe Software
 
PPTX
UiPath Academic Alliance Educator Panels: Session 2 - Business Analyst Content
DianaGray10
 
PDF
Achieving Consistent and Reliable AI Code Generation - Medusa AI
medusaaico
 
PPTX
Q2 FY26 Tableau User Group Leader Quarterly Call
lward7
 
PDF
The Builder’s Playbook - 2025 State of AI Report.pdf
jeroen339954
 
PDF
Agentic AI lifecycle for Enterprise Hyper-Automation
Debmalya Biswas
 
PDF
CIFDAQ Market Insights for July 7th 2025
CIFDAQ
 
PPTX
Building Search Using OpenSearch: Limitations and Workarounds
Sease
 
PDF
Using FME to Develop Self-Service CAD Applications for a Major UK Police Force
Safe Software
 
PPTX
"Autonomy of LLM Agents: Current State and Future Prospects", Oles` Petriv
Fwdays
 
PDF
Python basic programing language for automation
DanialHabibi2
 
PDF
Windsurf Meetup Ottawa 2025-07-12 - Planning Mode at Reliza.pdf
Pavel Shukhman
 
PDF
New from BookNet Canada for 2025: BNC BiblioShare - Tech Forum 2025
BookNet Canada
 
PDF
Log-Based Anomaly Detection: Enhancing System Reliability with Machine Learning
Mohammed BEKKOUCHE
 
PDF
Timothy Rottach - Ramp up on AI Use Cases, from Vector Search to AI Agents wi...
AWS Chicago
 
PDF
CIFDAQ Token Spotlight for 9th July 2025
CIFDAQ
 
Fl Studio 24.2.2 Build 4597 Crack for Windows Free Download 2025
faizk77g
 
HCIP-Data Center Facility Deployment V2.0 Training Material (Without Remarks ...
mcastillo49
 
AUTOMATION AND ROBOTICS IN PHARMA INDUSTRY.pptx
sameeraaabegumm
 
How Startups Are Growing Faster with App Developers in Australia.pdf
India App Developer
 
Building Real-Time Digital Twins with IBM Maximo & ArcGIS Indoors
Safe Software
 
UiPath Academic Alliance Educator Panels: Session 2 - Business Analyst Content
DianaGray10
 
Achieving Consistent and Reliable AI Code Generation - Medusa AI
medusaaico
 
Q2 FY26 Tableau User Group Leader Quarterly Call
lward7
 
The Builder’s Playbook - 2025 State of AI Report.pdf
jeroen339954
 
Agentic AI lifecycle for Enterprise Hyper-Automation
Debmalya Biswas
 
CIFDAQ Market Insights for July 7th 2025
CIFDAQ
 
Building Search Using OpenSearch: Limitations and Workarounds
Sease
 
Using FME to Develop Self-Service CAD Applications for a Major UK Police Force
Safe Software
 
"Autonomy of LLM Agents: Current State and Future Prospects", Oles` Petriv
Fwdays
 
Python basic programing language for automation
DanialHabibi2
 
Windsurf Meetup Ottawa 2025-07-12 - Planning Mode at Reliza.pdf
Pavel Shukhman
 
New from BookNet Canada for 2025: BNC BiblioShare - Tech Forum 2025
BookNet Canada
 
Log-Based Anomaly Detection: Enhancing System Reliability with Machine Learning
Mohammed BEKKOUCHE
 
Timothy Rottach - Ramp up on AI Use Cases, from Vector Search to AI Agents wi...
AWS Chicago
 
CIFDAQ Token Spotlight for 9th July 2025
CIFDAQ
 

End to-end convolutional network for saliency prediction