Cv mini project (1)

Object Classification/
Recognition

BASIC IDEA
2
Classifier
model
CAR
INPUT IMAGE

Two ways
1. Using
tensorflow API
2. Basic Classifier
from scratch
3

PLAN
01 Dataset
02 Import
Libraries
03
Build the CNN
Model
04
Train and
validate
4

About Dataset
Dataset: Airplane and car
images
Source : Kaggle.com
Number of images: 1000
6

Libraries, Tools and Technologies used.
8
● Cv2
● Spyder
● Anaconda
● NumPy
● Sklearn
● Matplotlib
● Keras

● Split the data into two parts using sklearn
- Training data and validation data
● Stored the resized images in array X and the label for it in array y.
- Resize to uniform dimension 150 by 150 for height and width and 3
channels(i.e., for RGB)
- Array Y will store label 0 for CAR and 1 for Airplane.
- X is now an array of image pixel values
- Y is a list of corresponding labels.
9
Data preprocessing

CNN
▪ Convolutional neural network to train the model.
11
But, What is
convolutional
neural network?
● It’s deep, feed-forward artificial neural
network.(FFNN are also c/as Multilayer perceptrons).
● made up of neurons that have learnable weights and
biases.
● Each neuron receives some inputs, performs a dot
product.
● unlike a regular Neural Network, the layers of a
ConvNet have neurons arranged in 3 dimensions:
width, height, depth.

12
A visualization is worth a thousand words
Activation map or feature map

CNN
▪ Convolutional layer acts as a feature extractor.
▪ To extract features like edges, corners etc.,
13

Activation Function
ReLU - Rectified linear units(non-linear activation
function)
A(x) = max(0,x)
Sigmoid Function suffers from vanishing Gradient problem.
❏ Sparsity - efficient in computations,
❏ more intuitive
❏ independent
❏ No complicated math.
PS : Find notes and important links with click to add notes section of the ppt.15
BUT WHY ONLY ReLU?

Pooling layer
▪ Using all features in classification is computationally hard when the image
size is large.
▪ For example 224x224, 480x480 size images.
▪ Prone to overfitting.
▪ Pooling (also called subsampling or downsampling) reduces the
dimensionality of each feature map.
▪ But retains the most important information
▪ Max pooling
16

Flatten layer and
Dropout Layer
▪ Flatten Layer - Convert 2D array to single linear vector.
▪ Dropout layer -
▫ Dropout randomly drops some layers in a neural
networks and then learns with the reduced network.
▫ This way, the network learns to be independent and not
reliable on a single layer. Bottom-line is that it helps in
overfitting.
▫ 0.5 means to randomly drop half of the layers.
17

Model.summary()
Formula to get number
of parameters
=
(((kernel size *
stride)*channels)+1)
*filters)
= (((3*3*1)*3))+1)*32
=((9*3)+1)*32
=28*32
= 896

Visualization of layers
Use big image
19

Model Fitting using
Back Propagation
▪ Back propagate errors(error calculation with loss function)
▪ Loss optimization (optimizer-update weights)
▪ Calculate gradient.
▪ With Learning rate 0.001 update the weight.
20
Positive Gradient Decrease weight
Negative Gradient Increase weight

Optimizer and
entropy
Entropy - binary_crossentropy since there are two classes (Loss
function)
Optimizer - RMSprop with a learning rate 0.0001.
▪ Minimize the loss incurred using RMSprop optimizer.
21

23
Snapshot of epochs
Number of Epochs=10
Training set has 768
images with batch
size of 32.
Hence Number of
iterations to complete
one epoch = 768/32=
24

24
When Softmax was used for final fully connected layer

Using Tensorflow
API
▪ Install object detection API
▪ Download pre-trained object detection model(COCO)
▪ Load images to be classified.
▪ Apply object detection on the load images.
▪ Based on the pre-trained model COCO dataset it tries
to predict the object in the image
25

Future Scope
Transfer Learning
http://cs231n.stanford.edu/repor
ts/2017/pdfs/411.pdf
By - Yangyang Yu, Olivier Jin,
Daniel Hsu , Stanford University
TL (Transfer learning) is a
popular training technique
used in deep learning;
where models that have
been trained for a task are
reused as base/starting
point for another model.
26

References
▪ Presentation template by SlidesCarnival
▪ https://towardsdatascience.com/a-guide-to-an-efficient-way-to-build-neural-network-
architectures-part-ii-hyper-parameter-42efca01e5d7
▪ https://towardsdatascience.com/activation-functions-and-its-types-which-is-better-
a9a5310cc8f
▪ https://www.tensorflow.org/tutorials/keras/basic_classification
▪ https://www.google.com/search?q=convolved+gif&rlz=1C1RLNS_enIN787IN787&tbm=isch&s
ource=iu&ictx=1&fir=1VHtR2R2SFEVMM%253A%252CXjlYoIPax9NDkM%252C_&vet=1&us
g=AI4_-kQpnIDbwPbGc-
UEQVXxHxuXIZ_CRA&sa=X&ved=2ahUKEwilyfryn73hAhXkmuYKHT22A38Q9QEwB3oECA
YQEg#imgrc=1VHtR2R2SFEVMM:
27

THANK YOU.
28
KADAMBINI INDURKAR
(BT15CSE035)

Cv mini project (1)

More Related Content

What's hot (20)

Similar to Cv mini project (1) (20)

Recently uploaded (20)

Cv mini project (1)

Editor's Notes