SlideShare a Scribd company logo
2
Most read
9
Most read
Venkat Java Projects
Mobile:+91 9966499110 Visit:www.venkatjavaprojects.com
Email:venkatjavaprojects@gmail.com
MODELING AND PREDICTING CYBER
HACKING BREACHES
ABSTRACT:
Analyzing cyber incident data sets is an important method for
deepening our understanding of the evolution of the threat situation. This
is a relatively new research topic, and many studies remain to be done.
In this paper, we report a statistical analysis of a breach incident data set
corresponding to 12 years (2005–2017) of cyber hacking activities that
include malware attacks. We show that, in contrast to the findings
reported in the literature, both hacking breach incident inter-arrival times
and breach sizes should be modeled by stochastic processes, rather than
by distributions because they exhibit autocorrelations. Then, we propose
particular stochastic process models to, respectively, fit the inter-arrival
times and the breach sizes. We also show that these models can predict
the inter-arrival times and the breach sizes. In order to get deeper
insights into the evolution of hacking breach incidents, we conduct both
qualitative and quantitative trend analyses on the data set. We draw a set
of cybersecurity insights, including that the threat of cyber hacks is
indeed getting worse in terms of their frequency, but not in terms of the
magnitude of their damage.
Venkat Java Projects
Mobile:+91 9966499110 Visit:www.venkatjavaprojects.com
Email:venkatjavaprojects@gmail.com
ARCHITECTURE:
EXISTING SYSTEM:
The present study is motivated by several questions that have not been
investigated until now, such as: Are data breaches caused by cyber-
attacks increasing, decreasing, or stabilizing? A principled answer to this
question will give us a clear insight into the overall situation of cyber
threats. This question was not answered by previous studies.
Venkat Java Projects
Mobile:+91 9966499110 Visit:www.venkatjavaprojects.com
Email:venkatjavaprojects@gmail.com
Specifically, the dataset analyzed in [7] only covered the time span from
2000 to 2008 and does not necessarily contain the breach incidents that
are caused by cyber-attacks; the dataset analyzed in [9] is more recent,
but contains two kinds of incidents: negligent breaches (i.e., incidents
caused by lost, discarded, stolen devices and other reasons) and
malicious breaching. Since negligent breaches represent more human
errors than cyber-attacks, we do not consider them in the present study.
Because the malicious breaches studied in [9] contain four sub-
categories: hacking (including malware), insider, payment card fraud,
and unknown, this study will focus on the hacking sub-category (called
hacking breach dataset thereafter), while noting that the other three sub-
categories are interesting on their own and should be analyzed
separately.Recently, researchers started modeling data breach incidents.
Maillart and Sornettestudied the statistical properties of the personal
identity losses in the United States between year 2000 and 2008. They
found that the number of breach incidents dramatically increases from
2000 to July 2006 but remains stable thereafter. Edwards et al. analyzed
a dataset containing 2,253 breach incidents that span over a decade
(2005 to 2015). They found that neither the size nor the frequency of
data breaches has increased over the years. Wheatley et al., analyzed a
dataset that is combined from corresponds to organizational breach
incidents between year 2000 and 2015. They found that the frequency of
large breach incidents (i.e., the ones that breach more than 50,000
Venkat Java Projects
Mobile:+91 9966499110 Visit:www.venkatjavaprojects.com
Email:venkatjavaprojects@gmail.com
records) occurring to US firms is independent of time, but the frequency
of large breach incidents occurring to non-US firms exhibits an
increasing trend.
PROPOSEDSYSTEM:
In this paper, we make the following three contributions. First, we
show that both the hacking breach incident interarrival times (reflecting
incident frequency) and breach sizes should be modeled by stochastic
processes, rather than by distributions. We find that a particular point
process can adequately describe the evolution of the hacking breach
incidents inter-arrival times and that a particular ARMA-GARCH model
can adequately describe the evolution of the hacking breach sizes, where
ARMA is acronym for “AutoRegressive and Moving Average” and
GARCH is acronym for “Generalized AutoRegressive Conditional
Heteroskedasticity.”We show that these stochastic process models can
predict the inter-arrival times and the breach sizes. To the best of our
knowledge, this is the first paper showing that stochastic processes,
rather than distributions, should be used to model these cyber threat
factors. Second, we discover a positive dependence between the
incidents inter-arrival times and the breach sizes, and show that this
dependence can be adequately described by a particular copula. We also
show that when predicting inter-arrival times and breach sizes, it is
necessary to consider the dependence; otherwise, the prediction results
Venkat Java Projects
Mobile:+91 9966499110 Visit:www.venkatjavaprojects.com
Email:venkatjavaprojects@gmail.com
are not accurate. To the best of our knowledge, this is the first work
showing the existence of this dependence and the consequence of
ignoring it. Third, we conduct both qualitative and quantitative trend
analyses of the cyber hacking breach incidents. We find that the
situation is indeed getting worse in terms of the incidents inter-arrival
time because hacking breach incidents become more and more frequent,
but the situation is stabilizing in terms of the incident breach size,
indicating that the damage of individual hacking breach incidents will
not get much worse. We hope the present study will inspire more
investigations, which can offer deep insights into alternate risk
mitigation approaches. Such insights are useful to insurance companies,
government agencies, and regulators because they need to deeply
understand the nature of data breach risks.
ALGORITHM:
SUPPORT VECTOR MACHINE
“Support Vector Machine” (SVM) is a supervised machine learning
algorithm which can be used for both classification and regression
challenges. However, it is mostly used in classification problems. In this
algorithm, we plot each data item as a point in n-dimensional space
(where n is number of features you have) with the value of each feature
being the value of a particular coordinate. Then, we perform
classification by finding the hyper-plane that differentiate the two
Venkat Java Projects
Mobile:+91 9966499110 Visit:www.venkatjavaprojects.com
Email:venkatjavaprojects@gmail.com
classes very well (look at the below snapshot).Support Vectors are
simply the co-ordinates of individual observation. Support Vector
Machine is a frontier which best segregates the two classes (hyper-plane/
line).More formally, a support vector machine constructs a hyper plane
or set of hyper planes in a high- or infinite-dimensional space, which can
be used for classification, regression, or other tasks like outliers
detection. Intuitively, a good separation is achieved by the hyper plane
that has the largest distance to the nearest training-data point of any class
(so-called functional margin), since in general the larger the margin the
lower the generalization error of the classifier.Whereas the original
problem may be stated in a finite dimensional space, it often happens
that the sets to discriminate are not linearly separable in that space. For
this reason, it was proposed that the original finite-dimensional space be
mapped into a much higher-dimensional space, presumably making the
separation easier in that space.
MODULES:
1. UPLOAD DATA
The data resource to database can be uploaded by both
administrator and authorized user. The data can be uploaded with
key in order to maintain the secrecy of the data that is not released
without knowledge of user. The users are authorized based on their
details that are shared to admin and admin can authorize each user.
Venkat Java Projects
Mobile:+91 9966499110 Visit:www.venkatjavaprojects.com
Email:venkatjavaprojects@gmail.com
Only Authorized users are allowed to access the system and upload
or request for files.
2. ACCESS DETAILS
The access of data from the database can be given by
administrators. Uploaded data are managed by admin and admin is
the only person to provide the rights to process the accessing details
and approve or unapproved users based on their details.
3. USER PERMISSIONS
The data from any resources are allowed to access the data with
only permission from administrator. Prior to access data, users are
allowed by admin to share their data and verify the details which are
provided by user. If user is access the data with wrong attempts then,
users are blocked accordingly. If user is requested to unblock them,
based on the requests and previous activities admin is unblock users.
4. DATA ANALYSIS
Data analyses are done with the help of graph. The collected
data are applied to graph in order to get the best analysis and
prediction of dataset and given data policies. The dataset can be
analyzed through this pictorial representation in order to better
understand of the data details.
FUTUREWORK:
Venkat Java Projects
Mobile:+91 9966499110 Visit:www.venkatjavaprojects.com
Email:venkatjavaprojects@gmail.com
There are many open problems that are left for future research. For
example, it is both interesting and challenging to investigate how to
predict the extremely large values and how to deal with missing data
(i.e., breach incidents that are not reported). It is also worthwhile to
estimate the exact occurring times of breach incidents. Finally, more
research needs to be conducted towards understanding the predictability
of breach incidents (i.e., the upper bound of prediction accuracy).
REQUIREMENT ANALYSIS
The project involved analyzing the design of few applications so as
to make the application more users friendly. To do so, it was really
important to keep the navigations from one screen to the other well
ordered and at the same time reducing the amount of typing the user
needs to do. In order to make the application more accessible, the
browser version had to be chosen so that it is compatible with most of
the Browsers.
REQUIREMENT SPECIFICATION
Functional Requirements
 Graphical User interface with the User.
Software Requirements
Venkat Java Projects
Mobile:+91 9966499110 Visit:www.venkatjavaprojects.com
Email:venkatjavaprojects@gmail.com
For developing the application the following are the Software
Requirements:
1. Python
2. Django
3. Mysql
4. Wampserver
Operating Systems supported
1. Windows 7
2. Windows XP
3. Windows 8
Technologies and Languages used to Develop
1. Python
Debugger and Emulator
 Any Browser (Particularly Chrome)
Hardware Requirements
Venkat Java Projects
Mobile:+91 9966499110 Visit:www.venkatjavaprojects.com
Email:venkatjavaprojects@gmail.com
For developing the application the following are the Hardware
Requirements:
 Processor: Pentium IV or higher
 RAM: 256 MB
 Space on Hard Disk: minimum 512MB
CONCLUSION:
We analyzed a hacking breach dataset from the points of view of
the incidents inter-arrival time and the breach size, and showed that they
both should be modeled by stochastic processes rather than distributions.
The statistical models developed in this paper show satisfactory fitting
and prediction accuracies. In particular, we propose using a copula-
based approach to predict the joint probability that an incident with a
certain magnitude of breach size will occur during a future period of
time. Statistical tests show that the methodologies proposed in this paper
are better than those which are presented in the literature, because the
latter ignored both the temporal correlations and the dependence
between the incidents inter-arrival times and the breach sizes. We
conducted qualitative and quantitative analyses to draw further insights.
We drew a set of cybersecurity insights, including that the threat of
cyber hacking breach incidents is indeed getting worse in terms of their
frequency, but not the magnitude of their damage. The methodology
Venkat Java Projects
Mobile:+91 9966499110 Visit:www.venkatjavaprojects.com
Email:venkatjavaprojects@gmail.com
presented in this paper can be adopted or adapted to analyze datasets of a
similar nature.

More Related Content

What's hot (20)

PPTX
Technical seminar on virtual smart phone
Akshitha Chutke
 
PPTX
Detection of phishing websites
m srikanth
 
PPTX
Face Recognition based Lecture Attendance System
Karmesh Maheshwari
 
DOC
My Project Report Documentation with Abstract & Snapshots
Usman Sait
 
PPTX
project ppt.pptx
GYamini22
 
PPTX
Detection of cyber-bullying
Ziar Khan
 
PPT
Ppt presentation
vishal4799
 
PPT
Safety app for woman
SMNajrulHowlader
 
PPT
Web Security
Bharath Manoharan
 
PPTX
Face detection and recognition using surveillance camera2 edited
Santu Chall
 
PPTX
Attendance Management System using Face Recognition
NanditaDutta4
 
PPTX
Computer Vision - Real Time Face Recognition using Open CV and Python
Akash Satamkar
 
PPT
Virtual Mouse
Vivek Khutale
 
PPTX
Detection of Fake reviews
27DuddeSai
 
PPT
Cloud analytics
gaurav jain
 
PPTX
Graphical Password Authentication
Abhijit Akotkar
 
PPTX
Email spam detection
PratisthaSingh5
 
PPTX
Face detection presentation slide
Sanjoy Dutta
 
PDF
Malware detection-using-machine-learning
Security Bootcamp
 
DOCX
Computer science seminar topics
123seminarsonly
 
Technical seminar on virtual smart phone
Akshitha Chutke
 
Detection of phishing websites
m srikanth
 
Face Recognition based Lecture Attendance System
Karmesh Maheshwari
 
My Project Report Documentation with Abstract & Snapshots
Usman Sait
 
project ppt.pptx
GYamini22
 
Detection of cyber-bullying
Ziar Khan
 
Ppt presentation
vishal4799
 
Safety app for woman
SMNajrulHowlader
 
Web Security
Bharath Manoharan
 
Face detection and recognition using surveillance camera2 edited
Santu Chall
 
Attendance Management System using Face Recognition
NanditaDutta4
 
Computer Vision - Real Time Face Recognition using Open CV and Python
Akash Satamkar
 
Virtual Mouse
Vivek Khutale
 
Detection of Fake reviews
27DuddeSai
 
Cloud analytics
gaurav jain
 
Graphical Password Authentication
Abhijit Akotkar
 
Email spam detection
PratisthaSingh5
 
Face detection presentation slide
Sanjoy Dutta
 
Malware detection-using-machine-learning
Security Bootcamp
 
Computer science seminar topics
123seminarsonly
 

Similar to modeling and predicting cyber hacking breaches (20)

DOCX
user centric machine learning framework for cyber security operations center
Venkat Projects
 
PDF
COMPARATIVE ANALYSIS OF ANOMALY BASED WEB ATTACK DETECTION METHODS
IJCI JOURNAL
 
PDF
Progress of Machine Learning in the Field of Intrusion Detection Systems
ijcisjournal
 
PDF
PROGRESS OF MACHINE LEARNING IN THE FIELD OF INTRUSION DETECTION SYSTEMS
ijcisjournal
 
PDF
PROGRESS OF MACHINE LEARNING IN THE FIELD OF INTRUSION DETECTION SYSTEMS
ijcisjournal
 
PDF
11421ijcPROGRESS OF MACHINE LEARNING IN THE FIELD OF INTRUSION DETECTION SYST...
ijcisjournal
 
PDF
Progress of Machine Learning in the Field of Intrusion Detection Systems
ijcisjournal
 
PDF
Information Security
Madushan Sandaruwan
 
PPTX
BITCOIN HEIST RANSOMEWARE ATTACK PREDICTION
ssuser2b426d1
 
PDF
NDSS_submission_240
Sakshi Jain
 
PDF
Securing And Protecting Information
Laura Martin
 
DOCX
Include at least 250 words in your posting and at least 250 words in
maribethy2y
 
PDF
FLOODING ATTACKS DETECTION OF MOBILE AGENTS IN IP NETWORKS
csandit
 
PDF
Malware Risk Analysis on the Campus Network with Bayesian Belief Network
IJNSA Journal
 
PDF
Malware Risk Analysis on the Campus Network with Bayesian Belief Network
IJNSA Journal
 
DOC
Only Abstract
guesta67d4a
 
PDF
A predictive framework for cyber security analytics using attack graphs
IJCNCJournal
 
PDF
Vulnerability Management System
IRJET Journal
 
PPTX
A data analyistic approach to cybercrime underground economy.pptx
streamwaytechnologie
 
PDF
Detection of Structured Query Language Injection Attacks Using Machine Learni...
AIRCC Publishing Corporation
 
user centric machine learning framework for cyber security operations center
Venkat Projects
 
COMPARATIVE ANALYSIS OF ANOMALY BASED WEB ATTACK DETECTION METHODS
IJCI JOURNAL
 
Progress of Machine Learning in the Field of Intrusion Detection Systems
ijcisjournal
 
PROGRESS OF MACHINE LEARNING IN THE FIELD OF INTRUSION DETECTION SYSTEMS
ijcisjournal
 
PROGRESS OF MACHINE LEARNING IN THE FIELD OF INTRUSION DETECTION SYSTEMS
ijcisjournal
 
11421ijcPROGRESS OF MACHINE LEARNING IN THE FIELD OF INTRUSION DETECTION SYST...
ijcisjournal
 
Progress of Machine Learning in the Field of Intrusion Detection Systems
ijcisjournal
 
Information Security
Madushan Sandaruwan
 
BITCOIN HEIST RANSOMEWARE ATTACK PREDICTION
ssuser2b426d1
 
NDSS_submission_240
Sakshi Jain
 
Securing And Protecting Information
Laura Martin
 
Include at least 250 words in your posting and at least 250 words in
maribethy2y
 
FLOODING ATTACKS DETECTION OF MOBILE AGENTS IN IP NETWORKS
csandit
 
Malware Risk Analysis on the Campus Network with Bayesian Belief Network
IJNSA Journal
 
Malware Risk Analysis on the Campus Network with Bayesian Belief Network
IJNSA Journal
 
Only Abstract
guesta67d4a
 
A predictive framework for cyber security analytics using attack graphs
IJCNCJournal
 
Vulnerability Management System
IRJET Journal
 
A data analyistic approach to cybercrime underground economy.pptx
streamwaytechnologie
 
Detection of Structured Query Language Injection Attacks Using Machine Learni...
AIRCC Publishing Corporation
 

More from Venkat Projects (20)

DOCX
1.AUTOMATIC DETECTION OF DIABETIC RETINOPATHY USING CNN.docx
Venkat Projects
 
DOCX
12.BLOCKCHAIN BASED MILK DELIVERY PLATFORM FOR STALLHOLDER DAIRY FARMERS IN K...
Venkat Projects
 
DOCX
10.ATTENDANCE CAPTURE SYSTEM USING FACE RECOGNITION.docx
Venkat Projects
 
DOCX
9.IMPLEMENTATION OF BLOCKCHAIN IN FINANCIAL SECTOR TO IMPROVE SCALABILITY.docx
Venkat Projects
 
DOCX
8.Geo Tracking Of Waste And Triggering Alerts And Mapping Areas With High Was...
Venkat Projects
 
DOCX
Image Forgery Detection Based on Fusion of Lightweight Deep Learning Models.docx
Venkat Projects
 
DOCX
6.A FOREST FIRE IDENTIFICATION METHOD FOR UNMANNED AERIAL VEHICLE MONITORING ...
Venkat Projects
 
DOCX
WATERMARKING IMAGES
Venkat Projects
 
DOCX
4.LOCAL DYNAMIC NEIGHBORHOOD BASED OUTLIER DETECTION APPROACH AND ITS FRAMEWO...
Venkat Projects
 
DOCX
Application and evaluation of a K-Medoidsbased shape clustering method for an...
Venkat Projects
 
DOCX
OPTIMISED STACKED ENSEMBLE TECHNIQUES IN THE PREDICTION OF CERVICAL CANCER US...
Venkat Projects
 
DOCX
1.AUTOMATIC DETECTION OF DIABETIC RETINOPATHY USING CNN.docx
Venkat Projects
 
DOCX
2022 PYTHON MAJOR PROJECTS LIST.docx
Venkat Projects
 
DOCX
2022 PYTHON PROJECTS LIST.docx
Venkat Projects
 
DOCX
2021 PYTHON PROJECTS LIST.docx
Venkat Projects
 
DOCX
2021 python projects list
Venkat Projects
 
DOCX
10.sentiment analysis of customer product reviews using machine learni
Venkat Projects
 
DOCX
9.data analysis for understanding the impact of covid–19 vaccinations on the ...
Venkat Projects
 
DOCX
6.iris recognition using machine learning technique
Venkat Projects
 
DOCX
5.local community detection algorithm based on minimal cluster
Venkat Projects
 
1.AUTOMATIC DETECTION OF DIABETIC RETINOPATHY USING CNN.docx
Venkat Projects
 
12.BLOCKCHAIN BASED MILK DELIVERY PLATFORM FOR STALLHOLDER DAIRY FARMERS IN K...
Venkat Projects
 
10.ATTENDANCE CAPTURE SYSTEM USING FACE RECOGNITION.docx
Venkat Projects
 
9.IMPLEMENTATION OF BLOCKCHAIN IN FINANCIAL SECTOR TO IMPROVE SCALABILITY.docx
Venkat Projects
 
8.Geo Tracking Of Waste And Triggering Alerts And Mapping Areas With High Was...
Venkat Projects
 
Image Forgery Detection Based on Fusion of Lightweight Deep Learning Models.docx
Venkat Projects
 
6.A FOREST FIRE IDENTIFICATION METHOD FOR UNMANNED AERIAL VEHICLE MONITORING ...
Venkat Projects
 
WATERMARKING IMAGES
Venkat Projects
 
4.LOCAL DYNAMIC NEIGHBORHOOD BASED OUTLIER DETECTION APPROACH AND ITS FRAMEWO...
Venkat Projects
 
Application and evaluation of a K-Medoidsbased shape clustering method for an...
Venkat Projects
 
OPTIMISED STACKED ENSEMBLE TECHNIQUES IN THE PREDICTION OF CERVICAL CANCER US...
Venkat Projects
 
1.AUTOMATIC DETECTION OF DIABETIC RETINOPATHY USING CNN.docx
Venkat Projects
 
2022 PYTHON MAJOR PROJECTS LIST.docx
Venkat Projects
 
2022 PYTHON PROJECTS LIST.docx
Venkat Projects
 
2021 PYTHON PROJECTS LIST.docx
Venkat Projects
 
2021 python projects list
Venkat Projects
 
10.sentiment analysis of customer product reviews using machine learni
Venkat Projects
 
9.data analysis for understanding the impact of covid–19 vaccinations on the ...
Venkat Projects
 
6.iris recognition using machine learning technique
Venkat Projects
 
5.local community detection algorithm based on minimal cluster
Venkat Projects
 

Recently uploaded (20)

DOCX
8th International Conference on Electrical Engineering (ELEN 2025)
elelijjournal653
 
PPTX
Day2 B2 Best.pptx
helenjenefa1
 
DOCX
CS-802 (A) BDH Lab manual IPS Academy Indore
thegodhimself05
 
PPTX
UNIT DAA PPT cover all topics 2021 regulation
archu26
 
PPTX
Green Building & Energy Conservation ppt
Sagar Sarangi
 
PPTX
Shinkawa Proposal to meet Vibration API670.pptx
AchmadBashori2
 
PDF
ARC--BUILDING-UTILITIES-2-PART-2 (1).pdf
IzzyBaniquedBusto
 
PPTX
ISO/IEC JTC 1/WG 9 (MAR) Convenor Report
Kurata Takeshi
 
PPTX
MobileComputingMANET2023 MobileComputingMANET2023.pptx
masterfake98765
 
PPTX
美国电子版毕业证南卡罗莱纳大学上州分校水印成绩单USC学费发票定做学位证书编号怎么查
Taqyea
 
PDF
MAD Unit - 2 Activity and Fragment Management in Android (Diploma IT)
JappanMavani
 
PPTX
Introduction to Neural Networks and Perceptron Learning Algorithm.pptx
Kayalvizhi A
 
PDF
Book.pdf01_Intro.ppt algorithm for preperation stu used
archu26
 
PDF
monopile foundation seminar topic for civil engineering students
Ahina5
 
PPTX
Break Statement in Programming with 6 Real Examples
manojpoojary2004
 
PPTX
Solar Thermal Energy System Seminar.pptx
Gpc Purapuza
 
DOC
MRRS Strength and Durability of Concrete
CivilMythili
 
PPTX
Hashing Introduction , hash functions and techniques
sailajam21
 
PPTX
MPMC_Module-2 xxxxxxxxxxxxxxxxxxxxx.pptx
ShivanshVaidya5
 
PPTX
Pharmaceuticals and fine chemicals.pptxx
jaypa242004
 
8th International Conference on Electrical Engineering (ELEN 2025)
elelijjournal653
 
Day2 B2 Best.pptx
helenjenefa1
 
CS-802 (A) BDH Lab manual IPS Academy Indore
thegodhimself05
 
UNIT DAA PPT cover all topics 2021 regulation
archu26
 
Green Building & Energy Conservation ppt
Sagar Sarangi
 
Shinkawa Proposal to meet Vibration API670.pptx
AchmadBashori2
 
ARC--BUILDING-UTILITIES-2-PART-2 (1).pdf
IzzyBaniquedBusto
 
ISO/IEC JTC 1/WG 9 (MAR) Convenor Report
Kurata Takeshi
 
MobileComputingMANET2023 MobileComputingMANET2023.pptx
masterfake98765
 
美国电子版毕业证南卡罗莱纳大学上州分校水印成绩单USC学费发票定做学位证书编号怎么查
Taqyea
 
MAD Unit - 2 Activity and Fragment Management in Android (Diploma IT)
JappanMavani
 
Introduction to Neural Networks and Perceptron Learning Algorithm.pptx
Kayalvizhi A
 
Book.pdf01_Intro.ppt algorithm for preperation stu used
archu26
 
monopile foundation seminar topic for civil engineering students
Ahina5
 
Break Statement in Programming with 6 Real Examples
manojpoojary2004
 
Solar Thermal Energy System Seminar.pptx
Gpc Purapuza
 
MRRS Strength and Durability of Concrete
CivilMythili
 
Hashing Introduction , hash functions and techniques
sailajam21
 
MPMC_Module-2 xxxxxxxxxxxxxxxxxxxxx.pptx
ShivanshVaidya5
 
Pharmaceuticals and fine chemicals.pptxx
jaypa242004
 

modeling and predicting cyber hacking breaches

  • 1. Venkat Java Projects Mobile:+91 9966499110 Visit:www.venkatjavaprojects.com Email:[email protected] MODELING AND PREDICTING CYBER HACKING BREACHES ABSTRACT: Analyzing cyber incident data sets is an important method for deepening our understanding of the evolution of the threat situation. This is a relatively new research topic, and many studies remain to be done. In this paper, we report a statistical analysis of a breach incident data set corresponding to 12 years (2005–2017) of cyber hacking activities that include malware attacks. We show that, in contrast to the findings reported in the literature, both hacking breach incident inter-arrival times and breach sizes should be modeled by stochastic processes, rather than by distributions because they exhibit autocorrelations. Then, we propose particular stochastic process models to, respectively, fit the inter-arrival times and the breach sizes. We also show that these models can predict the inter-arrival times and the breach sizes. In order to get deeper insights into the evolution of hacking breach incidents, we conduct both qualitative and quantitative trend analyses on the data set. We draw a set of cybersecurity insights, including that the threat of cyber hacks is indeed getting worse in terms of their frequency, but not in terms of the magnitude of their damage.
  • 2. Venkat Java Projects Mobile:+91 9966499110 Visit:www.venkatjavaprojects.com Email:[email protected] ARCHITECTURE: EXISTING SYSTEM: The present study is motivated by several questions that have not been investigated until now, such as: Are data breaches caused by cyber- attacks increasing, decreasing, or stabilizing? A principled answer to this question will give us a clear insight into the overall situation of cyber threats. This question was not answered by previous studies.
  • 3. Venkat Java Projects Mobile:+91 9966499110 Visit:www.venkatjavaprojects.com Email:[email protected] Specifically, the dataset analyzed in [7] only covered the time span from 2000 to 2008 and does not necessarily contain the breach incidents that are caused by cyber-attacks; the dataset analyzed in [9] is more recent, but contains two kinds of incidents: negligent breaches (i.e., incidents caused by lost, discarded, stolen devices and other reasons) and malicious breaching. Since negligent breaches represent more human errors than cyber-attacks, we do not consider them in the present study. Because the malicious breaches studied in [9] contain four sub- categories: hacking (including malware), insider, payment card fraud, and unknown, this study will focus on the hacking sub-category (called hacking breach dataset thereafter), while noting that the other three sub- categories are interesting on their own and should be analyzed separately.Recently, researchers started modeling data breach incidents. Maillart and Sornettestudied the statistical properties of the personal identity losses in the United States between year 2000 and 2008. They found that the number of breach incidents dramatically increases from 2000 to July 2006 but remains stable thereafter. Edwards et al. analyzed a dataset containing 2,253 breach incidents that span over a decade (2005 to 2015). They found that neither the size nor the frequency of data breaches has increased over the years. Wheatley et al., analyzed a dataset that is combined from corresponds to organizational breach incidents between year 2000 and 2015. They found that the frequency of large breach incidents (i.e., the ones that breach more than 50,000
  • 4. Venkat Java Projects Mobile:+91 9966499110 Visit:www.venkatjavaprojects.com Email:[email protected] records) occurring to US firms is independent of time, but the frequency of large breach incidents occurring to non-US firms exhibits an increasing trend. PROPOSEDSYSTEM: In this paper, we make the following three contributions. First, we show that both the hacking breach incident interarrival times (reflecting incident frequency) and breach sizes should be modeled by stochastic processes, rather than by distributions. We find that a particular point process can adequately describe the evolution of the hacking breach incidents inter-arrival times and that a particular ARMA-GARCH model can adequately describe the evolution of the hacking breach sizes, where ARMA is acronym for “AutoRegressive and Moving Average” and GARCH is acronym for “Generalized AutoRegressive Conditional Heteroskedasticity.”We show that these stochastic process models can predict the inter-arrival times and the breach sizes. To the best of our knowledge, this is the first paper showing that stochastic processes, rather than distributions, should be used to model these cyber threat factors. Second, we discover a positive dependence between the incidents inter-arrival times and the breach sizes, and show that this dependence can be adequately described by a particular copula. We also show that when predicting inter-arrival times and breach sizes, it is necessary to consider the dependence; otherwise, the prediction results
  • 5. Venkat Java Projects Mobile:+91 9966499110 Visit:www.venkatjavaprojects.com Email:[email protected] are not accurate. To the best of our knowledge, this is the first work showing the existence of this dependence and the consequence of ignoring it. Third, we conduct both qualitative and quantitative trend analyses of the cyber hacking breach incidents. We find that the situation is indeed getting worse in terms of the incidents inter-arrival time because hacking breach incidents become more and more frequent, but the situation is stabilizing in terms of the incident breach size, indicating that the damage of individual hacking breach incidents will not get much worse. We hope the present study will inspire more investigations, which can offer deep insights into alternate risk mitigation approaches. Such insights are useful to insurance companies, government agencies, and regulators because they need to deeply understand the nature of data breach risks. ALGORITHM: SUPPORT VECTOR MACHINE “Support Vector Machine” (SVM) is a supervised machine learning algorithm which can be used for both classification and regression challenges. However, it is mostly used in classification problems. In this algorithm, we plot each data item as a point in n-dimensional space (where n is number of features you have) with the value of each feature being the value of a particular coordinate. Then, we perform classification by finding the hyper-plane that differentiate the two
  • 6. Venkat Java Projects Mobile:+91 9966499110 Visit:www.venkatjavaprojects.com Email:[email protected] classes very well (look at the below snapshot).Support Vectors are simply the co-ordinates of individual observation. Support Vector Machine is a frontier which best segregates the two classes (hyper-plane/ line).More formally, a support vector machine constructs a hyper plane or set of hyper planes in a high- or infinite-dimensional space, which can be used for classification, regression, or other tasks like outliers detection. Intuitively, a good separation is achieved by the hyper plane that has the largest distance to the nearest training-data point of any class (so-called functional margin), since in general the larger the margin the lower the generalization error of the classifier.Whereas the original problem may be stated in a finite dimensional space, it often happens that the sets to discriminate are not linearly separable in that space. For this reason, it was proposed that the original finite-dimensional space be mapped into a much higher-dimensional space, presumably making the separation easier in that space. MODULES: 1. UPLOAD DATA The data resource to database can be uploaded by both administrator and authorized user. The data can be uploaded with key in order to maintain the secrecy of the data that is not released without knowledge of user. The users are authorized based on their details that are shared to admin and admin can authorize each user.
  • 7. Venkat Java Projects Mobile:+91 9966499110 Visit:www.venkatjavaprojects.com Email:[email protected] Only Authorized users are allowed to access the system and upload or request for files. 2. ACCESS DETAILS The access of data from the database can be given by administrators. Uploaded data are managed by admin and admin is the only person to provide the rights to process the accessing details and approve or unapproved users based on their details. 3. USER PERMISSIONS The data from any resources are allowed to access the data with only permission from administrator. Prior to access data, users are allowed by admin to share their data and verify the details which are provided by user. If user is access the data with wrong attempts then, users are blocked accordingly. If user is requested to unblock them, based on the requests and previous activities admin is unblock users. 4. DATA ANALYSIS Data analyses are done with the help of graph. The collected data are applied to graph in order to get the best analysis and prediction of dataset and given data policies. The dataset can be analyzed through this pictorial representation in order to better understand of the data details. FUTUREWORK:
  • 8. Venkat Java Projects Mobile:+91 9966499110 Visit:www.venkatjavaprojects.com Email:[email protected] There are many open problems that are left for future research. For example, it is both interesting and challenging to investigate how to predict the extremely large values and how to deal with missing data (i.e., breach incidents that are not reported). It is also worthwhile to estimate the exact occurring times of breach incidents. Finally, more research needs to be conducted towards understanding the predictability of breach incidents (i.e., the upper bound of prediction accuracy). REQUIREMENT ANALYSIS The project involved analyzing the design of few applications so as to make the application more users friendly. To do so, it was really important to keep the navigations from one screen to the other well ordered and at the same time reducing the amount of typing the user needs to do. In order to make the application more accessible, the browser version had to be chosen so that it is compatible with most of the Browsers. REQUIREMENT SPECIFICATION Functional Requirements  Graphical User interface with the User. Software Requirements
  • 9. Venkat Java Projects Mobile:+91 9966499110 Visit:www.venkatjavaprojects.com Email:[email protected] For developing the application the following are the Software Requirements: 1. Python 2. Django 3. Mysql 4. Wampserver Operating Systems supported 1. Windows 7 2. Windows XP 3. Windows 8 Technologies and Languages used to Develop 1. Python Debugger and Emulator  Any Browser (Particularly Chrome) Hardware Requirements
  • 10. Venkat Java Projects Mobile:+91 9966499110 Visit:www.venkatjavaprojects.com Email:[email protected] For developing the application the following are the Hardware Requirements:  Processor: Pentium IV or higher  RAM: 256 MB  Space on Hard Disk: minimum 512MB CONCLUSION: We analyzed a hacking breach dataset from the points of view of the incidents inter-arrival time and the breach size, and showed that they both should be modeled by stochastic processes rather than distributions. The statistical models developed in this paper show satisfactory fitting and prediction accuracies. In particular, we propose using a copula- based approach to predict the joint probability that an incident with a certain magnitude of breach size will occur during a future period of time. Statistical tests show that the methodologies proposed in this paper are better than those which are presented in the literature, because the latter ignored both the temporal correlations and the dependence between the incidents inter-arrival times and the breach sizes. We conducted qualitative and quantitative analyses to draw further insights. We drew a set of cybersecurity insights, including that the threat of cyber hacking breach incidents is indeed getting worse in terms of their frequency, but not the magnitude of their damage. The methodology
  • 11. Venkat Java Projects Mobile:+91 9966499110 Visit:www.venkatjavaprojects.com Email:[email protected] presented in this paper can be adopted or adapted to analyze datasets of a similar nature.