SlideShare a Scribd company logo
Microsoft Naive Bayes Algorithm
overviewNaive Bayes AlgorithmDMX QueriesExploring a Naive Bayes ModelNaive Bayes PrinciplesNaive Bayes Parameters
Naive Bayes AlgorithmThe Microsoft Naive Bayes algorithm is a classification algorithm provided by Microsoft SQL Server Analysis Services for use in predictive modeling. The name Naive Bayes derives from the fact that the algorithm uses Bayes theorem but does not take into account dependencies that may exist, and therefore its assumptions are said to be naive.
How to use the Naive Bayes algorithm in SQL server?This algorithm is less computationally intense than other Microsoft algorithmsIt is therefore is useful for quickly generating mining models to discover relationships between input columns and predictable columns. The algorithm considers each pair of input attribute values and output attribute values.Exploring a Naive Bayes model will tell you how your attributes are related to each other.
DMX When you create a query against a data mining modelyou can create either a content query, which provides details about the patterns discovered in analysis, or you can create a prediction query, which uses the patterns in the model to make predictions for new data You can also retrieve metadata about the model by using a query against the data mining schema rowset.
DMX QueriesSELECT MODEL_CATALOG, MODEL_NAME, DATE_CREATED, LAST_PROCESSED, SERVICE_NAME, PREDICTION_ENTITY, FILTER FROM $system.DMSCHEMA_MINING_MODELS WHERE MODEL_NAME = 'TM_NaiveBayes_Filtered‘Getting Model Metadata by Using DMXyou can find metadata for the model, by querying the data mining schema rowset.This might include when the model was created, when the model was last processed, the name of the mining structure that the model is based on, and the name of the columns used as the predictable attribute.
DMX QueriesRetrieving a Summary of Training DataQuery to retrieve the data from the node specified.Because the statistics are stored in a nested table, the FLATTENED keyword is used to make the results easier to view.     SELECT FLATTENED MODEL_NAME, (SELECT ATTRIBUTE_NAME, ATTRIBUTE_VALUE, [SUPPORT], [PROBABILITY], VALUETYPE FROM NODE_DISTRIBUTION) AS t FROM TM_NaiveBayes.CONTENT WHERE NODE_TYPE = 26
DMX QueriesFinding More Information about AttributesExample to show how to return information from the model about a particular attribute( here ”Region”) The Result of this query is shown in the next slide.SELECT NODE_TYPE, NODE_CAPTION, MSOLAP_NODE_SCORE FROM TM_NaiveBayes.CONTENT WHERE ATTRIBUTE_NAME = 'Region'
DMX QueriesSample Resultto showinginformation from the model about a particular  attribute  ”Region”
DMX QueriesSELECT NODE_CAPTION, MSOLAP_NODE_SCORE FROM TM_NaiveBayes.CONTENT WHERE NODE_TYPE = 10 ORDER BY MSOLAP_NODE_SCORE DESCQuery returns theimportance scores ofall attributes in theModel. The Result of this query is shown in the next slide.
DMX Queriesquery returns theimportance scores ofall attributes in theModel.
Exploring a Naive Bayes ModelThe convenient way to start analyzing a new data set is to create a Naive Bayes model and mark all the non-key columns as both input and predictive.The content of each model is presented as a series of nodes. A node is an object within a mining model that contains metadata and information about a portion of the model. Nodes are arranged in a hierarchy. 
Naive Bayes Model Content
Exploring a Naive Bayes ModelThe Naive Bayes viewer is accessed through either the BI Development Studio or SQL Management Studio by right-clicking on the model and selecting Browse.SQL Server Data Mining provides four different views on Naive Bayes models :Dependency Network: Provides a quick display of how all of the attributes in your model are related. Each node in the graph represents an attribute, whereas each edge represents a relationship. outgoing edge (it is predictive of the attribute in the node at the end of the edge)Incoming edge( it is predicted by the other node)
Exploring a Naive Bayes ModelAttribute Profiles:provides you with an exhaustive report of how each input attribute corresponds to each output attribute, one attribute at a time. At the top of the Attribute Profiles view, you select which output you want to look at, and the rest of the view shows how all of the input attributes are correlated to the states of the selected output attribute.
Exploring a Naive Bayes ModelAttribute Characteristics:This tab allows you to select an output attribute and value and shows you a description of the cases where that attribute and value occur.Attribute Discrimination: Provides the answers to the most interesting question: What is the difference between X and Y? With this viewer, you choose the attribute you are interested in, and select the states you want to compare.
Naive Bayes PrinciplesBayes mathematical methods use a combination of conditional and unconditional probabilities.The Naive part of Naive Bayes tells you to treat all of your input attributes as independent of each other with respect to the target variable. This may be a faulty assumption, but it allows you to multiply your probabilities to determine the likelihood of each state.
Naive Bayes PrinciplesThe Bayes rule states that if you have a hypothesis Hand evidence about that hypothesis E, then the probability of H is calculated using the following formula:P(H | E) =   P(E | H) × P(H)                                   P(E)This simply states that the probability of your hypothesis given the evidence is equal to the probability of the evidence given the hypothesis multiplied by the probability of the hypothesis, and then normalized.
Naive Bayes ParametersMAXIMUM _INPUT _ATTRIBUTES determines the number of attributes that will be considered as inputs for training. If there is more than this number of inputs, the algorithm will select the most important inputs and ignore the rest. Setting this parameter to 0 causes the algorithm to consider all attributes.The default value is 255.MAXIMUM _OUTPUT _ATTRIBUTES determines the number of attributes that will be considered as outputs for training. If there is more than this number of outputs, the algorithm will select the most important outputs and ignore the rest. Setting this parameter to 0 causes the algorithm to consider all attributes.The default value is 255.
Naive Bayes ParametersMAXIMUM _STATES controls how many states of an attribute are considered. If an attribute has more than this number of states, only the most popular states will be used. States that are not selected will be considered to be missing data. This parameter is useful when an attribute has a high cardinality
SummaryNaive Bayes AlgorithmDMX QueriesNaive Bayes Model ContentExploring a Naive Bayes ModelNaive Bayes PrinciplesNaive Bayes Parameters
Visit more self help tutorialsPick a tutorial of your choice and browse through it at your own pace.The tutorials section is free, self-guiding and will not involve any additional support.Visit us at www.dataminingtools.net

More Related Content

What's hot (17)

PPT
data mining with weka application
Rezapourabbas
 
PDF
Machine Learning with WEKA
butest
 
PPT
An Introduction To Weka
DataminingTools Inc
 
PPT
WEKA Tutorial
butest
 
PDF
Data Science - Part V - Decision Trees & Random Forests
Derek Kane
 
PPTX
Machine learning module 2
Gokulks007
 
PPTX
MS SQL SERVER:Microsoft neural network and logistic regression
DataminingTools Inc
 
PPTX
A simple introduction to weka
Pamoda Vajiramali
 
PPTX
Introduction to Machine learning ppt
shubhamshirke12
 
DOCX
Feature extraction for classifying students based on theirac ademic performance
Venkat Projects
 
PPTX
Terminology Machine Learning
DataminingTools Inc
 
PDF
Oracle ML Cheat Sheet
IBM India Pvt Ltd
 
PDF
Weka
Antony Alex
 
PDF
Weka
Shuang Wu
 
PDF
Weka_Manual_Sagar
Sagar Kumar
 
PPT
Weka toolkit introduction
butest
 
PPTX
Machine Learning Unit 4 Semester 3 MSc IT Part 2 Mumbai University
Madhav Mishra
 
data mining with weka application
Rezapourabbas
 
Machine Learning with WEKA
butest
 
An Introduction To Weka
DataminingTools Inc
 
WEKA Tutorial
butest
 
Data Science - Part V - Decision Trees & Random Forests
Derek Kane
 
Machine learning module 2
Gokulks007
 
MS SQL SERVER:Microsoft neural network and logistic regression
DataminingTools Inc
 
A simple introduction to weka
Pamoda Vajiramali
 
Introduction to Machine learning ppt
shubhamshirke12
 
Feature extraction for classifying students based on theirac ademic performance
Venkat Projects
 
Terminology Machine Learning
DataminingTools Inc
 
Oracle ML Cheat Sheet
IBM India Pvt Ltd
 
Weka
Shuang Wu
 
Weka_Manual_Sagar
Sagar Kumar
 
Weka toolkit introduction
butest
 
Machine Learning Unit 4 Semester 3 MSc IT Part 2 Mumbai University
Madhav Mishra
 

Viewers also liked (20)

PPTX
LISP: Type specifiers in lisp
DataminingTools Inc
 
PPTX
XL-MINER:Prediction
DataminingTools Inc
 
PPTX
Data Applied:Forecast
DataminingTools Inc
 
PPTX
LISP:Predicates in lisp
DataminingTools Inc
 
PDF
Norihicodanch
Filip Yang
 
PPTX
WEKA: Introduction To Weka
DataminingTools Inc
 
PPTX
WEKA: Credibility Evaluating Whats Been Learned
DataminingTools Inc
 
PPTX
Control Statements in Matlab
DataminingTools Inc
 
PPTX
Communicating simply
Mustansir Husain
 
PPTX
Txomin Hartz Txikia
irantzugoitia86
 
PPT
HistoriografíA Latina LatíN Ii
lara
 
PPTX
RapidMiner: Advanced Processes And Operators
DataminingTools Inc
 
PPTX
Data Applied: Clustering
DataminingTools Inc
 
PPTX
Drc 2010 D.J.Pawlik
slrommel
 
PPTX
Oracle: DML
DataminingTools Inc
 
PPT
Association Rules
DataminingTools Inc
 
PPTX
MS SQL SERVER: Programming sql server data mining
DataminingTools Inc
 
PPT
Powerpoint paragraaf 5.3/5.4
guestaa9e6a
 
PPTX
SQL Server: BI
DataminingTools Inc
 
PDF
Direct-services portfolio
vlastakolaja
 
LISP: Type specifiers in lisp
DataminingTools Inc
 
XL-MINER:Prediction
DataminingTools Inc
 
Data Applied:Forecast
DataminingTools Inc
 
LISP:Predicates in lisp
DataminingTools Inc
 
Norihicodanch
Filip Yang
 
WEKA: Introduction To Weka
DataminingTools Inc
 
WEKA: Credibility Evaluating Whats Been Learned
DataminingTools Inc
 
Control Statements in Matlab
DataminingTools Inc
 
Communicating simply
Mustansir Husain
 
Txomin Hartz Txikia
irantzugoitia86
 
HistoriografíA Latina LatíN Ii
lara
 
RapidMiner: Advanced Processes And Operators
DataminingTools Inc
 
Data Applied: Clustering
DataminingTools Inc
 
Drc 2010 D.J.Pawlik
slrommel
 
Oracle: DML
DataminingTools Inc
 
Association Rules
DataminingTools Inc
 
MS SQL SERVER: Programming sql server data mining
DataminingTools Inc
 
Powerpoint paragraaf 5.3/5.4
guestaa9e6a
 
SQL Server: BI
DataminingTools Inc
 
Direct-services portfolio
vlastakolaja
 
Ad

Similar to MS SQL SERVER: Microsoft naive bayes algorithm (20)

PPTX
MS SQL SERVER: Microsoft sequence clustering and association rules
sqlserver content
 
PDF
Machine learning Algorithms
Walaa Hamdy Assy
 
PDF
Introduction to Machine Learning with SciKit-Learn
Benjamin Bengfort
 
PPT
Lect_04b_PhpMysqlKEY PERFORMANCE INDICATOR FOR ICT-UNIT (new).ppt
SenzotaSemakuwa
 
DOCX
Task A. [20 marks] Data Choice. Name the chosen data set(s) .docx
josies1
 
PPT
Excel Datamining Addin Beginner
excel content
 
PPT
Excel Datamining Addin Beginner
DataminingTools Inc
 
PPTX
Machine Learning - Simple Linear Regression
Siddharth Shrivastava
 
PPTX
Predicting Employee Churn: A Data-Driven Approach Project Presentation
Boston Institute of Analytics
 
PPT
Php and MySQL Web Development
w3ondemand
 
PPTX
8606BICA2.pptx
KhushalChoudhary14
 
PPTX
Applications of Classification Algorithm.pptx
nagarajan740445
 
DOCX
CPSC 50900 Database Systems ProjectAll your efforts this semeste
CruzIbarra161
 
PPTX
Generating test data for Statistical and ML models
Vladimir Ulogov
 
PPT
Excel Datamining Addin Intermediate
DataminingTools Inc
 
PPT
Excel Datamining Addin Intermediate
excel content
 
PPTX
Predicting Employee Attrition
Shruti Mohan
 
PPT
Excel Datamining Addin Advanced
DataminingTools Inc
 
PPT
Excel Datamining Addin Advanced
excel content
 
DOCX
Obiee interview questions and answers faq
maheshboggula
 
MS SQL SERVER: Microsoft sequence clustering and association rules
sqlserver content
 
Machine learning Algorithms
Walaa Hamdy Assy
 
Introduction to Machine Learning with SciKit-Learn
Benjamin Bengfort
 
Lect_04b_PhpMysqlKEY PERFORMANCE INDICATOR FOR ICT-UNIT (new).ppt
SenzotaSemakuwa
 
Task A. [20 marks] Data Choice. Name the chosen data set(s) .docx
josies1
 
Excel Datamining Addin Beginner
excel content
 
Excel Datamining Addin Beginner
DataminingTools Inc
 
Machine Learning - Simple Linear Regression
Siddharth Shrivastava
 
Predicting Employee Churn: A Data-Driven Approach Project Presentation
Boston Institute of Analytics
 
Php and MySQL Web Development
w3ondemand
 
8606BICA2.pptx
KhushalChoudhary14
 
Applications of Classification Algorithm.pptx
nagarajan740445
 
CPSC 50900 Database Systems ProjectAll your efforts this semeste
CruzIbarra161
 
Generating test data for Statistical and ML models
Vladimir Ulogov
 
Excel Datamining Addin Intermediate
DataminingTools Inc
 
Excel Datamining Addin Intermediate
excel content
 
Predicting Employee Attrition
Shruti Mohan
 
Excel Datamining Addin Advanced
DataminingTools Inc
 
Excel Datamining Addin Advanced
excel content
 
Obiee interview questions and answers faq
maheshboggula
 
Ad

More from DataminingTools Inc (20)

PPTX
Techniques Machine Learning
DataminingTools Inc
 
PPTX
Machine learning Introduction
DataminingTools Inc
 
PPTX
Areas of machine leanring
DataminingTools Inc
 
PPTX
AI: Planning and AI
DataminingTools Inc
 
PPTX
AI: Logic in AI 2
DataminingTools Inc
 
PPTX
AI: Logic in AI
DataminingTools Inc
 
PPTX
AI: Learning in AI 2
DataminingTools Inc
 
PPTX
AI: Learning in AI
DataminingTools Inc
 
PPTX
AI: Introduction to artificial intelligence
DataminingTools Inc
 
PPTX
AI: Belief Networks
DataminingTools Inc
 
PPTX
AI: AI & Searching
DataminingTools Inc
 
PPTX
AI: AI & Problem Solving
DataminingTools Inc
 
PPTX
Data Mining: Text and web mining
DataminingTools Inc
 
PPTX
Data Mining: Outlier analysis
DataminingTools Inc
 
PPTX
Data Mining: Mining stream time series and sequence data
DataminingTools Inc
 
PPTX
Data Mining: Mining ,associations, and correlations
DataminingTools Inc
 
PPTX
Data Mining: Graph mining and social network analysis
DataminingTools Inc
 
PPTX
Data warehouse and olap technology
DataminingTools Inc
 
PPTX
Data Mining: Data processing
DataminingTools Inc
 
PPTX
Data Mining: clustering and analysis
DataminingTools Inc
 
Techniques Machine Learning
DataminingTools Inc
 
Machine learning Introduction
DataminingTools Inc
 
Areas of machine leanring
DataminingTools Inc
 
AI: Planning and AI
DataminingTools Inc
 
AI: Logic in AI 2
DataminingTools Inc
 
AI: Logic in AI
DataminingTools Inc
 
AI: Learning in AI 2
DataminingTools Inc
 
AI: Learning in AI
DataminingTools Inc
 
AI: Introduction to artificial intelligence
DataminingTools Inc
 
AI: Belief Networks
DataminingTools Inc
 
AI: AI & Searching
DataminingTools Inc
 
AI: AI & Problem Solving
DataminingTools Inc
 
Data Mining: Text and web mining
DataminingTools Inc
 
Data Mining: Outlier analysis
DataminingTools Inc
 
Data Mining: Mining stream time series and sequence data
DataminingTools Inc
 
Data Mining: Mining ,associations, and correlations
DataminingTools Inc
 
Data Mining: Graph mining and social network analysis
DataminingTools Inc
 
Data warehouse and olap technology
DataminingTools Inc
 
Data Mining: Data processing
DataminingTools Inc
 
Data Mining: clustering and analysis
DataminingTools Inc
 

Recently uploaded (20)

PPTX
From Sci-Fi to Reality: Exploring AI Evolution
Svetlana Meissner
 
PPTX
Agentforce World Tour Toronto '25 - MCP with MuleSoft
Alexandra N. Martinez
 
PDF
Transforming Utility Networks: Large-scale Data Migrations with FME
Safe Software
 
PDF
Book industry state of the nation 2025 - Tech Forum 2025
BookNet Canada
 
PDF
Go Concurrency Real-World Patterns, Pitfalls, and Playground Battles.pdf
Emily Achieng
 
PDF
Agentic AI lifecycle for Enterprise Hyper-Automation
Debmalya Biswas
 
PDF
“Voice Interfaces on a Budget: Building Real-time Speech Recognition on Low-c...
Edge AI and Vision Alliance
 
PDF
Reverse Engineering of Security Products: Developing an Advanced Microsoft De...
nwbxhhcyjv
 
PPTX
Digital Circuits, important subject in CS
contactparinay1
 
PDF
Mastering Financial Management in Direct Selling
Epixel MLM Software
 
PPTX
Designing_the_Future_AI_Driven_Product_Experiences_Across_Devices.pptx
presentifyai
 
PDF
Transcript: Book industry state of the nation 2025 - Tech Forum 2025
BookNet Canada
 
PDF
Peak of Data & AI Encore AI-Enhanced Workflows for the Real World
Safe Software
 
PDF
[Newgen] NewgenONE Marvin Brochure 1.pdf
darshakparmar
 
PPTX
AI Penetration Testing Essentials: A Cybersecurity Guide for 2025
defencerabbit Team
 
PPTX
MuleSoft MCP Support (Model Context Protocol) and Use Case Demo
shyamraj55
 
PDF
Bitcoin for Millennials podcast with Bram, Power Laws of Bitcoin
Stephen Perrenod
 
PPT
Ericsson LTE presentation SEMINAR 2010.ppt
npat3
 
PDF
CIFDAQ Market Wrap for the week of 4th July 2025
CIFDAQ
 
PPTX
Mastering ODC + Okta Configuration - Chennai OSUG
HathiMaryA
 
From Sci-Fi to Reality: Exploring AI Evolution
Svetlana Meissner
 
Agentforce World Tour Toronto '25 - MCP with MuleSoft
Alexandra N. Martinez
 
Transforming Utility Networks: Large-scale Data Migrations with FME
Safe Software
 
Book industry state of the nation 2025 - Tech Forum 2025
BookNet Canada
 
Go Concurrency Real-World Patterns, Pitfalls, and Playground Battles.pdf
Emily Achieng
 
Agentic AI lifecycle for Enterprise Hyper-Automation
Debmalya Biswas
 
“Voice Interfaces on a Budget: Building Real-time Speech Recognition on Low-c...
Edge AI and Vision Alliance
 
Reverse Engineering of Security Products: Developing an Advanced Microsoft De...
nwbxhhcyjv
 
Digital Circuits, important subject in CS
contactparinay1
 
Mastering Financial Management in Direct Selling
Epixel MLM Software
 
Designing_the_Future_AI_Driven_Product_Experiences_Across_Devices.pptx
presentifyai
 
Transcript: Book industry state of the nation 2025 - Tech Forum 2025
BookNet Canada
 
Peak of Data & AI Encore AI-Enhanced Workflows for the Real World
Safe Software
 
[Newgen] NewgenONE Marvin Brochure 1.pdf
darshakparmar
 
AI Penetration Testing Essentials: A Cybersecurity Guide for 2025
defencerabbit Team
 
MuleSoft MCP Support (Model Context Protocol) and Use Case Demo
shyamraj55
 
Bitcoin for Millennials podcast with Bram, Power Laws of Bitcoin
Stephen Perrenod
 
Ericsson LTE presentation SEMINAR 2010.ppt
npat3
 
CIFDAQ Market Wrap for the week of 4th July 2025
CIFDAQ
 
Mastering ODC + Okta Configuration - Chennai OSUG
HathiMaryA
 

MS SQL SERVER: Microsoft naive bayes algorithm

  • 2. overviewNaive Bayes AlgorithmDMX QueriesExploring a Naive Bayes ModelNaive Bayes PrinciplesNaive Bayes Parameters
  • 3. Naive Bayes AlgorithmThe Microsoft Naive Bayes algorithm is a classification algorithm provided by Microsoft SQL Server Analysis Services for use in predictive modeling. The name Naive Bayes derives from the fact that the algorithm uses Bayes theorem but does not take into account dependencies that may exist, and therefore its assumptions are said to be naive.
  • 4. How to use the Naive Bayes algorithm in SQL server?This algorithm is less computationally intense than other Microsoft algorithmsIt is therefore is useful for quickly generating mining models to discover relationships between input columns and predictable columns. The algorithm considers each pair of input attribute values and output attribute values.Exploring a Naive Bayes model will tell you how your attributes are related to each other.
  • 5. DMX When you create a query against a data mining modelyou can create either a content query, which provides details about the patterns discovered in analysis, or you can create a prediction query, which uses the patterns in the model to make predictions for new data You can also retrieve metadata about the model by using a query against the data mining schema rowset.
  • 6. DMX QueriesSELECT MODEL_CATALOG, MODEL_NAME, DATE_CREATED, LAST_PROCESSED, SERVICE_NAME, PREDICTION_ENTITY, FILTER FROM $system.DMSCHEMA_MINING_MODELS WHERE MODEL_NAME = 'TM_NaiveBayes_Filtered‘Getting Model Metadata by Using DMXyou can find metadata for the model, by querying the data mining schema rowset.This might include when the model was created, when the model was last processed, the name of the mining structure that the model is based on, and the name of the columns used as the predictable attribute.
  • 7. DMX QueriesRetrieving a Summary of Training DataQuery to retrieve the data from the node specified.Because the statistics are stored in a nested table, the FLATTENED keyword is used to make the results easier to view. SELECT FLATTENED MODEL_NAME, (SELECT ATTRIBUTE_NAME, ATTRIBUTE_VALUE, [SUPPORT], [PROBABILITY], VALUETYPE FROM NODE_DISTRIBUTION) AS t FROM TM_NaiveBayes.CONTENT WHERE NODE_TYPE = 26
  • 8. DMX QueriesFinding More Information about AttributesExample to show how to return information from the model about a particular attribute( here ”Region”) The Result of this query is shown in the next slide.SELECT NODE_TYPE, NODE_CAPTION, MSOLAP_NODE_SCORE FROM TM_NaiveBayes.CONTENT WHERE ATTRIBUTE_NAME = 'Region'
  • 9. DMX QueriesSample Resultto showinginformation from the model about a particular attribute  ”Region”
  • 10. DMX QueriesSELECT NODE_CAPTION, MSOLAP_NODE_SCORE FROM TM_NaiveBayes.CONTENT WHERE NODE_TYPE = 10 ORDER BY MSOLAP_NODE_SCORE DESCQuery returns theimportance scores ofall attributes in theModel. The Result of this query is shown in the next slide.
  • 11. DMX Queriesquery returns theimportance scores ofall attributes in theModel.
  • 12. Exploring a Naive Bayes ModelThe convenient way to start analyzing a new data set is to create a Naive Bayes model and mark all the non-key columns as both input and predictive.The content of each model is presented as a series of nodes. A node is an object within a mining model that contains metadata and information about a portion of the model. Nodes are arranged in a hierarchy. 
  • 14. Exploring a Naive Bayes ModelThe Naive Bayes viewer is accessed through either the BI Development Studio or SQL Management Studio by right-clicking on the model and selecting Browse.SQL Server Data Mining provides four different views on Naive Bayes models :Dependency Network: Provides a quick display of how all of the attributes in your model are related. Each node in the graph represents an attribute, whereas each edge represents a relationship. outgoing edge (it is predictive of the attribute in the node at the end of the edge)Incoming edge( it is predicted by the other node)
  • 15. Exploring a Naive Bayes ModelAttribute Profiles:provides you with an exhaustive report of how each input attribute corresponds to each output attribute, one attribute at a time. At the top of the Attribute Profiles view, you select which output you want to look at, and the rest of the view shows how all of the input attributes are correlated to the states of the selected output attribute.
  • 16. Exploring a Naive Bayes ModelAttribute Characteristics:This tab allows you to select an output attribute and value and shows you a description of the cases where that attribute and value occur.Attribute Discrimination: Provides the answers to the most interesting question: What is the difference between X and Y? With this viewer, you choose the attribute you are interested in, and select the states you want to compare.
  • 17. Naive Bayes PrinciplesBayes mathematical methods use a combination of conditional and unconditional probabilities.The Naive part of Naive Bayes tells you to treat all of your input attributes as independent of each other with respect to the target variable. This may be a faulty assumption, but it allows you to multiply your probabilities to determine the likelihood of each state.
  • 18. Naive Bayes PrinciplesThe Bayes rule states that if you have a hypothesis Hand evidence about that hypothesis E, then the probability of H is calculated using the following formula:P(H | E) = P(E | H) × P(H) P(E)This simply states that the probability of your hypothesis given the evidence is equal to the probability of the evidence given the hypothesis multiplied by the probability of the hypothesis, and then normalized.
  • 19. Naive Bayes ParametersMAXIMUM _INPUT _ATTRIBUTES determines the number of attributes that will be considered as inputs for training. If there is more than this number of inputs, the algorithm will select the most important inputs and ignore the rest. Setting this parameter to 0 causes the algorithm to consider all attributes.The default value is 255.MAXIMUM _OUTPUT _ATTRIBUTES determines the number of attributes that will be considered as outputs for training. If there is more than this number of outputs, the algorithm will select the most important outputs and ignore the rest. Setting this parameter to 0 causes the algorithm to consider all attributes.The default value is 255.
  • 20. Naive Bayes ParametersMAXIMUM _STATES controls how many states of an attribute are considered. If an attribute has more than this number of states, only the most popular states will be used. States that are not selected will be considered to be missing data. This parameter is useful when an attribute has a high cardinality
  • 21. SummaryNaive Bayes AlgorithmDMX QueriesNaive Bayes Model ContentExploring a Naive Bayes ModelNaive Bayes PrinciplesNaive Bayes Parameters
  • 22. Visit more self help tutorialsPick a tutorial of your choice and browse through it at your own pace.The tutorials section is free, self-guiding and will not involve any additional support.Visit us at www.dataminingtools.net