SlideShare a Scribd company logo
100

SQL SERVER: Data Mining
Types of analysis
• Ad-hoc query/Reporting/Analysis
  – What is the purpose?
     • Simple reports
     • Key Performance Indicators
     • OLAP cubes – Slice & Dice
  – In Real time - What happens now?
     • Events/Triggers

• Data Mining
  – How do we do it?
  – What happens?
What does Data Mining Do?
 Explores
Your Data

             Finds
            Patterns

                        Performs
                       Predictions
Data Mining Algorithms
•   Classification
•   Regression
•   Segmentation
•   Association
•   Forecasting
•   Text Analysis
•   Advanced Data Exploration
Mining Process
Training data                    Data to be
                  Mining Model   predicted




    DM Engine




Mining Model
                                      With
                                      predictions
Data Mining Process
                                                                     SSAS
                                                                     (OLAP)
                  Business                          Data
                                                                     DSV
                Understanding                   Understanding



                                                                              SSIS
                                                                              SSAS
                                                                   Data
                                Data                                          (OLAP)
                                                                Preparation

SSIS
SSAS(OLAP)
SSRS             Deployment
Flexible APIs                                                                   SSAS
                                                                 Modeling      (Data
                                                                              Mining)

                                   Evaluation

                                                                  www.crisp-dm.org
Data Mining in SQL Server 2008
• New algorithms developed in conjunction
  with Microsoft Research
• Data mining is made accessible and easy to
  use through integrated user interface, cross-
  product integration and familiar, standard APIs
• Complete framework for building and
  deploying intelligent applications on the fly
• Integration into the cloud.
Top New Features in SQL Server 2008

• Test multiple data mining models simultaneously with statistical
  scores of error and accuracy and confirm their stability with cross
  validation
• Build multiple, incompatible mining models within a single
  structure; apply model analysis over filtered data; query against
  structure data to present complete information, all enabled by
  enhanced mining structures
• Combine the best of both worlds by blending optimized near-term
  predictions (ARTXP) and stable long-term predictions (ARIMA) with
  Better Time Series Support
• Discover the relationship between items that are frequently
  purchased together by using Shopping Basket Analysis; generate
  interactive forms for scoring new cases with Predictive Calculator,
  delivered with Microsoft SQL Server 2008 Data Mining Add-ins for
  Office 2007
Rich and Innovative Algorithms
•   Benefit from many rich and innovative data mining algorithms, most developed by Microsoft Research to
    support common business problems promptly and accurately.
•   Market Basket Analysis - Discover which items tend to be bought together to create recommendations on-
    the-fly and to determine how product placement can directly contribute to your bottom line
•   Churn Analysis - Anticipate customers who may be considering canceling their service and identify benefits
    that will keep them from leaving
•   Market Analysis - Define market segments by automatically grouping similar customers together. Use
    these segments to seek profitable customers
•   Forecasting - Predict sales and inventory amounts and learn how they are interrelated to foresee
    bottlenecks and improve performance
•   Data Exploration - Analyze profitability across customers, or compare customers who prefer different
    brands of the same product to discover new opportunities
•   Unsupervised Learning - Identify previously unknown relationships between various elements of your
    business to better inform your decisions
•   Web Site Analysis - Understand how people use your Web site and group similar usage patterns to offer a
    better experience
•   Campaign Analysis - Spend marketing dollars more effectively by targeting the customers most likely to
    respond to a promotion
•   Information Quality - Identify and handle anomalies during data entry or data loading to improve the
    quality of information
•   Text Analysis - Analyze feedback to find common themes and trends that concern your customers or
    employees, informing decisions with unstructured input
Value of Data Mining
                           Business Knowledge

                                                            SQL Server 2008
Business value




                                                                           Data Mining


                                                          OLAP



                                     Reports (Adhoc)

                           Reports (static)

                  Simple                                         Complex
                                              Usability
Data Mining User Interface
• SQL Server BI Development Studio
  – Environment for creation and data exploration
  – Data Mining projects in Visual Studio solutions, tightly
    integrated
  – Source Control Integration
• SQL Server Management Studio
  – One tool for all administrative tasks
  – Manage, view and query mining models
BI Integration
• Integration Services
  – Data Mining processing and results integrate
    directly in IS pipeline
• OLAP
  – Processing of mining models directly from
    cubes
  – Use of mining results as dimensions
• Reporting Services
  – Embed Data Mining results directly in
    Reporting Services Reports
Applied Data Mining
• Make Decisions without Coding
   – Learn business rules directly from data
• Client Customization
   – Learn logic customized for each client
• Automatic Update
   – Data mining application logic updated by model re-
     processing
   – Applications do not need to be rewritten, recompiled, re-
     deployed
Server Mining Architecture
      BI Dev        Your Application
      Studio
      (Visual
      Studio)        OLE DB/ ADOMD/ XMLA
                                            App
Deploy                                      Data


Analysis Services   Mining Model
Server
                    Data Mining Algorithm           Data
                                                   Source
Data Mining EXtensions
• OLE DB for Data Mining specification
   – Now part of XML/A specification
   – See www.xmla.org for XML/A details
• Connect to Analysis Server
   – OLEDB, ADO, ADO.Net, ADOMD.Net, XMLA
   Dim cmd as ADOMD.Command
   Dim reader as ADOMD.DataReader
   Cmd.Connection = conn
   Set reader =
     Cmd.ExecuteReader(“Select
     Predict(Gender)…”)
Typical DM Process Using DMX
Define a model:
CREATE MINING MODEL ….

                                    Data Mining
Train a model:                   Management System
INSERT INTO dmm ….                   (DMMS)
         Training Data




Prediction using a model:          Mining Model
SELECT …
FROM dmm PREDICTION JOIN …
         Prediction Input Data
DMX Commands
• Definition (DDL)
   –   CREATE – Make new model
   –   SELECT INTO – Create model by copying existing
   –   EXPORT – Save model as .abf file
   –   IMPORT – Retrieve model from .abf file
• Manipulation (DML)
   –   INSERT INTO – Train model
   –   UPDATE – Change content of model
   –   DELETE – Clear content
   –   SELECT – Browse model
DMX SELECT Elements
•   SELECT [FLATTENED] [TOP] <columns>
•   FROM <model>
•   PREDICTION JOIN <table>
•   ON <mapping>
•   WHERE <filter>
•   ORDER BY <sort expression>
    – Use query builder to create SELECT statement
Training a DM Model: Simple
INSERT INTO CollegePlanModel
  (StudentID, Gender, ParentIncome,
   Encouragement, CollegePlans)
OPENROWSET(‘<provider>’, ‘<connection>’,
      ‘SELECT    StudentID,
                 Gender,
                 ParentIncome,
                 Encouragement,
                 CollegePlans
       FROM CollegePlansTrainData’)
Prediction Using a DM Model
• PREDICTION JOIN
  SELECT t.ID, CPModel.Plan
  FROM CPModel PREDICTION JOIN
      OPENQUERY(…,„SELECT * FROM NewStudents‟) AS t
  ON CPModel.Gender = t.Gender AND
     CPModel.IQ = t.IQ
Visit more self help tutorials

• Pick a tutorial of your choice and browse
  through it at your own pace.
• The tutorials section is free, self-guiding and
  will not involve any additional support.
• Visit us at www.dataminingtools.net

More Related Content

What's hot (20)

Xml parsers
Xml parsersXml parsers
Xml parsers
Manav Prasad
 
Azure Data Factory V2; The Data Flows
Azure Data Factory V2; The Data FlowsAzure Data Factory V2; The Data Flows
Azure Data Factory V2; The Data Flows
Thomas Sykes
 
Basics of the Web Platform
Basics of the Web PlatformBasics of the Web Platform
Basics of the Web Platform
Sanjeev Verma, PhD
 
PyCon Colombia 2020 Python for Data Analysis: Past, Present, and Future
PyCon Colombia 2020 Python for Data Analysis: Past, Present, and Future PyCon Colombia 2020 Python for Data Analysis: Past, Present, and Future
PyCon Colombia 2020 Python for Data Analysis: Past, Present, and Future
Wes McKinney
 
Apache pig
Apache pigApache pig
Apache pig
Sadiq Basha
 
DBMS Multiple Choice Questions
DBMS Multiple Choice QuestionsDBMS Multiple Choice Questions
DBMS Multiple Choice Questions
Shusil Baral
 
Azure Purview Data Toboggan Erwin de Kreuk
Azure Purview Data Toboggan Erwin de KreukAzure Purview Data Toboggan Erwin de Kreuk
Azure Purview Data Toboggan Erwin de Kreuk
Erwin de Kreuk
 
Data warehouse architecture
Data warehouse architectureData warehouse architecture
Data warehouse architecture
uncleRhyme
 
Snowflake Architecture.pptx
Snowflake Architecture.pptxSnowflake Architecture.pptx
Snowflake Architecture.pptx
chennakesava44
 
Mongo indexes
Mongo indexesMongo indexes
Mongo indexes
paradokslabs
 
Introduction to MongoDB
Introduction to MongoDBIntroduction to MongoDB
Introduction to MongoDB
Ravi Teja
 
Complete Lecture on Css presentation
Complete Lecture on Css presentation Complete Lecture on Css presentation
Complete Lecture on Css presentation
Salman Memon
 
Spark, ou comment traiter des données à la vitesse de l'éclair
Spark, ou comment traiter des données à la vitesse de l'éclairSpark, ou comment traiter des données à la vitesse de l'éclair
Spark, ou comment traiter des données à la vitesse de l'éclair
Alexis Seigneurin
 
Introduction to the Semantic Web
Introduction to the Semantic WebIntroduction to the Semantic Web
Introduction to the Semantic Web
Marin Dimitrov
 
Data mining techniques
Data mining techniquesData mining techniques
Data mining techniques
KarthigaGunasekaran1
 
Hive
HiveHive
Hive
Manas Nayak
 
Apache Spark overview
Apache Spark overviewApache Spark overview
Apache Spark overview
DataArt
 
NoSQL databases - An introduction
NoSQL databases - An introductionNoSQL databases - An introduction
NoSQL databases - An introduction
Pooyan Mehrparvar
 
Apache Spark Overview
Apache Spark OverviewApache Spark Overview
Apache Spark Overview
Vadim Y. Bichutskiy
 
Let’s get to know Snowflake
Let’s get to know SnowflakeLet’s get to know Snowflake
Let’s get to know Snowflake
Knoldus Inc.
 
Azure Data Factory V2; The Data Flows
Azure Data Factory V2; The Data FlowsAzure Data Factory V2; The Data Flows
Azure Data Factory V2; The Data Flows
Thomas Sykes
 
PyCon Colombia 2020 Python for Data Analysis: Past, Present, and Future
PyCon Colombia 2020 Python for Data Analysis: Past, Present, and Future PyCon Colombia 2020 Python for Data Analysis: Past, Present, and Future
PyCon Colombia 2020 Python for Data Analysis: Past, Present, and Future
Wes McKinney
 
DBMS Multiple Choice Questions
DBMS Multiple Choice QuestionsDBMS Multiple Choice Questions
DBMS Multiple Choice Questions
Shusil Baral
 
Azure Purview Data Toboggan Erwin de Kreuk
Azure Purview Data Toboggan Erwin de KreukAzure Purview Data Toboggan Erwin de Kreuk
Azure Purview Data Toboggan Erwin de Kreuk
Erwin de Kreuk
 
Data warehouse architecture
Data warehouse architectureData warehouse architecture
Data warehouse architecture
uncleRhyme
 
Snowflake Architecture.pptx
Snowflake Architecture.pptxSnowflake Architecture.pptx
Snowflake Architecture.pptx
chennakesava44
 
Introduction to MongoDB
Introduction to MongoDBIntroduction to MongoDB
Introduction to MongoDB
Ravi Teja
 
Complete Lecture on Css presentation
Complete Lecture on Css presentation Complete Lecture on Css presentation
Complete Lecture on Css presentation
Salman Memon
 
Spark, ou comment traiter des données à la vitesse de l'éclair
Spark, ou comment traiter des données à la vitesse de l'éclairSpark, ou comment traiter des données à la vitesse de l'éclair
Spark, ou comment traiter des données à la vitesse de l'éclair
Alexis Seigneurin
 
Introduction to the Semantic Web
Introduction to the Semantic WebIntroduction to the Semantic Web
Introduction to the Semantic Web
Marin Dimitrov
 
Apache Spark overview
Apache Spark overviewApache Spark overview
Apache Spark overview
DataArt
 
NoSQL databases - An introduction
NoSQL databases - An introductionNoSQL databases - An introduction
NoSQL databases - An introduction
Pooyan Mehrparvar
 
Let’s get to know Snowflake
Let’s get to know SnowflakeLet’s get to know Snowflake
Let’s get to know Snowflake
Knoldus Inc.
 

Viewers also liked (20)

SQL Server Data Mining - Taking your Application Design to the Next Level
SQL Server Data Mining - Taking your Application Design to the Next LevelSQL Server Data Mining - Taking your Application Design to the Next Level
SQL Server Data Mining - Taking your Application Design to the Next Level
Mark Ginnebaugh
 
AI: Planning and AI
AI: Planning and AIAI: Planning and AI
AI: Planning and AI
DataminingTools Inc
 
Lf conditionals
Lf conditionalsLf conditionals
Lf conditionals
Olaf Du Pont
 
Lecture no 15
Lecture no 15Lecture no 15
Lecture no 15
zaidshaidzaid
 
Microsoft Data Mining 2012
Microsoft Data Mining 2012Microsoft Data Mining 2012
Microsoft Data Mining 2012
Mark Ginnebaugh
 
Data Mining: Concepts and Techniques_ Chapter 6: Mining Frequent Patterns, ...
Data Mining:  Concepts and Techniques_ Chapter 6: Mining Frequent Patterns, ...Data Mining:  Concepts and Techniques_ Chapter 6: Mining Frequent Patterns, ...
Data Mining: Concepts and Techniques_ Chapter 6: Mining Frequent Patterns, ...
Salah Amean
 
38475471 qa-and-software-testing-interview-questions-and-answers
38475471 qa-and-software-testing-interview-questions-and-answers38475471 qa-and-software-testing-interview-questions-and-answers
38475471 qa-and-software-testing-interview-questions-and-answers
Maria FutureThoughts
 
Microsoft azure service 소개자료
Microsoft azure service 소개자료Microsoft azure service 소개자료
Microsoft azure service 소개자료
Alvin You
 
Interview Questions for Mobile application Testing
Interview Questions for Mobile application TestingInterview Questions for Mobile application Testing
Interview Questions for Mobile application Testing
Rahul S Singh
 
Preparing your QA team for mobile testing
Preparing your QA team for mobile testingPreparing your QA team for mobile testing
Preparing your QA team for mobile testing
Geoffrey Goetz
 
Webservices(or)SoapUI Interview Questions
Webservices(or)SoapUI Interview QuestionsWebservices(or)SoapUI Interview Questions
Webservices(or)SoapUI Interview Questions
H2kInfosys
 
Portavocía en redes sociales
Portavocía en redes socialesPortavocía en redes sociales
Portavocía en redes sociales
Muévete en bici por Madrid
 
Quick Look At Clustering
Quick Look At ClusteringQuick Look At Clustering
Quick Look At Clustering
DataminingTools Inc
 
LISP: Errors In Lisp
LISP: Errors In LispLISP: Errors In Lisp
LISP: Errors In Lisp
DataminingTools Inc
 
LISP: Scope and extent in lisp
LISP: Scope and extent in lispLISP: Scope and extent in lisp
LISP: Scope and extent in lisp
DataminingTools Inc
 
Matlab: Saving And Publishing
Matlab: Saving And PublishingMatlab: Saving And Publishing
Matlab: Saving And Publishing
DataminingTools Inc
 
LISP:Predicates in lisp
LISP:Predicates in lispLISP:Predicates in lisp
LISP:Predicates in lisp
DataminingTools Inc
 
Matlab: Discrete Linear Systems
Matlab: Discrete Linear SystemsMatlab: Discrete Linear Systems
Matlab: Discrete Linear Systems
DataminingTools Inc
 
Data-Applied: Technology Insights
Data-Applied: Technology InsightsData-Applied: Technology Insights
Data-Applied: Technology Insights
DataminingTools Inc
 
SQL Server Data Mining - Taking your Application Design to the Next Level
SQL Server Data Mining - Taking your Application Design to the Next LevelSQL Server Data Mining - Taking your Application Design to the Next Level
SQL Server Data Mining - Taking your Application Design to the Next Level
Mark Ginnebaugh
 
Microsoft Data Mining 2012
Microsoft Data Mining 2012Microsoft Data Mining 2012
Microsoft Data Mining 2012
Mark Ginnebaugh
 
Data Mining: Concepts and Techniques_ Chapter 6: Mining Frequent Patterns, ...
Data Mining:  Concepts and Techniques_ Chapter 6: Mining Frequent Patterns, ...Data Mining:  Concepts and Techniques_ Chapter 6: Mining Frequent Patterns, ...
Data Mining: Concepts and Techniques_ Chapter 6: Mining Frequent Patterns, ...
Salah Amean
 
38475471 qa-and-software-testing-interview-questions-and-answers
38475471 qa-and-software-testing-interview-questions-and-answers38475471 qa-and-software-testing-interview-questions-and-answers
38475471 qa-and-software-testing-interview-questions-and-answers
Maria FutureThoughts
 
Microsoft azure service 소개자료
Microsoft azure service 소개자료Microsoft azure service 소개자료
Microsoft azure service 소개자료
Alvin You
 
Interview Questions for Mobile application Testing
Interview Questions for Mobile application TestingInterview Questions for Mobile application Testing
Interview Questions for Mobile application Testing
Rahul S Singh
 
Preparing your QA team for mobile testing
Preparing your QA team for mobile testingPreparing your QA team for mobile testing
Preparing your QA team for mobile testing
Geoffrey Goetz
 
Webservices(or)SoapUI Interview Questions
Webservices(or)SoapUI Interview QuestionsWebservices(or)SoapUI Interview Questions
Webservices(or)SoapUI Interview Questions
H2kInfosys
 
Data-Applied: Technology Insights
Data-Applied: Technology InsightsData-Applied: Technology Insights
Data-Applied: Technology Insights
DataminingTools Inc
 
Ad

Similar to SQL Server: Data Mining (20)

BI 2008 Simple
BI 2008 SimpleBI 2008 Simple
BI 2008 Simple
llangit
 
SQL Server 2008 Data Mining
SQL Server 2008 Data MiningSQL Server 2008 Data Mining
SQL Server 2008 Data Mining
llangit
 
SQL Server 2008 Data Mining
SQL Server 2008 Data MiningSQL Server 2008 Data Mining
SQL Server 2008 Data Mining
llangit
 
SQL Server 2008 Data Mining
SQL Server 2008 Data MiningSQL Server 2008 Data Mining
SQL Server 2008 Data Mining
llangit
 
Data mining applications
Data mining applicationsData mining applications
Data mining applications
Dr. C.V. Suresh Babu
 
Data Mining 2008
Data Mining 2008Data Mining 2008
Data Mining 2008
llangit
 
Data Mining for Developers
Data Mining for DevelopersData Mining for Developers
Data Mining for Developers
llangit
 
Decision support system
Decision support systemDecision support system
Decision support system
Bhuwneshwar Pandaya
 
Mine craft:
Mine craft: Mine craft:
Mine craft:
Mark Tabladillo
 
Data mining
Data miningData mining
Data mining
Akannsha Totewar
 
Microsoft SQL Server_2012_predictive_analytics
Microsoft SQL Server_2012_predictive_analyticsMicrosoft SQL Server_2012_predictive_analytics
Microsoft SQL Server_2012_predictive_analytics
David J Rosenthal
 
Sql server 2008 r2 predictive analysis data sheet
Sql server 2008 r2 predictive analysis data sheetSql server 2008 r2 predictive analysis data sheet
Sql server 2008 r2 predictive analysis data sheet
Klaudiia Jacome
 
SQL Saturday 108 -- Enterprise Data Mining with SQL Server
SQL Saturday 108 -- Enterprise Data Mining with SQL ServerSQL Saturday 108 -- Enterprise Data Mining with SQL Server
SQL Saturday 108 -- Enterprise Data Mining with SQL Server
Mark Tabladillo
 
SSAS Design &amp; Incremental Processing - PASSMN May 2010
SSAS Design &amp; Incremental Processing - PASSMN May 2010SSAS Design &amp; Incremental Processing - PASSMN May 2010
SSAS Design &amp; Incremental Processing - PASSMN May 2010
Dan English
 
Data mining
Data miningData mining
Data mining
Ahmed Moussa
 
SQL Saturday 109 -- Enterprise Data Mining with SQL Server
SQL Saturday 109 -- Enterprise Data Mining with SQL ServerSQL Saturday 109 -- Enterprise Data Mining with SQL Server
SQL Saturday 109 -- Enterprise Data Mining with SQL Server
Mark Tabladillo
 
SQL Saturday 86 -- Enterprise Data Mining with SQL Server
SQL Saturday 86 -- Enterprise Data Mining with SQL ServerSQL Saturday 86 -- Enterprise Data Mining with SQL Server
SQL Saturday 86 -- Enterprise Data Mining with SQL Server
Mark Tabladillo
 
Introduction To Sql Server Data Mining
Introduction To Sql Server Data MiningIntroduction To Sql Server Data Mining
Introduction To Sql Server Data Mining
Hugo Olivera Alonso
 
Lecture2 (1).ppt
Lecture2 (1).pptLecture2 (1).ppt
Lecture2 (1).ppt
Minakshee Patil
 
Data mining
Data miningData mining
Data mining
RajThakuri
 
BI 2008 Simple
BI 2008 SimpleBI 2008 Simple
BI 2008 Simple
llangit
 
SQL Server 2008 Data Mining
SQL Server 2008 Data MiningSQL Server 2008 Data Mining
SQL Server 2008 Data Mining
llangit
 
SQL Server 2008 Data Mining
SQL Server 2008 Data MiningSQL Server 2008 Data Mining
SQL Server 2008 Data Mining
llangit
 
SQL Server 2008 Data Mining
SQL Server 2008 Data MiningSQL Server 2008 Data Mining
SQL Server 2008 Data Mining
llangit
 
Data Mining 2008
Data Mining 2008Data Mining 2008
Data Mining 2008
llangit
 
Data Mining for Developers
Data Mining for DevelopersData Mining for Developers
Data Mining for Developers
llangit
 
Microsoft SQL Server_2012_predictive_analytics
Microsoft SQL Server_2012_predictive_analyticsMicrosoft SQL Server_2012_predictive_analytics
Microsoft SQL Server_2012_predictive_analytics
David J Rosenthal
 
Sql server 2008 r2 predictive analysis data sheet
Sql server 2008 r2 predictive analysis data sheetSql server 2008 r2 predictive analysis data sheet
Sql server 2008 r2 predictive analysis data sheet
Klaudiia Jacome
 
SQL Saturday 108 -- Enterprise Data Mining with SQL Server
SQL Saturday 108 -- Enterprise Data Mining with SQL ServerSQL Saturday 108 -- Enterprise Data Mining with SQL Server
SQL Saturday 108 -- Enterprise Data Mining with SQL Server
Mark Tabladillo
 
SSAS Design &amp; Incremental Processing - PASSMN May 2010
SSAS Design &amp; Incremental Processing - PASSMN May 2010SSAS Design &amp; Incremental Processing - PASSMN May 2010
SSAS Design &amp; Incremental Processing - PASSMN May 2010
Dan English
 
SQL Saturday 109 -- Enterprise Data Mining with SQL Server
SQL Saturday 109 -- Enterprise Data Mining with SQL ServerSQL Saturday 109 -- Enterprise Data Mining with SQL Server
SQL Saturday 109 -- Enterprise Data Mining with SQL Server
Mark Tabladillo
 
SQL Saturday 86 -- Enterprise Data Mining with SQL Server
SQL Saturday 86 -- Enterprise Data Mining with SQL ServerSQL Saturday 86 -- Enterprise Data Mining with SQL Server
SQL Saturday 86 -- Enterprise Data Mining with SQL Server
Mark Tabladillo
 
Introduction To Sql Server Data Mining
Introduction To Sql Server Data MiningIntroduction To Sql Server Data Mining
Introduction To Sql Server Data Mining
Hugo Olivera Alonso
 
Ad

More from DataminingTools Inc (20)

Terminology Machine Learning
Terminology Machine LearningTerminology Machine Learning
Terminology Machine Learning
DataminingTools Inc
 
Techniques Machine Learning
Techniques Machine LearningTechniques Machine Learning
Techniques Machine Learning
DataminingTools Inc
 
Machine learning Introduction
Machine learning IntroductionMachine learning Introduction
Machine learning Introduction
DataminingTools Inc
 
Areas of machine leanring
Areas of machine leanringAreas of machine leanring
Areas of machine leanring
DataminingTools Inc
 
AI: Logic in AI 2
AI: Logic in AI 2AI: Logic in AI 2
AI: Logic in AI 2
DataminingTools Inc
 
AI: Logic in AI
AI: Logic in AIAI: Logic in AI
AI: Logic in AI
DataminingTools Inc
 
AI: Learning in AI 2
AI: Learning in AI 2AI: Learning in AI 2
AI: Learning in AI 2
DataminingTools Inc
 
AI: Learning in AI
AI: Learning in AI AI: Learning in AI
AI: Learning in AI
DataminingTools Inc
 
AI: Introduction to artificial intelligence
AI: Introduction to artificial intelligenceAI: Introduction to artificial intelligence
AI: Introduction to artificial intelligence
DataminingTools Inc
 
AI: Belief Networks
AI: Belief NetworksAI: Belief Networks
AI: Belief Networks
DataminingTools Inc
 
AI: AI & Searching
AI: AI & SearchingAI: AI & Searching
AI: AI & Searching
DataminingTools Inc
 
AI: AI & Problem Solving
AI: AI & Problem SolvingAI: AI & Problem Solving
AI: AI & Problem Solving
DataminingTools Inc
 
Data Mining: Text and web mining
Data Mining: Text and web miningData Mining: Text and web mining
Data Mining: Text and web mining
DataminingTools Inc
 
Data Mining: Outlier analysis
Data Mining: Outlier analysisData Mining: Outlier analysis
Data Mining: Outlier analysis
DataminingTools Inc
 
Data Mining: Mining stream time series and sequence data
Data Mining: Mining stream time series and sequence dataData Mining: Mining stream time series and sequence data
Data Mining: Mining stream time series and sequence data
DataminingTools Inc
 
Data Mining: Mining ,associations, and correlations
Data Mining: Mining ,associations, and correlationsData Mining: Mining ,associations, and correlations
Data Mining: Mining ,associations, and correlations
DataminingTools Inc
 
Data Mining: Graph mining and social network analysis
Data Mining: Graph mining and social network analysisData Mining: Graph mining and social network analysis
Data Mining: Graph mining and social network analysis
DataminingTools Inc
 
Data warehouse and olap technology
Data warehouse and olap technologyData warehouse and olap technology
Data warehouse and olap technology
DataminingTools Inc
 
Data Mining: Data processing
Data Mining: Data processingData Mining: Data processing
Data Mining: Data processing
DataminingTools Inc
 
Data Mining: clustering and analysis
Data Mining: clustering and analysisData Mining: clustering and analysis
Data Mining: clustering and analysis
DataminingTools Inc
 
AI: Introduction to artificial intelligence
AI: Introduction to artificial intelligenceAI: Introduction to artificial intelligence
AI: Introduction to artificial intelligence
DataminingTools Inc
 
Data Mining: Text and web mining
Data Mining: Text and web miningData Mining: Text and web mining
Data Mining: Text and web mining
DataminingTools Inc
 
Data Mining: Mining stream time series and sequence data
Data Mining: Mining stream time series and sequence dataData Mining: Mining stream time series and sequence data
Data Mining: Mining stream time series and sequence data
DataminingTools Inc
 
Data Mining: Mining ,associations, and correlations
Data Mining: Mining ,associations, and correlationsData Mining: Mining ,associations, and correlations
Data Mining: Mining ,associations, and correlations
DataminingTools Inc
 
Data Mining: Graph mining and social network analysis
Data Mining: Graph mining and social network analysisData Mining: Graph mining and social network analysis
Data Mining: Graph mining and social network analysis
DataminingTools Inc
 
Data warehouse and olap technology
Data warehouse and olap technologyData warehouse and olap technology
Data warehouse and olap technology
DataminingTools Inc
 
Data Mining: clustering and analysis
Data Mining: clustering and analysisData Mining: clustering and analysis
Data Mining: clustering and analysis
DataminingTools Inc
 

Recently uploaded (20)

Scaling GenAI Inference From Prototype to Production: Real-World Lessons in S...
Scaling GenAI Inference From Prototype to Production: Real-World Lessons in S...Scaling GenAI Inference From Prototype to Production: Real-World Lessons in S...
Scaling GenAI Inference From Prototype to Production: Real-World Lessons in S...
Anish Kumar
 
How to Detect Outliers in IBM SPSS Statistics.pptx
How to Detect Outliers in IBM SPSS Statistics.pptxHow to Detect Outliers in IBM SPSS Statistics.pptx
How to Detect Outliers in IBM SPSS Statistics.pptx
Version 1 Analytics
 
“State-space Models vs. Transformers for Ultra-low-power Edge AI,” a Presenta...
“State-space Models vs. Transformers for Ultra-low-power Edge AI,” a Presenta...“State-space Models vs. Transformers for Ultra-low-power Edge AI,” a Presenta...
“State-space Models vs. Transformers for Ultra-low-power Edge AI,” a Presenta...
Edge AI and Vision Alliance
 
Jira Administration Training – Day 1 : Introduction
Jira Administration Training – Day 1 : IntroductionJira Administration Training – Day 1 : Introduction
Jira Administration Training – Day 1 : Introduction
Ravi Teja
 
Oracle Cloud Infrastructure AI Foundations
Oracle Cloud Infrastructure AI FoundationsOracle Cloud Infrastructure AI Foundations
Oracle Cloud Infrastructure AI Foundations
VICTOR MAESTRE RAMIREZ
 
ELNL2025 - Unlocking the Power of Sensitivity Labels - A Comprehensive Guide....
ELNL2025 - Unlocking the Power of Sensitivity Labels - A Comprehensive Guide....ELNL2025 - Unlocking the Power of Sensitivity Labels - A Comprehensive Guide....
ELNL2025 - Unlocking the Power of Sensitivity Labels - A Comprehensive Guide....
Jasper Oosterveld
 
cnc-drilling-dowel-inserting-machine-drillteq-d-510-english.pdf
cnc-drilling-dowel-inserting-machine-drillteq-d-510-english.pdfcnc-drilling-dowel-inserting-machine-drillteq-d-510-english.pdf
cnc-drilling-dowel-inserting-machine-drillteq-d-510-english.pdf
AmirStern2
 
7 Salesforce Data Cloud Best Practices.pdf
7 Salesforce Data Cloud Best Practices.pdf7 Salesforce Data Cloud Best Practices.pdf
7 Salesforce Data Cloud Best Practices.pdf
Minuscule Technologies
 
FCF- Getting Started in Cybersecurity 3.0
FCF- Getting Started in Cybersecurity 3.0FCF- Getting Started in Cybersecurity 3.0
FCF- Getting Started in Cybersecurity 3.0
RodrigoMori7
 
Improving Developer Productivity With DORA, SPACE, and DevEx
Improving Developer Productivity With DORA, SPACE, and DevExImproving Developer Productivity With DORA, SPACE, and DevEx
Improving Developer Productivity With DORA, SPACE, and DevEx
Justin Reock
 
AI Agents in Logistics and Supply Chain Applications Benefits and Implementation
AI Agents in Logistics and Supply Chain Applications Benefits and ImplementationAI Agents in Logistics and Supply Chain Applications Benefits and Implementation
AI Agents in Logistics and Supply Chain Applications Benefits and Implementation
Christine Shepherd
 
Your startup on AWS - How to architect and maintain a Lean and Mean account
Your startup on AWS - How to architect and maintain a Lean and Mean accountYour startup on AWS - How to architect and maintain a Lean and Mean account
Your startup on AWS - How to architect and maintain a Lean and Mean account
angelo60207
 
What is Oracle EPM A Guide to Oracle EPM Cloud Everything You Need to Know
What is Oracle EPM A Guide to Oracle EPM Cloud Everything You Need to KnowWhat is Oracle EPM A Guide to Oracle EPM Cloud Everything You Need to Know
What is Oracle EPM A Guide to Oracle EPM Cloud Everything You Need to Know
SMACT Works
 
Domino IQ – Was Sie erwartet, erste Schritte und Anwendungsfälle
Domino IQ – Was Sie erwartet, erste Schritte und AnwendungsfälleDomino IQ – Was Sie erwartet, erste Schritte und Anwendungsfälle
Domino IQ – Was Sie erwartet, erste Schritte und Anwendungsfälle
panagenda
 
Mastering AI Workflows with FME - Peak of Data & AI 2025
Mastering AI Workflows with FME - Peak of Data & AI 2025Mastering AI Workflows with FME - Peak of Data & AI 2025
Mastering AI Workflows with FME - Peak of Data & AI 2025
Safe Software
 
Murdledescargadarkweb.pdfvolumen1 100 elementary
Murdledescargadarkweb.pdfvolumen1 100 elementaryMurdledescargadarkweb.pdfvolumen1 100 elementary
Murdledescargadarkweb.pdfvolumen1 100 elementary
JorgeSemperteguiMont
 
vertical-cnc-processing-centers-drillteq-v-200-en.pdf
vertical-cnc-processing-centers-drillteq-v-200-en.pdfvertical-cnc-processing-centers-drillteq-v-200-en.pdf
vertical-cnc-processing-centers-drillteq-v-200-en.pdf
AmirStern2
 
Data Virtualization: Bringing the Power of FME to Any Application
Data Virtualization: Bringing the Power of FME to Any ApplicationData Virtualization: Bringing the Power of FME to Any Application
Data Virtualization: Bringing the Power of FME to Any Application
Safe Software
 
ISOIEC 42005 Revolutionalises AI Impact Assessment.pptx
ISOIEC 42005 Revolutionalises AI Impact Assessment.pptxISOIEC 42005 Revolutionalises AI Impact Assessment.pptx
ISOIEC 42005 Revolutionalises AI Impact Assessment.pptx
AyilurRamnath1
 
Azure vs AWS Which Cloud Platform Is Best for Your Business in 2025
Azure vs AWS  Which Cloud Platform Is Best for Your Business in 2025Azure vs AWS  Which Cloud Platform Is Best for Your Business in 2025
Azure vs AWS Which Cloud Platform Is Best for Your Business in 2025
Infrassist Technologies Pvt. Ltd.
 
Scaling GenAI Inference From Prototype to Production: Real-World Lessons in S...
Scaling GenAI Inference From Prototype to Production: Real-World Lessons in S...Scaling GenAI Inference From Prototype to Production: Real-World Lessons in S...
Scaling GenAI Inference From Prototype to Production: Real-World Lessons in S...
Anish Kumar
 
How to Detect Outliers in IBM SPSS Statistics.pptx
How to Detect Outliers in IBM SPSS Statistics.pptxHow to Detect Outliers in IBM SPSS Statistics.pptx
How to Detect Outliers in IBM SPSS Statistics.pptx
Version 1 Analytics
 
“State-space Models vs. Transformers for Ultra-low-power Edge AI,” a Presenta...
“State-space Models vs. Transformers for Ultra-low-power Edge AI,” a Presenta...“State-space Models vs. Transformers for Ultra-low-power Edge AI,” a Presenta...
“State-space Models vs. Transformers for Ultra-low-power Edge AI,” a Presenta...
Edge AI and Vision Alliance
 
Jira Administration Training – Day 1 : Introduction
Jira Administration Training – Day 1 : IntroductionJira Administration Training – Day 1 : Introduction
Jira Administration Training – Day 1 : Introduction
Ravi Teja
 
Oracle Cloud Infrastructure AI Foundations
Oracle Cloud Infrastructure AI FoundationsOracle Cloud Infrastructure AI Foundations
Oracle Cloud Infrastructure AI Foundations
VICTOR MAESTRE RAMIREZ
 
ELNL2025 - Unlocking the Power of Sensitivity Labels - A Comprehensive Guide....
ELNL2025 - Unlocking the Power of Sensitivity Labels - A Comprehensive Guide....ELNL2025 - Unlocking the Power of Sensitivity Labels - A Comprehensive Guide....
ELNL2025 - Unlocking the Power of Sensitivity Labels - A Comprehensive Guide....
Jasper Oosterveld
 
cnc-drilling-dowel-inserting-machine-drillteq-d-510-english.pdf
cnc-drilling-dowel-inserting-machine-drillteq-d-510-english.pdfcnc-drilling-dowel-inserting-machine-drillteq-d-510-english.pdf
cnc-drilling-dowel-inserting-machine-drillteq-d-510-english.pdf
AmirStern2
 
7 Salesforce Data Cloud Best Practices.pdf
7 Salesforce Data Cloud Best Practices.pdf7 Salesforce Data Cloud Best Practices.pdf
7 Salesforce Data Cloud Best Practices.pdf
Minuscule Technologies
 
FCF- Getting Started in Cybersecurity 3.0
FCF- Getting Started in Cybersecurity 3.0FCF- Getting Started in Cybersecurity 3.0
FCF- Getting Started in Cybersecurity 3.0
RodrigoMori7
 
Improving Developer Productivity With DORA, SPACE, and DevEx
Improving Developer Productivity With DORA, SPACE, and DevExImproving Developer Productivity With DORA, SPACE, and DevEx
Improving Developer Productivity With DORA, SPACE, and DevEx
Justin Reock
 
AI Agents in Logistics and Supply Chain Applications Benefits and Implementation
AI Agents in Logistics and Supply Chain Applications Benefits and ImplementationAI Agents in Logistics and Supply Chain Applications Benefits and Implementation
AI Agents in Logistics and Supply Chain Applications Benefits and Implementation
Christine Shepherd
 
Your startup on AWS - How to architect and maintain a Lean and Mean account
Your startup on AWS - How to architect and maintain a Lean and Mean accountYour startup on AWS - How to architect and maintain a Lean and Mean account
Your startup on AWS - How to architect and maintain a Lean and Mean account
angelo60207
 
What is Oracle EPM A Guide to Oracle EPM Cloud Everything You Need to Know
What is Oracle EPM A Guide to Oracle EPM Cloud Everything You Need to KnowWhat is Oracle EPM A Guide to Oracle EPM Cloud Everything You Need to Know
What is Oracle EPM A Guide to Oracle EPM Cloud Everything You Need to Know
SMACT Works
 
Domino IQ – Was Sie erwartet, erste Schritte und Anwendungsfälle
Domino IQ – Was Sie erwartet, erste Schritte und AnwendungsfälleDomino IQ – Was Sie erwartet, erste Schritte und Anwendungsfälle
Domino IQ – Was Sie erwartet, erste Schritte und Anwendungsfälle
panagenda
 
Mastering AI Workflows with FME - Peak of Data & AI 2025
Mastering AI Workflows with FME - Peak of Data & AI 2025Mastering AI Workflows with FME - Peak of Data & AI 2025
Mastering AI Workflows with FME - Peak of Data & AI 2025
Safe Software
 
Murdledescargadarkweb.pdfvolumen1 100 elementary
Murdledescargadarkweb.pdfvolumen1 100 elementaryMurdledescargadarkweb.pdfvolumen1 100 elementary
Murdledescargadarkweb.pdfvolumen1 100 elementary
JorgeSemperteguiMont
 
vertical-cnc-processing-centers-drillteq-v-200-en.pdf
vertical-cnc-processing-centers-drillteq-v-200-en.pdfvertical-cnc-processing-centers-drillteq-v-200-en.pdf
vertical-cnc-processing-centers-drillteq-v-200-en.pdf
AmirStern2
 
Data Virtualization: Bringing the Power of FME to Any Application
Data Virtualization: Bringing the Power of FME to Any ApplicationData Virtualization: Bringing the Power of FME to Any Application
Data Virtualization: Bringing the Power of FME to Any Application
Safe Software
 
ISOIEC 42005 Revolutionalises AI Impact Assessment.pptx
ISOIEC 42005 Revolutionalises AI Impact Assessment.pptxISOIEC 42005 Revolutionalises AI Impact Assessment.pptx
ISOIEC 42005 Revolutionalises AI Impact Assessment.pptx
AyilurRamnath1
 
Azure vs AWS Which Cloud Platform Is Best for Your Business in 2025
Azure vs AWS  Which Cloud Platform Is Best for Your Business in 2025Azure vs AWS  Which Cloud Platform Is Best for Your Business in 2025
Azure vs AWS Which Cloud Platform Is Best for Your Business in 2025
Infrassist Technologies Pvt. Ltd.
 

SQL Server: Data Mining

  • 2. Types of analysis • Ad-hoc query/Reporting/Analysis – What is the purpose? • Simple reports • Key Performance Indicators • OLAP cubes – Slice & Dice – In Real time - What happens now? • Events/Triggers • Data Mining – How do we do it? – What happens?
  • 3. What does Data Mining Do? Explores Your Data Finds Patterns Performs Predictions
  • 4. Data Mining Algorithms • Classification • Regression • Segmentation • Association • Forecasting • Text Analysis • Advanced Data Exploration
  • 5. Mining Process Training data Data to be Mining Model predicted DM Engine Mining Model With predictions
  • 6. Data Mining Process SSAS (OLAP) Business Data DSV Understanding Understanding SSIS SSAS Data Data (OLAP) Preparation SSIS SSAS(OLAP) SSRS Deployment Flexible APIs SSAS Modeling (Data Mining) Evaluation www.crisp-dm.org
  • 7. Data Mining in SQL Server 2008 • New algorithms developed in conjunction with Microsoft Research • Data mining is made accessible and easy to use through integrated user interface, cross- product integration and familiar, standard APIs • Complete framework for building and deploying intelligent applications on the fly • Integration into the cloud.
  • 8. Top New Features in SQL Server 2008 • Test multiple data mining models simultaneously with statistical scores of error and accuracy and confirm their stability with cross validation • Build multiple, incompatible mining models within a single structure; apply model analysis over filtered data; query against structure data to present complete information, all enabled by enhanced mining structures • Combine the best of both worlds by blending optimized near-term predictions (ARTXP) and stable long-term predictions (ARIMA) with Better Time Series Support • Discover the relationship between items that are frequently purchased together by using Shopping Basket Analysis; generate interactive forms for scoring new cases with Predictive Calculator, delivered with Microsoft SQL Server 2008 Data Mining Add-ins for Office 2007
  • 9. Rich and Innovative Algorithms • Benefit from many rich and innovative data mining algorithms, most developed by Microsoft Research to support common business problems promptly and accurately. • Market Basket Analysis - Discover which items tend to be bought together to create recommendations on- the-fly and to determine how product placement can directly contribute to your bottom line • Churn Analysis - Anticipate customers who may be considering canceling their service and identify benefits that will keep them from leaving • Market Analysis - Define market segments by automatically grouping similar customers together. Use these segments to seek profitable customers • Forecasting - Predict sales and inventory amounts and learn how they are interrelated to foresee bottlenecks and improve performance • Data Exploration - Analyze profitability across customers, or compare customers who prefer different brands of the same product to discover new opportunities • Unsupervised Learning - Identify previously unknown relationships between various elements of your business to better inform your decisions • Web Site Analysis - Understand how people use your Web site and group similar usage patterns to offer a better experience • Campaign Analysis - Spend marketing dollars more effectively by targeting the customers most likely to respond to a promotion • Information Quality - Identify and handle anomalies during data entry or data loading to improve the quality of information • Text Analysis - Analyze feedback to find common themes and trends that concern your customers or employees, informing decisions with unstructured input
  • 10. Value of Data Mining Business Knowledge SQL Server 2008 Business value Data Mining OLAP Reports (Adhoc) Reports (static) Simple Complex Usability
  • 11. Data Mining User Interface • SQL Server BI Development Studio – Environment for creation and data exploration – Data Mining projects in Visual Studio solutions, tightly integrated – Source Control Integration • SQL Server Management Studio – One tool for all administrative tasks – Manage, view and query mining models
  • 12. BI Integration • Integration Services – Data Mining processing and results integrate directly in IS pipeline • OLAP – Processing of mining models directly from cubes – Use of mining results as dimensions • Reporting Services – Embed Data Mining results directly in Reporting Services Reports
  • 13. Applied Data Mining • Make Decisions without Coding – Learn business rules directly from data • Client Customization – Learn logic customized for each client • Automatic Update – Data mining application logic updated by model re- processing – Applications do not need to be rewritten, recompiled, re- deployed
  • 14. Server Mining Architecture BI Dev Your Application Studio (Visual Studio) OLE DB/ ADOMD/ XMLA App Deploy Data Analysis Services Mining Model Server Data Mining Algorithm Data Source
  • 15. Data Mining EXtensions • OLE DB for Data Mining specification – Now part of XML/A specification – See www.xmla.org for XML/A details • Connect to Analysis Server – OLEDB, ADO, ADO.Net, ADOMD.Net, XMLA Dim cmd as ADOMD.Command Dim reader as ADOMD.DataReader Cmd.Connection = conn Set reader = Cmd.ExecuteReader(“Select Predict(Gender)…”)
  • 16. Typical DM Process Using DMX Define a model: CREATE MINING MODEL …. Data Mining Train a model: Management System INSERT INTO dmm …. (DMMS) Training Data Prediction using a model: Mining Model SELECT … FROM dmm PREDICTION JOIN … Prediction Input Data
  • 17. DMX Commands • Definition (DDL) – CREATE – Make new model – SELECT INTO – Create model by copying existing – EXPORT – Save model as .abf file – IMPORT – Retrieve model from .abf file • Manipulation (DML) – INSERT INTO – Train model – UPDATE – Change content of model – DELETE – Clear content – SELECT – Browse model
  • 18. DMX SELECT Elements • SELECT [FLATTENED] [TOP] <columns> • FROM <model> • PREDICTION JOIN <table> • ON <mapping> • WHERE <filter> • ORDER BY <sort expression> – Use query builder to create SELECT statement
  • 19. Training a DM Model: Simple INSERT INTO CollegePlanModel (StudentID, Gender, ParentIncome, Encouragement, CollegePlans) OPENROWSET(‘<provider>’, ‘<connection>’, ‘SELECT StudentID, Gender, ParentIncome, Encouragement, CollegePlans FROM CollegePlansTrainData’)
  • 20. Prediction Using a DM Model • PREDICTION JOIN SELECT t.ID, CPModel.Plan FROM CPModel PREDICTION JOIN OPENQUERY(…,„SELECT * FROM NewStudents‟) AS t ON CPModel.Gender = t.Gender AND CPModel.IQ = t.IQ
  • 21. Visit more self help tutorials • Pick a tutorial of your choice and browse through it at your own pace. • The tutorials section is free, self-guiding and will not involve any additional support. • Visit us at www.dataminingtools.net