0% found this document useful (0 votes)
28 views3 pages

DWDM Unit 3, 4 and 5 Links

The document outlines the syllabus for the Data Mining and Data Warehousing course at Raghu Engineering College, focusing on classification, association analysis, and clustering techniques. It includes detailed topics such as decision tree classifiers, evaluation methods, and algorithms like Apriori and DBSCAN, along with associated YouTube links for further learning. The document serves as a comprehensive guide for third-year Computer Science students to understand key concepts and methodologies in data mining.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
28 views3 pages

DWDM Unit 3, 4 and 5 Links

The document outlines the syllabus for the Data Mining and Data Warehousing course at Raghu Engineering College, focusing on classification, association analysis, and clustering techniques. It includes detailed topics such as decision tree classifiers, evaluation methods, and algorithms like Apriori and DBSCAN, along with associated YouTube links for further learning. The document serves as a comprehensive guide for third-year Computer Science students to understand key concepts and methodologies in data mining.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd

RAGHU ENGINEERING COLLEGE,

DAKAMARRI
DEPARTMENT OF COMPUTER SCIENCE AND ENGINEERING (DS)

Branch : CSE Year: III Year-I Sem


Subject Name: Data Mining and Data Warehousing
Faculty Name: D.Hima Bindu

UNIT III
Classification: Basic Concepts, General Approach to solving a classification problem, Decision Tree classifier: A
Basic Algorithm to Build a Decision Tree , Methods for Expressing Attribute Test Conditions, Measures for
Selecting an Attribute Test Condition, Algorithm for Decision Tree Induction, Characteristics of Decision Tree
Classifiers, Model over fitting: Reasons for Model over fitting, evaluating the performance of classifier: holdout
method, Cross-validation, Random sub sampling.

S.No Topic You Tube Link Duration


1. General approach to solving a https://youtu.be/_9CXhMYflrc? 20.14-26.54
classification problem. si=EwCZhDL7h0-wtbif 6.40 mins
2. Basic algorithm used to build a https://youtu.be/PQbz0j56T3A?si=n- 17.42 mins
decision tree. asaZTcW9efJYyN
3. Common methods for https://youtu.be/zLXRLxXT5Is? 5.00 mins
expressing attribute test si=vhLl1415LKsMB2DF
conditions in decision trees
4. Decision tree induction https://youtu.be/mb3N4_pYtqE? 6.43 mins
si=is6rQkoyPbXBe_A5
5. Confusion matrix https://youtu.be/_9CXhMYflrc? 27.28-34.28
si=S4pFEUWTSZZ0jD1V 7.00 mins
6. Measures for selecting an https://youtu.be/lDapB3F8Lw8? 8.59 mins
attribute test condition si=9KeH4PDMhNelB4xz
7. Algorithm for decision tree https://youtu.be/Bjy81x3Efvk? 21.40-31.59
induction si=9qHsV9u9W3hu3As1 10.19 mins
8. Hunt’s algorithm https://youtu.be/MhtAHhJBsMo? 23.19 mins
si=iQ4pc8R5tCKRbZpG
9. Characteristics of decision tree https://youtu.be/EF9fyVjb1TM?si=PB2JC- 22.46 mins
ztbd4Grp5z
10. Measures for node impurity https://youtu.be/Bjy81x3Efvk? 9.33-21.10
si=9qHsV9u9W3hu3As1 12.23 mins
11. Holdout method to evaluate the https://youtu.be/fvfHTGqpR-M? 3.45 mins
performance of classifier si=pbBMYRmG4xtM6L9n
12. Random sub sampling to https://youtu.be/QtBxBJQsI2A? 5.25-7.48
evaluate the performance of si=62l2k9lNydWpJkDe 2.23 mins
classifier
13. k-fold cross validation https://youtu.be/QtBxBJQsI2A? 7.55-19.25
si=62l2k9lNydWpJkDe 12.30 mins
14. Creating decision tree, example https://youtu.be/zNYdkpAcP-g? 14.17 mins
si=QOTMOlumCE0HZBW0
15. Explain overfitting with an https://youtu.be/cSUEmTX9qSU? 10.16 mins
example.
si=apQQQRpBuUzRaWxd

16. Explain Reasons for Model Over https://youtu.be/EF9fyVjb1TM? 22.40 mins


fitting in Decision Tree si=zeFJMQEKrlFKEkvv
Induction.
17. Explain difference between https://youtu.be/mzW66DB48oM? 4.30 mins
entropy and gini index si=nX26pPJ45QR-krU4
Total Time mins
UNIT 4
Association Analysis: problem definition, Frequent Item set generation-The Apriori principle,
Frequent Item set generation in Apriori algorithm, candidate generation and pruning, support
counting , Rule generation, Compact representation of Frequent item sets, FP-Growth
algorithm.
1 Association rule mining https://youtu.be/IpBJ-veH-g0?
si=A_rmx8y6UUD4om8w
2 Association rule mining ex https://youtu.be/RDQplhHYUr0?
si=ZfAFZEnbZsKhmaz4
3 Fp growth algortihm https://youtu.be/7oGz4PCp9jI?
si=dZLs4KyeSi71oPhv
4 Compact representation of https://youtu.be/uox4Z9yh63o?si=lZTMRD-7k3-
frequent item sets (or) maximal 7nY9e
and closed frequent item sets
5 Different types of candidate https://youtu.be/B5Yszom_Tvk?
generation methods si=Abq4qu7PgGy5AvTr
6
7
8
9
10
11
12
13
14
15
16
17
18
UNIT V
Overview, K-means- Basic K- means algorithm ,additional issues, Bisecting k- means,
strengths and weaknesses. Agglomerative Hierarchical clustering-Basic agglomerative
hierarchical clustering algorithm, outliers, strengths and weaknesses. DBSCAN: Traditional
density: DBSCAN algorithm, strengths and weaknesses.
1 k-means clustering algorithm https://youtu.be/aR4yt5fBc_g?
with example for numeric data si=idnPwq97ZPef5yCk
2 k-means clustering algorithm https://youtu.be/KzJORp8bgqs?si=Qv5lt--H-
with example for point data c2L3-l1
3 Agglomerative hierarchical https://youtu.be/YH0r47m0kFM?
algorithm si=FSJ5cVY_pTrzL-2a
4
5 Db scan algorithm example https://youtu.be/-p354tQsKrs?si=-
2ZKY0ucsqN9oEp4
6 dB scan algorithm example in https://youtu.be/PZcssHN5PYQ?
telugu si=6BdlMkP55lwwG96p
7 Clustering https://youtu.be/dUm3ptTQr0Q?
si=BuvANO2oGeJtNu4z
8 All mining techniques in simple https://youtu.be/dUm3ptTQr0Q?
form si=BuvANO2oGeJtNu4z
9
10
11
12
13
14
15
16

You might also like