100% found this document useful (2 votes)
526 views3 pages

Data Sciences Class 10 Notes

Uploaded by

aadyaag2009
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
100% found this document useful (2 votes)
526 views3 pages

Data Sciences Class 10 Notes

Uploaded by

aadyaag2009
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 3

Data Sciences: Study Notes for CBSE Class 10

What is Data Science?

 Definition: Data Science is an interdisciplinary field that combines statistics, data


analysis, machine learning, and related techniques to extract insights from data.
 It uses methods from Mathematics, Statistics, Computer Science, and Information
Science to analyze phenomena.

Applications of Data Science

1. Fraud and Risk Detection:


o Identifies unusual patterns to prevent fraud (e.g., in banking or insurance).
2. Internet Search:
o Processes petabytes of data for search results.
3. Targeted Advertising:
o Analyzes user behavior to display relevant ads.
4. Website Recommendations:
o Suggests content like movies, music, or products based on preferences (e.g.,
Netflix, Amazon).
5. Airline Route Planning:
o Predicts delays, optimizes routes, and plans efficient fleet management.

Data Science in AI Projects

1. Scenario Example:
o Restaurants preparing food in bulk face wastage due to incorrect demand
predictions.
o A regression model can predict food preparation quantities based on past data.
2. Steps in the AI Project Cycle:
o Problem Scoping: Define the problem using the 4Ws canvas.
o Data Collection: Collect data (e.g., dish name, quantity prepared,
unconsumed amount).
o Modeling: Train and test a regression model using collected data.
o Evaluation: Check model predictions for accuracy and refine it as needed.

Types of Data and Formats

1. Types of Data:
o Numerical Data: Quantitative (e.g., age, marks).
o Categorical Data: Labels or categories (e.g., gender, country).
2. Formats:
o CSV: Comma-separated values for tabular data.
o Spreadsheet: Data organized in rows and columns (e.g., Excel).
o SQL: Structured Query Language for database management.

Python Packages for Data Access

1. NumPy:
o Fundamental for numerical operations.
o Works with arrays (homogeneous data collections) and performs efficient
mathematical computations.
2. Advantages of Python Packages:
o Pre-defined functions reduce coding effort and errors.
o Examples: NumPy simplifies array manipulations.

Data Issues and Cleaning

1. Erroneous Data:
o Incorrect Values: Mismatched data types (e.g., a decimal in a phone number
column).
o Null Values: Missing values (e.g., NaN in datasets).
2. Missing Data:
o Empty cells in datasets, not always due to errors.
3. Outliers:
o Values that deviate significantly from the rest (e.g., 0 marks for an absent
student affecting averages).

Important Exam Questions and Answers

1. Define Data Science.


o Answer: Data Science is the study of extracting insights and patterns from
data using methods from Mathematics, Statistics, and Computer Science.
2. What is the role of regression in AI projects?
o Answer: Regression is a supervised learning model that predicts continuous
data values based on historical data.
3. List three applications of Data Science.
o Answer: Fraud detection, targeted advertising, and website recommendations.
4. Differentiate between CSV and SQL.
o Answer:
 CSV: Stores tabular data as plain text with values separated by
commas.
 SQL: A programming language for managing structured data in
databases.
5. What are outliers in data? How are they handled?
o Answer: Outliers are extreme values that differ significantly from others.
They are excluded during analysis to prevent skewed results.
6. What are the common formats used for storing tabular data?
o Answer: CSV files, spreadsheets (Excel), and SQL databases.
7. Explain the types of errors found in datasets.
o Answer:
 Incorrect Values: Data in the wrong format or type.
 Null Values: Missing or empty data cells.
8. Describe the NumPy package.
o Answer: NumPy is a Python library used for numerical computations,
supporting arrays and various mathematical operations.
9. What is the importance of cleaning data?
o Answer: Cleaning ensures data accuracy by removing errors, filling missing
values, and eliminating outliers, leading to reliable analysis.
10. MCQ Example: Which of the following is a file format used in Data Science?
o Answer: CSV.
11. Describe one real-life application of Data Science in airlines.
o Answer: Predicting flight delays and optimizing routes for efficiency.

You might also like