Data Sciences Class 10 Notes
Data Sciences Class 10 Notes
1. Scenario Example:
o Restaurants preparing food in bulk face wastage due to incorrect demand
predictions.
o A regression model can predict food preparation quantities based on past data.
2. Steps in the AI Project Cycle:
o Problem Scoping: Define the problem using the 4Ws canvas.
o Data Collection: Collect data (e.g., dish name, quantity prepared,
unconsumed amount).
o Modeling: Train and test a regression model using collected data.
o Evaluation: Check model predictions for accuracy and refine it as needed.
1. Types of Data:
o Numerical Data: Quantitative (e.g., age, marks).
o Categorical Data: Labels or categories (e.g., gender, country).
2. Formats:
o CSV: Comma-separated values for tabular data.
o Spreadsheet: Data organized in rows and columns (e.g., Excel).
o SQL: Structured Query Language for database management.
1. NumPy:
o Fundamental for numerical operations.
o Works with arrays (homogeneous data collections) and performs efficient
mathematical computations.
2. Advantages of Python Packages:
o Pre-defined functions reduce coding effort and errors.
o Examples: NumPy simplifies array manipulations.
1. Erroneous Data:
o Incorrect Values: Mismatched data types (e.g., a decimal in a phone number
column).
o Null Values: Missing values (e.g., NaN in datasets).
2. Missing Data:
o Empty cells in datasets, not always due to errors.
3. Outliers:
o Values that deviate significantly from the rest (e.g., 0 marks for an absent
student affecting averages).