Understanding Data Assignment 2
Understanding Data Assignment 2
A. A newspaper article
B. An image on a website
D. A video file
A. Multimedia content
A. Median
B. Mode
C. Mean
D. Standard Deviation
Answer: C. Mean
C. Arow of values
D. A description of metadata
A. Mean
B. Median
C. Mode
D. Range
Answer: D. Range
A. Audio processing
B. Metadata generation
C. Statistical techniques
D. Data hiding
11. What is the mode of the dataset [34, 34, 27, 28, 27, 34, 34]?
A. 27
B. 34
C. 28
D. 33
Answer: B. 34
A. Pen Drive
B. CD/DVD
C. HDD
A. Audio files
A. Java
B. MySQL
C. Python
D. HTML
Answer: C. Python
A. Mode
B. Median
C. Mean
D. Standard Deviation
19. What is the formula for standard deviation (σ) as per the chapter?
A. 𝜎 =𝑀𝑒𝑎𝑛𝑜𝑓𝑣𝑎𝑙𝑢𝑒𝑠
B. 𝜎 = (𝑀𝑎𝑥𝑖𝑚𝑢𝑚−𝑀𝑖𝑛𝑖𝑚𝑢𝑚)
C. 𝜎 = √(Σ(𝑥𝑖−𝑥)2/𝑛)
D. 𝜎 =Σ(𝑥𝑖+𝑥)/𝑛
Answer: C. 𝜎 = √(Σ(𝑥𝑖−𝑥)2/𝑛)
1. Data which is organised and can be recorded in a well-defined format is called ________.
Answer: Structured Data
2. Data which do not follow any fixed structure or format are called ________.
Answer: Unstructured Data
3. The singular form of the word ‘data’ is ________.
Answer: Datum
4. The process of collecting, storing, and analysing data for decision making is known as
________.
Answer: Data Processing
5. ________ is a measure of central tendency that represents the average of a set of values.
Answer: Mean
6. The ________ is the middle value in a sorted list of data values.
Answer: Median
7. The value that occurs most frequently in a data set is called the ________.
Answer: Mode
8. The difference between the maximum and minimum values in a data set is called the
________.
Answer: Range
9. The standard deviation is represented by the Greek letter ________.
Answer: Sigma (𝜎)
10. The data describing other data is referred to as ________.
Answer: Metadata
11. Examples of digital storage devices include HDD, SSD, CD/DVD, Pen Drive, and ________.
Answer: Memory Card
12. Statistical techniques used to summarise data include mean, median, mode, range, and
________.
Answer: Standard Deviation
13. The process of obtaining data from reliable sources before processing is called ________.
Answer: Data Collection
14. ICT revolution has led to the generation of ________ volume of data at a very fast pace.
Answer: Large
15. The structured data is generally stored in a ________ format in computers.
Answer: Tabular
2 MARKS QUESTIONS
Answer:
Data helps identify trends, draw conclusions, and support decisions in areas such as
business, education, healthcare, and governance.
Answer:
Mean is the average of numeric values. Formula: 𝑀𝑒𝑎𝑛 = (𝑥1 +𝑥2 +...+𝑥𝑛)/𝑛
Answer:
Median is the middle value in an ordered list. For even number of values, it is the
Answer:
Mode is the value that appears most frequently in a dataset. Example: In [34, 34, 28, 27, 34],
mode is 34.
Answer:
Range is the difference between the maximum and minimum values in a dataset. Formula:
Range = Maximum– Minimum
Answer:
Standard deviation measures the spread or dispersion of values around the mean. It
considers all data points in the dataset.
Answer:
14. Mention any two statistical techniques used for data summarisation.
Answer:
1. Measures of Central Tendency (Mean, Median, Mode)
Answer:-
Deviation: Measures the average spread of all values from the mean.
Answer: Python provides libraries that allow efficient data processing, statistical analysis,
and visualisation of large data sets.
Answer:
1. Data Collection
2. Data Preparation
3. Data Entry
6. Generation of Reports/Results
3 MARKS QUESTIONS
1. Explain the three commonly used measures of central tendency with examples.
Answer:-
-Mean is the average of all values.
Example: Mean of [90, 100, 110] = (90+100+110)/3 = 100
-Median is the middle value in a sorted list.
Example: Median of [85, 90, 100, 110, 115] = 100
-Mode is the value that occurs most frequently.
Example: Mode of [90, 110, 110, 110, 100] = 110
2. Differentiate between structured data and unstructured data with examples.
Answer:-
Structured Data: Organised in rows and columns, easy to store and analyse. Example:
Table of student records with Roll No, Name, Marks.- Unstructured Data: Lacks
predefined format; difficult to analyse. Example: Social media posts with images and
text.
3. Define standard deviation. Write its formula and explain its significance.
Answer:
Standard deviation (σ) measures the spread of data around the mean. Formula: [𝜎 = √ 1
𝑛 𝑛 ∑ 𝑖=1 (𝑥𝑖 − ̄ 𝑥)2] It gives insights into data variability. A smaller σ means values are
closer to the mean; a larger σ indicates more spread.
4. What are metadata? Give three examples from different digital files.
Answer:
Metadata are data about data.
They describe content and structure.
Examples:- In an image file: resolution, format (JPEG/PNG)- In an email: subject,
recipient, date sent- In a document: author name, word count, creation date
5. Describe the role of data in business decision-making with any two examples.
Answer:
Businesses use data to understand market trends and improve performance.
Examples: 1. Analysing customer feedback to improve products.
6. Explain the data processing cycle with the help of a diagram or steps.
Answer:
4. Storage and Retrieval– Saving for future use. These steps convert raw data into
useful information.
7. Distinguish between range and standard deviation with formula and example.
Answer:-
Range = 115– 85 = 30
Formula: 𝜎 = √(Σ(𝑥–𝑥)2/𝑛)
Example: For [90, 100, 110], σ is calculated using all data points.
8. Give three different scenarios of data collection and describe the method to convert them
into digital format.
Answer:
1. Manual Record (e.g., shopkeeper’s diary): Enter data into spreadsheet manually.
2. Digital File (e.g., CSV): Directly use data for analysis using software tools.
3. No prior data: Develop software (e.g., in Python or MySQL) to store and manage
sales digitally.
9. What are the limitations of file processing and how does DBMS help overcome them?
Answer:
DBMS Benefits:
- Centralised management
10. A teacher wants to compare students’ test results from five months. Which statistical
technique is suitable and why?
Answer:
Mean is the suitable technique to compare average performance over five months. It
provides a quick understanding of how the class performed each month and highlights
trends in overall class performance.
5 MARKS QUESTIONS
1. What are the different types of data? Explain structured and unstructured data with
examples.
Answer:
Data can be broadly categorized into:
1. Structured Data– Organised in a defined format like tables (rows and columns). Each
column represents an attribute and each row represents an observation.
Examples:
- School records (RollNo, Name, Marks)
- ATM withdrawal data (AccountNo, Date, Amount)
2. Unstructured Data– Data not arranged in predefined format, lacks structure.
Examples:
- Social media posts
- Email content
- News articles with images, videos, and text
2. Explain the role of data in various real-life sectors. Give at least five examples.
Answer: Data plays a crucial role in decision-making across various domains.
Examples:
5. Business: Sales data is analysed for discounts, inventory planning, and marketing
decisions.
Answer:
Difference:
Answer:
Standard deviation measures the dispersion or spread of data around the mean.
Formula:
Steps:
- Mean ≈ 101.33
- 𝜎 ≈√(938/9) ≈ 10.
5. Differentiate between Range and Standard Deviation. Explain with examples and
formula.
Answer:
Example:
Data: [85, 90, 90, 100, 102, 110, 110, 110, 115]
- Range = 115– 85 = 30
- σ ≈10.2 (calculated using mean and all values)
6. Explain the data processing cycle with a real-life example.
Answer:
The data processing cycle includes:
1. Data Collection– Gather raw data.
2. Data Preparation– Organise, clean, and validate data.
3. Data Entry– Input data into the system.
- Check eligibility
Answer:
1. Manual to Digital:
- A new system is developed (e.g., using Python/MySQL) to record and store sales or
transactions digitally.
8. Explain metadata with three examples. How is it useful in processing unstructured data?
Answer:
Examples:
Usefulness:
Metadata helps organise, search, and process unstructured content like emails, images, and
documents.
9. List and explain any five real-life applications of statistical techniques in data processing.
Answer:
10. Compare the use of Mean, Median, and Mode with suitable scenarios.
Answer: