0% found this document useful (0 votes)
244 views

DSML Practical

This document contains information about an assignment to perform data science operations like summary statistics, data visualization, cleaning, transformation, and modeling on a dataset. It describes the data science process and applying statistical methods to pre-process and extract meaning from data. It also outlines different types of algorithms commonly used in data analysis and machine learning like descriptive, data mining, machine learning, natural language processing, reinforcement learning, and genetic algorithms.

Uploaded by

focusedbanda117
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
244 views

DSML Practical

This document contains information about an assignment to perform data science operations like summary statistics, data visualization, cleaning, transformation, and modeling on a dataset. It describes the data science process and applying statistical methods to pre-process and extract meaning from data. It also outlines different types of algorithms commonly used in data analysis and machine learning like descriptive, data mining, machine learning, natural language processing, reinforcement learning, and genetic algorithms.

Uploaded by

focusedbanda117
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 3

Name: Shreyas Satish Jagadale

Roll No. 322030


Batch B2
PRN: 22110649

Assignment 2
AIM: Perform the following operations using R/Python on the data sets Compute and display
summary statistics for each feature available in the dataset. (e.g. minimum value, maximum
value, mean, range, standard deviation, variance and percentiles · Data Visualization-Create a
histogram for each feature in the dataset to illustrate the feature distributions. · Data
cleaning · Data integration Data transformation · Data model building(e.g. Classification)
OBJECTIVE: Describe the Data Science Process and explore components interaction.
Apply statistical methods for pre-processing and extracting meaning from data to the
application dataset.
ALGORITHM:
An algorithm is a step-by-step procedure or formula for solving a problem or completing a
task. In the
context of data analysis and machine learning, algorithms are used to process and analyze
data, make
predictions, or learn patterns from data. There are various types of algorithms, each
designed for
specific tasks. Here are some common types of algorithms used in data analysis and machine
learning:
1. Descriptive Algorithms:
- Summary Statistics: Algorithms for calculating basic statistics like mean, median, standard
deviation, etc., to summarize data.
- Frequency Counting: Algorithms to count the occurrences of items in a dataset.
- Sorting Algorithms: Algorithms for arranging items in a specific order.
2. Data Mining Algorithms:
- Apriori Algorithm: Used for association rule mining in large datasets.
- K-Means Clustering: Unsupervised learning algorithm for clustering similar data points
together.
- Decision Trees: Supervised learning algorithm for classification and regression tasks.
3. Machine Learning Algorithms:
- Linear Regression: Supervised learning algorithm for predicting a continuous
outcome variable based on one or more input features.
- Logistic Regression: Supervised learning algorithm for binary classification problems. -
Random Forest: Ensemble learning method that combines multiple decision trees for more
accurate predictions.
- Support Vector Machines (SVM): Supervised learning algorithm for classification or
regression tasks, effective in high-dimensional spaces.
- Neural Networks: Deep learning algorithm inspired by the structure of the human
brain, used for complex tasks like image recognition and natural language processing.
4. Natural Language Processing (NLP) Algorithms:
- Tokenization: Breaking down text into words or phrases (tokens) for analysis.
- Sentiment Analysis: Determining the sentiment (positive, negative, neutral) of a piece of
text.
- Named Entity Recognition (NER): Identifying named entities (such as names of people,
organizations, locations) in text data.
5. Reinforcement Learning Algorithms:
- Q-Learning: Used in reinforcement learning problems where an agent learns to make
decisions by maximizing expected rewards.
6. Genetic Algorithms:
- Genetic Algorithms: Optimization algorithms inspired by the process of natural selection.
They are used to find approximate solutions to complex optimization and search problems.

Code:

You might also like