Data Analysis With Pandas - Aggregates in Pandas Cheatsheet - Codecademy

Pandas allows calculating aggregate statistics on DataFrame columns using functions like mean(), std(), median(), max(), min(), count(), and nunique(). These can provide summary information like the average, standard deviation, median, maximum, minimum, count, and number of unique values of a column. Aggregate functions can also be applied across multiple rows using groupby(), grouping rows that are the same in one column and replacing values in another column with aggregate statistics like mean() of that column for each group.

Uploaded by

Utsav Soi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

100% found this document useful (1 vote)

71 views2 pages

Data Analysis With Pandas - Aggregates in Pandas Cheatsheet - Codecademy

Uploaded by

Utsav Soi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 2

5/6/2020 Data Analysis with Pandas: Aggregates in Pandas Cheatsheet | Codecademy

Cheatsheets / Data Analysis with Pandas

Aggregates in Pandas
Pandas DataFrame Aggregate Function
Pandas’ aggregate statistics functions can be used to calculate statistics on a column of
a DataFrame. For example, df.columnName.mean() computes the mean of the column
columnName of dataframe df . The code block shows how to calculate statistics on the
column columnName of df using Pandas’ aggregate statistics functions.

df.columnName.mean() # Average of all values in column

df.columnName.std() # Standard deviation of column
df.columnName.median() # Median value of column
df.columnName.max() # Maximum value in column
df.columnName.min() # Minimum value in column
df.columnName.count() # Number of values in column
df.columnName.nunique() # Number of unique values in column
df.columnName.unique() # List of unique values in column

Pandas’ Groupby
In a pandas DataFrame , aggregate statistic functions can be applied across multiple rows
by using a groupby function. In the example, the code takes all of the elements that are
the same in Name and groups them, replacing the values in Grade with their mean.
Instead of mean() any aggregate statistics function, like median() or max() , can be used.
Note that to use the groupby() function, at least two columns must be supplied.

df = pd.DataFrame([
["Amy","Assignment 1",75],
["Amy","Assignment 2",35],
["Bob","Assignment 1",99],
["Bob","Assignment 2",35]
], columns=["Name", "Assignment", "Grade"])

df.groupby('Name').Grade.mean()

# output of the groupby command

|Name | Grade|
| - | - |

https://www.codecademy.com/learn/paths/data-science/tracks/data-processing-pandas/modules/dspath-agg-pandas/cheatsheet 1/2
5/6/2020 Data Analysis with Pandas: Aggregates in Pandas Cheatsheet | Codecademy

|Amy | 55|
|Bob | 67|

https://www.codecademy.com/learn/paths/data-science/tracks/data-processing-pandas/modules/dspath-agg-pandas/cheatsheet 2/2

1745516832930-Pandas-Handbook
No ratings yet
1745516832930-Pandas-Handbook
33 pages
Data Manipulation With Pandas
No ratings yet
Data Manipulation With Pandas
19 pages
Data Visualization With Pandas
No ratings yet
Data Visualization With Pandas
8 pages
Marko Grobelnik, Blaz Fortuna, Dunja Mladenic Jozef Stefan Institute, Slovenia
100% (1)
Marko Grobelnik, Blaz Fortuna, Dunja Mladenic Jozef Stefan Institute, Slovenia
107 pages
Panda Cheatsheet
No ratings yet
Panda Cheatsheet
17 pages
Data Visualization
No ratings yet
Data Visualization
9 pages
EDA with Pandas
No ratings yet
EDA with Pandas
8 pages
Pandas in Python 16sept2022
No ratings yet
Pandas in Python 16sept2022
8 pages
Pandas CheatSheet
No ratings yet
Pandas CheatSheet
18 pages
EDA Cheatsheet - Class Note
No ratings yet
EDA Cheatsheet - Class Note
29 pages
Pandas_Notes_Design
No ratings yet
Pandas_Notes_Design
5 pages
Pandas
No ratings yet
Pandas
30 pages
Module1-Cheat-Sheet-LINE PLOT
No ratings yet
Module1-Cheat-Sheet-LINE PLOT
3 pages
DevOps Session 3 Pandas.pptx
No ratings yet
DevOps Session 3 Pandas.pptx
33 pages
Pandas_Notes
No ratings yet
Pandas_Notes
6 pages
Pandas
No ratings yet
Pandas
9 pages
ML Lab1 Python Panda
No ratings yet
ML Lab1 Python Panda
9 pages
Pandas
No ratings yet
Pandas
13 pages
Pandas
No ratings yet
Pandas
167 pages
Class 6 Pandas
No ratings yet
Class 6 Pandas
13 pages
1-Pandas Cheat Sheet
No ratings yet
1-Pandas Cheat Sheet
7 pages
Exploratory Data Analysis (Eda) With Pandas: (Cheatsheet)
No ratings yet
Exploratory Data Analysis (Eda) With Pandas: (Cheatsheet)
7 pages
Unit3 160420200647 PDF
No ratings yet
Unit3 160420200647 PDF
146 pages
40_NumPy_and_Pandas_interview_questions_with_answers_1740141557
No ratings yet
40_NumPy_and_Pandas_interview_questions_with_answers_1740141557
6 pages
Pandas For Data Science
No ratings yet
Pandas For Data Science
42 pages
Pandas Methods
No ratings yet
Pandas Methods
6 pages
Pandas Class XII (2021-22)
No ratings yet
Pandas Class XII (2021-22)
246 pages
Pandas
No ratings yet
Pandas
8 pages
Unit-1 Python Pandas (1)
No ratings yet
Unit-1 Python Pandas (1)
56 pages
18_Pandas
No ratings yet
18_Pandas
33 pages
Block 1-Data Handling Using Pandas DataFrame
No ratings yet
Block 1-Data Handling Using Pandas DataFrame
17 pages
Pandas
No ratings yet
Pandas
14 pages
1 - Interactive Data Visualization With Bokeh
No ratings yet
1 - Interactive Data Visualization With Bokeh
31 pages
Salary Prediction LinearRegression
100% (1)
Salary Prediction LinearRegression
7 pages
Pandas - Basics - Practice: Consider The Following Python Dictionary Data and Python List Labels
No ratings yet
Pandas - Basics - Practice: Consider The Following Python Dictionary Data and Python List Labels
6 pages
Cleaning Dirty Data With Pandas & Python - DevelopIntelligence Blog PDF
No ratings yet
Cleaning Dirty Data With Pandas & Python - DevelopIntelligence Blog PDF
8 pages
Pandas Library Documentation
No ratings yet
Pandas Library Documentation
16 pages
IPL DATA ANLYSIS (1)
No ratings yet
IPL DATA ANLYSIS (1)
20 pages
UNIT - 3 Pandas
No ratings yet
UNIT - 3 Pandas
21 pages
Pandas Guide
No ratings yet
Pandas Guide
64 pages
Ss Project With Python
No ratings yet
Ss Project With Python
9 pages
Pandas Complete Notes
No ratings yet
Pandas Complete Notes
105 pages
Python Pandas Cheatsheety
No ratings yet
Python Pandas Cheatsheety
7 pages
Pandas Notes
No ratings yet
Pandas Notes
4 pages
Pandas Cheat Sheet - Python For Data Science
No ratings yet
Pandas Cheat Sheet - Python For Data Science
5 pages
DAX Cheat Sheet
No ratings yet
DAX Cheat Sheet
10 pages
Unit 5
No ratings yet
Unit 5
27 pages
Introduction To Data Visualization in Python
No ratings yet
Introduction To Data Visualization in Python
16 pages
K Means Clustering
100% (1)
K Means Clustering
10 pages
Chapter - 6 Dictionary
100% (2)
Chapter - 6 Dictionary
25 pages
Python Pandas
No ratings yet
Python Pandas
177 pages
Fds Unit - III
No ratings yet
Fds Unit - III
58 pages
Top 50 Pandas Interview Questions and Answers (2024)
No ratings yet
Top 50 Pandas Interview Questions and Answers (2024)
34 pages
Customer Segmentation Clustering
No ratings yet
Customer Segmentation Clustering
35 pages
XII-IP - Data Visualisation
No ratings yet
XII-IP - Data Visualisation
65 pages
Data Cleaning
No ratings yet
Data Cleaning
8 pages
UN Data Analysis Pandas Matplotlib
No ratings yet
UN Data Analysis Pandas Matplotlib
28 pages
Project
No ratings yet
Project
18 pages
Optimizing Hadoop for MapReduce
From Everand
Optimizing Hadoop for MapReduce
Khaled Tannir
No ratings yet
Microsoft Certified: Power BI Data Analyst Associate PL 300 Practice Tests
From Everand
Microsoft Certified: Power BI Data Analyst Associate PL 300 Practice Tests
CertSquad Professional Trainers
No ratings yet

Data Analysis With Pandas - Aggregates in Pandas Cheatsheet - Codecademy

Uploaded by

Data Analysis With Pandas - Aggregates in Pandas Cheatsheet - Codecademy

Uploaded by

5/6/2020 Data Analysis with Pandas: Aggregates in Pandas Cheatsheet | Codecademy

Cheatsheets / Data Analysis with Pandas

df.columnName.mean() # Average of all values in column

# output of the groupby command

You might also like