0% found this document useful (0 votes)

55 views1 page

Pandaspythonfordatascience

The document provides an overview of pandas, a Python library used for data analysis. It summarizes that pandas provides easy-to-use data structures like Series and DataFrames. It then demonstrates how to create and manipulate pandas Series and DataFrames, including selecting data, boolean indexing, sorting, handling missing data, reading/writing files, and applying functions. Basic operations like summing, sorting, and joining data are also covered.

Uploaded by

api-248437787

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

55 views1 page

Pandaspythonfordatascience

Uploaded by

api-248437787

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 1

Python For Data Science Cheat Sheet

Pandas Basics

Learn Python for Data Science Interactively at www.DataCamp.com

Asking For Help

Selection

Also see NumPy Arrays

Getting
>>> s['b']

Get one element

>>> df[1:]

Get subset of a DataFrame

-5

Pandas
The Pandas library is built on NumPy and provides easy-to-use
data structures and data analysis tools for the Python
programming language.

Dropping

>>> help(pd.Series.loc)

1
2

Country
India
Brazil

Capital
New Delhi
Braslia

Population
1303171035
207847528

By Position

>>> import pandas as pd

>>> df.iloc([0],[0])
'Belgium'

Pandas Data Structures

B -5

Index

>>> s = pd.Series([3, -5, 7, 4], index=['a', 'b', 'c', 'd'])

DataFrame
Columns

Index

Select single value by row &

column

'Belgium'

A one-dimensional labeled array

capable of holding any data type

Country
1

Belgium

India

Brazil

Capital
Brussels

Population
11190846

New Delhi 1303171035

Braslia

A two-dimensional labeled
data structure with columns
of potentially different types

207847528

>>> data = {'Country': ['Belgium', 'India', 'Brazil'],

'Capital': ['Brussels', 'New Delhi', 'Braslia'],

'Population': [11190846, 1303171035, 207847528]}

>>> df = pd.DataFrame(data,

columns=['Country', 'Capital', 'Population'])

'Belgium'

Select single value by row &

column labels

>>> df.at([0], ['Country'])

'Belgium'

By Label/Position
>>> df.ix[2]

Select single row of

subset of rows

>>> df.ix[:,'Capital']

Select a single column of

subset of columns

>>> df.ix[1,'Capital']

Select rows and columns

Country
Brazil
Capital
Braslia
Population 207847528

0
1
2

Brussels
New Delhi
Braslia

Boolean Indexing

Setting

Set index a of Series s to 6

Read and Write to Excel

>>> pd.read_excel('file.xlsx')
>>> pd.to_excel('dir/myDataFrame.xlsx', sheet_name='Sheet1')

Read multiple sheets from the same file

>>> xlsx = pd.ExcelFile('file.xls')

>>> df = pd.read_excel(xlsx, 'Sheet1')

df.shape
df.index
df.columns
df.info()
df.count()

(rows,columns)
Describe index
Describe DataFrame columns
Info on DataFrame
Number of non-NA values

>>>
>>>
>>>
>>>
>>>
>>>
>>>

df.sum()
df.cumsum()
df.min()/df.max()
df.idmin()/df.idmax()
df.describe()
df.mean()
df.median()

Sum of values
Cummulative sum of values
Minimum/maximum values
Minimum/Maximum index value
Summary statistics
Mean of values
Median of values

Applying Functions
>>> f = lambda x: x*2
>>> df.apply(f)
>>> df.applymap(f)

Apply function
Apply function element-wise

Internal Data Alignment

>>> s3 = pd.Series([7, -2, 3], index=['a', 'c', 'd'])
>>> s + s3
a

10.0

5.0

b
d

NaN

7.0

Arithmetic Operations with Fill Methods

I/O
>>> pd.read_csv('file.csv', header=None, nrows=5)
>>> pd.to_csv('myDataFrame.csv')

>>>
>>>
>>>
>>>
>>>

NA values are introduced in the indices that dont overlap:

>>> s[~(s > 1)]

Series s where value is not >1
>>> s[(s < -1) | (s > 2)]
s where value is <-1 or >2
>>> df[df['Population']>1200000000] Use filter to adjust DataFrame

Read and Write to CSV

Sort by row or column index

Sort a series by its values
Assign ranks to entries

Data Alignment

'New Delhi'

>>> s['a'] = 6

>>> df.sort_index(by='Country')
>>> s.order()
>>> df.rank()

Summary

By Label
>>> df.loc([0], ['Country'])

Sort & Rank

Basic Information

>>> df.iat([0],[0])

Series

Drop values from rows (axis=0)

>>> df.drop('Country', axis=1) Drop values from columns(axis=1)

Retrieving Series/DataFrame Information

Selecting, Boolean Indexing & Setting

Use the following import convention:

>>> s.drop(['a', 'c'])

Read and Write to SQL Query or Database Table

>>>
>>>
>>>
>>>
>>>

from sqlalchemy import create_engine

engine = create_engine('sqlite:///:memory:')
pd.read_sql("SELECT * FROM my_table;", engine)
pd.read_sql_table('my_table', engine)
pd.read_sql_query("SELECT * FROM my_table;", engine)

read_sql()is a convenience wrapper around read_sql_table() and

read_sql_query()
>>> pd.to_sql('myDf', engine)

You can also do the internal data alignment yourself with

the help of the fill methods:
>>> s.add(s3, fill_value=0)
a
b
c
d

10.0
-5.0
5.0
7.0

>>> s.sub(s3, fill_value=2)

>>> s.div(s3, fill_value=4)
>>> s.mul(s3, fill_value=3)

DataCamp

Learn Python for Data Science Interactively

Isl 25129 RGB Sensor Tutorial
No ratings yet
Isl 25129 RGB Sensor Tutorial
8 pages
Geiger Counter Tutotial
No ratings yet
Geiger Counter Tutotial
3 pages
Pandas Python For Data Science
No ratings yet
Pandas Python For Data Science
1 page
Pandas Python For Data Science
100% (1)
Pandas Python For Data Science
1 page
Python Cheatsy
No ratings yet
Python Cheatsy
1 page
Cheat Python
No ratings yet
Cheat Python
8 pages
Python For Data Science 1662157639
No ratings yet
Python For Data Science 1662157639
6 pages
Pandas Basics Cheat Sheet Python For Data Science: Retrieving Series/Dataframe Information
No ratings yet
Pandas Basics Cheat Sheet Python For Data Science: Retrieving Series/Dataframe Information
1 page
Pandas Data Structures: Sections
No ratings yet
Pandas Data Structures: Sections
13 pages
PandasGUIA PYTHON-04
No ratings yet
PandasGUIA PYTHON-04
1 page
Pandas_Cheat_Sheet (1)_240511_113437
No ratings yet
Pandas_Cheat_Sheet (1)_240511_113437
1 page
Pandas Cheat Sheet
No ratings yet
Pandas Cheat Sheet
1 page
WEBINTEL GUIDED LAB ACTIVITY Introduction To Pandas
No ratings yet
WEBINTEL GUIDED LAB ACTIVITY Introduction To Pandas
1 page
Pandas - Cheat - Sheet
No ratings yet
Pandas - Cheat - Sheet
6 pages
pandas-cheet-sheet
No ratings yet
pandas-cheet-sheet
1 page
Pandas
No ratings yet
Pandas
26 pages
Pandas
No ratings yet
Pandas
5 pages
Pandas
No ratings yet
Pandas
13 pages
Python Lab
No ratings yet
Python Lab
8 pages
PANDAS Python
No ratings yet
PANDAS Python
2 pages
05Getting Started With Pandas
No ratings yet
05Getting Started With Pandas
44 pages
Lecture 2 - data wrangling_update (2)
No ratings yet
Lecture 2 - data wrangling_update (2)
114 pages
Pandas DataFrame Notes
No ratings yet
Pandas DataFrame Notes
13 pages
Pandas
No ratings yet
Pandas
9 pages
Pandas Functions (1)
No ratings yet
Pandas Functions (1)
3 pages
IP-LAB-FILE-PYTHON
No ratings yet
IP-LAB-FILE-PYTHON
9 pages
python unit 3 4
No ratings yet
python unit 3 4
92 pages
Class 12 Practical File
No ratings yet
Class 12 Practical File
29 pages
LIst of practicals 2024 - 25 class xii
No ratings yet
LIst of practicals 2024 - 25 class xii
10 pages
Line By Line 12 IP
No ratings yet
Line By Line 12 IP
21 pages
Data Science Notes Unit-1 Part -2
No ratings yet
Data Science Notes Unit-1 Part -2
22 pages
Pandas
No ratings yet
Pandas
44 pages
Data Handing Using Pandas-I
100% (2)
Data Handing Using Pandas-I
46 pages
Unit 2
No ratings yet
Unit 2
81 pages
Lec 02 - DS100 Fa23 - Pandas 1
No ratings yet
Lec 02 - DS100 Fa23 - Pandas 1
61 pages
Pandas_Tutorial
No ratings yet
Pandas_Tutorial
7 pages
ip study
No ratings yet
ip study
18 pages
Pandas
No ratings yet
Pandas
36 pages
Data Handling Using Pandas-I-ORG
No ratings yet
Data Handling Using Pandas-I-ORG
44 pages
Data Frame
No ratings yet
Data Frame
17 pages
Pandas: Import
100% (1)
Pandas: Import
13 pages
Pandas Cheat Sheet
100% (1)
Pandas Cheat Sheet
2 pages
Lab-3 Pandas Library
No ratings yet
Lab-3 Pandas Library
14 pages
Python Pandas-Data Frames
No ratings yet
Python Pandas-Data Frames
41 pages
Lecture 3 - Pandas
No ratings yet
Lecture 3 - Pandas
37 pages
2.2 Data Indexing and Selection
No ratings yet
2.2 Data Indexing and Selection
8 pages
Python Cheat Sheets
97% (33)
Python Cheat Sheets
11 pages
Python Cheat Sheet For Excel Users
100% (2)
Python Cheat Sheet For Excel Users
5 pages
03 DataFrames
No ratings yet
03 DataFrames
9 pages
1501992967_1496666168_Pandas
No ratings yet
1501992967_1496666168_Pandas
63 pages
Pandas DataFrameObject
No ratings yet
Pandas DataFrameObject
4 pages
Cheat Sheet: The Pandas Dataframe Object: Preliminaries Get Your Data Into A Dataframe
100% (1)
Cheat Sheet: The Pandas Dataframe Object: Preliminaries Get Your Data Into A Dataframe
10 pages
Pandas Data Wrangling Cheatsheet Datacamp PDF
No ratings yet
Pandas Data Wrangling Cheatsheet Datacamp PDF
1 page
Pandas Cheat Sheet
100% (2)
Pandas Cheat Sheet
6 pages
The Essential R Reference
From Everand
The Essential R Reference
Mark Gardener
No ratings yet
Advanced C Concepts and Programming: First Edition
From Everand
Advanced C Concepts and Programming: First Edition
Gayatri
3/5 (1)
R Fast Track Guide - 86 Key Points Every Programmer from Other Languages Should Master
From Everand
R Fast Track Guide - 86 Key Points Every Programmer from Other Languages Should Master
Ginno
No ratings yet
Administering Microsoft Azure SQL Solutions DP 300
From Everand
Administering Microsoft Azure SQL Solutions DP 300
Manish Soni
No ratings yet
Data Science with R: Beginner to Expert
From Everand
Data Science with R: Beginner to Expert
Narayana Nemani
No ratings yet
Oracle Certified Professional Java Programmer OCPJP 1Z0 809
From Everand
Oracle Certified Professional Java Programmer OCPJP 1Z0 809
Manish Soni
No ratings yet
Profound Python Data Science
From Everand
Profound Python Data Science
Onder Teker
No ratings yet
IBM Cognos 8 Planning
From Everand
IBM Cognos 8 Planning
Jason Edwards
No ratings yet
bmp180 Pressure Sensor Tutorial
100% (1)
bmp180 Pressure Sensor Tutorial
14 pages
Arduinoprojectdraft
No ratings yet
Arduinoprojectdraft
6 pages
bmp180 Pressure Sensor Tutorial and Project
100% (1)
bmp180 Pressure Sensor Tutorial and Project
29 pages
Sharp Final
No ratings yet
Sharp Final
9 pages
Arduino-Adafruit Gps
No ratings yet
Arduino-Adafruit Gps
5 pages
Uv Final 2
No ratings yet
Uv Final 2
10 pages
Sensor Test
No ratings yet
Sensor Test
3 pages
Uit Ultimate Gps Breakout v3 Tutorial For The Raspberry Pi 3
No ratings yet
Uit Ultimate Gps Breakout v3 Tutorial For The Raspberry Pi 3
12 pages
9 Dof
No ratings yet
9 Dof
8 pages
Chad
No ratings yet
Chad
10 pages
Presentation Dec4
No ratings yet
Presentation Dec4
9 pages
hmc5883l Compass Tutorial
No ratings yet
hmc5883l Compass Tutorial
7 pages
Hab 12-4 Pres - Mae V 1
No ratings yet
Hab 12-4 Pres - Mae V 1
11 pages

Pandaspythonfordatascience

Uploaded by

Pandaspythonfordatascience

Uploaded by

Python For Data Science Cheat Sheet

Learn Python for Data Science Interactively at www.DataCamp.com

Asking For Help

Also see NumPy Arrays

Get one element

Get subset of a DataFrame

>>> import pandas as pd

Pandas Data Structures

>>> s = pd.Series([3, -5, 7, 4], index=['a', 'b', 'c', 'd'])

Select single value by row &

A one-dimensional labeled array

New Delhi 1303171035

>>> data = {'Country': ['Belgium', 'India', 'Brazil'],

'Capital': ['Brussels', 'New Delhi', 'Braslia'],

'Population': [11190846, 1303171035, 207847528]}

columns=['Country', 'Capital', 'Population'])

Select single value by row &

>>> df.at([0], ['Country'])

Select single row of

Select a single column of

Select rows and columns

Set index a of Series s to 6

Read and Write to Excel

Read multiple sheets from the same file

>>> xlsx = pd.ExcelFile('file.xls')

Internal Data Alignment

Arithmetic Operations with Fill Methods

NA values are introduced in the indices that dont overlap:

>>> s[~(s > 1)]

Read and Write to CSV

Sort by row or column index

Sort & Rank

Drop values from rows (axis=0)

>>> df.drop('Country', axis=1) Drop values from columns(axis=1)

Retrieving Series/DataFrame Information

Selecting, Boolean Indexing & Setting

>>> s.drop(['a', 'c'])

Read and Write to SQL Query or Database Table

from sqlalchemy import create_engine

read_sql()is a convenience wrapper around read_sql_table() and

You can also do the internal data alignment yourself with

>>> s.sub(s3, fill_value=2)

Learn Python for Data Science Interactively

You might also like