GPU - LAB - Ipynb - Colaboratory

This document compares the performance of matrix multiplication operations using NumPy on the CPU vs Cupy on the GPU. It generates random matrices of varying sizes and times the dot product of the matrices. For smaller matrices of size 50x50, the GPU implementation is about 1.9x faster. For larger 500x500 matrices, the GPU is over 300x faster. Finally, for very large 2000x2000 matrices, the GPU version finishes in 14.7 milliseconds compared to 12.9 seconds for the CPU version, a speedup of around 880x. This shows that GPUs provide significant performance benefits over CPUs for linear algebra and matrix operations, especially on large problem sizes.

Uploaded by

Mahmood Mohamed Abdelaziz

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

15 views

GPU - LAB - Ipynb - Colaboratory

Uploaded by

Mahmood Mohamed Abdelaziz

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 2

GPU_LAB.ipynb - Colaboratory https://colab.research.google.com/drive/1222GuvwsBst...

1 import cupy as cp

1 import numpy as np

1 n = 50
2 A = np.random.randint(0, 255, size=(n, n))
3 B = np.random.randint(0, 255, size=(n, n))
4 C = np.random.randint(0, 255, size=(n, n))

1 A_gpu = cp.random.randint(0, 255, size=(n, n))
2 B_gpu = cp.random.randint(0, 255, size=(n, n))
3 C_gpu = cp.random.randint(0, 255, size=(n, n))

1 %%timeit
2 A_dash = np.dot(A, A+B) + C

112 µs ± 22.1 µs per loop (mean ± std. dev. of 7 runs, 10000 loops each)

1 %%timeit
2 A_dash_gpu = cp.dot(A_gpu, A_gpu + B_gpu) + C_gpu

59.4 µs ± 13 µs per loop (mean ± std. dev. of 7 runs, 1 loop each)

1 n = 500
2 A = np.random.randint(0, 255, size=(n, n))
3 B = np.random.randint(0, 255, size=(n, n))
4 C = np.random.randint(0, 255, size=(n, n))

1 A_gpu = cp.random.randint(0, 255, size=(n, n))
2 B_gpu = cp.random.randint(0, 255, size=(n, n))
3 C_gpu = cp.random.randint(0, 255, size=(n, n))

1 %%timeit
2 A_dash = np.dot(A, A+B) + C

167 ms ± 25.8 ms per loop (mean ± std. dev. of 7 runs, 10 loops each)

1 %%timeit
2 A_dash_gpu = cp.dot(A_gpu, A_gpu + B_gpu) + C_gpu

479 µs ± 461 ns per loop (mean ± std. dev. of 7 runs, 1000 loops each)

1 n = 2000

1 of 2 4/17/23, 3:39 PM
GPU_LAB.ipynb - Colaboratory https://colab.research.google.com/drive/1222GuvwsBst...

 Executing (1m 59s) <cell line: 2>randint()randint()_interval()_get_indices()

2 A = np.random.randint(0, 255, size=(n, n))
3 B = np.random.randint(0, 255, size=(n, n))
4 C = np.random.randint(0, 255, size=(n, n))

1 A_gpu = cp.random.randint(0, 255, size=(n, n))
2 B_gpu = cp.random.randint(0, 255, size=(n, n))
3 C_gpu = cp.random.randint(0, 255, size=(n, n))

1 %%timeit
2 A_dash = np.dot(A, A+B) + C

12.9 s ± 640 ms per loop (mean ± std. dev. of 7 runs, 1 loop each)

1 %%timeit
2 A_dash_gpu = cp.dot(A_gpu, A_gpu + B_gpu) + C_gpu

14.7 ms ± 322 µs per loop (mean ± std. dev. of 7 runs, 1000 loops each)

Colab paid products - Cancel contracts here

2 of 2 4/17/23, 3:39 PM

x891 ExpressVPN Premium Accounts
67% (3)
x891 ExpressVPN Premium Accounts
22 pages
1000 Resep Chinese Food - Mary Winata
80% (5)
1000 Resep Chinese Food - Mary Winata
586 pages
Industrial Instrumentation by Donald P. Eckman
100% (6)
Industrial Instrumentation by Donald P. Eckman
367 pages
FOSS User Magazine - 2010 Dec
100% (1)
FOSS User Magazine - 2010 Dec
32 pages
Numba - Calculs: Import Import As From Import
No ratings yet
Numba - Calculs: Import Import As From Import
2 pages
BECOA157 Parallel Matrix Multiplication
No ratings yet
BECOA157 Parallel Matrix Multiplication
3 pages
LP 1,,1
No ratings yet
LP 1,,1
5 pages
vertopal.com_Lab7_TPU (1)
No ratings yet
vertopal.com_Lab7_TPU (1)
3 pages
Gpu, Cuda and Pycuda
No ratings yet
Gpu, Cuda and Pycuda
11 pages
Homework1 Solutions
No ratings yet
Homework1 Solutions
5 pages
Lab Report 6
No ratings yet
Lab Report 6
12 pages
Lab 1 Intro To High Performance Computing
No ratings yet
Lab 1 Intro To High Performance Computing
8 pages
Rishi
No ratings yet
Rishi
30 pages
G80 Cuda
No ratings yet
G80 Cuda
25 pages
Addition_Cuda
No ratings yet
Addition_Cuda
2 pages
PyCUDA AH PDF
No ratings yet
PyCUDA AH PDF
16 pages
vertopal.com_Lab7_GPU (1)
No ratings yet
vertopal.com_Lab7_GPU (1)
10 pages
2013 07 22-Python-CUDA
No ratings yet
2013 07 22-Python-CUDA
25 pages
Python, Performance, and GPUs - Towards Data Science
No ratings yet
Python, Performance, and GPUs - Towards Data Science
8 pages
Week 11
No ratings yet
Week 11
21 pages
4. Cuda Add Mult
No ratings yet
4. Cuda Add Mult
3 pages
PS4 - Ritesh Jaiswal - Ritesh - 054
No ratings yet
PS4 - Ritesh Jaiswal - Ritesh - 054
8 pages
Advanced Computer Graphics and Graphics Hardware: CUDA: Course Project
No ratings yet
Advanced Computer Graphics and Graphics Hardware: CUDA: Course Project
8 pages
Google Colab Solution Activity
No ratings yet
Google Colab Solution Activity
5 pages
CUDA-OPENCL
No ratings yet
CUDA-OPENCL
17 pages
Lab 1 Parallel
No ratings yet
Lab 1 Parallel
4 pages
Group A Assignment 4 (A) : Two Large Vectors
No ratings yet
Group A Assignment 4 (A) : Two Large Vectors
5 pages
PDC assignment
No ratings yet
PDC assignment
9 pages
CG Lab Manual
No ratings yet
CG Lab Manual
37 pages
Assignment - 5 111903109
No ratings yet
Assignment - 5 111903109
3 pages
cuda
No ratings yet
cuda
4 pages
An Introduction To PyCUDA Using Prefix Sum Algorithm PDF
No ratings yet
An Introduction To PyCUDA Using Prefix Sum Algorithm PDF
6 pages
L06_GPGPU_CUDA_Programming_1
No ratings yet
L06_GPGPU_CUDA_Programming_1
23 pages
Lecture17 12
No ratings yet
Lecture17 12
86 pages
Csnb594csnb4423 Lab 5 01a Harveen Velan Sw0104101
No ratings yet
Csnb594csnb4423 Lab 5 01a Harveen Velan Sw0104101
19 pages
hw2
No ratings yet
hw2
12 pages
File: /home/srinu/desktop/g.txt Page 1 of 6
No ratings yet
File: /home/srinu/desktop/g.txt Page 1 of 6
6 pages
excel test
No ratings yet
excel test
17 pages
4c66e611ad14aabe_0000000000000000_vs
No ratings yet
4c66e611ad14aabe_0000000000000000_vs
13 pages
Using CUDA
No ratings yet
Using CUDA
57 pages
948500d0191d1ed8 0000000000000000 Vs
No ratings yet
948500d0191d1ed8 0000000000000000 Vs
8 pages
01 Laurie Stephey
No ratings yet
01 Laurie Stephey
14 pages
ccc6fb8b53f5f651 0000000000000000 Vs
No ratings yet
ccc6fb8b53f5f651 0000000000000000 Vs
33 pages
multiplication.ipynb - Colab
No ratings yet
multiplication.ipynb - Colab
2 pages
d321199dc854621f 0000000000000000 Vs
No ratings yet
d321199dc854621f 0000000000000000 Vs
30 pages
CUDA Exercises
No ratings yet
CUDA Exercises
185 pages
CG Lab - Manual
No ratings yet
CG Lab - Manual
15 pages
2023-CSC14120-Lecture01-CUDAIntroduction
No ratings yet
2023-CSC14120-Lecture01-CUDAIntroduction
32 pages
CUDA Libraries and CUDA Fortran: Massimiliano Fatica
No ratings yet
CUDA Libraries and CUDA Fortran: Massimiliano Fatica
55 pages
I017 CG Lab5-1
No ratings yet
I017 CG Lab5-1
12 pages
Computer Graphics and Image Processing Laboratory Manual
No ratings yet
Computer Graphics and Image Processing Laboratory Manual
27 pages
Bonsai
No ratings yet
Bonsai
64 pages
GPU Programming EE 4702-1 Final Examination: Name Solution
No ratings yet
GPU Programming EE 4702-1 Final Examination: Name Solution
10 pages
Mahedi Hasan TURJOY - Lab Task 5_ DDA ALogrithm Implementation
No ratings yet
Mahedi Hasan TURJOY - Lab Task 5_ DDA ALogrithm Implementation
6 pages
karthik_bip38_decrypt.py
No ratings yet
karthik_bip38_decrypt.py
2 pages
E082c1f638f8e81e 0000000000000000 Vs
No ratings yet
E082c1f638f8e81e 0000000000000000 Vs
30 pages
14f760ff4d6b05f5 0000000000000000 Vs
No ratings yet
14f760ff4d6b05f5 0000000000000000 Vs
14 pages
CUDA - MonteCarloPi Code
No ratings yet
CUDA - MonteCarloPi Code
6 pages
HPC (Pra 04)
No ratings yet
HPC (Pra 04)
11 pages
My Experiments: Opencl Gpu Matrix Multiplication Program
No ratings yet
My Experiments: Opencl Gpu Matrix Multiplication Program
19 pages
GC1 Hello CUDA-2021Fall
No ratings yet
GC1 Hello CUDA-2021Fall
31 pages
GPUComputing
No ratings yet
GPUComputing
95 pages
Shortcuts to College Calculus Refreshment Kit
From Everand
Shortcuts to College Calculus Refreshment Kit
Juan Acevedo
No ratings yet
A Star: Fundamentals and Applications
From Everand
A Star: Fundamentals and Applications
Fouad Sabry
No ratings yet
1. Installing and Configuring Tools
No ratings yet
1. Installing and Configuring Tools
5 pages
Docker Commands
No ratings yet
Docker Commands
15 pages
A. Company Profile Type Nature of Business: Chapter-1INTRODUCTION
No ratings yet
A. Company Profile Type Nature of Business: Chapter-1INTRODUCTION
6 pages
Systemd: RHEL 7 Update
No ratings yet
Systemd: RHEL 7 Update
43 pages
Lex and Yacc Programs
No ratings yet
Lex and Yacc Programs
8 pages
LSP Cleaner
No ratings yet
LSP Cleaner
3 pages
Dog
No ratings yet
Dog
9 pages
O Level Sociology 2251 Sociology o Level Notes by Shahraiz Chishti Greenhall PDF Free
No ratings yet
O Level Sociology 2251 Sociology o Level Notes by Shahraiz Chishti Greenhall PDF Free
119 pages
CD Key Microsoft Office 2007
No ratings yet
CD Key Microsoft Office 2007
8 pages
Linux Server
No ratings yet
Linux Server
44 pages
Sheet 25
No ratings yet
Sheet 25
5 pages
Sha 256 Sum
No ratings yet
Sha 256 Sum
39 pages
MS Office 2010 Products
No ratings yet
MS Office 2010 Products
3 pages
Rosetta Stone German Catalogue PDF
No ratings yet
Rosetta Stone German Catalogue PDF
7 pages
PPM03 Project Portfolio Management Dashboard
No ratings yet
PPM03 Project Portfolio Management Dashboard
26 pages
Mengenal Mendeley Lebih Dekat Eko Wahyu Nur Sofianto Rabu, 15 Juli 2020 Directed by Fakultas Usluhudin Dan Humaniora
No ratings yet
Mengenal Mendeley Lebih Dekat Eko Wahyu Nur Sofianto Rabu, 15 Juli 2020 Directed by Fakultas Usluhudin Dan Humaniora
27 pages
Excel Functions and Formulas - How To Unprotect An Excel Sheet Without Password
No ratings yet
Excel Functions and Formulas - How To Unprotect An Excel Sheet Without Password
15 pages
Ethereal Tutorial PDF
No ratings yet
Ethereal Tutorial PDF
2 pages
Praktikum OpenGL Primitive Drawing
No ratings yet
Praktikum OpenGL Primitive Drawing
14 pages
LSWS PHP LSAPI Build Troubleshooting Guide
No ratings yet
LSWS PHP LSAPI Build Troubleshooting Guide
19 pages
商业计划清单
100% (1)
商业计划清单
8 pages
30 Useful Linux Commands For System Administrators
No ratings yet
30 Useful Linux Commands For System Administrators
27 pages
Linux A Ss Ig N M e N T: Find / - Type F Directories - TXT 2 Error
No ratings yet
Linux A Ss Ig N M e N T: Find / - Type F Directories - TXT 2 Error
3 pages
Man Page of Rsync
No ratings yet
Man Page of Rsync
55 pages
Simple Interest - 2 (Class Notes)
No ratings yet
Simple Interest - 2 (Class Notes)
41 pages
0x00-Shell Basics
No ratings yet
0x00-Shell Basics
2 pages

GPU - LAB - Ipynb - Colaboratory

Uploaded by

GPU - LAB - Ipynb - Colaboratory

Uploaded by

GPU_LAB.ipynb - Colaboratory https://colab.research.google.com/drive/1222GuvwsBst...

59.4 µs ± 13 µs per loop (mean ± std. dev. of 7 runs, 1 loop each)

 Executing (1m 59s) <cell line: 2>randint()randint()_interval()_get_indices()

Colab paid products - Cancel contracts here

You might also like