0% found this document useful (0 votes)

49 views121 pages

Advanced Data Layouts in Taichi

Uploaded by

qdyuan4619

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

49 views121 pages

Advanced Data Layouts in Taichi

Uploaded by

qdyuan4619

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 121

太极图形课

第03讲 Advanced Data Layouts

太极图形课
第03讲 Advanced Data Layouts
Recap
• Metaprogramming
• Object-oriented programming

Reusability Extensibility Maintainability

3
N-body systems

2/3D N-Body Dynamics Solar System

@Rabmelon @0xzhang
4
ODOP examples:

Diffraction @Y-jx007 Bezier @Zydiii Moxi (墨戏) @Vineyo

Ant Colony Maxwell's Demon game Marching Squares

@theAfish @507C @AlbertLiDesign 5
Other HW assignments are welcome as well!

Mandelbulb MPM88 + March Squares N-body with black hole(s) N-body with black hole(s)
@rockeyshao @wangfeng70117 @szl2 @logic-three-body

6
Gifts for the gifted
• Check your Github issues ☺

7
Outline Today
• Advanced dense data layouts
• Sparse data layouts

8
Outline Today
• Advanced dense data layouts
• Sparse data layouts

Performance Performance Performance

9
Advanced dense data layouts
Taichi
• ti.field()

• @ti.kernel
• Optimized for ti.field()

• OOP
• @data_oriented

11
Taichi: A data-oriented programming language
• ti.field()

• @ti.kernel
• Optimized for ti.field()

• OOP
• @data_oriented

12
import taichi as ti

ti.init(ti.gpu)

# gravitational constant 6.67408e-11, using 1 for simplicity

Init
G = 1
PI = 3.141592653

# number of planets
N = 300
# unit mass
m = 5
# galaxy size
galaxy_size = 0.4

Data
# planet radius (for rendering)
planet_radius = 2
# init vel
init_vel = 120

# time-step size
h = 1e-5
# substepping
substepping = 10

# pos, vel and force of the planets

# Nx2 vectors
pos = ti.Vector.field(2, ti.f32, N)
vel = ti.Vector.field(2, ti.f32, N)
force = ti.Vector.field(2, ti.f32, N)

@ti.kernel
def initialize():
center = ti.Vector([0.5, 0.5])
for i in range(N):
theta = ti.random() * 4 * PI
r = (ti.sqrt(ti.random()) * 0.7 + 0.3) * galaxy_size
offset = r * ti.Vector([ti.cos(theta), ti.sin(theta)])
pos[i] = center+offset
vel[i] = [-offset.y, offset.x]
vel[i] *= init_vel

@ti.kernel
def compute_force():
# clear force
for i in range(N):
force[i] = ti.Vector([0.0, 0.0])

Computation
# compute gravitational force
for i in range(N):
p = pos[i]
for j in range(N):
if i != j: # double the computation for a better memory footprint and load balance
diff = p-pos[j]
r = diff.norm(1e-5)

# gravitational force -(GMm / r^2) * (diff/r) for i

f = -G * m * m * (1.0/r)**3 * diff

# assign to each particle

force[i] += f

@ti.kernel
def update():
dt = h/substepping
for i in range(N):
#symplectic euler
vel[i] += dt*force[i]/m
pos[i] += dt*vel[i]

gui = ti.GUI('N-body problem', (512, 512))

initialize()
while gui.running:

Visualization
for i in range(substepping):
compute_force()
update()

...

gui.clear(0x112F41)
gui.circles(pos.to_numpy(), color=0xffffff, radius=planet_radius)
gui.show()

13
Performance @CPU…
WALL CLOCK TIME
Computation Data Access

20% less computation!

80%

14
Performance @GPU…
WALL CLOCK TIME
Computation Data Access

20% better memory access!

80%

15
搬砖 Example （a slide from @禹鹏）

...
...

...
. .
.. ..
...

16
Before we go: packed mode
• Initialized in ti.init()
• Decides whether to pad the data to the power of two
• Default choice: packed=False, will do the padding
• We assume packed=True in this class for simplicity