Towards a high-performance AI compiler with upstream MLIR

Golin, Renato; Chelini, Lorenzo; Siemieniuk, Adam; Madhu, Kavitha; Hasabnis, Niranjan; Pabst, Hans; Georganas, Evangelos; Heinecke, Alexander

Computer Science > Programming Languages

arXiv:2404.15204 (cs)

[Submitted on 15 Apr 2024]

Title:Towards a high-performance AI compiler with upstream MLIR

Authors:Renato Golin, Lorenzo Chelini, Adam Siemieniuk, Kavitha Madhu, Niranjan Hasabnis, Hans Pabst, Evangelos Georganas, Alexander Heinecke

View PDF HTML (experimental)

Abstract:This work proposes a compilation flow using open-source compiler passes to build a framework to achieve ninja performance from a generic linear algebra high-level abstraction. We demonstrate this flow with a proof-of-concept MLIR project that uses input IR in Linalg-on-Tensor from TensorFlow and PyTorch, performs cache-level optimizations and lowering to micro-kernels for efficient vectorization, achieving over 90% of the performance of ninja-written equivalent programs. The contributions of this work include: (1) Packing primitives on the tensor dialect and passes for cache-aware distribution of tensors (single and multi-core) and type-aware instructions (VNNI, BFDOT, BFMMLA), including propagation of shapes across the entire function; (2) A linear algebra pipeline, including tile, fuse and bufferization strategies to get model-level IR into hardware friendly tile calls; (3) A mechanism for micro-kernel lowering to an open source library that supports various CPUs.

Comments:	13 pages, 8 figures, presented at CGO C4ML 2024 & MLIR Workshop EuroLLVM 2024
Subjects:	Programming Languages (cs.PL); Artificial Intelligence (cs.AI); Hardware Architecture (cs.AR); Distributed, Parallel, and Cluster Computing (cs.DC); Machine Learning (cs.LG)
Cite as:	arXiv:2404.15204 [cs.PL]
	(or arXiv:2404.15204v1 [cs.PL] for this version)
	https://doi.org/10.48550/arXiv.2404.15204

Submission history

From: Renato Golin [view email]
[v1] Mon, 15 Apr 2024 10:35:50 UTC (1,281 KB)

Computer Science > Programming Languages

Title:Towards a high-performance AI compiler with upstream MLIR

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Programming Languages

Title:Towards a high-performance AI compiler with upstream MLIR

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators