[Draft PR] Graphcore backend support. #1659

Sameeranjoshi · 2024-09-16T16:16:25Z

This PR is a draft PR with lots of learnings from fpga.py and cuda.py backends.
The goal of the PR is to add graphcore backend to DaCE and map from DaCE graphs onto GraphCore graphs.
Helpful comments on parts I might be doing wrong would be super useful for next set of patches.

… from the documentation online to check what library is present

…page, this commit didn't work. They also have something similar - adding Tensor core backend from Nvidia as an external codegen

…ddition

…ion of the pass

…t commit

… the next commit" This reverts commit 429737d.

…f experimental changes which try to understand the SDFGIR and the changes to make IPUCodeGen registry into the frame_targets.

The fix was to call 'self._frame.generate_state' recursively from ipu.generate_state. This goes into framecode.py and calls the recursive function which traverses substates and calls the codegen respectively. Based on this learning a point to note is to remember to call the recursive functions inside generate_*() functions. example is 'self._dispatcher.dispatch_subgraph' in 'generate_scope'. 2. Fix ipu/ipu 2 folders were created recursively. Fix - Remove 'target_type='ipu' from CodeObject.

…ok right now into GPU/MPI

…ers and the IPUDevice, this goes in __init__/exit part and not in the SDFG

…ested might be buggy

Code snippet below checks if the file is frame(.cpp vs .cu) if so matches target and dumps the headers. ``` if backend == 'frame': #on cpu.cpp file + for target in self.targets: + if target.target_name == 'ipu': ``` Some cosmetic changes removed the prints.

… add more handcrafted tests, remove the header generation from framecode.py and add into ipu.py

…f how cpu codegen works for a tasklet

… one generates the golden code for poplar.

2. In the Host/ side the pipeline + functions are present. 3. The pipeline is not yet set, next patch might set it. 4. Lots of bugs still 5. Cerete Host/ Device/ files along with cpu/

…t the inputs to the library and the inputs to SDFG, seems like we need to dig into some other ways of allocation of variables and such details." Reverting this as this is making it hard to understand a node as of now. This reverts commit 63fad92.

…ave IPUCodegen + library codegen together.

Sameeranjoshi · 2024-09-16T16:24:50Z

The golden base code I plan to generate for IPU is here.

The current test I plan to support is (AccessNode + Library Node) - tests/library/poplar/poplar_matmul.py

…m fpga and cuda. This creates a codegen which dumps both library + nodes

This reverts commit 2ffc0c9.

ThrudPrimrose · 2024-09-25T15:13:41Z

dace/dtypes.py

Using more descriptive names of the memory locations will be more beneficial for future users.
I do not know the memory hierarchy of Graphcore, but for now, let us assume that Graphcore has only Global memory and an on-chip L1 memory location then I would go with the names IPU_L1 and IPU_Global.

Using the naming can also be good if IPUs change their memory hierarchy across generations. Let's say they add a level called L2 -> Then it will be IPU_Memory and IPU_L2 (and in my opinion, this will be quite confusing)

Same for the schedule, IPU_Multicore can be better (this again depends on what GraphCore calls them).
For example:

StorageType.IPU_L1: ScheduleType.IPU_Multicore, StorageType.IPU_Global: ScheduleType.IPU_Device

ThrudPrimrose · 2024-09-25T15:23:48Z

dace/codegen/targets/ipu_files/ipu_utils.py

There are, type maps such as _CTYPES (C/C++) or _OCL_TYPES (OpenCL) in dtypes.py
I think having this map there with other type maps would be consistent with the previous implementations.

ThrudPrimrose · 2024-09-25T15:37:03Z

dace/libraries/poplar/environments/poplar.py

Init code is for the part of the code the use library requires in initialization (MPI_Init) and finalize then the code needed for the finalization (MPI_finalize)

From what I see, engine.run(0) is similar to a Cuda kernel launch. Wouldn't it be better to generate this in a target/ipu.py?

If implemented like this, wouldn't there be problems if you need to call .run more than once?

Due to how graphcore requires you to use the poplar library, using it as a library is unavoidable, I want to point that you can still put as much as you can in ipu code-gen and ipu code-gen makes this library mandatory.

Good luck and success in your DaCe backend implementation!

…lude, debugging issue on real IPU machine, pushing incomplete changes"

…ted on another testcase

… for transients and scope must be inside state

…snode is not IPU_Memory

…s no schedule= in code

…_memory() triggers

Sameeranjoshi added 30 commits July 18, 2024 16:38

cpu, gpu basic tests

9ea89e7

add cpu array test

46a3c07

add optimization, helper file-check_external_library_used.py, this is…

809048c

… from the documentation online to check what library is present

understood where is the source generated from, read codegen.py

5c68cce

make more verbose comment

9a210f4

Tried using a custom codegen following the tutorial guide on dace web…

8763d54

…page, this commit didn't work. They also have something similar - adding Tensor core backend from Nvidia as an external codegen

make cpu, gpu, fpga tests to the most smallest and all doing vector a…

e5ae4ee

…ddition

add debug comments to understand the SDFG

a727cb9

basic structure is dumped, using node as of now, build fails as well

a575ef8

IPUTransformSDFG commented in python code, probably missing registrat…

5934537

…ion of the pass

MPI basic test

01c8bf5

Implement the LoopyLoop custom codegen on Map, will revert in the nex…

429737d

…t commit

Revert "Implement the LoopyLoop custom codegen on Map, will revert in…

77a7388

… the next commit" This reverts commit 429737d.

Debug: Find what are different types of nodes and how they are organized

fa78938

Debug: make output more verbose from last commit

d1af971

print states(if-else, for)

c3171fb

convert from vector add to saclar add, name might be confusing

ed6f63e

some debug comments, found control_flow_tree code, ipu.py has a lot o…

c30f1f2

…f experimental changes which try to understand the SDFGIR and the changes to make IPUCodeGen registry into the frame_targets.

mpi_scalar.py, some debug comments, now move on to cpu only, don't lo…

af38f47

…ok right now into GPU/MPI

partial code works, read cpu.py and generate_{node, state}

908a0f9

add mapping GC program to dace

73fc0bb

[WIP] Register array, copy, and add some code for generating the head…

c997147

…ers and the IPUDevice, this goes in __init__/exit part and not in the SDFG

use dace.DeviceType.IPU to check and emit headers in framecode, not t…

01d0658

…ested might be buggy

learn sdfg by using the APIs and writing tests

afbbdd1

Add new test case, simple codes to understand writing SDFG by hand

cff27f4

Add a new library, poplar

3715101

Comment the IPU type doesn't work as needs frontend support probably,…

30178f0

… add more handcrafted tests, remove the header generation from framecode.py and add into ipu.py

Copied codegen from cpu.py, tweaked it and understood the structure o…

82d193d

…f how cpu codegen works for a tasklet

Sameeranjoshi added 7 commits September 13, 2024 11:13

changes to test, ipu_test is now the new base, added state dump, next…

15d8023

… one generates the golden code for poplar.

1. Insert all the golden file code from Poplar example.

5207d2d

2. In the Host/ side the pipeline + functions are present. 3. The pipeline is not yet set, next patch might set it. 4. Lots of bugs still 5. Cerete Host/ Device/ files along with cpu/

fix bug where dace_init_target_ was missing

f6ac62f

Add library node, register it, modify test for the same, goal is to h…

940f6bc

…ave IPUCodegen + library codegen together.

Supress the building process

2ffc0c9

Attempt to add Node dispatcher

ad13cfc

Sameeranjoshi added 2 commits September 18, 2024 22:49

Turn off the node dispatcher and generate a state using some code fro…

6d5189f

…m fpga and cuda. This creates a codegen which dumps both library + nodes

Revert "Supress the building process"

e3193c4

This reverts commit 2ffc0c9.

acalotoiu requested review from acalotoiu and ThrudPrimrose September 25, 2024 14:38

ThrudPrimrose reviewed Sep 25, 2024

View reviewed changes

Sameeranjoshi and others added 15 commits September 30, 2024 12:47

Move the headers to a common runtime include/ folder dace/runtime/inc…

6ff38a6

…lude, debugging issue on real IPU machine, pushing incomplete changes"

some temporary changes

ba32abe

Resolve errors in compilation when using includes from runtime libraries

eeaa7d1

Fix bug - wasn't generating proper kernel names, was not generic, tes…

1a21f8e

…ted on another testcase

Support addVariables() and mapLinearlyOnTiles(), currently works only…

9290acd

… for transients and scope must be inside state

cosmetic changes, remove Dead code, iondent

d1971bd

Fix bug in is_ipu_kernel, was failing for tests where the first acces…

835bd81

…snode is not IPU_Memory

Fix mapping and variable allocation, remove Dead code

0e43bb6

Try adding generate_node() - fails as the predicate fails, as there i…

1164d69

…s no schedule= in code

Add vector add test for dace and poplar

530e298

Add scalar code using vector of size 1

1334015

Remove prints

7e54d70

new tests 1.copy a -> b on both IPU and dace test

3b1f4c7

Add IPU_Memory to accessNode

7537466

Most of the codegen is correct, generate_node() doesn't trigger, copy…

6db5588

…_memory() triggers

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Draft PR] Graphcore backend support. #1659

[Draft PR] Graphcore backend support. #1659

Sameeranjoshi commented Sep 16, 2024

Sameeranjoshi commented Sep 16, 2024

ThrudPrimrose Sep 25, 2024 •

edited

Loading

ThrudPrimrose Sep 25, 2024

ThrudPrimrose Sep 25, 2024

ThrudPrimrose Sep 25, 2024

[Draft PR] Graphcore backend support. #1659

Are you sure you want to change the base?

[Draft PR] Graphcore backend support. #1659

Conversation

Sameeranjoshi commented Sep 16, 2024

Sameeranjoshi commented Sep 16, 2024

ThrudPrimrose Sep 25, 2024 • edited Loading

Choose a reason for hiding this comment

ThrudPrimrose Sep 25, 2024

Choose a reason for hiding this comment

ThrudPrimrose Sep 25, 2024

Choose a reason for hiding this comment

ThrudPrimrose Sep 25, 2024

Choose a reason for hiding this comment

ThrudPrimrose Sep 25, 2024 •

edited

Loading