Add EGraph Visualizations #147

saulshanabrook · 2023-05-22T17:31:27Z

Adds the ability to visualize the state of an EGraph using Graphviz (addressing #144).

Goals:

Help educate new users on what e-graphs are and how egglog implements them
Help with debugging the state of an e-graph after some transitions
Serve as a launching point for interactive visualizations down the road

Features:

Colors each cluster based on it's sort
Caps size of graph to reduce memory/time blowups with unreadable output
Shows container values as nodes by adding inner_values method to Sorts
Creates intermediate graph IR before encoding to graphviz. Could be exposed at a later time to allow other visualization frontends from say Python.
Adds visualizer to web view, with dynamic transitions, and ability to pan and zoom.
Adds CLI flags --output-dot and --output-svg to visualize any program
Adds make graphs command to create SVG & dots of all examples
Tests graph creation in CI

Possible next steps/follow-up issues:

Graphviz does not allow edges to point to a cluster directly. The best we can do is point to a node in the cluster and then set the lhead to clip at the cluster edge. Currently we are doing a round-robin to point to nodes in each e-cluster.
Currently, e-classes that have nodes that point to themselves aren't rendered that well. The arrow won't stop at the cluster edge, but instead point directly to one of the nodes in the e-class. This is a known issue with graphviz edges in clusters. There are some workarounds in that linked issue we could explore.
If we create a subgraph with push or simplify than that graph will not be included in the resulting graphviz.

Examples

`eqsat-basic` in web viewer

It will also transition between graphs smoothly:

Recording.mov

`eqsat-basic`

`fibonacci`

`fibonacci-demand`

`map`

`rw-analysis`

`proofs`

eqsat-basic in Python

I've also been working on some Python examples, though they're not part of this PR. I wanted to show them here to just give a sense of how that could work eventually:

note that these screenshots were taken at a @recursecenter presentation previously and don't reflect the current graphviz styles

In this notebook we can see how the graph changes before and after running:

In this other notebook we use d3-graphviz to animate the transitions between different graphs, doing one run in between each snapshot:

Untitled.3.mov

TODO

oflatt · 2023-06-16T16:55:14Z

Thanks! Worked for me

4380306

Add the e-class ids to the tooltip in graphviz, to help with debugging output

oflatt · 2023-06-29T03:26:39Z

What's the status on this? Does @mwillsey have time to review?
Saul and I discussed it some- I think the conclusion was to somewhat simplify the intermediate representation Saul made. Eventually we want only one format that extraction, visualization, and dump to text uses

saulshanabrook · 2023-07-03T15:10:42Z

Saul and I discussed it some- I think the conclusion was to somewhat simplify the intermediate representation Saul made. Eventually we want only one format that extraction, visualization, and dump to text uses

@oflatt I saw that you opened your PR that also has its own IR for the graph. I would be happy to commit to merging our two IRs once both PRs are merged, if that's easier as well. In that merge, it might also be good to think about having a format we could expose in Python to allow other forms of extraction easily, like the work @philzook58 was experimenting with.

oflatt · 2023-07-03T16:51:43Z

Yep, I'm in favor of merging this and fixing it later then.
We should also coordinate later with @mwillsey's new format for the extraction gym:
https://github.com/egraphs-good/extraction-gym

mwillsey · 2023-07-06T23:39:38Z

After conversation with @oflatt, I really think we can use a common format for both this and the extraction gym. So I'm going to close this PR for now, as I do not intend to merge it in it's current state. Details about the format are coming soon!

saulshanabrook · 2023-07-08T23:44:03Z

@mwillsey So would you recommend implementing this on top of the new common format and then re-opening?

FWIW the Python bindings are already published with this fork and I will continue including this code in my fork and keeping it up to date, because I have been finding the graphviz helpful for education.

mwillsey · 2023-07-09T00:28:32Z

Yes! I think the idea is that you’d use the egraph serialize library as a dependency. Egglog (and egg) will hopefully soon have support for exporting to the in-memory representation of that library, thereby supporting not only serialization but also hopefully visualization. So the Python bindings could use mainline and still have visualization.

This adds a mapping of e-class id to class type to the format. One use case for this was in the visualizer in egglog (egraphs-good/egglog#147) to display the sort on each e-class.

* Add sorts/types/names to classes This adds a mapping of e-class id to class type to the format. One use case for this was in the visualizer in egglog (egraphs-good/egglog#147) to display the sort on each e-class. * Use local test files (#2) This changes the tests to use the local files instead of those in the extraction gym repo. I made this change so I could test the addition of classes. Feel free to disregard if you like. * Make class_data a separate object --------- Co-authored-by: Max Willsey <[email protected]>

saulshanabrook · 2023-07-21T15:46:34Z

My plan for following up with this work is as follows:

Add serialization support to egglog Add serialization support #171
Add support for converting serialized format to graphviz in the https://github.com/egraphs-good/egraph-serialize/ repo (under a feature flag, so graphviz requirements are optional), with methods to produce the string fo the graphviz, and save as a .dot and .svg file
Add support for splitting out all nodes of certain sorts into their own e-classes as a method on Egraph in egraph-serialize (so that fns which return or take primitives can not share them in the viz)
Update this branch to use the graphviz support from serialize to save svgs to disk and in the web UI.

Let me know what you think!

mwillsey · 2023-07-21T15:57:41Z

Yes, that sounds good! I would even go further and say that any "singleton" e-class (an e-class with just a single node, like all primitives) could be inlined directly into the parent for easier visualization.

saulshanabrook · 2023-07-21T17:03:00Z

Yeah, I think that might work too... let me take a look at some examples from this branch to see what would make sense....

Here are a couple of examples from three consecutive commits in the history of this branch:

git checkout <hash>
cargo run tests/<name>.egg --save-svg
# Screenshot to convert to cropped PNG due to bug at these commits with size

	`77e80d4`	`9da623c`	`7ddabb8`
	All equal primitives in shared node. Current behavior of exporter.	Unit primitives in their own nodes.	All primitives in their own nodes. Current behavior of this branch.
`fibonacci`		Same as ←
`fibonacci-demand`		Same as ←
`path`

Does anyone have thoughts on which are preferable? Happy to add other examples too.

I ended up settling on the last method because I thought it was closest to the current semantics of egglog.

mwillsey · 2023-07-21T17:42:05Z

For things of type unit, what makes the most sense to me is to group by function name. So basically approach 1, but split up by function name; so all the path tuples in one box, edge tuples in another, and so on. I think we can just elide the actual () node.

For other primitives (i64, etc), I'd like to see what the inlining approach looks like for when the primitives are used as inputs to functions. A quick example:

Node
|  |
v  v
1  foo

Could become:

Node(1, ·)
|
v
foo

in the situation where 1 is in a singleton e-class (or maybe is a primitive or something) but foo is not.

saulshanabrook · 2023-07-21T19:56:23Z

For other primitives (i64, etc), I'd like to see what the inlining approach looks like for when the primitives are used as inputs to functions. A quick example:

That's a cool idea! It would definitely cut down on the number of nodes...

We would still put collection primitives as nodes b/c they can point to e-classes...?

For functions that return primitives, we could do Node(1, ·) -> 2 too?

mwillsey · 2023-07-21T20:48:07Z

Sure! We'll have to try it to see

saulshanabrook · 2023-07-26T19:59:02Z

I have added these examples, along with the existing examples, to the e-graph serialize PR, so that we can see how inlining compares! https://github.com/saulshanabrook/egraph-serialize/tree/viz/tests-viz

saulshanabrook · 2023-08-08T19:20:52Z

@mwillsey I have updated this PR to use the e-graph serialize graphviz implementation (egraphs-good/egraph-serialize#4 and #171).

The changes seem out of date in the GitHub diff, even though I pushed them to the branch. Maybe if you re-open it they will be refreshed?

EDIT: I opened a new PR since maybe that's simpler, with the same branch: #186

saulshanabrook added 30 commits May 11, 2023 12:44

First go at graphviz export

799988f

Skip generated names

1157020

Use right hashmap

2bebd51

Expose graph

8018a0a

Make graph crate private

7123f22

Clippy fixes

3b58630

Tidy up string generation

bc4ddd2

Add CLI options to output .dot and .svg

df337d8

Ignore .DS_Store

70dd9b7

Add ability to run on all tests

d26646e

Don't run failing tests

e6816a1

Fix non primitive builtin sorts

f4e4b14

Fix extracting Sets with eq sort values

f163daf

Apply fix for graph

2a335d0

Fix extracting Sets with eq sort values

f3a439e

Require arcsort in extract

078375d

Make arcsort required in find_best

5b67147

Merge fix-set-eq-sorts into visualizer

e66508d

Make makefile iterative

4130b4e

Add readme description for new CLI commands

042f591

Move graph to its own folder

9067df7

clean up module statements

141a131

Remove dead inputs, which removes some duplicates

9279cb0

Make style closer to e-graph website

ffaf490

fmt fixes

05e3238

Fix wasm build

7fd7f15

Order all e-class nodes in same rank

7d87e76

Fix docstrings

3400b8f

Add first working web demo

5241a5b

fmt

19f0a69

saulshanabrook added 6 commits June 17, 2023 18:12

Reduce rank sep slightly

aa7d7ac

Add another cluster nesting to increase margin

40c27da

Reduce nodesep

4380306

Revert "Reduce nodesep"

5c6aaf9

4380306

Add outer cluster label to remove warning

7e1b75f

Add support for tooltips in graphviz

daa97af

Add the e-class ids to the tooltip in graphviz, to help with debugging output

mwillsey closed this Jul 6, 2023

saulshanabrook mentioned this pull request Jul 11, 2023

Add sorts/types/names to classes egraphs-good/egraph-serialize#1

Merged

This was referenced Jul 18, 2023

Add support for negation to f64 egraphs-good/egglog-python#34

Merged

Add serialization support #171

Merged

saulshanabrook mentioned this pull request Jul 26, 2023

Add support for exporting with Graphviz egraphs-good/egraph-serialize#4

Merged

saulshanabrook mentioned this pull request Aug 11, 2023

Exposes visualizations in CLI and in the web #186

Merged

saulshanabrook mentioned this pull request Oct 6, 2023

Omit default values when serializing primitive outputs #248

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add EGraph Visualizations #147

Add EGraph Visualizations #147

saulshanabrook commented May 22, 2023 •

edited

Loading

oflatt commented Jun 16, 2023

oflatt commented Jun 29, 2023

saulshanabrook commented Jul 3, 2023

oflatt commented Jul 3, 2023

mwillsey commented Jul 6, 2023

saulshanabrook commented Jul 8, 2023

mwillsey commented Jul 9, 2023

saulshanabrook commented Jul 21, 2023 •

edited

Loading

mwillsey commented Jul 21, 2023

saulshanabrook commented Jul 21, 2023 •

edited

Loading

mwillsey commented Jul 21, 2023

saulshanabrook commented Jul 21, 2023

mwillsey commented Jul 21, 2023

saulshanabrook commented Jul 26, 2023

saulshanabrook commented Aug 8, 2023 •

edited

Loading

Add EGraph Visualizations #147

Add EGraph Visualizations #147

Conversation

saulshanabrook commented May 22, 2023 • edited Loading

Examples

eqsat-basic in web viewer

eqsat-basic

fibonacci

fibonacci-demand

map

rw-analysis

proofs

eqsat-basic in Python

TODO

oflatt commented Jun 16, 2023

oflatt commented Jun 29, 2023

saulshanabrook commented Jul 3, 2023

oflatt commented Jul 3, 2023

mwillsey commented Jul 6, 2023

saulshanabrook commented Jul 8, 2023

mwillsey commented Jul 9, 2023

saulshanabrook commented Jul 21, 2023 • edited Loading

mwillsey commented Jul 21, 2023

saulshanabrook commented Jul 21, 2023 • edited Loading

mwillsey commented Jul 21, 2023

saulshanabrook commented Jul 21, 2023

mwillsey commented Jul 21, 2023

saulshanabrook commented Jul 26, 2023

saulshanabrook commented Aug 8, 2023 • edited Loading

saulshanabrook commented May 22, 2023 •

edited

Loading

`eqsat-basic` in web viewer

`eqsat-basic`

`fibonacci`

`fibonacci-demand`

`map`

`rw-analysis`

`proofs`

saulshanabrook commented Jul 21, 2023 •

edited

Loading

saulshanabrook commented Jul 21, 2023 •

edited

Loading

saulshanabrook commented Aug 8, 2023 •

edited

Loading