Base expression node types on `dataclasses` #125

inducer · 2023-01-13T01:59:40Z

Closes #46.

inducer · 2023-01-13T02:00:44Z

Surprisingly, this seems to somewhat work. (Current CI notwithstanding) @kaushikcfd @alexfikl @isuruf What do you think?

inducer · 2023-01-13T02:02:59Z

For context, this happened because I was working on inducer/pytato#393 and had a need to have the persistent hash key builder just "understand" expression values. Loopy has some absurd hack job to make this possible, but I wasn't itching to replicate it in pytato. This seemed nicer.

pymbolic/primitives.py

setup.py

alexfikl · 2023-01-13T09:41:14Z

Generally seems like a good idea to me!

One thing that doesn't quite spark joy is having to remember which classes are dataclass and which are attrs in more places :( But it's hard to argue with the hash caching. I found the hash to show up in quite a few profiles, so making that fast would have priority for me too.

inducer · 2023-01-13T18:25:00Z

I'm guessing I may revert this to dataclasses, for exactly this reason. Not sure.

pymbolic/primitives.py

inducer · 2023-06-02T05:08:16Z

I've changed this to dataclasses, and the tests in pymbolic itself seem to be passing. We'll see how the downstreams do.

inducer · 2023-06-02T22:55:16Z

Most everything seems to be working, except pytential has a puzzling failure.

alexfikl

Looked a bit through the code to see if anything jumps out that could cause the pytential errors (in Beltrami stuff again 😢) and left some nitpicks!

The only thing (?) that the Beltrami operators do a bit differently is that they have nested IntG expressions, but I couldn't find any reason this would change how those are handled..

EDIT: Found a simpler case to run (uses includes from test_beltrami.py)

def test_nested_potential(actx_factory):
    logging.basicConfig(level=logging.INFO)
    actx = actx_factory()

    case = eid.CircleTestCase(
            target_order=5,
            qbx_order=5,
            source_ovsmp=4,
            resolutions=[32, 64, 96, 128],
            # FIXME: FMM should not be slower!
            fmm_order=False, fmm_backend=None,
            radius=1.0
            )

    from pytential import GeometryCollection
    qbx = case.get_layer_potential(actx, 32, case.target_order)
    places = GeometryCollection(qbx, auto_where=case.name)

    from sumpy.kernel import LaplaceKernel
    kernel = LaplaceKernel(2)

    sym_sigma = sym.var("sigma")
    sym_op = sym.D(kernel,
                   sym.D(kernel, sym_sigma, qbx_forced_limit="avg"),
                   qbx_forced_limit="avg")

    density_discr = places.get_discretization(case.name)
    sigma = actx.thaw(density_discr.nodes()[0])
    r = bind(places, sym_op)(actx, sigma=sigma)

Tried S(Spp + Dp) and S(kappa * S) and S(W(S)) (the other terms in the Laplace-Beltrami operator) and the D(D) one was the only one that failed.

pymbolic/primitives.py

inducer · 2023-06-04T01:13:05Z

Thanks for finding the smaller reproducer! That helped quite a bit. The issue was the missing rewriting from None to cse_scope.EVALUATION in CommonSubexpression. It also helped expose inducer/pytential#209, which, as usual, the one issue you thought you had, is actually two. 🙂

inducer · 2023-06-04T04:10:53Z

Super weird. I wonder why the pytential run is getting canceled. Out of memory for some reason? If so, why would this PR make a difference?

inducer · 2023-06-04T12:08:55Z

Identical PR on Gitlab: https://gitlab.tiker.net/inducer/pymbolic/-/merge_requests/64

inducer · 2023-06-05T00:33:45Z

Tried to investigate more. Here's the time profile of the pytential tests, first with this patch:

============================================================================================================ slowest 10 durations =============================================================================================================
1357.39s call     test/test_linalg_skeletonization.py::test_skeletonize_by_proxy_convergence[<PyOpenCLArrayContext for <pyopencl.Device 'cpu-Intel(R) Xeon(R) CPU E5-2680 v2 @ 2.80GHz' on 'Portable Computing Language'>>-case3]
1355.53s call     test/test_beltrami.py::test_beltrami_convergence[<PyOpenCLArrayContext for <pyopencl.Device 'cpu-Intel(R) Xeon(R) CPU E5-2680 v2 @ 2.80GHz' on 'Portable Computing Language'>>-operator7-solution7]
914.25s call     test/test_stokes.py::test_exterior_stokes[<PyOpenCLArrayContext for <pyopencl.Device 'cpu-Intel(R) Xeon(R) CPU E5-2680 v2 @ 2.80GHz' on 'Portable Computing Language'>>-3]
746.09s call     test/test_beltrami.py::test_beltrami_convergence[<PyOpenCLArrayContext for <pyopencl.Device 'cpu-Intel(R) Xeon(R) CPU E5-2680 v2 @ 2.80GHz' on 'Portable Computing Language'>>-operator6-solution6]
724.49s call     test/test_scalar_int_eq.py::test_integral_equation[<PyOpenCLArrayContext for <pyopencl.Device 'cpu-Intel(R) Xeon(R) CPU E5-2680 v2 @ 2.80GHz' on 'Portable Computing Language'>>-case9]
685.49s call     test/test_layer_pot_identity.py::test_identity_convergence_slow[<PyOpenCLArrayContext for <pyopencl.Device 'cpu-Intel(R) Xeon(R) CPU E5-2680 v2 @ 2.80GHz' on 'Portable Computing Language'>>-case0]
640.87s call     test/test_maxwell.py::test_pec_mfie_extinction[<PyOpenCLArrayContext for <pyopencl.Device 'cpu-Intel(R) Xeon(R) CPU E5-2680 v2 @ 2.80GHz' on 'Portable Computing Language'>>-case0]
615.13s call     test/test_linalg_skeletonization.py::test_skeletonize_by_proxy_convergence[<PyOpenCLArrayContext for <pyopencl.Device 'cpu-Intel(R) Xeon(R) CPU E5-2680 v2 @ 2.80GHz' on 'Portable Computing Language'>>-case0]
499.75s call     test/test_layer_pot_identity.py::test_identity_convergence[<PyOpenCLArrayContext for <pyopencl.Device 'cpu-Intel(R) Xeon(R) CPU E5-2680 v2 @ 2.80GHz' on 'Portable Computing Language'>>-case1]
264.22s call     test/test_layer_pot_eigenvalues.py::test_ellipse_eigenvalues[<PyOpenCLArrayContext for <pyopencl.Device 'cpu-Intel(R) Xeon(R) CPU E5-2680 v2 @ 2.80GHz' on 'Portable Computing Language'>>-1-5-3-False]
=========================================================================================================== short test summary info ===========================================================================================================
SKIPPED [1] test_linalg_proxy.py:200: 3d partitioning requires a tree
========================================================================================= 237 passed, 1 skipped, 16198 warnings in 4015.96s (1:06:55) =========================================================================================

and then without:

==================================================================================================================================================================================================================================== slowest 10 durations =====================================================================================================================================================================================================================================
2185.13s call     test/test_stokes.py::test_exterior_stokes[<PyOpenCLArrayContext for <pyopencl.Device 'cpu-Intel(R) Xeon(R) CPU E5-2680 v2 @ 2.80GHz' on 'Portable Computing Language'>>-3]
2146.56s call     test/test_linalg_skeletonization.py::test_skeletonize_by_proxy_convergence[<PyOpenCLArrayContext for <pyopencl.Device 'cpu-Intel(R) Xeon(R) CPU E5-2680 v2 @ 2.80GHz' on 'Portable Computing Language'>>-case3]
1560.74s call     test/test_beltrami.py::test_beltrami_convergence[<PyOpenCLArrayContext for <pyopencl.Device 'cpu-Intel(R) Xeon(R) CPU E5-2680 v2 @ 2.80GHz' on 'Portable Computing Language'>>-operator7-solution7]
912.97s call     test/test_beltrami.py::test_beltrami_convergence[<PyOpenCLArrayContext for <pyopencl.Device 'cpu-Intel(R) Xeon(R) CPU E5-2680 v2 @ 2.80GHz' on 'Portable Computing Language'>>-operator6-solution6]
786.50s call     test/test_linalg_skeletonization.py::test_skeletonize_by_proxy_convergence[<PyOpenCLArrayContext for <pyopencl.Device 'cpu-Intel(R) Xeon(R) CPU E5-2680 v2 @ 2.80GHz' on 'Portable Computing Language'>>-case0]
631.47s call     test/test_maxwell.py::test_pec_mfie_extinction[<PyOpenCLArrayContext for <pyopencl.Device 'cpu-Intel(R) Xeon(R) CPU E5-2680 v2 @ 2.80GHz' on 'Portable Computing Language'>>-case0]
421.05s call     test/test_scalar_int_eq.py::test_integral_equation[<PyOpenCLArrayContext for <pyopencl.Device 'cpu-Intel(R) Xeon(R) CPU E5-2680 v2 @ 2.80GHz' on 'Portable Computing Language'>>-case9]
265.36s call     test/test_beltrami.py::test_beltrami_convergence[<PyOpenCLArrayContext for <pyopencl.Device 'cpu-Intel(R) Xeon(R) CPU E5-2680 v2 @ 2.80GHz' on 'Portable Computing Language'>>-operator5-solution5]
231.81s call     test/test_linalg_skeletonization.py::test_skeletonize_by_proxy_convergence[<PyOpenCLArrayContext for <pyopencl.Device 'cpu-Intel(R) Xeon(R) CPU E5-2680 v2 @ 2.80GHz' on 'Portable Computing Language'>>-case1]
188.92s call     test/test_scalar_int_eq.py::test_integral_equation[<PyOpenCLArrayContext for <pyopencl.Device 'cpu-Intel(R) Xeon(R) CPU E5-2680 v2 @ 2.80GHz' on 'Portable Computing Language'>>-case10]
=================================================================================================================================================================================================================================== short test summary info ===================================================================================================================================================================================================================================
SKIPPED [1] test_linalg_proxy.py:200: 3d partitioning requires a tree
================================================================================================================================================================================================================= 237 passed, 1 skipped, 1916 warnings in 3817.94s (1:03:37) =

(run on tripel, back to back with inducer/pytential@488b958 and inducer/sumpy@8960662)

The differences are... interesting. I'm not sure I understand them.

alexfikl · 2023-06-08T20:01:08Z

Tried to investigate more.

Ran this on koelsch as well and didn't see as much of a toss up between the timings of the various tests. The total runtime was pretty similar like in your run though. I also did a (in pytential)

mprof run python -m pytest -v -s -k 'not slowtest' --durations=25

to get the memory and it seems pretty similar as well (blue is on main and black is on attrs branches).

Sooo.. really not sure why the github CI is crashing..

EDIT: Not sure how accurate this is, but Github reports having 7GB on their machines (and above we're using 12GB)
https://docs.github.com/en/actions/using-github-hosted-runners/about-github-hosted-runners#supported-runners-and-hardware-resources

inducer · 2023-07-19T16:07:31Z

In a way I think it's not a surprise that the runners get canceled. They have about 7G of memory available, plus 4G swap, so the 12G peak allocation is definitely flirting with disaster, and I think which one gets cancelled is probably just up to coincidence. Perhaps the more useful question here might be why the curve is monotonically increasing, when in reality all tests should be independent, and so there shouldn't be an increase.

I propose we use inducer/sumpy#178 to have this discussion, because that looks eerily related.

inducer · 2024-09-30T17:05:46Z

Some interesting (informal) timing data, using experiments/traversal-benchmark.py:

This branch, as it is currently: 1.4864592552185059s
main: 1.3213305473327637s
This branch, with frozen=False passed to dataclass: 1.2415354251861572s

All cases run with python -O (Py3.12 from Debian).

I'm thinking I will set frozen=__debug__. Anyone see any downsides?

matthiasdiener · 2024-09-30T18:30:14Z

Some interesting (informal) timing data, using experiments/traversal-benchmark.py:

This branch, as it is currently: 1.4864592552185059s

main: 1.3213305473327637s

This branch, with frozen=False passed to dataclass: 1.2415354251861572s

All cases run with python -O (Py3.12 from Debian).

I'm thinking I will set frozen=__debug__. Anyone see any downsides?

No objection, I see similar performance numbers on my M1. My only concern is that setting frozen based on __debug__ may be confusing to some users?

Could a pattern such like this be helpful here?

from dataclasses import dataclass, FrozenInstanceError
from typing import Any

@dataclass(frozen=False)
class Foo:
    x: int
    y: int

    def __setattr__(self, name: str, value: Any) -> None:
        if hasattr(self, "_frozen"):
            raise FrozenInstanceError
        object.__setattr__(self, name, value)

    def __post_init__(self):
        self._frozen = True

f = Foo(1, 2)

f.x = 42  # raises dataclasses.FrozenInstanceError

f.__setattr__("x", 42)  # raises dataclasses.FrozenInstanceError

object.__setattr__(f, "x", 42)  # works

Perhaps better: https://rednafi.com/python/statically_enforcing_frozen_dataclasses

inducer · 2024-09-30T18:38:30Z

Could a pattern such like this be helpful here?

Have you tried the traversal benchmark with this? (I would expect it would tank it way more than leaving frozen enabled.)

My only concern is that setting frozen based on __debug__ may be confusing to some users?

The only scenario leading to confusion IMO is someone who only tests code during development with python -O and ignores advice on immutability. FWIW, expression nodes were only "morally" (not factually) immutable, and I haven't seen any problems from it.

Perhaps better: https://rednafi.com/python/statically_enforcing_frozen_dataclasses

That's effectively already what's going on:

@dataclass_transform(frozen_default=True)
def expr_dataclass(init: bool = True) -> Callable[[type[_T]], type[_T]]:
    ...

matthiasdiener · 2024-09-30T18:52:00Z

Could a pattern such like this be helpful here?

Have you tried the traversal benchmark with this? (I would expect it would tank it way more than leaving frozen enabled.)

You're right, it is slower than setting frozen.

Perhaps better: https://rednafi.com/python/statically_enforcing_frozen_dataclasses

That's effectively already what's going on:
@dataclass_transform(frozen_default=True)
def expr_dataclass(init: bool = True) -> Callable[[type[_T]], type[_T]]:
    ...

I see, thanks! I was not aware of this feature.

matthiasdiener

Apart from the minor typo, LGTM!

pymbolic/primitives.py

Matthias Diener contributed significantly to this by figuring out that innumerable warnings led to out-of-memory situations with pytest. Co-authored-by: Matthias Diener <[email protected]>

Stacking @classmethod, @Property is banned in Python 3.13.

inducer commented Jan 13, 2023

View reviewed changes

pymbolic/primitives.py Show resolved Hide resolved

pymbolic/primitives.py Outdated Show resolved Hide resolved

pymbolic/primitives.py Show resolved Hide resolved

setup.py Outdated Show resolved Hide resolved

inducer commented Jan 13, 2023

View reviewed changes

setup.py Outdated Show resolved Hide resolved

inducer force-pushed the attrs branch from 1df094e to a5167ec Compare January 13, 2023 02:08

inducer changed the title ~~Base expressions on attrs classes~~ Base expression node types on attrs classes Jan 13, 2023

inducer changed the title ~~Base expression node types on attrs classes~~ Base expression node types on attrs Jan 13, 2023

inducer mentioned this pull request Jan 13, 2023

New distributed-memory DAG partitioner inducer/pytato#393

Closed

10 tasks

inducer commented Jan 13, 2023

View reviewed changes

pymbolic/primitives.py Outdated Show resolved Hide resolved

inducer force-pushed the attrs branch from a5167ec to 89d4fda Compare June 2, 2023 01:39

inducer changed the title ~~Base expression node types on attrs~~ Base expression node types on dataclasses Jun 2, 2023

inducer force-pushed the attrs branch from 89d4fda to b0f6d8f Compare June 2, 2023 05:05

inducer force-pushed the attrs branch from b0f6d8f to 90d0cf0 Compare June 2, 2023 22:52

alexfikl approved these changes Jun 3, 2023

View reviewed changes

pymbolic/primitives.py Outdated Show resolved Hide resolved

pymbolic/primitives.py Show resolved Hide resolved

pymbolic/primitives.py Outdated Show resolved Hide resolved

pymbolic/primitives.py Outdated Show resolved Hide resolved

pymbolic/primitives.py Show resolved Hide resolved

inducer force-pushed the attrs branch from 90d0cf0 to aaee880 Compare June 4, 2023 01:09

inducer enabled auto-merge (rebase) June 4, 2023 01:13

alexfikl mentioned this pull request Jun 20, 2023

Direct Solver: Recursive Skeletonization inducer/pytential#164

Open

inducer mentioned this pull request Jun 20, 2023

Unexplained Github CI cancellations recently inducer/pytential#211

Closed

inducer disabled auto-merge July 13, 2023 20:26

inducer force-pushed the attrs branch from aaee880 to 08b9e4f Compare July 18, 2023 18:24

inducer force-pushed the attrs branch 8 times, most recently from 83feb59 to 2340251 Compare September 30, 2024 14:58

inducer force-pushed the attrs branch from 2340251 to 3eb71bb Compare September 30, 2024 17:51

matthiasdiener approved these changes Sep 30, 2024

View reviewed changes

pymbolic/primitives.py Outdated Show resolved Hide resolved

inducer and others added 10 commits September 30, 2024 14:13

Switch traversal benchmark to pyinstrument/speedscope

8500d66

Work around setuptools 64's breakage of static analysis tools

3ad5d47

Deprecate PersistentHashWalkMapper

d299233

Base expressions on dataclasses

7a23527

Matthias Diener contributed significantly to this by figuring out that innumerable warnings led to out-of-memory situations with pytest. Co-authored-by: Matthias Diener <[email protected]>

Tweak __match_args__ implementation to avoid decorator stacking

db3b76e

Stacking @classmethod, @Property is banned in Python 3.13.

__init__: include __all__, use import instead of assignment

f45caf6

Configure, pass mypy, mark typed

cda4d4b

Limit Github PR CI concurrency

4448534

Bump version to 2024.1

cd0f809

Enable ruff.isort

22dfda6

inducer force-pushed the attrs branch from 3eb71bb to 22dfda6 Compare September 30, 2024 19:19

inducer enabled auto-merge (rebase) September 30, 2024 19:47

inducer merged commit 0279229 into main Sep 30, 2024
10 checks passed

inducer deleted the attrs branch September 30, 2024 20:03

inducer mentioned this pull request Nov 3, 2024

Compatibility with (future) pymbolic 2024.1 firedrakeproject/loopy#27

Closed

matthiasdiener mentioned this pull request Dec 10, 2024

Remove frozenset() call from dataclass augmentation code inducer/pytato#567

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Base expression node types on `dataclasses` #125

Base expression node types on `dataclasses` #125

inducer commented Jan 13, 2023 •

edited

Loading

inducer commented Jan 13, 2023 •

edited

Loading

inducer commented Jan 13, 2023

alexfikl commented Jan 13, 2023

inducer commented Jan 13, 2023

inducer commented Jun 2, 2023

inducer commented Jun 2, 2023

alexfikl left a comment •

edited

Loading

inducer commented Jun 4, 2023

inducer commented Jun 4, 2023

inducer commented Jun 4, 2023

inducer commented Jun 5, 2023 •

edited

Loading

alexfikl commented Jun 8, 2023 •

edited

Loading

inducer commented Jul 19, 2023

inducer commented Sep 30, 2024

matthiasdiener commented Sep 30, 2024 •

edited

Loading

inducer commented Sep 30, 2024

matthiasdiener commented Sep 30, 2024

matthiasdiener left a comment

Base expression node types on dataclasses #125

Base expression node types on dataclasses #125

Conversation

inducer commented Jan 13, 2023 • edited Loading

inducer commented Jan 13, 2023 • edited Loading

inducer commented Jan 13, 2023

alexfikl commented Jan 13, 2023

inducer commented Jan 13, 2023

inducer commented Jun 2, 2023

inducer commented Jun 2, 2023

alexfikl left a comment • edited Loading

Choose a reason for hiding this comment

inducer commented Jun 4, 2023

inducer commented Jun 4, 2023

inducer commented Jun 4, 2023

inducer commented Jun 5, 2023 • edited Loading

alexfikl commented Jun 8, 2023 • edited Loading

inducer commented Jul 19, 2023

inducer commented Sep 30, 2024

matthiasdiener commented Sep 30, 2024 • edited Loading

inducer commented Sep 30, 2024

matthiasdiener commented Sep 30, 2024

matthiasdiener left a comment

Choose a reason for hiding this comment

Base expression node types on `dataclasses` #125

Base expression node types on `dataclasses` #125

inducer commented Jan 13, 2023 •

edited

Loading

inducer commented Jan 13, 2023 •

edited

Loading

alexfikl left a comment •

edited

Loading

inducer commented Jun 5, 2023 •

edited

Loading

alexfikl commented Jun 8, 2023 •

edited

Loading

matthiasdiener commented Sep 30, 2024 •

edited

Loading