MAINT:ENH:sparse.linalg: Rewrite iterative solvers in Python, remove FORTRAN code #18391

ilayn · 2023-04-30T15:06:22Z

This PR Is Moved to #18488

As discussed in #18367 for linalg.interpolative, iterative solvers are also in the same state that it is almost impossible to implement any enhancements such as returning the iteration information and so on. This PR, reimplements all code in pure Python and with a bit of care to inplace operations thanks to NumPy no performance is lost.

Reference issue

Closes a few PRs will populate once done with tests
Related #15738

Memory Lane for past discussions

#1466
#7623
#7624
#8400
#10341
#16050

What does this implement/fix?

Rewrote all code in Python instead of using very old Fortran templates
The performance is comparable and slightly faster with Python
Removes all dependencies to _isolve/iterative Fortran code.
The tol/atol handling is just a mess and is waiting to be overhauled for at least 5 years. This PR will unite both issues.
Most tests are dropin replacement passed. A few tests are now failing because these solvers were not meant to enter in certain branches but now they do because of more success. Those will be fixed. Several segfaults are avoided in the meantime that was annoying the CI for a very long time sporadically.
Not my favorite coding style but I went the Fortran files line-by-line and tried to be as faithful as possible

ev-br · 2023-04-30T17:13:23Z

Wow just wow.

[skip ci]

…stments

ilayn · 2023-05-13T10:33:57Z

I think this is close to the finished product. I'll stabilize the tests and tolerance adjustments mostly due to switching from random.seed to rng hence unearthing some unnecessarily tight tolerances that works only to those problems.

There is a GMRES issue about a premature "breakdown" but once that's fixed there are no showstoppers.

However there are many many issues, piled up overtime but not addressed, that we have to address, hopefully, with more actual users of these tools. These are mostly tolerance handling, callback strategy unification and solver performance in general such as missing stagnation checks and proper display of progress during iterations.

I am going to be just making them good enough so that we don't have new features (except deprecations) and leave it at that in this PR. My hope is that now that they are in Python more eyes can see the suboptimal parts and potentially fix them.

I would really appreciate all feedback.

scipy/sparse/linalg/_isolve/iterative.py

h-vetinari

I haven't checked the implementation, but I think the tests should be a good validation for now.

I have four concerns/desires:

clarify the situation w.r.t. tol= vs. rtol= also in the tests file; i.e. don't split up the calls everywhere but filter out the DeprecationWarning and add a todo to change everything from tol= to rtol=.
all stylistic changes (~black, rng clean-up, etc.) in the test file should be done in a separate PR IMO - that way the diff here becomes much less noisy and we could say "see, only the implementation changed"
I don't like the duplication of the solver list all over the place, would prefer to turn it into a fixture that's auto-injected as soon as a test has solver in the signature. I've left some suggestions to implement this
Also, I would consider leaving assert_, not least because PEP 679 might someday land in CPython, and we could avoid some churn. If you don't buy this argument, then I feel that assert (...) is strictly worse than assert ... because the whitespace in between now has semantic significance -- so if you want to change it, then please also remove the parentheses.

I think the last three could be tackled in preparatory PRs. If you want, I can help with the fixture one.

h-vetinari · 2023-05-15T00:56:22Z

scipy/sparse/linalg/_isolve/tests/test_iterative.py

+    if solver in [cg, cgs, bicg, bicgstab, gmres, qmr]:
+        x, info = solver(A, b, x0=x0, rtol=tol, maxiter=1, callback=callback)
+    else:
+        x, info = solver(A, b, x0=x0, tol=tol, maxiter=1, callback=callback)


So IIUC, the solvers that are not being touched by this PR would now need the same tol-vs-rtol treatment to ensure we have a homogeneous API here?

I would be tempted to write this as follows, so that it's more self-evident what still needs to be done (and to switch the test for 1.13)

Suggested change

if solver in [cg, cgs, bicg, bicgstab, gmres, qmr]:

x, info = solver(A, b, x0=x0, rtol=tol, maxiter=1, callback=callback)

else:

x, info = solver(A, b, x0=x0, tol=tol, maxiter=1, callback=callback)

with suppress_warnings() as sup:

sup.filter(DeprecationWarning, ".*keyword argument 'tol' is deprecated.*")

# TODO: ensure all solvers support rtol keyword

x, info = solver(A, b, x0=x0, tol=tol, maxiter=1, callback=callback)

scipy/sparse/linalg/_isolve/tests/test_iterative.py

h-vetinari · 2023-05-15T01:11:14Z

scipy/sparse/linalg/_isolve/tests/test_iterative.py

+    if solver in [cg, cgs, bicg, bicgstab, gmres, qmr]:
+        x, info = solver(A, b, M=precond, x0=x0, rtol=tol)
+    else:
+        x, info = solver(A, b, M=precond, x0=x0, tol=tol)


Same here as before: prefer to use tol= consistently with warning filter & todo

scipy/sparse/linalg/_isolve/tests/test_iterative.py

h-vetinari · 2023-05-15T01:16:33Z

scipy/sparse/linalg/_isolve/tests/test_iterative.py

+            if solver in [cg, cgs, bicg, bicgstab, gmres]:
+                tol_kwarg = {'rtol': tol}
+            else:
+                tol_kwarg = {'tol': tol}
+            x, info = solver(A, b, **tol_kwarg)


Same here. This is already in suppress_warnings() block, so we just need to add one more filter on top

ilayn · 2023-05-15T03:57:52Z

Thanks a lot @h-vetinari There is indeed quite some work to do. I'll address them in the evening I hope today. In general I agree with almost everything except the commit hygiene which is exceptionally difficult given the scope.

clarify the situation w.r.t. tol= vs. rtol= also in the tests file; i.e. don't split up the calls everywhere but filter out the DeprecationWarning and add a todo to change everything from tol= to rtol=.

This one is tricky because if I start touching all solvers then it would be definitely impossible to review. The tricky part is that currently we are carrying 3(or 4 depending on how you count) different legacy ways and deprecating 2 of them simultaneously. I can touch the others but it won't make it easier to review. So at some point this has to be a mess.

all stylistic changes (~black, rng clean-up, etc.) in the test file should be done in a separate PR IMO - that way the diff here becomes much less noisy and we could say "see, only the implementation changed"

Maybe but it won't have less clutter because there is unfortunately no such separation. Because when you switch to rng, for example, many tests start to fail. It's almost like the test precision is tailored manually. Hence it is a bit tricky.

I find it handling it easier in the split view without whitespace button clicked. But keeping track of the test results and commit separation simultaneously is a bit too much for me. I'm barely holding the logic in my head. And frankly, commit history hygiene is a bit of cargo cult now when you are changing hundreds of lines.

I don't like the duplication of the solver list all over the place, would prefer to turn it into a fixture that's auto-injected as soon as a test has solver in the signature. I've left some suggestions to implement this

Fixtures would also make things very cluttered mind you but I agree. There were apparently way too many springs that we skipped the cleaning for this module.

Also, I would consider leaving assert_, not least because PEP 679 might someday land in CPython, and we could avoid some churn. If you don't buy this argument, then I feel that assert (...) is strictly worse than assert ... because the whitespace in between now has semantic significance -- so if you want to change it, then please also remove the parentheses.

I have no opinion about this anymore, for some years we tried to get rid of assert_ through pytest adoption, now the wind is turning again probably. Both assert and assert_ are terrible in my opinion and assert_ is extra ugly with the underscore signifying the homemade wonkiness. I didn't go around and touched all those lines since it really becomes messy then. I could just rewrite all the tests.

I think the last three could be tackled in preparatory PRs. If you want, I can help with the fixture one.

Probably I can separate those PRs, only, after we are done with everything. But like I mentioned, there is no organic division that would still make tests pass. Certain things need to be touched simultaneously. I'll first get them passing to an acceptable level and then let's reconvene and assess what we can do to modernize the whole thing.

h-vetinari · 2023-05-15T04:12:03Z

clarify the situation w.r.t. tol= vs. rtol= also in the tests file; i.e. don't split up the calls everywhere but filter out the DeprecationWarning and add a todo to change everything from tol= to rtol=.

This one is tricky because if I start touching all solvers then it would be definitely impossible to review.

My proposal was to currently leave the tests with passing tol= and filtering the deprecation warning. That way you don't have to do everything at once, but it's still clear what needs to be done (even from reading the tests).

h-vetinari · 2023-05-15T05:05:07Z

all stylistic changes (~black, rng clean-up, etc.) in the test file should be done in a separate PR IMO - that way the diff here becomes much less noisy and we could say "see, only the implementation changed"

Maybe but it won't have less clutter because there is unfortunately no such separation. Because when you switch to rng, for example, many tests start to fail. It's almost like the test precision is tailored manually. Hence it is a bit tricky.

OK, if the rng-changes are not just stylistic but functionally relevant, then let's exclude those. For everything else, there's #18462 ;-)

ilayn · 2023-05-15T05:23:10Z

I see that changes on isolve is affecting optimize tests too (scipy/optimize/tests/test_nonlin.py::TestNonlin) and they also have mixed solvers. I guess I need to do something about this indeed.

h-vetinari · 2023-05-16T07:19:41Z

I think the last three could be tackled in preparatory PRs. If you want, I can help with the fixture one.

Alright, this has now happened with #18462 & #18463. Should be good to rebase (or merge) here. I'd still prefer

    with suppress_warnings() as sup:
        sup.filter(DeprecationWarning, ".*keyword argument 'tol' is deprecated.*")
        # TODO: ensure all solvers support rtol keyword
        x, info = solver(A, b, x0=x0, tol=tol, maxiter=1, callback=callback)

instead of

    if solver in [cg, cgs, bicg, bicgstab, gmres, qmr]:
        x, info = solver(A, b, x0=x0, rtol=tol, maxiter=1, callback=callback)
    else:
        x, info = solver(A, b, x0=x0, tol=tol, maxiter=1, callback=callback)

as noted above, for all those occurrences.

ilayn · 2023-05-16T08:46:29Z

I'd prefer that too but it means going through all solvers which is not that trivial and gets a bit too much out of the scope of this PR.

h-vetinari · 2023-05-16T08:50:19Z

I'd prefer that too but it means going through all solvers which is not that trivial and gets a bit too much out of the scope of this PR.

I must be doing quite a terrible job of explaining myself... Currently all solvers support tol=, which is why the test module can contain lines like x, info = solver(A, b, x0=x0, tol=tol, maxiter=1, callback=callback) without branches. In this PR, you're introducing an rtol= keyword for some solvers that supersedes & deprecates tol=.

What I'm saying is to leave the tests using tol=, and only filter out the warning, until we have universal support for rtol=. In other words, it reduces the scope of this PR (less test changes & the result is self-documenting), rather than increasing it.

ilayn · 2023-05-16T09:02:09Z

Yes I think I have same issue. Currently we are also changing the default values of the signature. So not providing those keywords lead to different behaviors on the solvers I touched and other solvers that I didn't.

So tol=1e-5, atol=None, rtol=0. is not the same as tol=None, atol=0., rtol=1e-5. The legacy behavior is behaving way too wonky which is way I touched it.

The plan is in one of the issues here but I couldn't find it. But also in the mailing list. It is not that trivial to bunch them up with a fixture. I have to say, the idea is to get rid of Fortran solvers not unifying test file.

j-bowhay

Sorry these are arriving in quite a trickle, I only have time a for a sporadic skim now and then

j-bowhay · 2023-05-16T09:28:20Z

scipy/sparse/linalg/_isolve/iterative.py

+    tol : float, optional, deprecated
+
+        .. deprecated 1.11.0
+           `gmres` keyword argument `tol` is deprecated in favor of `rtol` and


Suggested change

`gmres` keyword argument `tol` is deprecated in favor of `rtol` and

`bicg` keyword argument `tol` is deprecated in favor of `rtol` and

j-bowhay · 2023-05-16T09:29:04Z

scipy/sparse/linalg/_isolve/iterative.py

+    tol : float, optional, deprecated
+
+        .. deprecated 1.11.0
+           `gmres` keyword argument `tol` is deprecated in favor of `rtol` and


Suggested change

`gmres` keyword argument `tol` is deprecated in favor of `rtol` and

`bicgstab` keyword argument `tol` is deprecated in favor of `rtol` and

j-bowhay · 2023-05-16T09:29:24Z

scipy/sparse/linalg/_isolve/iterative.py

+    tol : float, optional, deprecated
+
+        .. deprecated 1.11.0
+           `gmres` keyword argument `tol` is deprecated in favor of `rtol` and


Suggested change

`gmres` keyword argument `tol` is deprecated in favor of `rtol` and

`cg` keyword argument `tol` is deprecated in favor of `rtol` and

j-bowhay · 2023-05-16T09:29:42Z

scipy/sparse/linalg/_isolve/iterative.py

+    tol : float, optional, deprecated
+
+        .. deprecated 1.11.0
+           `gmres` keyword argument `tol` is deprecated in favor of `rtol` and


Suggested change

`gmres` keyword argument `tol` is deprecated in favor of `rtol` and

`cgs` keyword argument `tol` is deprecated in favor of `rtol` and

j-bowhay · 2023-05-16T09:30:19Z

scipy/sparse/linalg/_isolve/iterative.py

+    tol : float, optional, deprecated
+
+        .. deprecated 1.11.0
+           `gmres` keyword argument `tol` is deprecated in favor of `rtol` and


Suggested change

`gmres` keyword argument `tol` is deprecated in favor of `rtol` and

`qmr` keyword argument `tol` is deprecated in favor of `rtol` and

j-bowhay

Sorry these are arriving in quite a trickle, I only have time a for a sporadic skim now and then

h-vetinari · 2023-05-16T09:42:35Z

So tol=1e-5, atol=None, rtol=0. is not the same as tol=None, atol=0., rtol=1e-5. The legacy behavior is behaving way too wonky which is way I touched it.

OK, I understand. But does tol=1e-5, atol=None, rtol=None before this PR still mean the same as tol=1e-5, atol=None, rtol=None after this PR? That's all that's required to keep a line like x, info = solver(A, b, x0=x0, tol=tol, maxiter=1) unchanged both in terms of code & outcome.

The plan is in one of the issues here but I couldn't find it.

I think you mean this.

It is not that trivial to bunch them up with a fixture.

The fixturization is already done (but that's not a substantial change from the current state of this PR, which already had tests templated over solver).

I have to say, the idea is to get rid of Fortran solvers not unifying test file.

Sure, and it's a worthwhile goal. I'm trying to avoid unnecessary churn in the tests (and where that's not possible, to ensure the state they're left in is self-explanatory without doing git archeology).

ilayn · 2023-05-17T09:31:09Z

But does tol=1e-5, atol=None, rtol=None before this PR still mean the same as tol=1e-5, atol=None, rtol=None after this PR?

No because the default of tol changed to None and fires up a different warning to not to use tol anymore. And requires another warning filter.

h-vetinari · 2023-05-17T09:55:27Z

No because the default of tol changed to None and fires up a different warning to not to use tol anymore. And requires another warning filter.

The warning can be filtered (as my example contains), and tol=1e-5, atol=None (the default before this PR) got transformed through _get_atol to do the equivalent of the new default tol=1e-5, atol=0., namely: both before & after, tol=x (without atol) defaults to a relative accuracy of 1e-5.

Looking closer at the code to actually verify that the default behaviour (on a meta-level) did not change, I noticed also that rather than repeating

    if tol is not None:
        msg = ("'scipy.sparse.linalg.cg' keyword argument 'tol' is "
               "deprecated in favor of 'rtol' and will be removed in SciPy "
               "v.1.13.0. Until then, if set, will override 'rtol'.")
        warnings.warn(msg, category=DeprecationWarning, stacklevel=3)
        rtol = float(tol)

    if isinstance(atol, str):
        warnings.warn("scipy.sparse.linalg.cg called with `atol` set to "
                      "string, possibly with value 'legacy'. This behavior "
                      "is deprecated and atol parameter only excepts floats."
                      " In SciPy 1.13, this will result with an error.",
                      category=DeprecationWarning, stacklevel=3)

    atol = max(float(atol), rtol * float(np.linalg.norm(b)))

in each function, it'd be better to keep _get_atol and move all the logic there, rather than copying this chunk into all solvers.

ilayn · 2023-05-17T10:20:02Z

I think we are again getting into the preference land. Repeated or not they are going to be removed in two versions. So DRY principle is not applicable here. Explicit removal is much better in these cases then implicit because then you know what is touched and what isn't.

This is already a very large PR and I really cannot keep track of which is yours which is mine to resolve the conflicts and after rebase now all tests are failing again so I don't see how this is helping.

Like I said, if we are going to filter new warnings that's pretty much if then else block type of intervention so if you don't mind I'm going to finish this PR with what I have then we can retouch it if there is any need.

h-vetinari · 2023-05-17T11:12:34Z

Repeated or not they are going to be removed in two versions. So DRY principle is not applicable here.

Any later changes to that logic, or even the warning itself, must now be duplicated X times (who knows what happen in the next year that causes this to be touched) - how is that helpful? It's IMO a trivial request - keep the function that was already there for that purpose.

I think we are again getting into the preference land.

What's a review if not a collection of preferences (that are held at varying degrees of intensity)? But I see that we are not progressing, so I'll leave the review of this to someone else.

ilayn · 2023-05-17T11:28:11Z

I am not really following anymore. We do things in separate PRs for a lot less in many other Pull Requests. It was PEP8 fine let's do it. It was the tests fine let's do it. Now it is the code with very very delicate state and I say maybe we leave afterwards, and I'm getting this comment? I mean come on.

Let me close this and handle the rebase on the fresh copy as it is less work.

h-vetinari · 2023-05-17T11:42:39Z

Now it is the code with very very delicate state and I say maybe we leave afterwards, and I'm getting this comment? I mean come on.

No offense intended. From my POV: I asked for things that are IMO reasonable and relevant, you tell me repeatedly that they're not possible or not useful. Rather than block your progress with opinions that you either don't share or don't want to take into account, I'm taking myself out of the picture.

ilayn · 2023-05-17T11:46:08Z

Fine, my bad then. I don't care about being right. I am already running out of patience with this code so let's be done with it in the next PR.

ilayn added enhancement A new feature or improvement scipy.sparse.linalg Documentation Issues related to the SciPy documentation. Also check https://github.com/scipy/scipy.org maintenance Items related to regular maintenance tasks labels Apr 30, 2023

ilayn marked this pull request as draft April 30, 2023 15:06

ilayn mentioned this pull request May 2, 2023

BUG: Missing symbol when used with reference LAPACK: cblas_cdotc_sub #18371

Closed

ilayn added 7 commits May 6, 2023 16:19

MAINT:sparse.linalg:cg: Replace Fortran code

1522381

MAINT:sparse.linalg:cgs: Replace Fortran code

6ed8d35

MAINT:sparse.linalg:bicg: Replace Fortran code

4714f8c

MAINT:sparse.linalg:bicgstab: Replace Fortran code

c51af13

MAINT:sparse.linalg:qmr: Replace Fortran code

30a005d

[skip ci]

TST:sparse.linalg:Remove Nonentrancy test for Python code

9502851

[skip ci]

MAINT:sparse.linalg:gmres: Replace Fortran code

5a1324c

[skip ci]

ilayn force-pushed the cg_no_fortran branch from 607668c to 5a1324c Compare May 6, 2023 14:19

ilayn added 5 commits May 6, 2023 16:22

TST:sparse.linalg: Remove reentrancy tests altogether

281c7cf

[skip ci]

DOC:sparse.linalg:Rewrite iterative.py docstrings

312e0bf

[skip ci]

MAINT:sparse.linalg:fix atol and rtol defaults

7fda7cd

[skip ci]

TST:sparse.linalg:First iteration of all tests and code fixes

ff276bd

[skip ci]

TST:sparse.linalg:All tests pass except for GMRES

cf6ff20

[skip ci]

ilayn mentioned this pull request May 9, 2023

Accept multiple matrices in scipy.linalg.expm #12838

Closed

MAINT:sparse.linalg:All solvers stably pythonized sans tolerance adju…

8a7bc10

…stments

ilayn marked this pull request as ready for review May 13, 2023 10:28

j-bowhay reviewed May 13, 2023

View reviewed changes

scipy/sparse/linalg/_isolve/iterative.py Outdated Show resolved Hide resolved

j-bowhay reviewed May 13, 2023

View reviewed changes

scipy/sparse/linalg/_isolve/iterative.py Outdated Show resolved Hide resolved

ilayn added 3 commits May 14, 2023 15:12

TST:sparse.linalg:Tolerance adjustments for Pythonized iterative solvers

3c56571

DOC:sparse.linalg:Fix tolerance related text and minor clean-up

34108d0

MAINT:sparse.linalg:Remove redundant atol checks

2627735

h-vetinari reviewed May 15, 2023

View reviewed changes

h-vetinari mentioned this pull request May 15, 2023

MAINT: Clean up scipy/sparse/linalg/_isolve/tests/test_iterative.py #18462

Merged

h-vetinari mentioned this pull request May 15, 2023

MAINT: parametrize scipy/sparse/linalg/_isolve/tests/test_iterative.py #18463

Merged

j-bowhay requested changes May 16, 2023

View reviewed changes

ilayn closed this May 17, 2023

ilayn mentioned this pull request May 18, 2023

MAINT:ENH:sparse.linalg: Rewrite iterative solvers in Python #18488

Merged

ilayn deleted the cg_no_fortran branch June 5, 2023 20:41

	`gmres` keyword argument `tol` is deprecated in favor of `rtol` and
	`bicg` keyword argument `tol` is deprecated in favor of `rtol` and

	`gmres` keyword argument `tol` is deprecated in favor of `rtol` and
	`bicgstab` keyword argument `tol` is deprecated in favor of `rtol` and

	`gmres` keyword argument `tol` is deprecated in favor of `rtol` and
	`cg` keyword argument `tol` is deprecated in favor of `rtol` and

	`gmres` keyword argument `tol` is deprecated in favor of `rtol` and
	`cgs` keyword argument `tol` is deprecated in favor of `rtol` and

	`gmres` keyword argument `tol` is deprecated in favor of `rtol` and
	`qmr` keyword argument `tol` is deprecated in favor of `rtol` and

Uh oh!

MAINT:ENH:sparse.linalg: Rewrite iterative solvers in Python, remove FORTRAN code #18391

MAINT:ENH:sparse.linalg: Rewrite iterative solvers in Python, remove FORTRAN code #18391

Uh oh!

Conversation

ilayn commented Apr 30, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Reference issue

Memory Lane for past discussions

What does this implement/fix?

Uh oh!

ev-br commented Apr 30, 2023

Uh oh!

ilayn commented May 13, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

h-vetinari left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

ilayn commented May 15, 2023

Uh oh!

h-vetinari commented May 15, 2023

Uh oh!

h-vetinari commented May 15, 2023

Uh oh!

ilayn commented May 15, 2023

Uh oh!

h-vetinari commented May 16, 2023

Uh oh!

ilayn commented May 16, 2023

Uh oh!

h-vetinari commented May 16, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ilayn commented May 16, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

j-bowhay left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

j-bowhay left a comment

Choose a reason for hiding this comment

Uh oh!

h-vetinari commented May 16, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ilayn commented May 17, 2023

Uh oh!

h-vetinari commented May 17, 2023

Uh oh!

ilayn commented May 17, 2023

Uh oh!

h-vetinari commented May 17, 2023

Uh oh!

ilayn commented May 17, 2023

ilayn commented Apr 30, 2023 •

edited

Loading

ilayn commented May 13, 2023 •

edited

Loading

h-vetinari commented May 16, 2023 •

edited

Loading

ilayn commented May 16, 2023 •

edited

Loading

h-vetinari commented May 16, 2023 •

edited

Loading