New methods for sparse graphs backend to enumerate neighbors in linear time #38167

cyrilbouvier · 2024-06-07T15:07:11Z

As noted in issue #37642, enumerating the neighbors of a vertex for a graph that use the sparse backend (the default in Sage) is not linear on the number of neighbors (there is an extra log factor).
The goal of this PR is to fix this.

The problem is that to enumerate the neighbors of a vertex, the current code calls iteratively _next_neighbor_unsafe (or one of its out or in variant). But this method return an int corresponding to a neighbor, so to go to the next neighbor, it was first necessary to retrieve this vertex* in the structure storing all the neighbors. As it is stored in a sorted binary tree, the cost of retrieving is O(log(#neighbors)).

*(or a close one if it was deleted in the meantime)

To obtain a linear complexity, I wrote a new method that do a simple tree traversal which is linear in the number of nodes in the tree (i.e., the number of neighbors). As this new method does not have a similar interface as the previous _next_neighbor_unsafe, some more rewriting was necessary to use it in other parts of the code.

I wrote new methods for the SparseGraph class: out_neighbors_unsafe and in_neighbors_unsafe. They overwrite the ones from the base class and rely on the new methods _neighbors_unsafe and _neighbors_BTNode_unsafe.
The _neighbors_BTNode_unsafe is a low-level method enumerating the neighbors in linear time.
I rewrote the two methods out_neighbors_BTNode_unsafe and in_neighbors_BTNode_unsafe to use the new _neighbors_BTNode_unsafe.
I wrote a new method _iterator_edges method for SparseGraphBackend to overwrite the one from the base class
CGraphBackend, in order to expose the low-level code to the Graph class.

Question:
During the writing of this PR, I notice that the methods out_neighbors_BTNode_unsafe and in_neighbors_BTNode_unsafe are only defined for the sparse backend and are not used anywhere in the code.
Can I remove them ? Or should they be kept for compatibility ?

📝 Checklist

The title is concise and informative.
The description explains in detail what this PR is about.
I have linked a relevant issue or discussion.
[] I have created tests covering the changes.
[] I have updated the documentation and checked the documentation preview.

⌛ Dependencies

It calls the new method from previous commit with linear complexity

cyrilbouvier · 2024-06-07T15:08:53Z

This PR is necessary to fix #37642 but is not sufficient: PR #37662 is also necessary.

dcoudert

Question:
During the writing of this PR, I notice that the methods out_neighbors_BTNode_unsafe and in_neighbors_BTNode_unsafe are only defined for the sparse backend and are not used anywhere in the code.
Can I remove them ? Or should they be kept for compatibility ?

These are old functions that are apparently no longer used. I think you can remove them.

src/sage/graphs/base/sparse_graph.pyx

dcoudert · 2024-06-09T15:24:48Z

src/sage/graphs/base/sparse_graph.pyx

+    """
+    if modus == 0:
+        return False
+    if modus == 1 or modus == 2:


you could reorder the tests to check if modus == 3 and otherwise return True

Does 51933f0 correspond to what you had in mind ?

cyrilbouvier · 2024-06-10T09:15:53Z

These are old functions that are apparently no longer used. I think you can remove them.

done in f04678c

dcoudert · 2024-06-10T09:49:26Z

These are old functions that are apparently no longer used. I think you can remove them.

done in f04678c

yes.

dcoudert · 2024-06-10T09:51:53Z

something goes wrong with the CI. Before, this PR was inducing a segfault...

cyrilbouvier · 2024-06-10T12:58:04Z

something goes wrong with the CI. Before, this PR was inducing a segfault...

I fixed it: I wrote two line of code to compute the max degree of the graph but I used num_verts instead of the size of active_vertices (+ test if the bit is active in the bitset) to iterate over the vertices of the graph. It created out of bound memory access in the rest of the code. My bad. (see e5f4299)

There is still some failing tests with labels. I am working on it.

github-actions · 2024-06-10T13:58:14Z

Documentation preview for this PR (built with commit b7e644e; changes) is ready! 🎉
This preview will update shortly after each push to this PR.

src/sage/graphs/base/sparse_graph.pyx

dcoudert

LGTM.

dcoudert · 2024-06-11T09:47:17Z

This is a nice improvement. Thank you.

sagemathgh-38167: New methods for sparse graphs backend to enumerate neighbors in linear time       As noted in issue sagemath#37642, enumerating the neighbors of a vertex for a graph that use the sparse backend (the default in Sage) is **not** linear on the number of neighbors (there is an extra log factor). The goal of this PR is to fix this. The problem is that to enumerate the neighbors of a vertex, the current code calls iteratively _next_neighbor_unsafe (or one of its out or in variant). But this method return an int corresponding to a neighbor, so to go to the next neighbor, it was first necessary to retrieve this vertex* in the structure storing all the neighbors. As it is stored in a sorted binary tree, the cost of retrieving is O(log(#neighbors)). *(or a close one if it was deleted in the meantime) To obtain a linear complexity, I wrote a new method that do a simple tree traversal which is linear in the number of nodes in the tree (i.e., the number of neighbors). As this new method does not have a similar interface as the previous _next_neighbor_unsafe, some more rewriting was necessary to use it in other parts of the code. 1. I wrote new methods for the SparseGraph class: out_neighbors_unsafe and in_neighbors_unsafe. They overwrite the ones from the base class and rely on the new methods _neighbors_unsafe and _neighbors_BTNode_unsafe. The _neighbors_BTNode_unsafe is a low-level method enumerating the neighbors in linear time. 2. I rewrote the two methods out_neighbors_BTNode_unsafe and in_neighbors_BTNode_unsafe to use the new _neighbors_BTNode_unsafe. 3. I wrote a new method _iterator_edges method for SparseGraphBackend to overwrite the one from the base class CGraphBackend, in order to expose the low-level code to the Graph class. Question: During the writing of this PR, I notice that the methods out_neighbors_BTNode_unsafe and in_neighbors_BTNode_unsafe are only defined for the sparse backend and are not used anywhere in the code. Can I remove them ? Or should they be kept for compatibility ? ### 📝 Checklist  - [x] The title is concise and informative. - [x] The description explains in detail what this PR is about. - [x] I have linked a relevant issue or discussion. - [] I have created tests covering the changes. - [] I have updated the documentation and checked the documentation preview. ### ⌛ Dependencies    URL: sagemath#38167 Reported by: cyrilbouvier Reviewer(s): cyrilbouvier, David Coudert

cyrilbouvier added 3 commits June 7, 2024 16:41

New methods of sparse graph to enumerate neighbors in linear time

b246b63

New _iterator_edges method for SparseGraph

a40aa66

It calls the new method from previous commit with linear complexity

Small typo in documentation

db73217

dcoudert reviewed Jun 9, 2024

View reviewed changes

cyrilbouvier added 2 commits June 10, 2024 10:44

graphs: remove unused functions [in|out]_neighbors_BTNode_unsafe

f04678c

graphs: remove extra space before (

dc3fb05

github-actions bot added the v: minimal label Jun 10, 2024

cyrilbouvier added 2 commits June 10, 2024 11:17

graphs: reorganize test if utility function

51933f0

Merge branch 'develop' into fix-graphs-out-neighbors-for-sparse-backend

6a2b3b5

cyrilbouvier added 2 commits June 10, 2024 14:52

graphs: fix typo + rewrite _reorganize_edge in another file

cb95310

graphs: fix memory error

e5f4299

github-actions bot added v: large and removed v: minimal labels Jun 10, 2024

graphs: fix handling of labels

4ef6424

dcoudert suggested changes Jun 10, 2024

View reviewed changes

src/sage/graphs/base/sparse_graph.pyx Outdated Show resolved Hide resolved

graphs: use MemoryAllocator to free the memory in _iterator_edges

b7e644e

dcoudert approved these changes Jun 11, 2024

View reviewed changes

dcoudert assigned cyrilbouvier Jun 11, 2024

dcoudert added the s: positive review label Jun 11, 2024

vbraun merged commit cee5890 into sagemath:develop Jun 22, 2024

github-actions bot removed the s: positive review label Jun 22, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

New methods for sparse graphs backend to enumerate neighbors in linear time #38167

New methods for sparse graphs backend to enumerate neighbors in linear time #38167

Uh oh!

cyrilbouvier commented Jun 7, 2024

Uh oh!

cyrilbouvier commented Jun 7, 2024

Uh oh!

dcoudert left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

dcoudert Jun 9, 2024

Uh oh!

cyrilbouvier Jun 10, 2024

Uh oh!

cyrilbouvier commented Jun 10, 2024

Uh oh!

dcoudert commented Jun 10, 2024

Uh oh!

dcoudert commented Jun 10, 2024

Uh oh!

cyrilbouvier commented Jun 10, 2024

Uh oh!

github-actions bot commented Jun 10, 2024 •

edited

Loading

Uh oh!

Uh oh!

dcoudert left a comment

Uh oh!

dcoudert commented Jun 11, 2024

Uh oh!

Uh oh!

Uh oh!

New methods for sparse graphs backend to enumerate neighbors in linear time #38167

New methods for sparse graphs backend to enumerate neighbors in linear time #38167

Uh oh!

Conversation

cyrilbouvier commented Jun 7, 2024

📝 Checklist

⌛ Dependencies

Uh oh!

cyrilbouvier commented Jun 7, 2024

Uh oh!

dcoudert left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

dcoudert Jun 9, 2024

Choose a reason for hiding this comment

Uh oh!

cyrilbouvier Jun 10, 2024

Choose a reason for hiding this comment

Uh oh!

cyrilbouvier commented Jun 10, 2024

Uh oh!

dcoudert commented Jun 10, 2024

Uh oh!

dcoudert commented Jun 10, 2024

Uh oh!

cyrilbouvier commented Jun 10, 2024

Uh oh!

github-actions bot commented Jun 10, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

dcoudert left a comment

Choose a reason for hiding this comment

Uh oh!

dcoudert commented Jun 11, 2024

Uh oh!

Uh oh!

github-actions bot commented Jun 10, 2024 •

edited

Loading