Skip to content

Conversation

rjurney
Copy link
Collaborator

@rjurney rjurney commented Feb 16, 2025

This is a documentation only change, the docs part of #473. The docs build and look okay when I serve them.

What changes were proposed in this pull request?

I added a network motif tutorial and updated / cleaned the docs generally. I added the new google group and the new logo to the images folder.

Why are the changes needed?

The docs were wrong in a few ways and there weren't even instructions in the README, which stopped new users from onboarding.

@rjurney rjurney requested a review from WeichenXu123 February 16, 2025 04:20
@codecov-commenter
Copy link

codecov-commenter commented Feb 16, 2025

⚠️ Please install the 'codecov app svg image' to ensure uploads and comments are reliably processed by Codecov.

Codecov Report

All modified and coverable lines are covered by tests ✅

Please upload report for BASE (rjurney/motif-tutorial-code-min@0b566d0). Learn more about missing BASE report.

❗ Your organization needs to install the Codecov GitHub app to enable full functionality.

Additional details and impacted files
@@                        Coverage Diff                         @@
##             rjurney/motif-tutorial-code-min     #511   +/-   ##
==================================================================
  Coverage                                   ?   90.73%           
==================================================================
  Files                                      ?       18           
  Lines                                      ?      896           
  Branches                                   ?      103           
==================================================================
  Hits                                       ?      813           
  Misses                                     ?       83           
  Partials                                   ?        0           

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

@rjurney rjurney changed the title Documentation cleanup and update. Added a Motif Finding tutorial. Documentation cleanup and update. Added a motif finding tutorial. Feb 16, 2025
@rjurney
Copy link
Collaborator Author

rjurney commented Feb 16, 2025

This project should be merged following #512 and #513, with #513.

…mes/graphframes into rjurney/motif-tutorial-docs
@rjurney rjurney changed the base branch from master to rjurney/motif-tutorial-code-min February 17, 2025 21:32
@rjurney
Copy link
Collaborator Author

rjurney commented Feb 17, 2025

Okay @bjornjorgensen @SauronShepherd @SemyonSinchenko I am done here. I would maybe suggest reviewing it by reading docs/README.md and running jekyll to view the documentation. Give it a once over. I went through and fixed anything that was old, broken, out of date version, etc. I added new community resources and the graphframes-py package I reserved that will go out with 0.8.5. It's a big PR but this is english not code, and it cross-refers to itself and depends on the changes in the other two PRs this is based on: #512 <-- #518 <-- #511

@rjurney rjurney changed the title Documentation cleanup and update. Added a motif finding tutorial. 3 of 3: Documentation cleanup and update. Added a motif finding tutorial. Feb 18, 2025
@rjurney rjurney changed the base branch from rjurney/motif-tutorial-code-min to master February 21, 2025 12:32
@rjurney
Copy link
Collaborator Author

rjurney commented Feb 21, 2025

@SemyonSinchenko this one is hard to review... lots of text changes. I have double checked it, but appreciate any review you can give :)

Copy link
Collaborator

@SemyonSinchenko SemyonSinchenko left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM overall, thank you! The only important comment is about licenses. Other comments are minor.

API for motif finding. The user also benefits from DataFrame performance optimizations
within the Spark SQL engine.
This is a package for DataFrame-based graphs on top of Apache Spark. Users can write highly expressive queries by leveraging the DataFrame API, combined with a new API for network motif finding. The user also benefits from DataFrame performance optimizations within the Spark SQL engine. GraphFrames works in Java, Scala, and Python.

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Let's add a note about deprecation of GraphX in Spark 4.0? Something like "With deprecation of GraphX in Apache Spark Core GraphFrames is the recommended approach for the distributed graphs processing."?

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@SemyonSinchenko for the moment - and we can get your own PR to do this if you want - I prefer to pretend that didn't happen. It isn't good for the project and we have a ways to go before merging with Spark.

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Where is this one taken? I can assume that it is from a scientific paper that may have a license like Creative Commons or a proprietary one. I think it is important to maintain IP clearance if we are going to move GraphFrames to the Spark core one day. Could you please add a source or even better to add a NOTICE.md file to the root of project with mentioning of the source of that picture and how is it licensed?

Copy link
Collaborator Author

@rjurney rjurney Feb 21, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@SemyonSinchenko It's from this paper, which the caption and image itself lead to if you click: https://www.computer.org/csdl/journal/tb/2017/06/07501518/13rRUxcsYKf

The thing is... there is no open license image of graphlets, let alone directed graphlets. What if I include it now and seek permission in the meanwhile? Then I can figure out some alternative and in the meanwhile people can still learn about graphlets? Worst case I can generate them all in networkx or something.

Copy link
Collaborator Author

@rjurney rjurney Feb 21, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I reached out to the authors, asking for permission. It would be best to secure their permission. In reading this guide to fair use, I believe this image may qualify: https://www.copyrightlaws.com/simple-guide-fair-use/

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I got permission to use the image :)

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The same

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The same

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The same

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The same

rjurney and others added 2 commits February 21, 2025 15:31
Co-authored-by: Sem <ssinchenko@apache.org>
Co-authored-by: Sem <ssinchenko@apache.org>
@rjurney rjurney merged commit 7c08186 into master Feb 21, 2025
6 checks passed
@rjurney rjurney deleted the rjurney/motif-tutorial-docs branch April 15, 2025 00:32
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants