Integrate existing Pixie cell clustering process with generic cell clustering process #885

alex-l-kong · 2023-01-19T19:24:53Z

What is the purpose of this PR?

Closes #850. Closes #881. The existing Pixie pipeline will need some modifications to support generic cell clustering.

How did you implement your changes

Main change is the addition of a preprocessing step outside of train_cell_som. This will help generate the cell clustering specific files (SOM input data based on pixel clusters and weighted cell channel expression) independent of the eventual notebook for generic cell clustering.

Additionally, it's better to pass the full list of expression columns to use for cell clustering as opposed to inferring them from a pixel cluster prefix. This will also make integration with generic cell clustering easier.

…cess

review-notebook-app · 2023-01-19T19:24:58Z

Check out this pull request on

See visual diffs & provide feedback on Jupyter Notebooks.

Powered by ReviewNB

…ge cache

alex-l-kong · 2023-01-20T19:57:39Z

Tested that the updated pipeline works. The following changes have been made:

Preprocess cluster_counts_size_norm and weighted_cell_channel prior to train_cell_som
Update the parameters accordingly
Pass the full list of expression columns to use for SOM training (as opposed to inferring from pixel_cluster_col_prefix)

ngreenwald

Looks good to me, will wait for Candace to weigh in.

After she's happy with this, I think opening a separate branch off of this one makes sense to begin constructing the notebook 3b. It will probably be the case that as you put that notebook together you realize that additional things may need to change

templates/3_Pixie_Cluster_Cells.ipynb

…rocess_pixie_cell

cliu72

Looks good!

* First draft of generic cell clustering process * Ensure the None checks run properly for weighted cell channel validation * Attempt to add tests to 3b notebook * Ensure generic cell clustering integrated with notebooks * Add description of generalized cell inputs to data_types.md, and ensure generalized inputs run without cell_size col * Add generic cell clustering process to the README * Remove 3b notebook from README for now * Patch up remaining errors * Move cell cluster summary file generation to separate functions * Remove extraneous comments from notebook * Relocate segmentation variable settings to end of generic cell clustering notebook * Remove test for column mismatch in cluster_cells (needed to support generic cell clustering) * Massive renaming to avoid confusion of counts referring to generic cell clustering * PYCODESTYLE * Normalize new naming conventions; remove more counts refs * Fix parameter to test_generate_wc_avg_files * Update cell cluster pipeline to save Pixie results at the end, and not at intermediate steps * Remove comments from notebook 3 * Rename averaging functions to avoid ambiguity

alex-l-kong · 2023-02-07T20:19:31Z

We will definitely need to open a separate branch off of this to address the case where the user wants to re-run generic cell clustering with new columns specified. This is a similar issue that #903 addressed

alex-l-kong · 2023-02-08T18:05:38Z

Per @cliu72 and @ngreenwald I've renamed the notebook so it doesn't interfere with any reviewers. We'll merge this in and open a PR for any issues that do pop up in the future.

ngreenwald

@JLrumberger will you try out the new notebook with the network output from a subset of the TONIC data to see how it looks? And also see if any issues come up? You can set up a time to meet with @alex-l-kong to go over it and make sure everything looks good.

templates/3_Pixie_Cluster_Cells.ipynb

cliu72

Looks good to me. Just one small comment thing.

… preprocess_pixie_cell

alex-l-kong · 2023-02-17T00:01:25Z

@ngreenwald per our meeting today, we'll get this merged in for the time being, then have @JLrumberger open up any subsequent issues that come up with his runs.

src/ark/settings.py

… preprocess_pixie_cell

…rocess_pixie_cell

alex-l-kong added 2 commits January 18, 2023 16:28

Include explicit preprocessing steps for cell SOM input data

7f94c21

Add tests for helper cell cluster functions that changed with new pro…

a5905bd

…cess

alex-l-kong self-assigned this Jan 19, 2023

alex-l-kong changed the title ~~Preprocess pixie cell~~ Integrate existing Pixie cell clustering process with generic cell clustering process Jan 19, 2023

alex-l-kong and others added 3 commits January 19, 2023 13:42

Make a minor comment change to trigger a new build with updated packa…

03d85c6

…ge cache

Explicitly install charset-normalizer to fix dependency issues

f6f09a6

Merge branch 'main' into preprocess_pixie_cell

23c7ef1

alex-l-kong requested review from cliu72 and ngreenwald January 20, 2023 19:56

ngreenwald reviewed Jan 20, 2023

View reviewed changes

Merge branch 'main' into preprocess_pixie_cell

8f117f8

cliu72 reviewed Jan 24, 2023

View reviewed changes

templates/3_Pixie_Cluster_Cells.ipynb Show resolved Hide resolved

alex-l-kong added 2 commits January 24, 2023 10:42

Save unnormalized cluster counts for reference purposes

82a62d4

Merge remote-tracking branch 'origin/preprocess_pixie_cell' into prep…

3e89d2f

…rocess_pixie_cell

cliu72 approved these changes Jan 25, 2023

View reviewed changes

Merge branch 'main' into preprocess_pixie_cell

9c92e38

alex-l-kong mentioned this pull request Feb 4, 2023

Add notebook to run generic cell clustering #892

Merged

alex-l-kong added 2 commits February 6, 2023 11:39

Merge branch 'main' into preprocess_pixie_cell

8d85f03

Temporarily rename 3b_Generic_Cluster_Cells for Candace's paper

ca38b86

alex-l-kong requested a review from ngreenwald February 8, 2023 18:04

ngreenwald requested review from cliu72 and JLrumberger February 8, 2023 21:28

ngreenwald requested changes Feb 8, 2023

View reviewed changes

cliu72 reviewed Feb 14, 2023

View reviewed changes

templates/3_Pixie_Cluster_Cells.ipynb Show resolved Hide resolved

cliu72 reviewed Feb 14, 2023

View reviewed changes

alex-l-kong and others added 9 commits February 15, 2023 15:35

Remove unnecessary comment

58eb31e

Merge branch 'main' of https://github.com/angelolab/ark-analysis into…

3b8e7f7

… preprocess_pixie_cell

Address hidden merge conflicts errors

a6f50fa

Purge old HuggingFace references

4d07482

Merge branch 'main' into preprocess_pixie_cell

2371c59

Add seed param to cell_cluster_utils

4ada58b

Purge rmtree call

88ea83f

Change from tmi to alpineer

1246ab0

Merge branch 'main' into preprocess_pixie_cell

b5578d6

alex-l-kong requested a review from ngreenwald February 17, 2023 00:00

ngreenwald approved these changes Feb 17, 2023

View reviewed changes

src/ark/settings.py Outdated Show resolved Hide resolved

alex-l-kong added 3 commits February 16, 2023 21:58

Merge branch 'main' of https://github.com/angelolab/ark-analysis into…

3d69e1e

… preprocess_pixie_cell

Change example dataset version back to main

7c3f9a5

Merge remote-tracking branch 'origin/preprocess_pixie_cell' into prep…

73fbe95

…rocess_pixie_cell

alex-l-kong merged commit 3a24d44 into main Feb 17, 2023

alex-l-kong deleted the preprocess_pixie_cell branch February 17, 2023 06:27

srivarra added the enhancement New feature or request label Mar 14, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Integrate existing Pixie cell clustering process with generic cell clustering process #885

Integrate existing Pixie cell clustering process with generic cell clustering process #885

Uh oh!

alex-l-kong commented Jan 19, 2023 •

edited

Loading

Uh oh!

review-notebook-app bot commented Jan 19, 2023

Uh oh!

alex-l-kong commented Jan 20, 2023

Uh oh!

ngreenwald left a comment

Uh oh!

Uh oh!

cliu72 left a comment

Uh oh!

alex-l-kong commented Feb 7, 2023

Uh oh!

alex-l-kong commented Feb 8, 2023

Uh oh!

ngreenwald left a comment

Uh oh!

Uh oh!

cliu72 left a comment

Uh oh!

alex-l-kong commented Feb 17, 2023

Uh oh!

Uh oh!

Uh oh!

Integrate existing Pixie cell clustering process with generic cell clustering process #885

Integrate existing Pixie cell clustering process with generic cell clustering process #885

Uh oh!

Conversation

alex-l-kong commented Jan 19, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

review-notebook-app bot commented Jan 19, 2023

Uh oh!

alex-l-kong commented Jan 20, 2023

Uh oh!

ngreenwald left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

cliu72 left a comment

Choose a reason for hiding this comment

Uh oh!

alex-l-kong commented Feb 7, 2023

Uh oh!

alex-l-kong commented Feb 8, 2023

Uh oh!

ngreenwald left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

cliu72 left a comment

Choose a reason for hiding this comment

Uh oh!

alex-l-kong commented Feb 17, 2023

Uh oh!

Uh oh!

Uh oh!

alex-l-kong commented Jan 19, 2023 •

edited

Loading