Skip to content

Conversation

drroe
Copy link
Contributor

@drroe drroe commented Jan 31, 2022

Version 6.2.3.

This PR contains the following improvements.

  • Improve the speed of hierarchical agglomerative clustering by an order of magnitude.
  • Create data sets for Davies-Bouldin index, pseudo-F, and SSR/SST ratio clustering metrics.
  • Check for and fix a potential integer overflow when setting the random number generator seed from the wall clock.

@drroe
Copy link
Contributor Author

drroe commented Jan 31, 2022

Pytraj failure is because I added 3 new data sets during clustering, causing the offset used by pytraj to access cluster data to be invalid:

https://jenkins.jasonswails.com/blue/organizations/jenkins/amber-github%2Fpytraj/detail/pytraj/414/pipeline#step-22-log-663

@hainm is there any reason why we're using data[-2] and not data[0] or something? The former (two from the last I think?) means that every time a new data set is added to cluster the index becomes invalid. Can we make them [0]? I would test myself but I'm having issues getting pytraj compiled on my platform (unrelated glibc issues).

@hainm
Copy link
Contributor

hainm commented Jan 31, 2022

Can we make them [0]? I would test myself but I'm having issues getting pytraj compiled on my platform (unrelated glibc issues).

For the record: Amber-MD/pytraj#1598 (comment)

@drroe drroe merged commit caf123a into Amber-MD:master Jan 31, 2022
@drroe drroe deleted the cluster.work branch January 31, 2022 23:31
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants