In the early days of working for SHARCNET, my colleague and I decided to standardize how cluster metrics were computed across our internal data frames. As mentioned in a previous post, part of the solution was pandas.
The second part was figuring out how to deploy the package for others to contribute to, as well as install on their own specific HPC clusters. Some quick searching revealed that PyPI and pip were the way to go.
To make a long story short, here are a few references that made it approachable:
The package is still in use today inside SHARCNET and has also received development support from WestGrid, Calcul Québec, and MILA.
ViewClust can be found on GitHub. Its cousin package, ViewClust-Vis, implements several summary figures.