Installing SPIDAL Software

Jan 18, 2018  

common

SPIDAL libraries depend on the common project in DSC-SPIDAL github. We need to build it first.

 git clone https://github.com/DSC-SPIDAL/common.git
 cd common
 keytool -import -file ./resources/ricecert/cs.rice.edu.cer -keystore /tmp/riceKeyStore

You can change the name of the key store and the path to it if you prefer to. This command will first ask for a password, provide any password of your choosing with at least 6 characters then it will show the following

Trust this certificate? [no]: 

type “y” and then press enter. Now the SSL cert has been properly installed. Next use the following command to compile the code

mvn -Djavax.net.ssl.trustStore=/tmp/riceKeyStore clean install

DA-MDS

DA-MDS is the deterministic annealing implementation of Multidimensional Scaling algorithm. The project can be built from the source.

 git clone https://github.com/DSC-SPIDAL/damds.git
 cd damds
 mvn install

After building it will create a Jar file inside the target directory.

Usually people run MDS using a cluster manager such as slurm. We have provided a sample script in bin directory that can be used to run the program on a single machine.

Run example - local machine

DA-MDS is configured using a configuration file. A sample configuration file with two input files for distance and weight matrices can be found in examples/input folder.

Lets run this sample using the damds.sh file found in bin directory.

 cd bin
 ./damds.sh ../examples/input/config.properties 1 4 

The first argument to the script is the location of configuration file. Second argument is the number of nodes. In our case it is 1 as we run on local machine. The third argument is the number of processes. In this case we are using 4 processes.

After the program finished executing, the output will be in a file called damds-points.txt inside the bin directory.

You can upload this file to WebPlotViz using the following link to visualize the data. Note: you may need to register in WebPlotViz before you can upload the files.

https://spidal-gw.dsc.soic.indiana.edu/dashboard

Run example - slurm cluster

You can use the damds_slurm.sh found in bin directory to run the program in a slurm HPC cluster. In this example we assume that you have built damds in a shared file system where every node can access the data files with the same file path.

 cd bin
 ./damds_slurm.sh ../examples/input/config.properties 4 

The above command will run the example using 4 nodes.

DAPWC

Deterministic Annealing Pairwise Clustering (dapwc) is a scalable and parallel clustering program that operate on non vector space

 git clone https://github.com/DSC-SPIDAL/dapwc.git
 cd dapwc
 mvn install

After building it will create a Jar file inside the target directory.