Difference between revisions of "Installing Software"
(Created page with " <div class="noautonum"> == 1. Logging on == DeepSense has two login nodes, login1.deepsense.ca and login2.deepsense.ca . You can access these through SSH with your username...") |
|||
Line 18: | Line 18: | ||
For more info, see [[VPN Setup]]. | For more info, see [[VPN Setup]]. | ||
− | |||
== 2. Configure your environment == | == 2. Configure your environment == | ||
Line 46: | Line 45: | ||
To load the python environment run <code>conda activate</code> | To load the python environment run <code>conda activate</code> | ||
− | |||
You can add either line to your .bashrc file to automatically load the desired environment when you log in. | You can add either line to your .bashrc file to automatically load the desired environment when you log in. | ||
Line 52: | Line 50: | ||
==== Local python install (managed by individual user) ==== | ==== Local python install (managed by individual user) ==== | ||
− | + | '''Stop using systemwide anaconda''' | |
+ | |||
+ | If you added the system anaconda environment to your <code>.bashrc</code> file then remove the line: | ||
+ | . /opt/anaconda2/etc/profile.d/conda.sh | ||
+ | |||
+ | '''Installing Anaconda with a python3 base ''' | ||
+ | |||
+ | From your home directory run: | ||
+ | wget https://repo.continuum.io/archive/Anaconda3-5.2.0-Linux-ppc64le.sh | ||
+ | bash Anaconda3-5.2.0-Linux-ppc64le.sh | ||
+ | |||
+ | Note: please enter "yes" when asked if you want to add anaconda to your .bashrc file. If you do not then you will need to add the following command to your .bashrc file or run it each time before using anaconda: | ||
+ | . ~/anaconda3/etc/profile.d/conda.sh | ||
+ | |||
+ | After the installer ends you need to either close and restart your terminal or run: | ||
+ | source ~/.bashrc | ||
+ | |||
+ | '''Adding a python2 environment''' | ||
+ | |||
+ | The previous instruction creates a python3 base environment. To add a python2 environment: | ||
+ | conda create -n py27 python=2.7 | ||
+ | |||
+ | Activate this environment to use python3: | ||
+ | conda activate py27 | ||
+ | |||
+ | note: if you receive an error message then you may need to deactivate the base conda environment first: | ||
+ | conda deactivate | ||
+ | conda activate py27 | ||
+ | |||
+ | '''Adding a python3 environment''' | ||
+ | |||
+ | We recommend creating a separate python3 environment from the base environment. This makes it easier to install the specific packages required for IBM PowerAI. | ||
+ | conda create -n py36 python=3.6 | ||
+ | |||
+ | Activate this environment to use python3: | ||
+ | conda activate py36 | ||
+ | |||
+ | ==3. IBM-AI Deep Learning Anaconda Channel == | ||
+ | |||
+ | To use deep learning packages like Tensorflow on DeepSense you need to add the IBM-AI anaconda channel to your list of available software channels. | ||
+ | conda config --prepend channels https://public.dhe.ibm.com/ibmdl/export/pub/software/server/ibm-ai/conda/ | ||
+ | |||
+ | We suggest creating a new environment for each deep learning package you want to use. For example for Tensorflow: | ||
+ | conda create -n py36_tensorflow python=3.6 | ||
+ | conda activate py36_tensorflow | ||
+ | |||
+ | Then install the anaconda package for the software you need. Again, with Tensorflow as an example: | ||
+ | conda install tensorflow | ||
+ | |||
+ | You can then use tensorflow or other deep learning packages as needed by simply activating that anaconda environment. Unlike the old method, you do not need to specifically activate tensorflow or other deep learning methods. | ||
+ | |||
+ | You can directly visit the IBM-AI anaconda channel URL to see a list of available software (https://public.dhe.ibm.com/ibmdl/export/pub/software/server/ibm-ai/conda/) | ||
+ | |||
+ | |||
+ | === Install PyTorch 1.6.0 in a user's home directory on DeepSense === | ||
+ | |||
+ | A DeepSense user can install PyTorch by him/herself in his/her home directory using the already built packages in /sofware/PyTorch-1.6.0-Build. The current build only works with Python 3.6. So, a user needs to create a conda environment with Python 3.6. If a user would like to use higher versions of Python, they would need to ask DeepSense team to build PyTorch with those versions. </br> | ||
+ | Here are the steps that a normal DeepSense user install PyTorch 1.6.0 in his/her home directory.</br> | ||
+ | |||
+ | <b>1. Source the conda environment you would like to use. For example:</b> | ||
+ | |||
+ | <code>source anaconda3/etc/profile.d/conda.sh</code></br> | ||
+ | |||
+ | <b>2. Activate the environment you would use to install PyTorch. If the environment hasn't been created, a user can create it and install PyTorch in one command line. For example, if you would create an environment with name "my-environment" (This is just an example. Please choose a meaningful name for yourself.) and install PyTorch, you would run the following command:</b> | ||
+ | |||
+ | <code>conda create -y -n my-environment python=3.6 pytorch -c file:////software/PyTorch-1.6.0-Build/condabuild</code></br> | ||
+ | |||
+ | <b>3. If the environment has been created, say the name of the environment is "my-environment", you would need to activate the environment first and then install PyTorch. For example:</b> | ||
+ | |||
+ | <code>conda activate my-environment</br> | ||
+ | conda install pytorch -c file:////software/PyTorch-1.6.0-Build/condabuild</code></br> | ||
+ | |||
+ | This should take about 2 minutes to install.</br> | ||
+ | |||
+ | <b>4. To test if your install is successful, issue python from the environment where PyTorch is installed. Then run "import torch" to see if there are any errors. For example:</b></br> | ||
+ | <code>[luy@ds-lg-01 ~]$ conda activate my-environment</br> | ||
+ | (my-environment) [luy@ds-lg-01 ~]$ python</br> | ||
+ | Python 3.6.12 |Anaconda, Inc.| (default, Sep 9 2020, 00:40:10) </br> | ||
+ | [GCC 7.3.0] on linux</br> | ||
+ | Type "help", "copyright", "credits" or "license" for more information.</br> | ||
+ | >>> import torch</br> | ||
+ | >>> </code></br> | ||
+ | |||
+ | === Install Opencv 3.4.10 in a user's home directory on DeepSense === | ||
+ | A DeepSense user can install Opencv by him/herself in his/her home directory using the already built packages in /sofware/PyTorch-1.6.0-Build/opencv-feedstock. The current build only works with Python 3.6. So, a user needs to create a conda environment with Python 3.6. If a user would like to use higher versions of Python, they would need to ask DeepSense team to build Opencv with those versions. </br> | ||
+ | Here are the steps that a normal DeepSense user installs Opencv 3.4.10 in his/her home directory.</br> | ||
+ | <b>1. Source the conda environment you would like to use. For example:</b> | ||
+ | source ~/anaconda3/etc/profile.d/conda.sh | ||
+ | |||
+ | <b>2. Activate the environment you would use to install Opencv. If the environment hasn't been created, a user should create one. Assume a user created an environment "my-environment" and activated it. To install Opencv, you would run the following command:</b> | ||
+ | conda activate my-environment | ||
+ | conda install opencv -c file:////software/PyTorch-1.6.0-Build/opencv-feedstock/condabuild | ||
+ | This should take about 2 minutes to install.</br> | ||
+ | |||
+ | <b>3. To test if your install is successful, issue python from the environment where Opencv is installed. Then run "import cv2" to see if there are any errors. For example:</b> | ||
+ | [luy@ds-lg-01 ~]$ conda activate my-environment | ||
+ | (my-environment) [luy@ds-lg-01 ~]$ python | ||
+ | Python 3.6.12 |Anaconda, Inc.| (default, Sep 9 2020, 00:40:10) | ||
+ | [GCC 7.3.0] on linux</br> | ||
+ | Type "help", "copyright", "credits" or "license" for more information. | ||
+ | >>> import cv2 | ||
+ | >>> | ||
+ | |||
+ | === Install other dependencies === | ||
+ | |||
+ | If you need additional python libraries then you can install them in your python environment. | ||
+ | |||
+ | The base package comes with several python libraries but you may want a newer version or additional libraries. Also, when you create a new environment it does not automatically get all of the same libraries as the base environment. | ||
+ | |||
+ | For example, suppose you want to install the <code>scikit-learn</code> package in your python3 environment. | ||
+ | |||
+ | First you need to activate the environment: | ||
+ | conda activate py36 | ||
+ | |||
+ | Then you install the package | ||
+ | conda install scikit-learn | ||
+ | |||
+ | A list of recommended packages follows in the next section. | ||
+ | |||
+ | === Recommended packages === | ||
+ | |||
+ | ==== Jupyter Notebooks for deep learning ==== | ||
+ | conda install jupyter | ||
+ | |||
+ | === (Old Method) Testing Deep Learning packages on the login nodes or non-GPU nodes === | ||
+ | |||
+ | You may wish to run PowerAI software on the login nodes for testing on the CPU-only nodes for some workflows. | ||
+ | |||
+ | Only the GPU nodes have graphics cards and graphics drivers installed. If you attempt to run the deep learning software like Tensorflow on the login nodes or CPU-only nodes then you will see errors like the following: | ||
+ | ImportError: libcublas.so.9.2: cannot open shared object file: No such file or directory | ||
+ | |||
+ | You need to load the GPU drivers with the following command: | ||
+ | source /opt/DL/cudnn/bin/cudnn-activate | ||
+ | |||
+ | Then you can activate the deep learning package, e.g. for Tensorflow: | ||
+ | source /opt/DL/tensorflow/bin/tensorflow-activate | ||
+ | Note that some deep learning software may be much slower or refuse to run without GPU access. Tensorflow works but Caffe does not. | ||
− | + | Keep in mind you need to activate the GPU drivers and deep learning package in each browser shell before you are able to use the package in your code or LSF jobs. | |
− | DeepSense | + | == Compiling Software for DeepSense == |
− | + | DeepSense uses IBM Power8 systems running RedHat Enterprise Linux. Code must be compiled for <code>ppc64le</code> which is PowerPC 64 bit Little Endian. | |
− | + | Some software may not have binaries available for <code>ppc64le</code> even if it does for other systems. If this happens then you (or [[Contact_Information|DeepSense support]]) will need to compile the software to run on DeepSense. Visit the web page for the software and see if the source code is available (e.g. through github). If so then follow the compilation instructions to run the software. | |
− | + | You may encounter errors when attempting to compile software for <code>ppc64le</code>. Often this occurs because of differences between <code>ppc64le</code> and other common architectures such as x86 and x86_64. | |
− | + | For example, one DeepSense user attempted to compile the rdkit software package from https://www.rdkit.org/ . This compilation failed when it attempted to use the gcc x86 optimization <code>-mpopcnt</code>. After replacing the optimization with the <code>ppc64le</code> equivalent <code>-mpopcntb</code> the software compiled successfully. | |
− | == | + | == 4. Technical and research support == |
DeepSense has a dedicated support team of research scientists ready to help you with technical questions, installing software, or even research questions. | DeepSense has a dedicated support team of research scientists ready to help you with technical questions, installing software, or even research questions. |
Revision as of 19:58, 1 December 2020
Contents
- 1 1. Logging on
- 2 2. Configure your environment
- 3 3. IBM-AI Deep Learning Anaconda Channel
- 4 Compiling Software for DeepSense
- 5 4. Technical and research support
1. Logging on
DeepSense has two login nodes, login1.deepsense.ca and login2.deepsense.ca . You can access these through SSH with your username and password from any computer on campus.
For example, if your userid is user1
, you can connect to deepsense by typing ssh user1@login1.deepsense.ca
just like logging on to any other network computer.
Note: The login nodes are intended for testing and compiling code. Please don’t run long or intensive computation on these nodes. Keep reading for instructions on how to submit compute jobs to dedicated compute nodes.
1.1 VPN
To connect to the DeepSense platform from outside of the Dalhousie Campus, you'll need to use a VPN. If you are are student, staff or faculty, you can use the Dalhousie VPN (https://wireless.dal.ca/vpnsoftware.php).
If you are not a Dalhousie staff, student, or faculty but require offsite access and cannot use the Dalhousie VPN then contact your project leader or (support@deepsense.ca) to make different arrangements.
For more info, see VPN Setup.
2. Configure your environment
DeepSense compute and management nodes are IBM Power8 computers (ppc64le) running Redhat Enterprise Linux. See Resources for more details on the available nodes.
2.1 Loading a python environment
You have two options for using python on DeepSense. You can use the systemwide python install, managed by DeepSense administrators. This is recommended for users new to Linux. You will need to contact DeepSense support to have additional software packages installed in the systemwide python.
Alternatively, you can install an Anaconda python environment or other software in your home directory. This allows you to install or update packages or software without requesting and waiting for DeepSense staff.
Systemwide python (managed by DeepSense)
DeepSense has two Anaconda python environments are installed locally on each DeepSense compute node.
First one is anaconda2 installed in /opt/anaconda2 will provide you python 2.7.5. To use this systemwide python add a parameter to your .bashrc file in your home directory:
echo ". /opt/anaconda2/etc/profile.d/conda.sh" >> ~/.bashrc
Second is anaconda3 installed in /software/WMLA/anaconda3 will provide you python 3.7.4. To use this systemwide python add a parameter to your .bashrc file in your home directory:
echo ". /software/WMLA/anaconda3/etc/profile.d/conda.sh" >> ~/.bashrc
Then source your .bashrc file:
source ~/.bashrc
To load the python environment run conda activate
You can add either line to your .bashrc file to automatically load the desired environment when you log in.
Local python install (managed by individual user)
Stop using systemwide anaconda
If you added the system anaconda environment to your .bashrc
file then remove the line:
. /opt/anaconda2/etc/profile.d/conda.sh
Installing Anaconda with a python3 base
From your home directory run:
wget https://repo.continuum.io/archive/Anaconda3-5.2.0-Linux-ppc64le.sh bash Anaconda3-5.2.0-Linux-ppc64le.sh
Note: please enter "yes" when asked if you want to add anaconda to your .bashrc file. If you do not then you will need to add the following command to your .bashrc file or run it each time before using anaconda:
. ~/anaconda3/etc/profile.d/conda.sh
After the installer ends you need to either close and restart your terminal or run:
source ~/.bashrc
Adding a python2 environment
The previous instruction creates a python3 base environment. To add a python2 environment:
conda create -n py27 python=2.7
Activate this environment to use python3:
conda activate py27
note: if you receive an error message then you may need to deactivate the base conda environment first:
conda deactivate conda activate py27
Adding a python3 environment
We recommend creating a separate python3 environment from the base environment. This makes it easier to install the specific packages required for IBM PowerAI.
conda create -n py36 python=3.6
Activate this environment to use python3:
conda activate py36
3. IBM-AI Deep Learning Anaconda Channel
To use deep learning packages like Tensorflow on DeepSense you need to add the IBM-AI anaconda channel to your list of available software channels.
conda config --prepend channels https://public.dhe.ibm.com/ibmdl/export/pub/software/server/ibm-ai/conda/
We suggest creating a new environment for each deep learning package you want to use. For example for Tensorflow:
conda create -n py36_tensorflow python=3.6 conda activate py36_tensorflow
Then install the anaconda package for the software you need. Again, with Tensorflow as an example:
conda install tensorflow
You can then use tensorflow or other deep learning packages as needed by simply activating that anaconda environment. Unlike the old method, you do not need to specifically activate tensorflow or other deep learning methods.
You can directly visit the IBM-AI anaconda channel URL to see a list of available software (https://public.dhe.ibm.com/ibmdl/export/pub/software/server/ibm-ai/conda/)
Install PyTorch 1.6.0 in a user's home directory on DeepSense
A DeepSense user can install PyTorch by him/herself in his/her home directory using the already built packages in /sofware/PyTorch-1.6.0-Build. The current build only works with Python 3.6. So, a user needs to create a conda environment with Python 3.6. If a user would like to use higher versions of Python, they would need to ask DeepSense team to build PyTorch with those versions.
Here are the steps that a normal DeepSense user install PyTorch 1.6.0 in his/her home directory.
1. Source the conda environment you would like to use. For example:
source anaconda3/etc/profile.d/conda.sh
2. Activate the environment you would use to install PyTorch. If the environment hasn't been created, a user can create it and install PyTorch in one command line. For example, if you would create an environment with name "my-environment" (This is just an example. Please choose a meaningful name for yourself.) and install PyTorch, you would run the following command:
conda create -y -n my-environment python=3.6 pytorch -c file:////software/PyTorch-1.6.0-Build/condabuild
3. If the environment has been created, say the name of the environment is "my-environment", you would need to activate the environment first and then install PyTorch. For example:
conda activate my-environment
conda install pytorch -c file:////software/PyTorch-1.6.0-Build/condabuild
This should take about 2 minutes to install.
4. To test if your install is successful, issue python from the environment where PyTorch is installed. Then run "import torch" to see if there are any errors. For example:
[luy@ds-lg-01 ~]$ conda activate my-environment
(my-environment) [luy@ds-lg-01 ~]$ python
Python 3.6.12 |Anaconda, Inc.| (default, Sep 9 2020, 00:40:10)
[GCC 7.3.0] on linux
Type "help", "copyright", "credits" or "license" for more information.
>>> import torch
>>>
Install Opencv 3.4.10 in a user's home directory on DeepSense
A DeepSense user can install Opencv by him/herself in his/her home directory using the already built packages in /sofware/PyTorch-1.6.0-Build/opencv-feedstock. The current build only works with Python 3.6. So, a user needs to create a conda environment with Python 3.6. If a user would like to use higher versions of Python, they would need to ask DeepSense team to build Opencv with those versions.
Here are the steps that a normal DeepSense user installs Opencv 3.4.10 in his/her home directory.
1. Source the conda environment you would like to use. For example:
source ~/anaconda3/etc/profile.d/conda.sh
2. Activate the environment you would use to install Opencv. If the environment hasn't been created, a user should create one. Assume a user created an environment "my-environment" and activated it. To install Opencv, you would run the following command:
conda activate my-environment conda install opencv -c file:////software/PyTorch-1.6.0-Build/opencv-feedstock/condabuild
This should take about 2 minutes to install.
3. To test if your install is successful, issue python from the environment where Opencv is installed. Then run "import cv2" to see if there are any errors. For example:
[luy@ds-lg-01 ~]$ conda activate my-environment (my-environment) [luy@ds-lg-01 ~]$ python Python 3.6.12 |Anaconda, Inc.| (default, Sep 9 2020, 00:40:10) [GCC 7.3.0] on linux
Type "help", "copyright", "credits" or "license" for more information. >>> import cv2 >>>
Install other dependencies
If you need additional python libraries then you can install them in your python environment.
The base package comes with several python libraries but you may want a newer version or additional libraries. Also, when you create a new environment it does not automatically get all of the same libraries as the base environment.
For example, suppose you want to install the scikit-learn
package in your python3 environment.
First you need to activate the environment:
conda activate py36
Then you install the package
conda install scikit-learn
A list of recommended packages follows in the next section.
Recommended packages
Jupyter Notebooks for deep learning
conda install jupyter
(Old Method) Testing Deep Learning packages on the login nodes or non-GPU nodes
You may wish to run PowerAI software on the login nodes for testing on the CPU-only nodes for some workflows.
Only the GPU nodes have graphics cards and graphics drivers installed. If you attempt to run the deep learning software like Tensorflow on the login nodes or CPU-only nodes then you will see errors like the following:
ImportError: libcublas.so.9.2: cannot open shared object file: No such file or directory
You need to load the GPU drivers with the following command:
source /opt/DL/cudnn/bin/cudnn-activate
Then you can activate the deep learning package, e.g. for Tensorflow:
source /opt/DL/tensorflow/bin/tensorflow-activate
Note that some deep learning software may be much slower or refuse to run without GPU access. Tensorflow works but Caffe does not.
Keep in mind you need to activate the GPU drivers and deep learning package in each browser shell before you are able to use the package in your code or LSF jobs.
Compiling Software for DeepSense
DeepSense uses IBM Power8 systems running RedHat Enterprise Linux. Code must be compiled for ppc64le
which is PowerPC 64 bit Little Endian.
Some software may not have binaries available for ppc64le
even if it does for other systems. If this happens then you (or DeepSense support) will need to compile the software to run on DeepSense. Visit the web page for the software and see if the source code is available (e.g. through github). If so then follow the compilation instructions to run the software.
You may encounter errors when attempting to compile software for ppc64le
. Often this occurs because of differences between ppc64le
and other common architectures such as x86 and x86_64.
For example, one DeepSense user attempted to compile the rdkit software package from https://www.rdkit.org/ . This compilation failed when it attempted to use the gcc x86 optimization -mpopcnt
. After replacing the optimization with the ppc64le
equivalent -mpopcntb
the software compiled successfully.
4. Technical and research support
DeepSense has a dedicated support team of research scientists ready to help you with technical questions, installing software, or even research questions.
If you can't find the answer to your question on this wiki or need more extensive help then send an email to support@deepsense.ca .
See Technical support for more information about the support available.