Difference between revisions of "DeepSense Documentation"

From DeepSense Docs
Jump to: navigation, search
(Note)
(21 intermediate revisions by 3 users not shown)
Line 1: Line 1:
== Note ==
+
'''Welcome to the DeepSense technical documentation wiki'''. This is the primary source for users with questions on the DeepSense equipment and services. You'll now find all of our content on the sidebar.  Just below you can see the cluster status, and information about any planned outages we may have.
On June 26 we updated the GPU compute nodes to a new version of IBM Watson Machine Learning Accelerator. This changes the way you access deep learning packages like Tensorflow and Pytorch. Instead of "activating" these packages, you will be able to install new versions directly in your anaconda environment.
 
  
We are actively updating the wiki documentation to explain the new method of accessing deep learning packages (see [[Getting started with Deep Learning]]). Please bear with us during these updates as some documentation may still refer to the old method of "activating" deep learning packages.
+
We routinely make changes and update the content.  If you see anything missing, or have any suggestions for content, we would appreciate hearing from you.  You can send us an email at ([mailto:support@deepsense.ca support@deepsense.ca]).
 +
 
 +
== Cluster Status ==
  
 
'''<span style="font-size:120%>Cluster status</span>'''
 
'''<span style="font-size:120%>Cluster status</span>'''
Line 12: Line 13:
 
|style="Color:green" | Online
 
|style="Color:green" | Online
 
|
 
|
|We will be upgrading our storage between 8:00am, Sep 8 to 4:30pm, Sep 9, 2020. The whole cluster will be closed to all users. You won't be able to run/submit jobs during the maintenance. Please estimate how long your jobs will be using before submitting jobs. Sorry for the inconvenience.
+
|
 
|}
 
|}
 
Legend:<br/>
 
Legend:<br/>
 
<span style="color:green">Online</span>: cluster is running normally<br/>
 
<span style="color:green">Online</span>: cluster is running normally<br/>
<span style="color:orange">Online</span>: cluster has some problems and is partially available<br/>
+
<span style="color:orange">Partially Online</span>: cluster has some problems and is partially available<br/>
 
<span style="color:red">Offline</span>: cluster is offine and users are not able to log in<br/>
 
<span style="color:red">Offline</span>: cluster is offine and users are not able to log in<br/>
 
== System Information ==
 
* [[Resources]]
 
* [[Available software]]
 
 
== Guides ==
 
* [[ Requesting access]]
 
* [[Getting started]]
 
* [[Introduction to Linux]]
 
* [[Getting started with Deep Learning]]
 
* [[Deep Learning Tutorials]]
 
* [[Storage policies]]
 
* [[Transferring Data]]
 
* Running jobs
 
** [[LSF|LSF batch jobs]]
 
** [[CWS|CWS web interface]]
 
* [[Installing local software]]
 
* Writing Tips
 
** [[Mitacs Accelerate Proposals]]
 
* [[Known problems]]
 
* [[Contact information|Contacting DeepSense]]
 
 
== Documentation ==
 
* [[Media:DeepSense_Computing_Platform.pdf|DeepSense Computing Platform]]
 
 
== Links ==
 
* [https://deepsense.ca DeepSense home page]
 
* [https://dal.ca Dalhousie University]
 
* [https://www.dal.ca/faculty/computerscience.html Faculty of Computer Science]
 
* [https://oceanfrontierinstitute.com/ Ocean Frontier Institute]
 

Revision as of 13:34, 7 December 2020

Welcome to the DeepSense technical documentation wiki. This is the primary source for users with questions on the DeepSense equipment and services. You'll now find all of our content on the sidebar. Just below you can see the cluster status, and information about any planned outages we may have.

We routinely make changes and update the content. If you see anything missing, or have any suggestions for content, we would appreciate hearing from you. You can send us an email at (support@deepsense.ca).

Cluster Status

Cluster status

Status Planned Outage Notes
Online

Legend:
Online: cluster is running normally
Partially Online: cluster has some problems and is partially available
Offline: cluster is offine and users are not able to log in