DeepSense Docs - User contributions [en-ca]

Deep Learning Tutorials

2020-07-06T13:21:28Z

Cwhidden:

<div class="noautonum">

This is a collection of deep learning tutorials that were explored as part of the training of the 2019 DeepSense fellowships. These were explored over multiple sessions with 1-3 notebooks explored per 1-1.5 hour session. Both Tensorflow and pytorch are explored.

== 1. Get started with DeepSense ==

Follow all the steps from [[Getting started]] and [[Getting started with Deep Learning]]. This tutorial assumes you can log on to the DeepSense compute platform and have a version of Anaconda python on your path with Tensorflow and Pytorch installed in an anaconda environment.

== 2. Tensorflow Preparation ==

=== Download the example notebooks ===

git clone https://github.com/aymericdamien/TensorFlow-Examples.git

=== request a gpu session ===

bsub -Is -q gpu bash

=== activate your anaconda environment ===

conda activate tensorflow

Note: this assumes you've followed the getting started instructions and have created a python environment called <code>tensorflow</code> with the <code>tensorflow-gpu</code> package installed from the IBM-AI repository. If not then please follow those instructions.

=== start a jupyter notebook ===
jupyter notebook --no-browser --ip=0.0.0.0

=== open an SSH tunnel to access the notebook ===
just as we did in the [[Getting started with Deep Learning]] tutorial, open an SSH tunnel in another window
ssh -l <user> login1.deepsense.ca -L <port>:ds-cmgpu-<num>:<port>

=== open the notebook in your browser ===
In a web browser navigate to the page listed in the jupyter notebook output. Remember to replace the node name with <code>localhost</code>

== 3. Tensorflow Example Notebooks ==

=== 1. helloworld.ipynb ===

TensorFlow-Examples/notebooks/1_Introduction/helloworld.ipynb

This is an introduction to a python jupyter notebook.

Select a box with code. Press the shift and enter keys together to run the code in that box. You'll notice a star appear beside running code and a number in brackets appear beside finished code to indicate the order in which code boxes were run.

Often in an example notebook you will see code that already has output cached. You still need to run all previous code boxes and may want to use the menu to clear all output. If there is an error then you can modify the code or fix the error (e.g. download a dependency) and try again.

If you are missing a required dependency in later notebooks then you can install that package into your anaconda environment in a terminal window and it will be immediately accessible from the notebook. You do not need to close and restart the notebook or SSH tunnel.

You can also run the entire notebook using the menu.

When you are finished with a notebook you should use the menu to halt the kernel before closing the notebook. This clears resources such as GPU memory.

=== 2. kmeans.ipynb ===

TensorFlow-Examples/notebooks/2_BasicModels/kmeans.ipynb

This is an introduction to a basic machine learning model, kmeans. In this tutorial the kmeans algorithm is used to classify handwritten digits.

kmeans works by clustering different training examples and comparing each new input to the mean of each cluster.

Run the notebook and learn more about kmeans.

Observe that machine learning methods often have parameters that must be chosen. You can use default parameters but optimizing these parameters can greatly improve the accuracy of a model.

For a simple example, try increasing the length of training by changing the <code>num_steps</code> variable from 50 to 100 and running it again.

What happens if you modify other parameters and run the notebook again? How do these parameters change the training time and accuracy of the model?

Note: depending on your version of tensorflow you may need to modify the notebook. In some versions of tensorflow the kmeans.training_graph() function returns a different number of variables than the notebook expects (such as the cluster_centers_vars variable which may need to be removed from the code).

=== 3. random_forest.ipynb ===

TensorFlow-Examples/notebooks/2_BasicModels/random_forest.ipynb

This notebook uses a different model, random forests, to classify the handwritten digits. A random forest is a set of decision trees, each of which are trained to learn part of the problem.

Random forests are one of the most commonly used machine learning models because they are quick to train and give good performance on many tasks.

Run the notebook and observe that the random forest provides better accuracy than the kmeans model you trained previously.

Try changing parameters such as increasing the number of decision trees or the training time.

If you increase the number of trees or training time by too much then you will see that you achieve worse performance on the test set than on the training set. This is called overfitting and means that your model is learning specific details from the training set that do not generalize to the test set. This is a common problem in machine learning and needs to be considered whenever you train a model. You may need to simplify your model, give it more training examples by collecting more data or use data augmentation.

=== 4. convolutional_network.ipynb ===

TensorFlow-Examples/notebooks/3_NeuralNetworks/convolutional_network.ipynb

This is our first example of a commonly applied kind of neural network called a convolutional neural network (CNN), also applied to handwriting classification.

The notebook provides a good description of this kind of network which gradually reduces the size of the input using convolutional layers. This forces the network to learn information about multiple input variables and provides good accuracy.

Run the notebook and observe that this simple CNN greatly outperforms kmeans and random forests for handwriting recognition.

=== 5. linear_regression.ipynb ===

notebooks/TensorFlow-Examples/notebooks/2_BasicModels/linear_regression.ipynb

This notebook is an example of a different problem, called regression. Regression attempts to predict a value, unlike classification which determines which class an input example belongs to. This is a simple example that attempts to fit a line to best match a set of data points with x and y values.

Note: you may need to add backets to the print statements depending on your version of python and the version of this example notebook

=== 5. autoencoder.ipynb ===

notebooks/TensorFlow-Examples/notebooks/3_NeuralNetworks/autoencoder.ipynb

An autoencoder trains two seperate networks, an encoder and a decoder. This has many applications using the intermediate "latent representation". This includes compression (by storing just the latent representation), translation (by training an encoder with multiple different languages and then training different decoders for each language), and modifying the style of the input (by similarly leveraging different encoder or decoder frameworks).

=== 6. gan.ipynb ===

TensorFlow-Examples/notebooks/3_NeuralNetworks/gan.ipynb

A generative adversarial network (GAN) can be used to generate new data that looks similar to training data. This kind of network is how so called "deepfakes" are made that alter teh style of images or video.

The basic idea is to traing two opposing networks. The generator tries to generate examples and the discriminator tries to determine if examples are real or fake. Both networks learn and improve together which greatly improves performance over using a single network for either purpose.

=== 7. recurrent_network.ipynb ===

TensorFlow-Examples/notebooks/3_NeuralNetworks/recurrent_network.ipynb

A recurrent neural network is often applied to text processing or other problems that consider a sequence of events or letters. This type of network retains information at each step and thus has a type of memory that learns both from training examples as well as from parts of a specific input instance that have already been processed.

== 4. Pytorch ==

There are a variety of example notebooks for the pytorch framework at https://pytorch.org/tutorials/ . For example, there is a series of three notebooks on text processing and translation using recurrent neural networks:

=== 1. classifying names by country of origin ===

https://pytorch.org/tutorials/intermediate/char_rnn_classification_tutorial.html

=== 2. generating names similar to those from those countries of origin ===

https://pytorch.org/tutorials/intermediate/char_rnn_generation_tutorial.html

=== 3. text translation using seq2seq latent encoding of text with different decoders ===

https://pytorch.org/tutorials/intermediate/seq2seq_translation_tutorial.html

== 5. Tensor Processing Units ==

Google has developed dedicated deep learning processors called tensor processing units (TPUs). The contest web site kaggle has an online notebook where you can try out these TPUs to classify images of flowers by species.

https://www.kaggle.com/c/flower-classification-with-tpus

Try out the notebook and compare the performance of CPU, GPU, and TPU computing. You will need to modify the parameters of the notebook heavily to see any results at all with CPUs, while you will see an obvious speed improvement from using the TPUs over the GPUs.

Deep Learning Tutorials

2020-07-02T22:34:12Z

Cwhidden:

<div class="noautonum">

== 1. Get started with DeepSense ==

Follow all the steps from [[Getting started]] and [[Getting started with Deep Learning]]. This tutorial assumes you can log on to the DeepSense compute platform and have a version of Anaconda python on your path with Tensorflow and Pytorch installed in an anaconda environment.

== 2. Tensorflow Preparation ==

=== Download the example notebooks ===

git clone https://github.com/aymericdamien/TensorFlow-Examples.git

=== request a gpu session ===

bsub -Is -q gpu bash

=== activate your anaconda environment ===

conda activate tensorflow

Note: this assumes you've followed the getting started instructions and have created a python environment called <code>tensorflow</code> with the <code>tensorflow-gpu</code> package installed from the IBM-AI repository. If not then please follow those instructions.

=== start a jupyter notebook ===
jupyter notebook --no-browser --ip=0.0.0.0

=== open an SSH tunnel to access the notebook ===
just as we did in the [[Getting started with Deep Learning]] tutorial, open an SSH tunnel in another window
ssh -l <user> login1.deepsense.ca -L <port>:ds-cmgpu-<num>:<port>

=== open the notebook in your browser ===
In a web browser navigate to the page listed in the jupyter notebook output. Remember to replace the node name with <code>localhost</code>

== 3. Tensorflow Example Notebooks ==

=== 1. helloworld.ipynb ===

TensorFlow-Examples/notebooks/1_Introduction/helloworld.ipynb

This is an introduction to a python jupyter notebook.

Select a box with code. Press the shift and enter keys together to run the code in that box. You'll notice a star appear beside running code and a number in brackets appear beside finished code to indicate the order in which code boxes were run.

Often in an example notebook you will see code that already has output cached. You still need to run all previous code boxes and may want to use the menu to clear all output. If there is an error then you can modify the code or fix the error (e.g. download a dependency) and try again.

If you are missing a required dependency in later notebooks then you can install that package into your anaconda environment in a terminal window and it will be immediately accessible from the notebook. You do not need to close and restart the notebook or SSH tunnel.

You can also run the entire notebook using the menu.

When you are finished with a notebook you should use the menu to halt the kernel before closing the notebook. This clears resources such as GPU memory.

=== 2. kmeans.ipynb ===

TensorFlow-Examples/notebooks/2_BasicModels/kmeans.ipynb

This is an introduction to a basic machine learning model, kmeans. In this tutorial the kmeans algorithm is used to classify handwritten digits.

kmeans works by clustering different training examples and comparing each new input to the mean of each cluster.

Run the notebook and learn more about kmeans.

Observe that machine learning methods often have parameters that must be chosen. You can use default parameters but optimizing these parameters can greatly improve the accuracy of a model.

For a simple example, try increasing the length of training by changing the <code>num_steps</code> variable from 50 to 100 and running it again.

What happens if you modify other parameters and run the notebook again? How do these parameters change the training time and accuracy of the model?

Note: depending on your version of tensorflow you may need to modify the notebook. In some versions of tensorflow the kmeans.training_graph() function returns a different number of variables than the notebook expects (such as the cluster_centers_vars variable which may need to be removed from the code).

=== 3. random_forest.ipynb ===

TensorFlow-Examples/notebooks/2_BasicModels/random_forest.ipynb

This notebook uses a different model, random forests, to classify the handwritten digits. A random forest is a set of decision trees, each of which are trained to learn part of the problem.

Random forests are one of the most commonly used machine learning models because they are quick to train and give good performance on many tasks.

Run the notebook and observe that the random forest provides better accuracy than the kmeans model you trained previously.

Try changing parameters such as increasing the number of decision trees or the training time.

If you increase the number of trees or training time by too much then you will see that you achieve worse performance on the test set than on the training set. This is called overfitting and means that your model is learning specific details from the training set that do not generalize to the test set. This is a common problem in machine learning and needs to be considered whenever you train a model. You may need to simplify your model, give it more training examples by collecting more data or use data augmentation.

=== 4. convolutional_network.ipynb ===

TensorFlow-Examples/notebooks/3_NeuralNetworks/convolutional_network.ipynb

This is our first example of a commonly applied kind of neural network called a convolutional neural network (CNN), also applied to handwriting classification.

The notebook provides a good description of this kind of network which gradually reduces the size of the input using convolutional layers. This forces the network to learn information about multiple input variables and provides good accuracy.

Run the notebook and observe that this simple CNN greatly outperforms kmeans and random forests for handwriting recognition.

=== 5. linear_regression.ipynb ===

notebooks/TensorFlow-Examples/notebooks/2_BasicModels/linear_regression.ipynb

This notebook is an example of a different problem, called regression. Regression attempts to predict a value, unlike classification which determines which class an input example belongs to. This is a simple example that attempts to fit a line to best match a set of data points with x and y values.

Note: you may need to add backets to the print statements depending on your version of python and the version of this example notebook

=== 5. autoencoder.ipynb ===

notebooks/TensorFlow-Examples/notebooks/3_NeuralNetworks/autoencoder.ipynb

An autoencoder trains two seperate networks, an encoder and a decoder. This has many applications using the intermediate "latent representation". This includes compression (by storing just the latent representation), translation (by training an encoder with multiple different languages and then training different decoders for each language), and modifying the style of the input (by similarly leveraging different encoder or decoder frameworks).

=== 6. gan.ipynb ===

TensorFlow-Examples/notebooks/3_NeuralNetworks/gan.ipynb

A generative adversarial network (GAN) can be used to generate new data that looks similar to training data. This kind of network is how so called "deepfakes" are made that alter teh style of images or video.

The basic idea is to traing two opposing networks. The generator tries to generate examples and the discriminator tries to determine if examples are real or fake. Both networks learn and improve together which greatly improves performance over using a single network for either purpose.

=== 7. recurrent_network.ipynb ===

TensorFlow-Examples/notebooks/3_NeuralNetworks/recurrent_network.ipynb

A recurrent neural network is often applied to text processing or other problems that consider a sequence of events or letters. This type of network retains information at each step and thus has a type of memory that learns both from training examples as well as from parts of a specific input instance that have already been processed.

== 4. Pytorch ==

There are a variety of example notebooks for the pytorch framework at https://pytorch.org/tutorials/ . For example, there is a series of three notebooks on text processing and translation using recurrent neural networks:

=== 1. classifying names by country of origin ===

https://pytorch.org/tutorials/intermediate/char_rnn_classification_tutorial.html

=== 2. generating names similar to those from those countries of origin ===

https://pytorch.org/tutorials/intermediate/char_rnn_generation_tutorial.html

=== 3. text translation using seq2seq latent encoding of text with different decoders ===

https://pytorch.org/tutorials/intermediate/seq2seq_translation_tutorial.html

== 5. Tensor Processing Units ==

Google has developed dedicated deep learning processors called tensor processing units (TPUs). The contest web site kaggle has an online notebook where you can try out these TPUs to classify images of flowers by species.

https://www.kaggle.com/c/flower-classification-with-tpus

Try out the notebook and compare the performance of CPU, GPU, and TPU computing. You will need to modify the parameters of the notebook heavily to see any results at all with CPUs, while you will see an obvious speed improvement from using the TPUs over the GPUs.

Deep Learning Tutorials

2020-07-02T22:26:03Z

Cwhidden:

Deep Learning Tutorials

2020-07-02T18:04:42Z

Cwhidden: Created page with "<div class="noautonum"> == 1. Get started with DeepSense == Follow all the steps from Getting started and Getting started with Deep Learning. This tutorial assumes y..."

DeepSense Documentation

2020-07-02T17:43:44Z

Cwhidden: /* Guides */

Mitacs Accelerate Proposals

2020-06-26T19:56:23Z

Cwhidden: Created page with "The Mitacs Accelerate program is a common funding method for DeepSense projects. This guide will help you through the process of writing a funding proposal for submission to M..."

The Mitacs Accelerate program is a common funding method for DeepSense projects. This guide will help you through the process of writing a funding proposal for submission to Mitacs.

Mitacs and a company each pay for half of the project. Mitacs funding is allocated in *units* of 4-6 months with each unit paying $15000 to the student.

See the [https://www.mitacs.ca/en/programs/accelerate/proposal|The Mitacs web site] for the proposal template and guide.

== Planning Meeting ==
As with any project, the first step is to have a planning meeting with the student, company, supervisor, and possibly with a DeepSense staff member. If you are reading this guide you probably already had at least one meeting with the company but there are a few items you need to be clear on. If necessary, have another meeting or discuss these items by email or phone with your supervisor, DeepSense staff, and/or the company:

# Number of students and duration of the project
# Overall goal of the project
# Deliverables
# A general idea of how to approach the project and belief it is feasible

== General advice ==

* There are a lot of sections in the proposal so write the proposal in stages instead of trying to do it all at once or in order.

* Be iterative: write an outline of what you intend to do, expand the outline to sentences and paragraphs, and then edit for clarity and content.

* Get feedback often. Involve your supervisor, other students on the project or related projects, and DeepSense staff. It will take several rounds of feedback and editing to write a strong proposal.

* Be mindful of both the science and the benefit to the company. A DeepSense project is a mix of both. There needs to be specified deliverables for the company and an explanation of how those deliverables solve a problem for the company. There also needs to be a research question to solve or novel advance that doesn't currently exist. This can range from applying a known state of the art technique to a novel dataset or situation all the way to developing a brand new solution.

* Use topic sentences and strong closing sentences. A topic sentence is a sentence that explains the purpose of a paragraph and is typically at the beginning of the paragraph. In most cases you should be able to read just the first sentence of each paragraph and still understand the main ideas. A strong closing sentence, especially at the end of each section, will encourage the reader

* Ask your supervisor or DeepSense staff if you can see an example of a successfully funded Mitacs proposal. This will help you understand the scope and detail required in each section.

== First Steps ==

The first steps are to do some research and write down the general idea of the project

=== 2.1 and 7.1: Title ===
Write down a draft title. You may want to change this later on so don't worry too much about writing a catchy title yet.

=== 2.2 Abstract ===
Approximately 200 words. The abstract summarizes the entire project. Write out the problem to be solved, why it is important, the current state of the art and why that is insufficient, your proposed solution, and finally the steps and deliverables of the project.

The abstract is important but does not need to be polished at this stage. Just write out each of these facets and if you can't then you need to do some research or discuss the project further with your supervisor, the company, and/or DeepSense staff.

=== 2.3 Background ===
Minimum 500 words. This is where you explain the current state of the problem in detail and cite relevant research.

First give a general overview of the problem and the broad research area. Cite some relevant survey papers or important guiding works.

Then summarize some of the more specific topics and research papers. Typically you will find papers that partially but do not completely solve your problem. Summarize their results as well as their pros and cons. It should be clear why you have selected these specific results and how this knowledge will contribute to completing the project.

Finally, explain how some of these results together can be used to solve your specific problem. For example, you may be using a deep learning model from one paper, adding a technique from a second paper, and then using a type of analysis from a third paper. Moreover, highlight any gaps that the current literature can't solve for your problem. It is important to explain how these methods fit together and show that your project is novel research.

=== 2.9 References ===

Put your references in this section. Mitacs does not specify a reference format so use any consistent format. We've had good success with the APA format.

This section is just as important as it is in a scientific paper so be sure to check your reference details and order them consistently.

== Project Detail ==

Now that you have a general idea of the project and have done some research it's time to get more specific with the deliverables and methods. Consider getting feedback on the sections you have already completed while you start on the next sections.

=== 2.4 General Objective ===

Write out the main objective of the project. Split this up into sub-objectives. Very briefly summarize any information needed to explain a sub-objective such as how it will be tested or what data will be used.

=== 2.5 Details of internships or subprojects ===

Although numbered like a regular section, each subsection of Details requires a significant explanation and should be treated as a full section in its own right. We'll complete some of these now but go back and do the other later.

For a very large project funded by a single mitacs submission you may need to break the project up into different subprojects. If so then the student(s) working on each subproject should fill out their subprojects information.

=== 2.5.a Name of Intern ===

Write out the name of the student or students working on a shared goal.

=== 2.5.b Specific Objectives ===

Write out the specific objectives that each student listed will work on. These should be matched with the general objective and sub-objectives as well as explaining what each individual student will do.

=== 2.5.c Methodologies ===

In this section you write out in detail how you will accomplish this project.

This is the meat of the proposal and must includes the datasets, tools, and methods you will use in enough detail for an expert external reviewer to determine if the project is feasible and will accomplish your objectives. Consider this like the methods section of a scientific paper.

Explain the analysis you will complete and how you will show that you have met each sub-objective.

=== 2.5.e Deliverables ===

Explain the deliverables of the project such as software, reports, publications, etc. What will the project create and how will it meet the sub-objectives?

=== 2.5.f Benefit to the intern ===

Explain what you will gain from this project. Examples include:
* exposure to an ocean company and/or novel data
* learning a new analysis method
* opportunity to contribute to an industry problem or a solution that will be used by the partner to do X
* opportunity to publish in peer-reviewed venues and/or write a thesis

== Timeline ==

Now that you have a good idea of the specifics of the project it's time to plan. Revise and edit the previous sections and seek feedback.

Then, take the steps your propose to do in your methodology and create a *gannt chart* showing how long each of those steps will take:

=== 2.5.d Timeline ===

First, list out the major steps of the project, goals, and deliverables such as:
* obtain data
* preprocess the data
* design the machine learning model
* train the model
* test the model
* improve the model
* train and test again
* analysis
* write a report

Then think about how long each step of the project will take in weeks, half months, or (for a long project) months. Break the project up into 4 month sections and create a gannt chart for each. You will probably want to do this in a spreadsheet program such as Excel for easy editing and then copy the chart into the mitacs proposal later.

== Basic Information ==

There are some informational sections you need to complete. You will need to obtain some information from your supervisor:

* 2.8 Relationship (if any) to past/other Mitacs projects:

* 3 Declarations

* 4.1 Lead academic supervisor in Canada

* 4.3 Interns

* 4.4 Interns to be determined (TBD)

== Company information ==

When your proposal is nearly ready you will need to send it to your partner company for their feedback. Do this in conjunction with your supervisor and/or DeepSense staff.

Be sure you make it clear to the company that *Section 7.2 Public project overview* will be publicly available on the Mitacs web site. They must review this section carefully.

Section of the proposal that should be completed by or with the help of the partner organization are:

* 2.5.g Partner interaction

* 2.6 Relevance to the partner organization and to Canada

* 4.2 Partner organization in Canada

* 7.2 Public project overview

== Remaining steps ==

There are a few other sections that must be completed

=== 5 Budget and invoicing ===

Your supervisor or DeepSense staff can help with the budget

=== 6 Suggested reviewers ===

You need to suggest 6 reviewers. They cannot be from your university and you cannot have published with them or have plans to collaborate with them in the near future.

Moreover, each reviewer must be from a different university or organization than the others.

We recommend selecting subject experts that are knowledgeable of the research area but also likely to be generally favourable of the research and provide constructive criticism and suggestions. Contact your supervisor and/or DeepSense staff if you need help selecting reviewers.

== Scientific Committee Review ==

The proposal must be reviewed by the DeepSense scientific committee.

== Signatures ==

After the scientific committee has approved the project and the company has agreed and signed the necessary sections, you and your supervisor will need to sign:

* 7.3 Signatures
* Appendix A - Intern consent form
* The separate mitacs IP agreement

== Review Process ==

Proposal review will take approximately 4-8 weeks so it is important to submit the proposal early before the start of a project. Most well-written proposals are funded but you may need to provide extra information to address reviewer comments.

DeepSense Documentation

2020-06-26T18:12:37Z

Cwhidden: /* Guides */ Added page for writing a mitacs

Known problems

2020-06-17T19:28:07Z