Week 1 — Neural Networks and Deep Learning
Notations (source) :
Week 1 — Introduction to Deep Learning
May 1, 2021
Welcome
- AI (Artificial Intelligence) is the new Electricity to bring about a change and a new era
- Parts of this course:
- Neural Networks and Deep Learning. → Recognizing cats
- Improving Deep Neural Networks: Hyperparmeter tuning, Regularization and Optimization
- Structuring your Machine Learning Project → Best Practices for project
- Convolutional Neural Networks
- Natural Language Processing: Building sequence models
Introduction to Deep Learning
What Deep Learning/ML is good for
-
Problems with long lists of rules: When the traditional approach fails, machine learning/deep leanring may help
-
Continually changing environments: Deep Learning can adapt ('lear') to new scenarios
-
Discovering insights withing large collections of data: Imaging trying to hand-craft rules for what 101 different kinds of food look like?
Where Deep Learning is (typically) not good?
- When you need explainability: the patterns learned by a deep learnign model are typically uninterpretable by human
- When the traditional approach is a better option: if you can accomplish what you need with a simple rule-based system.
- When errors are unacceptable: since the outputs of deep learning aren't always predictable. The outputs are probabilistic and not deterministic.
- When you don't have much data: deep learning models usually require a fairly large amount of data to produce great results.
What is neural networks?
Neural networks are created by structuring layers, with each layer consisting of "neurons" which get activated depending on its activation function and the input.
Supervised Learning with Neural Networks
Supervised learning refers to problems when we have inputs as well as labels (outputs) to predict using machine learning techniques. Then the models figure out the mapping from inputs to output.
The data can be:
- Structured Data (tables)
- Unstructured Data (images, audio, etc.)
Why is Deep Learning taking off?
Three reasons:
- Backpropagation algorithm
- Glorot and He initialization
- ReLU (Rectified Linear Unit) activation function
Other reasons:
- Data
- Computation power
- Algorithms
The scale at which we creating data is also important, as neural networks performance don't stagnant unlike traditional machine learning algorithms
In this course $m$ denotes no of training examples.
About this Course
- Week 1: Introduction
- Week 2: Basics of Neural Network programming
- Week 3: One hidden layer Neural Networks
- Week 4: Deep Neural Networks
Info
Any image used here for illustration if not mentioned, is attributed to Andrew Ng's lecture slides.