The breakthrough deep Q-network that beat humans at Atari games using only the visual input, and the AlphaGo program that dethroned the world champion at the board game Go are two prominent examples. As an example, given the stock prices of the past week as input, my deep learning algorithm will try to predict the stock price of the next day.Given a large dataset of input and output pairs, a deep learning algorithm will try to minimize the difference between its prediction and expected output. The input and output layers are not counted as hidden layers. Week 1 Quiz - Introduction to deep learning. The earlier layers of a neural network are typically computing more complex features of the input than the deeper layers. The number of hidden layers is 3. What does the analogy "AI is the new electricity" refer to? Consider the following 2 hidden layer neural network: Which of the following statements are True? Forward propagation propagates the input through the layers, although for shallow networks we may just write all the lines. Correct, the "cache" records values from the forward propagation units and sends it to the backward propagation units because it is needed to compute the chain rule derivatives. The number of layers L is 3. Which of the following statements is true? We use it to pass variables computed during backward propagation to the corresponding forward propagation step. If you have 10,000,000 examples, how would you split the train/dev/test set? Deep learning is part of a bigger family of machine learning. Thus, during backpropagation you need to know which activation was used in the forward propagation to be able to compute the correct derivative. A biological neuron has dendrites which are used to receive inputs. During backpropagation, the corresponding backward function also needs to know what is the activation function for layer l, since the gradient depends on it. Number of epochs : The number of times the entire training data is fed to the network while training is referred to as the number of epochs. Deep learning algorithms are similar to how nervous system structured where each neuron connected each other and passing information. The number of layers L is 5. The number of layers L is 4. In other words, It mirrors the functioning of our brains. Variables computed during backward propagation. As seen in lecture, the number of layers is counted as the number of hidden layers + 1. Momentum: It is a parameter that helps to come out of the local minima and smoothen the jumps while gradient descent. Learning rate: The learning rate is how fast the network learns its parameters. 