Neural Networks Exercises

Last updated on Nov 26, 2024

Objective: Understand the propagation of inputs through a neural network.

Given:
- Inputs: $X = [1, 0.5]$
- Weights: $W = [[0.2, 0.8], [0.4, 0.3]]$
- Biases: $b = [0.1, 0.1]$
Tasks:
- Calculate the weighted sum: $Z = W \cdot X + b$ .
- Apply the ReLU activation function: $A = ReLU (Z)$ .

Objective: Compute gradients for a simple neural network.

Setup:
- Two-layer network with:
  - $X = [0.5, - 0.2]$
  - $W_{1} = [[0.1, 0.3], [- 0.2, 0.4]]$
  - $W_{2} = [0.2, - 0.5]$
  - $y = 1$
Tasks:
- Perform a forward pass with ReLU for the hidden layer and Sigmoid for the output.
- Calculate the binary cross-entropy loss.
- Derive gradients for $W_{1}$ and $W_{2}$ using backpropagation.

Objective: Explore the impact of scaling on neural networks.

Given:
- Dataset: $X = [[5, 20, 10], [15, 5, 25], [10, 30, 15]]$ .
Tasks:
- Apply Min-Max scaling to scale values to the range [0, 1].
- Standardize features to have zero mean and unit variance.
- Compare the two approaches and explain when each would be preferred.

Objective: Compare the behavior of activation functions.

Given:
- Input values: $X = [- 2, - 1, 0, 1, 2]$ .
Tasks:
- Compute outputs for ReLU, LeakyReLU ( $α = 0.01$ ), Sigmoid, and Tanh functions.
- Sketch the graphs of these functions.
- Discuss the advantages and disadvantages of each function.

Objective: Verify the correctness of computed gradients.

Setup:
- Loss function: $L = \frac{1}{2} (y - \hat{y})^{2}$ , where $y = 1$ , $\hat{y} = W \cdot x + b$ .
- Parameters: $W = 0.5$ , $x = 2$ , $b = 0.1$ .
Tasks:
- Compute the analytical gradient $\frac{\partial L}{\partial W}$ .
- Use numerical approximation to compute: $\frac{\partial L}{\partial W} \approx \frac{L (W + ϵ) - L (W - ϵ)}{2 ϵ}$ for $ϵ = 10^{- 4}$ .
- Compare the two results and explain any differences.

Objective: Understand the effect of regularization on weight updates.

Setup:
- Weights: $W = [1, - 2, 0.5]$ .
- Regularization: L2 with $λ = 0.01$ .
Tasks:
- Compute the weight penalty term $λ \sum W^{2}$ .
- Update the weights using gradient descent with learning rate $η = 0.1$ and include the regularization term.
- Discuss how regularization affects model training.

Objective: Analyze and identify potential issues in a neural network setup.

Setup:
- A neural network has:
  - Inputs: $X = [1, 2]$
  - Weights: $W_{1} = [[0.5, 0.2], [- 0.3, 0.8]]$ , $W_{2} = [0.7, - 0.6]$
  - Biases: $b_{1} = [0.1, - 0.1]$ , $b_{2} = 0.2$
  - ReLU activation for the hidden layer and Sigmoid for the output.
- Output: $\hat{y} = 0.4$
- True label: $y = 1$ .
Tasks:
- Calculate the loss using binary cross-entropy.
- Determine whether the weight initialization could cause vanishing/exploding gradients.
- Propose changes to the network architecture or initialization to improve performance.