Changes

gillesc92 · 54b2e091
--- a/2.-Statistics/rnn.md
+++ b/2.-Statistics/rnn.md
+## Machine Learning for Time Series Modeling
+
+Time series data is ubiquitous in many fields, including ecology, finance, and climatology. Machine learning methods like **Recurrent Neural Networks (RNNs)**, **Long Short-Term Memory (LSTMs)**, and **Gated Recurrent Units (GRUs)** are well-suited for modeling temporal dependencies in time series data. These models are designed to capture both short-term and long-term dependencies, making them ideal for sequential data.
+
+### 1. Recurrent Neural Networks (RNNs)
+
+#### What is an RNN?
+
+A **Recurrent Neural Network (RNN)** is a type of neural network designed for processing sequential data by maintaining a **hidden state** that captures information from previous time steps. This hidden state allows RNNs to "remember" previous inputs, making them powerful for time-dependent data.
+
+#### How to Calculate
+
+At each time step, the hidden state **$h_t$** is updated using the current input **$x_t$** and the previous hidden state **$h_{t-1}$**:
+
+$$
+h_t = \tanh(W_h \cdot h_{t-1} + W_x \cdot x_t + b_h)
+$$
+
+The output **$y_t$** at time step **$t$** is calculated using:
+
+$$
+y_t = W_y \cdot h_t + b_y
+$$
+
+Where:
+- **$W_h$**, **$W_x$**, **$W_y$** are weight matrices.
+- **$b_h$**, **$b_y$** are bias terms.
+- **$\tanh$** is the activation function.
+
+#### Common Uses
+
+- **Time Series Forecasting**: Predicting future values of a time series, such as forecasting environmental variables like temperature or population dynamics.
+- **Speech and Text Processing**: Tasks like language translation or speech recognition where sequential dependencies are important.
+
+#### Issues
+
+- **Vanishing/Exploding Gradients**: RNNs struggle with long-term dependencies because of gradient issues during training, making it difficult to remember long sequences.
+
+---
+
+### 2. Long Short-Term Memory (LSTMs)
+
+#### What is an LSTM?
+
+An **LSTM** is a type of RNN designed to mitigate the vanishing gradient problem, allowing the model to retain information over longer time sequences. LSTMs use **gating mechanisms** to control the flow of information, which helps the network "decide" when to remember or forget information.
+
+#### How to Calculate
+
+The LSTM cell updates are governed by the following equations:
+
+1. **Forget Gate**: Decides what information to forget:
+   $$
+   f_t = \sigma(W_f \cdot [h_{t-1}, x_t] + b_f)
+   $$
+
+2. **Input Gate**: Decides which new information to store:
+   $$
+   i_t = \sigma(W_i \cdot [h_{t-1}, x_t] + b_i)
+   $$
+   $$
+   \tilde{C}_t = \tanh(W_C \cdot [h_{t-1}, x_t] + b_C)
+   $$
+
+3. **Cell State Update**:
+   $$
+   C_t = f_t \cdot C_{t-1} + i_t \cdot \tilde{C}_t
+   $$
+
+4. **Output Gate**: Decides the new hidden state:
+   $$
+   o_t = \sigma(W_o \cdot [h_{t-1}, x_t] + b_o)
+   $$
+   $$
+   h_t = o_t \cdot \tanh(C_t)
+   $$
+
+Where:
+- **$f_t$, $i_t$, $o_t$** are the forget, input, and output gates.
+- **$C_t$** is the cell state.
+- **$h_t$** is the hidden state.
+- **$W_f$, $W_i$, $W_o$** are weight matrices.
+
+#### Common Uses
+
+- **Long-Term Dependencies**: LSTMs are ideal for modeling tasks that require retaining information over long periods, such as predicting long-term climate patterns or population dynamics.
+- **Sequential Data**: LSTMs are widely used in text processing and speech recognition tasks that require context from long sequences.
+
+#### Issues
+
+- **Computational Cost**: LSTMs are computationally expensive compared to simpler models like standard RNNs.
+- **Complexity**: Tuning the hyperparameters (number of layers, hidden units) can be challenging.
+
+---
+
+### 3. Gated Recurrent Units (GRUs)
+
+#### What is a GRU?
+
+A **Gated Recurrent Unit (GRU)** is a simpler alternative to LSTMs. It has fewer parameters and combines the forget and input gates into a single gate, reducing the complexity while maintaining the ability to capture long-term dependencies.
+
+#### How to Calculate
+
+The GRU uses two gates to control the flow of information:
+
+1. **Update Gate**: Decides how much past information to keep:
+   $$
+   z_t = \sigma(W_z \cdot [h_{t-1}, x_t] + b_z)
+   $$
+
+2. **Reset Gate**: Decides how much of the previous hidden state to forget:
+   $$
+   r_t = \sigma(W_r \cdot [h_{t-1}, x_t] + b_r)
+   $$
+
+The new hidden state is calculated as:
+$$
+h_t = (1 - z_t) \cdot h_{t-1} + z_t \cdot \tanh(W_h \cdot [r_t \cdot h_{t-1}, x_t] + b_h)
+$$
+
+#### Common Uses
+
+GRUs are used in similar tasks as LSTMs, including:
+- **Time Series Prediction**: Forecasting future trends or anomalies in environmental data.
+- **Text and Speech Processing**: Handling long sequences while being computationally more efficient than LSTMs.
+
+#### Issues
+
+- **Limited Flexibility**: GRUs lack the explicit cell state, which can limit flexibility in capturing very long-term dependencies compared to LSTMs.
+
+---
+
+### 4. Common Applications of Time Series Modeling in Ecology
+
+1. **Population Dynamics**:
+   - Forecasting species population trends over time using LSTMs or RNNs to capture the seasonal and temporal variations in species abundance.
+
+2. **Environmental Monitoring**:
+   - Time series models like GRUs can be used to monitor and predict environmental variables (e.g., temperature, humidity) and detect long-term patterns or anomalies.
+
+3. **Climate Modeling**:
+   - RNNs and LSTMs are employed in predicting long-term climate changes, where past climate data is used to forecast future trends, aiding conservation efforts.
+
+---
+
+### 5. Issues in Time Series Models
+
+1. **Vanishing/Exploding Gradients**:
+   - RNNs struggle with long-term dependencies due to the vanishing gradient problem. LSTMs and GRUs mitigate this issue using gating mechanisms.
+
+2. **Overfitting**:
+   - With complex time series data, models can overfit the training data. Techniques like dropout regularization and early stopping can help combat overfitting.
+
+3. **Computational Cost**:
+   - LSTMs and GRUs can be computationally expensive, especially for long sequences. Consider reducing sequence length or using a simpler RNN when possible.
+
+---
+
+### How to Use Machine Learning for Time Series Effectively
+
+- **Use RNNs for Short-Term Dependencies**: RNNs are useful for tasks that rely heavily on recent inputs.
+- **Leverage LSTMs for Long-Term Dependencies**: For tasks requiring long-term memory, LSTMs are the preferred choice.
+- **Optimize with GRUs**: GRUs provide a balance between complexity and performance, making them ideal for medium-length dependencies.
\ No newline at end of file