================================================
Tutorial 3 - A Feedforward Spiking Neural Network
================================================

Tutorial written by Jason K. Eshraghian (`www.ncg.ucsc.edu <https://www.ncg.ucsc.edu>`_)

.. image:: https://colab.research.google.com/assets/colab-badge.svg
        :alt: Open In Colab
        :target: https://colab.research.google.com/github/jeshraghian/snntorch/blob/master/examples/tutorial_3_feedforward_snn.ipynb

The snnTorch tutorial series is based on the following paper. If you find these resources or code useful in your work, please consider citing the following source:
   
    `Jason K. Eshraghian, Max Ward, Emre Neftci, Xinxin Wang, Gregor Lenz, Girish
    Dwivedi, Mohammed Bennamoun, Doo Seok Jeong, and Wei D. Lu. “Training
    Spiking Neural Networks Using Lessons From Deep Learning”. Proceedings of the IEEE, 111(9) September 2023. <https://ieeexplore.ieee.org/abstract/document/10242251>`_

.. note::
  This tutorial is a static non-editable version. Interactive, editable versions are available via the following links:
    * `Google Colab <https://colab.research.google.com/github/jeshraghian/snntorch/blob/master/examples/tutorial_3_feedforward_snn.ipynb>`_
    * `Local Notebook (download via GitHub) <https://github.com/jeshraghian/snntorch/tree/master/examples>`_


Introduction
-------------

In this tutorial, you will: 

* Learn how to simplify the leaky integrate-and-fire (LIF) neuron to make it deep learning-friendly 
* Implement a feedforward spiking neural network (SNN)

Install the latest PyPi distribution of snnTorch:

::

    $ pip install snntorch

::

    # imports
    import snntorch as snn
    from snntorch import spikeplot as splt
    from snntorch import spikegen
    
    import torch
    import torch.nn as nn
    import matplotlib.pyplot as plt


1. Simplifying the Leaky Integrate-and-Fire Neuron Model
----------------------------------------------------------

In the previous tutorial, we designed our own LIF neuron model. But it was quite complex, and added an array of
hyperparameters to tune, including :math:`R`, :math:`C`,
:math:`\Delta t`, :math:`U_{\rm thr}`, and the choice of reset
mechanism. This is a lot to keep track of, and only grows more cumbersome
when scaled up to full-blown SNN. So let’s make a few
simplfications.

1.1 The Decay Rate: beta
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

In the previous tutorial, the Euler method was used to derive the
following solution to the passive membrane model:

.. math:: U(t+\Delta t) = (1-\frac{\Delta t}{\tau})U(t) + \frac{\Delta t}{\tau} I_{\rm in}(t)R \tag{1}

Now assume there is no input current, :math:`I_{\rm in}(t)=0 A`:

.. math:: U(t+\Delta t) = (1-\frac{\Delta t}{\tau})U(t) \tag{2}

Let the ratio of subsequent values of :math:`U`, i.e.,
:math:`U(t+\Delta t)/U(t)` be the decay rate of the membrane potential,
also known as the inverse time constant:

.. math:: U(t+\Delta t) = \beta U(t) \tag{3}

From :math:`(1)`, this implies that:

.. math:: \beta = (1-\frac{\Delta t}{\tau}) \tag{4}

For reasonable accuracy, :math:`\Delta t << \tau`.

1.2 Weighted Input Current
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

If we assume :math:`t` represents time-steps in a sequence rather than
continuous time, then we can set :math:`\Delta t = 1`. To
further reduce the number of hyperparameters, assume :math:`R=1`. From
:math:`(4)`, these assumptions lead to:

.. math:: \beta = (1-\frac{1}{C}) \implies (1-\beta)I_{\rm in} = \frac{1}{\tau}I_{\rm in} \tag{5}

The input current is weighted by :math:`(1-\beta)`. 
By additionally assuming input current instantaneously contributes to the membrane potential:

.. math:: U[t+1] = \beta U[t] + (1-\beta)I_{\rm in}[t+1] \tag{6}

Note that the discretization of time means we are assuming that each
time bin :math:`t` is brief enough such that a neuron may only emit a
maximum of one spike in this interval.

In deep learning, the weighting factor of an input is often a learnable
parameter. Taking a step away from the physically viable assumptions
made thus far, we subsume the effect of :math:`(1-\beta)` from
:math:`(6)` into a learnable weight :math:`W`, and replace
:math:`I_{\rm in}[t]` accordingly with an input :math:`X[t]`:

.. math:: WX[t] = I_{\rm in}[t] \tag{7}

This can be interpreted in the following way. :math:`X[t]` is an input
voltage, or spike, and is scaled by the synaptic conductance of
:math:`W` to generate a current injection to the neuron. This gives us
the following result:

.. math:: U[t+1] = \beta U[t] + WX[t+1] \tag{8}

In future simulations, the effects of :math:`W` and :math:`\beta` are decoupled.
:math:`W` is a learnable parameter that is updated independently of :math:`\beta`.

1.3 Spiking and Reset
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

We now introduce the spiking and reset mechanisms. Recall that if
the membrane exceeds the threshold, then the neuron emits an output
spike:

.. math::

   S[t] = \begin{cases} 1, &\text{if}~U[t] > U_{\rm thr} \\
   0, &\text{otherwise} \end{cases}

.. math::
   
   \tag{9}

If a spike is triggered, the membrane potential should be reset. The
*reset-by-subtraction* mechanism is modeled by:

.. math:: U[t+1] = \underbrace{\beta U[t]}_\text{decay} + \underbrace{WX[t+1]}_\text{input} - \underbrace{S[t]U_{\rm thr}}_\text{reset} \tag{10}

As :math:`W` is a learnable parameter, and :math:`U_{\rm thr}` is often
just set to :math:`1` (though can be tuned), this leaves the decay rate
:math:`\beta` as the only hyperparameter left to be specified. This
completes the painful part of this tutorial.

.. note::

   Some implementations might make slightly different assumptions.
   E.g., :math:`S[t] \rightarrow S[t+1]` in :math:`(9)`, or
   :math:`X[t] \rightarrow X[t+1]` in :math:`(10)`. This above
   derivation is what is used in snnTorch as we find it maps intuitively
   to a recurrent neural network representation, without any change in
   performance.

1.4 Code Implementation
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

Implementing this neuron in Python looks like this:

::

    def leaky_integrate_and_fire(mem, x, w, beta, threshold=1):
      spk = (mem > threshold) # if membrane exceeds threshold, spk=1, else, 0
      mem = beta * mem + w*x - spk*threshold
      return spk, mem

To set :math:`\beta`, we have the option of either using Equation
:math:`(3)` to define it, or hard-coding it directly. Here, we will use
:math:`(3)` for the sake of a demonstration, but in future, it will just be hard-coded as we
are more focused on something that works rather than biological precision.

Equation :math:`(3)` tells us that :math:`\beta` is the ratio of
membrane potential across two subsequent time steps. Solve
this using the continuous time-dependent form of the equation (assuming
no current injection), which was derived in `Tutorial
2 <https://snntorch.readthedocs.io/en/latest/tutorials/index.html>`__:

.. math:: U(t) = U_0e^{-\frac{t}{\tau}}

:math:`U_0` is the initial membrane potential at :math:`t=0`. Assume the
time-dependent equation is computed at discrete steps of
:math:`t, (t+\Delta t), (t+2\Delta t)~...~`, then we can find the ratio
of membrane potential between subsequent steps using:

.. math:: \beta = \frac{U_0e^{-\frac{t+\Delta t}{\tau}}}{U_0e^{-\frac{t}{\tau}}} = \frac{U_0e^{-\frac{t + 2\Delta t}{\tau}}}{U_0e^{-\frac{t+\Delta t}{\tau}}} =~~...

.. math:: \implies \beta = e^{-\frac{\Delta t}{\tau}} 

::

    # set neuronal parameters
    delta_t = torch.tensor(1e-3)
    tau = torch.tensor(5e-3)
    beta = torch.exp(-delta_t/tau)
   
::

    >>> print(f"The decay rate is: {beta:.3f}")
    The decay rate is: 0.819

Run a quick simulation to check the neuron behaves correctly in
response to a step voltage input:

::

    num_steps = 200
    
    # initialize inputs/outputs + small step current input
    x = torch.cat((torch.zeros(10), torch.ones(190)*0.5), 0)
    mem = torch.zeros(1)
    spk_out = torch.zeros(1)
    mem_rec = []
    spk_rec = []
    
    # neuron parameters
    w = 0.4
    beta = 0.819
    
    # neuron simulation
    for step in range(num_steps):
      spk, mem = leaky_integrate_and_fire(mem, x[step], w=w, beta=beta)
      mem_rec.append(mem)
      spk_rec.append(spk)
    
    # convert lists to tensors
    mem_rec = torch.stack(mem_rec)
    spk_rec = torch.stack(spk_rec)
    
    plot_cur_mem_spk(x*w, mem_rec, spk_rec, thr_line=1,ylim_max1=0.5,
                     title="LIF Neuron Model With Weighted Step Voltage")

.. image:: https://github.com/jeshraghian/snntorch/blob/master/docs/_static/img/examples/tutorial3/_static/lif_step.png?raw=true
        :align: center
        :width: 400


2. Leaky Neuron Model in snnTorch
---------------------------------------

This same thing can be achieved by instantiating ``snn.Leaky``, in a
similar way to how we used ``snn.Lapicque`` in the previous tutorial, but with less hyperparameters:

::

    lif1 = snn.Leaky(beta=0.8)

The neuron model is now stored in ``lif1``. To use this neuron:

**Inputs** 

* ``cur_in``: each element of :math:`W\times X[t]` is sequentially passed as an input 
* ``mem``: the previous step membrane potential, :math:`U[t-1]`, is also passed as input.

**Outputs** 

* ``spk_out``: output spike :math:`S[t]` (‘1’ if there is a spike; ‘0’ if there is no spike) 
* ``mem``: membrane potential :math:`U[t]` of the present step

These all need to be of type ``torch.Tensor``. Note that here, we assume
the input current has already been weighted before passing into the
``snn.Leaky`` neuron. This will make more sense when we construct a
network-scale model. Also, equation :math:`(10)` has been time-shifted
back one step without loss of generality.

::

    # Small step current input
    w=0.21
    cur_in = torch.cat((torch.zeros(10), torch.ones(190)*w), 0)
    mem = torch.zeros(1)
    spk = torch.zeros(1)
    mem_rec = []
    spk_rec = []
    
    # neuron simulation
    for step in range(num_steps):
      spk, mem = lif1(cur_in[step], mem)
      mem_rec.append(mem)
      spk_rec.append(spk)
    
    # convert lists to tensors
    mem_rec = torch.stack(mem_rec)
    spk_rec = torch.stack(spk_rec)
    
    plot_cur_mem_spk(cur_in, mem_rec, spk_rec, thr_line=1, ylim_max1=0.5,
                     title="snn.Leaky Neuron Model")

Compare this plot with the manually derived leaky integrate-and-fire neuron. 
The membrane potential reset is slightly weaker: i.e., it uses a *soft reset*. 
This has been done intentionally because it enables better performance on a few deep learning benchmarks. 
The equation used instead is:

.. math:: U[t+1] = \underbrace{\beta U[t]}_\text{decay} + \underbrace{WX[t+1]}_\text{input} - \underbrace{\beta S[t]U_{\rm thr}}_\text{soft reset} \tag{11}


This model has the same optional input arguments of ``reset_mechanism``
and ``threshold`` as described for Lapicque’s neuron model.

.. image:: https://github.com/jeshraghian/snntorch/blob/master/docs/_static/img/examples/tutorial3/_static/snn.leaky_step.png?raw=true
        :align: center
        :width: 450


3. A Feedforward Spiking Neural Network
---------------------------------------------

So far, we have only considered how a single neuron responds to input
stimulus. snnTorch makes it straightforward to scale this up to a deep
neural network. In this section, we will create a 3-layer fully-connected neural
network of dimensions 784-1000-10. Compared to our simulations so far, each neuron will now integrate over
many more incoming input spikes.

.. image:: https://github.com/jeshraghian/snntorch/blob/master/docs/_static/img/examples/tutorial2/2_8_fcn.png?raw=true
        :align: center
        :width: 600


PyTorch is used to form the connections between neurons, and
snnTorch to create the neurons. First, initialize all layers.

::

    # layer parameters
    num_inputs = 784
    num_hidden = 1000
    num_outputs = 10
    beta = 0.99
    
    # initialize layers
    fc1 = nn.Linear(num_inputs, num_hidden)
    lif1 = snn.Leaky(beta=beta)
    fc2 = nn.Linear(num_hidden, num_outputs)
    lif2 = snn.Leaky(beta=beta)

Next, initialize the hidden variables and outputs of each spiking
neuron. As networks increase in size, this becomes more tedious.
The static method ``init_leaky()`` can be used to take care of
this. All neurons in snnTorch have their own initialization methods that
follow this same syntax, e.g., ``init_lapicque()``. The shape of the
hidden states are automatically initialized based on the input data
dimensions during the first forward pass.

::

    # Initialize hidden states
    mem1 = lif1.init_leaky()
    mem2 = lif2.init_leaky()
    
    # record outputs
    mem2_rec = []
    spk1_rec = []
    spk2_rec = []

Create an input spike train to pass to the network. There are 200 time
steps to simulate across 784 input neurons, i.e., the input originally
has dimensions of :math:`200 \times 784`. However, neural nets typically process data in minibatches. 
snnTorch, uses time-first dimensionality:

[:math:`time \times batch\_size \times feature\_dimensions`]

So ‘unsqueeze’ the input along ``dim=1`` to indicate ‘one batch’
of data. The dimensions of this input tensor must be 200 :math:`\times`
1 :math:`\times` 784:

::

    spk_in = spikegen.rate_conv(torch.rand((200, 784))).unsqueeze(1)
    >>> print(f"Dimensions of spk_in: {spk_in.size()}")
    "Dimensions of spk_in: torch.Size([200, 1, 784])"

Now it’s finally time to run a full simulation. An intuitive way to
think about how PyTorch and snnTorch work together is that PyTorch
routes the neurons together, and snnTorch loads the results into spiking
neuron models. In terms of coding up a network, these spiking neurons
can be treated like time-varying activation functions.

Here is a sequential account of what’s going on:

-  The :math:`i^{th}` input from ``spk_in`` to the :math:`j^{th}` neuron 
   is weighted by the parameters initialized in ``nn.Linear``:
   :math:`X_{i} \times W_{ij}`
-  This generates the input current term from Equation :math:`(10)`,
   contributing to :math:`U[t+1]` of the spiking neuron
-  If :math:`U[t+1] > U_{\rm thr}`, then a spike is triggered from this
   neuron
-  This spike is weighted by the second layer weight, and the above
   process is repeated for all inputs, weights, and neurons.
-  If there is no spike, then nothing is passed to the post-synaptic
   neuron.

The only difference from our simulations thus far is that we are now
scaling the input current with a weight generated by ``nn.Linear``,
rather than manually setting :math:`W` ourselves.

::

    # network simulation
    for step in range(num_steps):
        cur1 = fc1(spk_in[step]) # post-synaptic current <-- spk_in x weight
        spk1, mem1 = lif1(cur1, mem1) # mem[t+1] <--post-syn current + decayed membrane
        cur2 = fc2(spk1)
        spk2, mem2 = lif2(cur2, mem2)
    
        mem2_rec.append(mem2)
        spk1_rec.append(spk1)
        spk2_rec.append(spk2)
    
    # convert lists to tensors
    mem2_rec = torch.stack(mem2_rec)
    spk1_rec = torch.stack(spk1_rec)
    spk2_rec = torch.stack(spk2_rec)
    
    plot_snn_spikes(spk_in, spk1_rec, spk2_rec, "Fully Connected Spiking Neural Network")

.. image:: https://github.com/jeshraghian/snntorch/blob/master/docs/_static/img/examples/tutorial3/_static/mlp_raster.png?raw=true
        :align: center
        :width: 450

At this stage, the spikes don’t have any real meaning. The inputs and
weights are all randomly initialized, and no training has taken place.
But the spikes should appear to be propagating from the first layer
through to the output. If you are not seeing any spikes, then you might have
 been unlucky in the weight initialization lottery - you might want
to try re-running the last four code-blocks.

``spikeplot.spike_count`` can create a spike counter of
the output layer. The following animation will take some time to
generate.

   Note: if you are running the notebook locally on your desktop, please
   uncomment the line below and modify the path to your ffmpeg.exe

::

    from IPython.display import HTML
    
    fig, ax = plt.subplots(facecolor='w', figsize=(12, 7))
    labels=['0', '1', '2', '3', '4', '5', '6', '7', '8','9']
    spk2_rec = spk2_rec.squeeze(1).detach().cpu()
    
    # plt.rcParams['animation.ffmpeg_path'] = 'C:\\path\\to\\your\\ffmpeg.exe'
    
    #  Plot spike count histogram
    anim = splt.spike_count(spk2_rec, fig, ax, labels=labels, animate=True)
    HTML(anim.to_html5_video())
    # anim.save("spike_bar.mp4")

.. raw:: html

  <center>
    <video controls src="https://github.com/jeshraghian/snntorch/blob/master/docs/_static/img/examples/tutorial3/_static/spike_bar.mp4?raw=true"></video>
  </center>

``spikeplot.traces`` lets you visualize the membrane potential traces. We will plot 9 out of 10 output neurons. 
Compare it to the animation and raster plot above to see if you can match the traces to the neuron.

::

    # plot membrane potential traces
    splt.traces(mem2_rec.squeeze(1), spk=spk2_rec.squeeze(1))
    fig = plt.gcf() 
    fig.set_size_inches(8, 6)

.. image:: https://github.com/jeshraghian/snntorch/blob/master/docs/_static/img/examples/tutorial3/_static/traces.png?raw=true
        :align: center
        :width: 450

It is fairly normal if some neurons are firing while others are
completely dead. Again, none of these spikes have any real meaning until
the weights have been trained.

Conclusion
-----------

That covers how to simplify the leaky integrate-and-fire neuron model,
and then using it to build a spiking neural network. In practice, we
will almost always prefer to use ``snn.Leaky`` over ``snn.Lapicque`` for
training networks, as there is a smaller hyperparameter search space.

`Tutorial
4 <https://snntorch.readthedocs.io/en/latest/tutorials/index.html>`__
goes into detail with the 2nd-order ``snn.Synaptic`` and ``snn.Alpha``
models. This next tutorial is not necessary for training a network, so if you wish to go straight
to deep learning with snnTorch, then skip ahead to `Tutorial
5 <https://snntorch.readthedocs.io/en/latest/tutorials/index.html>`__.

If you like this project, please consider starring ⭐ the repo on GitHub as it is the easiest and best way to support it.

For reference, the documentation `can be found
here <https://snntorch.readthedocs.io/en/latest/snntorch.html>`__.

Further Reading
---------------

-  `Check out the snnTorch GitHub project here. <https://github.com/jeshraghian/snntorch>`__
-  `snnTorch
   documentation <https://snntorch.readthedocs.io/en/latest/snntorch.html>`__
   of the Lapicque, Leaky, Synaptic, and Alpha models
-  `Neuronal Dynamics: From single neurons to networks and models of
   cognition <https://neuronaldynamics.epfl.ch/index.html>`__ by Wulfram
   Gerstner, Werner M. Kistler, Richard Naud and Liam Paninski.
-  `Theoretical Neuroscience: Computational and Mathematical Modeling of
   Neural
   Systems <https://mitpress.mit.edu/books/theoretical-neuroscience>`__
   by Laurence F. Abbott and Peter Dayan