Download

Download this notebook: plot_06_calcium_imaging.ipynb!

Fit Calcium Imaging#

For the example dataset, we will be working with a recording of a freely-moving mouse imaged with a Miniscope (1-photon imaging at 30Hz using the genetically encoded calcium indicator GCaMP6f). The area recorded for this experiment is the postsubiculum - a region that is known to contain head-direction cells, or cells that fire when the animal’s head is pointing in a specific direction.

The data were collected by Sofia Skromne Carrasco from the Peyrache Lab.

import jax
import jax.numpy as jnp
import matplotlib.pyplot as plt
import pynapple as nap

import nemos as nmo

configure plots

plt.style.use(nmo.styles.plot_style)

Data Streaming#

Here we load the data from OSF. The data is a NWB file.

path = nmo.fetch.fetch_data("A0670-221213.nwb")

Downloading file 'A0670-221213.nwb' from 'https://osf.io/download/sbnaw/' to '/home/docs/.cache/nemos'.

pynapple preprocessing#

Now that we have the file, let’s load the data. The NWB file contains multiple entries.

data = nap.load_file(path)
print(data)

A0670-221213
┍━━━━━━━━━━━━━━━━━━━━━━━┯━━━━━━━━━━━━━┑
│ Keys                  │ Type        │
┝━━━━━━━━━━━━━━━━━━━━━━━┿━━━━━━━━━━━━━┥
│ position_time_support │ IntervalSet │
│ RoiResponseSeries     │ TsdFrame    │
│ z                     │ Tsd         │
│ y                     │ Tsd         │
│ x                     │ Tsd         │
│ rz                    │ Tsd         │
│ ry                    │ Tsd         │
│ rx                    │ Tsd         │
┕━━━━━━━━━━━━━━━━━━━━━━━┷━━━━━━━━━━━━━┙

In the NWB file, the calcium traces are saved the RoiResponseSeries field. Let’s save them in a variable called ‘transients’ and print it.

transients = data['RoiResponseSeries']
print(transients)

Time (s)          0        1         2         3         4  ...
----------  -------  -------  --------  --------  --------  -----
1187      0.27546  0.79973  0.16383   0.20118   0.029255  ...
15225     0.26665  0.86751  0.15879   0.23682   0.027189  ...
18585     0.25796  0.89419  0.15352   0.25074   0.036514  ...
2194      0.24943  0.89513  0.14812   0.25215   0.056273  ...
253       0.24111  0.88023  0.14898   0.24651   0.070954  ...
28655     0.233    0.85584  0.14858   0.23706   0.081469  ...
32015     0.22513  1.0996   0.14715   0.22572   0.088588  ...
...                                                         ...
38945  0.20815  0.17535  0.12126   0.094461  0.87427   ...
42305  0.20247  0.17243  0.11807   0.089918  1.2578    ...
4566   0.19654  0.17056  0.11461   0.085079  1.62      ...
4902   0.19052  0.16645  0.11096   0.080197  1.8811    ...
52375  0.18449  0.16105  0.10717   0.075416  2.0599    ...
55735  0.17851  0.15494  0.10331   0.070814  2.2176    ...
5909   0.17264  0.14851  0.099416  0.066429  2.311     ...
dtype: float64, shape: (35757, 65)

transients is a TsdFrame. Each column contains the activity of one neuron.

The mouse was recorded for a 20 minute recording epoch as we can see from the time_support property of the transients object.

ep = transients.time_support
print(ep)

  index    start      end
      0   3.1187  1203.59
shape: (1, 2), time unit: sec.

There are a few different ways we can explore the data. First, let’s inspect the raw calcium traces for neurons 4 and 35 for the first 250 seconds of the experiment.

fig, ax = plt.subplots(1, 2, figsize=(12, 4))
ax[0].plot(transients[:, 4].get(0,250))
ax[0].set_ylabel("Firing rate (Hz)")
ax[0].set_title("Trace 4")
ax[0].set_xlabel("Time(s)")
ax[1].plot(transients[:, 35].get(0,250))
ax[1].set_title("Trace 35")
ax[1].set_xlabel("Time(s)")
plt.tight_layout()

../_images/d94be469caf5788e1873d03e44303f424d3904073fe4a6f7c8900d465a4642b1.png

You can see that the calcium signals are both nonnegative, and noisy. One (neuron 4) has much higher SNR than the other. We cannot typically resolve individual action potentials, but instead see slow calcium fluctuations that result from an unknown underlying electrical signal (estimating the spikes from calcium traces is known as deconvolution and is beyond the scope of this demo).

We can also plot tuning curves, plotting mean calcium activity as a function of head direction, using the function compute_tuning_curves_continuous. Here data['ry'] is a Tsd that contains the angular head-direction of the animal between 0 and 2\(\pi\).

tcurves = nap.compute_tuning_curves(transients, data['ry'], 120, feature_names=["angles"])

The function returns a pandas DataFrame. Let’s plot the tuning curves for neurons 4 and 35.

fig, ax = plt.subplots(1, 2, figsize=(12, 4))
ax[0].plot(tcurves.angles, tcurves[4])
ax[0].set_xlabel("Angle (rad)")
ax[0].set_ylabel("Firing rate (Hz)")
ax[0].set_title("Trace 4")
ax[1].plot(tcurves.angles, tcurves[35])
ax[1].set_xlabel("Angle (rad)")
ax[1].set_title("Trace 35")
plt.tight_layout()

../_images/4e6880737fcf2af89d42d9d67869dd16191097dc5aa179caa3d77117300340c6.png

As a first processing step, let’s bin the calcium traces to a 100ms resolution.

Y = transients.bin_average(0.1, ep)

We can visualize the downsampled transients for the first 50 seconds of data.

plt.figure()
plt.plot(transients[:,0].get(0, 50), linewidth=5, label="30 Hz")
plt.plot(Y[:,0].get(0, 50), '--', linewidth=2, label="10 Hz")
plt.xlabel("Time (s)")
plt.ylabel("Fluorescence")
plt.legend()
plt.show()

../_images/233714967eef38c74b6a41cb46a200ac417ff0a8f52d55f9da22462a679a0874.png

The downsampling did not destroy the fast transient dynamics, so seems fine to use. We can now move on to using NeMoS to fit a model.

Basis instantiation#

We can define a cyclic-BSpline for capturing the encoding of the heading angle, and a log-spaced raised cosine basis for the coupling filters between neurons. Note that we are not including a self-coupling (spike history) filter, because in practice we have found it results in overfitting.

We can combine the two bases.

heading_basis = nmo.basis.CyclicBSplineEval(n_basis_funcs=12, label="heading")
coupling_basis = nmo.basis.RaisedCosineLogConv(3, window_size=10, label="coupling")

Let’s combine both basis into a single additive element.

basis = heading_basis + coupling_basis
basis

'(heading + coupling)': AdditiveBasis(
    basis1='heading': CyclicBSplineEval(n_basis_funcs=12, order=4),
    basis2='coupling': RaisedCosineLogConv(n_basis_funcs=3, window_size=10, width=2.0, time_scaling=50.0, enforce_decay_to_zero=True),
)

In a Jupyter environment, please rerun this cell to show the HTML representation or trust the notebook.
On GitHub, the HTML representation is unable to render, please try loading this page with nbviewer.org.

Gamma GLM#

Until now, we have been modeling spike trains, and have used a Poisson distribution for the observation model. With calcium traces, things are quite different: we no longer have counts but continuous signals, so the Poisson assumption is no longer appropriate. A Gaussian model is also not ideal since the calcium traces are non-negative. To satisfy these constraints, we will use a Gamma distribution from NeMoS with a soft-plus non linearity.

Non-linearity

Different option are possible. With a soft-plus we are assuming an “additive” effect of the predictors, while an exponential non-linearity assumes multiplicative effects. Deciding which firing rate model works best is an empirical question. You can fit different configurations to see which one capture best the neural activity.

gamma_model = nmo.glm.GLM(
    solver_kwargs=dict(tol=10**-13),
    regularizer="Ridge",
    regularizer_strength=0.02,
    observation_model="Gamma",
    inverse_link_function=jax.nn.softplus,
)

We select one neuron to fit later, so remove it from the list of predictors

neu = 4
selected_neurons = jnp.hstack(
    (jnp.arange(0, neu), jnp.arange(neu+1, Y.shape[1]))
)

print(selected_neurons)

[ 0  1  2  3  5  6  7  8  9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24
 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48
 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64]

We need to bring the head-direction of the animal to the same size as the transients matrix. We can use the function bin_average of pynapple. Notice how we pass the parameter ep that is the time_support of the transients.

head_direction = data['ry'].bin_average(0.1, ep)

Let’s check that head_direction and Y are of the same size.

print(head_direction.shape)
print(Y.shape)

(12005,)
(12005, 65)

Design matrix#

We can now create the design matrix by combining the head-direction of the animal and the activity of all other neurons.

X = basis.compute_features(head_direction, Y[:, selected_neurons])

/home/docs/checkouts/readthedocs.org/user_builds/nemos/envs/stable/lib/python3.12/site-packages/pynapple/core/utils.py:198: UserWarning: Converting 'd' to numpy.array. The provided array was of type 'ArrayImpl'.
  warnings.warn(

Train & test set#

Let’s create a train epoch and a test epoch to fit and test the models. Since X is a pynapple time series, we can create IntervalSet objects to restrict them into a train set and test set.

train_ep = nap.IntervalSet(start=X.time_support.start, end=X.time_support.get_intervals_center().t)
test_ep = X.time_support.set_diff(train_ep) # Removing the train_ep from time_support

print(train_ep)
print(test_ep)

  index    start      end
      0   3.1187  603.355
shape: (1, 2), time unit: sec.
  index    start      end
      0  603.355  1203.59
shape: (1, 2), time unit: sec.

We can now restrict the X and Y to create our train set and test set.

Xtrain = X.restrict(train_ep)
Ytrain = Y.restrict(train_ep)

Xtest = X.restrict(test_ep)
Ytest = Y.restrict(test_ep)

Model fitting#

It’s time to fit the model on the data from the neuron we left out.

gamma_model.fit(Xtrain, Ytrain[:, neu])

/home/docs/checkouts/readthedocs.org/user_builds/nemos/envs/stable/lib/python3.12/site-packages/nemos/glm/glm.py:841: RuntimeWarning: The fit did not converge. Consider the following:
1) Enable float64 with ``jax.config.update('jax_enable_x64', True)`` 
2) Increase the max number of iterations or increase tolerance (if reasonable). These parameters can be specified by providing a ``solver_kwargs`` dictionary. For the available options see the ``self.solver.__init__`` docstrings.
  warnings.warn(

GLM(
    observation_model=GammaObservations(),
    inverse_link_function=softplus,
    regularizer=Ridge(),
    regularizer_strength=0.02,
    solver_name='LBFGS',
    solver_kwargs={'tol': 1e-13}
)

In a Jupyter environment, please rerun this cell to show the HTML representation or trust the notebook.
On GitHub, the HTML representation is unable to render, please try loading this page with nbviewer.org.

GLM

iFitted

Parameters

	observation_model	GammaObservations()
	inverse_link_function	<PjitFunction...738fb6963ec0>>
	regularizer	Ridge()
	regularizer_strength	0.02
	solver_name	'LBFGS'
	solver_kwargs	{'tol': 1e-13}

Fitted attributes

Name	Type	Value
aux_	NoneType	None
coef_	ArrayImpl[float32](204,)	Array([-3.927...dtype=float32)
dof_resid_	ArrayImpl[float32](1,)	Array([5797.], dtype=float32)
intercept_	ArrayImpl[float32](1,)	Array([-2.713...dtype=float32)
scale_	ArrayImpl[float32](1,)	Array([1.5165...dtype=float32)
solver_state_	OptimistixAdapterState	OptimistixAda...k_bool[] ) )

Model comparison#

We can compare the Gamma GLM to a standard Gaussian GLM. Nemos implements Gaussian GLMs as well.

gaussian_model = nmo.glm.GLM(
    observation_model="Gaussian",    
    regularizer="Ridge",
    regularizer_strength=0.02,
    solver_kwargs=dict(tol=10**-13),
)

gaussian_model.fit(Xtrain, Ytrain[:, neu])

/home/docs/checkouts/readthedocs.org/user_builds/nemos/envs/stable/lib/python3.12/site-packages/nemos/glm/glm.py:841: RuntimeWarning: The fit did not converge. Consider the following:
1) Enable float64 with ``jax.config.update('jax_enable_x64', True)`` 
2) Increase the max number of iterations or increase tolerance (if reasonable). These parameters can be specified by providing a ``solver_kwargs`` dictionary. For the available options see the ``self.solver.__init__`` docstrings.
  warnings.warn(

GLM(
    observation_model=GaussianObservations(),
    inverse_link_function=identity,
    regularizer=Ridge(),
    regularizer_strength=0.02,
    solver_name='LBFGS',
    solver_kwargs={'tol': 1e-13}
)

In a Jupyter environment, please rerun this cell to show the HTML representation or trust the notebook.
On GitHub, the HTML representation is unable to render, please try loading this page with nbviewer.org.

GLM

iFitted

Parameters

	observation_model	GaussianObservations()
	inverse_link_function	<function <la...x738f8f8a37e0>
	regularizer	Ridge()
	regularizer_strength	0.02
	solver_name	'LBFGS'
	solver_kwargs	{'tol': 1e-13}

Fitted attributes

Name	Type	Value
aux_	NoneType	None
coef_	ArrayImpl[float32](204,)	Array([-6.120...dtype=float32)
dof_resid_	ArrayImpl[float32](1,)	Array([5797.], dtype=float32)
intercept_	ArrayImpl[float32](1,)	Array([0.1373...dtype=float32)
scale_	ArrayImpl[float32](1,)	Array([0.1137...dtype=float32)
solver_state_	OptimistixAdapterState	OptimistixAda...k_bool[] ) )

We now have 2 models we can compare. Let’s predict the activity of the neuron during the test epoch.

yp = gamma_model.predict(Xtest)
ylreg = gaussian_model.predict(Xtest)

/home/docs/checkouts/readthedocs.org/user_builds/nemos/envs/stable/lib/python3.12/site-packages/pynapple/core/utils.py:198: UserWarning: Converting 'd' to numpy.array. The provided array was of type 'ArrayImpl'.
  warnings.warn(

Let’s plot the predicted activity for the first 60 seconds of data.

# mkdocs_gallery_thumbnail_number = 3

ep_to_plot = nap.IntervalSet(test_ep.start+20, test_ep.start+80)

plt.figure()
plt.plot(Ytest[:,neu].restrict(ep_to_plot), "r", label="true", linewidth=2)
plt.plot(yp.restrict(ep_to_plot), "k", label="gamma-nemos", alpha=1)
plt.plot(ylreg.restrict(ep_to_plot), "g", label="gaussian-nemos", alpha=0.5)
plt.legend(loc='best')
plt.xlabel("Time (s)")
plt.ylabel("Fluorescence")
plt.show()

../_images/8469abce6d8b23a76c6efc1a786fef25c36bda7202fd72e56f46eca3473d2d8c.png

While there is some variability in the fit for both models, one advantage of the gamma distribution is clear: the nonnegativity constraint is followed with the data. This is required for using GLMs to predict the firing rate, which must be positive, in response to simulated inputs. See Peyrache et al. 2018\(^{[1]}\) for an example of simulating activity with a GLM.

Another way to compare models is to compute tuning curves. Here we use the function compute_tuning_curves from pynapple.

real_tcurves = nap.compute_tuning_curves(transients, data['ry'], 120, epochs=test_ep, feature_names=["hd"])
gamma_tcurves = nap.compute_tuning_curves(yp, data['ry'], 120, epochs=test_ep, feature_names=["hd"])
linreg_tcurves = nap.compute_tuning_curves(ylreg, data['ry'], 120, epochs=test_ep, feature_names=["hd"])

Let’s plot them.

fig = plt.figure()
plt.plot(real_tcurves.hd, real_tcurves.sel(unit=neu), "r", label="true", linewidth=2)
plt.plot(gamma_tcurves.hd, gamma_tcurves[0], "k", label="gamma-nemos", alpha=1)
plt.plot(linreg_tcurves.hd, linreg_tcurves[0], "g", label="gaussian-nemos", alpha=0.5)
plt.legend(loc='best')
plt.ylabel("Fluorescence")
plt.xlabel("Head-direction (rad)")
plt.show()

../_images/de6f70dcbef88738bca7761725249bb356dd606303e9333c214bd598c582a1b4.png

Gamma-GLM for Calcium Imaging Analysis

Using Gamma-GLMs for fitting calcium imaging data is still in early stages, and hasn’t been through the levels of review and validation that they have for fitting spike data. Users should consider this a relatively unexplored territory, and we hope that we hope that NeMoS will help researchers explore this new space of models.

References#

[1] Peyrache, A., Schieferstein, N. & Buzsáki, G. Transformation of the head-direction signal into a spatial code. Nat Commun 8, 1752 (2017). https://doi.org/10.1038/s41467-017-01908-3

	label	'(heading + coupling)'
	heading__bounds	None
	heading__fill_value	nan
	heading__label	'heading'
	heading__n_basis_funcs	12
	heading__order	4
	heading	'heading': Cy...s=12, order=4)
	coupling__conv_kwargs	{}
	coupling__enforce_decay_to_zero	True
	coupling__label	'coupling'
	coupling__n_basis_funcs	3
	coupling__time_scaling	50.0
	coupling__width	2.0
	coupling__window_size	10
	coupling	'coupling': R..._to_zero=True)