brainbox.population.cca

Functions

`bin_spikes_trials`	Binarizes the spike times into a raster and assigns a trial number to each bin
`fit_cca`	Initialize and fit CCA sklearn object
`get_cca_projection`	Project data into CCA dimensions
`get_correlations`
`get_event_bin_indexes`	Get the indexes of the bins corresponding to a specific behavioral event within a window
`plot_correlations`	Correlation vs CCA dimension
`plot_pairwise_correlations`	Plot CCA correlations for multiple pairs of regions
`plot_pairwise_correlations_mult`	Plot CCA correlations for multiple pairs of regions, for multiple behavioural events
`preprocess`	Preprocess neural data for cca analysis with smoothing and pca
`shuffle_analysis`	Perform CCA on shuffled data
`split_by_area`	This function converts a matrix of binned spikes into a list of matrices, with the clusters grouped by brain areas
`split_timepoints`	Assign each time point to testing or training fold
`split_trials`	Assign each trial to testing or training fold

preprocess(data, smoothing_sd=25, n_pcs=20)[source]

Preprocess neural data for cca analysis with smoothing and pca

Parameters:

data (array-like) – array of shape (n_samples, n_features)
smoothing_sd (float) – gaussian smoothing kernel standard deviation (ms)
n_pcs (int) – number of pca dimensions to retain

Returns:

preprocessed neural data

Return type:

array-like, shape (n_samples, pca_dims)

split_trials(trial_ids, n_splits=5, rng_seed=0)[source]

Assign each trial to testing or training fold

Parameters:

trial_ids (array-like)
n_splits (int) – one split used for testing; remaining splits used for training
rng_seed (int) – set random state for shuffling trials

Returns:

list of dicts of indices with keys train and test

split_timepoints(trial_ids, idxs_trial)[source]

Assign each time point to testing or training fold

Parameters:

trial_ids (array-like) – trial id for each timepoint
idxs_trial (list) – list of dicts that define which trials are in train or test folds

Returns:

list of dicts that define which time points are in train and test folds

fit_cca(data_0, data_1, n_cca_dims=10)[source]

Initialize and fit CCA sklearn object

Parameters:

data_0 (array-like) – shape (n_samples, n_features_0)
data_1 (array-like) – shape (n_samples, n_features_1)
n_cca_dims (int) – number of CCA dimensions to fit

Returns:

sklearn cca object

get_cca_projection(cca, data_0, data_1)[source]

Project data into CCA dimensions

Parameters:

cca
data_0
data_1

Returns:

tuple; (data_0 projection, data_1 projection)

get_correlations(cca, data_0, data_1)[source]

Parameters:

cca
data_0
data_1

Returns:

shuffle_analysis(data_0, data_1, n_shuffles=100, **cca_kwargs)[source]

Perform CCA on shuffled data

Parameters:

data_0
data_1
n_shuffles

Returns:

plot_correlations(corrs, errors=None, ax=None, **plot_kwargs)[source]

Correlation vs CCA dimension

Parameters:

corrs (1-D vector) – correlation values for the CCA dimensions
errors – error values
ax (matplotlib axis object) – axis to plot on (default None)

Returns:

axis if specified, or plot if axis = None

plot_pairwise_correlations(means, stderrs=None, n_dims=None, region_strs=None, **kwargs)[source]

Plot CCA correlations for multiple pairs of regions

Parameters:

means – list of lists; means[i][j] contains the mean corrs between regions i, j
stderrs – list of lists; stderrs[i][j] contains std errors of corrs between regions i, j
n_dims – number of CCA dimensions to plot
region_strs – list of strings identifying each region
kwargs – keyword arguments for plot

Returns:

matplotlib figure handle

plot_pairwise_correlations_mult(means, stderrs, colvec, n_dims=None, region_strs=None, **kwargs)[source]

Plot CCA correlations for multiple pairs of regions, for multiple behavioural events

Parameters:

means – list of lists; means[k][i][j] contains the mean corrs between regions i, j for behavioral event k
stderrs – list of lists; stderrs[k][i][j] contains std errors of corrs between regions i, j for behavioral event k
colvec – color vector [must be a better way for this]
n_dims – number of CCA dimensions to plot
region_strs – list of strings identifying each region
kwargs – keyword arguments for plot

Returns:

matplotlib figure handle

bin_spikes_trials(spikes, trials, bin_size=0.01)[source]

Binarizes the spike times into a raster and assigns a trial number to each bin

Parameters:

spikes (Bunch) – spikes object
trials (Bunch) – trials object
bin_size (float) – size, in s, of the bins

Returns:

a matrix (bins, SpikeCounts), and a vector of bins size with trial ID,

and a vector bins size with the time that the bins start

split_by_area(binned_spikes, cl_brainAcronyms, active_clusters, brain_areas)[source]

This function converts a matrix of binned spikes into a list of matrices, with the clusters grouped by brain areas

Parameters:

binned_spikes (numpy.ndarray) – binned spike data of shape (n_bins, n_lusters)
cl_brainAcronyms (pandas.core.frame.DataFrame) – brain region for each cluster
brain_areas (numpy.ndarray) – list of brain areas to select
active_clusters (numpy.ndarray) – list of clusterIDs

Returns:

list of numpy.ndarrays of size brain_areas

get_event_bin_indexes(event_times, bin_times, window)[source]

Get the indexes of the bins corresponding to a specific behavioral event within a window

Parameters:

event_times (numpy.array) – time series of an event
bin_times (numpy.array) – time series pf starting point of bins
window (numpy.array) – list of size 2 specifying the window in seconds [-time before, time after]

Returns:

array of indexes