Understanding The Convolutional Layer in State Space Models

22 March 2025 in Research

Nu Wave Visualization of traveling waves found in the original Mamba architecture (left) and the variable velocity traveling waves we introduce in the Nu-Wave Mamba model (left). We see the variable velocity model learns exponentially faster and reaches lower error than the original counterpart.

A Spacetime Perspective on Dynamical Computation in Neural Information Processing Systems

24 September 2024 in Research

Spacetime_Inseparability Illustration of the fundamental mathematical difference between standing waves and traveling waves, which are not `spacetime separable’. We suggest these inseparable dynamics in neural activity may have a fundamental role in efficient and generalizable computation.

Relative Representations for Model-to-Brain Mappings

19 April 2024 in Research

Relative_Reps Relative Representations are a method for mapping points (such as the green circle) from a high dimensional space (left) to a lower dimensional space (right), by represeniting it in a new coordinate system relative to a select set of anchor points (red and blue star). In this work we apply such an idea of relative representations to model-brain mappings and show that it improves interpretability and computational efficiency – surprisingly model-brain RSA scores are roughly consistent even with as few as 10 randomly selected anchor points (10 dimensions) compared to the original 1000’s of dimensions.

Natural Inductive Biases for Artificial Intelligence (PhD Thesis)

7 November 2023 in Research

thesis_cover My PhD Thesis, studying the inductive biases that enable the efficiency and generalization capability of natural intelligence, yet unmatched by artificial intelligence.

Image segmentation with traveling waves in an exactly solvable recurrent neural network

5 November 2023 in Research

Relative_Reps Visualization of the phases of a network of locally coupled kuramoto oscillatos, driven by an input image (left), which converge in phase to segment the image into different shapes, with different oscillatory dynamics for each shape.

Flow Factorized Representation Learning

22 September 2023 in Research

ffrl Illustration of our flow factorized representation learning: at each point in the latent space we have a distinct set of tangent directions \(\nabla u^k\) which define different transformations we would like to model in the image space. For each path, the latent sample evolves to the target on the potential landscape following dynamic optimal transport.

Traveling Waves Encode the Recent Past and Enhance Sequence Learning

9 September 2023 in Research

WaveField Illustration of three input signals (top) and a corresponding wave-field with induced traveling waves (bottom). From an instantaneous snapshot of the wave-field at each timestep we are able decode both the time of onset and input channel of each input spike. Furthermore, subsequent spikes in the same channel do not overwrite one-another.

DEUT -- 2D Structured and Approximately Equivariant Representations

28 July 2023 in Research

duet Visualization of the DUET framework. The backbone \(f\) yields a 2d representation for each transformed image \(f(\tau_g(\mathbf{x}))\) (e.g. \(\tau_g\) is a rotation by \(g\) degrees). The group marginal is obtained as the softmax (sm) of the sum of the rows, and is compared to the prescribed target (red) with our group loss \(L_G\). The content is obtained by summing the columns, and contrasted (\(L_C\)) with the other view through a projection head \(h\). The final representation for downstream tasks is the 2d one, which has been optimized through its marginals.

Latent Traversals in Generative Models as Potential Flows

23 July 2023 in Research

poflow Comparison of latent traversals found with our method compared with state of the art baselines (WarpedSpace and SeFa). We see prior work tends to conflate multiple semantic concepts simultaneously due to the enforced linearity of the transformations. In our work, the inherently non-linear nature of the potential flow transformations more accurately disentangles semantically separate transformations.

Locally Coupled Oscillatory Recurrent Networks Learn Topographic Organization

20 December 2022 in Research

Measured orientation selectivity of neurons, as color coded by the bars on the left. We see our LocoRNN’s simulated cortical sheet learns selectivity reminiscent of the orientation columns observed in the Macaque primary visual cortex (source: Principles of Neural Science. E. Kandel, J. Schwartz, T. Jessell, S. Siegelbaum, & A. Hudspeth. 2013.).

T. Anderson Keller