Abstract
In this paper, we propose a simplified and robust model for place cell generation based on the oscillatory interference (OI) model concept. Aiming toward hardware implementation in bio-inspired simultaneous localization and mapping (SLAM) systems for mobile robotics, we base our model on logic operations that reduce its computational complexity. The model compensates for parameter variations in the behaviors of the population of constituent theta cells, and allows the theta cells to have square-wave oscillation profiles. The robustness of the model, with respect to mismatch in the theta cell's base oscillation frequency and gain—as a function of modulatory inputs—is demonstrated. Place cell composed of 48 theta cells with base frequency variations with a 25% standard deviation from the mean and a gain error with 20% standard deviation from the mean only result in a 20% deformations within the place field and 0.24% outer side lobes, and an overall pattern with 0.0015 mean squared error on average. We also present how the model can be used to achieve the localization and path-tracking functionalities of SLAM. Hence, we propose a model for spatial cell formation using theta cells with behaviors that are biologically plausible and hardware implementable for real world application in neurally-inspired SLAM.
Original content from this work may be used under the terms of the Creative Commons Attribution 4.0 licence. Any further distribution of this work must maintain attribution to the author(s) and the title of the work, journal citation and DOI.
1. Introduction
Most animals, including humans, have the capability to navigate through complex environments. They can keep track of their locations relative to their homes, and can plan and memorize paths as they advance toward their targets. When exploring unfamiliar environments, animals can also store memories of travled terrains, locate scents and visual cues and, ultimately, assign meaning to various aspects of the environment. Such capability is of interest to the robotics community and is known as simultaneous localization and mapping, or SLAM, which is a computational problem aimed at constructing a map for an unknown environment and localizing the robot in that environment while exploring it [1].
SLAM has long attracted interest in mobile robotics, and typically gets inputs from heavily over-sensored robots. It often uses complex mathematical algorithms, requiring high computation rates from high energy consuming systems. However, animals can achieve high-performance SLAM without, to our understanding, the ability to perform any overt mathematical computation [2–6]. Neuroscience researchers are attempting to explain how the nervous system encodes the spatial environment in which we navigate, and how that contributes to path planning and navigation [7–11, 19–23]. We believe that neuroscience research can inspire more efficient SLAM implementations compared to traditional approaches emerging from the robotics community. The key metric for efficiency will be compactness of the models, simplicity of the computations, and ease of hardware implementation.
Many nerve cells located in the hippocampus have been found to encode location or features in the environment and participate in navigation [2–5]. Distinguished by their behaviors, two types of such cells were termed 'grid cells' [2] and 'place cells' [3]. In particular, in 1971, O'Keefe discovered place cells in the hippocampus of the rat, which firing rate increase only when the animal is within a particular spatial location, defined as place field [2]. In 2005, Moser found grid cells that are activated periodically and regularly as the animal explores a given space [3]. These results were significant enough to warrant a Nobel Prize in 2014. As animals explore unfamiliar environments, new location-specific cells and maps are generated and remembered. Grid and place neurons are believed to play a critical role in spatial encoding and subsequently, localization and navigation functionalities [4–6].
To explain how the place and grid cells may emerge from the known oscillatory functions of neurons in the hippocampus, the oscillatory interference (OI) model was introduced by Burgess et al [7]. It attributes the signal of the place and grid cells to summations of lower-level neurons, named 'theta cells', which have specific frequency tunings to head direction and speed of movement in space. This was further supported by Welday et al discovery of the theta cells' frequency response to the animal's travel speed and direction [8]. The model presented in this paper will leverage these results because of their elegance and ease of implementation in hardware.
Another class of models used to explain the formation of place and grid cells is the continuous attractor networks (CAN) model which generates grid cell activity based on a collection of cells that projects and receives unsymmetric inhibition connections with their surrounding neighbors, as well as distinct directional tuned inputs for each of those cells [9–11].
Enlightened by these findings, many biologically inspired SLAM algorithms, such as NeuroSLAM or RatSLAM, offer methods that operate with limited small number of sensors and consume low power [12–16]. NeuroSLAM promises performance that can mimic living organisms, with the potential to surpass the performance of current computational approaches. While previous neural inspired SLAM implementations take inspiration from general behavior models of spatial encoding and navigation in rat hippocampus and para-hippocampal regions, their implementation, which is mainly in hardware, still employs complex computational techniques [17]. Hence, by failing to model the elegancy of lower-level neural circuitries found in the hippocampus, these models are typically cumbersome and cannot take advantage of the computational and power consumption efficiency offered by neuromorphic systems, in both software and hardware incarnations.
One reason that the traditional SLAM approach has shied away from the neuromorphic approach is because the latter may require a large and complex network of neurons [10]. However, the neuromorphic approach, particularly when implemented in hardware, can take advantage of abstractions that will result in smaller hardware complexity. Consequently, we follow a philosophy of abstracted neuromorphism in this work, rather than of direct neural mimicry.
As an example of the type of abstracted neuromorphic approach that we have taken, we can consider the method for implementing the theta cells, which are the velocity-sensitive neurons found in the hippocampus. These theta cells, which are required by the OI model to generate grid cells [7], are assumed to have a base oscillatory frequency that varies according to a cosine tuning curve that scales according to the dot-product between the input velocity vector and the preferred velocity vector of the cell, as indicated in equation (1). In Welday et al proposed model for place cell formation [8], a uniform gain for the frequency change as a function of travel velocity, and a uniform base frequency, are necessary when interfering independent oscillators with distinct preferred velocities. However, it is unrealistic to expect biological neurons to have these uniform behaviors. Similarly, it is impossible for different circuit modules implementing theta cells, particularly when implementated with analog circuit, to have the same base frequency and vary in the same way when the inputs are varied. Hence, any model that uses hardware neurons or oscillators must provide a pathway to reduce the impact of these non-uniformity among various theta cells.
The CAN model offers its own implementation challenges. In the CAN model, a collection of oscillating cells' frequencies are regulated by input velocity and their weighted inhibitory connections with its surrounding cells, as shown in figure 1, making the interconnections very complex for hardware implementation. Furthermore, as a population based model, scaling down the number of such cells to reduce the complexity would reduce its robustness or even malfunction, which places a burden on system complexity.
In the following sections, we introduce and compare our proposed neuromorphic multiplicative OI model with the prevailing models for spatial cell formation. To take ease of implementation and biological fidelity into account, we tested both our model and other OI models [7, 8] with variations among theta cells behavior (i.e. idle frequency and velocity-tuning response gain errors), and find that they initially are very sensitive to these variations. We subsequently improve our model by formulating a method to reduce its sensitivity to the offsets in the idling frequency and the gain of velocity-tuning. Ironically, through simulations, we also find that the variations in the idling frequency and gain of the velocity-tuning contribute to the uniqueness of place cell's spatial field! Hence, the variations become a feature, and not just a bug. Moreover, using our model, we also demonstrate a path-tracking mechanism that includes path-integration error reduction using a phase-resetting technique, which is a well know problem with 'dead-reckoning' based path tracking [24]. Lastly, we discuss prelimary findings of the neuromorphic implementations of our model and future work. Hence, we describe a model for spatial cell formation using hardware theta cells, or silicon velocity-tuned oscillators, for real world application in neurally-inspired simultaneous localization and mapping.
2. Backgrounds
As mentioned previously, there exist two major theories on how place and grid cells generate their response to an animal's location. The first model, the OI model proposed by Burgess et al concisely describes which grid cell patterns are formed with a summation of distinct theta cells' signals, with each theta cell having a distinctive preferred direction. Individual theta cells oscillate at a frequency that scales linearly with the dot-product between its preferred velocity and the travel velocity. The OI model has a nice modular structure in which each grid cell is independent, and the operation for creating grid cells requires only summation and thresholding. Furthermore, the model has a 'Fourier' construction nature, which allows the place and grid cells to be more compact when more base theta cells are introduced in the summation. However, the model expects each theta cell to have the same base frequency of operation, e.g. 8 Hz, and is susceptible to mismatch between theta cells' behaviors. When the theta cells have different base frequencies and/or different tuning gains, the model does not produce compact place and grid cells, and they may have multiple response lobes. However, Welday's discovery [8] that biological theta cells vary in their frequencies, either idling or under movement, is not surprising. Hence, if theta cells are implemented as ring oscillators as they suggested, then it would be difficult to maintain an array of independent ring oscillators with both the same oscillation frequency as well as individual frequency response curves. Figures 2 and 3 conceptually demonstrate the structure of two OI model implementations, along with detailed discussions in sections 3.1 and 3.2.
Download figure:
Standard image High-resolution imageDownload figure:
Standard image High-resolution imageThe other prevailing model is the continuous attractor network (CAN) model. It is a population based model where each cell needs the interconnection around its neighbors to operate. In the CAN model, each cell has a unsymmetric inhibition connection to its neighbors based on a preferred direction. Each cell also receives an theta cell input that is stronger when the subject is traveling along the preferred direction, as illustrated in figure 1. Thus when the subject is traveling along that preferred direction, the cell will become more active then push the pattern toward that direction through its unsymmetric inhibition projecting to the neighboring cells. The model maintains stability through the population that has various preferred directions that cancel out the asymmetry and exhibits an equilibrium pattern overall. The downside is also obvious since it needs a population to function which is not a modular design. Additionally, a large number of carefully weighted inhibition interconnection is also needed to form such a network, making it less attractive to a neuromorphic implementation.
After evaluating the two models on computational complexity and robustness to variations, we adopted the OI model for hardware implementation due to its simplicity, modularity, and affordance to error correction. As listed in table 1, the OI model only requires excitatory type connections from the oscillating theta cells to the potential grid and place cells, while the CAN model requires both excitatory and inhibitory connections. More importantly, the grid cells in CAN model is population based and can only function with other grid cells, meaning that a large number of connections and theta cells have to function collectively to ensure a grid pattern. On the other hand, the very interference of the theta cells in the OI model naturally produces grid patterns, hence making a single grid cell model is possible with a few theta cells. Unlike the CAN grid cells, the OI grid cell does not depend on other grid cells for its existence. This allows the implementation to be modular, which makes it easier, i.e. requires less complex networks, to scale to larger numbers of spatially distributed grid and place cells and to debug. Furthermore, should some part of the components have defects, the robustness of the OI model means that the defective parts can be easily discarded without drastically affecting the behavior of the entire network.
Table 1. A comparison of properties of OI, CAN, and proposed model regarding neuromorphic implementations.
OI | CAN | Proposed model | |
---|---|---|---|
Grid cell model structure | Single cell modular | Population map | Single cell modular |
Sensitivity to theta cell variations | High | Moderate | Moderate |
Type of network connections | Additive excitatory | Additive excitatory and inhibitory | Multiplicative/logical |
Network topography | Feedforward | Interconnect and feedforward | Feedforward |
Weight connection | Optional | Necessary | None |
Literature suggests that grid cells plays a crucial role for formation of place cells to achieve path integration when visual cues are absent [18, 19]. Solstad et al [20] and Blair et al [21] proposed models that form place cells by summing sophisticatedly weighted grid cells, suggesting a hierarchical structure from theta cells to grid cells then to place cells. However, such weighting imposes another burden on circuit design, especially when a large number of place cells are needed to tessellate a space. Hence, a simplified place cell construction approach is proposed in this paper. In addition, we also discuss a explanation for the necessity of such hierarchy.
3. Methods for grid and place cell formation
3.1. OI models under ideal conditions
In order to preserve simplicity, our model takes the OI model as its fundamental structure. As predicted in [7, 22,23,25,26] and later discovered by Welday et al [8], the basic component of the OI model originates from the theta cells signals that act like velocity-controlled oscillators. Each theta cell has an oscillating frequency that is modulated by the animal's movement direction and speed, as shown in equation (1), in which the firing phase accumulation is synchronized with displacement along a preferred direction.
Here F denotes the oscillation frequency of the theta cell, which depends on the dot-product between its preferred velocity and the animal's travel velocity . β is the frequency response factor or gain, which relates the traveling speed with oscillation frequency, with units of Hz/unit speed. The idling frequency Fidle denotes the frequency when the input velocity is zero. Welday et al experimentally observed that theta cells have various preferred velocity vectors, and believed that they generate the fundamental signals for spatial encoding through oscillatory interference.
3.1.1. Oscillatory interference to form grid cells
The core concept of the OI model is to form place cells and grid cells simply by summing one theta cell's velocity-regulated oscillation and another reference oscillator with the idling frequency Fidle, using the standard trigonometric identity below:
The cosine containing the difference between the two frequencies would represent one theta cell's deviation from either a reference cell or another theta cell. Burgess et al [7] suggested that forming grid cells is possible, and indeed concise, by interference between merely a few theta cells. A one-dimensional grid cell with a periodic response to traveling in the theta cell's preferred direction can be formed by just interfering that theta cell with an idling frequency, or, equivalently, with a theta cell with 0-valued preferred velocity, resulting in the interference pattern shown in figure 2 and its signal below:
Each point on the spatial firing pattern figure is calculated by traveling along a direct path from the origin to that point with a preset fixed speed , then plotting the instantaneous signal value at that location through equation (3). The vertical gratings signify that this grid cell will fire periodically while the subject is traveling in its preferred direction, as the gratings will always be perpendicular to the preferred velocity vector. According to equation (3), the resulting signal will have a low-frequency component that depends only on the theta cell's frequency response to an input velocity. After forming the grating, it is straightforward to form the hexagonal grid cells by interfering with an additional theta cell which has a preferred direction, say 60° apart, to form an interference pattern that resembles the response of grid cells, shown in figure 2.
3.1.2. Oscillatory interference to form place cells
Although many researchers suggest that place cells are formed by grid cells, Welday et al proposed a hypothesized method for forming place cells directly from theta cells. If we focus on the low-frequency component, i.e. the envelope, of the interference result in equation (3). The overall envelope of the interference result from N theta cells can be described by equation (4).
Here, is the envelope evaluated at the spatial location x through a vector with respect to the origin, wn is a weighting factor, and i is the imaginary unit to make use of Euler's form for the phase . It is the phase accumulation that deviates from the idling frequency, as mentioned in equation (3). It can be computed as equation (5), with an additional initial phase shift to add a degree of freedom for distinguishing .
Welday et al suggested a possible setup for place cells. By interfering 12 theta cells with a uniform weight of 1 and a uniform coefficient β but with preferred directions spaced 30° apart, a place cell can be formed, using a properly chosen threshold, with its pattern shown in figure 3 with initial theta phases calculated from equations (4) and (5). Specifically, at the desired location x, set equal to 1 then solve for for the nth theta cell's initial phase through equation (5).
Though under Welday's model, a place cell can be formed with a population of theta cells with various preferred directions other than those shown in figure 3, issues arise when constructing such a system. First, the restrictions on the setup of preferred directions of theta cells is unclear. For example, the minimum number of directions and their orientations are not specified. Second, the nonlinear thresholding operation requires a heuristic value for the threshold that introduce ambiguity, where a threshold too high will reduce the place field quickly whereas a value too small results in a number of side lobes. The latter is not easily resolvable if the oscillations are to be implemented as square waves. Since the square wave essentially normalize all values of a sinsoid wave that is greater than zero, the interferences lined up at the peaks have no difference from those that just have positive overalappings. This losing of uniquess will cause side lobes even after thresholding, as shown in figure 4.
Download figure:
Standard image High-resolution imageAnother important aspect of the model requires a homogenous oscillation at the idling frequency of the other speed dependent theta cells. Such synchrony might be realized in the neural circuit through shunting inhibition as suggested in [27] for the case of gamma oscillation and in [28] for pyramidal neurons both in the hippocampal subfield CA3 where place cells resides. Shunting inhibition is a type of inhibition that can regulate the membrane potential multiplicatively, resembles the logical AND operation.
3.1.3. Multiplicative interference and a constructive interference method for generating place cells
To simplify hardware implementation complexity, and to remove ambiguities regarding thresholds, we adopt the logical AND operation for interference rather than addition then thresholding. Such cumulative ANDing procedure enforces constructive interference of all selected theta cells at the location of interest, while any other side lobe will be cleared to zero by any theta envelope that is not 1 due to the multiplication in equation (9). We also assume that a hardware-emulated theta cell is likely to output square waves rather than sinusoids. In addition, quantized phase shifts are adopted here to better emulate both the neural circuitry and hardware realization of theta cell as ring attractor [29, 30] or oscillator. The AND interference operation can then be made analogous to the multiplication of two sinusoids, with the following form
In the case of the logical AND operation, the one-half coefficient would drop off, resulting in only a superposition of the sum and difference of the frequencies, instead of modulation which made separation of the two components easier. Aside from that, the result of the envelope function is identical to that of the summation method.
To simplify the interference, using fewer theta cells, and to reduce the complexity of phase computation, we adopt a construction method for grid- and place-cell formation inspired by the grid cell construction method from Burgess et al [7], which avoids the weight computation needed by the models of Solstad et al [20] and Blair et al [21]. Furthermore, unlike the structure proposed by Welday et al, we form place cells from theta cells with different frequency response factors. By introducing another degree of freedom, we can reduce the number of theta cells needed and make the model more flexible and robust. We start by manipulating the simplest interference pattern where interference happens between one idling frequency oscillator and one theta cell, as shown in figure 5(a); this pair acts as a one-dimensional grid cell. Then, we form just one stripe in the field of interest by interfering gratings with the same directivity but different frequency response factors, as shown by the top and bottom rows of figure 5. This process can be viewed as summing multiple sinusoidal waves with different frequencies to form one peak. However, the Fourier analysis on either an impulse-, sinc- or Gaussian-shaped place field would result in a continuous spectrum, which is not realistically possible from a limited number of oscillations. So, we step back to redefine a place cell as being a periodic signal with very large wavelength relative to our area of interest. According to [5], the peak separation L for the grating pattern can be computed with equation (7), where β is the frequency response factor for the theta cell in equation (1).
Download figure:
Standard image High-resolution imageHowever, covering large L with small β directly is difficult in hardware implementation, and requires precise control. So instead of applying a particular value of β directly, we propose an interference strategy that can achieve a large value for L as well. Consider β1 = pβ and β2 = qβ, which results in the pattern having p or q peaks before reaching L. Then, the sum of the two associated signals will have their first lined-up peak at the least common multiples of p and q. Adding more such oscillators with mutually prime frequency response factors (i.e. not multiples of the others), can create gratings with very large periods, as shown in figures 5(d) and (h), which are generated from three theta cells with distinctive response coefficients β. This principle has been proposed in [19, 34] for OI models.
After forming a single stripe within an acceptable region, we can offset the stripes, by adding a constant phase term, in the direction of its preferred velocity vector to guarantee that a firing stripe crosses through the desired location. This is similar to constructing a surface from two orthogonal bases. To compute the phase term, we treat the desired location point, denoted as in polar coordinates, as a vector projected onto the preferred velocity vector to compute its angular frequency of oscillation along the preferred direction, . Then, we assign one of the eight quantized phase fraction values φi , which range from 0 to 7, to adjust the oscillation signal to have a peak close to the desired point, referring to equation (8). The resulting equation for the stripe pattern is illustrated in (9), where Fi is the frequency response of the ith theta cell as defined in equation (1). Here the multiplication resembles the multiplicative effect of the shunting inhibition and the logical AND operation should the result of the term be digitized to 0 or 1 based on the sign of the value at any time instance.
If applying the phase computation to all the oscillators used to form a single stripe in the same direction, we guarantee that the stripe will form along the perpendicular direction and that it passes through the desired point. Repeating this same process for oscillators with other directivities, we can intersect their stripes at the desired point, resulting in a single-place field as shown in figure 5(j), where we set the other group of theta cells to have a 60° difference in their preferred velocity direction. The phase shift operation hence replaces the weight computation problem with a phase connectivity problem, or a binary weight system where only one of the eight phases of each theta cell will be weighted 1.
As shown by the figures, the constructed stripes and place cells have small ripples within, that reflect the high-frequency component in equation (3) and provide a potential explanation for how the phase procession exists [7]. But, for the purpose of spatial tuning, we only consider the low-frequency component, i.e. the spike envelope, which is a standard process of the OI model. In further discussions, we would filter the high-frequency component out. Thus, we implement an envelope-detection or low-pass filter stage, much like the integration process of synapses in a biological system. Eliminating the high-frequency component creates a solid place cell that reacts to its entire place field, which is important when the number of interferences increases: as shown in figure 5, the trough of the high-frequency components of the two stripes would destructively interfere in the AND process, creating an intersection with a very small positive region. Thus, for our demonstration purposes, we define the interference operation in the frequency domain as follows:
The proposed model of forming place cells is very simple and powerful since it only needs two stripes as a basis. Moreover, there is no weighting, thresholding, or memory necessary for the interference operation, which can be implemented as combinational logic. The number of theta cells with distinct frequency response factors needed is related to how large a region of interest is needed, since they govern the periods of the gratings. The model could be used to form any number of place cells by applying different sets of constant phase terms to the same theta cells' oscillation setup. This allows us to implement the model in hardware with a rudimentary feedforward network, where a small number of voltage-controlled oscillators can serve as inputs to generate any place cell.
Additionally, the whole process for the OI model only comprises ANDing operation, the interference for forming a single stripe for different directions can be interchanged to form grid cells as shown in figures 5(i)–(iii) through red dashed path. This is an interesting discovery hinting at how the grid cells can come into existence. But it also raises the question of whether the existence of grid cells is required: interference could be done once at the place cell directly, without the need for middle steps. We will, however, discover another crucial reason for why grid cells are necessary when we start to introduce variations into the OI model, reflecting the variations inevitable in both neuromorphic hardware implementation and biological systems. We will also demonstrate that the constructive methods can even take advantage of theta cell variations to form place cells having unique place field with minimal side lobes over a large region.
Download figure:
Standard image High-resolution image3.2. OI models under theta cell variations
3.2.1. Impact of theta cell variations
Although all the models above could generate reasonable grid- and place-cell firing patterns, they are all based on the theta cell model described by equation (1). This model, however, is very likely to be an oversimplification. In research on theta cells in rats [8], and when implementing theta cells as ring oscillators in silicon, the frequency response factor β cannot be exactly uniform across all theta cells. More importantly, Fidle also varies between theta cells. Thus, we adjust the theta cell equation with a constant offset frequency term, as shown in equation (11).
Under such variation condition, the idling frequency Fidle is replaced by the sum between an average frequency FAvg that is computed over the whole population of theta cells under zero-velocity input, and an offset frequency Foff_i for each theta cell to represent variations. This offset term does not depend on travel velocity, and significantly alters how the phase accumulation model represents space. This perspective implies—or tacitly assumes—that phase accumulation deviation happens even when the input velocity is zero. If one theta cell interferes with another with an idling frequency at FAvg as suggested by Burgess et al, the resulting frequency will be:
and the signal in the time domain becomes
Thus, after interference involving an average frequency, the phase accumulation is no longer solely dependent on the first term of a displacement vector but also on the second, time-dependent, term. On a one-dimensional interference pattern graph, the phase accumulation induces curvature in the grating as shown in figure 5(a), corresponding to the greater travel time needed to reach points farther from the origin, so that the time-dependent term takes precedence. The overall frequency after interference is also adjusted by Foffset and can result in a shrinkage or expansion of the inter-peak distance L, resulting in the deformation of both its geometric structure and scale as shown in figure 7 with a conceptual signal demostration in time domain shown in figure 6.
Download figure:
Standard image High-resolution imageSuch patterns and the equation imply that the OI model under the influence of an offset idling frequency will not provide a unique pattern that depends only on the spatial location through velocity integration. Differing routes and varying speeds during travel could disturb the phase accumulation through the time-dependent term, resulting in a completely different set of phase inputs for the place cell, so the place cell would not respond to its designated location. Moreover, the variations in the frequency response factor β even prevent the place cell generation model proposed by Welday et al, where a set of theta cells with uniform β is needed. Furthermore, as shown in figure 7(d), merely introducing the offset frequency compromises the formation of the place cell.
We do not consider such variation merely an artifact of analog hardware implementation: achieving a totally uniform idling frequency across all theta cells, yet retaining their ability to change frequency independently, is not consistent with biological behavior either. So, we decided that an improved version of the OI model for forming grid cells and place cells is critical to understanding the nature of hippocampus spatial encoding, and critical also for feasible and robust implementation in hardware. Along with the development of this improved model, we present a potential function for the grid cells below.
3.2.2. Improved OI model for variation compensation
The variations among the frequency response factors βi do not affect our proposed construction method for place cells. In the ideal scenario, we need to deliberately construct a set of mutually prime βi to form a unique stripe in the region of interest in figure 5. Under the condition of variations, the different values of the βi naturally construct a large lowest common multiple that results in a large L, covering a large distance. In other words, our proposal does not require the capability to program the frequency response factors, as long as the actual βi can be characterized. We believe that this property simplifies the place cell model and removes substantial burden from hardware design, which we illustrate with a simulated scenario in the results section below.
The additional offset for idling frequencies across theta cells for interference poses a more significant challenge to the principal idea of the OI model, with respect to achieving path integration through phase accumulation. In our improved model, we propose the solution in two steps: first, we propose an offset-reduction strategy of theta cells to reduce the impact of frequency offsets. Then, we modify the phase-computation equation to accommodate the residue of the offset frequency.
Inspecting the oscillatory interference equation (3), we can observe that interference with an idling frequency signal has the effect of removing it from the oscillation, so that the resulting signal frequency depends on the product of speed and time, which is effectively the displacement, as shown in equation (10), in the ideal case when Foff_i = 0. A similar approach could be applied to remove the offset frequency. Implementing that in the basic structure of the OI model would preserve its simplicity, and thus lead to simpler hardware requirements.
Our first step in removing the offset frequency is to pair oscillators with the closest idling frequency whose interference eliminates the magnitude of the offset frequency as much as possible, as shown in equation (14). However, if we just set the second oscillator to have a preferred velocity of [0, 0], it is possible to run into the circumstance that, at a certain input speed, the first oscillator's frequency becomes equal to the second oscillator's idling frequency, so that the superposed oscillations generate a stationary pattern that cannot represent the distance traveled. To avoid this, we set the second oscillator to have a preferred velocity vector 180° apart from the first one. In this configuration, we effectively obtain a new oscillator that operates as shown in equation (14), with the structure as in figure 8, configured so that Foff_a > Foff_b .
Download figure:
Standard image High-resolution imageThe frequency response factor is summed so to make the velocity-dependent term more predominant for the effective oscillator, while the offset frequency is suppressed. Figure 9 illustrates the suppression effect of offset-reduction between a pair of theta cells with preferred directions 0° and 180°, and offset frequencies 0.5 Hz and 0.6 Hz, respectively. This kind of offset-reduction strategy could be repeated to obtain an acceptable offset frequency for phase computation, in terms of the number of theta cells needed. The number of effective theta cells would then be divided by 2N where N is the number of layers. Referring to the previous discussion, the resulting effective frequency response factor can be of any value, and contributes to the feasibility of the pairing process in the framework of our construction method for place cell generation.
Download figure:
Standard image High-resolution imageA similar remedy for idling frequency drift was proposed in [33], where the authors introduce a ring attractor oscillator that is entrained to oscillate at the mean frequency of a group of theta cell's oscillating frequency and could provide an alternative explanation of grid cell through a group of three theta cells with 120 degree difference in preferred directions. Our offset-reduction method offers noise reduction through a similar concept, but without the need for an extra oscillator that needs connections from other theta cells to be trained. This maintains the independence between theta cells and keeps the place cell network feedforward only. Moreover, as shown in equation (14) and in figure 9, the output signal of the proposed offset-reduction approach behaves similarly to a theta cell. This property allows multiple layers of such reduction to be concatenated as required by the actual implementation, as shown in figure 8 with N layers. Then grid cells could be formed by treating the output of offset-reduction signals as effective theta cells, with the approach in figures 5(i)–(iii) following the red dotted line. The resulting pattern not only resembles the firing pattern of a grid cell, but also realizes two layers of offset reduction.
Though the magnitude of the offset frequency is reduced, the remainder, , would still contribute to displacement-independent phase accumulation, thus affecting the place cell's spatial encoding accuracy. To mediate this problem, we adjust the phase computation applied to the effective theta oscillator so that it takes the effective offset frequency into consideration, resulting in equation (15), where the phase shift φi of the ith effective theta cell is computed by an effective βi = (βa + βb ) and an effective .
The combination of the offset-reduction and modified phase-shift computation allows us to form place cells in a way that handles variations in both frequency response factors and idling frequencies. To design our system succinctly, we perform AND operations between all the interference signals. This means that the theta cell with the largest frequency response factor β, and thus the thinnest stripes according to equation (4), will determine the size of the place field. Due to the complication of the introduced offset-reduction, the size of the place fields is controlled not through the individual theta cells, but through the effective oscillator after offset-reduction.
4. Methods for path tracking
With the capability of forming a place cell at any spatial location with realistic performance of theta cells, it is possible to produce a number of place cells to tessellate the space for location- and path-tracking. Based on the size of the place field of the place cells formed and the region of interest, the required density of the place cells can be assessed. This allows for at least one place cell to be active during a tracking event, which enables the system to keep track of the subject's location based on which place cell fires in an egocentric frame.
As stated before, the phase accumulation for a place cell has a second term 2πFoff t from equations (13) and (14), that is only dependent on the time involved in traveling from origin to the place cell's designated spatial location. This makes the place cell function only when the subject is traveling with the preset speed and route, such as a straight line as assumed in equation (12). When traveling at a different speed, the place cells near the origin would still track normally because the time needed to travel to those locations is small, and not much phase is accumulated by the offset frequency. Thus, for the collection of place cells to function, we imposed a restriction that, during each tracking event, the velocity is held constant.
However, it is natural to change velocity while navigating through the environment, thus, tracking across variation in both speed and direction is crucial for reaching a desired target and path encoding. But the proposed place cells will only function properly when two conditions are met: velocity is constant, and the tracking does not persist for too long, due to the periodicity of the place cells and noise. To resolve this conflict, we introduce a phase-reset mechanism for the collection of place cells and their contributing grid and theta cells. It will pull all the theta oscillators back to a synchronized zero phase, which is equivalent to clearing the phase accumulation and bringing the tracking back to the origin, setting up a new egocentric frame. Upon this resetting, the latest-fired place cell will be recorded. The reset will be triggered by two events: if the velocity changes, or if the time since the last reset has reached a preset interval which shall be smaller than the place cells' common period. Similar phase resetting phenomena have been observed in theta rhythm of hippocampal cells, though without clear evidence of whether it correlates to a change in velocity as we are proposing here [31]. In both the OI model and CAN model, phase resetting is also introduced to clear off the effect of path integration accumulated error in both the direction and magnitude estimation of the travel velocity [7, 8, 10], and could serve the same purpose in this model as well.
Referring to equation (15) for place cell phase computation, the place cells' connections only depend on the speed, due to the nonzero offset frequency term, which means that one collection of place cells could track travels with a constant speed regardless of direction. A new collection of place cells is then necessary for tracking under different speeds, because of the different times it takes to reach a specific point resulting in different phase accumulations through the term 2πFoff t. As a result our phase shift computation equation is dependent on the magnitude of velocity. So, in our model, a set of precomputed collections of place cells is used for tracking. Each collection is differentiated by the speed it accommodates. Upon the phase-reset event, if the velocity changes only in direction, the tracking will still happen within the same collection of place cells. But if the magnitude changes, then tracking needs to be taken over by another collection of place cells. Then, the place cell that gets recorded will implicitly contain the speed information while traversing through this segment. And because we assume the same number of place cells to cover the region, and they operate with the same time interval, each collection operates on a different spatial scale and accuracy as well. This is, potentially, a nice feature: while the speed is slow, the place field is smaller and thus more accurate in tracking and recording, and vice versa. Moreover, this change of scale will also be reflected in the grid cell model, resulting in the grids having different spatial densities, which provides a potential explanation for parallel multi-scaled grid cells observed in rodents [25], and further support the models that take the advantage of multi-scaled grid cells for place cell formation [16, 29].
5. Results
The performance of the proposed model is now assessed in simulations, assuming Gaussian distribution of both frequency response factor β and offset frequency Foff. Though the 60° difference between the two sets of theta cells (as demonstrated in figure 5) is feasible, we want to emphasize the implementation's friendliness. Thus, in the performance assessment, we adopt a 90° difference in the preferred vectors to better correspond to the standard Cartesian coordinate system. Two sample place cells generated with our proposed model with β and Foff generated from their respective normal distributions are shown in figures 10(a) and (b). We then compute the mean squared error of the spatial pattern between each trial and an ideal place cell created through Welday et al method, as shown in figure 10(c). For each configuration of specified number of theta cells and standard deviation of offset frequency Foff, 100 trials were generated and an average mean squared error is plotted in figures 10(d) and (e).
Download figure:
Standard image High-resolution imageTwo setups of the experiment are shown here. In figure 10(d), we choose 3 mean values ( Hz per unit), with the same deviation (0.2 Hz per unit dot product) for the βi to simulate a similar construction setup as in figure 5. We then vary the number of theta cells from 16 to 48, and the standard deviation of the offset frequencies from 0 to 2 Hz, with a mean frequency of 8 Hz. The results in figure 10(d) show that, with a small number of theta cells, trials with small deviations of Foff would have similar errors as large deviations. Yet, due to the uncertainty of the βi , artifacts are likely to occur while forming a single stripe, resulting in several side stripes like the thin stripes in figures 10(d) and (h), which create error. Increasing the number of theta cells, a larger population of βi provides a more diverse spectrum basis that helps the formation of single stripes per direction, similar to the Fourier decomposition of a delta function, thus driving down the error. Moreover, the trials with larger deviations in offset frequencies introduce bending to the gratings, making overlapping at side stripes even less likely, which reduces the error even more with the sample place cell shown in figure 10(a). On the other hand, the offset-reduction strategy with a larger pool of choices confines the effective offset frequencies to small values that will not alter the directivity of the gratings as seen in figure 7(a). Together, the least amount of error occurs when there is a large deviation in the Foff with a large number of theta cells, suggesting that, under our proposed model, variations among the theta cells are in fact a contributing factor for accurate place cell formation. This may also be consistent with biological observations, where typically a large number of neurons converge to implement any particular functions in cortex, coincides with the place cell being a pyramidal cell [3, 5, 19].
This phenomenon can be further illustrated by the second experiment where βi are generated from a single normal distribution with a mean of 1 Hz per unit, and a standard deviation of 0.2 Hz per unit, shown in figure 10(e). The variations alone form a unique stripe and, further, a place cell in the space. It produces very similar results when there is a greater number of theta cells available; in these cases, trials with high deviations for offset frequency exhibit an advantage over those with lower deviations. But, unlike the previous setup, when the number of available theta cells is small, the trials with higher offset frequency deviations also prevail, since the model can only rely on the bending of stripes to avoid the formation of side stripes or dots when the diversity of β values is restricted. This restriction continues to affect the accuracy even when the number of theta cells increases, resulting in a larger place field as in figure 10(b), and also a higher overall MSE than the previous setup. This result further demonstrates that the combined effect of diversity in both frequency response factors and offset frequencies contribute to a unique place field for the place cell in our model.
To assess localization, we first need to populate the area of interest with place cells. We use the same theta cell setup from the first experiment—48 theta cells and a 2 Hz standard deviation—to conduct the localization and path tracking experiment. Based on the size of the place field, as shown in figure 10(a), we can determine the density of place cells to cover the region. In the following simulations, we generate a collection of 20 × 20 grid of place cells to tessellate a 2D space where coordinates x and y range from −1 to 1 (figures 10(a)–(c)), so at least one place cell is available to fire at any given location. Such coverage enables the system to keep track of the location of the subject based on which place cell fires in an egocentric frame.
A localization event is demonstrated in figure 11, where we generated three sets of 20 × 20 (i.e. 400) place cells. The three sets are designated for three different speeds of movement. We set a maximum time interval for phase resetting, thus the space that can be covered within that time varies, as specified by the speed of movement, as conceptually shown by figures 11(d), (h) and (j). The place cells assigned to the locations along the travel direction fire sequentially while the subject moves at a constant velocity. If the subject changes velocity, the reset event is triggered, which includes recording which place cell last fired. Each place cell that gets recorded coincides with the mathematical concept of a vector where a place cell encodes its terminal point. Upon each phase-resetting event, the place cells form a new egocentric field and record its relationship with the previous field much as a transformation matrix computes frame relations in robotics, as shown in figures 11(d), (h), (l) and (j). A path can be retrieved by sequentially going through those recorded place cells, just like vector addition. Yet the place cells also have a third dimension to record the speed for each segment, since the offset frequency forces us to use a multi-scale place cell organization that implicitly has a dynamic spatial resolution and speed encoding as discussed in section 6.
Download figure:
Standard image High-resolution image6. Discussion
In this paper, we first present an OI model for constructing place cells through a set of theta cells with mutually prime frequency response factors and two distinct preferred directions. We then improved the model to accommodate the possible variations of idling frequencies and frequency response factors among the population of theta cells. We proposed a novel offset-reduction strategy for interference, and a modified equation (15) to accommodate remaining offset frequencies and variations of frequency response factors. Through simulations, we found that the proposed model benefits from the variations of frequency response factors among theta cells for forming a unique place field. Moreover, we proposed a subsequent localization and path-tracking mechanism, incorporating phase resets, to correct the deviation and drift caused by the variations in idling frequencies and sensory errors.
When experimenting with the model, we adopt square-wave oscillation and quantized phases to demonstrate the model's suitability for hardware implementation. Our team built a mixed-mode theta chip [32] to imitate the behavior of theta cells described in equation (1), which has substantial variations across all theta cells, as figure 12 shows. With the proposed model, variations might be a beneficial feature for construction of place cells.
Download figure:
Standard image High-resolution imageTo better incorporate the square wave oscillation, our model utilizes a multiplicative interference rather than additive. In terms of biology, it resembles the conductance inhibition that has been observed in the hippocampus CA3 [28]. In terms of computation complexity, it can be more simply implemented with a combination logic AND operation in hardware, compared to additive interference which requires two stages of operation and a memory unit. As an example, a place cell formed through Welday et al model using 12 ideal theta cells, i.e. perfectly matched in frequency and gain, needs 11 additions and 1 comparison to compute the status of place cell in a time slice, whereas our model requires 47 Boolean AND operations from 48 non-ideal theta cells that have variations. As a reference, a typical cell in the continuous attractor model requires a weighted inhibition from all the neighboring cells within a certain radius of closeness. Even for a small radius of 4 surrounding layers of cells, 64 multiplications and additions needed to be computed for one timestep for that cell only.
Throughout our model, as many have, we hypothesize a purpose for the grid cells. In figure 5, we demonstrate that grid cells could be formed as a middle step for generating place cells. Combined with our offset-reduction mechanism, grid cells might have a further purpose of pairing two theta cells with closely matched base frequencies together to suppress its impact on the grid cell behavior just as we did. The only difference is that, instead of pairing two theta cells with preferred directions 180° apart, grid cells pair those with 60° or 120° differences. A construction rule for forming the grid cell would then just entail reinforcing synapses from the theta cells with the closest frequencies.
One development of the current model that we plan to investigate is to reduce the connections involved in forming a place cell. As shown in figure 10, it seems that, the more theta cells, the better, when we have a fully connected interference network forming each place cell from all available theta-cell pairs. However, we would like to develop a standard to identify each theta-cell pair's contribution to the place cell, and lesion/prune those that have negative or neutral impact on forming the place cells, so as to reduce the network complexity. We would also implement the model in hardware using our theta cell chip, to further validate the model. The latter will be the subject of a future submission.
Acknowledgments
This work was partially funded via a Cooperative Agreement between JHU and Toshiba Corporation, and a Graduate Student Fellowship to Alia Nasrallah from the Government of Kuwait. The authors would like to thank Akwasi Akwaboah for providing computational resources.
Data availability statement
The data that support the findings of this study are available upon reasonable request from the authors.