DESCRIPTION
SYSTEM AND METHOD FOR IMPLEMENTING RESERVOIR COMPUTING IN MAGNETIC RESONANCE IMAGING DEVICE USING ELASTOGRAPHY TECHNIQUES
Field of the invention
The present invention relates to a system and method for implementing a specific class of a recurrent neural network algorithm called reservoir computing using magnetic resonance imaging device and its principles in elastography.
Background of the invention
Recurrent Neural Networks (RNNs) are connectionist computational models that utilize distributed representation and nonlinear dynamics of its units. Information in RNNs is propagated and processed in time through the states of its hidden units, which make them appropriate tools for sequential information processing. There are two broad types of RNNs: stochastic energy based with symmetric connections, and deterministic with directed connections.
RNNs are known to be Turing complete computational models (Siegelmann and Sontag, 1995) and universal approximators of dynamical systems (Funahashi and Nakamura, 1993). They are especially appealing for problems that require remembering long-range statistical relationships such as speech, natural language processing, video processing, financial data analysis etc. Additionally, RNNs are shown to be very successful generative models for data completion tasks (Salakhutdinov and Hinton, 2012).
Despite their immense potential as universal computers, difficulties in training RNNs arise due to the inherent difficulty of learning long-term dependencies (Hochreiter, 1991 ; Bengio et al., 1994; and see Hochreiter and Schmidhuber,
1997) and convergence issues (Doya, 1992). However, recent advances suggest promising approaches in overcoming these issues, such as utilizing a reservoir of coupled oscillators (Maass et al., 2002; Jaeger, 2001). Reservoir computing (echo state networks or liquid state machines) alleviates the problem of training in a recurrent network by using a static dynamical reservoir of coupled oscillators, which are operating at the edge of chaos. It is claimed that many of these type of dynamical systems possess high computational power (Bertschinger and Natschlager, 2004; Legenstein and Maass, 2007). In this approach, due to rich dynamics already provided by the reservoir, there is no need to train many recurrent layers and learning takes place only at the output (or readout stage) layer. This simplification enables usage of recurrent neural networks in complicated tasks that require memory for long-range (both spatially and temporally) statistical relationships.
Previously Fernando and Sojakka (2003) implemented reservoir computing in a system consisting of a bucket of water and a camera mounted on top of the water surface. In their approach, the water waves act as the reservoir of dynamical activity that maps the input onto a high dimensional nonlinear space. Using diffusive wave front interactions on water surface produces the necessary nonlinearity in the reservoir. The water reservoir is vibrated by multiple mechanical actuators mounted on the bucket surfaces that act as the input device. It is necessary to encode the set of actual inputs (speech signals, images, financial data etc.) into mechanical actuator commands. The camera captures the water wave evolution caused by actuator vibration, and uses the image properties (edge strength in a N*M grid) as the output of the reservoir. The output vector of the reservoir is then used to train the network using classification methods for speech recognition task ("one" vs "zero" speech recognition). Adamatzky (2001 and 2002) previously analyzed usage of various nonlinear media for computing. Walmsley (2001) and Duport et al. (2012) and Paquot et al.
(2012) proposed optical devices and Adamatzky (2004) proposed chemical reactions for implementing nonlinear medium. US patent 7,392,230 B2 proposed a method for implementing reservoir system using nanotechnology, i.e. molecular interactions modulated by electrical input. In addition to these physical implementation studies, there are a patents that implement reservoir computing in software for specific purposes. Patents US 20130060772 Al and US 8301628 B2 suggest using an echo state network for ontology generation, and patent EP 2389669 Al proposes using reservoir computing based neural network for geodatabase information processing.
Muthupillai et al. (1996) proposed a method for magnetic resonance (MR) imaging elastography, in which the tissue stiffness properties are measured using a setup in an MR machine. In this setup, the tissue is mechanically vibrated using actuators and the acoustic strain waves that propagate in the tissue are captured in MR image using special type of magnetic gradients called "motion sensitizing gradients". The method output is a 3D MR image volume of the acoustic waves that vibrate the tissue. In their experiment, they used a homogeneous phantom gel for emulating tissue. Objects of the invention
The object of the invention is to provide a system for implementing reservoir computing in a magnetic resonance imaging device and utilizing both acoustic and magnetic wavefront interactions, and the inhomogeneity in the phantom gel as the nonlinearity in the medium.
Another object of the invention is to provide a method for preparing the physical medium of an inhomogeneous gel volume that is specifically tailored for the task of the reservoir computer. This method is very similar to unsupervised pre- training of neural networks with unlabeled data (Hinton et al., 2006).
Detailed description of the invention
In Figure 1, the physical construct (100) of the reservoir computing device is shown and it is composed of a block of gel (101), inhomogeneity introduced in the gel (102), mechanical actuators that vibrate the gel (103), an MR imaging machine (104) and a computing device (105) for processing and communicating reservoir data. In a preferred implementation of the system, the gel is an agarose compound, however it can be chosen as any semisolid gel that can be vibrated by the actuators. Any shape can be used for the container of the gel, however using a cube enables systematic placement of mechanical actuators (103). As given in method 600, the inhomogeneity in the gel can be a combination of a non-metal object of any shape that distorts the acoustic wave propagation, and stiffness inhomogeneity due to varying concentration of gel ingredient. An object can be inserted into the gel during preparation, or the inhomogeneity can be introduced in a region by using a different concentration of gel ingredient which creates different stiffness in that region, or both. In a preferred implementation, the actuators are located on 5 sides of the cube (bottom side is omitted). The actuators are able to create point source or planar acoustic waves, depending on the input code generated in encoding stage (202). Actuators can be any type available in elastography methods: electromechanical driver, piezoelectric stack driver, focused-ultrasound based or acoustic speaker based. MR imaging machine (104) is the both one of the input (via motion sensitizing gradient) devices and the output device in the system that can be used for phase-contrast MRI (Muthupillai et al., 1996). This technique provides the 3D volume images of propagating acoustic waves in the gel. A computing device (105) is connected to the system to receive input from the outside world (201), to provide processing medium for encoding (202) and decoding (205) stages, to process (206) the reservoir output for a specific task (classification etc.) and to communicate the output (207) of the system to the outside world.
In Figure 2, the algorithmic flow of the system (200) is given. The reservoir computing system receives the input data (201). The encoding stage (202) translates the input into a code that drives the mechanical actuators and modulates the motion sensitizing magnetic gradient. At the end of this stage input data is transformed into a set of instructions that drives the physical system. Then, the physical system is excited (203) according to these instructions generated in the previous stage and the phase-contrast MR imaging (204) is performed that gives an image volume of the gel. The MR image volume of the gel is decoded (205). In the decoding stage, the MR image volume is converted into a data vector that represents the state of the complex acoustic wave patterns in the gel. The decoded data vector is then further processed (206) according to the task at hand (eg. recognition, data compression, clustering etc.). The stages (203) and (204) take place in the physical construct of the system whereas the rest of the stages are part of software implemented in the computing device (105). This computer accesses input data (201) and fetches MR image volume (204) for decoding (205), and then applies processing algorithms (206) for the assigned task.
The design of encoding and decoding stages is essential for efficient use of the reservoir computing system. In order to understand the encoding stage of the system, the principles of the MR elastography need to be visited. In elastography, the phase shift of the MR signal is acquired by the machine and the image intensity at a voxel is a function of this shift. The shift at location r in the gel is given by:
φ (r) = — fi . ξ) s{k . r -f- ) The MR image is dependent on the gyromagnetic ratio (γ) of the material at location r, the angular frequency (ω), the wavenumber (k), initial phase offset (a) and peak displacement (ξ) of the mechanical excitation and the magnetic gradient (G). G can be a temporally periodic signal:
By looking at the formula that converts the acoustic wave vibration into MR image, and inspecting the physical architecture of the proposed system (Figure 1), we can list the different variables that can be used for encoding of the input signal into instructions of the physical system:
Mechanical wave type (point source or planar)
Mechanical wave location (x,y,z) on the gel volume, see (103) in Figure 1
Mechanical wave parameters (ω, k, α, ξ)
Magnetic gradient temporal frequency ω χ, ω γ, ω ζ in each dimension. Magnetic gradient strength Gox, Goy, Goz in each dimension.
There are total of at least 14 (V) variables for encoding. Using these variables, the input data (201) needs to be converted into a specific set of system instructions and the system is excited (203) with these instructions. In a similar but simpler system proposed by Fernando and Sojakka (2003), actuator locations represented different frequency channels of the speech data, and time varying peak displacement of the waves represented the magnitude of the specific frequency channel.
The steps of the encoding stage (202) are shown in Figure 3. The input data is first pre-processed (301), and this can be a sequence of operations (filter, whiten, transformations such as fourier or wavelet, dimensionality reduction etc.) that modifies and transforms the data to make it more appropriate for the subsequent stages. At the end of pre-processing, the data have an inherent number of dimensions, K. In the mapping stage (302), these dimensions are mapped onto the system variables given above. K can be much larger than the number of system variables (V >= 14), then in that case the mapping algorithm exploits time. Making the system variables time varying and presenting different dimensions of the pre-processed input in different excitation epochs (e=l,2,..T), allows the encoding stage to map as many dimensions as needed. As the dimensionality of the input data (201) increases it takes more time to encode the input, and the complexity of the mapping algorithm as well as the need for dimensionality reducing pre-processing steps (i.e. principle component analysis, wavelet
transform etc.) also increases. Suppose we have a K dimensional pre-processed data vector and XK is the kth dimension of the vector. Then in mapping stage, a function maps the component of the input (can be real, integer or binary) onto the value of mth reservoir variable at epoch e, ¾:
(302)
For all k = 1,2, ><*K there is a combination of m and e in set: m. = 1,2,,. and e = 1,2,
The steps of decoding stage (205) are shown in Figure 4. The MR image volume passed from the imaging stage (204) is first post-processed (401) with a sequence of algorithms such as high pass filtering and edge extraction. This stage filters out low spatial frequency information and enhances the wave propagation information in the image. Then in the subsequent step (402), for dimensionality reduction, the image volume can be divided into a coarser grid (M by N by P) than the MR image volume (many voxels falls inside a grid). A single feature or a set of features (length Q) are computed for each grid cell. The features can be average, standard deviation, or any spatial transformation output which can be applied to a neighborhood of voxels inside a grid cell. The feature values for each grid cell are concatenated to form a data vector (403) of size (M*N*P*Q). This vector is the reservoir output for a given input.
In (206), the output vector of the reservoir is used as an input to another algorithm designed for a specific task such as classification, clustering, dimensionality reduction, data completion etc. The output of the overall system is computed from this stage but the specifics are irrelevant for this patent.
The acoustic waves that are captured by the MR imaging device need to be harnessing the nonlinearity provided by diffusive wavefront interactions and inhomogeneity in the gel. The nonlinearity in the reservoir maps the input onto a high dimensional nonlinear manifold. In order to illustrate the complexity in the
acoustic waves of the proposed system in a controlled manner, a series of simulations are performed in 2D acoustic waves. Figure 5 gives an instance of waveforms generated by 3 different scenarios. In (501), 4 point acoustic sources with different phases are used in a homogeneous medium and waves seem to exhibit very complex patterns. A single point source and a very stiff line object are used in (502), and this scenario shows complex diffraction patterns. In (503), a single point source and many small inhomogeneities are used. The inhomogeneous small objects also distort the waves to create wave patterns. Both simulations and experiments show that proposed system is able to exhibit nonlinear operations essential to reservoir computing, through reflection, refraction and dispersion of acoustic waves. The suggested system is a complicated combination of acoustic waves, magnetic waves, spatial nonlinearities, and is able to provide a very rich nonlinear projection of the input. Recent advances (Hinton, 2006) in recurrent neural networks have shown the importance of unsupervised pre-training of the network for robustness and high performance. Traditional pre-training in software recurrent neural networks is executed using a big unlabeled dataset and unsupervised learning peinciples, tailoring the network connections for a specific task and data. The pre-training stage iteratively minimizes an energy function defined on the connection of the nodes. We propose a method for performing a hardware pre-training stage (600) in the reservoir of our system (Figure 6). This pre-training phase prepares a specific gel for a specific dataset. In (601) a homogeneous and hot (in its liquid form) gel is prepared in its container (101). Then a number of high density objects (plastic, wood, etc.) are inserted into the hot gel (602) and stirred for homogeneous diffusion of the objects. These objects act as nonlinear operators on the acoustic wave. In addition to these objects stiffness inhomogeneity can be used. Using a different concentration of gel ingredient creates different stiffness in a region, and inhomogeneity in stiffness can be achieved by mixing gel ingredient (603) without stirring the hot solution. Data is composed of many instances (i.e. images), and the system instructions (303) of each one of the instances can be
computed offline and stored (605) in the computing device. In annealing phase (604), the hot gel is excited (acoustic waves) (203) with the system instructions of the whole data (605) one instance at a time, continuously, repeatedly and in random instance order until the gel becomes solid. During this annealing phase, the acoustic waves applied onto the hot gel, stir and mix the gel guiding gel ingredient diffusion and move the inserted objects. The inhomogeneities (object locations and stiffness gradients) are created according to the applied excitation. The diffusion/motion process minimizes the total energy of the gel, optimizing it for the data. In preferred implementations, object insertion (602) and gel ingredient mixing (603) can be selected as the only source of inhomogeneity, or can be applied together.