US9189870B2 - Method, computer readable medium and system for tomographic reconstruction - Google Patents
Method, computer readable medium and system for tomographic reconstruction Download PDFInfo
- Publication number
- US9189870B2 US9189870B2 US14/416,533 US201214416533A US9189870B2 US 9189870 B2 US9189870 B2 US 9189870B2 US 201214416533 A US201214416533 A US 201214416533A US 9189870 B2 US9189870 B2 US 9189870B2
- Authority
- US
- United States
- Prior art keywords
- detector
- photon
- measurement data
- projection
- reconstruction
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T11/00—2D [Two Dimensional] image generation
- G06T11/003—Reconstruction from projections, e.g. tomography
- G06T11/005—Specific pre-processing for tomographic reconstruction, e.g. calibration, source positioning, rebinning, scatter correction, retrospective gating
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T1/00—General purpose image data processing
- G06T1/20—Processor architectures; Processor configuration, e.g. pipelining
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T11/00—2D [Two Dimensional] image generation
- G06T11/003—Reconstruction from projections, e.g. tomography
- G06T11/006—Inverse problem, transformation from projection-space into object-space, e.g. transform methods, back-projection, algebraic methods
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/10—Image acquisition modality
- G06T2207/10072—Tomographic images
- G06T2207/10104—Positron emission tomography [PET]
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2210/00—Indexing scheme for image generation or computer graphics
- G06T2210/41—Medical
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2210/00—Indexing scheme for image generation or computer graphics
- G06T2210/52—Parallel processing
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2211/00—Image generation
- G06T2211/40—Computed tomography
- G06T2211/416—Exact reconstruction
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2211/00—Image generation
- G06T2211/40—Computed tomography
- G06T2211/424—Iterative
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2211/00—Image generation
- G06T2211/40—Computed tomography
- G06T2211/428—Real-time
Definitions
- the invention relates to a method, a computer readable medium and a system for tomographic reconstruction from measurement data.
- emission tomography the density data of particle emissions (in the following: emission density data) in a volume has to be found on the basis of measurement data.
- Emitting particles are for instance positrons or gamma photons. Emitted particles may get scattered or get absorbed, and may also change their type in the volume or before detection. Emitted particles or the particles originating from the emitted particles arrive in detector devices that report detection events. The above mentioned physical phenomena complicate the tomographic reconstruction, and for an appropriate result these have to be taken into account in the course of the reconstruction.
- An emission tomographic reconstruction method reconstructs the emission density data based on the measurement data obtained from signals of detector devices comprising a plurality of detector elements.
- PET Positron Emission Tomography
- the density of radioactive tracer materials has to be found, which is proportional with its emission density data.
- the particles emitted from the tracer material are positrons, which are replaced by pairs of approximately oppositely directed gamma photons when they annihilate with electrons (cf. Physics reference manual, Geant4 9.1. Technical Report, CERN, 2007).
- These gamma photons may get scattered in the body and/or in the detector elements before they get finally absorbed in a detector element comprised in the detector device or module.
- the detector elements are usually in the form of detector crystals.
- the detector device should detect pairs of gamma photons arriving almost simultaneously, so an event occurs if two detector elements nearly simultaneously detect two photons.
- a pair of detector elements sometimes called detector crystals, represents a conceptual detector, which is also called Line Of Response (LOR).
- LOR Line Of Response
- emitted particles are gamma photons.
- a particle may also get scattered or absorbed in the object, in the collimator or in the detector crystals comprised in a detector device rotating around the object to be investigated.
- a detector crystal i.e. a detector element—is able to detect gamma photons that went through a collimator associated to it.
- the collimator can be for example parallel hole, pinhole, multi-pinhole, fan or cone beam, slit-slat type or any other type.
- SPECT a conceptual detector is defined by the actual orientation of the detector device, i.e. of its detector elements and the corresponding collimators. The line characterizing this orientation is called the projection line.
- the reconstruction process is said to be based on binned data.
- the inputs comprise the total number of hits in conceptual detectors acquired during the measurement. Consequently, when binned reconstruction is applied, individual detector events undergo binning and the emission density data has to be reconstructed from the spatial histogram of photon pair hits in the case of PET or from the photon hits in the case of SPECT.
- a list mode reconstruction may involve more information, like the timing of individual events. The differences between binned and list mode reconstruction are given in more detail below.
- the required output of a reconstruction method of any type is the emission density function x( ⁇ right arrow over (v) ⁇ ) in points ⁇ right arrow over (v) ⁇ of a volume of interest ⁇ , which is typically discretized on a voxel grid.
- the input of the reconstruction method is the measurement data, the measured number of hits in the conceptual detectors, which is connected to the output by a system matrix.
- An element of the system matrix represents the probability that a particle generated in a voxel is detected by a conceptual detector.
- both the number of conceptual detectors and the number of voxels may be in the range of several hundred millions, thus the system matrix has a very large size and the reconstruction method must scale up well and must be appropriate for high performance computation platforms.
- the known time-consuming processes of iterative tomography reconstruction are targeted by pre-computing system dependent information and always needed more efficient computational platforms (A. J. Reader and H. Zaidi: Advances in PET Image Reconstruction, PET Clinics , Vol. 2, pp. 173-190 (2007)).
- FPGA Field-Programmable Gate Arrays
- multi-CPU systems D. W. Shattuck, J. Rapela, E. Asma, A. Chatzioannou, J. Qi, and R. M. Leahy: Internet2-based 3D PET image reconstruction using a PC cluster, Physics in Medicine and Biology , Vol. 47, pp.
- GPUs can be programmed with two different programming models.
- Shader APIs Application Programming Interface
- GPGPU General Purpose GPU
- CUDA Computer Unified Device Architecture
- OpenCL Open Computing Language
- CUDA Computer Unified Device Architecture
- SIMD Single Instruction. Multiple Data
- forward projection computing the expected detector hits from an actual estimation on emission density data and back projection correcting the emission density data based on the measured and expected detector responses alternate.
- Equations of forward projection and back projection are similar in the way that they take many input values—voxel intensities and data of conceptual detectors, respectively—and compute many unknown values—again, data of conceptual detectors and voxel intensities, respectively.
- This kind of “many to many” computation can be organized in two different ways. Known values can be taken one-by-one obtaining the contribution of a single known value to all of the unknowns, and accumulate the contributions as the different known values are visited. This scheme is called scattering.
- a tomographic reconstruction method is disclosed in M. Magdics, L. Szirmay-Kalos, ⁇ . Szlavecz, G. Hesz, B. Benyó, ⁇ . Cserkaszky, J. Lantos, D. Légrády, Sz. Czifrus, A. Wirth, B. Kári, G. Patay, D. Vögyes, T. Bükki, P. Major, G. Németh, B. Domonkos: TeraTomo project: a fully 3D GPU based reconstruction code for exploiting the imaging capability of the NanoPET/CT system, 2010 World Molecular Imaging Congress , 11 Sep. 2010.
- This paper discusses direct Monte Carlo particle transport and presents an adjoint Monte Carlo method which applies gathering type approach both in forward and back projection. Both direct photon hits and scattered photon hits are considered.
- a photon transport simulation method that executes the very same instruction in parallel threads at a time and eliminates the conditional instructions, is disclosed in L. Szirmay-Kalos, B. Tóth, M. Magdics, D. Légrády, and A. Penzov: Gamma Photon Transport on the GPU for PET, in: Large - scale scientific computing , Springer Berlin, Heidelberg, pp. 435-442, 4 Jun. 2009.
- an object of the invention is to provide an emission tomographic reconstruction method, which is free of the disadvantages of prior art solutions.
- a further object of the invention is to provide an emission tomographic reconstruction method which is highly optimized for parallel architectures.
- the object of an embodiment of the invention is to get an algorithm that is accurate for all types of data, i.e. applicable either for homogeneous regions or small, high intensity regions to be reconstructed and is fast on the target parallel architecture.
- List mode reconstruction processes a list of individual events, thus event specific information can be assigned to each detector hit, like photon energy or the time difference of detections (time of flight).
- Binned mode reconstruction works with accumulated data, which comprise the number of hits in each conceptual detector and, optionally in the case of time of flight measurements, associated histograms of energy levels of time of flight measurements.
- a binned mode reconstruction therefore, in spite of dropping out the additional information handled in a list mode reconstruction, can be appropriate for fast reconstructions.
- high accuracy can be reached by careful implementation as detailed below.
- list mode processes events one-by-one, it cannot reuse computation done previously when another event in the same conceptual detector was processed. Unlike the list mode reconstruction, the reuse of previously done computations can be applied in binned mode reconstruction. To make this reuse even more effective, the invention proposes the application of factoring in some embodiments, which improves the performance of binned mode reconstruction and would not be feasible in a list mode approach.
- a further difference between the prior art solutions and some embodiments of the method according to the invention is that the prior art solutions involve either gathering type approaches, which are good for GPU implementation but cannot focus on important, i.e. high contribution subsets of the data, or scattering type approaches, which may reconstruct well important regions but perform poorly on the GPU.
- the invention combines the advantages of scattering and gathering approaches with the application of multiple importance sampling.
- US 2011/0182491 A1 that addresses list mode reconstruction with gather-only forward projection and scatter-only back projection
- some embodiments of the invention target binned mode reconstruction with combined forward and combined back projections.
- the invention is based on the recognition that a binned type reconstruction method can be more thoroughly optimized on parallel architectures than a list mode reconstruction method.
- FIG. 1 shows a volume of interest placed between two parallel detector devices
- FIG. 2A shows an exemplary ray bundle between sources and detector devices
- FIG. 2B shows the exemplary ray bundle with introducing a virtual detector device/virtual source in between the sources and detector devices
- FIG. 3 represents an embodiment of the inventive method in a flowchart
- FIG. 4 shows a voxel being on the line connecting two detector elements
- FIG. 5A shows an exemplary independent random distribution of points in a rectangular area
- FIG. 5B shows an exemplary Poisson-disk distribution of points in a rectangular area
- FIG. 6 shows an object placed between two parallel detector devices and the notations in connection with the identification of a LOR and a detector element in a detector device
- FIG. 7 shows a voxel placed between two parallel detector devices and lines connecting pairs of detector elements
- FIG. 8 shows a group of detector elements in which a particle is propagating
- FIG. 9 shows a voxel placed between two parallel detector devices and a LOR giving contribution to the voxel
- FIG. 10 represents an embodiment, where no filtered sampling is applied
- FIG. 11 represents a further embodiment with filtered sampling involved
- FIG. 12A shows a Derenzo phantom
- FIG. 12B shows a homogeneity phantom
- FIGS. 13A-13C show comparison of various embodiments of the invention for point phantom, homogeneity phantom and Derenzo phantom,
- FIGS. 14A and 14B show a reconstruction result for an animal obtained by various embodiments of the method according to the invention.
- the emission tomographic reconstruction method according to the invention is carried out in a parallel architecture.
- the parallel hardware architecture is at least one Graphics Processing Unit (GPU).
- GPU Graphics Processing Unit
- the detailed embodiment of the reconstruction method according to the invention can also be applied in other parallel architectures with slight changes within the scope of the invention.
- most of the details of the embodiments are described in connection with PET reconstruction, but the most of the embodiments described below can be used for other tomographic reconstructions, such as SPECT.
- the invention is a method for emission tomographic reconstruction from measurement data, the method comprising the steps of: obtaining the measurement data corresponding to activity of particle emissions of a volume from a tomographic imaging system, and performing at least one iteration step to obtain emission density data of particle emissions of the volume from the measurement data, wherein the at least one iteration step comprises a forward projection being a projection from the emission density data to the measurement data, and a back projection being a projection from the measurement data to the emission density data, and the at least one iteration step is carried out in a parallel hardware architecture, wherein both the forward projection and the back projection are of at least partially gathering type, and the measurement data is of binned type.
- the tomographic imaging system can be preferably a positron emission tomograph or a single photon emission computed tomograph as detailed above. The details of the forward and back projections and other details of the invention are given below.
- An embodiment may be embodied in the form of computer-implemented processes and apparatuses for practicing those processes.
- An embodiment may also be embodied in the form of a computer program code containing instructions embodied in tangible media, such as floppy diskettes, CD-ROMs, hard drives, or any other computer readable storage medium, wherein, when the computer program code is loaded into and executed by a computer, the computer becomes an apparatus for carrying out the method.
- An embodiment may also be embodied in the form of computer program code, for example, whether stored in a storage medium, loaded into and/or executed by a computer, or transmitted over some transmission medium, such as over electrical wiring or cabling, through fiber optics, or via electromagnetic radiation, wherein when the computer program code is loaded into and executed by a computer, the computer becomes an apparatus for carrying out the method.
- the computer program code segments configure the microprocessor to create specific logic circuits.
- the invention can take the form of a computer program product accessible from a computer-usable or computer-readable medium providing program code for use by or in connection with a computer or any instruction execution system.
- a computer-usable or computer readable medium can be any apparatus that may include, store, communicate, propagate, or transport the program for use by or in connection with the instruction execution system, apparatus, or device.
- the medium can be an electronic, magnetic, optical, electromagnetic, infrared, or semiconductor system (or apparatus or device) or a propagation medium.
- the invention can take the form of a computer readable medium comprising a computer readable program for processing tomographic reconstruction from measurement data, wherein the computer readable program when executed on a computer causes the computer to perform the steps of any embodiments of the method according to the invention.
- Embodiments can take the form of an entirely hardware embodiment, an entirely software embodiment or an embodiment including both hardware and software elements.
- the present invention is implemented in software, which includes but is not limited to firmware, resident software, microcode, etc.
- Embodiments can take the form of a system for processing tomographic reconstruction from measurement data, characterized by comprising an obtaining unit for obtaining the measurement data corresponding to activity of particle emissions of a volume from a tomographic imaging system; and an iteration unit for performing at least one iteration step to obtain emission density data of particle emissions of the volume from the measurement data, wherein the iteration unit comprises a forward projection unit for performing a forward projection for projecting the emission density data to the measurement data: and a back projection unit for performing a back projection for projecting the measurement data to the emission density data; and wherein the iteration unit comprises a parallel hardware architecture, the forward projection and the back projection are of at least partially gathering type, and the measurement data is of binned type.
- the reconstruction method according to the invention is implemented on GPU in some embodiments.
- the critical issue of the GPU programming and parallel programming in general, is thread mapping, i.e. the decomposition of the algorithm to parallel threads of a parallel architecture, so that the algorithm can run efficiently.
- a GPU is a collection of multiprocessors, where each multiprocessor has several SIMD scalar processors that share the instruction unit and thus always execute the same machine instruction.
- the threads allocated to a single multiprocessor preferably have the same control path. If two threads of the same multiprocessor execute different branches of a conditional instruction due to data dependencies, then the scalar processor should go through both branches, what would degrade performance.
- the dependence of flow control on input data is preferably minimized.
- data independent flow control is performed in at least a part of the scalar processors and/or the control path of parallel threads is identical for all possible input data.
- the tomographic reconstruction method according to the invention is preferably free of conditional instructions and the algorithm connected to the method is preferably decomposed to phases or steps implemented by different kernels.
- the specific kernels are started only if the data requires execution.
- the iteration steps of the inventive method are carried out in parallel threads of a parallel architecture.
- algorithms corresponding to the step are carried out in parallel threads being of identical length. This can be achieved by eliminating variable length loops from the program implementing a respective step, even if the constant length loops have more cycles than needed by the required accuracy.
- Synchronization of the threads and atomic operations are an available option on a GPU but they are expensive to use, so these are preferably avoided in the inventive method.
- No atomic operation is needed if threads modify only their own private memory location. However, different threads may read the same memory location while the data remains constant.
- global memory access is slow on the GPU, especially when atomic writes are needed to resolve thread collisions, so it is preferably avoided according to the invention.
- An iterative reconstruction method comprises a forward projection and a back projection, therefore there are four different ways to implement the method depending on whether scattering or gathering type approach is applied in forward projection and back projection. It is emphasized that the distinction of these cases might be just the order of loops in a CPU implementation, but is a crucial design decision when the algorithm runs on the GPU since it defines which loop is executed in parallel on the shader processors, while the scattering type steps are not favored in a parallel architecture. GPUs and in general parallel algorithms favor gathering since it computes a single result from the available data, which can be written out without communication between the computational threads and the synchronization of them.
- the communication between threads of different multiprocessors is done via an external memory, which is slow. Therefore in general, external memory accesses are preferably minimized and all communication is avoided between threads in the method according to the invention.
- Running independent threads on different processors boosts performance proportionally with the number of processors.
- temporary results obtained by different processors cannot be generally reused, which is obviously possible and worthwhile in a single thread implementation.
- efficiency improvement via reuse and performance enhancement via parallelism are contradicting requirements.
- the algorithm should be broken into phases, i.e. the system matrix of the inventive method is preferably factored—as detailed below—similarly as described in an above mentioned paper (J.
- the global memory is generally very slow compared to the register and local memory access and to the computational performance of processors. There is, however, one exception to this rule. If threads of the same scalar processor access neighboring data elements, then the transfer is executed in a single step which decreases the access time. Therefore, neighboring threads are preferably designed to access neighboring data elements. For example, in pure Monte Carlo particle tracing threads fetch the memory randomly, thus coherent access cannot be guaranteed. However, if particles are transferred by ray-bundles, i.e. in a step a larger set of coherent rays are processed, memory access speed can be greatly increased.
- the required output of tomographic reconstruction methods is the emission density data for a binned type reconstruction.
- the emission density data is given e.g. by a function x( ⁇ right arrow over (v) ⁇ ) in points ⁇ right arrow over (v) ⁇ of a volume of interest ⁇ .
- the volume of interest ⁇ is the volume where the positron emitting tracer material is dispersed, for example in a body to be investigated.
- the input of the reconstruction method is the measurement data, i.e. the measured number of hits in the conceptual detectors, e.g. LORs for example in the case of PET.
- a pair of detector elements 13 ′ and 13 ′′ comprised in respective detector devices 10 ′ and 10 ′′ constitutes a conceptual detector, i.e. a LOR in the case of the depicted PET system.
- the detector device 10 ′ and 10 ′′ comprises other detector elements 12 as well.
- Propagation line 14 of a photon pair in direction ⁇ right arrow over ( 107 ) ⁇ is illustrated in the figure. This photon pair is detected on detector elements 13 ′ and 13 ′′, thus gives a contribution to the LOR of the detector elements 13 ′ and 13 ′′.
- the photon pair is emitted roughly in parallel by the annihilation of a positron.
- the point of emission of the photon pair is denoted by ⁇ right arrow over (v) ⁇ in FIG. 1 and the emitted photons are propagating in the ⁇ right arrow over ( 107 ) ⁇ and ⁇ right arrow over ( 107 ) ⁇ directions, respectively.
- the measurement data of the LOR connecting detector elements 13 ′ and 13 ′′ can also comprise other events.
- a volume of interest is also shown in FIG. 1 , which comprises a head in the shown example.
- x v is a V th voxel value.
- b v ( ⁇ right arrow over (v) ⁇ ) functions are pre-defined basis functions for each voxel.
- ⁇ tilde over (y) ⁇ D is the expected number of hits for conceptual detector D corresponding to a certain x( ⁇ right arrow over (v) ⁇ ) emission density data
- T( ⁇ right arrow over (v) ⁇ D) is the system sensitivity, i.e. the probability that an emission in point ⁇ right arrow over (v) ⁇ is detected by the conceptual detector D
- ⁇ is the volume of interest
- a DV ⁇ V ⁇ b V ⁇ ( v ⁇ ) ⁇ T ⁇ ( v ⁇ ⁇ D ) ⁇ ⁇ d v is an element of the system matrix A which connects the conceptual detector D and voxel V.
- the difference between y D and ⁇ tilde over (y) ⁇ D should be emphasized; y D is the measured number of hits for conceptual detector D, i.e. the measurement data and ⁇ tilde over (y) ⁇ D is the expected number of hits for a given x( ⁇ right arrow over (v) ⁇ ) distribution, therefore its value changes during the reconstruction process as x( ⁇ right arrow over (v) ⁇ ) changes during the iteration steps.
- FIG. 2A shows the conceptual model of emission tomography.
- the 3D source of emitted particles is divided into several voxels, e.g. voxels 16 ′, 16 ′′, 16 ′′′.
- Emitted particles may end up in detector elements 12 ′, 12 ′′, 12 ′′′ after traveling in space including possible scattering and type-changes as demonstrated by particle paths 15 .
- the expected values of the conceptual detector hits have to be computed in the forward projection, which require the integration of the contribution of all possible particle paths.
- the contribution of sources to detector elements is a high-dimensional integral in the domain of source points, detector points and arbitrary number of scattering points.
- Such high-dimensional integrals are preferably calculated by a Monte Carlo quadrature that randomly traces sample paths. The more paths are computed, the higher precision reconstruction is obtained.
- the particle transport process according to the forward and back projections is preferably decomposed to phases by introducing virtual detector devices comprising virtual detector elements.
- this factorization has other advantages as well.
- virtual detector elements 18 ′, 18 ′′, 18 ′′′ are illustrated.
- the particle transport process is factorized. It is clear from the figure that the factorization leads to the segmentation of particle paths 15 into path segments 17 and 19 .
- their paths are first executed to the virtual detector elements—path segments 17 in FIG. 2B , then virtual detector elements 18 ′, 18 ′′, 18 ′′′ become virtual sources and the next phase simulates transport from the virtual sources to the real detector elements—path segments 19 .
- the idea of factoring is that the transport process. i.e. the whole propagation from the emission of a particle until it is detected in a detector element, is decomposed to phases with the introduction of virtual detector elements.
- FIG. 2B shows decomposition into two phases, but more than two phases are also possible in a tomography reconstruction method.
- the expected values in the first layer of virtual detector elements are computed from the source.
- the first layer of these detector elements become sources and a similar algorithm is executed until the real detector elements are reached.
- the advantages of the factorization are the following:
- y ⁇ D ⁇ V ⁇ x ⁇ ( v ⁇ ) ⁇ T ⁇ ( v ⁇ ⁇ D ) ⁇ ⁇ d v should be reinterpreted in the factored case.
- ⁇ is the domain of source distribution considered in the investigated phase
- ⁇ tilde over (y) ⁇ D is the expected number of hits in detector element D of the phase, which can be virtual detector elements
- T( ⁇ right arrow over (v) ⁇ D) is the probability density of transporting from virtual source point ⁇ right arrow over (v) ⁇ to virtual detector element D.
- both gathering and scattering type algorithms can be proposed that may have significantly different accuracy depending on the data to be reconstructed.
- one type of the algorithms can be good to estimate larger homogeneous regions, while the other type to estimate small high intensity features.
- the gathering type and the scattering type algorithms may be associated with significantly different computational cost on the parallel hardware.
- forward projection and back projection are preferably factored to phases or steps that can be executed independently and without communication.
- the factored steps can be based on different computation paradigms, including analytic calculations, numerical quadrature, or exploitation of measured or simulated data.
- all steps are at least partly of gathering type in the reconstruction method according to the invention. Gathering is implemented in a way that a thread computes a single output value from all relevant inputs.
- the computation process follows the direction from the voxels of positron density towards the expected LOR hits.
- back projection of it the direction is from the ratios of expected and measured LOR values to the new voxel values, so the factored phases are computed in the reverse order and the input and the output of the phases are exchanged.
- Multiple importance sampling is a general mathematical approach to combine different Monte Carlo estimators (E. Veach and L. Guibas: Optimally combining sampling techniques for Monte Carlo rendering. Computer Graphics Proceedings, Annual Conference Series , 1995 (ACM SIGGRAPH '95 Proceedings), pp. 419-428 (1995)).
- multiple importance sampling is proposed for application in phases of emission tomography to meet the goals of accuracy and efficient parallel implementation. These phases consider the transport from either virtual or real sources to either virtual or real detector devices.
- y ⁇ D ⁇ V ⁇ x ⁇ ( v ⁇ ) ⁇ T ⁇ ( v ⁇ ⁇ D ) ⁇ ⁇ d v is presented below when two different methods, an output-driven i.e. gathering type and an input-driven, i.e. scattering type are used simultaneously.
- the special case of using just a single—i.e. gathering or scattering—type method is also obtained by setting the sample numbers of other methods to zero.
- the reconstruction method in these embodiments becomes highly accurate for all types of data, i.e. applicable either for homogeneous regions or small, high intensity regions to be reconstructed and is fast on the target parallel architecture.
- a set of (virtual) sources i.e. voxels and a set of (virtual) detector elements are considered as introduced above.
- N o output-driven i.e. detector oriented samples in the points ⁇ right arrow over (v) ⁇ 1 , . . . , ⁇ right arrow over (v) ⁇ N o of the source domain are taken for each detector element D, drawn from probability density p o ( ⁇ right arrow over (v) ⁇ , D) which is at least approximately proportional to the transport probability, T( ⁇ right arrow over (v) ⁇ D).
- Individual densities d i or d o do not have to cover the total integration domain, it is enough if at least one of the strategies generates a given source point of non-zero emission with non-zero probability. Note that in the case of forward projection, detector oriented sampling leads to an output-driven, i.e.
- N i and N o must be set to consider the requirements that combined density d o ( ⁇ right arrow over (v) ⁇ , D)+d i ( ⁇ right arrow over (v) ⁇ ) should be roughly proportional to integrand x( ⁇ right arrow over (v) ⁇ )T( ⁇ right arrow over (v) ⁇ D) and the total computational cost of N i source oriented and N o detector oriented samples should be minimal taking into account the parallel hardware.
- FIG. 3 The overview of a PET reconstruction algorithm is shown in FIG. 3 .
- FIG. 3 the details of an iteration step are shown in one embodiment of the inventive reconstruction method.
- the transport process of the forward projection is decomposed into three factored steps in this embodiment:
- the system matrix connecting the measurement data and the emission density data is factorized in this embodiment into sub-matrices according to the photon transport step, the detector evaluation step and the positron transport step.
- Each of these phases is detailed below, but the whole iteration cycle is detailed here on the basis of FIG. 3 .
- the positron emission density data x( ⁇ right arrow over (v) ⁇ ) denoted by a dot on the left hand side of the figure
- the annihilation density, x a ( ⁇ right arrow over (v) ⁇ ) is calculated from x( ⁇ right arrow over (v) ⁇ ) by the application of a positron transport step.
- the virtual detector element data ⁇ tilde over (y) ⁇ L surf
- the measurement data have to be compared to the virtual detector element data obtained after the detector response step, i.e. ⁇ tilde over (y) ⁇ L det .
- the virtual detector elements inserted after the detector response phase are detector elements of the respective conceptual detectors, i.e. LORs in the case of PET. It is shown in FIG. 3 that multiple importance sampling is taken into account also in the phase of detector response. In back projection, the results obtained from the forward projections, i.e.
- ⁇ tilde over (y) ⁇ L det are compared to the measurement data of the respective LORs.
- multiple importance sampling is involved in the present embodiment in such a way that the photon transport step and/or the detector response/evaluation step are of at least partly scattering type.
- the scattering methods are introduced in order to make the present embodiment of the inventive method more suitable for the reconstruction of any kind of measured data. Note, that the usage of only a small number of input-driven threads improves the results of the reconstruction.
- the reads and writes indicated by arrows are also distinguished by their types in FIG. 3 .
- Arrows with a solid line correspond to parallel non-exclusive reads from arrays that are constant during the thread run. This means that each data of the array can be used by several threads, but these threads do not alter the value of the data.
- Such a read is used e.g. when contributing voxels are collected for a LOR, value of a single voxel can give contribution for more than one LOR.
- Arrows with a dotted line correspond to the case of non-colliding writes, a thread in this case writes only its own private data element. This kind of write is used in the case of output-driven, i.e. gathering type threads which are preferred for a parallel architecture.
- Atomic writes only have to be used for threads denoted by a dashed arrow in FIG. 3 , i.e. an atomic write has to be used in the case of an input-driven thread.
- an atomic write has to be used in the case of an input-driven thread.
- registered CT data can be also used for taking into account material dependent positron transport, i.e. the type of the material comprised in a voxel can be taken into account in the positron transport phase.
- the embodiment introduced in FIG. 3 can be modified in such a way that only the output-driven algorithms are used in photon transport step and/or detector evaluation step. Also the amount of input-driven data used in the reconstruction can be varied e.g. by the type of the investigated material.
- forward projection simulates the physical phenomena from the emission of positrons to the output of detector electronics and executes the factored steps of positron transport, photon transport (also referred to as geometric forward projection if scattering is computed separately), and detector evaluation that itself consists of detector blurring and sensitivity simulation.
- the annihilation density x a ( ⁇ right arrow over (v) ⁇ a ) due to positron range is computed as a blurring of the current estimate of the positron density with a filter kernel storing the probabilities of positron offsets between positron generation and annihilation:
- x a ⁇ ( v ⁇ a ) ⁇ V ⁇ x ⁇ ( v ⁇ p ) ⁇ P ⁇ ( v ⁇ a - v ⁇ p ) ⁇ ⁇ d v p
- P( ⁇ right arrow over (v) ⁇ a ⁇ right arrow over (v) ⁇ p ) is the probability density that a positron born in ⁇ right arrow over (v) ⁇ p annihilates in ⁇ right arrow over (v) ⁇ a .
- x a ⁇ ( v -> a ) ⁇ m ⁇ ⁇ v ⁇ x ⁇ ( v -> p ) ⁇ ⁇ m ⁇ ( v -> v ) ⁇ P m ⁇ ( v -> a - v -> p ) ⁇ d v p is obtained.
- ⁇ m ( ⁇ right arrow over (v) ⁇ p ) is an indicator function that is 1 if in point ⁇ right arrow over (v) ⁇ p there is material of index m and zero otherwise.
- the kernel of the material at the position of the annihilation can also be used, which leads to the following convolution:
- x a ⁇ ( v -> a ) ⁇ m ⁇ ⁇ ⁇ m ⁇ ( v -> a ) ⁇ ⁇ v ⁇ x ⁇ ( v -> p ) ⁇ P m ⁇ ( v -> a - v -> p ) ⁇ ⁇ d v p .
- the convolutions in the sum can also be computed via Fourier transformations:
- x a ⁇ ( v -> ) ⁇ m ⁇ ⁇ ⁇ m ⁇ ( v -> ) ⁇ F - 1 ⁇ ⁇ F ⁇ ⁇ x ⁇ ( v -> ) ⁇ ⁇ ⁇ F ⁇ [ P m ⁇ ( v -> ) ] ⁇ .
- a gathering approach leads to a parallelization scheme where a thread is responsible for computing a single LOR from all relevant voxels.
- a detector element pair can be affected only if its detector elements are seen at opposite directions from annihilation point ⁇ right arrow over (v) ⁇ .
- the photon non-collinearity can be modeled not only during the geometric part of the photon pair transport, but can also be approximated as a LOR-space blurring (A. Rahmim, J. Tang, M. Lodge. S. Lashkari, M. Ay, R. Lautamaki, B. Tsui, F. Bengel: Analytic system matrix resolution modeling in PET: an application to Rb-82 cardiac imaging, Phys Med Biol ., Vol. 53, pp.
- the above assumption also means that annihilation point ⁇ right arrow over (v) ⁇ where the photon pair is born and their directions ⁇ right arrow over ( 107 ) ⁇ and ⁇ right arrow over ( 107 ) ⁇ unambiguously identify detector element hit points ⁇ right arrow over (z) ⁇ 1 and ⁇ right arrow over (z) ⁇ 2 , or alternatively, from detector element hit points ⁇ right arrow over (z) ⁇ 1 and ⁇ right arrow over (z) ⁇ 2 , those points ⁇ right arrow over (v) ⁇ and direction ⁇ right arrow over ( 107 ) ⁇ , which can contribute can be determined, as it is shown in FIG. 4 .
- the view point is modified herebelow from the annihilation points and directions to detector element hit points, and using the correspondence between them, the detector response is expressed as an integral over the detector element surfaces and an integral along lines between the surface points.
- the integrals over the detector element surfaces can be estimated by N detline discrete line samples, i.e. point pairs ( ⁇ right arrow over (z) ⁇ 1 (s) , ⁇ right arrow over (z) ⁇ 2 (s) ) on the surface of detector elements 13 ′ and 13 ′′.
- N detline discrete line samples i.e. point pairs ( ⁇ right arrow over (z) ⁇ 1 (s) , ⁇ right arrow over (z) ⁇ 2 (s) ) on the surface of detector elements 13 ′ and 13 ′′.
- Siddon's algorithm R. L. Siddon: Fast calculation of the exact radiological path for a three-dimensional ct array, Medical Physics , Vol. 12. pp. 252-257 (1985)
- N march equidistant points ⁇ right arrow over (l) ⁇ s,r index r runs from 1 to N march —are taken along each line segment s ( ⁇ right arrow over (z) ⁇ 1 (s) , ⁇ right arrow over (z) ⁇ 2 (s) ) and the line integral is preferably evaluated with trapezoidal quadrature.
- this scheme is applicable not only for piece-wise constant but for other basis-functions as well, its implementation does not need conditional instructions, and is very fast if x a ( ⁇ right arrow over (l) ⁇ s,r ) is read from a 3D texture since tri-linear interpolation is directly supported by the hardware, and the probability that neighboring threads need neighboring voxels is high, which improves cache utilization.
- This approach is similar to Joseph's method (P. M. Joseph: An improved algorithm for reprojecting rays through pixel images, IEEE Transactions on Medical Imaging , Vol. 1, pp. 192-196 (1982)) but may take more than one sample in a voxel.
- the extra samples slightly increases the accuracy since tri-linear basis functions lead to cubic integrands in the line integral.
- the extra samples are not worth being eliminated to improve performance since the SIMD architecture does not prefer different loop cycles in different threads computing different line integrals.
- the error of the estimation of the forward projection depends on the number of line samples N detline and point samples per line N march , and also on the discrepancy of the samples in the five-dimensional space defined by the two two-dimensional spaces of the detector element surfaces and the one-dimensional space of the line (H. Niederreiter: Random number generation and quasi-Monte Carlo methods, SIAM, Pennsylvania, 1992).
- Poisson-disk distributed sample points can be used for sampling each of the detector element surfaces, respectively. Sampling a surface by independent random distribution and by Poisson-disk distribution are both demonstrated in FIGS. 5A and 5B , respectively. It is clear from the figure that the surface of a detector element can be covered in a more uniform way using Poisson-disk distribution. Sets of sample points should be preferably computed only once and the result can be stored in a constant array.
- multiple importance sampling is applied to the computation of photon pair transport, i.e. this phase is of partly scattering type. Consequently input-driven sampling is used besides output-driven sampling.
- input-driven sampling first N v volume points ⁇ right arrow over (v) ⁇ s are obtained with a density that is proportional to the activity. Then, line directions are sampled by placing uniformly distributed hit points ⁇ right arrow over (z) ⁇ 1 on detector element surface D 1 of the detector element being farther from the two modules of coincidence.
- volume point ⁇ right arrow over (v) ⁇ s and surface point ⁇ right arrow over (z) ⁇ 1 unambiguously define a line of direction ⁇ right arrow over ( 107 ) ⁇ s , which identifies detector element pair L by the intersected detector element surface on the closer detector element. Putting just a single sample on each detector element surface D 1 , the expected number of hits in LOR L is
- d i ⁇ ( v -> , ⁇ -> ) N v ⁇ x a ⁇ ( v -> ) ⁇ ⁇ v -> - z -> 1 ⁇ 2 XD 1 ⁇ cos ⁇ ⁇ ⁇ z -> 1 where X is the total activity.
- FIG. 7 shows a voxel 22 which contributes through exemplary lines 20 to many of the detector elements 12 of the detector devices 10 ′, 10 ′′.
- the positron density i.e. the activity is taken into account during sampling.
- the activity is identified at the end of the computation process, thus the activity cannot be taken into account in the density of sampling. Consequently, an input-driven approach is typically more accurate when the activity distribution has a very high variation, for example, when the activity is concentrated in a small region.
- the output-driven method is better than the input-driven in reconstructing colder—low-activity—regions.
- the photon pair transport is detailed with the assumption that there is neither scattering nor random events.
- the proposed invention is compatible with various types of scattering compensation algorithms (M. Magdics, L. Szirmay-Kalos, B. Tóth, ⁇ . Csendesi, and A. Penzov. Scatter estimation for PET reconstruction, in: Proceedings of the 7 th international conference on Numerical methods and applications , NMA '10, pp. 77-86, Berlin. Heidelberg, Springer-Verlag (2011)), random event, dead-time compensation algorithms and can involve physically accurate detector elements or crystals, gamma-photon absorption models and can be extended to take binned time-of-flight data into account.
- Detector elements are parts of planar detector devices or modules and form a 2D discrete grid.
- the detector evaluation step comprises a detector blurring step for describing photon propagation from the surface of a detector element to absorption of the photon, and a detector measurement step for describing an evaluation from the absorption of the photon to the output of detection, wherein the system matrix is factorized into sub-matrices also according to the detector blurring step and the detector measurement step.
- each detector element c is characterized by a detector—or crystal—sensitivity value ⁇ c .
- the detector sensitivity value is the expected number of events reported in this detector element or crystal by the output of the measuring system, provided that a photon has been absorbed here. It is assumed that this parameter is available for every detector elements, or can be obtained by indirect measurements.
- Scattering inside the detector elements can be modeled by a crystal transport probability p i ⁇ c ( ⁇ right arrow over ( 107 ) ⁇ ) that specifies the conditional probability that a photon is absorbed in detector element c provided that it arrives at the surface of detector element i from direction ⁇ right arrow over ( 107 ) ⁇ as demonstrated in FIG. 8 .
- the sum of the crystal transport probabilities is the detection probability, i.e. the probability that the photon does not get lost, or from a different point of view, the photon does not leave the module without absorption:
- the detector response calculation is a convolution operation in four-dimensional LOR space.
- the space of the LORs is four-dimensional since a LOR is determined by its two end points, each given by a coordinate pair on the detector surfaces.
- the detector elements of an exemplary PET system are LYSO crystals, the average free path of gamma-photons of 511 keV is about 5 mm, while detector elements are packed at about 1.12 mm distance, thus with non-negligible probability, photons of larger incident angles can be absorbed farther than the tenth neighbor of the detector element where the photon entered the detector device.
- the convolution of the above expression should cover at least 20 ⁇ 20 detector elements on both ends of the LOR, which makes the number of terms to be summed greater than 20 4 .
- the prohibitive cost of such large LOR convolution cannot be reduced by filtering in the frequency domain since the expression also depends on the incident directions, and therefore spatially variant in LOR space.
- Relaxed sample sets are preferably pre-generated sample sets containing N o samples by minimizing the difference between their distribution and the real distribution, which is measured or calculated with a transport Monte Carlo simulation off-line. These pre-generated samples are preferably used in the detector blurring step. Thanks to the translation invariance, it is sufficient to carry out the measurement or the off-line simulation and sample generation only once and use the same set of samples in each LOR.
- LORs L(i(N o +1), j(N o +1)), . . . , L(i(N o +N i ), j(N o +N i )) are sampled proportionally to ⁇ tilde over (y) ⁇ L surf , where LOR L(i, j) is selected with density
- the estimator of the detector response is:
- the back projection considers the ratios of measured and estimated LOR values and executes the steps of forward projection backwards in reverse order to update the voxel estimates.
- the detector response phase computes LOR values from LOR values, and the positron range phase voxel values from voxel values, their backwards execution is relatively simple.
- the reversed detector response operation multiplies the ratios of measured and estimated LOR values first with the detector element sensitivities and then blurs in LOR space with the detector blurring kernel.
- the critical step of back projection is the geometric step, i.e. the photon transport step, since it computes LORs from voxels in forward projection, and thus voxels from LORs in back projection.
- the geometric back projection must be different from the forward projection.
- the geometric back projection is essential in the iterative reconstruction algorithm but positron range and even the detector response may be skipped in this phase, making forward and back projections unmatching with respect to each other.
- the application of unmatched back projector can reduce the time needed by a single iteration and can improve convergence, but skipping the detector response makes the system less stable and accurate for low-dose and therefore low-statistics measurements.
- the positron transport step and/or the detector response step may be also omitted or other simplified steps can be applied instead of the above detailed positron transport step and detector response step in the forward projection.
- the elimination of these steps result only in that the transport of the positrons and the photon transport in the detectors are not taken into account but this does not alter the type of the input and the output of the forward and back projections, namely, while both the input and output are voxel values in the positron transport step and both the input and output are detector element data in the detector response step.
- Geometric back projection computes the scaling factors of voxels from the ratios of measured and computed LOR values as described by the back projection equation introduced above, thus a gathering approach leads to a parallelization scheme where a thread is responsible for computing the update ratio of a single voxel from all relevant LORs.
- the elements in the column of the system matrix associated with the current voxel are estimated by taking discrete position samples inside the voxel. These sample points are generated using a basis function as the density during pre-processing and are used for each voxel. Each detector element pair is considered and the surface of detector elements on the farther detector device is projected onto the plane of the other detector element via position samples. Note that with a fixed position sample, central projection is a homogeneous linear function whose parameters need to be computed only once for every detector element pair. Central projection maps the rectangle onto another rectangle if modules are parallel and to a trapezoid if only the axial detector element edges are parallel but the tangential edges are not.
- FIG. 9 The figure shows a single computational thread of the geometric back projection corresponding to LOR L, which takes point samples ⁇ right arrow over (v) ⁇ and projects the surface of a detector element 13 ′′ of the farther detector device 10 ′′ onto the plane of closer detector device 10 ′ to obtain the solid angles of LORs.
- the figure shows a single point sample in ⁇ right arrow over (v) ⁇ and a pair of parallel detector devices 10 ′, 10 ′′, but the thread processes all detector element pairs and all point samples associated with this voxel. If detector devices are not parallel, then the central projection of a rectangle is a trapezoid.
- the system matrix element can be computed from the solid angle, the length of the line segment inside the voxel, and the total attenuation of the line between two detector elements.
- the attenuation is primarily caused by Compton scattering.
- the result of the attenuation is that some of the particles scatter out from a specific LOR.
- the attenuation can be preferably taken into account in some embodiments of the reconstruction method according to the invention.
- the projection of the farther detector element can cover several neighboring detector elements partly on the closer detector device.
- the contribution of the farther detector device can be estimated by the contribution of every contributing detector elements by weighting.
- the expectation maximization algorithm used in the iteration steps in preferred embodiments of the method according to the invention—is known to be ill-conditioned, which means that enforcing the maximization of a likelihood function may result in a solution with drastic oscillations and noisy behavior.
- This problem can be attacked by regularization methods that include additional information (P. J. Green: Bayesian reconstructions from emission tomography data using a modified EM algorithm, IEEE Trans. Med. Im ., Vol. 9, pp. 84-93 (1990)), e.g. as a penalty term.
- An appropriate penalty term is the total variation (TV) of the solution (M. Persson, D. Bone, H. Elmqvist: Total variation norm for three-dimensional iterative reconstruction in limited view angle tomography, Physics in Medicine and Biology , Vol. 46, pp. 853-866 (2001)).
- System matrix elements at least some of matrix elements of the sub-matrices—are preferably computed on-the-fly by a numerical quadrature, for which discrete random points are used as detailed above.
- the method according to the invention can preferably use two types of error reduction techniques, stochastic iteration (L. Szirmay-Kalos: Stochastic iteration for non-diffuse global illumination. Computer Graphics Forum , Vol 18, pp. 233-244 (1999)), which is a filtering executed in the time-domain, and filtered sampling (J. Krivánek, M. Colbert: Real-time shading with filtered importance sampling. Computer Graphics Forum , Vol. 27, pp. 1147-1154 (2008)), which is a filtering executed in the spatial domain. Both filtering schemes improve the accuracy without essentially increasing the computation time.
- Stochastic iteration uses different samples in different iteration steps making sure that the expectation of the error is zero, which results in computed LOR responses ⁇ tilde over (y) ⁇ L that fluctuate around their expected value, and averages recent forward projection estimates during the iteration sequence. Concerning accuracy, stochastic iteration is similar to iterating with the average of the system matrices, i.e. with the matrix that would be computed using more samples.
- filtered sampling replaces the integrand of forward projection by another function that has a similar integral but a smaller variation, therefore, its integral can be estimated more precisely.
- the control loops of classical iteration and with the application of filtered sampling are shown in FIG. 10 and FIG. 11 , respectively.
- forward projection is not ideal and the variation of its input poses problems, the variation of the input can be reduced with the added filter.
- the performance of the different embodiments of the method according to the invention is demonstrated by showing some exemplary results.
- the presented examples are produced with the CUDA Implementation that runs on NVidia GeForce GTX 480 GPUs.
- the simulated or measured data have been obtained assuming a known nanoPET/CT system (NanoPET/CT in-vivo preclinical imager, 2010, http://www.bioscan.com/molecular-imaging/nanopet-ct) that consists of twelve square detector devices or modules consisting of 81 ⁇ 39 crystals detectors, i.e. detector elements.
- the exemplary detector elements are of surface size 1.12 ⁇ 1.12 mm 2 .
- registered CT data can also be relied on, which can be used in object and material dependent positron range and attenuation/scattering computations.
- the mathematical phantoms also make it possible to test and evaluate the factored steps separately.
- physical effects are turned off and on to produce ideal geometric measurement and data simulating physical effects as well.
- the LYSO crystals—i.e. the detector elements—of nanoPET/CT are replaced in the GATE by a very dense material, back-to-back gamma sources are placed in the voxels, and the phantom attenuation and scattering are turned off.
- the detector model is made realistic, i.e.
- LYSO crystals are specified for GATE, isotope sources are defined, and attenuation and scattering are also allowed to happen. From these physical phenomena, positron range and detector response are critical in small animal PET, attenuation has a moderate effect and scattering just a minor one.
- the off-axis point source is expected to favor the input-driven sampling, the homogeneity phantom the output-driven sampling, while the Derenzo is a typical case in between the two previous extreme cases.
- FIGS. 13A-13C the Cross Correlation (CC) error of the reconstruction is shown as the function of the computation time for the three above introduced phantoms, respectively.
- the definition of CC is given at http://mathworld.wolfram.com/CorrelationCoefficient.html and denoted by r.
- the number r is the cross-correlation coefficient of a currently obtained reconstruction and the original phantom.
- FIGS. 13A-13C the performances of fully output-driven and fully input-driven embodiments are compared to an embodiment combining the output-driven and input-driven schemes (Multiple Importance Sampling).
- the data corresponding to output-driven, input-driven and combined schemes are denoted by continuous, long-dashed and short-dashed lines, respectively.
- different parameters are used in the different embodiments of the reconstructions.
- the number of line samples N detline and the number of steps N march can be reduced, and relatively few N v volume points should be added to compensate the missing samples at important regions.
- 10 4 added volume samples seem satisfactory. The reason is that at high activity, point like features, a few samples can result in accurate projections, while at larger homogeneous regions, the output-driven method already does a fairly good job, which is not compromised by the added rare volume samples.
- the random selection and projection of 10 4 volume samples onto 180 million LORs needs just 0.3 seconds on the GPU, which is negligible with respect to the times of other processing steps.
- FIGS. 14A and 14B demonstrate the reconstruction result of a physical measurement taken by a known nanoPET/CT system.
- the target resolution is 324 ⁇ 315 ⁇ 315 voxels.
- the difference of the reconstruction methods performed for obtaining FIGS. 14A and 14B is that the result shown in FIG. 14A is obtained by an embodiment of the invention comprising the above detailed photon transport step but not comprising the above detailed detector response step, while both that photon transport and detector response steps are utilized for the reconstruction result shown in FIG. 14B .
- the detector response was computed with 64 output-driven samples in forward projection.
- the forward and back projections are unmatching since in the detector response step of the back projection a simplified detector model is used which is proposed in M. Magdics, B. Tóth, L. Szécsi, B. Csébfalvi, L. Szirmay-Kalos, ⁇ .
- Gaussian filtering is applied to the reconstructed volume between iteration steps with standard deviation equal to the double of the linear size of a voxel. Stochastic iteration was also turned on after the fifth iteration step, i.e. took statistically independent sets of random samples and computed the average of expected LOR hits. Total variation regularization is applied during the iteration.
- the positron transport step is not carried out in the reconstruction of the FIGS. 14A and 14B .
- Table 1 presents the execution times of the factored steps needed to execute a single EM (Expectation Maximization) iteration and to process a subset in OSEM (Ordered Subset Expectation Maximization), when reconstruction is performed at 128 3 and 256 3 voxel resolutions, respectively.
- the illustrated reconstruction times correspond to an unmatched back projection scheme, i.e. detector response simulation and positron range simulation are not taken into account in the course of the back projection. Note that the geometric back projection takes longer than forward projection and dominates the computation time, which is due to the fact that the voxel driven, i.e.
- output-driven, back projection visits all relevant LORs for each voxel in a thread, but the LOR data structure is essentially a four-dimensional array (an array element is identified by the four indices of the four spatial coordinates determines a certain LOR) which cannot be as efficiently placed in textures as 3D voxels.
- forward projection is LOR driven in connection with the data shown in the table, i.e. similarly output-driven as the back projection.
- all relevant voxels are accessed for each LOR in a thread, and the 3D texture cache can be highly utilized.
- more than one GPU card can be used.
- a single GPU card maintains just a portion of the voxel array and computes the geometric forward and back projections as if only these voxels existed. Between the forward and back projections, LOR images are exchanged and every GPU adds the LOR values of other cards to its own result.
- This multi-GPU parallelization linearly speeds up geometric forward and back projections and its overhead is only the time needed for the exchange and the addition of the computed LOR values.
Abstract
Description
where xv is a Vth voxel value. Voxel values are elements of vector x=(x1, x2, . . . ), which has Nvoxel components. In a reconstruction method these voxel values are the unknown coefficients which have to be found based on measurement data. In the above expression bv({right arrow over (v)}) functions are pre-defined basis functions for each voxel.
where {tilde over (y)}D is the expected number of hits for conceptual detector D corresponding to a certain x({right arrow over (v)}) emission density data, T({right arrow over (v)}→D) is the system sensitivity, i.e. the probability that an emission in point {right arrow over (v)} is detected by the conceptual detector D, □ is the volume of interest, and
is an element of the system matrix A which connects the conceptual detector D and voxel V. The difference between yD and {tilde over (y)}D should be emphasized; yD is the measured number of hits for conceptual detector D, i.e. the measurement data and {tilde over (y)}D is the expected number of hits for a given x({right arrow over (v)}) distribution, therefore its value changes during the reconstruction process as x({right arrow over (v)}) changes during the iteration steps.
and back projection
in each of the n=1, 2, . . . iteration steps. Expectation maximization used in preferred embodiments of the invention. As the formula of back projection shows, in back projection the estimation of emission density data is corrected based on the ratio of measured hit numbers and expected hit numbers.
-
- The calculation of a single phase can be much simpler than the complete transport process, thus all conditional statements can be preferably eliminated, which increases GPU efficiency.
- As a computed sample path ended in a virtual detector element is continued by all paths started here in the next phase, a much higher number of sample paths will be available to compute high dimensional integrals, thus the result is more accurate. Note that the number of sample paths increased from 4 to 16, in the arrangements of
FIG. 2A toFIG. 2B . - Each phase is computed in parallel on the GPU where threads do not communicate. However, a next phase can reuse the results of all threads of a previous phase, so redundant computations can be eliminated.
should be reinterpreted in the factored case. When factorization is used, □ is the domain of source distribution considered in the investigated phase, {tilde over (y)}D is the expected number of hits in detector element D of the phase, which can be virtual detector elements, and T({right arrow over (v)}→D) is the probability density of transporting from virtual source point {right arrow over (v)}to virtual detector element D.
is presented below when two different methods, an output-driven i.e. gathering type and an input-driven, i.e. scattering type are used simultaneously. The special case of using just a single—i.e. gathering or scattering—type method is also obtained by setting the sample numbers of other methods to zero.
Then, additionally Ni input-driven, i.e. source oriented samples {right arrow over (v)}N
Instead of these estimators considering just a single sampling strategy, the use of the mixture of the two sample sets is proposed according to an embodiment of the invention to compute the expected number of detector element hits as:
Individual densities di or do do not have to cover the total integration domain, it is enough if at least one of the strategies generates a given source point of non-zero emission with non-zero probability. Note that in the case of forward projection, detector oriented sampling leads to an output-driven, i.e. gathering type algorithm and is thus more efficient on parallel hardware, while source oriented sampling leads to an input-driven, i.e. scattering type algorithm that may result in colliding writes. Number of samples Ni and No must be set to consider the requirements that combined density do({right arrow over (v)}, D)+di({right arrow over (v)}) should be roughly proportional to integrand x({right arrow over (v)})T({right arrow over (v)}→D) and the total computational cost of Ni source oriented and No detector oriented samples should be minimal taking into account the parallel hardware.
-
- 1. Positron transport step for describing the propagation of the positron from a point of origination to a point of annihilation where the positron annihilates and two oppositely directed gamma photons are born. This step computes an annihilation density xa({right arrow over (v)}) from positron emission density x({right arrow over (v)}), as an exemplary emission density data.
- 2. Photon transport step that describes photon propagation from the annihilation point—or point of origination—to a surface of a detector crystal, i.e. detector element. In this step the expected number of hits on the surfaces of the detector elements (LORs), {tilde over (y)}L surf, is computed from the annihilation density.
- 3. Detector response step (detector evaluation step) that describes photon propagation from the surface of the detector element to the output of the detector element. In other words this step is responsible for all phenomena happening in the detector elements, i.e. crystals, including inter-crystal scattering, absorption, and the sensitivity of crystals and electronics.
where P({right arrow over (v)}a−{right arrow over (v)}p) is the probability density that a positron born in {right arrow over (v)}p annihilates in {right arrow over (v)}a. This convolution like expression is valid only if the material is homogeneous.
is obtained. In the expression, ξm({right arrow over (v)}p) is an indicator function that is 1 if in point {right arrow over (v)}p there is material of index m and zero otherwise. As the computational complexity of filtering in spatial domain is proportional to the product of the voxel numbers in the positron density volume and the kernel, spatial filtering gets prohibitively expensive for large kernels. The feasible approach is the execution of the filtering in frequency domain, for which efficient 2D and 3D fast Fourier transformation solutions exist for the GPU (NVIDIA, http://developer.nvidia.com/cuda, in: The CUDA Homepage.).
is obtained. Note that this computation requires the Fourier transforms of the blurring functions computed during pre-processing for each material, the Fourier transformation of the positron density once for each material type (usually two or three), and a single inverse Fourier transformation.
where xa is the annihilation density and do is
where line segment end points {right arrow over (z)}1, {right arrow over (z)}2 and ray marching step size Δls can be unambiguously determined from the line of a place vector {right arrow over (l)}s,r and direction vector {right arrow over (107)}, and from the arrangement of the
where density di is:
where X is the total activity.
With this sample density, the modified output-driven projection and input-driven projection are
respectively. The final estimator is the sum of the two estimators: {tilde over (y)}L surf={tilde over (y)}L 0+{tilde over (y)}L i.
Let us consider a LOR connecting crystals c1 and c2 and having direction {right arrow over (107)}. The expected number of hits in this LOR is:
It is clear from the expression that the detector response calculation is a convolution operation in four-dimensional LOR space. The space of the LORs is four-dimensional since a LOR is determined by its two end points, each given by a coordinate pair on the detector surfaces.
Relaxed sample sets are preferably pre-generated sample sets containing No samples by minimizing the difference between their distribution and the real distribution, which is measured or calculated with a transport Monte Carlo simulation off-line. These pre-generated samples are preferably used in the detector blurring step. Thanks to the translation invariance, it is sufficient to carry out the measurement or the off-line simulation and sample generation only once and use the same set of samples in each LOR.
where Ni is the number of input-driven LOR samples and Y=Σij{tilde over (y)}L(i,j) surf.
Herebelow, the details of a back projection are given as used in some preferred embodiments of the invention. The back projection considers the ratios of measured and estimated LOR values and executes the steps of forward projection backwards in reverse order to update the voxel estimates. As the detector response phase computes LOR values from LOR values, and the positron range phase voxel values from voxel values, their backwards execution is relatively simple. The reversed detector response operation multiplies the ratios of measured and estimated LOR values first with the detector element sensitivities and then blurs in LOR space with the detector blurring kernel.
TABLE 1 | |||
EM | OSEM, 6 subsets |
Factored step | 1283 | 2563 | 1283 | 2563 |
Positron range with 3 | 0.6 |
2 sec | 0.6 |
2 sec |
materials | ||||
Geometric FP (4 line | 7 sec | 58 sec | 1 sec | 9 sec |
samples) | ||||
Geometric BP (1 |
12 sec | 90 |
2 |
15 sec |
per voxel) | ||||
LOR filtering (64 samples) | 9 sec | 9 sec | 1.4 sec | 1.4 sec |
Total | 29 sec | 159 |
5 sec | 27 sec |
Claims (20)
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
PCT/HU2012/000066 WO2014016626A1 (en) | 2012-07-23 | 2012-07-23 | Method, computer readable medium and system for tomographic reconstruction |
Publications (2)
Publication Number | Publication Date |
---|---|
US20150213630A1 US20150213630A1 (en) | 2015-07-30 |
US9189870B2 true US9189870B2 (en) | 2015-11-17 |
Family
ID=46634466
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US14/416,533 Active US9189870B2 (en) | 2012-07-23 | 2012-07-23 | Method, computer readable medium and system for tomographic reconstruction |
Country Status (3)
Country | Link |
---|---|
US (1) | US9189870B2 (en) |
EP (1) | EP2875489B1 (en) |
WO (1) | WO2014016626A1 (en) |
Families Citing this family (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN104750992B (en) * | 2015-04-01 | 2017-11-28 | 上海联影医疗科技有限公司 | Simulation particle transports and determined the method, apparatus and system of human dose in radiotherapy |
EP3350776B1 (en) | 2016-04-20 | 2021-09-22 | Shanghai United Imaging Healthcare Co., Ltd. | System and method for image reconstruction |
US10311631B2 (en) * | 2017-05-12 | 2019-06-04 | Siemens Healthcare Gmbh | Light path fusion for rendering surface and volume data in medical imaging |
US10403010B2 (en) * | 2017-09-19 | 2019-09-03 | General Electric Company | Methods and systems for reconstructing images |
WO2019057851A1 (en) * | 2017-09-21 | 2019-03-28 | Koninklijke Philips N.V. | A relaxed iterative maximum-likelihood expectation maximization for positron emission tomography random coincidence estimation |
JP6958794B2 (en) * | 2017-12-28 | 2021-11-02 | ボード オブ トラスティーズ オブ ノーザン イリノイ ユニバーシティー | Processing pipeline for immediate particle image reconstruction |
CN110269638B (en) * | 2019-06-25 | 2023-07-25 | 上海联影医疗科技股份有限公司 | Image reconstruction method, system, readable storage medium and apparatus |
US11335038B2 (en) * | 2019-11-04 | 2022-05-17 | Uih America, Inc. | System and method for computed tomographic imaging |
CN112991482B (en) * | 2021-04-12 | 2023-03-24 | 明峰医疗系统股份有限公司 | GPU-based rapid reconstruction imaging method and device and readable storage medium |
Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US7840053B2 (en) * | 2007-04-05 | 2010-11-23 | Liao Hstau Y | System and methods for tomography image reconstruction |
Family Cites Families (36)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5414623A (en) | 1992-05-08 | 1995-05-09 | Iowa State University Research Foundation | Optoelectronic system for implementation of iterative computer tomography algorithms |
US6373059B1 (en) | 2000-10-31 | 2002-04-16 | Ge Medical Systems Global Technology Company, Llc | PET scanner septa |
US7818047B2 (en) | 2001-11-09 | 2010-10-19 | Nova R&D, Inc. | X-ray and gamma ray detector readout system |
US6804325B1 (en) | 2002-10-25 | 2004-10-12 | Southeastern Universities Research Assn. | Method for position emission mammography image reconstruction |
US7015477B2 (en) | 2003-03-27 | 2006-03-21 | Donald Lee Gunter | Filtered backprojection algorithms for compton cameras in nuclear medicine |
US7596202B2 (en) | 2003-12-16 | 2009-09-29 | Koninklijke Philips Electronics N.V. | Imaging method with filtered back projection |
US7693318B1 (en) | 2004-01-12 | 2010-04-06 | Pme Ip Australia Pty Ltd | Method and apparatus for reconstruction of 3D image volumes from projection images |
US7120283B2 (en) | 2004-01-12 | 2006-10-10 | Mercury Computer Systems, Inc. | Methods and apparatus for back-projection and forward-projection |
US7778392B1 (en) | 2004-11-02 | 2010-08-17 | Pme Ip Australia Pty Ltd | Method of reconstructing computed tomography (CT) volumes suitable for execution on commodity central processing units (CPUs) and graphics processors, and apparatus operating in accord with those methods (rotational X-ray on GPUs) |
US7332721B2 (en) | 2005-04-13 | 2008-02-19 | Photodetection Systems, Inc. | Separation of geometric system response matrix for three-dimensional image reconstruction |
WO2006119085A2 (en) | 2005-04-29 | 2006-11-09 | The Regents Of The University Of California | Integrated pet-mri scanner |
GB0512252D0 (en) | 2005-06-16 | 2005-07-27 | Hammersmith Imanet Ltd | Method of and software for conducting motion correction in tomographic scanning and system for tomographic scanning using the method |
US7333107B2 (en) | 2005-08-18 | 2008-02-19 | Voxar Limited | Volume rendering apparatus and process |
US7211799B2 (en) | 2005-09-14 | 2007-05-01 | General Electric Company | Method and system for calibrating a time of flight positron emission tomography system |
US8687869B2 (en) | 2005-11-30 | 2014-04-01 | The Research Foundation Of State Of University Of New York | System and method for acceleration of image reconstruction |
WO2007095312A2 (en) | 2006-02-13 | 2007-08-23 | University Of Chicago | Image reconstruction from limited or incomplete data |
US8314796B2 (en) | 2006-02-24 | 2012-11-20 | The Board Of Trustees Of The Leland Stanford Junior University | Method of reconstructing a tomographic image using a graphics processing unit |
US7769217B2 (en) | 2006-03-06 | 2010-08-03 | Siemens Medical Solutions Usa, Inc. | Fast iterative 3D PET image reconstruction using a set of 2D linogram transformations |
US20070221855A1 (en) | 2006-03-22 | 2007-09-27 | Jinhun Joung | Scintillation crystal surface treatment |
US8442353B2 (en) | 2006-08-03 | 2013-05-14 | The Regents Of The University Of California | Incorporation of mathematical constraints in methods for dose reduction and image enhancement in tomography |
US20080095300A1 (en) | 2006-10-05 | 2008-04-24 | General Electric Company | System and method for iterative reconstruction using parallel processing |
US7381960B1 (en) | 2006-11-11 | 2008-06-03 | National Tsing Hua University | Imaging system and method for the non-pure positron emission tomography |
US7759647B2 (en) | 2007-01-24 | 2010-07-20 | Siemens Medical Solutions Usa, Inc. | PET imaging system with APD-based PET detectors and three-dimensional positron-confining magnetic field |
US7856129B2 (en) | 2007-03-09 | 2010-12-21 | Siemens Medical Solutions Usa, Inc. | Acceleration of Joseph's method for full 3D reconstruction of nuclear medical images from projection data |
US7876941B2 (en) | 2007-03-09 | 2011-01-25 | Siemens Medical Solutions Usa, Inc. | Incorporation of axial system response in iterative reconstruction from axially compressed data of cylindrical scanner using on-the-fly computing |
CA2631004C (en) | 2007-05-09 | 2016-07-19 | Universite De Sherbrooke | Image reconstruction methods based on block circulant system matrices |
US7873236B2 (en) | 2007-08-28 | 2011-01-18 | General Electric Company | Systems, methods and apparatus for consistency-constrained filtered backprojection for out-of-focus artifacts in digital tomosythesis |
WO2009137794A2 (en) | 2008-05-08 | 2009-11-12 | The Johns Hopkins University | Real-time dose computation for radiation therapy using graphics processing unit acceleration of the convolution/superposition dose computation method |
US8000513B2 (en) | 2008-09-22 | 2011-08-16 | Siemens Medical Solutions Usa, Inc. | System and method for 3D time of flight PET forward projection based on an exact axial inverse rebinning relation in fourier space |
US8781198B2 (en) | 2008-10-10 | 2014-07-15 | Koninklijke Philips N.V. | High contrast imaging and fast imaging reconstruction |
US8698087B2 (en) | 2008-11-03 | 2014-04-15 | The Trustees Of The University Of Pennsylvania | Limited angle tomography with time-of-flight PET |
US20100128046A1 (en) | 2008-11-26 | 2010-05-27 | Microsoft Corporation | Parallel poisson disk sampling |
WO2011036436A1 (en) | 2009-09-22 | 2011-03-31 | Isis Innovation Limited | X-ray imaging |
US8076644B2 (en) | 2009-12-02 | 2011-12-13 | General Electric Company | Methods and systems for determining a medical system alignment |
US20110164031A1 (en) | 2010-01-06 | 2011-07-07 | Kabushiki Kaisha Toshiba | Novel implementation of total variation (tv) minimization iterative reconstruction algorithm suitable for parallel computation |
US9111381B2 (en) | 2010-01-27 | 2015-08-18 | Koninklijke Philips N.V. | Shift-varying line projection using graphics hardware |
-
2012
- 2012-07-23 US US14/416,533 patent/US9189870B2/en active Active
- 2012-07-23 WO PCT/HU2012/000066 patent/WO2014016626A1/en active Application Filing
- 2012-07-23 EP EP12743764.8A patent/EP2875489B1/en active Active
Patent Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US7840053B2 (en) * | 2007-04-05 | 2010-11-23 | Liao Hstau Y | System and methods for tomography image reconstruction |
Non-Patent Citations (4)
Title |
---|
European Patent Office, International Search Report and Written Opinion of the International Searching Authority, Mar. 21, 2013. |
L. Szipmay-Kalos et al, Gamma Photon Transport on the GPU for PET, Jun. 4, 2009, Large-Scale Scientific Computing, Springer Berlin Heidelberg, Berlin, Heidelberg. |
Milan Magdics et al, Detector Modeling Techniques for Pre-Clinical 3D PET Reconstruction on the GPU, Proceedings of the 11th International Meeting on Fully 3D Image Reconstruction in Radiology and Nuclear Medicine, Jul. 11, 2011. |
Milan Magdics et al, TeraTomo project: a fully 3D GPU based reconstruction code for exploiting the imaging capability of the NanoPET/CT system, 2010 World Molecular Imaging Congress, Sep. 11, 2010. |
Also Published As
Publication number | Publication date |
---|---|
EP2875489B1 (en) | 2016-08-24 |
WO2014016626A1 (en) | 2014-01-30 |
EP2875489A1 (en) | 2015-05-27 |
US20150213630A1 (en) | 2015-07-30 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US9189870B2 (en) | Method, computer readable medium and system for tomographic reconstruction | |
Cui et al. | Fully 3D list‐mode time‐of‐flight PET image reconstruction on GPUs using CUDA | |
Pratx et al. | GPU computing in medical physics: A review | |
Reader et al. | Advances in PET image reconstruction | |
Vandenberghe et al. | Iterative reconstruction algorithms in nuclear medicine | |
Lewitt et al. | Overview of methods for image reconstruction from projections in emission computed tomography | |
Kachelriess et al. | Hyperfast parallel‐beam and cone‐beam backprojection using the cell general purpose hardware | |
Pratx et al. | Fast, accurate and shift-varying line projections for iterative reconstruction using the GPU | |
Chou et al. | A fast forward projection using multithreads for multirays on GPUs in medical image reconstruction | |
US8620054B2 (en) | Image reconstruction based on accelerated method using polar symmetries | |
Pedemonte et al. | GPU accelerated rotation-based emission tomography reconstruction | |
Markiewicz et al. | NiftyPET: a high-throughput software platform for high quantitative accuracy and precision PET imaging and analysis | |
Matenine et al. | GPU‐accelerated regularized iterative reconstruction for few‐view cone beam CT | |
US9111381B2 (en) | Shift-varying line projection using graphics hardware | |
US20120288176A1 (en) | Method and apparatus for estimating monte-carlo simulation gamma-ray scattering in positron emission tomography using graphics processing unit | |
Liu et al. | GPU-based branchless distance-driven projection and backprojection | |
Herraiz et al. | GPU-based fast iterative reconstruction of fully 3-D PET sinograms | |
CN101278318A (en) | Method and system for PET image reconstruction using a surrogate image | |
Schellmann et al. | Parallel medical image reconstruction: from graphics processing units (GPU) to grids | |
KR101283266B1 (en) | Method and apparatus for monte-carlo simulation gamma-ray scattering estimation in positron emission tomography using graphics processing unit | |
Bexelius et al. | Implementation of GPU accelerated SPECT reconstruction with Monte Carlo-based scatter correction | |
Ha et al. | A GPU-accelerated multivoxel update scheme for iterative coordinate descent (ICD) optimization in statistical iterative CT reconstruction (SIR) | |
Schellmann et al. | Cost-effective medical image reconstruction: from clusters to graphics processing units | |
Pratx et al. | 3-D tomographic image reconstruction from randomly ordered lines with CUDA | |
Chou et al. | Accelerating image reconstruction in dual-head PET system by GPU and symmetry properties |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
AS | Assignment |
Owner name: MEDISO ORVOSI BERENDEZES FEJLESZTO ES SZERVIZ KFT. Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNORS:SZIRMAY-KALOS, LASZLO;MAGDICS, MILAN;TOTH, BALAZS GYORGY;REEL/FRAME:035122/0723 Effective date: 20150126 |
|
STCF | Information on status: patent grant |
Free format text: PATENTED CASE |
|
MAFP | Maintenance fee payment |
Free format text: PAYMENT OF MAINTENANCE FEE, 4TH YR, SMALL ENTITY (ORIGINAL EVENT CODE: M2551); ENTITY STATUS OF PATENT OWNER: SMALL ENTITY Year of fee payment: 4 |
|
MAFP | Maintenance fee payment |
Free format text: PAYMENT OF MAINTENANCE FEE, 8TH YR, SMALL ENTITY (ORIGINAL EVENT CODE: M2552); ENTITY STATUS OF PATENT OWNER: SMALL ENTITY Year of fee payment: 8 |