WO2000030021A1 - System for detection of malignancy in pulmonary nodules - Google Patents

System for detection of malignancy in pulmonary nodules Download PDF

Info

Publication number
WO2000030021A1
WO2000030021A1 PCT/US1999/025998 US9925998W WO0030021A1 WO 2000030021 A1 WO2000030021 A1 WO 2000030021A1 US 9925998 W US9925998 W US 9925998W WO 0030021 A1 WO0030021 A1 WO 0030021A1
Authority
WO
WIPO (PCT)
Prior art keywords
nodule
neural network
training
artificial neural
mechanism configured
Prior art date
Application number
PCT/US1999/025998
Other languages
French (fr)
Other versions
WO2000030021A9 (en
Inventor
Kunio Doi
Katsumi Nakamura
Original Assignee
Arch Development Corporation
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Arch Development Corporation filed Critical Arch Development Corporation
Priority to JP2000582958A priority Critical patent/JP2002530133A/en
Priority to US09/830,574 priority patent/US6738499B1/en
Priority to EP99958770A priority patent/EP1129426A4/en
Priority to AU16064/00A priority patent/AU1606400A/en
Publication of WO2000030021A1 publication Critical patent/WO2000030021A1/en
Publication of WO2000030021A9 publication Critical patent/WO2000030021A9/en

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T7/00Image analysis
    • G06T7/0002Inspection of images, e.g. flaw detection
    • G06T7/0012Biomedical image inspection
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V20/00Scenes; Scene-specific elements
    • G06V20/60Type of objects
    • G06V20/69Microscopic objects, e.g. biological cells or cellular parts

Definitions

  • the invention relates generally to a method and system for the computerized analysis of radiographic images, and more specifically, to the determination of the likelihood of malignancy in pulmonary nodules using artificial neural networks (ANNs).
  • ANNs artificial neural networks
  • the present invention generally relates to computerized techniques for automated analysis of digital images, for example, as disclosed in one or more of U.S. Patents 4,839,807; 4,841,555; 4,851,984; 4,875,165; 4,907,156; 4,918,534; 5,072,384; 5,133,020; 5,150,292; 5,224,177; 5,289,374; 5,319,549; 5,343,390; 5,359,513; 5,452,367; 5,463,548; 5,491,627; 5,537,485; 5,598,481; 5,622,171; 5,638,458; 5,657,362; 5,666,434; 5,673,332; 5,668,888; 5,740,268; 5,790,690; and 5,832,103; as well as U.S.
  • the present invention includes the use of various technologies referenced and described in the above-noted U.S. Patents and Applications, as well as described in the references identified in the appended APPENDIX and cross-referenced throughout the specification by reference to the corresponding number, in brackets, of the respective references listed in the APPENDIX, the entire contents of which, including the related patents and applications listed above and the references listed in the APPENDIX, are incorporated herein by reference. Discussion of the Background
  • solitary pulmonary nodule is a common finding on a chest radiograph
  • the differential diagnosis of a solitary pulmonary nodule or chest radiograph is often a difficult task for radiologists [1-8].
  • a solitary pulmonary nodule may be the first sign of lung cancer, especially in its early stage, most patients undergo a further diagnostic evaluation that may include an imaging study with computed tomography (CT) [1].
  • CT computed tomography
  • Malignant diseases are estimated to occur in about 20% of patients with solitary pulmonary nodules in the population [9].
  • the majority of radiographically detected pulmonary nodules are benign [3,8,10-14].
  • CT has become a major diagnostic method to differentiate pulmonary nodules in recent years, a large number of CT examinations have been performed for benign cases that were suspected of being malignant.
  • a survey was conducted to obtain estimates for the relative numbers (percentages) of malignant and benign cases that were performed for chest CT study under the investigation of a solitary pulmonary nodule. The survey was performed at the University of Chicago Hospital and at four Hospitals in Japan (University of Occupational and Environmental Health Hospital, Fukuoka; Nagasaki University Hospital, Nagasaki; Iwate Prefectural Central Hospital, Morioka; and Tokyo Metropolitan Hospital, Tokyo).
  • pre-CT diagnosis included a lung nodule/lung mass
  • some of the cases may have involved suspected benign diseases; however, it was assumed that most of these cases involved suspected malignancy.
  • Table 1 shows the summary of the survey on the final diagnosis of solitary pulmonary nodules which underwent chest CT. Fifty-five out of 133 cases (or 41.4%) indicated malignant nodules including primary lung cancer and pulmonary metastases. Sixty-four cases (48.1%) indicated benign conditions including benign diseases and "negative" cases that had no apparent lung abnormality as a result of CT examination. Fourteen cases had inconclusive final diagnoses. The results obtained in this survey show that a large fraction of patients who underwent chest CT examination were ultimately identified as having benign conditions. Accordingly, some of the CT examinations may have been avoided if these benign conditions were diagnosed accurately and/or confidently on the initial chest radiographs.
  • Computer schemes capable of providing objective information on the nature of pulmonary nodules may aid radiologists in their classification of pulmonary nodules.
  • Various computerized schemes have been investigated for characterizing pulmonary nodules.
  • Another object of the present invention is to provide a method and system for implementing a computer-aided diagnostic (CAD) technique to assist radiologists in distinguishing benign and malignant lung nodules.
  • CAD computer-aided diagnostic
  • Another object of this invention is to provide a method and system for assisting radiologists in accurately identifying benign pulmonary nodules.
  • Techniques include the use of ANNs to merge subjective features extracted by radiologists to determine the likelihood of malignancy of solitary pulmonary nodules. Additional techniques include computerized extraction of objective measures of lung nodules that are correlated to the subjective features seen by radiologists, and the use of ANNs to estimate the likelihood of malignancy by merging the objective measures. The performance of the ANNs is evaluated by means of receiver operating characteristic (ROC) analysis. The performance of radiologists is evaluated in classifying benign and malignant nodules for comparison with the computerized methods.
  • ROC receiver operating characteristic
  • the present invention thus addresses the problems associated with the conventional diagnosis of pulmonary nodules.
  • the method and system of the invention using ANNs to merge subjective data obtained manually or objective data obtained with automated techniques, is thus able to estimate the likelihood of malignancy. This estimate assists radiologists in confidently and accurately identifying benign nodules, thereby helping to reduce the number of unnecessary CT examinations (i.e., CT examinations performed on patients with benign nodules).
  • Figure 1(a) is a flowchart of the method used to analyze the likelihood of malignancy in pulmonary nodules according to the present invention
  • Figure 1(b) is an image of a score sheet 10 to be used by radiologists to subjectively characterize lung nodules;
  • Figures 2(a) and 2(b) are respective images of (a) a malignant nodule and (b) a benign nodule;
  • Figure 2(c) is an image of a hand-drawn outline delineating the malignant nodule of Figure 2(a);
  • Figure 2(d) is an image of a hand-drawn outline delineating the benign nodule of Figure 2(b);
  • Figure 2(e) is an image of an ellipse calculated to fit the outline of the malignant nodule delineated (extracted) in Figure 2(c);
  • Figure 2(f) is an image of an ellipse calculated to fit the outline of the benign nodule delineated (extracted) in Figure 2(d);
  • Figure 3(a) is a one-dimensional representation of the nodule outline of Figure 2(c) as the distance from sample points distributed with equal intervals on the calculated ellipse of Figure 2(e);
  • Figure 3(b) is a one-dimensional representation of the nodule outline of Figure 2(d) as the distance from sample points distributed with equal intervals on the calculated ellipse of Figure 2(f);
  • Figure 4(a) is an illustration of an artificial neural network (ANN) for determining the likelihood of malignancy in a pulmonary nodule in accordance with the present invention
  • Figures 4(b), 4(c), and 4(d) are respective graphs, for attending radiologists (4(b)), radiology residents (4(c)), and all radiologists (4(d)), of receiver operating characteristic (ROC) curves showing how accurately benign and malignant nodules were identified using an ANN trained with selected subjective features and an ANN trained with all subjective features quantified by the respective radiologists.
  • ROC receiver operating characteristic
  • Figures 5(a), 5(b), and 5(c) are respective graphs comparing ROC curves of (a) attending radiologists versus an ANN trained using selected subjective features quantified by the attending radiologists, (b) radiology residents versus an ANN trained using selected subjective features quantified by the radiology residents, and (c) the average performance of all radiologists versus an ANN trained using selected subjective features quantified by all radiologists, in distinguishing between benign and malignant nodules;
  • Figures 6(a) and 6(b) are graphs showing the respective relationships between (a) the root mean square (RMS) value and subjective rating of marginal irregularity, and (b) the standard deviation of pixel values and subjective rating of homogeneity;
  • Figures 7(a), 7(b), 7(c), and 7(d) are graphs showing the respective relationships between (a) effective diameter and degree of ellipticity, (b) standard deviation of pixel values and average pixel value, (c) degree of circularity and RMS variation, (d) average edge gradient and degree of elliptical irregularity, for malignant and benign nodules;
  • Figure 8 is a graph of a series of ROC curves showing that an ANN trained with objective measures quantified by a computer outperforms both radiologists and an ANN trained with subjective features quantified by radiologists, when distinguishing between benign and malignant nodules;
  • Figure 9 is a schematic illustration of a general purpose computer 100 programmed according to the teachings of the present invention.
  • the likelihood of malignancy for a candidate pulmonary nodule may be determined in steps S5 through S7.
  • the selected features are quantified for the candidate nodule.
  • the quantified features are input to the respective input nodes of an ANN.
  • an output unit of the ANN determines the likelihood of malignancy of the candidate nodule based on the quantified features input in step S6.
  • a database was constructed from 56 chest radiographs selected from the cases that were used for development of computerized schemes for detection of lung nodules [29, 30]. Solitary pulmonary nodules larger than 3 cm were excluded in this study. None of the nodules showed calcification using CT, nor were scarlike linear opacities visible. The final diagnosis was established pathologically, and for some benign nodules, a presumed diagnosis of benign etiology was made because of no change or decrease in nodule size over a 2 year period. The 56 chest radiographs included 34 malignant nodules, all of which were pathologically proven primary bronchogenic carcinoma.
  • the radiographs further included 22 benign lesions, 2 of which were classified as pulmonary hamartoma, 12 of which were classified as granuloma, 7 of which were classified as inflammatory lesion, and 1 which was classified as pulmonary infarction.
  • the 56 chest radiographs were obtained in 33 women and 23 men who ranged in age from 24 to 86 years (mean, 58.4 years).
  • Chest radiographs were digitized with a laser scanner (model number 2905 manufactured by Abe Sekkei of Tokyo, Japan) with a pixel size of 0.175 mm and a 10- bit gray scale (1,024 gray levels). In an alternative embodiment, however, a database of more or fewer radiographs is used.
  • the nodules' sizes were measured by using a ruler.
  • the variations in measured nodule sizes between different observers was mainly due to the variation in subjective judgments on nodule edges.
  • Seven radiologists (4 attending radiologists ("attendings") and 3 radiology residents ("residents")) characterized the features of each nodule independently using a score sheet, such as the score sheet 10, with scales from 1 to 5.
  • Nodule shape was scored from round to elongated; marginal irregularity was scored from smooth to irregular; spiculation was scored from non-spiculated to spiculated; border definition was scored from well-defined to ill- defined; lobulation was scored from non-lobulated to lobulated; nodule density (contrast) was scored from low density to high density; homogeneity was scored from homogeneous to inhomogeneous.
  • the score sheet included images of two extreme examples (i.e., an example of a "1" and a "5") for each feature.
  • the diagrams served as a scoring guide for the radiologists; for example, an image of a non-spiculated nodule (i.e., a "1") and a spiculated nodule (i.e., a "5") were provided to help assess the degree of spiculation.
  • Table 2 shows examples of a radiologist's subjective ratings for the eight subjective radiological features for the malignant and benign pulmonary nodules shown in Figures 2(a) and 2(b), respectively. To eliminate the bias of the knowledge of "truth,” all radiologists rated each nodule without knowledge of the correct diagnosis.
  • the following twelve features were selected for computer characterization of pulmonary nodules that were extracted from digitized chest radiographs: effective diameter; degree of circularity; degree of ellipticity; root mean square (RMS) variation; first moment of power spectrum; degree of irregularity; average gradient; radial gradient index (RGI); tangential gradient index (TGI); line enhancement index (LEI); average pixel value; and standard deviation of pixel values.
  • RMS root mean square
  • RMS root mean square
  • first moment of power spectrum degree of irregularity
  • average gradient radial gradient index
  • RGI radial gradient index
  • TGI tangential gradient index
  • LEI line enhancement index
  • average pixel value and standard deviation of pixel values.
  • the objective measures are quantified or determined based on the outline (or contour) of the nodules manually extracted by a first radiologist.
  • Figure 2(c) shows the hand-drawn outlines (i.e., margins) for the malignant nodule of Figure 2(a).
  • Figure 2(d) shows the hand- drawn outlines for the benign nodule of Figure 2(b).
  • a second radiologist independently extracted a second set of nodule outlines to examine the variation in these objective measures derived from the outlines. The radiologists were not informed about correct diagnosis in order to avoid a bias to the outlines.
  • the effective diameter provides an objective measure of the nodule size.
  • the effective diameter [31, 32] is the diameter of an equivalent circle having the same area as that defined by the outline of the nodule.
  • the degree of circularity provides an objective measure of the nodule shape.
  • the degree of circularity is the ratio of the area of the nodule overlapped with the equivalent circle, to the total area of the nodule [31, 32].
  • the definition of marginal irregularity includes two independent factors: the magnitude and the coarseness (or fineness) of irregular edge patterns.
  • the edge pattern is the distance from the nodule outline to the calculated ellipse, as illustrated in Figures 3(a) and 3(b) for the malignant and benign nodules of Figures 2(a) and (b), respectively.
  • the irregular edge pattern was analyzed using Fourier transformation.
  • the RMS variation and the first moment of power spectrum [35] were each determined to provide separate measures of marginal irregularity.
  • the degree of irregularity [32] provides another measure of marginal irregularity.
  • the degree of irregularity is one minus the ratio of (a) the perimeter (i.e., circumference) of the ellipse used to determine degree of ellipticity to (b) the length of the extracted outline.
  • Border definition is quantified by the average gradient, which is obtained by the average edge gradients over a selected border region.
  • the border region is the area between an outer limit defined by the fitted ellipse plus twice the RMS variation and an inner limit defined by the fitted ellipse minus twice the RMS variation.
  • the border region has a width of four times the RMS variation as measured from the center of the calculated ellipse.
  • RGI provides an objective measure of the spiculation of a nodule.
  • the RGI is the average absolute value of the radial edge gradients projected in a direction along the radial direction [42].
  • the radial direction is a line from the center of mass of the calculated ellipse. For each pixel in the border region, the absolute value of the radial edge gradient projected in a direction along the radial direction is determined. The resulting absolute values are summed and averaged to determine the RGI according to the following formula:
  • P is the pixel or image point
  • L is the set of pixels in the border region
  • D x is the gradient in the x-direction
  • D y is the gradient in the y-direction
  • is the angle between the gradient vector and the radial direction.
  • TGI provides another measure of the spiculation of a nodule.
  • the TGI is obtained from a tangential component of the edge gradient at a pixel, which is projected in a direction pe ⁇ endicular to the radial direction. Accordingly, the following formula is used to determine TGI:
  • LEI provides yet another measure of spiculation and indicates the magnitude of line pattern components obtained by means of a line enhancement filter [36], in a direction within 45 degrees from the radial direction.
  • LEI indicates the number of pixels in the border region that form part of a line pattern having a direction within 45 degrees of the radial direction, divided by the total number of pixels in the border region. Whether a pixel forms part of a line pattern (i.e., whether a pixel is a line pattern component) is determined by the line enhancement filter [36].
  • the average pixel value provides a measure of the optical density of a nodule.
  • the average pixel value is the average pixel value (i.e., gray level) over the nodule as defined by the extracted outline.
  • the standard deviation of pixel values over the nodule provides a measure of the homogeneity of the nodule.
  • the standard deviation is the standard deviation of pixel values over the nodule as defined by the extracted outline.
  • Figure 4(a) is a diagram of a three-layer, feed-forward artificial neural network (ANN) trained with a back propagation algorithm [37] and used to determine the likelihood of malignancy for a pulmonary nodule.
  • ANN feed-forward artificial neural network
  • the features input (applied) to the ANN included two clinical parameters (patient's age and gender) and either the eight subjective radiological features quantified by radiologists or selected of the physical measures (objective features) quantified using computer analysis. The same features are used to test and train each ANN.
  • the number of input units equals the number of features input to the ANN.
  • the number of hidden units may also vary but is preferably half the number of input units.
  • Radiologists To evaluate the performance of radiologists in classifying pulmonary nodules, seven radiologists (4 attendings and 3 residents) participated in the observer study. Each observer (i.e., participating radiologist) was presented with a chest radiograph and two clinical parameters (patient's age and gender). Each observer was then asked to provide his or her confidence level regarding the likelihood of malignancy by using a continuous rating scale with a line checking method [29]. Confidence ratings of "definitely benign” and "definitely malignant" were marked above the left and the right end, respectively, of the line. Radiographs were presented in random order. ROC analysis was employed for comparison of the performance of observers with those of the computerized methods in distinguishing between benign and malignant nodules.
  • Table 3 shows the performance index, Az, when distinguishing between benign and malignant nodules, of the ANN trained with image features extracted by each radiologist separately, and by all radiologists combined.
  • Az the performance index
  • the performance of the ANN with features extracted by radiology residents were much lower than that of attending radiologists.
  • the selected features were patient age, nodule size, marginal irregularity, border definition, nodule density (contrast), and homogeneity.
  • ANNs it is generally desirable for ANNs to achieve a high performance even when only a small number of essential input units are applied.
  • 6 features including patient's age, the nodule size, marginal irregularity, spiculation, border definition, and homogeneity were selected.
  • the average Az value of 0.761 for all 7 radiologists with 6 selected subjective features was statistically significantly greater than that (0.710) with all 10 subjective features (p ⁇ 0.0007).
  • Figures 4(a), 4(b), 4(c), and 4(d) show ROC curves obtained by the ANN trained with 10 features (2 clinical parameters (age and gender) and all 8 subjective features) and an ANN trained with 6 features (1 clinical feature (age) and five subjective features (nodule size, marginal irregularity, spiculation, border definition, and homogeneity)) as input data.
  • the average Az value for the ANN with 6 features was greater than those with 10 features.
  • Figures 5(a), 5(b), and 5(c) compare the performances of the radiologists in differentiating pulmonary nodules with the ANN trained with six selected subjective features (1 clinical feature (age) and five subjective features (nodule size, marginal irregularity, spiculation, border definition, and homogeneity)).
  • the average performance of the ANN when trained with the six subjective features obtained from attendings was slightly greater than the average performance of the attendings.
  • the average performance of the ANN when trained with the six subjective features obtained from residents was comparable to the average performance of the residents.
  • Figures 6(a) and 6(b) respectively show (a) the relationships between the RMS value and subjective ratings of marginal irregularity, and (b) the relationships between the standard deviation of pixel values and subjective ratings of homogeneity.
  • the small squares and vertical bars in Figures 6(a) and 6(b) indicate the average and the standard deviation, respectively, of each physical measure obtained from all nodules included in each of five different subjective ratings.
  • Figures 7(a), 7(b), and 7(c) each show the relationships between two selected image features of pulmonary nodules. Although a considerable overlap between the malignant and benign pulmonary nodules is observed in general, there is also a trend, which would discriminate malignant and benign nodules, between the distribution of two groups. For example, in Figure 7(a) malignant nodules tend to have a larger effective diameter and smaller degree of ellipticity than benign nodules. This result appears to agree with general characteristics of lung nodules that malignant nodules are larger than benign nodules, and that the shape of benign nodules tends to be round.
  • Figure 7(d) shows that malignant nodules tend to have a larger average edge gradient and a larger degree of elliptical irregularity than benign nodules, which indicates potential for discriminating benign from malignant nodules. This tendency also agrees with general characteristics that malignant nodules tend to contain spiculations and irregular margins.
  • RMS RMS value
  • FM first moment of power spectrum
  • TGI tangential gradient index
  • RGI radial gradient index
  • LEI line enhancement index
  • AG average gradient
  • STD standard deviation of pixel value
  • APV average pixel value.
  • Figure 8 shows the comparison of ROC curves corresponding to: (1) the ANN with the computer extracted features, (2) the ANN with six selected subjective ratings by radiologists, and (3) the radiologists.
  • ANNs trained with subjective ratings by radiologists was better than that by radiologists themselves in distinguishing benign from malignant pulmonary nodules. Similar results were reported in studies on differential diagnosis of breast cancer [22, 23] and also interstitial lung diseases [40]. This may be because radiologists would not use all of the image features, which were obtained in subjective ratings in this study in their differential diagnosis of solitary pulmonary nodules. For example, due to their knowledge and own experience, a limited number of conspicuous features are likely to affect strongly their decision making in some cases, and radiologists generally tend not to consider all of features systematically. On the other hand, ANNs are affected by all of the data consistently and comprehensively. In addition, ANNs are superior to radiologists in merging large amounts of data.
  • An ANN has a unique capability to learn specific patterns between input and output data if the ANN is trained by examples repeatedly; however, this capability strongly depends on the quality of the input data. In other words, if input data are selected randomly and have no correlation with output data, the ANN is unlikely to learn any specific patterns between input and output data, resulting in a lower performance.
  • images were provided as a guide to radiologists to use the criterion consistently for extracting subjective image features, the ratings are highly subjective, and could be affected strongly by an individual radiologist's knowledge and experience, which may have caused the large variation in the ratings of the radiologists.
  • Another limitation of using subjective ratings for input data to train and test the ANN is that the quality of subjective ratings as input data for the ANN considerably depends on the ability of a radiologist to extract or quantify nodule features.
  • the performance of the ANN with subjective features extracted by radiology residents were much lower than that of attending radiologists, which indicates that less experienced radiologists could not extract the nodule features sufficiently, and consequently, the ANN could not learn well the specific patterns between input data and output data. Therefore, computer-aided diagnostic schemes that can extract nodule features automatically, objectively, and reproducibly are highly desirable.
  • the ANN having input features automatically determined by computer performed better (based on Az) than the average performance of the radiologists. Moreover, the ANN trained with features determined by computer performed better than the ANN trained with subjective features. Although the objective measures were selected initially on the basis of their expected correlation with the subjective features, these objective measures may contain different and/or additional information from the subjective features, which may explain why the objective measures contributed more effectively than the subjective features in distinguishing benign from malignant pulmonary nodules.
  • a goal of the inventive computerized classification scheme for pulmonary nodules is to reduce the number of benign nodules sent for further diagnostic evaluation.
  • the computerized method indicated a higher performance than the average radiologists (based on Az), even though the nodule outline was provided manually by a radiologist.
  • the computerized classification method may provide a useful aid to radiologists in diagnosing/differentiating benign from malignant pulmonary nodules, and therefore, it may be possible to reduce the number of "unnecessary" CT examinations.
  • This invention conveniently may be implemented using a conventional general pu ⁇ ose computer or micro-processor programmed according to the teachings of the present invention, as will be apparent to those skilled in the computer art.
  • Appropriate software can readily be prepared by programmers of ordinary skill based on the teachings of the present disclosure, as will be apparent to those skilled in the software art.
  • FIG. 9 is a schematic illustration of a computer system for the computerized analysis of the likelihood of malignancy in pulmonary nodules.
  • a computer 100 implements the method of the present invention, wherein the computer housing 102 houses a motherboard 104 which contains a CPU 106, memory 108 (e.g., DRAM, ROM, EPROM, EEPROM, SRAM, SDRAM, and Flash RAM), and other optional special pu ⁇ ose logic devices (e.g., ASICs) or configurable logic devices (e.g., GAL and reprogrammable FPGA).
  • the computer 100 also includes plural input devices, (e.g., a keyboard 122 and mouse 124), and a display card 110 for controlling monitor 120.
  • the computer 100 further includes a floppy disk drive 114; other removable media devices (e.g., compact disc 119, tape, and removable magneto-optical media (not shown)); and a hard disk 112, or other fixed, high density media drives, connected using an appropriate device bus (e.g., a SCSI bus, an Enhanced IDE bus, or a Ultra DMA bus). Also connected to the same device bus or another device bus, the computer 100 may additionally include a compact disc reader 118, a compact disc reader/writer unit (not shown) or a compact disc jukebox (not shown). Although compact disc 119 is shown in a CD caddy, the compact disc 119 can be inserted directly into CD- ROM drives which do not require caddies.
  • a floppy disk drive 114 other removable media devices (e.g., compact disc 119, tape, and removable magneto-optical media (not shown)); and a hard disk 112, or other fixed, high density media drives, connected using an appropriate device bus (e.g., a
  • the system includes at least one computer readable medium.
  • Examples of computer readable media are compact discs 119, hard disks 112, floppy disks, tape, magneto-optical disks, PROMs (EPROM, EEPROM, Flash EPROM), DRAM, SRAM, SDRAM, etc.
  • the present invention includes software for controlling both the hardware of the computer 100 and for enabling the computer 100 to interact with a human user.
  • Such software may include, but is not limited to, device drivers, operating systems and user applications, such as development tools.
  • Such computer readable media further includes the computer program product of the present invention for performing the inventive method described above, including steps SI through S7 of Figure 1(a).
  • the computer code devices of the present invention can be any inte ⁇ reted or executable code mechanism, including but not limited to scripts, inte ⁇ reters, dynamic link libraries, Java classes, and complete executable programs. Moreover, parts of the processing of the present invention may be distributed for better performance, reliability, and/or cost. For example, an outline or image may be selected on a first computer and sent to a second computer for remote diagnosis.
  • the invention may also be implemented by the preparation of application specific integrated circuits or by interconnecting an appropriate network of conventional component circuits, as will be readily apparent to those skilled in the art.
  • numerous modifications and variations of the present invention are possible in light of the above teachings.
  • the outline of the nodules may be extracted using any available automated technique, rather than manually. It is therefore to be understood that within the scope of the appended claims, the invention may be practiced otherwise than as specifically described herein.
  • Metz CE ROC methodology in radiologic imaging. Invest Radiol 1986; 21:720 733.
  • Metz CE Herman BA, Shen JH. Maximum-likelihood estimation of receiver operating (ROC) curves from continuously-distributed data. Statistics in Medicine (in press).
  • Anastasio MA Yoshida H. Nagel R. Nishikawa RM, Doi K. A genetic algorithm based method for optimizing the performance of a computer-aided diagnosis scheme for detection of clustered microcalcifications in mammograms.
  • Ashizawa K MacMahon H. Ishida T. Vyborny CJ, Katsuragawa S.

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Health & Medical Sciences (AREA)
  • Theoretical Computer Science (AREA)
  • Health & Medical Sciences (AREA)
  • General Physics & Mathematics (AREA)
  • Nuclear Medicine, Radiotherapy & Molecular Imaging (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Quality & Reliability (AREA)
  • Radiology & Medical Imaging (AREA)
  • Medical Informatics (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Biomedical Technology (AREA)
  • Molecular Biology (AREA)
  • Multimedia (AREA)
  • Image Analysis (AREA)
  • Apparatus For Radiation Diagnosis (AREA)
  • Image Processing (AREA)

Abstract

A method, computer program product, and system (100) for computerized analysis of the likelihood of malignancy in a pulmonary nodule using artificial neural networks (ANNs) (S4). The method, on which the computer program product and the system is based on, includes obtaining a digital outline of a nodule; generating objective measures corresponding to physical features of the outline of the nodule; applying the generated objective measures to an ANN; and determining a likelihood of malignancy of the nodule based on an output of the ANN. Techniques include novel developments and implementations of artificial neural networks and feature extraction for digital images. Output from the inventive method yields an estimate of the likelihood of malignancy (S7) for a pulmonary nodule.

Description

SYSTEM FOR DETECTION OF MALIGANCY IN PULMONARY NODULES
This application claims priority under 35 USC 119(e) to U.S. Provisional application Serial No. 60/108,167, filed November 13, 1998.
The present invention was made in part with U.S. Government support under USPHS Grants CA24806 and CA62625. The U.S. Government has certain rights in this invention.
BACKGROUND OF THE INVENTION Field of the Invention
The invention relates generally to a method and system for the computerized analysis of radiographic images, and more specifically, to the determination of the likelihood of malignancy in pulmonary nodules using artificial neural networks (ANNs).
The present invention generally relates to computerized techniques for automated analysis of digital images, for example, as disclosed in one or more of U.S. Patents 4,839,807; 4,841,555; 4,851,984; 4,875,165; 4,907,156; 4,918,534; 5,072,384; 5,133,020; 5,150,292; 5,224,177; 5,289,374; 5,319,549; 5,343,390; 5,359,513; 5,452,367; 5,463,548; 5,491,627; 5,537,485; 5,598,481; 5,622,171; 5,638,458; 5,657,362; 5,666,434; 5,673,332; 5,668,888; 5,740,268; 5,790,690; and 5,832,103; as well as U.S. patent applications 08/158,388 (PCT Publication WO 95/14431); 08/173,935; 08/220,917 (PCT Publication WO 95/26682); 08/398,307 (PCT Publication WO 96/27846); 08/523,210 (PCT Publication WO 95/15537); 08/536,149; 08/562,087; 08/757,611; 08/758,438; 08/900,191; 08/900,361; 08/900,362; 08/900,188; 08/900,189, 08/900,192; 08/979,623; 08/979,639; 08/982,282; 09/027,468; 09/027,685; 09/028,518; 09/053,798; 09/092,004; 09/098,504; 09/121,719; 09/131,162; 09/141,535; 09/156,413; 60/107,095 (Attorney Docket No. 0730-0060-20PROV, filed November 5, 1998) all of which are incorporated herein by reference.
The present invention includes the use of various technologies referenced and described in the above-noted U.S. Patents and Applications, as well as described in the references identified in the appended APPENDIX and cross-referenced throughout the specification by reference to the corresponding number, in brackets, of the respective references listed in the APPENDIX, the entire contents of which, including the related patents and applications listed above and the references listed in the APPENDIX, are incorporated herein by reference. Discussion of the Background
Although a solitary pulmonary nodule (SPN) is a common finding on a chest radiograph, the differential diagnosis of a solitary pulmonary nodule or chest radiograph is often a difficult task for radiologists [1-8]. Since a solitary pulmonary nodule may be the first sign of lung cancer, especially in its early stage, most patients undergo a further diagnostic evaluation that may include an imaging study with computed tomography (CT) [1]. Malignant diseases are estimated to occur in about 20% of patients with solitary pulmonary nodules in the population [9]. The majority of radiographically detected pulmonary nodules, however, are benign [3,8,10-14].
Although CT has become a major diagnostic method to differentiate pulmonary nodules in recent years, a large number of CT examinations have been performed for benign cases that were suspected of being malignant. A survey was conducted to obtain estimates for the relative numbers (percentages) of malignant and benign cases that were performed for chest CT study under the investigation of a solitary pulmonary nodule. The survey was performed at the University of Chicago Hospital and at four Hospitals in Japan (University of Occupational and Environmental Health Hospital, Fukuoka; Nagasaki University Hospital, Nagasaki; Iwate Prefectural Central Hospital, Morioka; and Tokyo Metropolitan Hospital, Tokyo). At each institution, patients who underwent chest CT examinations for suspicious pulmonary nodules on chest radiographs were assessed regarding pre-CT clinical diagnosis, the final diagnosis, patient's age, and patient's gender. Final diagnosis was established by a pathologic examination or clinical follow-up.
One hundred thirty-three patients (83 male, 50 female) who ranged in age from 25 to 85 (mean, 62.9 years) were identified at the five hospitals. Pre-CT clinical diagnosis consisted of "suspicion of lung cancer" for 43 of the patients, "lung nodule/lung mass" for 70 of the patients, "abnormal shadow" for 10 of the patients, "suspicion of pulmonary metastasis" for 6 of the patients, and benign diseases for four of the patients ("suspicion of pulmonary tuberculosis" for one patient, "suspicion of pulmonary abscess" for two patients, and "suspicion of pulmonary aspergillosis" for one patient). In the cases in which pre-CT diagnosis included a lung nodule/lung mass, some of the cases may have involved suspected benign diseases; however, it was assumed that most of these cases involved suspected malignancy. Table 1 shows the summary of the survey on the final diagnosis of solitary pulmonary nodules which underwent chest CT. Fifty-five out of 133 cases (or 41.4%) indicated malignant nodules including primary lung cancer and pulmonary metastases. Sixty-four cases (48.1%) indicated benign conditions including benign diseases and "negative" cases that had no apparent lung abnormality as a result of CT examination. Fourteen cases had inconclusive final diagnoses. The results obtained in this survey show that a large fraction of patients who underwent chest CT examination were ultimately identified as having benign conditions. Accordingly, some of the CT examinations may have been avoided if these benign conditions were diagnosed accurately and/or confidently on the initial chest radiographs.
Table 1
Institution Malignant Benign Benign Negative Unknown Total
Lesion
A 9 (32.0%) 15 (53.6%) 11 4 4 28 (100%)
B 8 (25.8%) 23 (74.2%) - - 0 31 (100%)
C 11 (40.7%) 11 (40.7%) 3 8 5 27 (100%)
D 6 (35.3%) 10 (58.8%) - - 1 17 (100%)
E 21 (70.0%) 5 (16.7%) - 4 30 (100%)
Total 55 (41.4%) 64 (48.1%) 14 133 (100%)
Computer schemes capable of providing objective information on the nature of pulmonary nodules may aid radiologists in their classification of pulmonary nodules. Various computerized schemes have been investigated for characterizing pulmonary nodules.
In most of these studies, however, radiographic features were manually extracted, and the computer was used only to determine the likelihood of malignancy by merging image features using rule-based or discriminant analysis. Swensen et al. [15] estimated the probability of malignancy in radiologically indeterminate SPNs by using multivariate logistic regression. They concluded that three clinical parameters (age, cigarette-smoking status, and history of cancer) and 3 radiological features (diameter, spiculation, and upper lobe location) were independent predictors of malignancy. Cummings et al. [16] estimated the probability of malignancy of pulmonary nodules by using Bayesian analysis based on the diameter of an SPN, the patient's age, history of cigarette smoking, and the prevalence of malignancy in SPNs. Gurney [17,18] also used Bayesian analysis to calculate the probability of malignancy, which was compared with the performance of radiologists.
Other investigators have used computer-extracted features to differentiate between malignant and benign lung nodules. Sherrier et al. [19] applied a gradient analysis for distinguishing benign nodules from malignant nodules, and they presented that benign calcified granuloma showed greater gradient number than malignant nodules. Sasaoka et al. [20] extracted nodule features using a computerized method. However, the extracted features, such as density gradient and density entropy, were not directly correlated with specific radiological findings, and thus the meaning of these features is inconclusive. Recently, artificial neural networks (ANNs) have been used in the field of diagnostic radiology to provide a potentially powerful classification tool [21-27]. Gurney et al. [28] reported that the Bayesian method was better than the neural network in the prediction of the probability of malignancy in pulmonary nodules. Despite these considerable efforts, a computerized scheme has not been applied in clinical situations to assist radiologists in their interpretation of malignancy of pulmonary nodules.
SUMMARY OF THE INVENTION
Accordingly, an object of this invention is to provide a new and improved method and system for the analysis of the likelihood of malignancy in solitary pulmonary nodules using artificial neural networks.
Another object of the present invention is to provide a method and system for implementing a computer-aided diagnostic (CAD) technique to assist radiologists in distinguishing benign and malignant lung nodules.
Another object of this invention is to provide a method and system for assisting radiologists in accurately identifying benign pulmonary nodules.
These and other objects are achieved according to the invention by providing (1) a new and improved method, (2) a storage medium storing a program for performing the steps of the method, and (3) a system for analyzing nodules. The method, on which a computer program product and system is based, includes obtaining a digital outline of a nodule; generating objective measures corresponding to physical features of the outline of the nodule; applying the generated objective measures to an artificial neural network (ANN); and determining a likelihood of malignancy of the nodule based on an output of the ANN.
Techniques include the use of ANNs to merge subjective features extracted by radiologists to determine the likelihood of malignancy of solitary pulmonary nodules. Additional techniques include computerized extraction of objective measures of lung nodules that are correlated to the subjective features seen by radiologists, and the use of ANNs to estimate the likelihood of malignancy by merging the objective measures. The performance of the ANNs is evaluated by means of receiver operating characteristic (ROC) analysis. The performance of radiologists is evaluated in classifying benign and malignant nodules for comparison with the computerized methods.
The present invention thus addresses the problems associated with the conventional diagnosis of pulmonary nodules. The method and system of the invention, using ANNs to merge subjective data obtained manually or objective data obtained with automated techniques, is thus able to estimate the likelihood of malignancy. This estimate assists radiologists in confidently and accurately identifying benign nodules, thereby helping to reduce the number of unnecessary CT examinations (i.e., CT examinations performed on patients with benign nodules).
BRIEF DESCRIPTION OF THE DRAWINGS
A more complete appreciation of the invention and many of the attendant advantages thereof will be readily obtained as the same becomes better understood by reference to the following detailed description when considered in connection with the accompanying drawings, wherein:
Figure 1(a) is a flowchart of the method used to analyze the likelihood of malignancy in pulmonary nodules according to the present invention;
Figure 1(b) is an image of a score sheet 10 to be used by radiologists to subjectively characterize lung nodules;
Figures 2(a) and 2(b) are respective images of (a) a malignant nodule and (b) a benign nodule; Figure 2(c) is an image of a hand-drawn outline delineating the malignant nodule of Figure 2(a);
Figure 2(d) is an image of a hand-drawn outline delineating the benign nodule of Figure 2(b);
Figure 2(e) is an image of an ellipse calculated to fit the outline of the malignant nodule delineated (extracted) in Figure 2(c);
Figure 2(f) is an image of an ellipse calculated to fit the outline of the benign nodule delineated (extracted) in Figure 2(d);
Figure 3(a) is a one-dimensional representation of the nodule outline of Figure 2(c) as the distance from sample points distributed with equal intervals on the calculated ellipse of Figure 2(e);
Figure 3(b) is a one-dimensional representation of the nodule outline of Figure 2(d) as the distance from sample points distributed with equal intervals on the calculated ellipse of Figure 2(f);
Figure 4(a) is an illustration of an artificial neural network (ANN) for determining the likelihood of malignancy in a pulmonary nodule in accordance with the present invention;
Figures 4(b), 4(c), and 4(d) are respective graphs, for attending radiologists (4(b)), radiology residents (4(c)), and all radiologists (4(d)), of receiver operating characteristic (ROC) curves showing how accurately benign and malignant nodules were identified using an ANN trained with selected subjective features and an ANN trained with all subjective features quantified by the respective radiologists.
Figures 5(a), 5(b), and 5(c) are respective graphs comparing ROC curves of (a) attending radiologists versus an ANN trained using selected subjective features quantified by the attending radiologists, (b) radiology residents versus an ANN trained using selected subjective features quantified by the radiology residents, and (c) the average performance of all radiologists versus an ANN trained using selected subjective features quantified by all radiologists, in distinguishing between benign and malignant nodules;
Figures 6(a) and 6(b) are graphs showing the respective relationships between (a) the root mean square (RMS) value and subjective rating of marginal irregularity, and (b) the standard deviation of pixel values and subjective rating of homogeneity; Figures 7(a), 7(b), 7(c), and 7(d) are graphs showing the respective relationships between (a) effective diameter and degree of ellipticity, (b) standard deviation of pixel values and average pixel value, (c) degree of circularity and RMS variation, (d) average edge gradient and degree of elliptical irregularity, for malignant and benign nodules;
Figure 8 is a graph of a series of ROC curves showing that an ANN trained with objective measures quantified by a computer outperforms both radiologists and an ANN trained with subjective features quantified by radiologists, when distinguishing between benign and malignant nodules; and
Figure 9 is a schematic illustration of a general purpose computer 100 programmed according to the teachings of the present invention.
DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS Referring now to the drawings, and more particularly to Figure 1(a) thereof, there is shown a schematic diagram of a method for the computerized analysis of the likelihood of malignancy in pulmonary nodules using a set of selected features. The overall technique includes an initial acquisition of a database of digital medical images (training images) in step SI . The digital medical images include images of pulmonary nodules. Then, in step S2 the approximate outlines (i.e., approximate contours) of the nodules are extracted (i.e., delineated) manually by a radiologist. Next, in step S3 selected features of the nodules shown in the digital medical images are quantified. The selected features may include clinical parameters, subjective features, and/or objective measures. The quantification of the selected features may be performed subjectively, by humans, and/or objectively, using automated techniques described below. If none of the selected features are determined based on the extracted outline of the nodule, then step S2 may be skipped. In step S4 an artificial neural network (ANN) is trained based on the selected features quantified in step S3.
Once the ANN is trained in steps SI through S4, the likelihood of malignancy for a candidate pulmonary nodule (i.e., a pulmonary nodule to be analyzed) may be determined in steps S5 through S7. In step S5 the selected features are quantified for the candidate nodule. Then, in step S6 the quantified features are input to the respective input nodes of an ANN. In step S7, an output unit of the ANN determines the likelihood of malignancy of the candidate nodule based on the quantified features input in step S6. Each step of Figure 1(a) is discussed in greater detail below.
Database
In order to provide comparative test results, a database was constructed from 56 chest radiographs selected from the cases that were used for development of computerized schemes for detection of lung nodules [29, 30]. Solitary pulmonary nodules larger than 3 cm were excluded in this study. None of the nodules showed calcification using CT, nor were scarlike linear opacities visible. The final diagnosis was established pathologically, and for some benign nodules, a presumed diagnosis of benign etiology was made because of no change or decrease in nodule size over a 2 year period. The 56 chest radiographs included 34 malignant nodules, all of which were pathologically proven primary bronchogenic carcinoma. The radiographs further included 22 benign lesions, 2 of which were classified as pulmonary hamartoma, 12 of which were classified as granuloma, 7 of which were classified as inflammatory lesion, and 1 which was classified as pulmonary infarction. The 56 chest radiographs were obtained in 33 women and 23 men who ranged in age from 24 to 86 years (mean, 58.4 years). Chest radiographs were digitized with a laser scanner (model number 2905 manufactured by Abe Sekkei of Tokyo, Japan) with a pixel size of 0.175 mm and a 10- bit gray scale (1,024 gray levels). In an alternative embodiment, however, a database of more or fewer radiographs is used.
Subjective Feature Extraction by Radiologists
Radiological findings on pulmonary nodules provided eight subjective features, including nodule size, nodule shape, marginal irregularity, spiculation, border definition, lobulation, nodule density (contrast), and homogeneity. These features are described by way of example in Figure 1(b) which shows a score sheet 10 to be used in subjectively characterizing the lung nodules.
The nodules' sizes were measured by using a ruler. The variations in measured nodule sizes between different observers was mainly due to the variation in subjective judgments on nodule edges. Seven radiologists (4 attending radiologists ("attendings") and 3 radiology residents ("residents")) characterized the features of each nodule independently using a score sheet, such as the score sheet 10, with scales from 1 to 5. Nodule shape was scored from round to elongated; marginal irregularity was scored from smooth to irregular; spiculation was scored from non-spiculated to spiculated; border definition was scored from well-defined to ill- defined; lobulation was scored from non-lobulated to lobulated; nodule density (contrast) was scored from low density to high density; homogeneity was scored from homogeneous to inhomogeneous. The score sheet included images of two extreme examples (i.e., an example of a "1" and a "5") for each feature. The diagrams served as a scoring guide for the radiologists; for example, an image of a non-spiculated nodule (i.e., a "1") and a spiculated nodule (i.e., a "5") were provided to help assess the degree of spiculation.
Table 2 shows examples of a radiologist's subjective ratings for the eight subjective radiological features for the malignant and benign pulmonary nodules shown in Figures 2(a) and 2(b), respectively. To eliminate the bias of the knowledge of "truth," all radiologists rated each nodule without knowledge of the correct diagnosis.
Table 2
Image Features Malignant Benign
Nodule size (mm) 25 21
Nodule shape (l=round, 5=elongated) 4 5
Marginal irregularity (l=smooth, 5, irregular) 5 1
Spiculation (l=non-spiculated, 5-spiculated) 5 1
Border definition (l=well-defιned, 5=ill-defϊned) 4 2
Lobulation (1 -non-lobulated, 5=lobulated) 5 2
Nodule density (contrast) (l=low, 5=high) 5 4
Homogeneity (Inhomogeneous, 5=inhomogeneous) 5 1
Objective Feature Extraction by Computer
The following twelve features were selected for computer characterization of pulmonary nodules that were extracted from digitized chest radiographs: effective diameter; degree of circularity; degree of ellipticity; root mean square (RMS) variation; first moment of power spectrum; degree of irregularity; average gradient; radial gradient index (RGI); tangential gradient index (TGI); line enhancement index (LEI); average pixel value; and standard deviation of pixel values. These twelve features were objective physical measures selected on the basis of their expected correlations with the subjective features that were used in radiologists' ratings.
The objective measures are quantified or determined based on the outline (or contour) of the nodules manually extracted by a first radiologist. Figure 2(c) shows the hand-drawn outlines (i.e., margins) for the malignant nodule of Figure 2(a). Figure 2(d) shows the hand- drawn outlines for the benign nodule of Figure 2(b). A second radiologist independently extracted a second set of nodule outlines to examine the variation in these objective measures derived from the outlines. The radiologists were not informed about correct diagnosis in order to avoid a bias to the outlines.
The effective diameter provides an objective measure of the nodule size. The effective diameter [31, 32] is the diameter of an equivalent circle having the same area as that defined by the outline of the nodule.
The degree of circularity provides an objective measure of the nodule shape. The degree of circularity is the ratio of the area of the nodule overlapped with the equivalent circle, to the total area of the nodule [31, 32].
The degree of ellipticity also provides an objective measure of the nodule shape. The degree of ellipticity is determined in the same manner as that for the degree of circularity, except that an ellipse approximating the nodule outline is calculated using a least squares fit [33, 34]. Figure 2(e) shows an ellipse calculated to fit to the outline of the malignant nodule of Figure 2(a). Figure 2(f) shows an ellipse calculated to fit to the outline of the benign nodule of Figure 2(b).
The definition of marginal irregularity includes two independent factors: the magnitude and the coarseness (or fineness) of irregular edge patterns. The edge pattern is the distance from the nodule outline to the calculated ellipse, as illustrated in Figures 3(a) and 3(b) for the malignant and benign nodules of Figures 2(a) and (b), respectively. The irregular edge pattern was analyzed using Fourier transformation. The RMS variation and the first moment of power spectrum [35] were each determined to provide separate measures of marginal irregularity. The degree of irregularity [32] provides another measure of marginal irregularity. The degree of irregularity is one minus the ratio of (a) the perimeter (i.e., circumference) of the ellipse used to determine degree of ellipticity to (b) the length of the extracted outline.
Border definition is quantified by the average gradient, which is obtained by the average edge gradients over a selected border region. The border region is the area between an outer limit defined by the fitted ellipse plus twice the RMS variation and an inner limit defined by the fitted ellipse minus twice the RMS variation. Thus, the border region has a width of four times the RMS variation as measured from the center of the calculated ellipse.
RGI provides an objective measure of the spiculation of a nodule. The RGI is the average absolute value of the radial edge gradients projected in a direction along the radial direction [42]. The radial direction is a line from the center of mass of the calculated ellipse. For each pixel in the border region, the absolute value of the radial edge gradient projected in a direction along the radial direction is determined. The resulting absolute values are summed and averaged to determine the RGI according to the following formula:
cos
RGI PeL φ^A x +D y2
PeL v '
where P is the pixel or image point, L is the set of pixels in the border region, Dx is the gradient in the x-direction, Dy is the gradient in the y-direction, and φ is the angle between the gradient vector and the radial direction.
TGI provides another measure of the spiculation of a nodule. The TGI is obtained from a tangential component of the edge gradient at a pixel, which is projected in a direction peφendicular to the radial direction. Accordingly, the following formula is used to determine TGI:
Figure imgf000013_0001
LEI provides yet another measure of spiculation and indicates the magnitude of line pattern components obtained by means of a line enhancement filter [36], in a direction within 45 degrees from the radial direction. Thus, LEI indicates the number of pixels in the border region that form part of a line pattern having a direction within 45 degrees of the radial direction, divided by the total number of pixels in the border region. Whether a pixel forms part of a line pattern (i.e., whether a pixel is a line pattern component) is determined by the line enhancement filter [36].
The average pixel value provides a measure of the optical density of a nodule. The average pixel value is the average pixel value (i.e., gray level) over the nodule as defined by the extracted outline.
The standard deviation of pixel values over the nodule provides a measure of the homogeneity of the nodule. The standard deviation is the standard deviation of pixel values over the nodule as defined by the extracted outline.
Artificial Neural Network
Figure 4(a) is a diagram of a three-layer, feed-forward artificial neural network (ANN) trained with a back propagation algorithm [37] and used to determine the likelihood of malignancy for a pulmonary nodule. In an alternate embodiment, another learning algorithm is used instead of back propagation. The features input (applied) to the ANN included two clinical parameters (patient's age and gender) and either the eight subjective radiological features quantified by radiologists or selected of the physical measures (objective features) quantified using computer analysis. The same features are used to test and train each ANN. The number of input units equals the number of features input to the ANN. The number of hidden units may also vary but is preferably half the number of input units. For example, when 10 features were input, the basic structure of the neural network included 10 input units la, lb, lc, Id, le, If, lg, lh, li, lj; 5 hidden units 3a, 3b, 3c, 3d, 3e; and an output unit 5. The input data obtained from clinical parameters, subjective ratings by radiologists, and physical measures obtained by using a computer were normalized in the range of 0 to 1. The output of the neural network represented the likelihood of malignancy which ranged from 0 to 1 (0 = benign, and 1 = malignant). The training and testing of the neural network was performed using a round-robin (or leave-one-out) method [24]. With this method, all but one case in the database is used for training, and the case not used for training is applied for testing the trained ANNs. This procedure was repeated until every case in the database is used once for testing. The performance of the ANN was evaluated on a per patient basis [24]. The performance of the ANNs was evaluated by means of receiver operating characteristic (ROC) curves [38]. The LABROC4 program [39] was used to fit ROC curves, and the area under the ROC curve (Az) was used as an index of the performance in distinguishing between benign and malignant nodules. If the output unit of the ANN outputs a likelihood greater than a preselected threshold value, then the nodule being analyzed is considered to be malignant. Although the threshold may be set at 0.5, the threshold is preferably set at 0.1 or 0.05 so that malignant nodules are correctly identified. Thus, low thresholds increase the sensitivity of the ANN with regard to classifying malignant nodules, at the risk of increased false positives. However it is preferable to ensure that malignancies are identified since the risk of missing a malignancy is worse than the risk of exposure to an unnecessary CT.
Observer Study
To evaluate the performance of radiologists in classifying pulmonary nodules, seven radiologists (4 attendings and 3 residents) participated in the observer study. Each observer (i.e., participating radiologist) was presented with a chest radiograph and two clinical parameters (patient's age and gender). Each observer was then asked to provide his or her confidence level regarding the likelihood of malignancy by using a continuous rating scale with a line checking method [29]. Confidence ratings of "definitely benign" and "definitely malignant" were marked above the left and the right end, respectively, of the line. Radiographs were presented in random order. ROC analysis was employed for comparison of the performance of observers with those of the computerized methods in distinguishing between benign and malignant nodules.
Performance of ANNs Trained with the Subjective Ratings of Radiologists
A round-robin method was employed for the data set extracted (quantified) by each radiologist separately in determining the ANN performance. Table 3 shows the performance index, Az, when distinguishing between benign and malignant nodules, of the ANN trained with image features extracted by each radiologist separately, and by all radiologists combined. There is a relatively large variation among Az values ranging from 0.579 to 0.834, which indicates a considerable variation among the scoring of image features by radiologists. It is also noted that the performance of the ANN with features extracted by radiology residents were much lower than that of attending radiologists. The selected features were patient age, nodule size, marginal irregularity, border definition, nodule density (contrast), and homogeneity.
Table 3
Radiologists All Subjective Features Selected Subjective Features
Attendings
A 0.792 0.839 B 0.763 0.816 C 0.579 0.657 D 0.834 0.848 average 0.742 0.790
Residents
E 0.616 0.691 F 0.683 0.726 G 0.700 0.722 average 0.666 0.722
Average 0.710 0.761
All radiologists 0.747 0.754
It is generally desirable for ANNs to achieve a high performance even when only a small number of essential input units are applied. Thus, an attempt was made to reduce the number of input units, because the initial selection of subjective features may include redundant and nondiscriminating features, which can degrade the ANN performance in distinguishing benign from malignant pulmonary nodules. On the basis of the performance obtainable by using each feature independently as well as radiologists' knowledge and experience, 6 features including patient's age, the nodule size, marginal irregularity, spiculation, border definition, and homogeneity were selected. The average Az value of 0.761 for all 7 radiologists with 6 selected subjective features was statistically significantly greater than that (0.710) with all 10 subjective features (p < 0.0007).
Figures 4(a), 4(b), 4(c), and 4(d) show ROC curves obtained by the ANN trained with 10 features (2 clinical parameters (age and gender) and all 8 subjective features) and an ANN trained with 6 features (1 clinical feature (age) and five subjective features (nodule size, marginal irregularity, spiculation, border definition, and homogeneity)) as input data. The average Az value for the ANN with 6 features was greater than those with 10 features.
Figures 5(a), 5(b), and 5(c) compare the performances of the radiologists in differentiating pulmonary nodules with the ANN trained with six selected subjective features (1 clinical feature (age) and five subjective features (nodule size, marginal irregularity, spiculation, border definition, and homogeneity)). The average performance of the ANN when trained with the six subjective features obtained from attendings was slightly greater than the average performance of the attendings. The average performance of the ANN when trained with the six subjective features obtained from residents was comparable to the average performance of the residents.
Another subset of features, including patient's age and five subjective features (nodule size, marginal irregularity, border definition, nodule density (contrast), and homogeneity), was selected. The ANN trained with this set of features performed better (based on Az) than the ANN with all 10 features. These results indicate that the ANN appeared to be able to learn and generalize better when trained with only a selected subset of features (patient's age, nodule size, marginal irregularity, border definition, nodule density (contrast), and homogeneity) than when trained with all 10 features.
The performance of the ANN trained with the features obtained from all radiologists (i.e., attendings and residents) together was also evaluated. The performance of the ANN with the features extracted by all radiologists is also shown in Table 3. The Az value for the ANN trained and tested with the subjective features obtained from all radiologists was comparable to the average Az value for ANNs trained with the features obtained by each attending radiologist when the ANN used gender, age, and all 8 subjective features. However, when the ANN used age and five selected subjective features, the Az value for the ANN trained with features obtained from all radiologists was lower than the average Az value for ANNs trained with features obtained from each attending radiologist. This appears to indicate that, despite the larger amount of input data for the ANN trained with features determined by all radiologists, the ANN did not learn the patterns of input data well when distinguishing between benign and malignant nodules. This may be due to the variation among radiologists' subjective ratings.
Computerized Analysis of Nodule Features
Different objective, physical measures obtained with computerized analysis were first compared with radiologists' subjective ratings (i.e., scores) to determine the utility of those measures. Figures 6(a) and 6(b) respectively show (a) the relationships between the RMS value and subjective ratings of marginal irregularity, and (b) the relationships between the standard deviation of pixel values and subjective ratings of homogeneity. The small squares and vertical bars in Figures 6(a) and 6(b) indicate the average and the standard deviation, respectively, of each physical measure obtained from all nodules included in each of five different subjective ratings. These results indicate that physical measures are correlated well with the radiologists' subjective ratings. Table 4 shows the correlation coefficients between the objective measures and the subjective features. As seen from Table 4, most objective measures are generally well correlated with the corresponding subjective features.
Table 4
Subjective Features Physical Measures Correlation Coefficient
(1) Nodule Shape Degree of Circularity -0.885
Degree of Ellipticity -0.931
(2) Marginal Irregularity Degree of Irregularity 0.939
RMS variation 0.999
First moment of power spectrum -0.177
(3) Spiculation Tangential Gradient Index (TGI) 0.934
Radial Gradient Index (RGI) -0.880
Line Enhancement Index (LEI) 0.986
(4) Border definition Average Gradient 0.946
(5) Nodule Density Average Pixel Value 0.988
(6) Homogeneity Standard deviation of pixel value 0.992
Figures 7(a), 7(b), and 7(c) each show the relationships between two selected image features of pulmonary nodules. Although a considerable overlap between the malignant and benign pulmonary nodules is observed in general, there is also a trend, which would discriminate malignant and benign nodules, between the distribution of two groups. For example, in Figure 7(a) malignant nodules tend to have a larger effective diameter and smaller degree of ellipticity than benign nodules. This result appears to agree with general characteristics of lung nodules that malignant nodules are larger than benign nodules, and that the shape of benign nodules tends to be round. Figure 7(d) shows that malignant nodules tend to have a larger average edge gradient and a larger degree of elliptical irregularity than benign nodules, which indicates potential for discriminating benign from malignant nodules. This tendency also agrees with general characteristics that malignant nodules tend to contain spiculations and irregular margins.
Performance of ANNs Trained with Objective measures Although all twelve objective measures could be used to distinguish benign from malignant nodules, it is desirable for ANNs to achieve a high performance when smaller numbers of input units are applied, as in the analysis of the performance of ANNs trained with subjective features. Therefore, selected subsets of features were identified from among the twelve objective physical measures and two clinical parameters, using a genetic algorithm [40]. Note, however, that patient's age and nodule size were always included as selected features, because these two features are considered to be among the most important in differentiating pulmonary nodules [15, 16]. Table 5 shows the results of different combinations of clinical parameters and objective measures quantified by computer, that resulted in high values of Az (Az > 0.830) for the ANN. In Table 5, the following abbreviations are used: RMS = RMS value; FM = first moment of power spectrum; TGI = tangential gradient index; RGI = radial gradient index; LEI = line enhancement index; AG = average gradient; STD = standard deviation of pixel value; and APV = average pixel value.
Table 5
Features Az Value
Age, Size, Ellipticity, FM, AG 0.854
Age, Size, RMS, FM, TGI 0.851
Age, Size, Circularity, TGI, APV, STD 0.846
Age, Size, Circularity, Ellipticity, TGI, APV 0.842
Age, Size, Circularity, RMS, FM, APV 0.837
Age, Size, Irregularity, TGI, RGI, LEI, AG 0.831
Figure 8 shows the comparison of ROC curves corresponding to: (1) the ANN with the computer extracted features, (2) the ANN with six selected subjective ratings by radiologists, and (3) the radiologists. The results indicated that the ANN trained with features obtained with computer analysis performed better (based on Az) than both the radiologists and the ANN trained with subjective features. In this study the computer analysis was performed based on nodule outlines drawn by a radiologist, which is subjective. Therefore, the nodule outlines drawn by another radiologist were examined, and it was confirmed that the ANN with another set of computer extracted (quantified) features provided a comparable or a higher performance (Az = 0.920).
Discussion
The performance of ANNs trained with subjective ratings by radiologists was better than that by radiologists themselves in distinguishing benign from malignant pulmonary nodules. Similar results were reported in studies on differential diagnosis of breast cancer [22, 23] and also interstitial lung diseases [40]. This may be because radiologists would not use all of the image features, which were obtained in subjective ratings in this study in their differential diagnosis of solitary pulmonary nodules. For example, due to their knowledge and own experience, a limited number of conspicuous features are likely to affect strongly their decision making in some cases, and radiologists generally tend not to consider all of features systematically. On the other hand, ANNs are affected by all of the data consistently and comprehensively. In addition, ANNs are superior to radiologists in merging large amounts of data.
An ANN has a unique capability to learn specific patterns between input and output data if the ANN is trained by examples repeatedly; however, this capability strongly depends on the quality of the input data. In other words, if input data are selected randomly and have no correlation with output data, the ANN is unlikely to learn any specific patterns between input and output data, resulting in a lower performance. There was a considerable variation among the performances of ANN with subjective ratings given by each radiologist. This seems to indicate a large variation of subjective ratings by radiologists. Although images were provided as a guide to radiologists to use the criterion consistently for extracting subjective image features, the ratings are highly subjective, and could be affected strongly by an individual radiologist's knowledge and experience, which may have caused the large variation in the ratings of the radiologists.
It is generally desirable to train an ANN with a large database that contains a wide spectrum of data. Therefore, the round-robin test using all of data by seven radiologists together was applied. Although the performance of the ANN with all data improved slightly as compared with the average performance of the ANN with each radiologist's data when all features were included, the performance with all data decreased when selected features were used. This might be due to the variation among the subjective ratings by radiologists. The feature data provided by some radiologists might have had a negative influence on the ANN for learning the pattern of the feature data provided by other radiologists.
Another limitation of using subjective ratings for input data to train and test the ANN is that the quality of subjective ratings as input data for the ANN considerably depends on the ability of a radiologist to extract or quantify nodule features. The performance of the ANN with subjective features extracted by radiology residents were much lower than that of attending radiologists, which indicates that less experienced radiologists could not extract the nodule features sufficiently, and consequently, the ANN could not learn well the specific patterns between input data and output data. Therefore, computer-aided diagnostic schemes that can extract nodule features automatically, objectively, and reproducibly are highly desirable.
The ANN having input features automatically determined by computer performed better (based on Az) than the average performance of the radiologists. Moreover, the ANN trained with features determined by computer performed better than the ANN trained with subjective features. Although the objective measures were selected initially on the basis of their expected correlation with the subjective features, these objective measures may contain different and/or additional information from the subjective features, which may explain why the objective measures contributed more effectively than the subjective features in distinguishing benign from malignant pulmonary nodules.
Since plain chest radiographs, alone, are limited in their usefulness for defining nodule moφhology and classifying lung nodules, chest CT must be performed in most patients having a solitary pulmonary nodule. It should be noted, however, that the radiation exposure of a patient is inevitable with a CT examination, and moreover, a CT examination is expensive. A goal of the inventive computerized classification scheme for pulmonary nodules is to reduce the number of benign nodules sent for further diagnostic evaluation. The computerized method indicated a higher performance than the average radiologists (based on Az), even though the nodule outline was provided manually by a radiologist. Thus, the computerized classification method may provide a useful aid to radiologists in diagnosing/differentiating benign from malignant pulmonary nodules, and therefore, it may be possible to reduce the number of "unnecessary" CT examinations.
Computer and System
This invention conveniently may be implemented using a conventional general puφose computer or micro-processor programmed according to the teachings of the present invention, as will be apparent to those skilled in the computer art. Appropriate software can readily be prepared by programmers of ordinary skill based on the teachings of the present disclosure, as will be apparent to those skilled in the software art.
Figure 9 is a schematic illustration of a computer system for the computerized analysis of the likelihood of malignancy in pulmonary nodules. A computer 100 implements the method of the present invention, wherein the computer housing 102 houses a motherboard 104 which contains a CPU 106, memory 108 (e.g., DRAM, ROM, EPROM, EEPROM, SRAM, SDRAM, and Flash RAM), and other optional special puφose logic devices (e.g., ASICs) or configurable logic devices (e.g., GAL and reprogrammable FPGA). The computer 100 also includes plural input devices, (e.g., a keyboard 122 and mouse 124), and a display card 110 for controlling monitor 120. In addition, the computer 100 further includes a floppy disk drive 114; other removable media devices (e.g., compact disc 119, tape, and removable magneto-optical media (not shown)); and a hard disk 112, or other fixed, high density media drives, connected using an appropriate device bus (e.g., a SCSI bus, an Enhanced IDE bus, or a Ultra DMA bus). Also connected to the same device bus or another device bus, the computer 100 may additionally include a compact disc reader 118, a compact disc reader/writer unit (not shown) or a compact disc jukebox (not shown). Although compact disc 119 is shown in a CD caddy, the compact disc 119 can be inserted directly into CD- ROM drives which do not require caddies.
As stated above, the system includes at least one computer readable medium. Examples of computer readable media are compact discs 119, hard disks 112, floppy disks, tape, magneto-optical disks, PROMs (EPROM, EEPROM, Flash EPROM), DRAM, SRAM, SDRAM, etc. Stored on any one or on a combination of computer readable media, the present invention includes software for controlling both the hardware of the computer 100 and for enabling the computer 100 to interact with a human user. Such software may include, but is not limited to, device drivers, operating systems and user applications, such as development tools. Such computer readable media further includes the computer program product of the present invention for performing the inventive method described above, including steps SI through S7 of Figure 1(a). The computer code devices of the present invention can be any inteφreted or executable code mechanism, including but not limited to scripts, inteφreters, dynamic link libraries, Java classes, and complete executable programs. Moreover, parts of the processing of the present invention may be distributed for better performance, reliability, and/or cost. For example, an outline or image may be selected on a first computer and sent to a second computer for remote diagnosis.
The invention may also be implemented by the preparation of application specific integrated circuits or by interconnecting an appropriate network of conventional component circuits, as will be readily apparent to those skilled in the art. Obviously, numerous modifications and variations of the present invention are possible in light of the above teachings. For example, the outline of the nodules may be extracted using any available automated technique, rather than manually. It is therefore to be understood that within the scope of the appended claims, the invention may be practiced otherwise than as specifically described herein.
APPENDIX REFERENCES
[I] Webb WR. Radiologic evaluation of the solitary pulmonary nodule. AJR 1990; 154:701 -708.
[2] Lillington GA. The solitary pulmonary nodule 1974. Am Rev Respir Dis 1974;
110:699-706. [3] Ray JF, Lawton BR, Magnin GE, et al. The coin lesion story: update 1976.
Twenty years experience with early thoracotomy for 179 suspected malignant coin lesions. Chest 1976; 70:332-336. [4] Gracey DR, Byrd RB, Cagell DW. The dilemma of the asymptomatic pulmonary nodule in the young and not-so-young adult. Chest 1971; 60:479-483. [5] Nathan MH. Management of solitary pulmonary nodules. An organized approach based on growth rates and statistics. JAMA 1974; 227:1141-1144. [6] Lillington GA, Stevens GM. The solitary nodule. The other side of the coin.
Chest 1976; 70:322-323. [7] Cortese DA. Solitary pulmonary nodule. Observe, operate, or what? Chest 1982;
81:662-663. [8] Lillington GA. Pulmonary nodules: solitary and multiple. Clin Chest Med 1982;
3:361-367. [9] Khouri NF, Meziane MA, Zerhouni EA, Fishman EK. The solitary pulmonary nodule: assessment, diagnosis, and management. Chest 1987; 91 :128-133. [10] Bateson EM. An analysis of 155 solitary lung lesions illustrating the differential diagnosis of mixed tumors of the lung. Clin Radiol 1965; 16:51-65.
[II] Edwards WM, Cox RS Jr, Garland LH. The solitary nodule (coin lesion) of the lung: an analysis of 52 consecutive cases treated by thoracotomy and a study of preoperative diagnostic accuracy. AJR 1962; 88: 1020- 1042.
[12] Toomes H. Delphendahl A, Manke H-G, Vogt-Moydopf I. The coin lesion of the lung. A review of 955 resected coin lesions. Cancer 1983; 51 :534-537.
[13] Keagy BA, Starek PJK, Murray GF, Battaglini JW, Lores ME, Wilcox BR. Major pulmonary resection for suspected but unconfirmed malignancy. Ann Thorac Surg 1984; 38:314-316. [14] Daly BDT, Faling LJ, Dichl JT, Bankoff MS, Gale ME. Computed tomography guided minithoracotomy for the resection of small peripheral pulmonary nodules. Ann Thorac Surg 1991; 51:465-469. [15] Swensen SJ, Silverstein MD, Ilstrup DM, Schleck, Schleck CD, Edell ES. The probability of malignancy in solitary pulmonary nodules: application to small radiologically indeterminate nodules. Arch Intern Med 1997; 157:849-855. [16] Cummings SR, Lillington GA, Richard RJ. Estimating the probability of malignancy in solitary pulmonary nodules. Am Rev Respir Dis 1986; 134:449-452. [17] Tab(all)Gurney JW. Determining the likelihood of malignancy in solitary pulmonary nodules with Bayesian analysts: parti, theory. Radiology 1993; 186:405-413. [18] Gurney JW, Lyddon DM, Mckay JA. Determining the likelihood of malignancy in solitary pulmonary nodules with Bayesian analysis: part II. application. Radiology
1993; 186:415-422. [19] Sherrier RH, Chiles C, Johnson GA, Ravin CE. Differentiation of benign from malignant pulmonary nodules with digitized chest radiographs. Radiology 1987;
162:645-649. [20] Sasaoka S., Takabatake H., Mori M, Natori H., Abe S. Digital analysis of pulmonary nodules - potential usefulness of computer-aided diagnosis for differentiation of benign from malignant nodules -. Japanese J Chest Disease 1995;
33:489-496. [21] Asada N., Doi K., MacMahon H., et al. Potential usefulness of an artificial neural network for differential diagnosis of interstitial lung diseases: pilot study. Radiology
1990; 177:857-860. [22] Wu Y., Doi K., Giger ML., Nishikawa RM. Computerized detection of clustered microcalcifications in digital mammograms: application of artificial neural networks.
Med Phys 1992; 19:555-560. [23] Wu Y., Giger ML., Doi K., Vyborny CJ, Schmidt RA, Metz CE. Artificial neural networks in mammography: application to decision making in the diagnosis of breast cancer. Radiology 1993; 187:81-87. [24] Jiang Y., Nishikawa RM, Wolverton DE, et al. Malignant and benign clustered microcalcifications: automated feature analysis and classification. Radiology 1996;
198:671 678. [25] IshidaT., Katsuragawa S., Ashizawa K., MacMahon H., Doi K. Artificial neural networks in chest radiographs: detection and characterization of interstitial lung disease. Proc SPIE 1997; 3034:931-937.3. [26] Gross GW, Boone JM, Greco-Hunt V, Grrenberg B. Neural networks in radiologic diagnosis: II. Inteφretation of neonatal chest radiographs. Invest Radiol
1990; 25: 1017- 1023. [27] Lo SC, Freedman MT, Lin JS, Mun SK. Automatic lung nodule detection using profile matching and back-propagation neural network techniques. J Digit Imaging
1993:6:48-54. [28] Gurney JW, Swensen SJ. Solitary pulmonary nodules: determining the likelihood of malignancy with neural network analysis. Radiology 1995; 196:823-829 [29] Kobayashi T. Xu XW, MacMahon H. Metz CE, Doi K. Effect of a computer aided diagnosis scheme on radiologists' performance in detection of lung nodules on radiographs. Radiology 1996; 199:843-848. [30] Xu XW, Doi K. Development of an improved CAD scheme for automated detection of lung nodules in digital chest images. MedPhys 1997; 24:1395-1403. [31] Giger ML, Doi K, MacMahon H. Metz CE, Yin FF. Pulmonary nodules: computer-aided detection in digital chest images. RadioGraphics 1990; 17:861-865. [32] MatsumotoT, Yoshimura H. Doi K, et al. Image feature analysis of false-positive diagnoses produced by automated detection of lung nodules. Invest Radiol 1992;
27:587 597. [33] Pilu M, Fitzgibbon A, Fisher R. Ellipse-specific direct least-square fitting. IEEE international Conference on Image Processing 1996; 3:599-602. [34] Fitzgibbon A, Pilu M, Fisher R. Direct least-square fitting of ellipses.
International Conference on Pattern Recognition 1996; 1:253-257. [35] Katsuragawa S. Doi K, Nakamori N. MacMahon H. Image feature analysis and computer-aided diagnosis in digital radiography: Effect of digital parameters on the accuracy of computerized analysis of interstitial disease in digital chest radiographs.
Med Phys 1 990; 17:72-78. [36] Isihda T. Katsuragawa S. Kobayashi T. MacMahon H. Doi K. Computerized analysis of interstitial disease in chest radiographs: Improvement of geometric-pattern feature analysis. MedPhys 1997; 24:915-924. [37] Rumelhart DE, Hinton GE, Williams RJ. Learning internal representations by error propagation. In: Rumelhart DE, McClelland JL, PDP Research Group, eds.
Parallel distributed processing: explorations in the microstructure of cognition. Vol 1.
Cambridge, Mass: MITPress, 1986; 318-362. [38] Metz CE. ROC methodology in radiologic imaging. Invest Radiol 1986; 21:720 733. [39] Metz CE, Herman BA, Shen JH. Maximum-likelihood estimation of receiver operating (ROC) curves from continuously-distributed data. Statistics in Medicine (in press). [40] Anastasio MA, Yoshida H. Nagel R. Nishikawa RM, Doi K. A genetic algorithm based method for optimizing the performance of a computer-aided diagnosis scheme for detection of clustered microcalcifications in mammograms. [41] Ashizawa K, MacMahon H. Ishida T. Vyborny CJ, Katsuragawa S. Doi K. Effect of artificial neural networks on observer performance for differential diagnosis of interstitial lung disease on chest radiographs. Radiology 1997; 205(P):529. [42] Bick U. Giger ML, Schmidt RA, Doi K. Anew single-image method for computer-aided detection of small mammographic masses. In: Lewke HU, Inamura
K, Jaffe CC, Vannier MW, ed. Proc. CAR - Computer Assisted Radiology, 1995;
357-363.

Claims

Claims:
1. A method for analyzing a nodule, comprising: obtaining a digital outline of a nodule; generating objective measures corresponding to physical features of the nodule based on the outline; applying the generated objective measures to an artificial neural network; and determining a likelihood of malignancy of the nodule based on an output of the artificial neural network.
2. The method of Claim 1, wherein said obtaining step comprises: obtaining a digital image of the nodule; and extracting the outline of the nodule from the digital image.
3. The method of Claim 1, further comprising: obtaining an image of the nodule; and wherein the generating step further comprises generating the objective measures from the group consisting essentially of: effective diameter, degree of circularity, degree of ellipticity, root mean square variation, first moment of power spectrum of a function corresponding to the distance between a calculated ellipse and the outline, degree of irregularity, average gradient, radial gradient index, tangential gradient index, line enhancement index, average pixel value, and standard deviation of pixel values.
4. The method of Claim 1, wherein the applying step further comprises: applying to the artificial neural network at least one clinical parameter corresponding to the nodule.
5. The method of Claim 4, wherein the step of applying the at least one clinical parameter comprises: selecting the at least one clinical parameter from the group consisting essentially of: age and gender.
6. A method for training an artificial neural network to analyze a candidate nodule, comprising: obtaining a digital outline of a training nodule; generating objective measures corresponding to physical features of the training nodule based on the outline; applying the generated objective measures to an aitificial neural network; and training the artificial neural network to determine a likelihood of malignancy in a candidate nodule, based on the objective measures.
7. The method of Claim 6, wherein the step of obtaining the outline comprises: obtaining a digital image of the training nodule; and extracting the outline of the training nodule from the digital image.
8. The method of Claim 6, further comprising: obtaining an image of the nodule; and wherein the step of generating the objective measures comprises generating the objective measures from the group consisting essentially of: effective diameter, degree of circularity, degree of ellipticity, root mean square variation, first moment of power spectrum of a function corresponding to the distance between a calculated ellipse and the outline, degree of irregularity, average gradient, radial gradient index, tangential gradient index, line enhancement index, average pixel value, and standard deviation of pixel values.
9. The method of Claim 6, wherein the training step further comprises: training the artificial neural network based on at least one clinical parameter corresponding to the training nodule.
10. The method of Claim 9, wherein the step of training the artificial neural network based on at least one clinical parameter comprises: selecting the at least one clinical parameter from the group consisting essentially of: age and gender.
11. A method for analyzing a nodule, comprising: selecting plural features of a nodule from the group consisting essentially of: nodule size, nodule shape, marginal irregularity, spiculation, border definition, lobulation, nodule density, and homogeneity; quantifying the selected features for the nodule; applying the quantified features to an artificial neural network; and determining the likelihood of malignancy of the nodule based on an output of the artificial neural network.
12. The method of Claim 11, wherein the applying step comprises: applying at least one clinical parameter into the artificial neural network.
13. A method for training an artificial neural network to analyze a candidate nodule, comprising: selecting plural features of a training nodule from the group consisting essentially of: nodule size, nodule shape, marginal irregularity, spiculation, border definition, lobulation, nodule density, and homogeneity; quantifying the selected features for the training nodule; applying the quantified features to an artificial neural network; and training the artificial neural network to determine a likelihood of malignancy in a candidate nodule, based on the quantified features of the training nodule.
14. The method of Claim 13, wherein the training step comprises: training the artificial neural network to determine a likelihood of malignancy in a candidate nodule, based on at least one clinical feature corresponding to the training nodule.
15. A system for analyzing a nodule, comprising: a mechanism configured to obtain a digital outline of a nodule; a mechanism configured to generate objective measures corresponding to physical features of the nodule based on the outline; a mechanism configured to apply the generated objective measures to an artificial neural network; and a mechanism configured to determine a likelihood of malignancy of the nodule based on an output of the artificial neural network.
16. The system of Claim 15, wherein the mechanism configured to obtain the digital outline comprises: a mechanism configured to obtain a digital image of the nodule; and a mechanism configured to extract the outline of the nodule from the digital image.
17. The system of Claim 15, further comprising: a mechanism configured to obtain an image of the nodule; and wherein the mechanism configured to generate the objective measures comprises a mechanism configured to generate the objective measures from the group consisting essentially of: effective diameter, degree of circularity, degree of ellipticity, root mean square variation, first moment of power spectrum of a function corresponding to the distance between a calculated ellipse and the outline, degree of irregularity, average gradient, radial gradient index, tangential gradient index, line enhancement index, average pixel value, and standard deviation of pixel values.
18. The system of Claim 15, wherein the mechanism configured to apply the generated objective measures to the artificial neural network comprises: a mechanism configured to apply to the artificial neural network at least one clinical parameter corresponding to the nodule.
19. The system of Claim 18, wherein the mechanism configured to apply the at least one clinical parameter comprises: a mechanism configured to select the at least one clinical parameter from the group consisting essentially of: age and gender.
20. A system for training an artificial neural network to analyze a candidate nodule, comprising: a mechanism configured to obtain a digital outline of a training nodule; a mechanism configured to generate objective measures corresponding to physical features of the training nodule based on the outline; a mechanism configured to apply the generated objective measures to an artificial neural network; and a mechanism configured to train the artificial neural network to determine a likelihood of malignancy in a candidate nodule, based on the objective measures.
21. The system of Claim 20, wherein the mechanism configured to obtain the digital outline comprises: a mechanism configured to obtain a digital image of the training nodule; and mechanism configured to extract the outline of the training nodule from the digital image.
22. The system of Claim 20, further comprising: a mechanism configured to obtain an image of the nodule; and wherein the mechanism configured to generate the objective measures comprises a mechanism configured to generate the objective measures from the group consisting essentially of: effective diameter, degree of circularity, degree of ellipticity, root mean square variation, first moment of power spectrum of a function corresponding to the distance between a calculated ellipse and the outline, degree of irregularity, average gradient, radial gradient index, tangential gradient index, line enhancement index, average pixel value, and standard deviation of pixel values.
23. The system of Claim 20, wherein the mechanism configured to train the artificial neural network comprises: a mechanism configured to train the artificial neural network based on at least one clinical parameter corresponding to the training nodule.
24. The system of Claim 23, wherein the mechanism configured to train the artificial neural network based on at least one clinical parameter comprises: a mechanism configured to select the at least one clinical parameter from the group consisting essentially of: age and gender.
25. A system for analyzing a nodule, comprising: a mechanism configured to select plural features of a nodule from the group consisting essentially of: nodule size, nodule shape, marginal irregularity, spiculation, border definition, lobulation, nodule density, and homogeneity; a mechanism configured to quantify the selected features for the nodule; a mechanism configured to apply the quantified features to an artificial neural network; and a mechanism configured to determine the likelihood of malignancy of the nodule based on an output of the artificial neural network.
26. The system of Claim 25, wherein the mechanism configured to apply the quantified features to the artificial neural network comprises: a mechanism configured to apply at least one clinical parameter into the artificial neural network.
27. A system for training an artificial neural network to analyze a candidate nodule, comprising: a mechanism configured to select plural features of a training nodule from the group consisting essentially of: nodule size, nodule shape, marginal irregularity, spiculation, border definition, lobulation, nodule density, and homogeneity; a mechanism configured to quantify the selected features for the training nodule; a mechanism configured to apply the quantified features to an artificial neural network; and a mechanism configured to train the artificial neural network to determine a likelihood of malignancy in a candidate nodule, based on the quantified features of the training nodule.
28. The system of Claim 27, wherein the mechanism configured to train the artificial neural network comprises: a mechanism configured to train the artificial neural network to determine a likelihood of malignancy in a candidate nodule, based on at least one clinical feature corresponding to the training nodule.
29. A computer readable medium storing computer instructions for analyzing a nodule, by performing the steps of: obtaining a digital outline of a nodule; generating objective measures corresponding to physical features of the nodule based on the outline; applying the generated objective measures to an artificial neural network; and determining a likelihood of malignancy of the nodule based on an output of the artificial neural network.
30. The computer readable medium of Claim 29, wherein said obtaining step comprises: obtaining a digital image of the nodule; and extracting the outline of the nodule from the digital image.
31. The computer readable medium of Claim 29, storing further instructions for performing the step comprising: obtaining an image of the nodule; and wherein the generating step further comprises generating the objective measures from the group consisting essentially of: effective diameter, degree of circularity, degree of ellipticity, root mean square variation, first moment of power spectrum of a function corresponding to the distance between a calculated ellipse and the outline, degree of irregularity, average gradient, radial gradient index, tangential gradient index, line enhancement index, average pixel value, and standard deviation of pixel values.
32. The computer readable medium of Claim 29, wherein the applying step further comprises: applying to the artificial neural network at least one clinical parameter corresponding to the nodule.
33. The computer readable medium of Claim 32, wherein the step of applying the at least one clinical parameter comprises: selecting the at least one clinical parameter from the group consisting essentially of: age and gender.
34. A computer readable medium storing computer instructions for training an artificial neural network to analyze a candidate nodule, by performing the steps of: obtaining a digital outline of a training nodule; generating objective measures corresponding to physical features of the training nodule based on the outline; applying the generated objective measures to an artificial neural network; and training the artificial neural network to determine a likelihood of malignancy in a candidate nodule, based on the objective measures.
35. The computer readable medium of Claim 34, wherein the obtaining step comprises: obtaining a digital image of the training nodule; and extracting the outline of the training nodule from the digital image.
36. The computer readable medium of Claim 34, storing further instructions for performing the step comprising: obtaining an image of the nodule; and wherein the generating step comprises generating the objective measures from the group consisting essentially of: effective diameter, degree of circularity, degree of ellipticity, root mean square variation, first moment of power spectrum of a function corresponding to the distance between a calculated ellipse and the outline, degree of irregularity, average gradient, radial gradient index, tangential gradient index, line enhancement index, average pixel value, and standard deviation of pixel values.
37. The computer readable medium of Claim 34, wherein the training step further comprises: training the artificial neural network based on at least one clinical parameter corresponding to the training nodule.
38. The computer readable medium of Claim 37, wherein the step of training the artificial neural network based on at least one clinical parameter comprises: selecting the at least one clinical parameter from the group consisting essentially of: age and gender.
39. A computer readable medium storing computer instructions for analyzing a nodule, by performing the steps of: selecting plural features of a nodule from the group consisting essentially of: nodule size, nodule shape, marginal irregularity, spiculation, border definition, lobulation, nodule density, and homogeneity; quantifying the selected features for the nodule; applying the quantified features to an artificial neural network; and determining the likelihood of malignancy of the nodule based on an output of the artificial neural network.
40. The computer readable medium of Claim 39, wherein the applying step comprises: applying at least one clinical parameter into the artificial neural network.
41. A computer readable medium storing computer instructions for training an artificial neural network to analyze a candidate nodule, by performing the steps of: selecting plural features of a training nodule from the group consisting essentially of: nodule size, nodule shape, marginal irregularity, spiculation, border definition, lobulation, nodule density, and homogeneity; quantifying the selected features for the training nodule; applying the quantified features to an artificial neural network; and training the artificial neural network to determine a likelihood of malignancy in a candidate nodule, based on the quantified features of the training nodule.
42. The computer readable medium of Claim 41, wherein the training step comprises: training the artificial neural network to determine a likelihood of malignancy in a candidate nodule, based on at least one clinical feature corresponding to the training nodule.
PCT/US1999/025998 1998-11-13 1999-11-12 System for detection of malignancy in pulmonary nodules WO2000030021A1 (en)

Priority Applications (4)

Application Number Priority Date Filing Date Title
JP2000582958A JP2002530133A (en) 1998-11-13 1999-11-12 System for detecting malignant tumors in lung nodules
US09/830,574 US6738499B1 (en) 1998-11-13 1999-11-12 System for detection of malignancy in pulmonary nodules
EP99958770A EP1129426A4 (en) 1998-11-13 1999-11-12 System for detection of malignancy in pulmonary nodules
AU16064/00A AU1606400A (en) 1998-11-13 1999-11-12 System for detection of malignancy in pulmonary nodules

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US10816798P 1998-11-13 1998-11-13
US60/108,167 1998-11-13

Publications (2)

Publication Number Publication Date
WO2000030021A1 true WO2000030021A1 (en) 2000-05-25
WO2000030021A9 WO2000030021A9 (en) 2000-10-19

Family

ID=22320683

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US1999/025998 WO2000030021A1 (en) 1998-11-13 1999-11-12 System for detection of malignancy in pulmonary nodules

Country Status (4)

Country Link
EP (1) EP1129426A4 (en)
JP (1) JP2002530133A (en)
AU (1) AU1606400A (en)
WO (1) WO2000030021A1 (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2001054065A1 (en) * 2000-01-18 2001-07-26 The University Of Chicago Method, system and computer readable medium for the two-dimensional and three-dimensional detection of lungs nodules in computed tomography scans
JP2002325761A (en) * 2000-06-30 2002-11-12 Hitachi Medical Corp Image diagnosis supporting device
US7738683B2 (en) 2005-07-22 2010-06-15 Carestream Health, Inc. Abnormality detection in medical images

Families Citing this family (14)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP3642059B2 (en) * 2002-09-06 2005-04-27 独立行政法人理化学研究所 Contour data extraction method and apparatus for biological tomographic image
JP2004135868A (en) * 2002-10-17 2004-05-13 Fuji Photo Film Co Ltd System for abnormal shadow candidate detection process
JP2005323629A (en) * 2004-05-12 2005-11-24 Ge Medical Systems Global Technology Co Llc Diagnosis supporting device
WO2006054269A2 (en) * 2004-11-19 2006-05-26 Koninklijke Philips Electronics, N.V. System and method for false positive reduction in computer-aided detection (cad) using a support vector machine (svm)
US7623694B2 (en) * 2006-01-31 2009-11-24 Mevis Medical Solutions, Inc. Method and apparatus for classifying detection inputs in medical images
WO2007099495A1 (en) * 2006-03-03 2007-09-07 Koninklijke Philips Electronics N.V. Identifying set of image characteristics for assessing similarity of images
US8923577B2 (en) * 2006-09-28 2014-12-30 General Electric Company Method and system for identifying regions in an image
WO2009063443A2 (en) * 2007-11-13 2009-05-22 Oridion Medical (1987) Ltd. Medical system, apparatus and method
EP1947606A1 (en) * 2007-01-16 2008-07-23 National University Corporation Kobe University Medical image processing apparatus and medical image processing method
US8116553B2 (en) * 2007-10-03 2012-02-14 Siemens Product Lifecycle Management Software Inc. Rotation invariant 2D sketch descriptor
JP5527869B2 (en) * 2008-03-21 2014-06-25 国立大学法人神戸大学 Image diagnosis support processing apparatus and image diagnosis support processing program
JP6415878B2 (en) * 2014-07-10 2018-10-31 キヤノンメディカルシステムズ株式会社 Image processing apparatus, image processing method, and medical image diagnostic apparatus
US9865052B2 (en) * 2016-03-18 2018-01-09 Niramai Health Analytix Pvt Ltd Contour-based determination of malignant tissue in a thermal image
US10346728B2 (en) * 2017-10-26 2019-07-09 Hitachi, Ltd. Nodule detection with false positive reduction

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4961425A (en) * 1987-08-14 1990-10-09 Massachusetts Institute Of Technology Morphometric analysis of anatomical tomographic data
US5003979A (en) * 1989-02-21 1991-04-02 University Of Virginia System and method for the noninvasive identification and display of breast lesions and the like
US5133020A (en) * 1989-07-21 1992-07-21 Arch Development Corporation Automated method and system for the detection and classification of abnormal lesions and parenchymal distortions in digital medical images
US5319549A (en) * 1992-11-25 1994-06-07 Arch Development Corporation Method and system for determining geometric pattern features of interstitial infiltrates in chest images
US5410250A (en) * 1992-04-21 1995-04-25 University Of South Florida Magnetic resonance imaging color composites
US5426684A (en) * 1993-11-15 1995-06-20 Eastman Kodak Company Technique for finding the histogram region of interest for improved tone scale reproduction of digital radiographic images

Family Cites Families (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5881124A (en) * 1994-03-31 1999-03-09 Arch Development Corporation Automated method and system for the detection of lesions in medical computed tomographic scans
DE69513300T2 (en) * 1994-08-29 2000-03-23 Torsana A S Skodsborg DETERMINATION PROCEDURE
US5790690A (en) * 1995-04-25 1998-08-04 Arch Development Corporation Computer-aided method for automated image feature analysis and diagnosis of medical images

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4961425A (en) * 1987-08-14 1990-10-09 Massachusetts Institute Of Technology Morphometric analysis of anatomical tomographic data
US5003979A (en) * 1989-02-21 1991-04-02 University Of Virginia System and method for the noninvasive identification and display of breast lesions and the like
US5133020A (en) * 1989-07-21 1992-07-21 Arch Development Corporation Automated method and system for the detection and classification of abnormal lesions and parenchymal distortions in digital medical images
US5410250A (en) * 1992-04-21 1995-04-25 University Of South Florida Magnetic resonance imaging color composites
US5319549A (en) * 1992-11-25 1994-06-07 Arch Development Corporation Method and system for determining geometric pattern features of interstitial infiltrates in chest images
US5426684A (en) * 1993-11-15 1995-06-20 Eastman Kodak Company Technique for finding the histogram region of interest for improved tone scale reproduction of digital radiographic images

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
See also references of EP1129426A4 *

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2001054065A1 (en) * 2000-01-18 2001-07-26 The University Of Chicago Method, system and computer readable medium for the two-dimensional and three-dimensional detection of lungs nodules in computed tomography scans
JP2002325761A (en) * 2000-06-30 2002-11-12 Hitachi Medical Corp Image diagnosis supporting device
US7738683B2 (en) 2005-07-22 2010-06-15 Carestream Health, Inc. Abnormality detection in medical images

Also Published As

Publication number Publication date
AU1606400A (en) 2000-06-05
EP1129426A4 (en) 2002-08-21
JP2002530133A (en) 2002-09-17
EP1129426A1 (en) 2001-09-05
WO2000030021A9 (en) 2000-10-19

Similar Documents

Publication Publication Date Title
US6738499B1 (en) System for detection of malignancy in pulmonary nodules
Sahiner et al. Effect of CAD on radiologists' detection of lung nodules on thoracic CT scans: analysis of an observer performance study by nodule size
Yilmaz et al. Computer-aided diagnosis of periapical cyst and keratocystic odontogenic tumor on cone beam computed tomography
Xu et al. Computer-aided classification of interstitial lung diseases via MDCT: 3D adaptive multiple feature method (3D AMFM)
Santos et al. Automatic detection of small lung nodules in 3D CT data using Gaussian mixture models, Tsallis entropy and SVM
Giger et al. Computer-aided diagnosis
Li Recent progress in computer-aided diagnosis of lung nodules on thin-section CT
McNitt-Gray et al. The effects of co-occurrence matrix based texture parameters on the classification of solitary pulmonary nodules imaged on computed tomography
Way et al. Computer‐aided diagnosis of pulmonary nodules on CT scans: improvement of classification performance with nodule surface features
US6694046B2 (en) Automated computerized scheme for distinction between benign and malignant solitary pulmonary nodules on chest images
Sutton et al. Texture measures for automatic classification of pulmonary disease
da Silva Sousa et al. Methodology for automatic detection of lung nodules in computerized tomography images
Li et al. Investigation of new psychophysical measures for evaluation of similar images on thoracic computed tomography for distinction between benign and malignant nodules
EP1129426A1 (en) System for detection of malignancy in pulmonary nodules
Shiraishi et al. Computer-aided diagnosis for the detection and classification of lung cancers on chest radiographs: ROC analysis of radiologists’ performance
Nishio et al. Computer-aided diagnosis for lung cancer: usefulness of nodule heterogeneity
JPH09185714A (en) Computer supported diagnostic device
US20030103663A1 (en) Computerized scheme for distinguishing between benign and malignant nodules in thoracic computed tomography scans by use of similar images
Pino Peña et al. Automatic emphysema detection using weakly labeled HRCT lung images
Van Beek et al. Validation study of machine-learning chest radiograph software in primary and emergency medicine
US11058383B2 (en) Apparatus for the detection of opacities in X-ray images
Sahiner et al. Effect of CAD on radiologists’ detection of lung nodules on thoracic CT scans: observer performance study
Ajmera et al. Deep-learning-based automatic detection of pulmonary nodules from chest radiographs
Kao et al. Zone‐based analysis for automated detection of abnormalities in chest radiographs
Summers et al. Conspicuity of colorectal polyps at CT colonography: visual assessment, CAD performance, and the important role of polyp height

Legal Events

Date Code Title Description
ENP Entry into the national phase

Ref country code: AU

Ref document number: 2000 16064

Kind code of ref document: A

Format of ref document f/p: F

AK Designated states

Kind code of ref document: A1

Designated state(s): AU CA JP US

AL Designated countries for regional patents

Kind code of ref document: A1

Designated state(s): AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE

121 Ep: the epo has been informed by wipo that ep was designated in this application
DFPE Request for preliminary examination filed prior to expiration of 19th month from priority date (pct application filed before 20040101)
AK Designated states

Kind code of ref document: C2

Designated state(s): AU CA JP US

AL Designated countries for regional patents

Kind code of ref document: C2

Designated state(s): AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE

COP Corrected version of pamphlet

Free format text: PAGES 1/24-24/24, DRAWINGS, REPLACED BY NEW PAGES 1/24-24/24; DUE TO LATE TRANSMITTAL BY THE RECEIVING OFFICE

WWE Wipo information: entry into national phase

Ref document number: 1999958770

Country of ref document: EP

ENP Entry into the national phase

Ref country code: JP

Ref document number: 2000 582958

Kind code of ref document: A

Format of ref document f/p: F

WWE Wipo information: entry into national phase

Ref document number: 09830574

Country of ref document: US

WWP Wipo information: published in national office

Ref document number: 1999958770

Country of ref document: EP

WWW Wipo information: withdrawn in national office

Ref document number: 1999958770

Country of ref document: EP