key: cord-0868369-nray6171
authors: Tan, T.Z.; Quek, C.; Ng, G.S.; Ng, E.Y.K.
title: A novel cognitive interpretation of breast cancer thermography with complementary learning fuzzy neural memory structure
date: 2006-07-13
journal: Expert Syst Appl
DOI: 10.1016/j.eswa.2006.06.012
sha: 85d55fc7b8dd0d3537274e857edba00eea3e55d9
doc_id: 868369
cord_uid: nray6171

Early detection of breast cancer is the key to improve survival rate. Thermogram is a promising front-line screening tool as it is able to warn women of breast cancer up to 10 years in advance. However, analysis and interpretation of thermogram are heavily dependent on the analysts, which may be inconsistent and error-prone. In order to boost the accuracy of preliminary screening using thermogram without incurring additional financial burden, Complementary Learning Fuzzy Neural Network (CLFNN), FALCON-AART is proposed as the Computer-Assisted Intervention (CAI) tool for thermogram analysis. CLFNN is a neuroscience-inspired technique that provides intuitive fuzzy rules, human-like reasoning, and good classification performance. Confluence of thermogram and CLFNN offers a promising tool for fighting breast cancer.

Breast cancer is the second most deadly cancer among women. Each year, 211,240 women are diagnosed with breast cancer and 40,870 of them will die in 2005 (American Cancer Society, 2005) . In United States alone, it is estimated that there are 1 million women with undetected breast cancer; to date, the figure of women affected has surged to 1.8 million and 45, 000 women die per year . This high death rate has stimulated extensive researches in breast cancer detection and treatment. Recent studies have determined that the key to breast cancer survival rests upon its earliest detection possible. If discovered in its earliest stage, 95% cure rates are possible (Gautherie, 1999; Pacific Chiropractic and Research Center) . On the other side, it is reported that 70 to 90% of the excisional biopsies performed are found to be benign (Lay, Crump, Frykberg, Goedde, & Copeland, 1990) . Owing to this high false positive rate, many endeavors have been putted into ameliorate the breast cancer early detection.

Breast imaging is a noninvasive and inexpensive cancer detection technology. Amongst, mammography is accepted as the most reliable and cost-effective imaging modality. However, its false-negative rates is high (up to 30%) (Elmore, Wells, & Carol, 1994; Rajentheran, Rao, Lim, & Lennard, 2001) . In addition, the danger of ionizing radiation and tissue density, which has been associated with increased cancer risk (Boyd, Byng, & Jong, 1995) , is linked with patient who underwent mammography screening. It is also uncomfortable, because the breast has to be compressed between flat surfaces to improve image quality. Furthermore, obtaining adequate images from radiologically dense breasts (with little fat) or in women with breast implants are difficult (Foster, 1998) , and it is difficult to detect breast cancer in young women (Gohagan, Rodes, Blackwell, & Darby, 2004) . Despite of these limitations, mammogram remains the gold standard for screenings (Gohagan et al., 2004; Moore, 2001) . Since early detection is important, new technologies such as Magnetic Resonance Imaging (MRI), Positron Emission Tomography (PET), Computed Tomography-Single Photon Emission Computed Tomography (CT-SPECT) (Del Guerra, Di Domenico, Fantini, & Gambaccini, 2003) , and ultrasound have been applied as complement to mammogram (Ng & Fok, 2003) . Fig. 1 and Table 1 show the available modalities for breast cancer detection at present, and the reported accuracy, respectively. Note that the reported accuracy is Fok et al., 2002) . only an estimate because these modalities perform differently on different types of breast cancer, on different age group, apart from the fact that most of the tests are done on small populations. As shown in Table 1 , none of the methods possesses high sensitivity (correctly identify women with breast cancer), and high specificity (correctly weed out women without breast cancer), albeit a lot of endeavors have been put in Foster (1998) , Moore (2001) . Each has its limitations. For example, clinical examination is insensitive, examinerdependent (McDonald, Saslow, & Alciati, 2004) ; biopsy is invasive, causes complications, leaves scars, and requires long recovery time (Imaginis, Breast Cancer Diagnosis, 2004; Simmon, Kalbhen, Cooper, & Flisak, 2000) ; MRI is inconsistent, costly and low-resolution (Cardillo, Starita, Caramella, & Cilotti, 2001) ; PET, CT-SPECT, is expensive and scarce; ultrasound images are of poor resolution (Kotre, 1993; Moore, 2001) , and operator-dependent (Chen, Chang, & Huang, 2000) ; microwave imaging requires accurate modeling of the relation between various tissues' frequency dependency, and its sensitivity is affected by many factors (Bond, Li, Hagness, & Van Venn, 2003; Fear, Hagness, Meaney, Okoniewski, & Stuchly, 2002; Kosmas, Rappaport, & Bishop, 2004) ; PEM (Thompson, Murthy, Picard, Weinberg, & Mako, 1995) is expensive and insensitive (Moses, 2004) ; FNA is operator-dependent (Pisano, Fajardo, Caudry, & Sneige, 2001) , and incurs complications (Lucas & Cone, 2003) ; Gene expression analysis on genes BRCA1 and BRCA2, whose mutations are associated with breast cancer, is difficult as the genes are highly complex. The costly blood storage worsens the matter (Spengler, 2003) ; MRS is technically demanding, and only of confimatory value to MRI (Cecil, Siegelman, & Lenkinski, 2001; He & Shkarin, 1999) ; EIS requires localization of lesion before hand (Glickman et al., 2002) , insensitive, and observerdependent (Malich et al., 2003) . These methods are often too cumbersome, costly inaccessible or invasive to be used as first-line detection modalities alongside clinical examination and mammography (Keyserlingk, Ahlgren, Yu, Belliveau, & Yassa, 2000; Qi & Diakides, 2003) .

Thus, thermogram appears as one of the most promising and suitable alternatives for preliminary screening (Amalu, 2003) . Thermogram monitors the breast health based on the heat pattern variation that correlates with the patients' medical condition (Gautherie, 1999; Head, Wang, Lipari, & Elliott, 2000) . It is cheap, noninvasive, simple, painless, low cost, and highly accurate if done right, safe (no side effect known), practical, and it requires no contact nor compression, no radiation or venous access (Aksenov et al., 2003; Bamberg, 2002; Gautherie, 1989; Head et al., 2000; Keyserlingk et al., 2000) . Infrared breast thermography can increase sensitivity at the critical early detection phase by providing an early warning of an abnormality that is not evident by other approaches (Keyserlingk et al., 2000) . It is able to warn women up to 10 years before a cancer is found (Amalu, 2003; Pacific Chiropractic and Research Center) . Furthermore, thermography is the only physical method that mediates significant information on breast physiology (Gautherie, 1989) . In contrast to other techniques, its result is independent of nodal status, and unrelated to age, tumor location (right or left breast), and estrogen, progesterone receptor status (Head et al., 2000) . Hence, thermogram plays a pivotal role in breast cancer, be it risk assessment (Amalu, 2003) , detection, diagnosis, or prognosis (Gautherie, 1989; Head et al., 2000) .

Unfortunately, despite of the strengths reported, thermogram is associated with some of the limitations such as environment-dependent, operator-dependent (Fok, Ng, & Tai, 2002; Ng & Fok, 2003) , not descriptive (Aksenov et al., 2003; Bamberg, 2002) , difficult to interpret (Amalu, 2003) , nonspecific (Jones, 1998) , inconsistent (Frize, Herry, & Roberge, 2002; Head, Hoekstra, Keyserlingk, Elliott, & Diakides, 2003) , and no standard analysis procedure (Ohashi & Uchida, 2000; Kaczmarek & Nowakowski, 2003) , as pointed out in Breast Cancer Detection Demonstration Projects (BCDDP). As a result, breast thermography is yet to be widely used and is not recommended by National Breast Cancer Centre (National Breast Cancer Centre Position Statement, 2004) . Apparently, thermogram performs no better than other modalities. All in all, if the thermography is done right, it offers a very powerful tool for fighting breast cancer. Thus, by providing decision aids using intelligent system (Ng & Fok, 2003; , good and consistent diagnosis performance can be maintained using breast thermography. At the same time, these intelligent tools can lighten the pressures upon the physicians, and ease the burden of examining large number of images (e.g., 1 million pairs of X-ray images per year is needed to be reviewed Kotre, 1993) . A summary of the use of complementing breast cancer detection modalities with intelligent tools is given in Table 2 .

As shown in Table 2 , intelligent tools contribute significantly in improving the breast cancer detection and prognosis. This is consistent with a recent review that computer-aided diagnosis shows incremental improvement in sensitivity (Irwig, Houssami, & van Vliet, 2004) . MLP or BP is the favorite algorithm to complement various modalities, in spite of its limitations such as slow learning, likely to be trapped in local minima, etc. SOM is another common adjunct for imaging modalities, albeit its poor classification performance, and high memory requirement. Statistical methods like LDA, Bayesian network, and logistic regression are often applied in assisting diagnosis and prognosis. However, statistical methods are difficult to develop, and oftentimes they work under the assumption that the underlying data is normally distributed. Whereas RBF has heavy computation and memory requirements, decision tree is limited in its representation power due to the use of crisp rule. On the other hand, evolving ANN, although it is able to achieve optimal performance, is time-consuming to develop since it may take a few hundreds to thousands runs before it can find the appropriate parameters. Furthermore, due to the stochastic nature of the algorithm, it may generate (Naguib et al., 1996) 55 inconsistent knowledge base. Most of all, these methods (except decision tree) do not provide any explanations for their computations and reasoning. As a result, the physicians have no way to validate the system operation, and hence, they find it difficult to trust the system. Complementary Learning Fuzzy Neural Network (CLFNN) is therefore proposed to be Computer-Assisted Intervention (CAI) for breast thermography. CLFNN is a neuroscience-inspired, evolving, and autonomous fuzzy neural network that based on positive and negative learning. CLFNN not only provides good performance in classification, but also fast in learning. Most importantly, CLFNN offers human-like reasoning as well as intuitive fuzzy rules to explain its computations. Since human observer's image interpretation is often lack of thoroughness and lack of consistency (Bick, 2000) , the capacity of CLFNN in providing cognitive interpretation on given thermogram is of great importance for aiding image analysis. Psychophysical evidence demonstrates that even imperfect prompts can enhance human ability in pattern detection (Kotre, 1993) . Therefore, CLFNN is believed to enhance the overall accuracy of breast thermography. On the other hand, most of the disease detection works in CAI adopted physic or physically inspired models (Ellis & Peters, 2004) , statistical methods such as Bayesian theory and nearest neighbor (Sajda, Spence, & Parra, 2003) , or Artificial Neural Network (ANN) (Frigyesi, 2003; Joo, Yang, Moon, & Kim, 2004) . These methods however possess some shortcomings: statistical methods and ANN do not justify, and provide no explanation for their computation. As a result, the output is difficult to trust because it comes without reason. As for model-based system, it is difficult to develop, and many a times requires assumption to be made. This applies to statistical methods as well, as many statistical methods assume that the data is normally distributed. Conversely, other than superior accuracy, CLFNN provides positive and negative fuzzy rules to reason its decisions, and this reasoning is closely akin to diagnostician's decision-making process. These rules not only can be used to countercheck physician's diagnosis, they could potentially guide junior physician. Besides, CLFNN can also be adopted to confirm or investigate hypothesis associated with breast cancer such as women having family history of breast cancer belong to high risk group (Cancer Research UK, 2002) , temperature difference between left and right breast suggests possible case of cancer (Gautherie, 1989) , and so on.

FALCON-AART is a CLFNN that forms its fuzzy partitions based on visual cortical plasticity, and adjusts its parameters based on psychological theory of learning ) (for details, see . It generates fuzzy rules autonomously in the form described by Eq. (1).

The fuzzy rule in Eq. (1) is an example of a system with two inputs and two outputs. It consists of five elements:

1. Input linguistic variables (x 1 , x 2 ). 2. Input linguistic terms (A, B). This represents fuzzy entities such as tall, short, thin, fat, and so on. FALCON-AART represents input linguistic terms by using trapezoidal membership function. 3. If-Then rule: links the antecedent part (i.e., input linguistic variables and terms) with the consequent part (i.e., output linguistic variables and terms). 4. Output linguistic variables (y 1 , y 2 ). 5. Output linguistic terms (C, D).

FALCON-ART has five layers and each layer is mapped onto the elements of the fuzzy rule (Fig. 2) . Before training commences, FALCON-AART consists of input and output layers only. As training progresses, FALCON-AART evolves and automatically constructs its hidden layer by modified Fuzzy ART algorithm . This algorithm is based on complementary learning paradigm that comprises positive (learn from positive patterns) and negative learning (learn from negative patterns). The modified fuzzy ART algorithm (known as Another ART) improves Fuzzy ART (Baraldi & Bonda, 1999) by functionally models and incorporates the human visual cortical plasticity. With this, FALCON-AART structural learning becomes a function of time (age), which enables FAL-CON-AART to alleviate the stability-plasticity dilemma as well as to avoid the problem of generating bad clusters as suffered by most competitive learning algorithm. It dynamically partitions the input and output spaces into trapezoidal fuzzy clusters, and subsequently these clusters are finetuned using modified adaptive back-propagation algorithm . The tuning is done simultaneously to the slope and the location of fuzzy sets. When new training patterns are presented, the stored cluster will resonate if the new training patterns are sufficiently similar to them. The resonant cluster will then expand to incorporate these patterns using the Another ART algorithm. Training terminates when the mean square errors between two consecutive epochs are sufficiently equal.

The neural memory structure between Layers 2 and 3 is the construct of the complementary learning. Complementary learning refers to positive and negative learning, which is believed to be a mechanism underlies human recognition. When a positive pattern is presented, positive rules will be excited, and negative rules will be inhibited simultaneously, and vice versa. The complementary learning is often practiced in daily life: a child will learn how to recognize an apple more efficiently, if he/she were presented an apple (positive pattern) and other fruits (negative patterns).

Likewise, a radiologist will have to have seen/learned, both abnormal medical image (positive learning) and normal medical images (negative learning), in order for him/ her to recognize or analyse the images effectively. Evidences for this complementary learning can be drawn from vari-ous neuroscience studies. For instance, hippocampus possesses both positive and negative reinforcement signals; the existence of excitatory (positive) and inhibitory (negative) neurotransmitter systems inside human brain, etc. As shown in Fig. 3 , different objects are registered into different brain areas, lending further support to the complementary learning conjecture. Hence, whenever a car is presented (positive), only areas registered for car (positive rules) will be activated, while the areas registered for other objects (negative rules) will be inhibited simultaneously.

Thus, FALCON-AART functionally models the biological complementary learning, and is formalized as Eqs. (2) and (3). Given a positive sample, {x + = (x 1 , x 2 , . . . , x I ), d = 1}, x 2 U, d 2 V, and l R þ ðxÞ ¼ membership function of positive rule, l R À ðxÞ ¼ membership function of negative rule, then: Fig. 3 . Slices of fusiform gyrus of car and bird expert in face, car, and bird recognition. The rectangular boxes show the activated areas of brain for different recognition task (Adapted from Gauthier et al., 2000) . 

Hence, whenever a positive sample is presented to the system, l R þ ðx þ Þ > l R À ðx À Þ, which leads to a correct decision, i.e., d = 1.

The thermograms are obtained from voluntary patients at the Singapore General Hospital (SGH) (Ng, Fok, Ng, & Sim, 2001; . The thermograms are captured using the AVIO thermal camera TVS-2000 MkIIST system. The thermography process is shown in Fig. 4 .

The patient's thermal image is captured using thermal camera. The imager component of thermal system converts infrared emitted by the object under observation into electrical signals. Subsequently, the processor components collects these signals, store them in frame memory, then displays them on a LCD display, either as real-time sixteen bit color or monochrome thermographic images. The thermogram is stored, and feature extraction is done to compute the temperatures of the left and right breasts using the AVIO software. Example of thermogram is given in Fig. 5 .

The volunteers are between the ages of 27 and 90. Screening was carried out from 9.00 am to 11.30 pm of the day as this is the most stable period (Gautherie, 1989) . All volunteers were briefed the methodology and process of thermography in advance in order to relief them from any possible emotional stress as well as to obtain their consent. They were advised not to put on any powder, ointments, perfume, or any other wipes that will affect the conduction through the skin, around regions to be examined. Before the examination was carried out, volunteers were required to rest for 15-20 min for acclimatization to room temperature upon arrival at the examination room. This is important to keep patients in basal metabolic rate which will result minimal surface temperature changes for satisfactory thermograms. Since standardized ambient conditions are necessary to minimize variations in thermography, the ambient temperature was carefully observed for the examination. The examination environment was a controlled, air-conditioned room maintained at an ambient temperature of 20-22°C (maximum variation is ±0.1°C), with humidity between 55% and 65%. Direct draughts are avoided in the areas where the patient is positioned. Volunteers wore loose gowns that do not restrict airflow for equilibration and do not constrict the skin surface during this equilibration period. It was ensured that patients were within the period of the 5th-12th and 21st day after the onset of menstrual cycle as this is the most suitable period for imaging. This is because women body temperature is known to be stable in this period (Gautherie, 1989) , and the vascularisation is at basal level with least engorgement of blood vessels (Ng et al., 2001) .

Three thermograms were taken for each patient: one front view and two lateral views. There are total of 78 patients with 28 healthy patients, 43 benign tumor patients, and 7 cancer patients. Mean, median, mode, standard deviation and skewness of each breast temperature are extracted from front-view thermograms using histograms of the temperature distribution, and calculated using the Statistical Package for the Social Sciences (SPSS). Population of patients is shown in Table 3 . Table 3 shows that carcinoma patients generally have higher breast temperature compared to healthy patients.

AVIO Thermal Camera AVIO system (with viewing monitor)

Computer viewing printer Fig. 4 . Thermography process. Fig. 5 . Thermogram of (a) healthy patient-symmetrical temperature (b) unhealthy patient-unsymmetrical temperature.

This temperature difference arises because the cancerous breast has higher metabolism. The blood vessels in the vicinity of the tumor are engorged with blood and therefore, cancerous breast emits more heat .

The experiment is to diagnose whether a patient belongs to normal, benign, or malignant based on breast temperatures extracted from thermogram. Five types of file are created, and three training/testing sets are created for each type of file for cross-validation purpose. Each of the stratified training sets contains randomly selected 50% samples from the dataset, and the remaining unseen samples made up the testing sets. The sets are presented below:

• File FH: contains patient age, family history, hormone replacement therapy, age of menarche, presence of palpable lump, previous surgery/biopsy, presence of nipple discharge, breast pain, menopause at age above 50 years, and first child at age above 30 years. • File T: contains mean, median, modal, standard deviation and skewness of temperature for left and right breasts. • File TH: combination of FH and T. • File TD: contains temperature difference of mean, median, modal, standard deviation and skewness for left and right breasts. • File TDH: combination of TD and FH.

The averaged performance of FALCON-AART is benchmarked against Linear discriminant analysis (LDA) (Hanm & Kamber, 2001) , k-Nearest neighbor (kNN) (Hanm & Kamber, 2001) , Naï ve Bayesian (Hanm & Kamber, 2001) , logistic regression (LR) (Hanm & Kamber, 2001) , Self-Organizing Map (Chen et al., 2000) , Radial Basis Function (RBF) (Hanm & Kamber, 2001) , Support Vector Machine (SVM) (Hanm & Kamber, 2001) , C4.5 (Hanm & Kamber, 2001) , Multilayer Perceptrons (MLP) (Hanm & Kamber, 2001) . Apart from that, comparison is made with FALCON-AART ancestors: FALCON-ART (Lin & Lin, 1997) and FALCON-MART (Tung & Quek, 2001) . The result is listed in Table 4 . Recall refers to the classification accuracy on the training set, whereas predict refers to the classification accuracy on testing set.

It is shown that FALCON-AART outperforms the common methods in medical image analysis and its ancestors in all the training/testing sets. While having good recall and relatively superior generalization capability, the aver-age training time of FALCON-AART is significantly shorter than MLP, SOM, and LR. Though statistical algorithms require only one pass of training dataset, it does not necessarily means they are faster than FALCON-AART as this depends on the computational complexity of the algorithm. In this particular case, FALCON-AART is as fast as kNN, LDA, SVM, and Naïve Bayesian classifier in learning (%245 ms). In contrast to statistical methods, FAL-CON-AART did not make assumption on the data distribution, and this may give superior classification performance even for non-normally distributed data. Note that this result is not comparable to the one in Tables 1 and 2 as this is a different classification task. This classification task involves normal, benign, and malignant whereas the task in Tables 1 and 2 involves only benign and malignant. In other words, from the experimental result shown in Table 4 , complementary learning displays superior capacity in multi-class classification than conventional methods.

One significant advantage FALCON-AART offers is the ability to explain its computed output. In contrast to conventional methods, FALCON-AART constructs intuitive positive and negative fuzzy rules dynamically to depict its reasoning process; these rules can be scrutinized by the physicians and decide upon whether to adopt the system suggestion. In addition, accurate rules identified may be used as a guideline for inexperience physicians in diagnosis. As shown in Table 4 , rule generation capability of FAL-CON-AART is better than its ancestor, in which lesser rules are generated but greater accuracy are attained. Some authors have proposed a few criteria for measuring system interpretability: compactness (lesser number of rule in rule base), coverage (every value in universe of discourse should belong to one of the rule), normality (every rule has at least one pattern exhibit full-matching), and so on (Casillas, Cordó n, Herrera, & Magdalena, 2003) . FALCON-AART learning is a data-centered learning and therefore, it fulfills the coverage and normality criteria. From this experiment, it can be seen that FALCON-AART generates a smaller rule base then its ancestors. Thus, from this aspect, FAL-CON-AART offers a more interpretable system than its ancestors. Examples of the rules generated are given in Table 5 .

As shown in Table 5 , fuzzy rules generated by FAL-CON-AART are highly similar to the diagnostic rules practiced by diagnosticians. Aside from the capacity for uncertainty handling (allowing vagueness in linguistic terms), FALCON-AART rule is relatively more expressive compared to decision-tree rule. FALCON-AART rule encapsulates unnecessary details using linguistic term, and allows the use of linguistic hedges such as ''very'', ''rather'', etc. Moreover, rules generated by FALCON-AART do not have the confusing repeated antecedent term as in decision tree. Furthermore, because FALCON-AART adopts complementary learning, positive and negative rules are generated. This, aside from better classification performance, models the problem space closer than positive or negative learning systems (system with only positive or negative rule base) because no assumptions are made for the uncovered space by the rule base. Fig. 6 depicts the FALCON-AART reasoning process. As shown in Fig. 6 , the reasoning process of FALCON-AART is closely akin to how a diagnosis is made: A diag- nostician will first observe (presents sample), generates a set of hypotheses (a set of rules), evaluates each hypotheses (compute matching degree of rules), and subsequently derives the conclusion. This human-like reasoning, together with the fuzzy rules generated, which provide insights and interpretations to the thermograms, are useful to aid diagnosti-cian. Table 6 shows the similarity between FALCON-AART and thermogram analyst's reasoning process. As shown, there is one-to-one mapping of the reasoning process, suggesting the closeness between the two reasoning processes. This is paramount as it facilitates the physicians in analyzing or validating a system, in that he/she can do so in his/her familiar terms, as well as in his/her familiar thought process.

FALCON-AART can be used to assess/affirm certain medical hypothesis as well. For example, from Table 4 , one can see that the classification accuracy of training/testing set using only breast temperatures alone is lower than that of using breast temperatures and family history. This confirms that family history is important risk factor for breast cancer, lending support to the hypothesis that women who have family history of breast cancer belong to the highrisk group. Another example: the performance of FAL-CON-AART trained on files TH (FA TH ) and TDF (FA TDF ) seemed to be inconsistent with the belief that temperature asymmetry between left and right breast suggests possible case of cancer. This happens because the classification task is to classify three classes, instead of classifying out the cancerous case. In fact, the temperature asymmetry between left and right breast may be more useful in determining the stages of cancer instead of cancer detection (Usuki, Maeta, Maeba, & Wakabayashi, 2000) . Nevertheless, the result of detecting cancerous case is illustrated in Task 1 of Table 7 .

Though both FA TH and FA TDF attain same accuracy, FA TDF is better when assessed using Receiver Operating Curve (ROC) plot, which suggests that it is relative easier to classify using temperature asymmetry of left and right breasts. The 45°line signifies the random guessing. As shown in Fig. 7 , FALCON-AART trained on either files deviates far away from the 45°line, achieving good performance for breast cancer detection. The Area Under the Curve index (A Z ) is often used in ROC analysis. A Z = 0.5 symbolizes random guessing, and the closer A Z is to 1.0, the better the classifier is. A Z for FA TH and FA TDF are 0.867 and 0.93, respectively, hence, confirming that asym-metry temperature between the left and right breast is an alarm for breast cancer, and the fact that FALCON-AART is a competent classifier.

Thermogram is often employed to detect the presence of breast tumor. Hence, experiment to classify patient with breast tumor is conducted using FALCON-AART. The result is summarized in Task 2 of Table 7 . The experimental result reveals that FALCON-AART can detect patient with breast tumor accurately. Therefore, FALCON-AART could assist the physicians in identifying suspected cases where follow-ups are needed. With overall performance close to 90%, good recall and generalization capability is exhibited by FALCON-AART.

Sometimes, it is desired to classify benign and malignant breast cancer. Misdiagnose benign breast tumor as malignant causes unnecessary physical and emotional agony, because the only way to remove breast tumor is surgical Determine the consequent linked by the winning rule Determine the conclusion derived from the knowledge applied 5

Perform defuzzification and outputs the conclusion Give the diagnostic conclusion and decision Task 3 of Table 7 demonstrates that FALCON-AART is able to assist in this diagnostic task as well. Giving an overall accuracy about 93%, FALCON-AART demonstrates its competency in tumor classification task. This shows that complementary learning paradigm is a promising recognition approach. From the results presented in Tables 1, 2 and 7, complementary learning exhibits itself as a promising tool for aiding breast cancer diagnosis. Applying FALCON-AART with thermogram shows an improved performance in cancer detection as well as breast tumor classification. This confluence of thermography and CLFNN subsides the problem of high variability in accuracy of breast thermogram analysis. Besides, sensitivity and specificity are offered as high/ higher than the reported accuracy on breast thermography alone, as well as other modalities. However, CLFNN is not to replace, rather, is to complement the breast thermography and to assist the physicians in breast cancer diagnosis. The contribution of CLFNN-breast thermography in enhancing the consistency of breast cancer diagnosis accuracy is believed to bring forth better patient outcome. Comparing the results of Tables 2 and 7, confluence of CLFNN and breast thermography shows a superior performance in breast cancer detection over different conventional methods in medical diagnosis and medical imaging analysis. Medley of CLFNN and breast thermography gives as accurate result, if not better, compared to other combinations of ANN and breast imaging modalities in tumor classification and detection. In general, CLFNN has relatively good generalization capability, in that it can classify well using only a small fraction of the data. Together, this supports the application of CLFNN and breast thermography. This also suggests that the confluence of breast thermography and CLFNN is a promising system for fighting breast cancer.

In this study, it is shown that CLFNN complements breast thermography in various ways. The combination of breast thermography and CLFNN gives better or more consistent result than using breast thermography alone. Whether it is cancer detection, tumor classification or breast-cancer diagnosis (multi-class problem), CLFNN outperforms conventional methods, showing the strength of complementary learning in recognition task. FALCON-AART assists the physicians in different diagnostic tasks by providing relative accurate decision support, and hence could potentially enhance patient outcome. FALCON-AART not only gives superior result than conventional methods, but it also offers intuitive positive and negative fuzzy rules to explain its reasoning process. FALCON-AART satisfies the criteria of an interpretable system: normality, compactness, coverage, and therefore is a more interpretable system. The rules generated are useful because it gives insight to the problem space, provides simple cognitive interpretation of medical image, and could potentially serve as guidelines or arguments for its decision to the physicians. Apart from assisting physician in diagnosis, FALCON-AART can also be used to investigate or to support hypothesis associated with the problem domain, i.e., concept validation (Qi & Diakides, 2003) . In this study, only two hypotheses were analyzed. In future, more hypotheses can be assessed using CLFNN by proper experiment setup. Examples are thermal challenge test (Eccles, 2003) , cold stress (Usuki et al., 2000) or cooling-rewarming tests (Gautherie, 1999) (outside cooling of the breast will increase the temperature contrast if the breast is cancerous), injection of vasoactive substances (Gautherie, 1999) , microwave or ultrasonic irradiation (Gautherie, 1999) and so on. Likewise, FALCON-AART can be applied with advanced technologies, which provides more information in thermography: dynamic thermography (Ohashi & Uchida, 2000) , 3-dimensional thermography (Aksenov et al., 2003) , or thermal texture map (Hassan, Hattery, & Gandjbakhche, 2003) , or Dynamic Area Telethermometry (DAT) (Anbar et al., 2000) . Conversely, FALCON-AART can complement thermogram in other application areas such as injuries monitoring (Bamberg, 2002) , neurology, vascular disorders (e.g., diabetes), rheumatic diseases, tissue viability, oncology (especially breast cancer), dermatological disorders, neonatal, ophthalmology, surgery (Jones, 1998) , as well as Severe Acute Respiratory Syndrome (SARS) . Alternatively, CLFNN can be used to complement other medical imaging modalities such as MRI, MRS, PET, etc., as well as to serve as a concept validation tool for techniques such as nipple fluid bFGF (Liu, Wang, Chang, Barsky, & Nguyen, 2000) , Electrical Impedance Tomography (EIT) (Cherepenin et al., 2001) , etc. In current study, FALCON-AART does not perform feature analysis, which is an important area that may improve the system performance and deserved to be studied, as recognition requires one to make decision based on some ''important features''. Moreover, performing feature analysis can reduce the number of antecedents of the rule, and hence improve the interpretability of the system. This will be investigated in future.

An evolutionary artificial neural networks approach for breast cancer diagnosis

3D thermography for quantification of heat generation resulting from inflammation. 8th 3D modelling symposium

A review of breast thermography. International Academy of Clinical Thermology

Cancer facts and figures

Proceedings of 22nd annual international conference of the IEEE Engineering in Medicine and Biology

Designing breast cancer diagnostic systems via a hybrid fuzzy-genetic methodology

Breast cancer detection using rank nearest neighbor

Infrared thermography. Biomedical engineering seminar

A survey of fuzzy clustering algorithms for pattern classification-part B

The clinical breast examination. Harvard Pilgrim Health Care

BRCAPRO validation, sensitivity of genetic testing of BRCA1/ BRCA2, and prevalence of other breast cancer susceptibility genes

Computer-assisted data analysis in breast imaging

Microwave imaging via space-time beamforming for early detection of breast cancer

Quantitative classification of mammographic densities and breast cancer risk

Evaluation of a high-resolution, breast-specific, small-field-of-view gamma camera for the detection of breast cancer

Stereotactic core-needle breast biopsy: A multi-institutional prospective trial

Neural networks for measuring cancer outcomes

About breast cancer: Risks and causes

A neural tool for breast cancer detection and classification in MRI

Interpretability issues in fuzzy modeling

The evaluation of human breast lesions with magnetic resonance imaging and proton magnetic resonance spectroscopy

Breast cancer diagnosis using self-organizing map for sonography

A Neural Network for breast cancer detection using fuzzy entropy approach

Mining the breast cancer pattern using artificial neural networks and multivariate adaptive regression splines

Computer aided diagnosis of breast cancer in digitized mammograms

A dedicated system for breast cancer study with combined SPECT-CT modalities

Predicting breast cancer survivability: A comparison of three data mining methods

Three-dimensional ultrasoundvalidated large-core needle biopsy: Is it a realiable method for the histological assessment of breast lesions?

Infrared imaging in medicine worldwide. 1st international conference of Thermal Texture Maps (TTM) technology in medicine and engineering

Thermography -Its role in early breast cancer detection and pain monitoring

Variability in radiologists interpretation of mammograms

Mammography-guided stereotactic fine-needle aspiration cytology of nonpalpable breast leasions: Prospective comparison with surgical biopsy results

Near-field imaging for breast tumor detection

Report of the international workshop on screening for breast cancer

Evolving artificial neural network for screening features from mammograms

Early detection and visualization of breast tumor with thermogram and neural network

Thermographic detection of breast cancer

An automated method for the detection of pulmonary embolism in V/Q-scans

Processing of thermal images to detect breast cancer: Comparison with previous work

Annual Conference and the Annual Fall Meeting of the Biomedical Engineering Society] EMBS/BMES Conference

Atlas of breast thermography

Thermopathology of breast cancer: Measurement and analysis of in vivo temperature and blood flow

Expertise for cars and birds recruits brain areas involved in face recognition

Novel EIS postprocessing algorithm for breast cancer diagnosis

Individual and combined effectiveness of palpation, thermography and mammography in breast cancer screening

A neural network based model for prognosis of early breast cancer

Data mining: Concepts and techniques

Thermal texture map-A new technique for disease assessment and treatment monitoring. 1st International conference of Thermal Texture Maps (TTM) technology in medicine and engineering

Proton magnetic resonance spectroscopy and imaging of human breast cancer by selective multiple quantum coherence transfer

Comparison of breast infrared imaging results by three independent investigators

The important role of infrared imaging in breast cancer

Sydney breast imaging accuracy study: Comparative sensitivity and specificity of mammography and sonography in young women with symptoms

Gene expression predictors of breast cancer outcomes

Computer-aided diagnosis: Analysis of mammographic parenchymal patterns and classification of masses on digitized mammograms

New technologies in screening for breast cancer: A systematic review of their accuracy

Thermal signatures for breast cancer screening comparative study

A reappraisal of the use of infrared thermal image analysis in medicine

Computer-aided diagnosis of solid breast nodules: Use of an artificial neural network based on multiple sonographic features

Analysis of transient thermal processes for improved visualization of breast cancer using IR imaging

Functional infrared imaging of the breast

Using neural networks to select wavelet features for breast cancer diagnosis

Modeling with the FDTD method for microwave breast cancer detection

Image processing in the fight against breast cancer

Application of a new evolutionary programming/adaptive boosting hybrid to breast cancer diagnosis

Breast cancer screening using evolved neural networks

Breast biopsy -Changing patterns during a five-year period

Computerized diagnostics in digital mammography. 19th Convention of electrical and electronics engineers in Israel

Positron emission mammography: Initial clinical results

Clinical comparison of full-field digital mammography and screen-film mammography for detection of breast cancer

An ART-based fuzzy adaptive learning control network

Diagnosing breast cancer based on support vector machines

Breast-cancer diagnosis with nipple fluid bFGF

Application of artificial neural networks for diagnosis of breast cancer

Breast cyst aspiration

Electrical impedance scanning as a new imaging modality in breast cancer detection-a short review of clinical value on breast application, limitations and perspectives

Performance and reporting of clinical breast examination: A review of the literature

Large-core needle biopsy of nonpalpable breast lesions

Better breast cancer detection

Positron emission mammography imaging

Results of preliminary clinical trails of the positron emission mammography system PEM-I: A dedicated breast imaging system producing glucose metabolic images using FDG

The detection of nodal metastasis in breast cancer using neural network techniques

A framework for early discovery of breast tumor using thermography with artificial neural network

Breast tumor detection by thermography. Applied research fund (RG69/98). School of Mechanical and Production Engineering

Computerized detection of breast cancer with artificial intelligence and thermograms

Noninvasive diagnosis of breast cancer using thermography with artificial neural network

Applying dynamic thermography in the diagnosis of breast cancer

MR imaging of the breast

Breast cancer and early detection

Association, statistical, mathematical and neural approaches for mining breast cancer patterns

Fine-needle aspiration biopsy of nonpalpable breast lesions in a multicenter clinical trial: Results from the radiologic diagnostic oncology group V

Role of mammography, ultrasound and large core biopsy in the diagnostic evaluation of papillary breast lesions

Thermal infrared imaging in early breast cancer detection -A survey of recent research

Palpable breast cancer which is mammograpically invisible

Complementary imaging of solid breast lesions: Contribution of ultrasonography, fine-needle aspiration biopsy, and high-field and low-field MR imaging. Academic dissertation. Faculty of Medicine

A multi-scale probabilistic network model for detection, synthesis, and compression in mammographic image analysis

A neural network made of a Kohonen's SOM coupled to a MLP trained via backpropagation for the diagnosis of malignant breast cancer from digital mammograms

Computer-aided detection of breast cancer nuclei

Improved detection of breast cancer nuclei using modular neural networks

Prognostic comparison of statistical neural and fuzzy methods of analysis of breast cancer image cytometric data

Accuracy and complication rates of US-guided vacuum-assisted core breast biopsy: Initial results

Breast cancer evaluation

Breast cancer (BRCA) gene testing, healthwise

Solid breast nodules: Use of sonography to distinguish between benign and malignant lesions

Ipsilateral-mammogram computeraided detection of breast cancer

FALCON-AART: An improved version of FALCON-ART

Complementary learning fuzzy neural network for medical domain and bioinformatics domain applications. First year report

Breast cancer diagnosis using thermography and complementary learning fuzzy neural network

In vitro diagnosis of axillary lymph node metastases in breast cancer by spectrum analysis of radio frequency echo signals

Positron emission mammography (PEM): A promising technique for detecting breast cancer

An image analysis system for automated detection of breast cancer nuclei

A novel approach to the derivation of fuzzy membership functions using Falcon-MART architecture. Pattern Recognition Letters

Standardization of thermographic breast cancer detection-role of qualitative findings and quantitative findings

Gene expression profiling predicts clinical outcome of breast cancer

Thermal analysis of infra-red mammography

A novel approach toward development of a rapid blood test for breast cancer

Computer-assisted diagnosis of breast cancer using a data-driven Bayesian belief network

Neural networks for breast cancer diagnosis

Human breast lesions: Characterization with contrast-enhanced in vivo proton MR spectroscopy-initial results