Knowledge-Based Systems 74 (2015) 133–150
Contents lists available at ScienceDirect
Knowledge-Based Systems
journal homepage: www.elsevier.com/locate/knosys
Intuitionistic fuzzy recommender systems: An effective tool for medical
diagnosis
Le Hoang Son ⇑, Nguyen Tho Thong
VNU University of Science, Vietnam National University, Viet Nam
a r t i c l e
i n f o
Article history:
Received 12 May 2014
Received in revised form 6 October 2014
Accepted 10 November 2014
Available online 20 November 2014
Keywords:
Accuracy
Fuzzy sets
Intuitionistic fuzzy collaborative filtering
Intuitionistic fuzzy recommender systems
Medical diagnosis
a b s t r a c t
Medical diagnosis has been being considered as one of the important processes in clinical medicine that
determines acquired diseases from some given symptoms. Enhancing the accuracy of diagnosis is the
centralized focuses of researchers involving the uses of computerized techniques such as intuitionistic
fuzzy sets (IFS) and recommender systems (RS). Based upon the observation that medical data are often
imprecise, incomplete and vague so that using the standalone IFS and RS methods may not improve the
accuracy of diagnosis, in this paper we consider the integration of IFS and RS into the proposed methodology and present a novel intuitionistic fuzzy recommender systems (IFRS) including: (i) new definitions
of single-criterion and multi-criteria IFRS; (ii) new definitions of intuitionistic fuzzy matrix (IFM) and
intuitionistic fuzzy composition matrix (IFCM); (iii) proposing intuitionistic fuzzy similarity matrix
(IFSM), intuitionistic fuzzy similarity degree (IFSD) and the formulas to predict values on the basis of
IFSD; (iv) a novel intuitionistic fuzzy collaborative filtering method so-called IFCF to predict the possible
diseases. Experimental results reveal that IFCF obtains better accuracy than the standalone methods of
IFS such as De et al., Szmidt and Kacprzyk, Samuel and Balamurugan and RS, e.g. Davis et al. and Hassan
and Syed.
Ó 2014 Elsevier B.V. All rights reserved.
1. Introduction
In this section, we formulate the medical diagnosis problem and
give some illustrated examples in Section 1.1. Section 1.2 describes
the relevant works using the intuitionistic fuzzy sets for the medical diagnosis problem. Section 1.3 summarizes the limitations of
those relevant works, and based on these facts the motivation
and ideas of the proposed approach are highlighted in Section 1.4.
Section 1.5 demonstrates our contributions in details, and their
novelty and significance are discussed in Section 1.6. Lastly,
Section 1.7 elaborates the organization of the paper.
1.1. The medical diagnosis problem
Medical diagnosis has been being considered as one of the most
important and necessary processes in clinical medicine that
determines acquired diseases of patients from given symptoms.
According to Kononenko [20], diagnosis commonly relates to the
probability or risk of an individual developing a particular state
of health over a specific time, based on his or her clinical and
⇑ Corresponding author at: 334 Nguyen Trai, Thanh Xuan, Hanoi, Viet Nam. Tel.:
+84 904171284; fax: +84 0438623938.
E-mail addresses: , (L.H. Son).
/>0950-7051/Ó 2014 Elsevier B.V. All rights reserved.
non-clinical profile. It is useful to minimize the risk of associated
health complications such as osteoporosis, small bowel cancer
and increased risk of other autoimmune diseases. Mathematically,
its definition is stated as follows.
Definition 1 (Medical diagnosis). Given three lists: P = {P1, . . ., Pn},
S = {S1, . . ., Sm} and D = {D1, . . ., Dk} where P is a list of patients, S a
list of symptoms and D a list of diseases, respectively. Three values
n, m, k 2 N+ are the numbers of patients, symptoms and
diseases, respectively. The relation between the patients and the
symptoms is characterized by the set- RPS ¼ fRPS ðP i ; Sj Þj8i ¼
1; . . . ; n; 8j ¼ 1; . . . ; mg where RPS(Pi, Sj) shows the level that patient
Pi acquires symptom Sj and is represented by either a numeric
value or a (intuitionistic) fuzzy value depending on the domain of
the problem. Analogously, the relation between the symptoms and
the diseases is expressed as RSD ¼ fRSD ðSi ; Dj Þj8i ¼ 1; . . . ; m; 8j ¼
1; . . . ; kg where RSD(Si, Dj) reflects the possibility that symptom Si
would lead to disease Dj. The medical diagnosis problem aims to
determine the relation between the patients and the diseases
described by the set- RPD ¼ fRPD ðP i ; Dj Þj8i ¼ 1; . . . ; n; 8j ¼ 1; . . . ; kg
where RPD(Pi, Dj) is either 0 or 1 showing that patient Pi acquires
disease Dj or not. The medical diagnosis problem can be shortly
represented by the implication fRPS ; RSD g ! RPD .
134
L.H. Son, N.T. Thong / Knowledge-Based Systems 74 (2015) 133–150
Example 1. Consider the dataset in [31] having four patients
namely P = {Ram, Mari, Sugu, Somu}, five symptoms S = {Temperature, Headache, Stomach-pain, Cough, Chest-pain} and five diseases D = {Viral-Fever, Malaria, Typhoid, Stomach, Heart}. The
relations between the patients – the symptoms and the symptoms
– the diseases are illustrated in Tables 1 and 2, respectively.
The relation between the patients and the diseases determined
by the medical diagnosis is illustrated in Table 3. Since the domain
of the problem is the intuitionistic fuzzy values, this relation is also
expressed in this form. The most acquiring disease that the
patients suffer is expressed in Table 4, which is converted from
Table 3 by a trivial defuzzification method considering the maximal membership degree of disease among all.
Medical diagnosis is considered as an efficient support tool for
clinicians to make the right therapeutical decisions especially in
the cases that medicine extends its predictive capacities using
genetic data [5]. As being observed in Table 3, medical diagnosis
could assist the clinicians to enumerate the possible diseases of
patients accompanied with certain membership values. Thus, it is
convenient for clinicians, who are experts in this field, to quickly
diagnose and give proper medicated figures. This fact clearly shows
the importance of medical diagnosis in medicine sciences
nowadays.
P
Temperature
Headache
Stomach_pain
Cough
Chest_pain
Ram
Mari
Sugu
Somu
(0.8, 0.1)
(0, 0.8)
(0.8, 0.1)
(0.6, 0.1)
(0.6,
(0.4,
(0.8,
(0.5,
(0.2, 0.8)
(0.6, 0.1)
(0, 0.6)
(0.3, 0.4)
(0.6,
(0.1,
(0.2,
(0.7,
(0.1, 0.6)
(0.1, 0.8)
(0, 0.5)
(0.3, 0.4)
0.1)
0.7)
0.7)
0.2)
S
Viral_fever
Malaria
Typhoid
Stomach
Heart
Temperature
Headache
Stomach_pain
Cough
Chest_pain
(0.4,
(0.3,
(0.1,
(0.4,
(0.1,
(0.7, 0)
(0.2, 0.6)
(0, 0.9)
(0.7, 0)
(0.1, 0.8)
(0.3,
(0.6,
(0.2,
(0.2,
(0.1,
(0.1,
(0.2,
(0.8,
(0.2,
(0.2,
(0.1, 0.8)
(0, 0.8)
(0.2, 0.8)
(0.2, 0.8)
(0.8, 0.1)
0.3)
0.1)
0.7)
0.6)
0.9)
0.7)
0.4)
0)
0.7)
0.7)
Table 3
The relation between the patients and the diseases – RPD expressed by intuitionistic
fuzzy values.
P
Viral_fever
Malaria
Typhoid
Stomach
Heart
Ram
Mari
Sugu
Somu
(0.4,
(0.3,
(0.4,
(0.4,
(0.7,
(0.2,
(0.7,
(0.7,
(0.6,
(0.4,
(0.6,
(0.5,
(0.2,
(0.6,
(0.2,
(0.3,
(0.2,
(0.1,
(0.2,
(0.3,
0.1)
0.5)
0.1)
0.1)
0.1)
0.6)
0.1)
0.1)
0.1)
0.4)
0.1)
0.3)
Definition 2. A Fuzzy Set (FS) [49] in a non-empty set X is a
function
ð1Þ
where l(x) is the membership degree of each element x 2 X. A fuzzy
set can be alternately defined as,
A ¼ fhx; lðxÞijx 2 Xg:
ð2Þ
An extension of FS that is widely applied to the medical prognosis problem is Intuitionistic Fuzzy Set (IFS), which is defined as
follows.
Definition 3. An Intuitionistic Fuzzy Set (IFS) [4] in a non-empty set
X is,
Table 2
The relation between the symptoms and the diseases – RSD.
0)
0.5)
0.7)
0.3)
0.7)
Computerized techniques for medical diagnosis such as fuzzy
set, genetic algorithms, neural networks, statistical tools and recommender systems aiming to enhance the accuracy of diagnosis
have been being introduced widely [20]. Nonetheless, an important issue in medical diagnosis is that the relations between the
patients – the symptoms (RPS) and the symptoms – the diseases
(RSD) are often vague, imprecise and uncertain. For instance, doctors could faced with patients who are likely to have personal
problems and/or mental disorders so that the crucial patients’
signs and symptoms are missing, incomplete and vague even
though the supports of patients’ medical histories and physical
examination are provided within the diagnosis. Even if information
of patients are clearly provided, how to give accurate evaluation to
given symptoms/diseases is another challenge requiring welltrained, copious-experienced physicians. These evidences raise
the need of using fuzzy set or its extension to model and assist
the techniques that improve the accuracy of diagnosis. The definition of fuzzy set is stated below.
l : X ! ½0; 1
x # lðxÞ;
Table 1
The relation between the patients and the symptoms – RPS.
0.1)
0.4)
0.1)
0.4)
1.2. The previous works
0.4)
0.1)
0.4)
0.4)
0.6)
0.7)
0.5)
0.4)
Table 4
The most acquiring diseases of patients.
P
Viral_fever
Malaria
Typhoid
Stomach
Heart
Ram
Mari
Sugu
Somu
0
0
0
0
1
0
1
1
0
0
0
0
0
1
0
0
0
0
0
0
e¼
A
nD
E
o
x; le ðxÞ; ce ðxÞ jx 2 X ;
A
A
ð3Þ
where leðxÞ and ce ðxÞ are the membership and non-membership
A
A
degrees of each element x 2 X, respectively.
leA ðxÞ; ceA ðxÞ 2 ½0; 1; 8x 2 X;
0 6 leðxÞ þ ce ðxÞ 6 1; 8x 2 X:
A
A
ð4Þ
ð5Þ
The intuitionistic fuzzy index of an element showing the non-determinacy is denoted as,
peA ðxÞ ¼ 1 À leA ðxÞ þ ceA ðxÞ; 8x 2 X:
ð6Þ
When pe ðxÞ ¼ 0 for "x 2 X, IFS returns to the FS set of Zadeh.
A
Some extensions of fuzzy sets are not appropriate for modeling
uncertainty in the medical diagnosis such as the rough set [28],
rough soft sets [11,12,16], intuitionistic fuzzy rough sets [50] and
soft rough fuzzy sets & soft fuzzy rough sets [23]. The limitations
of these sets, as pointed out by Yao [48], Rodriguez et al. [30], Jafarian and Rezvani [17] and many other authors lie to their intrinsic
nature and how they are organized and operated such as (i) The
positive and the boundary rules are considered in rough sets and
their variants so that in cases of many concepts, the negative rules
would be redundant; (ii) The modeling of linguistic information is
limited due to the elicitation of single and simple terms that should
encompass and express the information provided by the experts
regarding the a linguistic variable; (iii) if exact membership
degrees cannot be determined due to insufficient information then
it is impossible to consider the uncertainty on the membership
135
L.H. Son, N.T. Thong / Knowledge-Based Systems 74 (2015) 133–150
function. Thus, these types of fuzzy sets could not be used for the
application of medical diagnosis.
The first approach for the medical diagnosis problem was
drawn from the Sanchez’s notion of medical knowledge [32]. Since
then several improvements of the Sanchez’s approach in association with IFS and other advanced fuzzy sets have been introduced.
De et al. [9] fuzzified the relations between the patients – the
symptoms and the symptoms – the diseases by intuitionistic fuzzy
memberships and derived the relation between the patients and
the diseases by means of intuitionistic fuzzy relations. The algorithm contains the following steps.
1. Calculate the relation between the patients and the diseases
by intuitionistic fuzzy relations with the membership and
non-membership functions being expressed in Eqs. (7) and
(8), respectively.
È
É
lPD ðPi ; Dj Þ ¼ max minflPS ðPi ; Sl Þ; lSD ðSl ; Dj Þg ;
Table 6
The SPD matrix where bold values imply the most possible disease.
P
Viral_fever
Malaria
Typhoid
Stomach
Heart
Ram
Mari
Sugu
Somu
0.35
0.2
0.35
0.35
0.68
0.08
0.68
0.68
0.57
0.32
0.57
0.44
0.04
0.57
0.04
0.18
0.08
0.05
0.05
0.18
Table 7
The WPD matrix.
P
Viral_fever
Malaria
Typhoid
Stomach
Heart
Ram
Mari
Sugu
Somu
(0.4,
(0.3,
(0.4,
(0.4,
(0.7,
(0.2,
(0.7,
(0.7,
(0.6,
(0.4,
(0.6,
(0.5,
(0.2,
(0.6,
(0.2,
(0.3,
(0.2,
(0.2,
(0.2,
(0.3,
0.9)
0.5)
0.9)
0.9)
0.9)
0.4)
0.9)
0.9)
0.9)
0.6)
0.9)
0.7)
0.6)
0.9)
0.6)
0.6)
0.4)
0.5)
0.5)
0.6)
l¼1;m
8i 2 f1; . . . ; ng;
8j 2 f1; . . . ; kg;
È
ð7Þ
É
Table 8
The reduction matrix where bold values imply the most possible disease.
cPD ðPi ; Dj Þ ¼ min maxfcPS ðPi ; Sl Þ; cSD ðSl ; Dj Þg ;
l¼1;m
8i 2 f1; . . . ; ng;
8j 2 f1; . . . ; kg:
ð8Þ
2. Perform the defuzzification through the SPD,
SPD ¼ lPD À cPD Â pPD :
ð9Þ
P
Viral_fever
Malaria
Typhoid
Stomach
Heart
Ram
Mari
Sugu
Somu
0.4
0.3
0.4
0.4
0.7
0.2
0.7
0.7
0.6
0.4
0.6
0.5
0.2
0.6
0.2
0.3
0.2
0.2
0.2
0.3
3. Determine the most acquiring diseases of patients based on
the maximal SPD and minimal pPD.
Example 2. Consider the dataset in Example 1. The relation
between the patients and the diseases calculated by Eqs. (7) and
(8) is expressed in Table 5. The SPD matrix is described in Table 6.
Based upon this table, Ram, Sugu and Somu suffer from the Malaria
and Mari acquires Stomach the most.
Samuel and Balamurugan [31] improved the method of De et al.
[9] by a new technique named intuitionistic fuzzy max–min composition. This method is analogous to that of De et al. [9] except
that Steps 2 & 3 are replaced by,
1. Compute W PD ¼ ðlPD ; 1 À cPD Þ.
2. For each Pi find maxj fminðlPD ðP i ; Dj Þ; 1 À cPD ðP i ; Dj ÞÞg and
conclude the most acquiring diseases.
1. Use the Hamming or Euclidean function to calculate the relation
between the patients and the diseases as in Eqs. (10) and (11),
respectively.
m
À
1 X
l ðP i ; Sl Þ À l ðSl ; Dj Þ þ jc ðPi ; Sl Þ
PS
PS
SD
2m l¼1
Á
ÀcSD ðSl ; Dj Þ þ jpPS ðPi ; Sl Þ À pSD ðSl ; Dj Þj ;
ð10Þ
RPD ðPi ; Dj Þ ¼
m
À
Á
1 X
lPS ðPi ; Sl Þ À lSD ðSl ; Dj Þ 2
2m l¼1
À
Á2
þ cPS ðP i ; Sl Þ À cSD ðSl ; Dj Þ
1=2
þðpPS ðPi ; Sl Þ À pSD ðSl ; Dj ÞÞ2
:
RPD ðPi ; Dj Þ ¼
ð11Þ
Example 3. Consider again the dataset in Example 1. The WPD
matrix is shown in Table 7. The reduction of WPD is presented in
Table 8. From this table, Ram, Sugu and Somu suffer from the
Malaria and Mari acquires Stomach the most.
2. Conclude the possible diseases of patients based on the minimal
distance criterion.
Another approach for the medical diagnosis is utilizing the
distance functions to calculate the relation between the patients
and the diseases from the relations between the patients – the
symptoms and the symptoms – the diseases as described in
[42–44,19,33]. The general activities of these algorithms are,
Example 4. Use this method for the dataset in Example 1, we have
the relations between the patients and the diseases by the Hamming (Table 9) or Euclidean function (Table 10). The most acquiring diseases of patients are highlighted in bold.
Table 5
The relation between the patients and the diseases – RPD in the method of De et al. [9]
expressed by intuitionistic fuzzy values.
P
Viral_fever
Malaria
Typhoid
Stomach
Heart
Ram
Mari
Sugu
Somu
(0.4,
(0.3,
(0.4,
(0.4,
(0.7,
(0.2,
(0.7,
(0.7,
(0.6,
(0.4,
(0.6,
(0.5,
(0.2,
(0.6,
(0.2,
(0.3,
(0.2,
(0.2,
(0.2,
(0.3,
0.1)
0.5)
0.1)
0.1)
0.1)
0.6)
0.1)
0.1)
0.1)
0.4)
0.1)
0.3)
0.4)
0.1)
0.4)
0.4)
0.6)
0.5)
0.5)
0.4)
Besides these approaches, some authors have extended them
for special cases, e.g. multi-criteria medical diagnosis and the multiple time intervals modeling for the relation between the patients
and the symptoms. This requires the deployment on other
advanced fuzzy sets such as the type-2 fuzzy sets [26], the interval-valued intuitionistic fuzzy sets [2], fuzzy soft set [25,47] and
intuitionistic fuzzy soft set [1,21]. The combination of these fuzzy
sets with machine learning methods to handle the special cases
such as the fuzzy-neural automatic system [27,24] and the type2 fuzzy genetic algorithm [45,14] was also investigated.
136
L.H. Son, N.T. Thong / Knowledge-Based Systems 74 (2015) 133–150
Table 9
The relation between the patients and the diseases by the Hamming function where
bold values imply the most possible disease.
P
Viral_fever
Malaria
Typhoid
Stomach
Heart
Ram
Mari
Sugu
Somu
0.28
0.40
0.38
0.28
0.24
0.50
0.44
0.30
0.28
0.31
0.32
0.38
0.54
0.14
0.50
0.44
0.56
0.42
0.55
0.54
Table 10
The relation between the patients and the diseases by the Euclidean function where
bold values imply the most possible disease.
decision support systems, can give users information about predictive ‘‘rating’’ or ‘‘preference’’ that they would like to assess an item;
thus helping them to choose the appropriate item among numerous possibilities. This kind of expert systems is now commonly
popularized in numerous application fields such as books, documents, images, movie, music, shopping and TV programs personalized systems. The mathematical definition of RS is stated below.
Definition 4 (Recommender Systems – RS [29]). Suppose U is a set
of all users and I is the set of items in the system. The utility
function R is a mapping specified on U1 & U and I1 & I as follows.
R : U 1 Â I1 ! P
ð12Þ
P
Viral_fever
Malaria
Typhoid
Stomach
Heart
ðu1 ; i1 Þ # Rðu1 ; i1 Þ;
Ram
Mari
Sugu
Somu
0.29
0.43
0.36
0.25
0.25
0.56
0.41
0.29
0.32
0.33
0.32
0.35
0.53
0.14
0.52
0.43
0.58
0.46
0.57
0.50
where R(u1, i1) is a non-negative integer or a real number within a
certain range. P is a set of available ratings in the system. Thus, RS is
the system that provides two basic functions below.
Ã
1.3. The limitations of the previous works
Considering the relevant works involving the usage of the IFS
set, we clearly recognize that IFS was used mainly for the applications of medical diagnosis among the advanced fuzzy sets. Nonetheless, these works have the following disadvantages.
(a) The previous works calculate the relation between the
patients and the diseases (RPD) solely from those between
the patients – the symptoms (RPS) and the symptoms –
the diseases (RSD). In some practical cases where the relation
between the patients – the symptoms or the symptoms – the
diseases is missing, those works could not be performed.
This fact is happened in reality since clinicians somehow
do not accurately express the values of membership and
non-membership degrees of symptoms to diseases or vive
versa;
(b) The information of previous diagnoses of patients could not
be utilized. That is to say, a patient has had some records in
the patients-diseases databases (RPD) beforehand. Nevertheless, the calculation of the next records of this patient is
made solely on the basis of both RPS and RSD. Historic diagnoses of patients are not taken into account so that the accuracy of diagnosis may not be high as a result;
(c) The determination of the most acquiring disease is dependent from the defuzzification method. For instance, De
et al. [9] used SPD for the defuzzification, Samuel and Balamurugan [31] relied on the reduction matrix from WPD and
Szmidt and Kacprzyk [42–44], Khatibi and Montazer [19]
and Shinoj and John [33] employed the distance functions.
Independent determination from the defuzzification method
should be investigated for the stable performance of the
algorithm.
(d) Mathematical properties of operations such as the fuzzy
implication in De et al. [9], Samuel and Balamurugan [31]
and the distance function in Szmidt and Kacprzyk [42–44],
Khatibi and Montazer [19] and Shinoj and John [33] were
not discussed in the equivalent articles. Readers could not
know the theoretical bases of these operations and why they
were selected for the medical diagnosis problem.
1.4. The motivation and ideas
From the disadvantages of the previous works, our idea in this
article is using the hybrid method between Recommender Systems
(RS) and the IFS set to handle them. RS, which are a subclass of
Ã
(a) Prediction: determine Rðuà ; i Þ for any ðuà ; i Þ 2 ðU; IÞ n ðU 1 ; I1 Þ.
(b) Recommendation: choose i⁄ 2 I satisfying i⁄ = arg maxi2IR(u, i)
for all u 2 U.
RS has been applied to the medical diagnosis problem. Davis et al.
[8] proposed CARE, a Collaborative Assessment and Recommendation Engine, which relies only on a patient’s medical history in order
to predict future diseases risks and combines collaborative filtering
methods with clustering to predict each patient’s greatest disease
risks based on their own medical history and that of similar patients.
An iterative version of CARE so-called ICARE that incorporates
ensemble concepts for improved performance was also introduced.
These systems required no specialized information and provided
predictions for medical conditions of all kinds in a single run. Hassan
and Syed [13] employed a collaborative filtering framework that
assessed patient risk both by matching new cases to historical
records and by matching patient demographics to adverse outcomes
so that it could achieve a higher predictive accuracy for both sudden
cardiac death and recurrent myocardial infraction than popular
classification approaches such as logistic regression and support
vector machines. More works on the applications of RS could be referenced in Duan et al. [10], Meisamshabanpoor and Mahdavi [22]
and our previous works in [7,38,40,39,41,34–37].
Example 5. Consider the training dataset in Table 11. Taking a
simple encoded method by multiplying the membership degree by
10 and adding the non-membership degree to it, we have a crisp
training in Table 12.
The method of Hassan and Syed [13] employed a collaborative
filtering including the traditional Pearson coefficient to calculate
the similarity between users and the k-nearest neighbor approximation function to predict the blank values in Table 12. The results
are shown in Table 13. If taking the maximal value among all for a
given patient in Table 13 then we can conclude that Ram, Sugu and
Somu are suffered from Malaria and Mari acquires Stomach.
Analogously, Table 14 shows the results of the method of Davis
Table 11
The training dataset with ⁄ being the values to be predicted.
P
Viral_fever
Malaria
Typhoid
Stomach
Heart
Ram
Mari
Sugu
Somu
(0.4, 0.1)
(0.3, 0.5)
(0.4, 0.1)
⁄
(0.7, 0.1)
(0.2, 0.6)
(0.7, 0.1)
⁄
(0.6, 0.1)
(0.4, 0.4)
⁄
(0.5, 0.3)
(0.2, 0.4)
(0.6, 0.1)
⁄
(0.3, 0.4)
(0.2, 0.6)
(0.1, 0.7)
⁄
⁄
L.H. Son, N.T. Thong / Knowledge-Based Systems 74 (2015) 133–150
Table 12
The crisp training dataset with ⁄ being the values to be predicted.
P
Viral_fever
Malaria
Typhoid
Stomach
Heart
Ram
Mari
Sugu
Somu
4.1
3.5
4.1
⁄
7.1
2.6
7.1
⁄
6.1
4.4
⁄
5.3
2.4
6.1
⁄
3.4
2.6
1.7
⁄
⁄
et al. [8] where Ram is suffered from Malaria, Mari acquires Stomach and Sugu and Somu have Typhoid.
From Example 5, we clearly recognize the following facts:
(a) RS could be applied to the medical diagnosis. Yet in cases
that the relations are expressed by fuzzy memberships as
in Table 11, the accuracy of diagnosis in RS is dependent
on the encoded method. In the other words, RS is effective
with the crisp dataset such as Table 12 but not the fuzzy
one, e.g. Table 11;
(b) The problem of the previous researches about the dependence of the determination of the most acquiring disease
from the defuzzification method, e.g. the maximal function
in Example 5 still exists;
(c) RS works only if the training dataset is provided. That is to
say, we must have the historic diagnoses of patients for
the prediction.
From Sections 1.3 and 1.4 and illustrated examples, we clearly
recognize that the IFS and RS approaches have their own advantages
and disadvantages. Thus, a combination of these approaches in order
to combine the advantages and eliminate the disadvantages could
handle the mentioned issues. Scanning the literature, we realize that
some hybrid methods were also designed for the medical diagnosis
problem, to name but a few such as Davis et al. [8] combined collaborative filtering methods with clustering; Kala et al. [18] integrated
genetic algorithms with modular neural network; Hosseini et al.
[14] joined a type-2 fuzzy logic with genetic algorithm. These evidences show that the combination of groups of methods such as
between RS and IFS is a trendy approach for medical diagnosis.
1.5. The contributions of this work
Based upon the observations, our contribution in this paper is a
novel intuitionistic fuzzy recommender system (IFRS) for medical
diagnosis consisting of the following components:
Table 13
The full dataset derived by the method of Hassan & Syed [13] where bold values imply
the most possible disease.
P
Viral_fever
Malaria
Typhoid
Stomach
Heart
Ram
Mari
Sugu
Somu
4.1
3.5
4.1
5.9
7.1
2.6
7.1
9.8
6.1
4.4
4.8
5.3
2.4
6.1
1.9
3.4
2.6
1.7
3.2
5.5
Table 14
The full dataset derived by the method of Davis et al. [8] where bold values imply the
most possible disease.
P
Viral_fever
Malaria
Typhoid
Stomach
Heart
Ram
Mari
Sugu
Somu
4.1
3.5
4.1
2.6
7.1
2.6
7.1
4.7
6.1
4.4
7.3
5.3
2.4
6.1
5.2
3.4
2.6
1.7
2.1
0.2
137
(a) The new definitions of single-criterion IFRS (SC-IFRS) and
multi-criteria IFRS (MC-IFRS) that extend the definition of
RS (Definition 4) taking into account a feature of a user
and a characteristic of an item expressed by intuitionistic
linguistic labels (See Section 2.1). These definitions are the
basis for the deployment of similarity degrees used for the
prediction of RPD(Pi, Dj) (Definition 1);
(b) The new definitions of intuitionistic fuzzy matrix (IFM),
which is a representation of SC-IFRS and MC-IFRS in the
matrix format and the intuitionistic fuzzy composition
matrix (IFCM) of two IFMs with the intersection/union operation. Some interesting theorems and properties of IFM and
IFCM are presented (See Section 2.2);
(c) Some new similarity degrees of IFMs such as the intuitionistic fuzzy similarity matrix (IFSM) and the intuitionistic fuzzy
similarity degree (IFSD). The formulas to predict RPD(Pi, Dj)
on the basis of IFSD accompanied with an interesting theorem is proposed (See Section 2.3);
(d) From the predicting formulas, a novel intuitionistic fuzzy
collaborative filtering method so-called IFCF is presented
for the medical diagnosis problem (See Section 2.4);
(e) The validation of the IFCF method in comparison with the
standalone methods of IFS such as De et al. [9], Szmidt and
Kacprzyk [44], Samuel and Balamurugan [31] and RS, e.g.
Davis et al. [8], Hassan and Syed [13] is made by both a
numerical illustration on the dataset in Example 1 and the
experiments on benchmark medical diagnosis datasets from
UCI Machine Learning Repository in terms of the accuracy of
diagnosis (See Section 3).
1.6. The novelty and significance of the proposed work
According to the contributions in Section 1.5 and the limitations
of IFS and RS in Sections 1.3 and 1.4, respectively, the novel and the
significance of the proposed work are stressed as follows.
(a) The proposed work is different from the previous ones especially the standalone IFS and RS methods. Specifically, it
employs the ideas of both the IFS set and RS in the definitions of SC-IFRS and MC-IFRS, which are the basis to develop
some new terms and similarity degrees for the IFCF algorithm. Furthermore, as being observed from Example 1 to
3, the determination of the relation between patients and
diseases in the standalone IFS methods is performed by
some operations such as the fuzzy implication in De et al.
[9], Samuel and Balamurugan [31] and the distance function
in Szmidt and Kacprzyk [42–44]. In the proposed work, this
can be done through the intuitionistic fuzzy similarity
degree (IFSD) in Section 2.3, which is developed based on
SC-IFRS and MC-IFRS. Comparing with the standalone RS
methods such as Davis et al. [8] and Hassan and Syed [13],
the similarity degree – IFSD in the proposed work is constructed from the light of the IFS set but not by the Pearson
coefficient from the hard values such as in Table 12. Additionally, the formulas to predict RPD(Pi, Dj) are also made
according to the membership and non-membership functions but not by the hard values above. These proofs demonstrate the novel of the proposed work;
(b) The proposed hybrid method could handle the issues of the
standalone IFS and RS methods. For instance, the limitations
of IFS relating to the missing relations and the historic diagnoses of patients stated in Section 1.3(a) and (b) and the limitations of RS relating to the crisp and training datasets stated
in Section 1.4(a) and (c) are solved by the integration of IFS
and RS. The deficiency of mathematical properties of operations in Section 1.3(d) is resolved by a number of interesting
138
L.H. Son, N.T. Thong / Knowledge-Based Systems 74 (2015) 133–150
theorems and properties in Section 2. Lastly, when predicting
RPD(Pi, Dj), users could find a suitable defuzzification method
for the determination of the most acquiring disease;
(c) The proposal of this work is significance in terms of both
theory and practice. In the theoretical aspect, the proposed
work motivates researching on advanced algorithms of IFS
and RS especially the hybrid method between them to
enhance the accuracy of the algorithm. Looking for details
in Section 1.5, we recognize that the proposed method is
constructed on a well-defined mathematical foundation,
which is not paid much attention in the previous researches.
Thus, this guarantees the further deployment of other
advanced methods of both IFS and RS on such the mathematical foundation. In the practical side, the proposed work
contributes greatly to the medical diagnosis problem and
some extensions and variants of this method could be
quickly deployed for other socio-economic problems. This
clearly affirms the significance of the proposed work.
where liX(x) 2 [0, 1] (resp. ciX(x) 2 [0, 1]), "i 2 {1, . . ., s} is the
membership (resp. non-membership) value of the patient to the
linguistic label ith of feature X. ljY(y) 2 [0, 1] (resp. cjY (y) 2 [0, 1]),
"j 2 {1, . . ., s} is the membership (resp. non-membership) value of
the symptom to the linguistic label jth of characteristicY. Finally,
llD(D) 2 [0, 1] (resp. clD(D) 2 [0, 1]), "l 2 {1, . . ., s} is the membership
(resp. non-membership) value of disease D to the linguistic label lth.
SC-IFRS provides two basic functions:
(a) Prediction: determine the values of ðllD ðDÞ; clD ðDÞÞ, "l 2 {1,
. . ., s};
Ã
(b) Recommendation: choose i⁄ 2 [1, s] satisfying i ¼ arg
maxi¼1;s fliD ðDÞ þ liD ðDÞð1 À liD ðDÞ À ciD ðDÞÞg.
Remark 1
(a) From Definition 5 and Eq. (13), the medical diagnosis is represented by the implication {Patient, Symtomp} ? Disease,
which is identical to that of Definition 1. Thus, we clearly
recognize that of SC-IFRS in Definition 5 is another representation and an extension of medical diagnosis in Definition 1
inspired by the ideas of RS in Definition 4;
(b) SC-IFRS in Definition 5 could be regarded as the extension of
the traditional RS in Definition 4 in cases that
$i: liX(x) = 1 ^ ciX(x) = 0; "j – i: ljX(x) = 0 ^ cjX(x) = 1,
$i: liY(y) = 1 ^ ciY(y) = 0; "j – i: ljY(y) = 0 ^ cjY(y) = 1,
$i: liD(D) = 1 ^ ciD(D) = 0; "j – i: ljD(D) = 0 ^ cjD(D) = 1,
1.7. The organization of the paper
The rest of the paper is organized as follows. Section 2 presents
the main contribution including the IFRS and its elements stated
in Section 1.5. Section 3 validates the proposed approach through
a set of experiments involving benchmark medical diagnosis data.
Section 4 draws the conclusions and delineates the future research
directions.
2. Intuitionistic fuzzy recommender systems
In this section, we present the new definitions of single-criterion IFRS (SC-IFRS) and multi-criteria IFRS (MC-IFRS) in Section 2.1;
the new definitions of intuitionistic fuzzy matrix (IFM) and the
intuitionistic fuzzy composition matrix (IFCM) of two IFMs with
the intersection/union operation in Section 2.2; the intuitionistic
fuzzy similarity matrix (IFSM), the intuitionistic fuzzy similarity
degree (IFSD) and the formulas to predict RPD(Pi, Dj) on the basis
of IFSD in Section 2.3; a novel intuitionistic fuzzy collaborative filtering method so-called IFCF in Section 2.4.
2.1. The single-criterion and multi-criteria intuitionistic fuzzy
recommender systems
Recall P, S and D from Definition 1 being the sets of patients,
symptoms and diseases, respectively. Each patient Pi ("i 2 {1, . . .,
n}) (resp. symptom Sj, "j 2 {1, . . ., m}) is assumed to have some
features (resp. characteristics). For the simplicity, we consider RS
including a feature of the patient and a characteristic of the
symptom denoted as X and Y, respectively. X and Y both consist of
s intuitionistic linguistic labels. Analogously, disease Di ("i 2 {1,
. . ., k}) also contains s intuitionistic linguistic labels. A new
definition of RS under the lights of those of medical diagnosis
expressed by IFS and the traditional RS in Definition 1 & 4 respectively is stated as follows.
Definition 5 (Single-criterion Intuitionistic Fuzzy Recommender
Systems – SC-IFRS). The utility function R is a mapping specified
on (X, Y) as follows.
R:XÂY !D
*
ðl1X ðxÞ; c1X ðxÞÞ;
ðl2X ðxÞ; c2X ðxÞÞ;u
...
ðlsX ðxÞ; csX ðxÞÞ
+ *
Â
ðl1Y ðyÞ; c1Y ðyÞÞ;
ðl2Y ðyÞ; c2Y ðyÞÞ;
...
ðlsY ðyÞ; csY ðyÞÞ
+
*
!
À
Á
l1D ðDÞ; c1D ðDÞ ; +
ðl2D ðDÞ; c2D ðDÞÞ;
...
;
then the mapping in (13) could be re-written as,
R:PÂS!D
ð14Þ
ððp; XÞ; ðs; YÞÞ # RPD :
Now we extend SC-IFRS to handle the cases of multiple diseases
D = {D1, . . ., Dk}.
Definition 6 (Multi-criteria Intuitionistic Fuzzy Recommender Systems – MC-IFRS). The utility function R is a mapping specified on
(X, Y) below.
R : X Â Y ! D1 Â Á Á Á Â Dk
*
ðl1X ðxÞ; c1X ðxÞÞ;
ðl2X ðxÞ; c2X ðxÞÞ; u
...
+
*
Â
ðlsX ðxÞ; csX ðxÞÞ
*
!
ðl1Y ðyÞ; c1Y ðyÞÞ;
ðl2Y ðyÞ; c2Y ðyÞÞ;
...
ðlsY ðyÞ; csY ðyÞÞ
ðl1D ðD1 Þ; c1D ðD1 ÞÞ;
ðl2D ðD1 Þ; c2D ðD1 ÞÞ;
...
ðlsD ðD1 Þ; csD ðD1 ÞÞ
+
+
*
 ÁÁÁ Â
ðl1D ðDk Þ; c1D ðDk ÞÞ;
ðl2D ðDk Þ; c2D ðDk ÞÞ;
...
ð15Þ
+
:
ðlsD ðDk Þ; csD ðDk ÞÞ
MC-IFRS is the system that provides two basic functions below.
(a) Prediction: determine the values of ðllD ðDi Þ; clD ðDi ÞÞ, "l 2 {1,
. . ., s}, "i 2 {1, . . ., k};
Ã
(b) Recommendation: choose i⁄ 2 [1, s] satisfying i ¼
nP
o
À
Á
k
arg maxi¼1;s
j¼1 wj liD ðDj Þ þ liD ðDj Þð1 À liD ðDj Þ À ciD ðDj ÞÞ
where wj 2 [0, 1] is the weight of Dj satisfying the constraint:
Pk
j¼1 wj ¼ 1.
ðlsD ðDÞ; csD ðDÞÞ
ð13Þ
139
L.H. Son, N.T. Thong / Knowledge-Based Systems 74 (2015) 133–150
Example 6. In a medical diagnosis system, there are 4 patients
whose feature X is ‘‘Age’’ consisting of 3 linguistic labels {low, medium, high} (s = 3). The symptom‘s characteristic Y is ‘‘Temperature’’
including 3 linguistic labels {cold, medium, hot}. The diseases (D1,
D2) are {‘‘Flu’’, ‘‘Headache’’}, and both of them contain 3 linguistic
labels {Level 1, Level 2, Level 3}. We would like to verify which ages
of users and types of temperature are likely to cause the diseases of
flu and headache. In this case we have a MC-IFRS system. By using
the trapezoidal intuitionistic fuzzy number – TIFN ([3]) characterized
À
Á
by a1 ; a2 ; a3 ; a4 ; a01 ; a04 with a01 6 a1 6 a2 6 a3 6 a4 6 a04 , the membership (non-membership) functions of patients to the linguistic
label ith of feature X are:
8
1
x 6 20
>
<
llow ðxÞ ¼ ð35 À xÞ=15 20 < x 6 35 ;
>
:
0
x > 35
8
0
x 6 20
>
<
v low ðxÞ ¼ > ðx À 20Þ=15 20 < x 6 35 ;
:
1
x > 35
8
0
x 6 20; x > 60
>
>
>
>
< ðx À 20Þ=15 20 < x 6 35
lmedium ðxÞ ¼
;
>
1
35 < x 6 45
>
>
>
:
ð60 À xÞ=15 45 < x 6 60
8
1
x 6 20; x > 60
>
>
>
>
< ð35 À xÞ=15 20 < x 6 35
v medium ðxÞ ¼ >
;
0
35 < x 6 45
>
>
>
:
ðx À 45Þ=15 45 < x 6 60
8
0
x 6 45
>
<
lhigh ðxÞ ¼ ðx À 45Þ=15 45 < x 6 60 ;
>
:
1
x > 60
8
1
x 6 45
>
<
v high ðxÞ ¼ > ð60 À xÞ=15 45 < x 6 60 :
:
0
x > 60
ð16Þ
Joeð45tÞ : hhighð0; 1Þ; mediumð1; 0Þ; lowð0; 1Þi;
Tedð50tÞ : hhighð0:33; 0:67Þ; mediumð0:67; 0:33Þ; lowð0; 1Þi:
ð17Þ
ð18Þ
ð19Þ
ð29Þ
35 < x 6 40
ð30Þ
8
1
x 6 35
>
<
v hot ðxÞ ¼ > ð40 À xÞ=5 35 < x 6 40 :
:
0
x > 40
ð31Þ
ð4 CÞ : hcoldð1; 0Þ; mediumð0; 1Þ; hotð0; 1Þi;
ð32Þ
ð16 CÞ : hcoldð0:267; 0:733Þ; mediumð0:733; 0:267Þ; hotð0; 1Þi; ð33Þ
ð39 CÞ : hcoldð0; 1Þ; mediumð0:2; 0:8Þ; hotð0:8; 0:2Þi;
ð34Þ
ð35Þ
From Eqs. (22)–(25), (32)–(35) we have a MC-IFRS described by
Table 15.
In Table 15, the cells having question marks are needed to predict the intuitionistic fuzzy values ðllD ðDi Þ; clD ðDi ÞÞ; 8l 2 f1; 2; 3g;
8i 2 f1; 2g.
2.2. Intuitionistic fuzzy matrix and intuitionistic fuzzy composition
matrix
0
ð21Þ
B
B b21
B
B
B c31
Z¼B
B
B c41
B
B ...
@
ð22Þ
ð23Þ
ð24Þ
ð25Þ
8
x65
>
<0
v cold ðxÞ ¼ > ðx À 5Þ=15 5 < x 6 20 ;
:
1
x > 20
ð27Þ
a11
ct1
a12
b22
c32
c42
...
ct2
. . . a1s
1
C
. . . b2s C
C
C
. . . c3s C
C:
C
. . . c4s C
C
... ... C
A
...
ð36Þ
cts
In Eq. (36), t = k + 2 where k 2 N+ is the number of diseases in
Definition 6. The value s 2 N+ is the number of intuitionistic
linguistic labels. a1i, b2i, chi, " h 2 {3, . . ., t}, "i 2 {1, . . ., s} are the
intuitionistic fuzzy values (IFV) consisting of the membership and
non-membership values as in Definition 6. a1i ¼ ðliX ðxÞ; ciX ðxÞÞ,
"i 2 {1, . . ., s} represents for the IFV value of the patient to the
linguistic label ith of featureX. b2i ¼ ðliY ðyÞ; ciY ðyÞÞ, " i 2 {1, . . ., s}
stands for the IFV value of the symptom to the linguistic label ith
of characteristic Y. chi ¼ ðliD ðDhÀ2 Þ; ciD ðDhÀ2 ÞÞ, "i 2 {1, . . ., s},
"h 2 {3, . . ., t} is the IFV value of the disease to the linguistic label
ith. Each line from the third one to the last in Eq. (36) is related
to a given disease.
Example 7. The first line in Table 15 describing the information of
user Al (Age: 25) at the temperature 4°C can be expressed by the
IFM as follows.
1
ð0:0; 1:0Þ ð0:33; 0:67Þ ð0:67; 0:33Þ
C
B
ð0:0; 1:0Þ
ð0:0; 1:0Þ C
B ð1:0; 0:0Þ
C:
Z¼B
C
B
ð0:2; 0:6Þ
ð0:1; 0:9Þ A
@ ð0:8; 0:1Þ
0
x 6 5; x > 40
35 < x 6 40
;
8
0
x 6 35
>
<
lhot ðxÞ ¼ ðx À 35Þ=5 35 < x 6 40 ;
>
:
1
x > 40
ð20Þ
ð26Þ
20 < x 6 35
20 < x 6 35
Definition 7. An intuitionistic fuzzy matrix (IFM) Z in MC-IFRS is
defined as,
8
x65
>
<1
lcold ðxÞ ¼ ð20 À xÞ=15 5 < x 6 20 ;
>
:
0
x > 20
5 < x 6 20
0
>
>
>
:
ðx À 35Þ=5
ð25 CÞ : hcoldð0; 1Þ; mediumð1; 0Þ; hotð0; 1Þi:
Similarly, the membership (non-membership) functions of the
symptom to the linguistic label jth of characteristicY are:
8
0
>
>
>
< ðx À 5Þ=15
lmedium ðxÞ ¼
>1
>
>
:
ð40 À xÞ=5
v medium ðxÞ ¼ >
The information of symptom are shown as follows
Based on Eqs. (16)–(21), we calculate the information of
patients as follows.
Alð25tÞ : hhighð0; 1Þ; mediumð0:33; 0:67Þ; lowð0:67; 0:33Þi;
Bobð40tÞ : hhighð0; 1Þ; mediumð1; 0Þ; lowð0; 1Þi;
8
1
x 6 5; x > 40
>
>
>
>
< ð20 À xÞ=15 5 < x 6 20
;
ð28Þ
ð0:1; 0:8Þ
ð0:6; 0:35Þ
ð0:3; 0:55Þ
ð37Þ
140
L.H. Son, N.T. Thong / Knowledge-Based Systems 74 (2015) 133–150
Table 15
A MC-IFRS for medical diagnosis with ⁄ being the values to be predicted.
Age
*
Alð25Þ :
*
Alð25Þ :
highð0; 1Þ;
mediumð0:33; 0:67Þ;
lowð0:67; 0:33Þ
highð0; 1Þ;
mediumð0:33; 0:67Þ;
lowð0:67; 0:33Þ
*
highð0; 1Þ;
mediumð1; 0Þ;
lowð0; 1Þ
Bobð40Þ :
*
highð0; 1Þ;
mediumð1; 0Þ;
lowð0; 1Þ
Joeð45Þ :
*
Tedð50Þ :
Temperature
*
+
coldð1; 0Þ;
ð4 CÞ : mediumð0; 1Þ;
hotð0; 1Þ
+
+
*
ð16 CÞ :
*
+
ð39 CÞ :
*
+
ð16 CÞ :
highð0:33; 0:67Þ;
mediumð0:67; 0:33Þ;
lowð0; 1Þ
*
+
ð25 CÞ :
coldð0:267; 0:733Þ;
mediumð0:733; 0:267Þ;
hotð0; 1Þ
coldð0; 1Þ;
mediumð0:2; 0:8Þ;
hotð0:8; 0:2Þ
ð1Þ
a
B 11
B ð1Þ
Bb
B 21
B
B ð1Þ
B c31
B
B
B cð1Þ
B 41
B
B
B ...
@
ð1Þ
ct1
0
ð1Þ
a12
ð1Þ
b22
ð1Þ
c32
ð1Þ
c42
...
ð1Þ
ct2
ð12Þ
a11
B
B ð12Þ
Bb
B 21
B
B ð12Þ
B c31
¼B
B
B cð12Þ
B 41
B
B
B ...
@
ð12Þ
ct1
ð1Þ
. . . a1s
1 0
ð2Þ
a
C B 11
C B
ð1Þ C B ð2Þ
. . . b2s C B b21
C B
ð1Þ C B ð2Þ
. . . c3s C B c31
CB
C B
ð1Þ C B ð2Þ
. . . c4s C B c41
C B
C B
... ... C B ...
A @
ð1Þ
ð2Þ
. . . cts
ct1
1
ð12Þ
ð12Þ
a12
. . . a1s
C
ð12Þ
ð12Þ C
b22
. . . b2s C
C
C
ð12Þ
ð12Þ C
c32
. . . c3s C
C;
C
ð12Þ
ð12Þ
c42
. . . c4s C
C
C
C
... ... ... C
A
ð12Þ
ð12Þ
ct2
. . . cts
ð2Þ
a12
ð2Þ
ð2Þ
. . . a1s
b22
...
c32
ð2Þ
...
ð2Þ
c42
...
...
...
ð2Þ
...
ct2
1
C
C
ð2Þ
b2s C
C
C
ð2Þ C
c3s C
C
C
ð2Þ C
c4s C
C
C
... C
A
ð2Þ
cts
ð12Þ
ð12Þ
b2i
¼
ð1Þ
b2i
^
ð2Þ
b2i
ð12Þ
chi
ð1Þ
Level2(0.6, 0.35);
Level3(0.3,0.55)
Level1(0.4, 0.5);
Level1(0.0, 0.9);
Level2(0.6,0.2);
Level3(0.1,0.9);
Level2(0.2, 0.75);
Level3(0.7,0.2)
Level1(0.8, 0.2);
Level1(0.8, 0.1);
Level2(0.1,0.8);
Level3(0.0,0.95);
Level2(0.1,0.9);
Level3(0.0,0.9)
Level1(0.0, 1.0);
Level1(0.0, 0.9);
Level2(0.2, 0.7);
Level3(1.0,0.0);
Level2(0.7, 0.3);
Level3(0.1,0.85);
Level1(⁄, ⁄);
Level1(⁄, ⁄);
Level2(⁄, ⁄);
Level3(⁄, ⁄);
Level2(⁄, ⁄);
Level3(⁄, ⁄);
Example 8. Given 2 IFM below.
1
ð0:0; 1:0Þ ð0:33; 0:67Þ ð0:67; 0:33Þ
B ð1:0; 0:0Þ
ð0:0; 1:0Þ
ð0:0; 1:0Þ C
C
B
Z1 ¼ B
C;
@ ð0:8; 0:1Þ
ð0:2; 0:6Þ
ð0:1; 0:9Þ A
0
ð0:7; 0:3Þ
ð42Þ
ð43Þ
ð0:1; 0:85Þ
0
ð0:0; 1:0Þ
ð0:33; 0:67Þ
B ð0:267; 0:733Þ
ð0:0; 1:0Þ
B
Z ¼ Z1 Z2 ¼ B
@ ð0:0; 1:0Þ
ð0:2; 0:7Þ
ð0:0; 0:9Þ
ð0:0; 1:0Þ
1
ð0:0; 1:0Þ C
C
C:
ð0:1; 0:9Þ A
ð0:6; 0:35Þ
ð44Þ
ð0:1; 0:85Þ
ð38Þ
Definition 9. Suppose that Z1 and Z2 are two IFM in MC-IFRS. The
intuitionistic fuzzy composition matrix (IFCM) of Z1 and Z2 with the
union operation is defined as follows.
0
ð40Þ
ð2Þ
¼ chi ^ chi
ð1Þ
ð2Þ
ð1Þ
ð2Þ
¼ min liD ðDhÀ2 Þ; liD ðDhÀ2 Þ ; max ciD ðDhÀ2 Þ; ciD ðDhÀ2 Þ ;
8h 2 f3; . . . ; tg:
Level2(0.2, 0.6);
Level3(0.1, 0.9);
The IFCM of Z1 and Z2 with the intersection operation is:
ð1Þ
ð2Þ
ð1Þ
ð2Þ
¼ min liY ðyÞ; liY ðyÞ ; max ciY ðyÞ; ciY ðyÞ ;
8i 2 f1; . . . ; sg;
Level1(0.1, 0.8)
ð0:0; 0:9Þ
ð39Þ
8i 2 f1; . . . ; sg;
Level1(0.8, 0.1);
ð0:1; 0:8Þ ð0:6; 0:35Þ
ð0:3; 0:55Þ
1
ð0:0; 1:0Þ
ð1:0; 0:0Þ
ð0:0; 1:0Þ
B ð0:267; 0:733Þ ð0:733; 0:267Þ ð0:0; 1:0Þ C
C
B
Z2 ¼ B
C:
@ ð0:0; 1:0Þ
ð0:2; 0:7Þ
ð1:0; 0:0Þ A
ð1Þ
ð2Þ
ð1Þ
ð2Þ
ð1Þ
ð2Þ
¼ a1i ^ a1i ¼ min liX ðxÞ; liX ðxÞ ; max ciX ðxÞ; ciX ðxÞ ;
8i 2 f1; . . . ; sg;
Headache
0
where
a1i
+
+
Definition 8. Suppose that Z1 and Z2 are two IFM in MC-IFRS. The
intuitionistic fuzzy composition matrix (IFCM) of Z1 and Z2 with the
intersection operation is,
0
+
coldð0:267; 0:733Þ;
mediumð0:733; 0:267Þ;
hotð0; 1Þ
coldð0; 1Þ;
mediumð1; 0Þ;
hotð0; 1Þ
+
Flu
ð41Þ
ð1Þ
a11
B ð1Þ
Bb
B 21
B ð1Þ
Bc
B 31
B ð1Þ
B c41
B
B ...
@
ð1Þ
ct1
0
ð1Þ
a12
ð1Þ
b22
ð1Þ
c32
ð1Þ
c42
...
ð1Þ
ct2
ð12Þ
a
B 11
B bð12Þ
B 21
B ð12Þ
Bc
31
¼B
B ð12Þ
B c41
B
B ...
@
ð12Þ
ct1
ð1Þ
. . . a1s
1 0
ð2Þ
a11
C B ð2Þ
ð1Þ
B
. . . b2s C
C B b21
C
B ð2Þ
ð1Þ
B
. . . c3s C
C B c31
B ð2Þ
ð1Þ C
B
. . . c4s C
C B c41
C
... ... A B
@ ...
ð1Þ
ð2Þ
ct1
. . . cts
1
ð12Þ
ð12Þ
a12
. . . a1s
C
ð12Þ
ð12Þ
b22
. . . b2s C
C
C
ð12Þ
ð12Þ
c32
. . . c3s C
C;
ð12Þ
ð12Þ C
c42
. . . c4s C
C
... ... ... C
A
ð12Þ
ð12Þ
ct2
. . . cts
ð2Þ
a12
ð2Þ
b22
ð2Þ
c32
ð2Þ
c42
...
ð2Þ
ct2
ð2Þ
. . . a1s
1
C
ð2Þ
. . . b2s C
C
C
ð2Þ
. . . c3s C
C
ð2Þ C
. . . c4s C
C
... ... C
A
ð2Þ
. . . cts
ð45Þ
141
L.H. Son, N.T. Thong / Knowledge-Based Systems 74 (2015) 133–150
where
ð12Þ
a1i
It follows that,
ð1Þ
ð2Þ
¼ a1i _ a1i
ð1Þ
ð2Þ
ð1Þ
ð2Þ
¼ max liX ðxÞ; liX ðxÞ ; min ciX ðxÞ; ciX ðxÞ ;
8i 2 f1; . . . ; sg;
ð12Þ
b2i
ð1Þ
ð2Þ
¼ b2i _ b2i
ð46Þ
ð1Þ
ð2Þ
ð1Þ
ð2Þ
¼ max liY ðyÞ; liY ðyÞ ; min ciY ðyÞ; ciY ðyÞ ;
8i 2 f1; . . . ; sg;
ð12Þ
chi
ð1Þ
ð47Þ
8h 2 f3; . . . ; tg:
ð48Þ
1
0
ð0:0; 1:0Þ
ð1:0; 0:0Þ
ð0:67; 0:33Þ
B ð1:0; 0:0Þ ð0:733; 0:267Þ
ð0:0; 1:0Þ C
C
B
Z ¼ Z1 Z2 ¼ B
C:
@ ð0:8; 0:1Þ
ð0:2; 0:6Þ
ð1:0; 0:0Þ A
ð0:1; 0:8Þ
ð0:7; 0:3Þ
ð49Þ
ð0:3; 0:55Þ
Theorem 1. The IFCM of Z1 and Z2 with the intersection (union) operation is an IFM.
Proof 1. We prove the theorem with the intersection operation
only. The theorem with the union operation is proven analogously.
From Definition 8, we know that
ð1Þ
ð2Þ
ð1Þ
ð2Þ
ð1Þ
ð2Þ
¼ a1i ^ a1i ¼ min liX ðxÞ; liX ðxÞ ; max ciX ðxÞ; ciX ðxÞ ;
8i 2 f1; . . . ; sg;
ð12Þ
b2i
ð1Þ
ð2Þ
ð1Þ
ð2Þ
ð1Þ
ð2Þ
¼ b2i ^ b2i ¼ min liY ðyÞ; liY ðyÞ ; max ciY ðyÞ; ciY ðyÞ ;
8i 2 f1; . . . ; sg;
ð12Þ
chi
B
B ð12Þ
B b21
B
B ð12Þ
B c31
Z1 Z2 ¼ B
B ð12Þ
Bc
B 41
B
B ...
@
ð12Þ
ct1
ð1Þ
ð2Þ
¼ chi ^ chi
ð1Þ
ð2Þ
ð1Þ
ð2Þ
¼ min liD ðDhÀ2 Þ; liD ðDhÀ2 Þ ; max ciD ðDhÀ2 Þ; ciD ðDhÀ2 Þ ;
8i 2 f1; . . . ; sg;
ð1Þ
a1i
8h 2 f3; . . . ; tg:
ð52Þ
ð2Þ
a1i
ð12Þ
a1i
Since
and
are two IFV, the function of them –
is also an
ð12Þ
ð12Þ
IFV. Similar conclusions are found for b2i and chi . Thus, the IFCM
of Z1 and Z2 with the intersection operation is an IFM. The proof is
complete. h
Property 1. Given Z1, Z2 and Z3 being IFM. The following properties
hold for these IFM.
ð12Þ
b22
ð12Þ
c32
ð12Þ
c42
...
ð12Þ
ct2
ð12Þ
. . . a1s
ð1Þ
ð2Þ
the intersection operation. Since a1i and a1i are two IFV, we obtain
ð12Þ
¼ a1i ^ a1i ¼ a1i ^ a1i ¼ a1i ;
ð53Þ
ð12Þ
b2i
ð12Þ
chi
ð1Þ
ð2Þ
ð2Þ
ð1Þ
ð21Þ
b2i ^ b2i ¼ b2i ^ b2i ¼ b2i ;
ð1Þ
ð2Þ
ð2Þ
ð1Þ
ð21Þ
chi ^ chi ¼ chi ^ chi ¼ chi :
ð54Þ
¼
¼
ð2Þ
ð2Þ
ð1Þ
0
ð21Þ
a11
C B
B ð21Þ
ð12Þ C
. . . b2s C B b21
C B
B ð21Þ
ð12Þ C
B
. . . c3s C
C ¼ B c31
C B ð21Þ
ð12Þ C
. . . c4s C B
B c41
C B
C
... ... A B
@ ...
ð12Þ
ð21Þ
. . . cts
ct1
ð21Þ
a12
ð21Þ
b22
ð21Þ
c32
ð21Þ
c42
...
ð21Þ
ct2
ð21Þ
. . . a1s
1
C
ð21Þ C
. . . b2s C
C
ð21Þ C
. . . c3s C
C
C
ð21Þ
. . . c4s C
C
C
... ... C
A
ð21Þ
. . . cts
¼ Z2 Z1 :
ð56Þ
The proof is analogously performed with the IFCM of Z1 and Z2
equipped with the union operation.(b) Suppose that the IFCM of
Z1 and Z2 is equipped with the intersection operation. We have,
0
ð123Þ
a11
ð123Þ
a12
ð123Þ
. . . a1s
1
C
B
B ð123Þ
ð123Þ
ð123Þ C
B b21
b22
. . . b2s C
C
B
B ð123Þ
ð123Þ
ð123Þ C
C
B c31
c
.
.
.
c
32
3s
C:
ðZ 1 Z 2 Þ Z 3 ¼ B
C
B ð123Þ
ð123Þ
ð123Þ
C
Bc
c
.
.
.
c
C
B 41
42
4s
C
B
C
B ...
.
.
.
.
.
.
.
.
.
A
@
ð123Þ
ð123Þ
ð123Þ
ct2
. . . cts
ct1
ð123Þ
ð1Þ
ð2Þ
ð3Þ
a1i ¼ a1i ^ a1i ^ a1i ; 8i 2 f1; . . . ; sg;
ð123Þ
ð1Þ
ð2Þ
ð3Þ
b2i ¼ b2i ^ b2i ^ b2i ; 8i 2 f1; . . . ; sg;
ð123Þ
ð1Þ
ð2Þ
ð3Þ
chi ¼ chi ^ chi ^ chi ; 8i 2 f1; . . . ; sg; 8h 2 f3; . . . ; tg:
ð57Þ
ð58Þ
ð59Þ
ð60Þ
Because
ð1Þ
ð2Þ
ð3Þ
ð1Þ
ð2Þ
ð3Þ
a1i ^ a1i ^ a1i ¼ a1i ^ a1i ^ a1i ;
ð1Þ
ð2Þ
ð3Þ
ð1Þ
ð2Þ
ð3Þ
b2i ^ b2i ^ b2i ¼ b2i ^ b2i ^ b2i ;
ð1Þ
ð2Þ
ð3Þ
ð1Þ
ð2Þ
ð3Þ
chi ^ chi ^ chi ¼ chi ^ chi ^ chi :
ð61Þ
ð62Þ
ð63Þ
It follows that
ðZ 1 Z 2 Þ Z 3 ¼ Z 1 ðZ 2 Z 3 Þ:
ð64Þ
The proof is analogously performed with the IFCM of Z1 and Z2
equipped with the union operation. h
2.3. The intuitionistic fuzzy similarity matrix and intuitionistic fuzzy
similarity degree
Motivated by the ideas of Hung and Yang [15], we present the
definition of intuitionistic fuzzy similarity matrix as follows.
0
Proof 2. (a) Suppose that the IFCM of Z1 and Z2 is equipped with
ð1Þ
1
Definition 10. Suppose that Z1 and Z2 are two IFM in MC-IFRS. The
intuitionistic fuzzy similarity matrix (IFSM) between Z1 and Z2 is
defined as follows.
(a) Z1Z2 = Z2Z1,
(b) (Z1Z2)Z3 = Z1(Z2Z3).
a1i
ð12Þ
a12
ð2Þ
Example 9. Given 2 IFM in Example 8. The IFCM of Z1 and Z2 with
the union operation is:
ð12Þ
ð12Þ
a11
¼ chi _ chi
ð1Þ
ð2Þ
ð1Þ
ð2Þ
¼ max liD ðDhÀ2 Þ; liD ðDhÀ2 Þ ; min ciD ðDhÀ2 Þ; ciD ðDhÀ2 Þ ;
8i 2 f1; . . . ; sg;
a1i
0
ð21Þ
ð55Þ
e
S 11
B
Be
B S 21
Be
B S 31
e
S¼B
Be
B S 41
B
B ...
@
e
S t1
e
S 12
e
S 22
...
e
S 32
e
S 42
...
...
e
S t2
...
...
...
...
1
e
S 1s
C
e
S 2s C
C
C
e
S 3s C
C;
C
e
S 4s C
C
... C
A
e
S ts
ð65Þ
142
L.H. Son, N.T. Thong / Knowledge-Based Systems 74 (2015) 133–150
where,
Since
e
S 1i ¼ 1 À
qffiffiffiffiffiffiffiffiffiffiffiffiffiffi qffiffiffiffiffiffiffiffiffiffiffiffiffiffi qffiffiffiffiffiffiffiffiffiffiffiffiffiffi qffiffiffiffiffiffiffiffiffiffiffiffiffiffi
ð1Þ
ð2Þ
ð1Þ
ð2Þ
1 À exp À1=2 liX ðxÞ À liX ðxÞ þ ciX ðxÞ À ciX ðxÞ
1 À expðÀ1Þ
8i 2 f1;.. .;sg;
;
ð66Þ
qffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi qffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi qffiffiffiffiffiffiffiffiffiffiffiffiffiffi qffiffiffiffiffiffiffiffiffiffiffiffiffiffi
ð1Þ
ð2Þ
ð1Þ
ð2Þ
1 À exp À1=2 liY ðyÞ À liY ðyÞ þ ciY ðyÞ À ciY ðyÞ
e
S 2i ¼ 1 À
À
SIMðPu ; P2 Þ Â
À
Á
lPiDv ðDj Þ þ cPiDv ðDj Þ 6 SIMðPu ; P1 Þ;
l
Pv
iD ðDj Þ
Pv
iD ðDj Þ
þc
Á
ð78Þ
6 SIMðPu ; P2 Þ;
ð79Þ
lPiDv ðDj Þ þ cPiDv ðDj Þ 6 SIMðPu ; Pn Þ:
ð80Þ
...
SIMðPu ; Pn Þ Â
À
Á
1 À expðÀ1Þ
8i 2 f1; . . . ;sg;
e
S hi ¼ 1 À
SIMðPu ; P1 Þ Â
ð67Þ
qffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi qffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi qffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi qffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi
ð1Þ
ð2Þ
ð1Þ
ð2Þ
1 À exp À1=2 liD ðDhÀ2 Þ À liD ðDhÀ2 Þ þ ciD ðDhÀ2 Þ À ciD ðDhÀ2 Þ
1 À expðÀ1Þ
8i 2 f1; . . . ; sg;
;
8h 2 f3; . . . ; tg:
ð68Þ
Definition 11. Suppose that Z1 and Z2 are two IFM in MC-IFRS. The
intuitionistic fuzzy similarity degree (IFSD) between Z1 and Z2 is
Pu
iD ðDj Þ
l
s
s
t X
s
X
X
X
S 1i þ b w2i e
S 2i þ v
S hi ;
SIMðZ 1 ; Z 2 Þ ¼ a w1i e
whi e
i¼1
i¼1
It follows that
Pu
iD ðDj Þ
þc
ð69Þ
h¼3 i¼1
where e
S is the IFSM between Z1 and Z2. W = (wij) ("i 2 {1, . . ., t},
"j 2 {1, . . ., s}) is the weight matrix of IFSM between Z1 and Z2
satisfying,
s
X
w1i ¼ 1;
s
X
w2i ¼ 1;
s
X
whi ¼ 1;
i¼1
i¼1
i¼1
8h 2 f3; . . . ; tg;
ð70Þ
a þ b þ v ¼ 1:
ð71Þ
Pn
v ¼1 SIMðP u ; P v Þ Â
À
Pn
v ¼1 SIMðP u ; P v Þ
Pn
SIMðPu ; Pv Þ
6 Pvn ¼1
¼ 1:
v ¼1 SIMðP u ; P v Þ
¼
The proof is complete.
Á
lPiDv ðDj Þ þ cPiDv ðDj Þ
ð81Þ
h
2.4. The intuitionistic fuzzy collaborative filtering method
Fig. 1)
3. Evaluation
Remark 2. The formula of IFSD in Eq. (68) can be recognized as the
generalization of the hard user-based, item-based and the ratingbased similarity degrees in recommender systems [29] when
b = v = 0, a = v = 0 and a = b = 0, respectively.
In this section, we describe the experimental environment in
Section 3.1. The database for experiments is given in Section 3.2.
Definition 12. The formulas to predict the values of linguistic
labels of patient Pu ("u 2 {1, . . ., n}) to symptom Sj (" j 2 {1, . . .,
m}) according to diseases (D1, D2, . . ., Dk) in MC-IFRS are:
lPiDu ðDj Þ ¼
Pn
Pv
SIMðPu ; Pv Þ Â liD ðDj Þ
v ¼1P
; 8i 2 f1; . . . ; sg;
n
v ¼1 SIMðP u ; P v Þ
8j 2 f1; . . . ; kg;
cPiDu ðDj Þ ¼
Pn
8u 2 f1; . . . ; ng;
ð72Þ
Pv
SIMðP u ; P v Þ Â ciD ðDj Þ
v ¼1P
;
n
v ¼1 SIMðP u ; P v Þ
8j 2 f1; . . . ; kg;
8i 2 f1; . . . ; sg;
8u 2 f1; . . . ; ng:
ð73Þ
Theorem 2. The predictive IFM results in Definition 12 are an IFV.
Proof 3. We have the following fact.
lPiDu ðDj Þ þ cPiDu ðDj Þ ¼
Pn
v ¼1 SIMðP u ; P v Þ Â
Pn
À
lPiDv ðDj Þ þ cPiDv ðDj Þ
v ¼1 SIMðP u ; P v Þ
Á
;
ð74Þ
0 6 lPiDv ðDj Þ þ cPiDv ðDj Þ 6 1;
ð75Þ
0 6 SIMðPu ; Pv Þ 6 1:
ð76Þ
It is obvious that
lPiDu ðDj Þ þ cPiDu ðDj Þ P 0:
ð77Þ
Fig. 1. The IFCF algorithm.
143
L.H. Son, N.T. Thong / Knowledge-Based Systems 74 (2015) 133–150
Table 16
The descriptions of experimental datasets.
Dataset
No.
elements
No.
attributes
No.
classes
Elements in each
classes
HEART
RHC
270
5735
13
8
2
2
(150, 120)
(3804, 1931)
Section 3.3 illustrates the activities of IFCF on the intuitionistic
medical diagnosis dataset in [31]. Lastly, Section 3.4 presents the
experimental results on the benchmark medical diagnosis datasets
namely HEART and RHC.
3.1. Experimental design
In this part, we describe the experimental environments such
as,
Experimental tools: We have implemented the proposed hybrid
algorithm – IFCF in addition to the typical standalone methods
of IFS such as De et al. [9], Szmidt and Kacprzyk [44], Samuel
and Balamurugan [31] and RS such as Davis et al. [8] and Hassan
and Syed [13] in PHP programming language and executed
them on a PC Intel(R) core(TM) 2 Duo CPU T6400 @ 2.00 GHz
2 GB RAM. The results are taken as the average value of 50 runs.
Evaluation indices: Mean Absolute Error (MAE) and the computational time.
Objective:
To illustrate the activities of IFCF on an illustrated dataset;
To evaluate the IFCF in comparison with the relevant algorithms in terms of accuracy through evaluation indices.
3.2. Database
In the evaluation, we use three kinds of datasets for
experiments.
The intuitionistic medical diagnosis dataset in [31], which was
used from Example 1 to 5 to illustrate the activities of the relevant algorithms in Section 1;
The benchmark medical diagnosis dataset namely HEART from
UCI Machine Learning Repository [46] (Table 16 and Fig. 2);
A large benchmark medical diagnosis dataset namely RHC
(Right Heart Catheterization) including 5735 critically ill adult
patients receiving care in ICU [6] (Table 16 and Fig. 3).
The cross-validation method for the experiments is the k-fold
validation with k from 2 to 10. The aim for various folds is to
observe the changes of MAE and computational time of algorithms
so that this could help us better analysis of experimental results.
Besides testing with the k-fold validation, the random experiments
with the cardinalities of the testing being from 10 to 100 random
elements are also performed. In order to validate the results with
accurate classes, the intuitionistic defuzzification method [3] is
used for experimental algorithms.
3.3. An illustration of IFCF
In this section, we illustrate the activities of IFCF on the intuitionistic medical diagnosis dataset in [31] described from Tables
1–3. Similar to Example 5, the training dataset is demonstrated
in Table 17 where ⁄ values in this table are needed to be predicted.
From Tables 1 and 17 we have extracted the SC-IFRS dataset in
Table 18 similar to that in Table 15 where ⁄ values in this table are
needed to be predicted.
From Table 18, we realize that a semi-SC-IFRS with the hard
patients being used instead of their features. Thus, the parameters
of the IFCF algorithm are automatically updated as a = 0, b = c = 1/2
and w1i = w2i = w3i = 0.2. From Definition 11, we calculate the IFSD
between Sugu (Somu) and Ram and Mari as follows.
IFSDðSugu; RamÞ ¼ 0:87;
IFSDðSugu; MariÞ ¼ 0:57;
ð82Þ
ð83Þ
IFSDðSomu; RamÞ ¼ 0:83;
ð84Þ
IFSDðSomu; MariÞ ¼ 0:58:
ð85Þ
From Eqs. (82)–(85), we used Definition 12 to calculate the predictive IFM results of Sugu and Somu.
Viral feverð0:49; 0:38Þ;
* Malariað0:52; 0:22Þ
DiseaseðSuguÞ ¼
Fig. 2. The 2D distribution of HEART.
Typhoidð0:36; 0:52Þ;
Stomach problemð0:40; 0:34Þ
Chest problemð0:10; 0:68Þ
+
;
ð86Þ
144
L.H. Son, N.T. Thong / Knowledge-Based Systems 74 (2015) 133–150
Table 17
The training dataset for IFCF with ⁄ being the values to be predicted.
Table 18
The extracted SC-IFRS dataset with ⁄ being the values to be predicted.
P
Viral_fever
Malaria
Typhoid
Stomach
Chest
Ram
Mari
Sugu
Somu
(0.4, 0.1)
(0.3, 0.5)
⁄
⁄
(0.7, 0.1)
(0.2, 0.6)
⁄
⁄
(0.6, 0.1)
(0.4, 0.4)
⁄
⁄
(0.2, 0.4)
(0.6, 0.1)
⁄
⁄
(0.2, 0.6)
(0.1, 0.7)
⁄
⁄
P
S
D
Ram
Temperatureð0:8; 0:1Þ;
*
+
Headacheð0:6; 0:1Þ
Stomach painð0:2; 0:8Þ;
Coughð0:6; 0:1Þ
Chest painð0:1; 0:6Þ
Temperatureð0:0; 0:8Þ;
*
+
Headacheð0:4; 0:4Þ
Stomach painð0:6; 0:1Þ;
Coughð0:1; 0:7Þ
Chest painð0:1; 0:8Þ
Temperatureð0:8; 0:1Þ;
*
+
Headacheð0:8; 0:1Þ
Viral feverð0:4; 0:1Þ;
+
Malariað0:7; 0:1Þ
Typhoidð0:6; 0:1Þ;
Stomach problemð0:2; 0:4Þ
Chest problemð0:2; 0:6Þ
Viral feverð0:3; 0:5Þ;
*
+
Malariað0:2; 0:6Þ
Typhoidð0:4; 0:4Þ;
Stomach problemð0:6; 0:1Þ
Chest problemð0:1; 0:7Þ
⁄
Mari
Viral feverð0:47; 0:39Þ;
* Malariað0:52; 0:22Þ
DiseaseðSomuÞ ¼
Typhoidð0:36; 0:51Þ;
+
:
ð87Þ
Sugu
Stomach problemð0:39; 0:47Þ
Chest problemð0:10; 0:68Þ
Based on the recommendation function of Definition 5 and Eqs. (86)
and (87), we recommend the disease those patients suffer the most
as in Table 19.
From Table 19, we conclude that Sugu and Somu both suffer
from the Malaria.
Somu
Stomach painð0:0; 0:6Þ;
Coughð0:2; 0:7Þ
Chest painð0:0; 0:5Þ
Temperatureð0:6; 0:1Þ;
*
+
Headacheð0:5; 0:4Þ
Stomach painð0:3; 0:4Þ;
Coughð0:7; 0:2Þ
Chest painð0:3; 0:4Þ
*
⁄
Remark 3
(a) The result of IFCF expressed in Table 19 is identical to those
of De et al. [9] in Example 2, Samuel and Balamurugan [31]
in Example 3 and Hassan and Syed [13] in Example 5.
Besides these methods, if we perform other relevant algorithms such as Own [26] and Shinoj and John [33] then the
same results would be given. This proves the correctness
of the proposed method;
(b) IFCF is capable to perform the prediction and recommendation with more types of datasets including the hard and
(intuitionistic) fuzzy data, not only the semi-SC-IFRS dataset
like what being experimented in this section. This affirms
the flexibility and usefulness of IFCF in comparison with
other relevant methods;
(c) The limitation of the defuzzification method stated in
Sections 1.3(c) and 1.4(b) of the relevant works such as De
et al. [9], Szmidt and Kacprzyk [44], Samuel and
Table 19
The recommended diseases where the most suffered one is highlighted in bold.
P
Viral_fever
Malaria
Typhoid
Stomach
Chest
Sugu
Somu
0.5537
0.5358
0.6552
0.6552
0.4032
0.4068
0.504
0.4446
0.122
0.122
Balamurugan [31], Davis et al. [8] and Hassan and Syed
[13] is handled by the recommendation function in SC-IFRS
(Definition 5) and MC-IFRS (Definition 6);
(d) IFCF used both the data about the relations between the
patients – the symptoms and the patients – the diseases
for the prediction, not solely on the data of the patients –
the diseases as in the relevant methods relating to RS such
as Davis et al. [8] and Hassan and Syed [13]. This enhances
the accuracy of diagnosis since more information is count
for the calculation of the algorithm.
Fig. 3. The 2D distribution of RHC.
145
L.H. Son, N.T. Thong / Knowledge-Based Systems 74 (2015) 133–150
Table 20
The results of random experiments on the HEART dataset.
Data sets
10
20
30
40
50
60
70
80
90
100
MAE
IFCF
DAVIS
HASSAN
DE
SAMUEL
SZMIDT
0.49641
0.492051
0.490383
0.488195
0.492451
0.490892
0.491429
0.493946
0.494265
0.493219
0.500286
0.495541
0.492238
0.49185
0.494166
0.494094
0.494952
0.497251
0.49741
0.495757
0.494036
0.488739
0.486775
0.483692
0.487049
0.486138
0.487727
0.490348
0.490197
0.488414
0.489147
0.496361
0.493604
0.47396
0.490438
0.476327
0.480793
0.491512
0.484721
0.483999
0.508045
0.504834
0.509998
0.526556
0.510221
0.523458
0.519985
0.507438
0.517029
0.516403
0.491955
0.495166
0.490002
0.473444
0.489779
0.476542
0.480015
0.492562
0.482971
0.483597
0.646215
0.703126
0.739453
1.192436
1.234968
2.084581
2.360242
3.014339
3.081387
3.305678
0.089682
0.152622
0.211139
0.255561
0.290003
0.304465
0.306179
0.323535
0.324501
0.328335
0.011868
0.028907
0.032683
0.043212
0.043985
0.048735
0.058691
0.078036
0.081523
0.127595
0.009564
0.0347
0.036562
0.037371
0.043722
0.052548
0.053144
0.060445
0.11539
0.11964
0.014055
0.023194
0.029526
0.046678
0.053152
0.058788
0.072443
0.072655
0.082507
0.102687
Computational time (s)
10
20
30
40
50
60
70
80
90
100
0.327558
0.796293
1.166132
1.221915
1.232003
1.260366
1.278752
1.469952
1.613406
2.210751
Table 21
The results of k-fold cross validation on the HEART dataset.
Fold
2
3
4
5
6
7
8
9
10
MAE
IFCF
DAVIS
HASSAN
DE
SAMUEL
SZMIDT
0.495733
0.494624
0.490572
0.488722
0.489378
0.492201
0.492132
0.488487
0.487778
0.498977
0.497455
0.49394
0.492826
0.493186
0.496148
0.496581
0.492575
0.491602
0.491243
0.489978
0.486429
0.485395
0.484268
0.488301
0.488893
0.485452
0.484024
0.479124
0.474681
0.491674
0.474125
0.474017
0.468833
0.492033
0.486527
0.490158
0.521357
0.525323
0.510025
0.525475
0.526214
0.527847
0.509502
0.515524
0.514568
2.060489
0.474677
0.489975
0.474525
0.474768
0.472153
0.490498
0.484476
0.485432
4.059911
4.039289
3.383637
3.208319
2.952161
2.342186
2.186523
2.150605
0.492826
1.572734
1.431444
1.335837
1.137519
1.108358
1.101027
1.097188
0.956852
0.863702
0.213536
0.191975
0.09894
0.092019
0.088419
0.076773
0.057825
0.047176
0.045803
0.21604
0.103076
0.100901
0.083503
0.04564
0.043674
0.034734
0.029806
0.019865
0.207523
0.144031
0.128371
0.086687
0.075412
0.07241
0.06922
0.0414
0.037433
Data sets
Computational time (s)
2
3
4
5
6
7
8
9
10
3.225795
3.158217
2.821404
2.70524
2.310253
2.166859
2.088267
2.060489
1.889909
3.4. Assessment
In this section, we perform the experiments on the benchmark
medical diagnosis datasets namely HEART and RHC. The experimental results are described from Tables 20–23. The MAE results
of random experiments (resp. k-fold cross validation) on the HEART
dataset is illustrated in Fig. 4 (resp. Fig. 5). Analogously, the MAE
results of random experiments (resp. k-fold cross validation) on
the RHC dataset is illustrated in Fig. 6 (resp. Fig. 7). The discussion
of the experimental results is demonstrated in Remark 4.
Remark 4
(a) Tables 20 and 21 have revealed that the MAE values of IFCF
are approximate to those of other algorithms. They are
better than those of Davis et al. [8] and Samuel and
Balamurugan [31] only for this dataset. Specifically, the
average MAE values of IFCF, Davis et al. [8], Hassan and Syed
[13], De et al. [9], Samuel and Balamurugan [31] and Szmidt
and Kacprzyk [44] in Table 20 are 0.4923241, 0.4953545,
0.4883115, 0.4860862, 0.5143967 and 0.4856033, respectively. These numbers in Table 21 are 0.491069667,
0.49481, 0.487109222, 0.481241333, 0.519537222 and
0.656332556, respectively. Figs. 4 and 5 illustrate this fact.
This shows that the proposed algorithm IFCF is not really
effective in case that the dataset contains hard values and
is small-sized & concentrative in a small range such as the
HEART dataset;
(b) Tables 22 and 23 show other results of MAE in cases of a large
dataset such as RHC. The average MAE values of IFCF, Davis
et al. [8], Hassan and Syed [13], De et al. [9], Samuel and Balamurugan [31] and Szmidt and Kacprzyk [44]
in Table 22 are 0.43716495, 0.4374099, 0.4504119,
146
L.H. Son, N.T. Thong / Knowledge-Based Systems 74 (2015) 133–150
Table 22
The results of random experiments on the RHC dataset.
Data sets
50
100
150
200
250
300
350
400
450
500
550
600
650
700
750
800
850
900
950
1000
MAE
IFCF
DAVIS
HASSAN
DE
SAMUEL
SZMIDT
0.442175
0.439844
0.438635
0.438282
0.436268
0.436115
0.437641
0.436992
0.440996
0.435963
0.436537
0.435991
0.435262
0.438705
0.437769
0.435194
0.434399
0.435067
0.436633
0.434831
0.44247
0.440324
0.438546
0.438322
0.436218
0.436708
0.438068
0.436864
0.441618
0.436172
0.436788
0.436306
0.435451
0.439116
0.437992
0.435462
0.434582
0.435319
0.436888
0.434984
0.441502
0.451435
0.450023
0.450814
0.444011
0.438171
0.453067
0.449501
0.450962
0.441205
0.453276
0.458081
0.438105
0.45581
0.463094
0.444957
0.451129
0.463573
0.448817
0.460705
0.479901
0.482438
0.481689
0.476943
0.474844
0.480948
0.480032
0.47773
0.481943
0.475933
0.480885
0.478958
0.478893
0.481182
0.481186
0.476458
0.479923
0.476773
0.483203
0.478549
0.52626
0.524069
0.524843
0.529039
0.531077
0.525275
0.526245
0.528056
0.523331
0.529638
0.525715
0.526851
0.527255
0.524615
0.524855
0.529806
0.526656
0.529274
0.523412
0.527785
0.47374
0.475931
0.475157
0.470961
0.468923
0.474725
0.473755
0.471944
0.476669
0.470362
0.474285
0.473149
0.472745
0.475385
0.475145
0.470194
0.473344
0.470726
0.476588
0.472215
33.57283
65.9519
87.0333
113.8669
135.3158
152.9428
168.4303
189.6871
219.4764
249.4249
262.0359
275.2695
291.7030
315.0818
333.7332
351.0495
385.3729
402.2348
421.2111
442.4772
12.06656
30.21522
51.16982
67.81981
83.36726
96.91099
113.4414
129.7520
142.0267
156.7367
169.7921
180.1441
191.0308
202.0023
204.5337
213.1942
217.8114
226.5664
262.3495
287.3276
0.010758
0.015019
0.021002
0.038148
0.049271
0.056363
0.061749
0.067974
0.081742
0.096007
0.108004
0.110438
0.128739
0.138693
0.153086
0.163775
0.169128
0.185551
0.205338
0.216944
0.010931
0.019594
0.028967
0.038453
0.047737
0.056865
0.066663
0.076921
0.087006
0.094882
0.112752
0.134178
0.143444
0.151820
0.166440
0.170800
0.187068
0.199256
0.208938
0.237930
0.014407
0.028265
0.041422
0.055456
0.068599
0.081633
0.095873
0.113722
0.123928
0.135023
0.154736
0.162411
0.188254
0.210567
0.214335
0.242015
0.252885
0.268898
0.273174
0.306299
Computational time (s)
50
100
150
200
250
300
350
400
450
500
550
600
650
700
750
800
850
900
950
1000
40.06698
77.14874
113.0401
147.2283
180.2224
211.1297
242.7504
272.7885
319.4657
325.5207
344.3712
365.4371
385.1481
404.1568
421.5027
436.5416
453.7668
465.9425
478.8142
597.5560
Table 23
The results of k-fold cross validation on the RHC dataset.
Fold
MAE
IFCF
DAVIS
HASSAN
DE
SAMUEL
SZMIDT
2
3
4
5
6
7
8
9
10
0.440949
0.437283
0.434009
0.436070
0.435874
0.437098
0.434386
0.437215
0.441839
0.441186
0.437374
0.434141
0.436107
0.436164
0.437535
0.434495
0.437222
0.442047
0.469381
0.463187
0.458663
0.464382
0.436214
0.461758
0.44394
0.435230
0.457397
0.481136
0.480916
0.477842
0.479325
0.482992
0.479249
0.481571
0.479689
0.477239
0.524992
0.525445
0.528936
0.527072
0.523708
0.526433
0.525246
0.52695
0.527824
0.475008
0.474555
0.471064
0.472928
0.476292
0.473567
0.474754
0.47305
0.472176
Data sets
Computational time (s)
2
3
4
5
6
7
8
9
10
736.6445
701.3804
692.8024
629.4906
566.3853
508.6334
456.6042
436.1555
409.1319
477.9642
476.6182
450.9788
424.9808
396.0038
370.6705
349.1144
329.7963
319.6725
262.6263
262.9038
251.8981
238.3611
226.2907
215.6599
206.5204
198.37997
194.55609
0.791994
0.503385
0.366384
0.291197
0.229366
0.113842
0.051412
0.037128
0.013360
0.942148
0.712606
0.617685
0.615317
0.494319
0.346833
0.287202
0.277723
0.250653
1.651110
1.358187
1.215837
0.993458
0.780865
0.356283
0.321834
0.288569
0.275802
L.H. Son, N.T. Thong / Knowledge-Based Systems 74 (2015) 133–150
147
Fig. 4. The MAE results of random experiments on the HEART dataset.
Fig. 5. The MAE results of k-fold cross validation on the HEART dataset.
0.47942055, 0.52670285 and 0.47329715, respectively.
These numbers in Table 23 are 0.437191444, 0.437363444,
0.454461333, 0.479995444, 0.526289556 and 0.473710444,
respectively. In this case, the MAE value of IFCF is the smallest
among all. Looking for the comparisons of algorithms in
terms of MAE in Figs. 6 and 7 also illustrates this fact. This
show that IFCF is effective in cases of large medical diagnosis
datasets having wide ranges of values;
148
L.H. Son, N.T. Thong / Knowledge-Based Systems 74 (2015) 133–150
Fig. 6. The MAE results of random experiments on the RHC dataset.
Fig. 7. The MAE results of k-fold cross validation on the RHC dataset.
(c) The computational time is a drawback of the IFCF algorithm.
In cases of small-sized datasets such as HEART, the computational time of IFCF is ranged from 1.2 to 2.5 s on average
and is not quite larger than those of other algorithms that
take approximately from 0.3 to 1.8 s to process. Yet in cases
of large-size datasets such as RHC, the difference is getting
larger and obvious. Thus a better trade-off between the computational time and the accuracy of diagnosis should be paid
much attention for the assurance of the performance of the
algorithm.
L.H. Son, N.T. Thong / Knowledge-Based Systems 74 (2015) 133–150
4. Conclusions
In this paper, we concentrated on the problem of enhancing
the accuracy of medical diagnosis and presented a novel intuitionistic fuzzy recommender system (IFRS) consisting of the following components: (i) the new definitions of single-criterion
IFRS (SC-IFRS) and multi-criteria IFRS (MC-IFRS) that extend the
definition of traditional recommender systems (RS) taking into
account a feature of a user and a characteristic of an item
expressed by intuitionistic linguistic labels; (ii) the new definitions of intuitionistic fuzzy matrix (IFM), which is a representation of SC-IFRS and MC-IFRS in the matrix format and the
intuitionistic fuzzy composition matrix (IFCM) of two IFMs with
the intersection/union operation; (iii) some new similarity
degrees of IFMs such as the intuitionistic fuzzy similarity matrix
(IFSM) and the intuitionistic fuzzy similarity degree (IFSD) and
the formulas to predict diseases on the basis of IFSD accompanied
with an interesting theorem; (iv) a novel intuitionistic fuzzy collaborative filtering method relying on the basis of the predicting
formulas so-called IFCF. Some interesting theorems and properties of the proposed components were also investigated. The
proposed IFCF algorithm was used mainly for the medical diagnosis problem. Some numerical examples have been introduced
throughout the paper to illustrate the problem and the activities
of the algorithms.
In the Evaluation section, a numerical example on a small intuionistic fuzzy medical diagnosis data was presented. The next
experiments were conducted on both the small and large real hard
benchmark medical diagnosis data from UCI Machine Learning
Repository. The findings from the experiments are summarized
as follows: (i) the proposed IFCF algorithm is capable to perform
the prediction and recommendation with more types of datasets
including the hard and (intuitionistic) fuzzy data than other algorithms such as the standalone methods of intuitionistic fuzzy sets
(IFS) such as De et al. [9], Szmidt and Kacprzyk [44], Samuel and
Balamurugan [31] and RS such as Davis et al. [8] and Hassan and
Syed [13]; (ii) IFCF has better accuracy of prediction than those
algorithms in cases of the (intuitionistic) fuzzy data and the large
medical diagnosis datasets having wide ranges of values; (iii) IFCF
could handle the limitations of those works as pointed out in Section 1.6(b) of this article; (iv) Lastly, a better trade-off between the
computational time and the accuracy of diagnosis in IFCF should be
paid much attention for the assurance of the performance of the
algorithm. These findings clearly show the effectiveness of the proposed algorithm.
As being mentioned in the Section 1.6 stressing the importance
and significance of the proposed work, some solid further works
are necessary to impulse the development of the algorithm in this
field. Firstly, a variation of IFCF algorithm that tackles with the
deficiency of processing small-sized real datasets should be studied. Secondly, a trade-off between the computational time and
the accuracy of diagnosis in IFCF is examined. Thirdly, a hybrid
algorithm between IFCF and a fuzzy clustering method to enhance
the accuracy is considered. Fourthly, the theoretical analyses of the
IFRS especially the formulation of IFCM with other operations such
as t-norm and t-conorm are examined. Lastly, applications of IFRS
for other problems, e.g. the time series forecast and the nowcasting
could be performed. These future works will enrich the knowledge
of deploying advanced fuzzy recommender systems for practical
problems.
Acknowledgement
The authors are greatly indebted to the editors-in-chief, Prof. H.
Fujita, Prof. J. Lu and anonymous reviewers for their comments and
149
their valuable suggestions that improved the quality and clarity of
paper. This work is sponsored by the NAFOSTED under Contract
No. 102.05-2014.01.
References
[1] M. Agarwal, M. Hanmandlu, K.K. Biswas, Generalized intuitionistic fuzzy soft
set and its application in practical medical diagnosis problem, in: Proceeding of
IEEE International Conference on Fuzzy Systems (FUZZ 2011), 2011, pp. 2972–
2978.
[2] J.Y. Ahn, K.S. Han, S.Y. Oh, C.D. Lee, An application of interval-valued
intuitionistic fuzzy sets for medical diagnosis of headache, Int. J. Innovative
Comput. Inform. Control 7 (5) (2011) 2755–2762.
[3] G. Albeanu, F.L. Popentiu-Vladicescu, Intuitionistic fuzzy methods in software
reliability modelling, J. Sustainable Energy 1 (1) (2010) 30–34.
[4] K.T. Atanassov, Intuitionistic fuzzy sets, Fuzzy Sets Syst. 20 (1) (1986) 87–96.
[5] G. Bernegger, M. Musalek, C. Rehmann-Sutter, An alternative view on the task
of prognosis, Critical Rev. Oncol./Hematol. 84 (2012) S17–S24.
[6] A.F. Connors et al., The effectiveness of right heart catheterization in the initial
care of critically III patients, Jama 276 (11) (1996) 889–897.
[7] B.C. Cuong, L.H. Son, H.T.M. Chau, Some context fuzzy clustering methods for
classification problems, in: Proceedings of the 2010 ACM Symposium on
Information and Communication Technology, 2010, pp. 34–40.
[8] D.A. Davis, N.V. Chawla, N. Blumm, N. Christakis, A.L. Barabási, Predicting
individual disease risk based on medical history, in: Proceedings of the 17th
ACM Conference on Information and Knowledge Management, 2008, pp. 769–
778.
[9] S.K. De, R. Biswas, A.R. Roy, An application of intuitionistic fuzzy sets in
medical diagnosis, Fuzzy Sets Syst. 117 (2) (2001) 209–213.
[10] L. Duan, W.N. Street, E. Xu, Healthcare information systems: data mining
methods in the creation of a clinical recommender system, Enterprise Inform.
Syst. 5 (2) (2011) 169–181.
[11] F. Feng, C. Li, B. Davvaz, M.I. Ali, Soft sets combined with fuzzy sets and rough
sets: a tentative approach, Soft Comput. 14 (9) (2010) 899–911.
[12] F. Feng, X. Liu, V. Leoreanu-Fotea, Y.B. Jun, Soft sets and soft rough sets, Inform.
Sci. 181 (6) (2011) 1125–1137.
[13] S. Hassan, Z. Syed, From netflix to heart attacks: collaborative filtering in
medical datasets, in: Proceedings of the 1st ACM International Health
Informatics Symposium, 2010, pp. 128–134.
[14] R. Hosseini, T. Ellis, M. Mazinani, J. Dehmeshki, A genetic fuzzy approach for
rule extraction for rule-based classification with application to medical
diagnosis, in: Proceeding of European Conference on Machine Learning and
Principles and Practice of Knowledge Discovery in Databases (ECML PKDD),
2011, pp. 5–9.
[15] W.L. Hung, M.S. Yang, On similarity measures between intuitionistic fuzzy
sets, Int. J. Intell. Syst. 23 (3) (2008) 364–383.
[16] M. Irfan Ali, A note on soft sets, rough soft sets and fuzzy soft sets, Appl. Soft
Comput. 11 (4) (2011) 3329–3332.
[17] E. Jafarian, M.A. Rezvani, A valuation-based method for ranking the
intuitionistic fuzzy numbers, J. Intell. Fuzzy Syst. 24 (1) (2013) 133–144.
[18] R. Kala, R.R. Janghel, R. Tiwari, A. Shukla, Diagnosis of breast cancer by modular
evolutionary neural networks, Int. J. Biomed. Eng. Technol. 7 (2) (2011) 194–
211.
[19] V. Khatibi, G.A. Montazer, Intuitionistic fuzzy set vs. fuzzy set application in
medical pattern recognition, Artif. Intell. Med. 47 (1) (2009) 43–52.
[20] I. Kononenko, Machine learning for medical diagnosis: history, state of the art
and perspective, Artif. Intell. Med. 23 (1) (2001) 89–109.
[21] A.R. Meenakshi, M. Kaliraja, An application of interval valued fuzzy matrices in
medical diagnosis, Int. J. Math. Anal. 5 (36) (2011) 1791–1802.
[22] Meisamshabanpoor, M. Mahdavi, Implementation of a recommender system
on medical recognition and treatment, Int. J. e-Education, e-Business, eManagement e-Learning 2 (4) (2012) 315–318.
[23] D. Meng, X. Zhang, K. Qin, Soft rough fuzzy sets and soft fuzzy rough sets,
Comput. Math. Appl. 62 (12) (2011) 4635–4645.
[24] S. Moein, S.A. Monadjemi, P. Moallem, A novel fuzzy-neural based medical
diagnosis system, Int. J. Biol. Med. Sci. 4 (3) (2009) 146–150.
[25] T.J. Neog, D.K. Sut, An application of fuzzy soft sets in medical diagnosis using
fuzzy soft complement, Int. J. Comput. Appl. 33 (2011) 30–33.
[26] C.M. Own, Switching between type-2 fuzzy sets and intuitionistic fuzzy sets:
an application in medical diagnosis, Appl. Intell. 31 (3) (2009) 283–291.
[27] L. Parthiban, R. Subramanian, Intelligent heart disease prediction system using
CANFIS and genetic algorithm, Int. J. Biol. Biomed. Med. Sci. 3 (3) (2008) 157–
160.
[28] Z. Pawlak, Rough sets and intelligent data analysis, Inform. Sci. 147 (1) (2002)
1–12.
[29] F. Ricci, L. Rokach, B. Shapira, in: Introduction to Recommender Systems
Handbook, Springer, US, 2011, pp. 1–35.
[30] R.M. Rodriguez, L. Martinez, F. Herrera, Hesitant fuzzy linguistic term sets for
decision making, IEEE Trans. Fuzzy Syst. 20 (1) (2012) 109–119.
[31] A.E. Samuel, M. Balamurugan, Fuzzy max–min composition technique in
medical diagnosis, Appl. Math. Sci. 6 (35) (2012) 1741–1746.
[32] E. Sanchez, Resolution of composition fuzzy relation equations, Inform. Control
30 (1976) 38–48.
150
L.H. Son, N.T. Thong / Knowledge-Based Systems 74 (2015) 133–150
[33] T.K. Shinoj, S.J. John, Intuitionistic Fuzzy Multi sets and its Application in
Medical Diagnosis, World Acad. Sci. Eng. Technol. 6 (2012) 1418–1421.
[34] L.H. Son, Enhancing clustering quality of geo-demographic analysis using
context fuzzy clustering type-2 and particle swarm optimization, Appl. Soft
Comput. 22 (2014) 566–584.
[35] L.H. Son, HU-FCF: a hybrid user-based fuzzy collaborative filtering method in
recommender systems, Expert Syst. Appl. 41 (15) (2014) 6861–6870.
[36] L.H. Son, Optimizing municipal solid waste collection using chaotic particle
swarm optimization in GIS based environments: a case study at Danang City,
Vietnam, Expert Syst. Appl. 41 (18) (2014) 8062–8074.
[37] L.H. Son, DPFCM: a novel distributed picture fuzzy clustering method on
picture fuzzy sets, Expert Syst. Appl. 42 (1) (2015) 51–66.
[38] L.H. Son, B.C. Cuong, P.L. Lanzi, N.T. Thong, A novel intuitionistic fuzzy
clustering method for geo-demographic analysis, Expert Syst. Appl. 39 (10)
(2012) 9848–9859.
[39] L.H. Son, B.C. Cuong, H.V. Long, Spatial interaction–modification model and
applications to geo-demographic analysis, Knowl.-Based Syst. 49 (2013) 152–
170.
[40] L.H. Son, P.L. Lanzi, B.C. Cuong, H.A. Hung, Data mining in GIS: a novel contextbased fuzzy geographically weighted clustering algorithm, Int. J. Machine
Learning Comput. 2 (3) (2012) 235–238.
[41] L.H. Son, N.D. Linh, H.V. Long, A lossless DEM compression for fast retrieval
method using fuzzy clustering and MANFIS neural network, Eng. Appl. Artif.
Intell. 29 (2014) 33–42.
[42] E. Szmidt, J. Kacprzyk, Intuitionistic fuzzy sets in some medical applications,
in: Proceeding of Computational Intelligence: Theory and Applications, 2001,
pp. 148–151.
[43] E. Szmidt, J. Kacprzyk, An intuitionistic fuzzy set based approach to intelligent
data analysis: an application to medical diagnosis, in: Proceeding of Recent
Advances in Intelligent Paradigms and Applications, 2003, pp. 57–70.
[44] E. Szmidt, J. Kacprzyk, A similarity measure for intuitionistic fuzzy sets
and its application in supporting medical diagnostic reasoning, in: Proceeding
of Artificial Intelligence and Soft Computing (ICAISC 2004), 2004, pp. 388–
393.
[45] K.C. Tan, Q. Yu, C.M. Heng, T.H. Lee, Evolutionary computing for knowledge
discovery in medical diagnosis, Artif. Intell. Med. 27 (2) (2003) 129–154.
[46] University of California, UCI Repository of Machine Learning Databases, 2007.
< />[47] Z. Xiao, X. Yang, Q. Niu, Y. Dong, K. Gong, S. Xia, Y. Pang, A new evaluation
method based on D–S generalized fuzzy soft sets and its application in medical
diagnosis problem, Appl. Math. Modell. 36 (10) (2012) 4592–4604.
[48] Y. Yao, Three-way decisions with probabilistic rough sets, Inform. Sci. 180 (3)
(2010) 341–353.
[49] L.A. Zadeh, Fuzzy sets, Inform. Control 8 (1965) 338–353.
[50] X. Zhang, B. Zhou, P. Li, A general frame for intuitionistic fuzzy rough sets,
Inform. Sci. 216 (2012) 34–49.