Saltar navegación Este enlace salta al contenido informativo de la página
Ir a página principal de um.es
      NEWS



  • Homage to Lofti A. Zadeh

    At IPMU2018 a well-deserved homage will be paid to his scientific work.
    Cádiz, Spain, June 11th - 15th, 2018

  • ISAPEP'21

    5th I W on Intelligent Systems for Agriculture Production and Environment Protection
    Dubai, United Arab Emirates, June 21th - 22th, 2021

  • NIP
    NIP-Software tool to manage
    low quality datasets
    © Univ. Murcia 2012
    R.P.I. nº 8/2012/700

  • FCTA 2011
    Best Student Paper Award
    "Constructing Fuzzy Partitions from Imprecise Data"
    J.M. Cadenas, M.C. Garrido, R. Martinez

  • FCTA 2012
    Best Paper Award
    "Towards an Approach to Select Features from Low Quality Datasets"
    J.M. Cadenas, M.C. Garrido, R. Martinez

DataSets repository

Datasets with imperfect values

Datasets with missing values in real/nominal attributes, fuzzy values in real attributes and fuzzy subset values in nominal attributes

Below you can find the datasets available used for the task of imputation and condensation based on low quality data. For each dataset, it is shown its name, attributes (the table details the number of Real/Nominal attributes in the data), number of instances, and classes (number of possible values of the output variable).

In addition, the table shows if the corresponding dataset has missing, and fuzzy and fuzzy subset values (the table shows the percentage of instances with imperfect values).

The table allows to download each dataset (inside a ZIP file).

with missing values, fuzzy values and fuzzy subset values


Name #Attributes
(R/N)
#Examp. #Class. Missing
values
Fuzzy and
fuzzy subsets
values
Dataset Fuzzy and
fuzzy subsets
values
Dataset
(AUS) Australian
14  (8/6)
690
2
No (0%)
Yes (9.57%)
zip.gif
Yes (27.54%)
zip.gif
(AUT) Automobile
25  (15/10)
205
6
Yes (26.83%)
Yes (40.49%)
zip.gif
Yes (60.00%)
zip.gif
(BAL) Balance
4  (4/0)
625
3
No (0.0%)
Yes (3.84%)
zip.gif
Yes (11.84%)
zip.gif
(BAN) Banana
2  (2/0)
5300
2
No (0.0%)
Yes (2.00%)
zip.gif
Yes (5.81%)
zip.gif
(BAD) Bands
19  (19/0)
539
2
Yes (32.28%)
Yes (43.23%)
zip.gif
Yes (62.15%)
zip.gif
(BRE) Breast
9  (0/9)
286
2
Yes (3.15%)
Yes (9.44%)
zip.gif
Yes (18.88%)
zip.gif
(CAR) Car
6  (0/6)
1728
4
No (0.0%)
Yes (5.73%)
zip.gif
Yes (17.42%)
zip.gif
(CHE) Chess
36  (0/36)
3196
2
No (0.0%)
Yes (1.00%)
zip.gif
Yes (3.00%)
zip.gif
(COI) Coil-2000
85  (85/0)
9822
2
No (0.0%)
Yes (57.42%)
zip.gif
Yes (92.36%)
zip.gif
(CON) Contraceptive
9  (9/0)
1473
3
No (0.0%)
Yes (8.76%)
zip.gif
Yes (23.56%)
zip.gif
(DER) Dermatology
34  (34/0)
366
6
Yes (2.19%)
Yes (31.69%)
zip.gif
Yes (65.57%)
zip.gif
(FLA) Flare-solar
11  (0/11)
1066
6
No (0.00%)
Yes (6.38%)
zip.gif
Yes (18.01%)
zip.gif
(GER) German
20  (7/13)
1000
2
No (0.00%)
Yes (21.20%)
zip.gif
Yes (51.70%)
zip.gif
(HEA) Heart
13  (13/0)
270
2
No (0.00%)
Yes (13.33%)
zip.gif
Yes (33.70%)
zip.gif
(MAM) Mammographic
5  (5/0)
961
2
No (0.00%)
Yes (13.63%)
zip.gif
Yes (25.81%)
zip.gif
(NUR) Nursey
8  (0/8)
12960
5
No (0.00%)
Yes (6.72%)
zip.gif
Yes (19.21%)
zip.gif
(PAG) Page-block
10  (10/0)
5472
5
No (0.00%)
Yes (9.61%)
zip.gif
Yes (26.21%)
zip.gif
(PEN) Penbased
16  (16/0)
10992
10
No (0.00%)
Yes (14.80%)
zip.gif
Yes (38.97%)
zip.gif
(PHO) Phoneme
5  (5/0)
5404
2
No (0.00%)
Yes (4.85%)
zip.gif
Yes (14.21%)
zip.gif
(PIM) Pima
8  (8/0)
768
2
No (0.00%)
Yes (7.81%)
zip.gif
Yes (21.22%)
zip.gif
(RIN) Ring
20  (20/0)
7400
2
No (0.00%)
Yes (18.31%)
zip.gif
Yes (45.77%)
zip.gif
(SAT) Satimage
36  (36/0)
6435
7
No (0.00%)
Yes (30.51%)
zip.gif
Yes (67.10%)
zip.gif
(SEG) Segment
19  (19/0)
2310
7
No (0.00%)
Yes (17.01%)
zip.gif
Yes (42.73%)
zip.gif
(SON) Sonar
60  (60/0)
208
2
No (0.00%)
Yes (41.35%)
zip.gif
Yes (83.17%)
zip.gif
(SPA) Spambase
57  (57/0)
4597
2
No (0.00%)
Yes (19.34%)
zip.gif
Yes (46.75%)
zip.gif
(SPE) Spectfheart
44  (44/0)
267
2
No (0.00%)
Yes (28.84%)
zip.gif
Yes (57.30%)
zip.gif
(SPL) Splice
60  (0/60)
3190
3
No (0.00%)
Yes (44.89%)
zip.gif
Yes (84.08%)
zip.gif
(TEX) Texture
40  (40/0)
5500
11
No (0.00%)
Yes (33.31%)
zip.gif
Yes (70.58%)
zip.gif
(THY) Thyroid
21  (21/0)
7200
3
No (0.00%)
Yes (18.97%)
zip.gif
Yes (47.61%)
zip.gif
(TIC) Tic-tac-toe
9  (0/9)
958
2
No (0.00%)
Yes (8.87%)
zip.gif
Yes (23.90%)
zip.gif
(TIT) Titanic
3  (3/0)
2201
2
No (0.00%)
Yes (2.91%)
zip.gif
Yes (8.81%)
zip.gif
(TWO) Twonorm
20  (20/0)
7400
2
No (0.00%)
Yes (18.01%)
zip.gif
Yes (46.03%)
zip.gif
(VOW) Vowel
13  (13/0)
990
11
No (0.0%)
Yes (12.93%)
zip.gif
Yes (32.42%)
zip.gif
(WIS) Wisconsin
9  (9/0)
683
2
Yes (2.29%)
Yes (10.73%)
zip.gif
Yes (25.18%)
zip.gif
(YEA) Yeast
8  (8/0)
1484
10
No (0.00%)
Yes (7.68%)
zip.gif
Yes (21.29%)
zip.gif
(ZOO) Zoo
16  (0/16)
101
7
No (0.00%)
Yes (0.99%)
zip.gif
Yes (2.97%)
zip.gif
All datasets
zip.gif
info.txt
info.pdf


                                                                               Go to "Datasets Repository"
 
                                                                               Go to "Datasets Repository and Results"