Wisconsin Breast Cancer Database


Do you have cancer or not?

You must have

1. Sources: This breast cancer database was obtained from the University of Wisconsin Hospital, Madison from Dr. William H. Wolberg. Please see: O. L. Mangasarian and W. H. Wolberg: "Cancer diagnosis via linear programming", SIAM News, Volume 23, Number 5, September 1990, pp. 1 & 18.

2. Information

3. Attribute Information:

# Attribute                             Domain

-- -----------------------------------------
1. Sample code number          id number
________________________________________
2. Clump Thickness                1 - 10
3. Uniformity of Cell Size        1 - 10
4. Uniformity of Cell Shape     1 - 10
5. Marginal Adhesion              1 - 10
6. Single Epithelial Cell Size     1 - 10
7. Bare Nuclei                         1 - 10
8. Bland Chromatin                  1 - 10
9. Normal Nucleoli                  1 - 10
10. Mitoses                             1 - 10
________________________________________
11. Class:             (2 for benign, 4 for malignant)

4. Database

This Cancer Database file has 663 instances. This database does not have enough instances, therefore, 100% accuracy will not be achieved in this case. However, even with this limited database, the DecisionMaker will be able to give you an 85% overall accuracy rate. The number of required instances to achieve 100% accuracy is listed below. The number of available instances in this case is also listed below.

Class             #Required Instances         #Actual Inatances

2 = benign             1000                             458
4 = malignant         1000                             241
Total                         2000                         663

Below are the first 10 instances:

Cancer Database File:
 
1000025
5
1
1
1
2
1
3
1
1
2
1002945
5
4
4
5
7
10
3
2
1
2
1015425
3
1
1
1
2
2
3
1
1
2
1016277
6
8
8
1
3
4
3
7
1
2
1017023
4
1
1
3
2
1
3
1
1
2
1017122
8
10
10
8
7
10
9
7
1
4
1018099
1
1
1
1
2
10
3
1
1
2
1018561
2
1
2
1
2
1
3
1
1
2
1033078
2
1
1
1
2
1
1
1
5
2
1033078
4
2
1
1
2
1
2
1
1
2

... ...

Cancer database file.

Below are the last 20 instances, which will be used to test the DecisionMaker's accuracy:

Question                                     Answer
 
1368882
2
1
1
1
2
1
1
1
1
2
1369821
10
10
10
10
5
10
10
10
7
4
1371026
5
10
10
10
4
10
5
6
3
4
1371920
5
1
1
1
2
1
3
2
1
2
466906
1
1
1
1
2
1
1
1
1
2
466906
1
1
1
1
2
1
1
1
1
2
534555
1
1
1
1
2
1
1
1
1
2
536708
1
1
1
1
2
1
1
1
1
2
566346
3
1
1
1
2
1
2
3
1
2
603148
4
1
1
1
2
1
1
1
1
2
654546
1
1
1
1
2
1
1
1
8
2
654546
1
1
1
3
2
1
1
1
1
2
695091
5
10
10
5
4
5
4
4
1
4
714039
3
1
1
1
2
1
1
1
1
2
763235
3
1
1
1
2
1
2
1
2
2
776715
3
1
1
1
3
2
1
1
1
2
841769
2
1
1
1
2
1
1
1
1
2
888820
5
10
10
3
7
3
8
10
2
4
897471
4
8
6
4
3
4
10
6
1
4
897471
4
8
8
5
4
5
10
4
1
4

Question file.
 

5. Results

Click "Average/+ Integer" (second of 2 clicks) to get the answer. The running time is one second, and the following Answer file is opened automatically:

=================== Beginning =====================
 
 

*

Wisconsin Breast Cancer Database

Question File

· benign = 2 or

· malignant = 4.

Command: Average/+ Integer

Precision: 10.

* 2 1 1 1 2 1 1 1 1

2

10 10 10 10 5 10 10 10 7

4

5 10 10 10 4 10 5 6 3

4

5 1 1 1 2 1 3 2 1

2

1 1 1 1 2 1 1 1 1

2

1 1 1 1 2 1 1 1 1

2

1 1 1 1 2 1 1 1 1

2

1 1 1 1 2 1 1 1 1

2

3 1 1 1 2 1 2 3 1

2

4 1 1 1 2 1 1 1 1

2

1 1 1 1 2 1 1 1 8

2

1 1 1 3 2 1 1 1 1

2

5 10 10 5 4 5 4 4 1

4

3 1 1 1 2 1 1 1 1

2

3 1 1 1 2 1 2 1 2

2

3 1 1 1 3 2 1 1 1

2

2 1 1 1 2 1 1 1 1

2

5 10 10 3 7 3 8 10 2

Can not make a Prediction.

To get a prediction, you can:

(1) Add more data;

(2) If you do not have more data; then click 'Data/link' and

reduce the precision level;

(3) You may also consider

to reduce the number of variables in your model.

See User's Guide

4 8 6 4 3 4 10 6 1

Can not make a Prediction.

To get a prediction, you can:

(1) Add more data;

(2) If you do not have more data; then click 'Data/link' and

reduce the precision level;

(3) You may also consider

to reduce the number of variables in your model.

See User's Guide

4 8 8 5 4 5 10 4 1

Can not make a Prediction.

To get a prediction, you can:

(1) Add more data;

(2) If you do not have more data; then click 'Data/link' and

reduce the precision level;

(3) You may also consider

to reduce the number of variables in your model.

See User's Guide

Precision of each number:

0.11

=================== End ==========================
 
 

6. Results

In the Answer file, the remark section: * . . . *, is borrowed from the Question file. Out of the 20 predictions, the DecisionMaker made 17 predictions. These predictions are 100% correct. The DecisionMaker can not handle the last three cases, based on the training received from the 663 instances. More training is required for these 3 instances.

The DecisionMaker is capable of achieving 100% accuracy. Please read chapter 4 on how to achieve 100% accuracy.