Skip to content

Commit 5c57694

Browse files
authored
DOC Add dropdowns to module 7.1 Toy datasets (scikit-learn#26710)
1 parent 952f480 commit 5c57694

File tree

5 files changed

+82
-62
lines changed

5 files changed

+82
-62
lines changed

sklearn/datasets/descr/breast_cancer.rst

Lines changed: 16 additions & 12 deletions
Original file line numberDiff line numberDiff line change
@@ -104,15 +104,19 @@ This database is also available through the UW CS ftp server:
104104
ftp ftp.cs.wisc.edu
105105
cd math-prog/cpo-dataset/machine-learn/WDBC/
106106

107-
.. topic:: References
108-
109-
- W.N. Street, W.H. Wolberg and O.L. Mangasarian. Nuclear feature extraction
110-
for breast tumor diagnosis. IS&T/SPIE 1993 International Symposium on
111-
Electronic Imaging: Science and Technology, volume 1905, pages 861-870,
112-
San Jose, CA, 1993.
113-
- O.L. Mangasarian, W.N. Street and W.H. Wolberg. Breast cancer diagnosis and
114-
prognosis via linear programming. Operations Research, 43(4), pages 570-577,
115-
July-August 1995.
116-
- W.H. Wolberg, W.N. Street, and O.L. Mangasarian. Machine learning techniques
117-
to diagnose breast cancer from fine-needle aspirates. Cancer Letters 77 (1994)
118-
163-171.
107+
|details-start|
108+
**References**
109+
|details-split|
110+
111+
- W.N. Street, W.H. Wolberg and O.L. Mangasarian. Nuclear feature extraction
112+
for breast tumor diagnosis. IS&T/SPIE 1993 International Symposium on
113+
Electronic Imaging: Science and Technology, volume 1905, pages 861-870,
114+
San Jose, CA, 1993.
115+
- O.L. Mangasarian, W.N. Street and W.H. Wolberg. Breast cancer diagnosis and
116+
prognosis via linear programming. Operations Research, 43(4), pages 570-577,
117+
July-August 1995.
118+
- W.H. Wolberg, W.N. Street, and O.L. Mangasarian. Machine learning techniques
119+
to diagnose breast cancer from fine-needle aspirates. Cancer Letters 77 (1994)
120+
163-171.
121+
122+
|details-end|

sklearn/datasets/descr/digits.rst

Lines changed: 16 additions & 12 deletions
Original file line numberDiff line numberDiff line change
@@ -32,15 +32,19 @@ T. Candela, D. L. Dimmick, J. Geist, P. J. Grother, S. A. Janet, and C.
3232
L. Wilson, NIST Form-Based Handprint Recognition System, NISTIR 5469,
3333
1994.
3434

35-
.. topic:: References
36-
37-
- C. Kaynak (1995) Methods of Combining Multiple Classifiers and Their
38-
Applications to Handwritten Digit Recognition, MSc Thesis, Institute of
39-
Graduate Studies in Science and Engineering, Bogazici University.
40-
- E. Alpaydin, C. Kaynak (1998) Cascading Classifiers, Kybernetika.
41-
- Ken Tang and Ponnuthurai N. Suganthan and Xi Yao and A. Kai Qin.
42-
Linear dimensionalityreduction using relevance weighted LDA. School of
43-
Electrical and Electronic Engineering Nanyang Technological University.
44-
2005.
45-
- Claudio Gentile. A New Approximate Maximal Margin Classification
46-
Algorithm. NIPS. 2000.
35+
|details-start|
36+
**References**
37+
|details-split|
38+
39+
- C. Kaynak (1995) Methods of Combining Multiple Classifiers and Their
40+
Applications to Handwritten Digit Recognition, MSc Thesis, Institute of
41+
Graduate Studies in Science and Engineering, Bogazici University.
42+
- E. Alpaydin, C. Kaynak (1998) Cascading Classifiers, Kybernetika.
43+
- Ken Tang and Ponnuthurai N. Suganthan and Xi Yao and A. Kai Qin.
44+
Linear dimensionalityreduction using relevance weighted LDA. School of
45+
Electrical and Electronic Engineering Nanyang Technological University.
46+
2005.
47+
- Claudio Gentile. A New Approximate Maximal Margin Classification
48+
Algorithm. NIPS. 2000.
49+
50+
|details-end|

sklearn/datasets/descr/iris.rst

Lines changed: 20 additions & 16 deletions
Original file line numberDiff line numberDiff line change
@@ -45,19 +45,23 @@ data set contains 3 classes of 50 instances each, where each class refers to a
4545
type of iris plant. One class is linearly separable from the other 2; the
4646
latter are NOT linearly separable from each other.
4747

48-
.. topic:: References
49-
50-
- Fisher, R.A. "The use of multiple measurements in taxonomic problems"
51-
Annual Eugenics, 7, Part II, 179-188 (1936); also in "Contributions to
52-
Mathematical Statistics" (John Wiley, NY, 1950).
53-
- Duda, R.O., & Hart, P.E. (1973) Pattern Classification and Scene Analysis.
54-
(Q327.D83) John Wiley & Sons. ISBN 0-471-22361-1. See page 218.
55-
- Dasarathy, B.V. (1980) "Nosing Around the Neighborhood: A New System
56-
Structure and Classification Rule for Recognition in Partially Exposed
57-
Environments". IEEE Transactions on Pattern Analysis and Machine
58-
Intelligence, Vol. PAMI-2, No. 1, 67-71.
59-
- Gates, G.W. (1972) "The Reduced Nearest Neighbor Rule". IEEE Transactions
60-
on Information Theory, May 1972, 431-433.
61-
- See also: 1988 MLC Proceedings, 54-64. Cheeseman et al"s AUTOCLASS II
62-
conceptual clustering system finds 3 classes in the data.
63-
- Many, many more ...
48+
|details-start|
49+
**References**
50+
|details-split|
51+
52+
- Fisher, R.A. "The use of multiple measurements in taxonomic problems"
53+
Annual Eugenics, 7, Part II, 179-188 (1936); also in "Contributions to
54+
Mathematical Statistics" (John Wiley, NY, 1950).
55+
- Duda, R.O., & Hart, P.E. (1973) Pattern Classification and Scene Analysis.
56+
(Q327.D83) John Wiley & Sons. ISBN 0-471-22361-1. See page 218.
57+
- Dasarathy, B.V. (1980) "Nosing Around the Neighborhood: A New System
58+
Structure and Classification Rule for Recognition in Partially Exposed
59+
Environments". IEEE Transactions on Pattern Analysis and Machine
60+
Intelligence, Vol. PAMI-2, No. 1, 67-71.
61+
- Gates, G.W. (1972) "The Reduced Nearest Neighbor Rule". IEEE Transactions
62+
on Information Theory, May 1972, 431-433.
63+
- See also: 1988 MLC Proceedings, 54-64. Cheeseman et al"s AUTOCLASS II
64+
conceptual clustering system finds 3 classes in the data.
65+
- Many, many more ...
66+
67+
|details-end|

sklearn/datasets/descr/linnerud.rst

Lines changed: 7 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -18,7 +18,11 @@ twenty middle-aged men in a fitness club:
1818
- *exercise* - CSV containing 20 observations on 3 exercise variables:
1919
Chins, Situps and Jumps.
2020

21-
.. topic:: References
21+
|details-start|
22+
**References**
23+
|details-split|
2224

23-
* Tenenhaus, M. (1998). La regression PLS: theorie et pratique. Paris:
24-
Editions Technic.
25+
* Tenenhaus, M. (1998). La regression PLS: theorie et pratique. Paris:
26+
Editions Technic.
27+
28+
|details-end|

sklearn/datasets/descr/wine_data.rst

Lines changed: 23 additions & 19 deletions
Original file line numberDiff line numberDiff line change
@@ -74,22 +74,26 @@ Lichman, M. (2013). UCI Machine Learning Repository
7474
[https://archive.ics.uci.edu/ml]. Irvine, CA: University of California,
7575
School of Information and Computer Science.
7676

77-
.. topic:: References
78-
79-
(1) S. Aeberhard, D. Coomans and O. de Vel,
80-
Comparison of Classifiers in High Dimensional Settings,
81-
Tech. Rep. no. 92-02, (1992), Dept. of Computer Science and Dept. of
82-
Mathematics and Statistics, James Cook University of North Queensland.
83-
(Also submitted to Technometrics).
84-
85-
The data was used with many others for comparing various
86-
classifiers. The classes are separable, though only RDA
87-
has achieved 100% correct classification.
88-
(RDA : 100%, QDA 99.4%, LDA 98.9%, 1NN 96.1% (z-transformed data))
89-
(All results using the leave-one-out technique)
90-
91-
(2) S. Aeberhard, D. Coomans and O. de Vel,
92-
"THE CLASSIFICATION PERFORMANCE OF RDA"
93-
Tech. Rep. no. 92-01, (1992), Dept. of Computer Science and Dept. of
94-
Mathematics and Statistics, James Cook University of North Queensland.
95-
(Also submitted to Journal of Chemometrics).
77+
|details-start|
78+
**References**
79+
|details-split|
80+
81+
(1) S. Aeberhard, D. Coomans and O. de Vel,
82+
Comparison of Classifiers in High Dimensional Settings,
83+
Tech. Rep. no. 92-02, (1992), Dept. of Computer Science and Dept. of
84+
Mathematics and Statistics, James Cook University of North Queensland.
85+
(Also submitted to Technometrics).
86+
87+
The data was used with many others for comparing various
88+
classifiers. The classes are separable, though only RDA
89+
has achieved 100% correct classification.
90+
(RDA : 100%, QDA 99.4%, LDA 98.9%, 1NN 96.1% (z-transformed data))
91+
(All results using the leave-one-out technique)
92+
93+
(2) S. Aeberhard, D. Coomans and O. de Vel,
94+
"THE CLASSIFICATION PERFORMANCE OF RDA"
95+
Tech. Rep. no. 92-01, (1992), Dept. of Computer Science and Dept. of
96+
Mathematics and Statistics, James Cook University of North Queensland.
97+
(Also submitted to Journal of Chemometrics).
98+
99+
|details-end|

0 commit comments

Comments
 (0)