You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
<p>If you use TPOT in a scientific publication, please consider citing at least one of the following papers:</p>
129
129
<p>Trang T. Le, Weixuan Fu and Jason H. Moore (2020). <ahref="https://academic.oup.com/bioinformatics/article/36/1/250/5511404">Scaling tree-based automated machine learning to biomedical big data with a feature set selector</a>. <em>Bioinformatics</em>.36(1): 250-256.</p>
<p>Randal S. Olson, Ryan J. Urbanowicz, Peter C. Andrews, Nicole A. Lavender, La Creis Kidd, and Jason H. Moore (2016). <ahref="http://link.springer.com/chapter/10.1007/978-3-319-31204-0_9">Automating biomedical data science through tree-based pipeline optimization</a>. <em>Applications of Evolutionary Computation</em>, pages 123-137.</p>
<p>Evaluation of a Tree-based Pipeline Optimization Tool for Automating Data Science</p>
160
158
<p>Randal S. Olson, Nathan Bartley, Ryan J. Urbanowicz, and Jason H. Moore (2016). <ahref="http://dl.acm.org/citation.cfm?id=2908918">Evaluation of a Tree-based Pipeline Optimization Tool for Automating Data Science</a>. <em>Proceedings of GECCO 2016</em>, pages 485-492.</p>
- For details on how the <code>fit()</code>, <code>score()</code> and <code>export()</code> methods work, refer to the <ahref="/using/">usage documentation</a>.
201
215
- Upon re-running the experiments, your resulting pipelines <em>may</em> differ (to some extent) from the ones demonstrated here.</p>
<p>Below is a minimal working example with the optical recognition of handwritten digits dataset, which is an <em>image classification problem</em>.</p>
<p>Running this code should discover a pipeline (exported as <code>tpot_boston_pipeline.py</code>) that achieves at least 10 mean squared error (MSE) on the test set:</p>
309
-
<pre><codeclass="Python">import numpy as np
318
+
<pre><codeclass="language-Python">import numpy as np
310
319
import pandas as pd
311
320
from sklearn.ensemble import ExtraTreesRegressor
312
321
from sklearn.model_selection import train_test_split
<p>To see the TPOT applied the Titanic Kaggle dataset, see the Jupyter notebook <ahref="https://github.com/EpistasisLab/tpot/blob/master/tutorials/Titanic_Kaggle.ipynb">here</a>. This example shows how to take a messy dataset and preprocess it such that it can be used in scikit-learn and TPOT.</p>
337
345
<h2id="portuguese-bank-marketing">Portuguese Bank Marketing</h2>
<p>The corresponding Jupyter notebook, containing the associated data preprocessing and analysis, can be found <ahref="https://github.com/EpistasisLab/tpot/blob/master/tutorials/MAGIC%20Gamma%20Telescope/MAGIC%20Gamma%20Telescope.ipynb">here</a>.</p>
341
349
<h2id="neural-network-classifier-using-tpot-nn">Neural network classifier using TPOT-NN</h2>
342
350
<p>By loading the <ahref="https://github.com/EpistasisLab/tpot/blob/master/tpot/config/classifier_nn.py">TPOT-NN configuration dictionary</a>, PyTorch estimators will be included for classification. Users can also create their own NN configuration dictionary that includes <code>tpot.builtins.PytorchLRClassifier</code> and/or <code>tpot.builtins.PytorchMLPClassifier</code>, or they can specify them using a template string, as shown in the following example:</p>
0 commit comments