AutoDask

AutoDask is currently in the development stage, which is why some modules may not work or may have errors.

📖 Overview

AutoDask is a tiny AutoML library built on top of Dask for distributed computing, leveraging the Bee Colony Optimization (BCO) algorithm for hyperparameter tuning. It provides an easy-to-use interface for automated machine learning tasks while efficiently using computational resources.

⚡ Quickstart

Installation (unsupported yet)

pip install autodask

Basic Usage

from autodask.main import AutoDask

adsk = AutoDask(task='classification')
adsk.fit(X_train, y_train)
predictions = adsk.predict(X_test)

🛠️ Features

Automated Machine Learning: Handles preprocessing, feature engineering (WIP), model selection, and hyperparameter tuning
Distributed Computing: Leverage Dask for parallel processing and efficient resource utilization
Multiple ML Tasks: Supports classification and regression tasks
Efficient Optimization: Uses Bee Colony Optimization for hyperparameter tuning

🧩 Advanced Usage

Custom Configuration

from autodask.main import AutoDask

# Configure with custom parameters
adsk = AutoDask(
    task='regression',
    n_jobs=-1,  # Use all available cores
    time_limit=3600,  # 1 hour time limit
    with_tuning=True, # Allow bee optimization for hyperparameters
    optimization_rounds=3,
    max_ensemble_models=5,
    models=['lgbm', 'xgboost', 'catboost'],  # Specify models to consider
)

# Train with advanced options
adsk.fit(X_train, y_train)

🐝 Bee Colony Optimization

AutoDask implements the simplified version of Bee Colony Optimization algorithm, a nature-inspired metaheuristic based on the foraging behavior of honey bees:

Employed Bees: Explore the solution space by testing different model configurations
Onlooker Bees: Focus on the most promising configurations based on performance metrics

This approach efficiently navigates the vast model configurations with fewer evaluations compared to traditional grid search or random search methods.

Parameters

from autodask.main import AutoDask

adsk = AutoDask(
    task='classification',
    bco_params={
        'employed_bees': 3,  # Number of employed bees
        'onlooker_bees': 3,  # Number of onlooker bees
        'exploration_rate': 0.3   # Balance between exploration and exploitation
    }
)

Saving and Loading Models

# Save the trained model
adsk.save('my_autodask_model.pkl')

# Load the model later
from autodask.main import load_model
loaded_model = load_model('my_autodask_model.pkl')
predictions = loaded_model.predict(new_data)

Performance Comparison

Coming soon...

📧 Contact

For questions and support, please open an issue on GitHub.

Name		Name	Last commit message	Last commit date
Latest commit History 92 Commits
.github/workflows		.github/workflows
autodask		autodask
docs		docs
img		img
tests		tests
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
mkdocs.yml		mkdocs.yml
poetry.lock		poetry.lock
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

AutoDask

AutoDask is currently in the development stage, which is why some modules may not work or may have errors.

📖 Overview

⚡ Quickstart

🛠️ Features

🧩 Advanced Usage

🐝 Bee Colony Optimization

Saving and Loading Models

Performance Comparison

📧 Contact

About

Uh oh!

Releases

Packages

Languages

License

dmitryglhf/autodask

Folders and files

Latest commit

History

Repository files navigation

AutoDask

AutoDask is currently in the development stage, which is why some modules may not work or may have errors.

📖 Overview

⚡ Quickstart

🛠️ Features

🧩 Advanced Usage

🐝 Bee Colony Optimization

Saving and Loading Models

Performance Comparison

📧 Contact

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages