Releases · KernelTuner/kernel_tuner

16 Nov 16:07

0.2.0

69759ae

Version 0.2.0

Version 0.2.0 adds a large number of search optimization algorithms and basic support for testing and tuning Fortran kernels.

Changed

no longer replacing kernel names with instance strings during tuning
bugfix in tempfile creation that lead to too many open files error

Added

A minimal Fortran example and basic Fortran support
Particle Swarm Optimization strategy, use strategy="pso"
Simulated Annealing strategy, use strategy="simulated_annealing"
Firefly Algorithm strategy, use strategy="firefly_algorithm"
Genetic Algorithm strategy, use strategy="genetic_algorithm"

Assets 2

18 Apr 10:10

benvanwerkhoven

0.1.9

e50ed8d

Version 0.1.9

[0.1.9] - 2018-04-18

Changed

bugfix for C backend for byte array arguments
argument type mismatches throw warning instead of exception

Added

wrapper functionality to wrap C++ functions
citation file and zenodo doi generation for releases

Assets 2

23 Nov 21:01

benvanwerkhoven

0.1.8

31fe668

Version 0.1.8

Version 0.1.8 brings many improvements, mostly focused on user friendliness. The installation process of optional dependencies is simplified as you can now use extras with pip. For example, pip install kernel_tuner[cuda] can be used to install both Kernel Tuner and the optional dependency PyCuda. In addition, Version 0.1.8 introduces many more checks on the user input that you pass to tune_kernel and run_kernel. For example, the kernel source code is parsed to see if the signature matches the argument list. The additional checks on input should make it easier to use and debug programs using Kernel Tuner. For a more detailed overview of the changes, see below:

[0.1.8] - 2017-11-23

Changed

bugfix for when using iterations smaller than 3
the install procedure now uses extras, e.g. [cuda,opencl]
option quiet makes tune_kernel completely quiet
extensive updates to documentation

Added

type checking for kernel arguments and answers lists
checks for reserved keywords in tunable paramters
checks for whether thread block dimensions are specified
printing units for measured time with CUDA and OpenCL
option to print all measured execution times

Assets 2

10 Nov 14:49

benvanwerkhoven

0.1.7

1d8fe7b

Version 0.1.7

[0.1.7] - 2017-10-11

Changed

bugfix install when scipy not present
bugfix for GPU cleanup when using Noodles runner
reworked the way strings are handled internally

Added

option to set compiler name, when using C backend

Assets 2

24 Aug 13:23

benvanwerkhoven

0.1.6

443976a

Version 0.1.6

Version 0.1.6 brings a few bugfixes but mostly extends the existing functionality of the tuner. Three new search strategies have been added and are now ready to use: minimize, basinhopping, and diff_evo. For more info on what these strategies do and what solvers and methods they support please see the documentation pages.

From the CHANGELOG:

[0.1.6] - 2017-08-17

Changed

actively freeing GPU memory after tuning
bugfix for 3D grids when using OpenCL

Added

support for dynamic parallelism when using PyCUDA
option to use differential evolution optimization
global optimization strategies basinhopping, minimize

Assets 2

21 Jul 13:51

benvanwerkhoven

0.1.5

5d94431

Version 0.1.5

Version 0.1.5 brings more flexibility, you can now pass code generating functions, your own functions for verifying kernel output correctness, and use your own names for the thread block dimensions.

Internally, quite a lot has changed in this version. The runners have been separated into strategies and runners. And the way that options are passed around within the Kernel Tuner has changed dramatically.

From the CHANGELOG:

[0.1.5] - 2017-07-21

Changed

option to pass a fraction to the sample runner
fixed a bug in memset for OpenCL backend

Added

parallel tuning on single node using Noodles runner
option to pass new defaults for block dimensions
option to pass a Python function as code generator
option to pass custom function for output verification

Assets 2

14 Jun 09:24

benvanwerkhoven

0.1.4

fe99d38

Version 0.1.4

This release adds that tune_kernel will also return a dictionary containing information about the environment in which the benchmarking of the kernel was performed. This is very useful for understanding how and under what circumstances certain measurement results were obtained.

In addition, there were some very minor changes in the way C functions are compiled and called.

Assets 2

06 Apr 13:59

benvanwerkhoven

0.1.3

0ca3c06

Version 0.1.3

Bugfixes for handling scalar arguments and documentation update.

Assets 2

29 Mar 07:02

benvanwerkhoven

0.1.2

d36feb6

Version 0.1.2

Better defaults for grid divisor lists, full support for 3D grids, and a simpler way to specify the problem size of 1D grids.

[0.1.2] - 2017-03-29

Changed

allow non-tuple problem_size for 1D grids
changed default for grid_div_y from None to block_size_y
converted the tutorial to a Jupyter Notebook
CUDA backend prints device in use, similar to OpenCL backend
migrating from nosetests to pytest
rewrote many of the examples to save results to json files

Added

full support for 3D grids, including option for grid_div_z
separable convolution example

Assets 2

10 Feb 07:49

benvanwerkhoven

0.1.1

5997364

Version 0.1.1

[0.1.1] - 2017-02-10

Changed

changed the output format to list of dictionaries

Added

option to set compiler options

Assets 2

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Version 0.2.0

Changed

Added

Uh oh!

[0.1.9] - 2018-04-18

Changed

Added

Uh oh!

[0.1.8] - 2017-11-23

Changed

Added

Uh oh!

[0.1.7] - 2017-10-11

Changed

Added

Uh oh!

Version 0.1.6

[0.1.6] - 2017-08-17

Changed

Added

Uh oh!

Version 0.1.5

[0.1.5] - 2017-07-21

Changed

Added

Uh oh!

Uh oh!

Uh oh!

[0.1.2] - 2017-03-29

Changed

Added

Uh oh!

[0.1.1] - 2017-02-10

Changed

Added

Uh oh!

Releases: KernelTuner/kernel_tuner

Version 0.2.0

Version 0.2.0

Changed

Added

Uh oh!

Version 0.1.9

[0.1.9] - 2018-04-18

Changed

Added

Uh oh!

Version 0.1.8

[0.1.8] - 2017-11-23

Changed

Added

Uh oh!

Version 0.1.7

[0.1.7] - 2017-10-11

Changed

Added

Uh oh!

Version 0.1.6

Version 0.1.6

[0.1.6] - 2017-08-17

Changed

Added

Uh oh!

Version 0.1.5

Version 0.1.5

[0.1.5] - 2017-07-21

Changed

Added

Uh oh!

Version 0.1.4

Uh oh!

Version 0.1.3

Uh oh!

Version 0.1.2

[0.1.2] - 2017-03-29

Changed

Added

Uh oh!

Version 0.1.1

[0.1.1] - 2017-02-10

Changed

Added

Uh oh!