iS-Pareto

Ab-initio based optimization towards the Pareto front of multiple reaction conditions

Version 0.1.0

Features

Multi-Objective Optimization of multiple reaction conditions
Multi-Objective Optimization with several Optimizers
Automatic Reaction Rate Calculation
Automatic Gibbs Free Energy Interpolation
Visualization of Pareto Front

About

Pharmaceutical industry is one of the fastest growing economies and essential for the lives of billions of people.

BUT

The costs to produces a new drug are assumed to be over 2.5$ billion
The pharmaceutical industry generates 25-100kg of hazardous waste per kg of product
250-1000 times more waste is produced in pharmaceutical industry than in the oil refining industry

Optimizing reaction conditions in pharmaceutical reactors is essential to boost API yield and selectivity while cutting costs and waste for a greener, more efficient pharmaceutical industry.

By leveraging in-silicio optimization, iS-Pareto can significantly reduce experimental workload, time and costs. However, further validation is needed to confirm its broader applicability

Example $S_N Ar$ Reaction System

iS-Pareto tries to reproduce the study on the $S_NAr$ system by Schweidtmann et al. [1]. The workflow aims to reproduce $STY$ and $E$-factor values.

Installation

This package is only compatible with Python 3.10.

First install the ipopt solver and the tamkin [2] package from conda-forge:

conda install -c conda-forge ipopt tamkin

Afterwards the rest of the dependencies can be installed via pip:

pip install git+https://github.com/bg196678/IS-Pareto.git

Workflow

CREST - Conformer Search
Gaussian - $\Delta G_{Thermal}$ -> .fchk files
CosmoTherm - $\Delta E$ -> .tab files

Usage

A complete usage example can be found in examples/system_1/tsemo/system_1.py.

Species and Transition States

Species are divided into general species and transition states. Species can be defined as following:

from pathlib import Path
from ispareto.species import Species


species_A = Species(
    name="Species A",
    mass=0.00,  # in kg/mol
    fchk_file_path=Path("path/to/gaussian/file.fchk"),
    tab_file_path=Path("path/to/cosmotherm/file.tab"),
    energy=0.00  # optional, in Hartree
)

The energy for a species is optional. When not provided it is taken from the Gaussian .fchk file. The Species class can be used to define reactants and product.

In addition there is a seperate class for defining transisiton states:

from pathlib import Path
from ispareto.species import TransitionState


transisiton_state_A = TransitionState(
    name="Transition State A",
    fchk_file_path=Path("path/to/gaussian/file.fchk"),
    tab_file_path=Path("path/to/cosmotherm/file.tab"),
    energy=0.00  # optional, in Hartree
)

Transition state do not need any masses. Despite that the definition is the same as for the standard Species class.

Reactions

Reactions are defined via simple addition and subtraction

# C -> A + B
reaction = species_A + species_B - species_C
reaction.name = "A_plus_B_to_C"
reaction.transition_state = transition_state_ABC

Every reaction defined needs a transition state which can be set via the Reaction.transisiton_state property.

Kinetics and Solvation

The Kinetics class is used to automatically calculate the needed reaction rate constants on the fly. It takes in a list of reactions:

from ispareto.kinetics import Kinetics

reactions = [
    reaction_1,
    reaction_2,
]
kinetics = Kinetics(
    reactions=reactions,
    tunneling_correction=None,
    gradient_threshold=1e-4,
    
)

Tunneling Correction wigner, eckart and miller can also be specified here in addition to the gradient_threshold. The gradient threshold is passed to the tamkin.ConstrainExt treatment.

The $\Delta G_{solv}$ values are automatically extracted and then calculated based on the .tab files for every species. The Solvation is also constructed with the same list of reactions:

from ispareto.solvation import Solvation

reactions = [
    reaction_1,
    reaction_2,
]
solvation = Solvation(
    reactions=reactions,
)

The grid of temperatures is interpolated to match the varying temperatures in the reactor.

Reactor

The reactor is based on the reactions, the corresponding kinetics and the $\Delta G_{solv}$ values:

from ispareto.reactor import Reactor

reactor = Reactor(
    reactions=reactions,
    kinetics=kinetics,
    solvation=solvation,
)

Optimization

For the multi-objective optimization two different types of data has to be defined. On one hand we have to define the Species (reactants and products) to be optimized:

from ispareto.optimizer import OptimizationSpecies

optimization_species = OptimizationSpecies(
    reactant_1=species_A,
    reactant_2=species_B,
    products=[species_C,]
)

Here we start the reactor with species_A and species_B and would like to maximize the outcome of Species_C.

On the other hand we have to define the boundaries of the optimization:

from ispareto.optimizer import OptimizationBoundaries

optimization_boundaries = OptimizationBoundaries(
    temperature=(60, 140),  # Celsius Degrees
    concentration_reactant_1=(100, 500),  # Mol/L 
    concentration_ratio=(1.0, 5.0),  # 
    time=(0.5, 2.0),  # Minutes
)

Here the concentration range of reactant 1 (Species_A) is provided. In addition the concentration of reactant 2 (Species_B) is given with the concentration_ratio range. Furthermore the temperature range and the time is defined.

After defining Species, Reactions, Kinetics, Solvation and Optimization Boundaries the optimization can begin:

from pathlib import Path
from ispareto.optimizer import TSEmoOptimizer

optimizer = TSEmoOptimizer(
    species=optimization_species,
    boundaries=optimization_boundaries,
    reactor=reactor,
    output_directory=Path("/path/to/an/output/directory"),
    num_initial_points=4,
)
optimizer.run(num_iteration=100)

The optimizer takes in the previously defined elements. Additionally the numer of random LHS points (num_initial_points=4) are defined to give the TSEMO optimization a headstart. Afterwards the optimization is run for num_iteration=100 iterations.

Output

In the output directory will be a file called ispareto.log which gives an overview of the input and logs every optimization iteration.

Two folders will be created, plots and data. In the plots folder there will be for every iteration the data as a scatter plot with the current pareto front. In the end an animation is generated. An example can be seen above. In the data folder there will be four .csv files:

corrections.csv: thermal corrections evaluated on a fine grid
gsolv.csv:
kinetics.csv: reaction rate constants evaluated on a fine grid
ispareto.csv evaluations of the reactor model with reactor starting conditions and E and STY

Running Tests

Tests can be run via pytest:

pytest -v

Some tests require a large amount of time. If you want to skip these tests, you can use the -m flag:

pytest -v -m "not slow"

License

References

[1] Schweidtmann, Artur M., Adam D. Clayton, Nicholas Holmes, Eric Bradford, Richard A. Bourne, and Alexei A. Lapkin. (November 15, 2018): 277–82. [2] Ghysels et al. “TAMkin: A Versatile Package for Vibrational Analysis and Chemical Kinetics.” Journal of Chemical Information and Modeling 50, no. 9 (September 27, 2010): 1736–50

Name		Name	Last commit message	Last commit date
Latest commit History 90 Commits
.github/workflows		.github/workflows
examples		examples
src/ispareto		src/ispareto
tests		tests
README.md		README.md
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

iS-Pareto

Features

About

Example $S_N Ar$ Reaction System

Installation

Workflow

Usage

Species and Transition States

Reactions

Kinetics and Solvation

Reactor

Optimization

Output

Running Tests

License

References

About

Uh oh!

Releases

Packages

Contributors 3

Uh oh!

Languages

bg196678/IS-Pareto

Folders and files

Latest commit

History

Repository files navigation

iS-Pareto

Features

About

Example $S_N Ar$ Reaction System

Installation

Workflow

Usage

Species and Transition States

Reactions

Kinetics and Solvation

Reactor

Optimization

Output

Running Tests

License

References

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Uh oh!

Languages

Packages