[GSoC 2025 project 5] discussion about HBond interactions from implicit hydrogens in ProLIF #4962
Replies: 6 comments 17 replies
-
Hi Stuart and thank you for considering participating in a GSoC project with us! Your understanding of why we don't want to use protonation tool is correct, the goal of this project is to provide a faster (but possibly less accurate) way of estimating interactions from PDB files that don't contain explicit hydrogen atoms. The exact implementation is going to be left up to the candidate to decide, but I'd like to emphasise that because we aren't relying on a protonation tool, it might be best to assume that there will be no hydrogens to relax explicitly and that we won't use any local environment effect to infer the protonation state. Essentially we want to only use the heavy atoms positions to infer their ability to participate in hydrogen-bonding, i.e. if after using this implicit-hydrogen method two residues are predicted to interact with a hydrogen bond, then by adding hydrogens with a protonation tool and optimising the HBond network we should see that interaction between both residues. As an example, if you take a look at the 3D viewer implemented in the PDB (molstar), it assumes all nitrogen atoms in a histidine can take part as a donor. The goal of this project would be something similar. We can also respect different residue naming convention, this is where the "helper" function plays a role, e.g. making sure Hopefully this answers some of your questions! |
Beta Was this translation helpful? Give feedback.
-
Hi @cbouysset @talagayev , I'm Hocine, and I’d love to contribute to Project 5. I’ve put together a pre-proposal where I address histidine protonation, but I didn’t go into detail on nomenclature differences in CHARMM/AMBER since I wasn’t sure how relevant that was or how long the pre-proposal should be. I also tried to emphasize avoiding explicit protonation tools, but I’m not sure if it’s clear enough. If we’ve already submitted the pre-proposal, can I still modify it, or would you not recommend doing so? Would love to hear your thoughts! |
Beta Was this translation helpful? Give feedback.
-
A trivial thing for @cbouy and @talagayev: The size of this project is between medium and large. But, the application system of GSoC only has three options: small, medium, large. So, the question is: which one would you suggest selecting when filling in the application forms? |
Beta Was this translation helpful? Give feedback.
-
Hi @cbouy and @talagayev, |
Beta Was this translation helpful? Give feedback.
-
Hi @cbouy @talagayev , Based on my difficulties with the PR, I have changed my timeline from medium to large project. Would it be that okay? |
Beta Was this translation helpful? Give feedback.
-
Congrats @yuyuan871111 ! |
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
-
Hi @cbouy @talagayev,
I am Stuart, interested in contributing to the project 5 (calculating HBond interactions from implicit hydrogens in ProLIF). I recently considered a solution, but I found several questions, and I would like to ask mentors for clarification.
In the project details, you mentioned that the expected outcomes are to compare the complexes without using other protonation tools. However, positioning hydrogens without relaxation or optimization can result in inaccurate predictions of hydrogen bonds, particularly for residues with rotatable side chains and hydrogen donors (e.g., CYS, LYS, SER, THR, and TYR) even if the heavy atoms are given. For those non-deterministic implicit hydrogens, it will be difficult to calculate the hydrogen interactions based on angles and distances. (In contrast, implicit hydrogens for some residues are deterministic due to their local resonance effect, like ARG, ASN, GLN, TRP. For these residues, adding implicit hydrogens should be fine.)
I think the possible reason for not using the protonation tool is the computation costs. But, achieving accurate predictions by relaxing implicit hydrogens comes at the cost of computational efficiency, potentially resulting in the same computational expense as other protonation tools.
Also, the protonation tools usually consider the environment of the protein, such as global pH and local residue/ligand-induced effect (like PROPKA does). These factors might also affect the calculation of the hydrogen interactions. For example, the pH will affect the protonation states of several residues, including ARG, ASP, GLU, HIS, TYR and LYS, and thus lead to different hydrogen bond interactions.
I would like to ask if there is any further reason why we want to position hydrogens ideally without using protonation tools (except for the computational costs)? And do we want to add the hydrogen relaxation to the codes (if not using other protonation tool)?
What would be the acceptable assumption for the hydrogen bonds calculations with implicit hydrogens (e.g., pH 7 environment and not considering local residue/ligand-induced environment)?
Beta Was this translation helpful? Give feedback.
All reactions