Info About New Features Still Need to be Added To This Framework #1831

AnuragKr · 2022-07-25T07:26:18Z

AnuragKr
Jul 25, 2022

I am from Computer Science And Data Science Background. I want to contribute to this project. I need ideas on what new feature or improvement regarding Model Training or Improvement enhancement on GPU I can do.

amcadmus · 2022-07-29T03:41:47Z

amcadmus
Jul 29, 2022
Maintainer

Dear Anurag,

Thank you very much for the interest in the DeePMD-kit project. Current we are working on integrating the attention structure, which greatly improves the accuracy and data efficiency of the DP model. We find that the attention is very expansive and becomes the hot spot of the code. You are welcome to benchmark and let us know how to improve the efficiency of the model, especially on GPUs.

The attention code has not yet been PRed, but you may check it via the link:
https://github.com/iProzd/deepmd-kit/tree/attention_at_sel

Best regards,
Han

7 replies

AnuragKr Aug 2, 2022
Author

@denghuilu If you could just share resources that would also work.

denghuilu Aug 3, 2022
Collaborator

The GPU implementation bases on the TensorFlow framework. And we have tested the performance of DP model(compressed) with several network sizes as showing below:

Please refer to this paper for more details about the construction of the DP model as well as the related test setting: https://arxiv.org/abs/2107.02103.
We mainly use the profiling tools provided by NVIDIA: nvprof and tools such as Nsight system or Nsight compute.Sometimes TensorFlow's timeline API is also used to profiling the training process.

AnuragKr Aug 5, 2022
Author

Thanks for the answer. I will try to do it myself in case I face difficulty I will get back.

Zhang-pchao Aug 10, 2022

Dear Anurag,

Thank you very much for the interest in the DeePMD-kit project. Current we are working on integrating the attention structure, which greatly improves the accuracy and data efficiency of the DP model. We find that the attention is very expansive and becomes the hot spot of the code. You are welcome to benchmark and let us know how to improve the efficiency of the model, especially on GPUs.

The attention code has not yet been PRed, but you may check it via the link: https://github.com/iProzd/deepmd-kit/tree/attention_at_sel

Best regards, Han

Can the graph neural network improve the fitting accuracy of the DP model ?

wanghan-iapcm Aug 12, 2022
Maintainer

"graph neural network" is a general concept. We cannot say whether it improves or not the accuracy of the DP.

AnuragKr · 2022-10-07T07:41:08Z

AnuragKr
Oct 7, 2022
Author

Hello @wanghan-iapcm,
Like there is one problem mentioned above regarding optimizing the attention mechanism for GPUS, Is there any other open problem like any section of code that still needs to be worked on with respect to any aspect of Data Science and the HPC domain? How about implementing a different matrix multiplication operator and seeing if it improves existing performance? If any control flow of the model architecture need to be fixed?
Please, suggest what will be the best thing to work on. I just want to explore all possible problems and then see which will be a good point to start and which can be solved.
Thanks

0 replies

wanghan-iapcm · 2022-10-08T06:28:13Z

wanghan-iapcm
Oct 8, 2022
Maintainer

Hello @AnuragKr
The most urgent performance issue is the attention model DPA-1 . Compared to the standard DP model, the hot-spot moves from the embedding net to the self-attention layers, so the model compression technique would not help much. We are seeking for ideas of improve the performance of DPA-1, but the progress is slow.

The gemm is the bottleneck of the standard DP model (not compressed), but we find that that the performance is memory bound, not computational bound, at least on the V100 GPU. This means that faster (in the sense of floating point operations) gemm would not help improve the over-all performance. For example using the tensorcores of V100 does not significantly improves the speed of DP. (unless far-beyond-necessarily-large embedding nets are used).

By the model compression technique we have removed the gemm in the embedding net, thus the hot-spot becomes tabulated energy and force evaluation and the gemms in the fitting net. it is definitely welcome to raise new ideas of improving the performance of these parts.

2 replies

AnuragKr Jan 8, 2023
Author

Hello, @wanghan-iapcm I came across one CNN-based model Schnet. Is it a good idea to extend the DeePMD-kit fitting net architecture option apart from the feed-forward neural network, keeping the input data format and loss function the same?

wanghan-iapcm Jan 9, 2023
Maintainer

New architectures are welcomed.

Info About New Features Still Need to be Added To This Framework #1831

Uh oh!

AnuragKr Jul 25, 2022

Replies: 3 comments · 9 replies

Uh oh!

amcadmus Jul 29, 2022 Maintainer

Uh oh!

AnuragKr Aug 2, 2022 Author

Uh oh!

denghuilu Aug 3, 2022 Collaborator

Uh oh!

AnuragKr Aug 5, 2022 Author

Uh oh!

Zhang-pchao Aug 10, 2022

Uh oh!

wanghan-iapcm Aug 12, 2022 Maintainer

Uh oh!

AnuragKr Oct 7, 2022 Author

Uh oh!

wanghan-iapcm Oct 8, 2022 Maintainer

Uh oh!

AnuragKr Jan 8, 2023 Author

Uh oh!

wanghan-iapcm Jan 9, 2023 Maintainer

AnuragKr
Jul 25, 2022

Replies: 3 comments 9 replies

amcadmus
Jul 29, 2022
Maintainer

AnuragKr Aug 2, 2022
Author

denghuilu Aug 3, 2022
Collaborator

AnuragKr Aug 5, 2022
Author

wanghan-iapcm Aug 12, 2022
Maintainer

AnuragKr
Oct 7, 2022
Author

wanghan-iapcm
Oct 8, 2022
Maintainer

AnuragKr Jan 8, 2023
Author

wanghan-iapcm Jan 9, 2023
Maintainer