Feat: add model format for dpa1 by iProzd · Pull Request #3211 · deepmodeling/deepmd-kit

iProzd · 2024-02-01T10:07:33Z

This PR add model format for DPA1 model:

Add torch reformat implementation for DPA1 model
Add numpy implementation for DPA1 model without attention layer
Align the torch and numpy implementations

TODO:

Add numpy implementation for DPA1 model with attention layer
Align the TF and numpy implementations
Align the smoothness implementations
Make filter_layers._networks in torch be accessable from outside

for more information, see https://pre-commit.ci

+        atype_embd = atype_embd_ext[:, :nloc, :]
+        # nf x nloc x nnei x tebd_dim
+        atype_embd_nnei = np.tile(atype_embd[:, :, np.newaxis, :], (1, 1, nnei, 1))
+        nlist_mask = nlist != -1


+        ):
+            dtype = PRECISION_DICT[prec]
+            rtol, atol = get_tols(prec)
+            err_msg = f"idt={idt} prec={prec}"


+            dd0.se_atten.mean = torch.tensor(davg, dtype=dtype, device=env.DEVICE)
+            dd0.se_atten.dstd = torch.tensor(dstd, dtype=dtype, device=env.DEVICE)
+            # dd1 = DescrptDPA1.deserialize(dd0.serialize())
+            model = torch.jit.script(dd0)


+            resnet=False,
+            precision=precision,
+        )
+        self.w = self.w.squeeze(0)  # keep the weight shape to be [num_in]


+        )
+        self.w = self.w.squeeze(0)  # keep the weight shape to be [num_in]
+        if self.uni_init:
+            self.w = 1.0


+        self.w = self.w.squeeze(0)  # keep the weight shape to be [num_in]
+        if self.uni_init:
+            self.w = 1.0
+            self.b = 0.0


codecov · 2024-02-01T10:14:57Z

Codecov Report

Attention: 529 lines in your changes are missing coverage. Please review.

Comparison is base (afb440a) 74.39% compared to head (a96cab0) 20.72%.
Report is 2 commits behind head on devel.

Files	Patch %	Lines
deepmd/pt/model/descriptor/se_atten.py	0.00%	200 Missing ⚠️
deepmd/model_format/dpa1.py	0.00%	117 Missing ⚠️
deepmd/model_format/network.py	0.00%	109 Missing ⚠️
deepmd/pt/model/network/mlp.py	0.00%	64 Missing ⚠️
deepmd/pt/model/descriptor/dpa1.py	0.00%	36 Missing ⚠️
deepmd/model_format/__init__.py	0.00%	1 Missing ⚠️
deepmd/pt/model/descriptor/se_a.py	0.00%	1 Missing ⚠️
deepmd/pt/model/task/ener.py	0.00%	1 Missing ⚠️

Additional details and impacted files

@@             Coverage Diff             @@
##            devel    #3211       +/-   ##
===========================================
- Coverage   74.39%   20.72%   -53.68%     
===========================================
  Files         345      346        +1     
  Lines       31981    32509      +528     
  Branches     1592     1594        +2     
===========================================
- Hits        23791     6736    -17055     
- Misses       7265    25075    +17810     
+ Partials      925      698      -227

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

+        embeddings = data.pop("embeddings")
+        type_embedding = data.pop("type_embedding")
+        attention_layers = data.pop("attention_layers")
+        env_mat = data.pop("env_mat")


wanghan-iapcm

The serialize and de-serialize of the model_format/dpa1 should be tested.

njzjz · 2024-02-03T14:03:43Z

+        variables = data.pop("@variables")
+        embeddings = data.pop("embeddings")
+        type_embedding = data.pop("type_embedding")
+        attention_layers = data.pop("attention_layers", None)


Why is it pop and not used?

+                dd0_state_dict = dd0.se_atten.state_dict()
+                dd4_state_dict = dd4.se_atten.state_dict()
+
+                dd0_state_dict_attn = dd0.se_atten.dpa1_attention.state_dict()


+                dd4_state_dict = dd4.se_atten.state_dict()
+
+                dd0_state_dict_attn = dd0.se_atten.dpa1_attention.state_dict()
+                dd4_state_dict_attn = dd4.se_atten.dpa1_attention.state_dict()


+        data = copy.deepcopy(data)
+        variables = data.pop("@variables")
+        embeddings = data.pop("embeddings")
+        type_embedding = data.pop("type_embedding")


+        variables = data.pop("@variables")
+        embeddings = data.pop("embeddings")
+        type_embedding = data.pop("type_embedding")
+        attention_layers = data.pop("attention_layers", None)


njzjz · 2024-02-03T14:00:59Z

+    Then the scaled dot-product attention method is adopted:
+
+    .. math::
+    A(\mathcal{Q}^{i,l}, \mathcal{K}^{i,l}, \mathcal{V}^{i,l}, \mathcal{R}^{i,l})=\varphi\left(\mathcal{Q}^{i,l}, \mathcal{K}^{i,l},\mathcal{R}^{i,l}\right)\mathcal{V}^{i,l},


Need indents, otherwise, it cannot be rendered correctly. See https://deepmodeling--3211.org.readthedocs.build/projects/deepmd/en/3211/api_py/deepmd.model_format.html#deepmd.model_format.DescrptDPA1

njzjz · 2024-02-03T14:03:43Z

+        variables = data.pop("@variables")
+        embeddings = data.pop("embeddings")
+        type_embedding = data.pop("type_embedding")
+        attention_layers = data.pop("attention_layers", None)


Why is it pop and not used?

njzjz · 2024-02-03T14:04:20Z

+    w : np.ndarray, optional
+        The embedding weights of the layer.


Mismatch the actual parameters.

njzjz · 2024-02-03T14:05:18Z

+    w : np.ndarray, optional
+        The learnable weights of the normalization scale in the layer.
+    b : np.ndarray, optional
+        The learnable biases of the normalization shift in the layer.


Mismatch the actual parameters.

iProzd · 2024-04-21T09:18:10Z

This PR is merged into #3696

Feat: add model format for dpa1

51643af

github-actions Bot added the Python label Feb 1, 2024

iProzd requested review from njzjz and wanghan-iapcm February 1, 2024 10:07

[pre-commit.ci] auto fixes from pre-commit.com hooks

8abff4c

for more information, see https://pre-commit.ci

github-advanced-security AI found potential problems Feb 1, 2024

View reviewed changes

fix pre-commit

b4770de

github-advanced-security AI found potential problems Feb 1, 2024

View reviewed changes

fix uts

8fec4cb

wanghan-iapcm reviewed Feb 1, 2024

View reviewed changes

iProzd added 3 commits February 1, 2024 22:38

fix uts

42c75de

fix uts

603128a

Add dp impl serialization test

a96cab0

github-advanced-security AI found potential problems Feb 1, 2024

View reviewed changes

njzjz added the Test CUDA Trigger test CUDA workflow label Feb 2, 2024

github-actions Bot removed the Test CUDA Trigger test CUDA workflow label Feb 2, 2024

njzjz reviewed Feb 3, 2024

View reviewed changes

njzjz linked an issue Mar 19, 2024 that may be closed by this pull request

[Feature Request] pt: refactor DPA-1 in the PyTorch backend #3506

Closed

iProzd closed this Apr 21, 2024

iProzd deleted the rf_dpa1 branch April 24, 2024 09:12

Conversation

iProzd commented Feb 1, 2024

Uh oh!

Uh oh!

Check notice

Uh oh!

Uh oh!

Uh oh!

Check notice

Check notice

Check warning

Check warning

Check warning

codecov Bot commented Feb 1, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

Check notice

wanghan-iapcm left a comment

Choose a reason for hiding this comment

Uh oh!

Check notice

njzjz Feb 3, 2024

Choose a reason for hiding this comment

Uh oh!

Check notice

Check notice

Check failure

Check failure

njzjz Feb 3, 2024

Choose a reason for hiding this comment

Uh oh!

njzjz Feb 3, 2024

Choose a reason for hiding this comment

Uh oh!

njzjz Feb 3, 2024

Choose a reason for hiding this comment

Uh oh!

njzjz Feb 3, 2024

Choose a reason for hiding this comment

Uh oh!

iProzd commented Apr 21, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

codecov Bot commented Feb 1, 2024 •

edited

Loading