add inputs_embeds to Bart/MBart/Unified_Transformer/Unimo/CodeGen #3769

Yam0214 · 2022-11-15T11:21:25Z

PR types

New features

PR changes

Models

Description

add inputs_embeds to Bart/MBart/Unified_Transformer/Unimo/CodeGen
force use_cache to False if labels is provided for Bart and MBart Model (in training)

…ts_embed

…ovided to save memory during training

…rovided to save memory during training

…ts_embed

…nto inputs_embed

FrostML · 2022-12-08T05:05:06Z

paddlenlp/transformers/bart/modeling.py

@@ -515,11 +540,13 @@ def set_input_embeddings(self, value):

    def forward(
        self,
-        input_ids,
+        input_ids=None,


也修改下文档，加一个 optional，后面的地方也一样。

FrostML · 2022-12-08T05:11:46Z

paddlenlp/transformers/mbart/modeling.py

-        if attention_mask is None:
-            assert input_ids is not None, "input_ids should be " "specified when generating attention_mask"
+        if attention_mask is None and input_ids is not None:
+            # assert input_ids is not None, "input_ids should be " \


这里能否也和 codegen 的修改一样，在 input_ids 为 None 的时候贴一个 warning，然后删除这个注释

FrostML · 2022-12-08T05:31:08Z

paddlenlp/transformers/unified_transformer/modeling.py

+        elif input_ids is not None:
+            inputs_sample = input_ids
+        elif input_embeddings is not None:
+            inputs_sample = input_embeddings[:, :, -1]


这里的 inputs_sample 在其表示意义上有点奇怪，后面的 paddle.expand_as() 能否替换成 paddle.expand()，然后这里改成获取 input_shape，也避免后续反复调用 paddle.shape()

FrostML · 2022-12-08T05:32:52Z

paddlenlp/transformers/unified_transformer/modeling.py

+                    ).astype("int64")
+                else:
+                    logger.warning(
+                        "position_ids or pad_token_ids should be provided when input_embeds is specified, otherwise an unexpected result may be returned"


otherwise 这句，说明下是怎样的 position id 吧，an unexpected result 有些笼统了

FrostML · 2022-12-08T05:37:32Z

paddlenlp/transformers/codegen/modeling.py

-                    paddle.cast(input_ids == self.pad_token_id, dtype=paddle.get_default_dtype()).unsqueeze([1, 2])
-                    * -1e4
-                )
+                logger.warning("provided inputs_embeds without attention_mask")


首字母大写，其次说明下将会使用默认值 None 作为 attention mask，表示不进行 mask 操作。后面的也一样。

FrostML · 2022-12-08T05:38:51Z

paddlenlp/transformers/unimo/modeling.py

+        elif input_ids is not None:
+            inputs_sample = input_ids
+        elif input_embeddings is not None:
+            inputs_sample = input_embeddings[:, :, -1]


这里和上面一样，需要修改下。

input_ids 的文档加上 optional

assert input_ids is not None 注释删除，并加上logger

paddle.expand_as 部分替换成 paddle.expand

补充了 logger.warning的内容，开头大写。

codecov · 2022-12-16T06:55:02Z

Codecov Report

Merging #3769 (d0aded0) into develop (271f3c1) will increase coverage by 0.10%.
The diff coverage is 72.04%.

@@             Coverage Diff             @@
##           develop    #3769      +/-   ##
===========================================
+ Coverage    32.95%   33.06%   +0.10%     
===========================================
  Files          400      400              
  Lines        56031    56131     +100     
===========================================
+ Hits         18466    18560      +94     
- Misses       37565    37571       +6

Impacted Files	Coverage Δ
paddlenlp/transformers/t5/modeling.py	`16.42% <0.00%> (-0.03%)`	⬇️
...lenlp/transformers/unified_transformer/modeling.py	`81.35% <57.69%> (-1.98%)`	⬇️
paddlenlp/transformers/unimo/modeling.py	`82.08% <57.69%> (-2.10%)`	⬇️
paddlenlp/transformers/mbart/modeling.py	`80.73% <73.21%> (-1.26%)`	⬇️
paddlenlp/transformers/codegen/modeling.py	`88.85% <78.94%> (+<0.01%)`	⬆️
paddlenlp/transformers/bart/modeling.py	`84.81% <84.21%> (-0.48%)`	⬇️
paddlenlp/utils/downloader.py	`52.86% <0.00%> (+9.42%)`	⬆️

Help us with your feedback. Take ten seconds to tell us how you rate us. Have a feature suggestion? Share it here.

Yam0214 added 4 commits November 15, 2022 02:48

fix bug for t5 which will occured when encoder_output is not None

8a99eb4

Merge branch 'develop' of github.com:PaddlePaddle/PaddleNLP into inpu…

460f3a3

…ts_embed

add inputs_embeds to bart and force use_cache=False when labels is pr…

6174326

…ovided to save memory during training

add inputs_embeds to mbart and force use_cache=False when labels is p…

64b23f2

…rovided to save memory during training

Yam0214 marked this pull request as draft November 15, 2022 11:21

Yam0214 added 4 commits November 15, 2022 11:28

fix conflicts

53096e1

add inputs_embeds to codegen

7ff588e

add inputs_embeds to unimo

ed51fba

add inputs_embeds to unified

89e1922

Yam0214 marked this pull request as ready for review November 16, 2022 09:23

Merge branch 'develop' of github.com:PaddlePaddle/PaddleNLP into inpu…

4ae2205

…ts_embed

Yam0214 changed the title ~~add inputs_embeds to Bart/MBart...~~ add inputs_embeds to Bart/MBart/Unified_Transformer/Unimo/CodeGen Nov 16, 2022

Yam0214 added 9 commits November 16, 2022 11:46

change assertion to warning with default position_ids

3ffba1b

Merge branch 'develop' of github.com:PaddlePaddle/PaddleNLP into inpu…

0569bd2

…ts_embed

Merge branch 'codestyle_before' into inputs_embed

2ce2f74

merge and fix conflicts

32148d1

check code style

15112d8

merge and fix conflicts

005761f

Merge branch 'develop' of https://github.com/PaddlePaddle/PaddleNLP i…

2cd6639

…nto inputs_embed

change tensor.shape to paddle.shape(tensor)

8b5766f

merge and fix conflicts

cbb5fc9

FrostML reviewed Dec 8, 2022

View reviewed changes

FrostML and others added 4 commits December 8, 2022 13:39

Merge branch 'develop' into inputs_embed

48c1db1

fix documntes and change expand_as to expand

7ed06df

Merge branch 'develop' into inputs_embed

0f884f8

Merge branch 'develop' into inputs_embed

eb27b2a

FrostML approved these changes Dec 9, 2022

View reviewed changes

FrostML added 3 commits December 9, 2022 14:05

Merge branch 'develop' into inputs_embed

556b751

Merge branch 'develop' into inputs_embed

791dfce

Merge branch 'develop' into inputs_embed

d0aded0

FrostML merged commit 1858416 into PaddlePaddle:develop Dec 16, 2022

sijunhe mentioned this pull request Dec 22, 2022

PaddleNLP 2.4.6 Release Note Candidate #4206

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

add inputs_embeds to Bart/MBart/Unified_Transformer/Unimo/CodeGen #3769

add inputs_embeds to Bart/MBart/Unified_Transformer/Unimo/CodeGen #3769

Uh oh!

Yam0214 commented Nov 15, 2022 •

edited

Loading

Uh oh!

FrostML Dec 8, 2022

Uh oh!

FrostML Dec 8, 2022

Uh oh!

FrostML Dec 8, 2022

Uh oh!

FrostML Dec 8, 2022

Uh oh!

FrostML Dec 8, 2022

Uh oh!

FrostML Dec 8, 2022

Uh oh!

Yam0214 Dec 8, 2022

Uh oh!

codecov bot commented Dec 16, 2022 •

edited

Loading

Uh oh!

Uh oh!

add inputs_embeds to Bart/MBart/Unified_Transformer/Unimo/CodeGen #3769

add inputs_embeds to Bart/MBart/Unified_Transformer/Unimo/CodeGen #3769

Uh oh!

Conversation

Yam0214 commented Nov 15, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

PR types

PR changes

Description

Uh oh!

FrostML Dec 8, 2022

Choose a reason for hiding this comment

Uh oh!

FrostML Dec 8, 2022

Choose a reason for hiding this comment

Uh oh!

FrostML Dec 8, 2022

Choose a reason for hiding this comment

Uh oh!

FrostML Dec 8, 2022

Choose a reason for hiding this comment

Uh oh!

FrostML Dec 8, 2022

Choose a reason for hiding this comment

Uh oh!

FrostML Dec 8, 2022

Choose a reason for hiding this comment

Uh oh!

Yam0214 Dec 8, 2022

Choose a reason for hiding this comment

Uh oh!

codecov bot commented Dec 16, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

Uh oh!

Yam0214 commented Nov 15, 2022 •

edited

Loading

codecov bot commented Dec 16, 2022 •

edited

Loading