Skip to content

Commit 6dc6507

Browse files
authored
[wip] update v2.1 readme (PaddlePaddle#736)
1 parent 54fc991 commit 6dc6507

File tree

4 files changed

+53
-2
lines changed

4 files changed

+53
-2
lines changed

CITATION.cff

Lines changed: 15 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,15 @@
1+
# This CITATION.cff file was generated with cffinit.
2+
# Visit https://bit.ly/cffinit to generate yours today!
3+
4+
cff-version: 1.2.0
5+
title: Paddle Multimodal Integration and eXploration
6+
message: >-
7+
If you use this repository, please cite it using the metadata from this file.
8+
type: software
9+
authors:
10+
- given-names: PaddleMIX Authors
11+
repository-code: 'https://github.com/PaddlePaddle/PaddleMIX'
12+
repository: 'https://github.com/PaddlePaddle/PaddleMIX'
13+
keywords:
14+
- paddlemix
15+
license: Apache-2.0

README.md

Lines changed: 8 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -24,7 +24,7 @@ PaddleMIX是基于飞桨的多模态大模型开发套件,聚合图像、文
2424
**🔥2024.10.11 发布PaddleMIX v2.1**
2525
* 支持[PaddleNLP 3.0 beta](https://github.com/PaddlePaddle/PaddleNLP/releases/tag/v3.0.0-beta0)版本,抢先体验其最新功能。
2626
* 新增[Qwen2-VL](./paddlemix/examples/qwen2_vl/)[InternVL2](./paddlemix/examples/internvl2/)[Stable Diffusion 3 (SD3)](https://github.com/PaddlePaddle/PaddleMIX/blob/develop/ppdiffusers/examples/dreambooth/README_sd3.md)等前沿模型。
27-
* 发布自研多模数据能力标签模型[PP-InsCapTagger](./paddlemix/datacopilot/example/pp_inscaptagger/);可用于数据的分析和过滤,试验案例表明在保持模型效果的条件下可减少50%的数据量,大幅提高训练效率。
27+
* DataCopilot发布自研多模数据能力标签模型[PP-InsCapTagger](./paddlemix/datacopilot/example/pp_inscaptagger/);可用于数据的分析和过滤,试验案例表明在保持模型效果的条件下可减少50%的数据量,大幅提高训练效率。
2828
* 多模态大模型InternVL2、LLaVA、SD3、SDXL适配昇腾910B,提供国产计算芯片上的训推能力。
2929

3030
**2024.09.11 更新**
@@ -145,6 +145,7 @@ sh build_env.sh
145145
<li><a href="paddlemix/examples/evaclip">EVA-CLIP</a></li>
146146
<li><a href="paddlemix/examples/llava">LLaVA</a></li>
147147
<li><a href="paddlemix/examples/llava">LLaVA-1.5</a></li>
148+
<li><a href="paddlemix/examples/llava">LLaVA-1.6</a></li>
148149
<li><a href="paddlemix/examples/llava">LLaVA-NeXT</a></li>
149150
<li><a href="paddlemix/examples/qwen_vl">Qwen-VL</a></li>
150151
<li><a href="paddlemix/examples/qwen2_vl">Qwen2-VL</a></li>
@@ -169,13 +170,19 @@ sh build_env.sh
169170
<ul>
170171
<li><a href="paddlemix/examples/imagebind">ImageBind</a></li>
171172
</ul>
173+
</ul>
174+
<li><b>数据分析</b></li>
175+
<ul>
176+
<li><a href="./paddlemix/datacopilot/example/pp_inscaptagger/">PP-InsCapTagger</a></li>
177+
</ul>
172178
</td>
173179
<td>
174180
<ul>
175181
</ul>
176182
<li><b>文生图</b></li>
177183
<ul>
178184
<li><a href="ppdiffusers/examples/stable_diffusion">Stable Diffusion</a></li>
185+
<li><a href="ppdiffusers/examples/dreambooth/README_sd3.md">Stable Diffusion 3 (SD3)</a></li>
179186
<li><a href="ppdiffusers/examples/controlnet">ControlNet</a></li>
180187
<li><a href="ppdiffusers/examples/t2i-adapter">T2I-Adapter</a></li>
181188
<li><a href="ppdiffusers/examples/text_to_image_laion400m">LDM</a></li>

paddlemix/datacopilot/example/pp_inscaptagger/readme.md

Lines changed: 29 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -110,3 +110,32 @@ LLaVA v1.5模型SFT阶段训练时,使用的指令微调数据集为[LLaVA-Ins
110110
| llava-1.5-7b <br> (tag 50%/our) | 70.24 | 57.12 | 78.32 | 62.14 | 37.11 | 1476 <br> 338 |
111111

112112
通过PP-InsCapTagger的打标和优化,50%数据集与原始数据集的训练效果基本持平,大大提高了模型训练效率。
113+
114+
115+
116+
## 引用
117+
如果在你的工作中用到`PP-InsCapTagger`,请按照下面的方式引用:
118+
119+
<details>
120+
<summary> bibtex </summary>
121+
122+
```bibtex
123+
124+
@software{PaddleMIX_Authors_Paddle_Multimodal_Integration,
125+
author = {PaddleMIX Authors},
126+
license = {Apache-2.0},
127+
title = {{Paddle Multimodal Integration and eXploration}},
128+
url = {https://github.com/PaddlePaddle/PaddleMIX}
129+
}
130+
131+
@software{Lv_Instance_Capability_Tagger_2024,
132+
author = {Lv, Wenyu and Huang, Kui and Zhao, Yian},
133+
license = {Apache-2.0},
134+
month = oct,
135+
title = {{Instance Capability Tagger: Enhancing Multimodal Data Efficiency for Model Training}},
136+
url = {https://github.com/lyuwenyu/PP-InsCapTagger},
137+
version = {1.0},
138+
year = {2024}
139+
}
140+
```
141+
</details>

paddlemix/datacopilot/nn/inscaptagger.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -39,7 +39,7 @@ def __init__(self, model_name_or_path, max_new_tokens = 4096, dtype='float16') -
3939

4040

4141
def init_model(self, model_name_or_path, max_new_tokens, dtype):
42-
tokenizer = AutoTokenizerMIX.from_pretrained(model_name_or_path, use_fast=False)
42+
tokenizer = AutoTokenizerMIX.from_pretrained(model_name_or_path)
4343
model_config = AutoConfigMIX.from_pretrained(model_name_or_path)
4444
model = AutoModelMIX.from_pretrained(model_name_or_path, dtype=dtype)
4545
model.eval()

0 commit comments

Comments
 (0)