[BUG]: Llama3.1 HybridParallelPlugin train failed when pp_size>1

### Is there an existing issue for this bug?

- [X] I have searched the existing issues

### 🐛 Describe the bug

pp=2
tp=2
sp=1
zero_stage=0

[rank6]:   File "/usr/local/lib/python3.10/dist-packages/colossalai/shardformer/modeling/llama.py", line 93, in llama_model_forward
[rank6]:     input_shape = hidden_states.shape[:-1]
[rank6]: AttributeError: 'NoneType' object has no attribute 'shape'

### Environment

transformers              4.39.3
torch                     2.4.0a0+3bcc3cddb5.nv24.7
colossalai                0.4.5


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[BUG]: Llama3.1 HybridParallelPlugin train failed when pp_size>1 #6110

Is there an existing issue for this bug?

🐛 Describe the bug

Environment

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

[BUG]: Llama3.1 HybridParallelPlugin train failed when pp_size>1 #6110

Description

Is there an existing issue for this bug?

🐛 Describe the bug

Environment

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions