【Hackathon 8th No.28】在 PaddleNLP 中复现 Phi3 #10688

robinbg · 2025-06-01T09:24:15Z

Before submitting

Lint code. If there are lint issues, please format the code first.

# Install and register `pre-commit` in the project folder
pip install pre-commit && pre-commit install

# Process previous code files separately
pre-commit run --file XXXX.py

Add test cases into tests folder. If there are codecov issues, please add tests cases first.

PR types

PR changes

Description

- Add Phi3 model configuration, tokenizer, and modeling classes - Support both phi3-small (3B) and phi3-base (14B) variants - Add comprehensive unit tests for model and tokenizer - Implement grouped query attention and rotary embeddings - Add support for gradient checkpointing and generation - Follow PaddleNLP coding standards and conventions

paddle-bot · 2025-06-01T09:24:20Z

Thanks for your contribution!

DrownFish19 · 2025-06-04T03:50:05Z

paddlenlp/transformers/phi3/tokenizer.py

+    """
+
+    resource_files_names = {"vocab_file": "vocab.model", "tokenizer_config_file": "tokenizer_config.json"}
+    pretrained_resource_files_map = {


此处不需要配置下载路径，我们后续可以直接转模型上传使用。
模型配置位置也可以删除。

DrownFish19 · 2025-06-04T03:50:38Z

tests/transformers/phi3/__init__.py

@@ -0,0 +1 @@
+# Copyright (c) 2025 PaddlePaddle Authors. All Rights Reserved.


这里不完整

DrownFish19 · 2025-06-04T04:04:55Z

请将文件通过pre-commit 处理后再补充提交以统一格式，可参考以下命令：

# Install and register `pre-commit` in the project folder
pip install pre-commit && pre-commit install

# Process previous code files separately
pre-commit run --file XXXX.py

luotao1 · 2025-06-04T04:24:42Z

可下载「如流」扫码加入第八期黑客松交流群

DrownFish19 · 2025-06-04T04:30:37Z

需要在XXXPretrainedModel中补充_get_name_mappings (支持参数转换)、_get_tensor_parallel_mappings （支持模型并行参数切分）、_get_fuse_or_split_param_mappings（支持参数自动化融合切分）。
需要参考Qwen2模型支持并行策略以支持模型训练。

Fix(phi3): Address comments from PR PaddlePaddle#10688 This commit incorporates your suggestions and requirements from the review comments on PR PaddlePaddle#10688 for the Phi3 model implementation. The following changes were made: 1. **Tokenizer Configuration Cleanup:** - Removed `pretrained_resource_files_map`, `pretrained_init_configuration`, and `max_model_input_sizes` from `paddlenlp/transformers/phi3/tokenizer.py` as you requested, to decouple it from specific pre-trained model download paths. 2. **Test Init File Completion:** - Added a docstring to `tests/transformers/phi3/__init__.py` to ensure it's a valid and non-empty Python module initialization file. 3. **PretrainedModel Mapping Methods:** - Implemented `_get_name_mappings`, `_get_tensor_parallel_mappings`, and `_get_fuse_or_split_param_mappings` in the `Phi3PreTrainedModel` class in `paddlenlp/transformers/phi3/modeling.py`. These methods are crucial for model conversion and tensor parallelism, based on the Qwen2 model's implementation. 4. **Parallel Strategy Support:** - Integrated support for sequence parallelism and recomputation into `paddlenlp/transformers/phi3/modeling.py`. - This includes: - Configuration flags for enabling/disabling these features. - Modifications to `Phi3Model`, `Phi3DecoderLayer`, `Phi3Attention`, and `Phi3MLP` to handle sequence-parallel linear layers and recomputation logic (full layer, full attention, and core attention granularities). - Necessary imports and utilities for sequence parallelism (ScatterOp, GatherOp, sequence-parallel linear layers) and recomputation. - Tensor parallelism considerations for weight initialization and layer configurations. 5. **Code Formatting:** - Applied `pre-commit` to all modified files to ensure code style consistency and address linting issues. This included removing some unused imports and a duplicated code segment.

CLAassistant · 2025-06-09T07:00:23Z

All committers have signed the CLA.

robinbg added 3 commits June 1, 2025 12:05

Add ModernBERT model to transformers package

e7ee187

Add ModernBERT configuration and tokenizer

90e0caa

paddle-bot bot added the contributor label Jun 1, 2025

paddle-bot bot assigned lugimzzz Jun 1, 2025

Delete paddlenlp/transformers/modernbert directory

c61a888

luotao1 mentioned this pull request Jun 3, 2025

【Hackathon 8th】开源贡献个人挑战赛 PaddlePaddle/Paddle#71310

Open

luotao1 added the hackathon label Jun 3, 2025

luotao1 assigned luotao1 and DrownFish19 and unassigned lugimzzz Jun 3, 2025

DrownFish19 reviewed Jun 4, 2025

View reviewed changes

robinbg added 3 commits June 8, 2025 23:56

Update modeling.py

e8fe5e3

Update tokenizer.py

75d310a

Update __init__.py

dbb9d76

robinbg force-pushed the feature/add_phi3 branch from ff63b2e to dbb9d76 Compare June 9, 2025 07:03

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

【Hackathon 8th No.28】在 PaddleNLP 中复现 Phi3 #10688

【Hackathon 8th No.28】在 PaddleNLP 中复现 Phi3 #10688

Uh oh!

robinbg commented Jun 1, 2025

Uh oh!

paddle-bot bot commented Jun 1, 2025

Uh oh!

DrownFish19 Jun 4, 2025

Uh oh!

DrownFish19 Jun 4, 2025

Uh oh!

DrownFish19 commented Jun 4, 2025

Uh oh!

luotao1 commented Jun 4, 2025

Uh oh!

DrownFish19 commented Jun 4, 2025

Uh oh!

CLAassistant commented Jun 9, 2025 •

edited

Loading

Uh oh!

Uh oh!

		@@ -0,0 +1 @@
		# Copyright (c) 2025 PaddlePaddle Authors. All Rights Reserved.

【Hackathon 8th No.28】在 PaddleNLP 中复现 Phi3 #10688

Are you sure you want to change the base?

【Hackathon 8th No.28】在 PaddleNLP 中复现 Phi3 #10688

Uh oh!

Conversation

robinbg commented Jun 1, 2025

Before submitting

PR types

PR changes

Description

Uh oh!

paddle-bot bot commented Jun 1, 2025

Uh oh!

DrownFish19 Jun 4, 2025

Choose a reason for hiding this comment

Uh oh!

DrownFish19 Jun 4, 2025

Choose a reason for hiding this comment

Uh oh!

DrownFish19 commented Jun 4, 2025

Uh oh!

luotao1 commented Jun 4, 2025

Uh oh!

DrownFish19 commented Jun 4, 2025

Uh oh!

CLAassistant commented Jun 9, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

CLAassistant commented Jun 9, 2025 •

edited

Loading