Encode Prefill Disaggregation testcases#219
Conversation
CLA Signature Guide@EdwardXuy , thanks for your pull request. The following commit(s) are not associated with a signed Contributor License Agreement (CLA).
To sign CLA, click here. To check if your email is configured correctly, refer to the FAQs. Once you've signed the CLA or updating your email, please comment |
CLA Signature Guide@EdwardXuy , thanks for your pull request. The following commit(s) are not associated with a signed Contributor License Agreement (CLA).
To sign CLA, click here. To check if your email is configured correctly, refer to the FAQs. Once you've signed the CLA or updating your email, please comment |
CLA Signature Guide@EdwardXuy , thanks for your pull request. The following commit(s) are not associated with a signed Contributor License Agreement (CLA).
To sign CLA, click here. To check if your email is configured correctly, refer to the FAQs. Once you've signed the CLA or updating your email, please comment |
CLA Signature Guide@EdwardXuy , thanks for your pull request. The following commit(s) are not associated with a signed Contributor License Agreement (CLA).
To sign CLA, click here. To check if your email is configured correctly, refer to the FAQs. Once you've signed the CLA or updating your email, please comment |
CLA Signature Guide@EdwardXuy , thanks for your pull request. The following commit(s) are not associated with a signed Contributor License Agreement (CLA).
To sign CLA, click here. To check if your email is configured correctly, refer to the FAQs. Once you've signed the CLA or updating your email, please comment |
CLA Signature Guide@EdwardXuy , thanks for your pull request. The following commit(s) are not associated with a signed Contributor License Agreement (CLA).
To sign CLA, click here. To check if your email is configured correctly, refer to the FAQs. Once you've signed the CLA or updating your email, please comment |
CLA Signature Guide@EdwardXuy , thanks for your pull request. The following commit(s) are not associated with a signed Contributor License Agreement (CLA).
To sign CLA, click here. To check if your email is configured correctly, refer to the FAQs. Once you've signed the CLA or updating your email, please comment |
CLA Signature Guide@EdwardXuy , thanks for your pull request. The following commit(s) are not associated with a signed Contributor License Agreement (CLA).
To sign CLA, click here. To check if your email is configured correctly, refer to the FAQs. Once you've signed the CLA or updating your email, please comment |
CLA Signature Guide@EdwardXuy , thanks for your pull request. The following commit(s) are not associated with a signed Contributor License Agreement (CLA).
To sign CLA, click here. To check if your email is configured correctly, refer to the FAQs. Once you've signed the CLA or updating your email, please comment |
CLA Signature Guide@EdwardXuy , thanks for your pull request. The following commit(s) are not associated with a signed Contributor License Agreement (CLA).
To sign CLA, click here. To check if your email is configured correctly, refer to the FAQs. Once you've signed the CLA or updating your email, please comment |
CLA Signature Guide@EdwardXuy , thanks for your pull request. The following commit(s) are not associated with a signed Contributor License Agreement (CLA).
To sign CLA, click here. To check if your email is configured correctly, refer to the FAQs. Once you've signed the CLA or updating your email, please comment |
CLA Signature Guide@EdwardXuy , thanks for your pull request. The following commit(s) are not associated with a signed Contributor License Agreement (CLA).
To sign CLA, click here. To check if your email is configured correctly, refer to the FAQs. Once you've signed the CLA or updating your email, please comment |
Motivation
This PR aims to enhance the testing coverage for VLM (Vision-Language Model) encoder-prefill disaggregation on Ascend NPU hardware. It introduces a comprehensive test suite to validate the interaction of key parameters (
--encoder-only,--language-only,--encoder-transfer-backend,--encoder-urls, and--enable-adaptive-dispatch-to-encoder) in a multi-process environment.The goal is to ensure the system correctly handles both single-image (local processing) and multi-image (distributed processing) scenarios, as well as verifies the stability of different encoder transfer backends.
Modifications
New Test File:
test_npu_adaptive_dispatch_to_encoder.py--enable-adaptive-dispatch-to-encoderparameter.--language-onlyserver without an encoder backend. It validates that single-image requests are processed locally (adaptive dispatch) while ensuring the parameter is correctly recognized by the server.New Test File:
test_npu_disaggregated_vlm.py--encoder-onlyserver and a--language-onlyserver. It tests the communication between them.zmq_to_scheduler,zmq_to_tokenizer) through subclassing, ensuring that:Accuracy Tests
N/A
Speed Tests and Profiling
N/A
Checklist