Skip to content

Add version details to NVSHMEM version mismatch errors#32

Closed
AkiRusProd wants to merge 1 commit intoNVIDIA:develfrom
AkiRusProd:feature/detail-versions
Closed

Add version details to NVSHMEM version mismatch errors#32
AkiRusProd wants to merge 1 commit intoNVIDIA:develfrom
AkiRusProd:feature/detail-versions

Conversation

@AkiRusProd
Copy link
Copy Markdown
Contributor

Add Version Details to NVSHMEM Version Mismatch Errors

Problem: When NVSHMEM device and host library versions mismatch, the error message only provided a generic warning without specific version details, making it difficult to diagnose compatibility issues.

Solution: Added detailed version number output in major.minor.patch format when version mismatches are detected:

  • In nvshmemid_hostlib_init_attr function during library initialization
  • In nvshmemi_cuobject_init_common function during CUmodule/CUlibrary initialization

Error output example now shows:

NVSHMEM device library version (3.3.24) does not match with NVSHMEM host library version (3.2.5)

Benefits:

  • Accelerates compatibility issue diagnosis
  • Allows precise identification of which version needs updating
  • Simplifies debugging in heterogeneous environments with different library versions

@abrooks98
Copy link
Copy Markdown
Collaborator

Thank you for the contribution! This has been merged internally and is now available with v3.6.5-0

@abrooks98 abrooks98 closed this Mar 25, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants