Hello,
I'm trying to reimplementate your work with A100*8EA.
However, performance was not reached to your one.
I found that some configurations are different compared with your training log.
Could you share more details about training configurations or exact training scripts?
Hello,
I'm trying to reimplementate your work with A100*8EA.
However, performance was not reached to your one.
I found that some configurations are different compared with your training log.
Could you share more details about training configurations or exact training scripts?