Dear Author,
I have fine-tuned the ABR model according to the instructions using the provided hyperparameters. However, the results I obtained are noticeably different from the results reported in the paper. I have attached two figures to illustrate the problem:
Figure 1: Shows the loss curve and the reward curve during my fine-tuning process.
Figure 2: Compares the baseline, my fine-tuned large model (purple curve), and your fine-tuned large model (green curve).
As observed, my fine-tuned model performs significantly worse than the model provided by you, despite using the same hyperparameters. Could you please advise if there are specific hyperparameters I should further adjust, or if there are any additional configurations that I might have missed?
Thank you very much for your assistance!
Best regards,
Reamon


Dear Author,
I have fine-tuned the ABR model according to the instructions using the provided hyperparameters. However, the results I obtained are noticeably different from the results reported in the paper. I have attached two figures to illustrate the problem:
Figure 1: Shows the loss curve and the reward curve during my fine-tuning process.
Figure 2: Compares the baseline, my fine-tuned large model (purple curve), and your fine-tuned large model (green curve).
As observed, my fine-tuned model performs significantly worse than the model provided by you, despite using the same hyperparameters. Could you please advise if there are specific hyperparameters I should further adjust, or if there are any additional configurations that I might have missed?
Thank you very much for your assistance!
Best regards,
Reamon