Hi Chengyang, thanks for your great code!
I'm trying to reproduce the GLAT+DSLP model, I checked your given training scripts, but I found there is no "--arch glat_sd" registered model in the code, is it should be "nat_sd_glat"?
BTW, what's the meaning of "ss" and "sd"? Does "sd" mean supervised deeply? how about "ss"
Thank for your answer!!
Hi Chengyang, thanks for your great code!
I'm trying to reproduce the GLAT+DSLP model, I checked your given training scripts, but I found there is no "--arch glat_sd" registered model in the code, is it should be "nat_sd_glat"?
BTW, what's the meaning of "ss" and "sd"? Does "sd" mean supervised deeply? how about "ss"
Thank for your answer!!