-
Notifications
You must be signed in to change notification settings - Fork 0
Expand file tree
/
Copy pathtest.log
More file actions
134 lines (133 loc) · 9.2 KB
/
test.log
File metadata and controls
134 lines (133 loc) · 9.2 KB
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
03/15/2023 23:49:44 - WARNING - __main__ - Process rank: -1, device: cuda, n_gpu: 1, distributed training: False, 16-bits training: False
Some weights of RobertaForSequenceClassification were not initialized from the model checkpoint at ./trained_model/roberta_model and are newly initialized because the shapes did not match:
- classifier.out_proj.weight: found shape torch.Size([2, 768]) in the checkpoint and torch.Size([1, 768]) in the model instantiated
- classifier.out_proj.bias: found shape torch.Size([2]) in the checkpoint and torch.Size([1]) in the model instantiated
You should probably TRAIN this model on a down-stream task to be able to use it for predictions and inference.
03/15/2023 23:49:45 - INFO - __main__ - Training/evaluation parameters Namespace(adam_epsilon=1e-08, block_size=510, cache_dir='./trained_model', config_name='', device=device(type='cuda'), do_eval=True, do_lower_case=False, do_test=True, do_train=False, epoch=5, eval_all_checkpoints=False, eval_batch_size=64, eval_data_file='validate.json', evaluate_during_training=True, fp16=False, fp16_opt_level='O1', gradient_accumulation_steps=1, learning_rate=2e-05, local_rank=-1, logging_steps=50, max_grad_norm=1.0, max_steps=-1, mlm=False, mlm_probability=0.15, model_name_or_path='./trained_model/roberta_model', model_type='roberta', n_gpu=1, no_cuda=False, num_train_epochs=1.0, output_dir='./saved_models', overwrite_cache=False, overwrite_output_dir=False, per_gpu_eval_batch_size=64, per_gpu_train_batch_size=32, save_steps=50, save_total_limit=None, seed=123456, server_ip='', server_port='', start_epoch=0, start_step=0, test_data_file='test.json', tokenizer_name='./trained_model/roberta_model', train_batch_size=32, train_data_file='train.json', warmup_steps=0, weight_decay=0.0)
03/15/2023 23:49:49 - INFO - __main__ - ***** Running evaluation *****
03/15/2023 23:49:49 - INFO - __main__ - Num examples = 823
03/15/2023 23:49:49 - INFO - __main__ - Batch size = 64
03/15/2023 23:49:58 - INFO - __main__ - ***** Eval results *****
03/15/2023 23:49:58 - INFO - __main__ - eval_acc = 0.616
03/15/2023 23:49:58 - INFO - __main__ - eval_loss = 0.6291
03/15/2023 23:50:09 - INFO - __main__ - ***** Running Test *****
03/15/2023 23:50:09 - INFO - __main__ - Num examples = 7403
03/15/2023 23:50:09 - INFO - __main__ - Batch size = 64
0%| | 0/116 [00:00<?, ?it/s] 1%| | 1/116 [00:00<01:15, 1.52it/s] 2%|▏ | 2/116 [00:01<01:12, 1.56it/s] 3%|▎ | 3/116 [00:01<01:11, 1.57it/s] 3%|▎ | 4/116 [00:02<01:11, 1.58it/s] 4%|▍ | 5/116 [00:03<01:10, 1.58it/s] 5%|▌ | 6/116 [00:03<01:09, 1.58it/s] 6%|▌ | 7/116 [00:04<01:09, 1.58it/s] 7%|▋ | 8/116 [00:05<01:08, 1.58it/s] 8%|▊ | 9/116 [00:05<01:07, 1.58it/s] 9%|▊ | 10/116 [00:06<01:07, 1.58it/s] 9%|▉ | 11/116 [00:06<01:06, 1.58it/s] 10%|█ | 12/116 [00:07<01:05, 1.58it/s] 11%|█ | 13/116 [00:08<01:05, 1.58it/s] 12%|█▏ | 14/116 [00:08<01:04, 1.58it/s] 13%|█▎ | 15/116 [00:09<01:03, 1.58it/s] 14%|█▍ | 16/116 [00:10<01:03, 1.58it/s] 15%|█▍ | 17/116 [00:10<01:02, 1.58it/s] 16%|█▌ | 18/116 [00:11<01:02, 1.58it/s] 16%|█▋ | 19/116 [00:12<01:01, 1.57it/s] 17%|█▋ | 20/116 [00:12<01:00, 1.58it/s] 18%|█▊ | 21/116 [00:13<01:00, 1.57it/s] 19%|█▉ | 22/116 [00:13<00:59, 1.57it/s] 20%|█▉ | 23/116 [00:14<00:59, 1.57it/s] 21%|██ | 24/116 [00:15<00:58, 1.57it/s] 22%|██▏ | 25/116 [00:15<00:57, 1.57it/s] 22%|██▏ | 26/116 [00:16<00:57, 1.57it/s] 23%|██▎ | 27/116 [00:17<00:56, 1.57it/s] 24%|██▍ | 28/116 [00:17<00:56, 1.57it/s] 25%|██▌ | 29/116 [00:18<00:55, 1.57it/s] 26%|██▌ | 30/116 [00:19<00:54, 1.57it/s] 27%|██▋ | 31/116 [00:19<00:54, 1.57it/s] 28%|██▊ | 32/116 [00:20<00:53, 1.57it/s] 28%|██▊ | 33/116 [00:20<00:52, 1.57it/s] 29%|██▉ | 34/116 [00:21<00:52, 1.57it/s] 30%|███ | 35/116 [00:22<00:51, 1.57it/s] 31%|███ | 36/116 [00:22<00:50, 1.57it/s] 32%|███▏ | 37/116 [00:23<00:50, 1.57it/s] 33%|███▎ | 38/116 [00:24<00:49, 1.57it/s] 34%|███▎ | 39/116 [00:24<00:49, 1.57it/s] 34%|███▍ | 40/116 [00:25<00:48, 1.57it/s] 35%|███▌ | 41/116 [00:26<00:47, 1.57it/s] 36%|███▌ | 42/116 [00:26<00:47, 1.57it/s] 37%|███▋ | 43/116 [00:27<00:46, 1.57it/s] 38%|███▊ | 44/116 [00:27<00:45, 1.57it/s] 39%|███▉ | 45/116 [00:28<00:45, 1.57it/s] 40%|███▉ | 46/116 [00:29<00:44, 1.57it/s] 41%|████ | 47/116 [00:29<00:44, 1.57it/s] 41%|████▏ | 48/116 [00:30<00:43, 1.57it/s] 42%|████▏ | 49/116 [00:31<00:42, 1.56it/s] 43%|████▎ | 50/116 [00:31<00:42, 1.56it/s] 44%|████▍ | 51/116 [00:32<00:41, 1.56it/s] 45%|████▍ | 52/116 [00:33<00:40, 1.56it/s] 46%|████▌ | 53/116 [00:33<00:40, 1.56it/s] 47%|████▋ | 54/116 [00:34<00:39, 1.56it/s] 47%|████▋ | 55/116 [00:35<00:39, 1.56it/s] 48%|████▊ | 56/116 [00:35<00:38, 1.56it/s] 49%|████▉ | 57/116 [00:36<00:37, 1.56it/s] 50%|█████ | 58/116 [00:36<00:37, 1.56it/s] 51%|█████ | 59/116 [00:37<00:36, 1.56it/s] 52%|█████▏ | 60/116 [00:38<00:35, 1.56it/s] 53%|█████▎ | 61/116 [00:38<00:35, 1.56it/s] 53%|█████▎ | 62/116 [00:39<00:34, 1.56it/s] 54%|█████▍ | 63/116 [00:40<00:33, 1.56it/s] 55%|█████▌ | 64/116 [00:40<00:33, 1.56it/s] 56%|█████▌ | 65/116 [00:41<00:32, 1.56it/s] 57%|█████▋ | 66/116 [00:42<00:32, 1.56it/s] 58%|█████▊ | 67/116 [00:42<00:31, 1.56it/s] 59%|█████▊ | 68/116 [00:43<00:30, 1.56it/s] 59%|█████▉ | 69/116 [00:43<00:30, 1.56it/s] 60%|██████ | 70/116 [00:44<00:29, 1.56it/s] 61%|██████ | 71/116 [00:45<00:28, 1.55it/s] 62%|██████▏ | 72/116 [00:45<00:28, 1.55it/s] 63%|██████▎ | 73/116 [00:46<00:27, 1.56it/s] 64%|██████▍ | 74/116 [00:47<00:27, 1.55it/s] 65%|██████▍ | 75/116 [00:47<00:26, 1.55it/s] 66%|██████▌ | 76/116 [00:48<00:25, 1.55it/s] 66%|██████▋ | 77/116 [00:49<00:25, 1.55it/s] 67%|██████▋ | 78/116 [00:49<00:24, 1.55it/s] 68%|██████▊ | 79/116 [00:50<00:23, 1.55it/s] 69%|██████▉ | 80/116 [00:51<00:23, 1.55it/s] 70%|██████▉ | 81/116 [00:51<00:22, 1.55it/s] 71%|███████ | 82/116 [00:52<00:21, 1.55it/s] 72%|███████▏ | 83/116 [00:53<00:21, 1.55it/s] 72%|███████▏ | 84/116 [00:53<00:20, 1.55it/s] 73%|███████▎ | 85/116 [00:54<00:20, 1.55it/s] 74%|███████▍ | 86/116 [00:54<00:19, 1.55it/s] 75%|███████▌ | 87/116 [00:55<00:18, 1.55it/s] 76%|███████▌ | 88/116 [00:56<00:18, 1.55it/s] 77%|███████▋ | 89/116 [00:56<00:17, 1.55it/s] 78%|███████▊ | 90/116 [00:57<00:16, 1.55it/s] 78%|███████▊ | 91/116 [00:58<00:16, 1.55it/s] 79%|███████▉ | 92/116 [00:58<00:15, 1.55it/s] 80%|████████ | 93/116 [00:59<00:14, 1.55it/s] 81%|████████ | 94/116 [01:00<00:14, 1.55it/s] 82%|████████▏ | 95/116 [01:00<00:13, 1.55it/s] 83%|████████▎ | 96/116 [01:01<00:12, 1.55it/s] 84%|████████▎ | 97/116 [01:02<00:12, 1.55it/s] 84%|████████▍ | 98/116 [01:02<00:11, 1.55it/s] 85%|████████▌ | 99/116 [01:03<00:10, 1.55it/s] 86%|████████▌ | 100/116 [01:04<00:10, 1.55it/s] 87%|████████▋ | 101/116 [01:04<00:09, 1.55it/s] 88%|████████▊ | 102/116 [01:05<00:09, 1.55it/s] 89%|████████▉ | 103/116 [01:05<00:08, 1.55it/s] 90%|████████▉ | 104/116 [01:06<00:07, 1.55it/s] 91%|█████████ | 105/116 [01:07<00:07, 1.55it/s] 91%|█████████▏| 106/116 [01:07<00:06, 1.54it/s] 92%|█████████▏| 107/116 [01:08<00:05, 1.54it/s] 93%|█████████▎| 108/116 [01:09<00:05, 1.54it/s] 94%|█████████▍| 109/116 [01:09<00:04, 1.54it/s] 95%|█████████▍| 110/116 [01:10<00:03, 1.54it/s] 96%|█████████▌| 111/116 [01:11<00:03, 1.54it/s] 97%|█████████▋| 112/116 [01:11<00:02, 1.54it/s] 97%|█████████▋| 113/116 [01:12<00:01, 1.54it/s] 98%|█████████▊| 114/116 [01:13<00:01, 1.54it/s] 99%|█████████▉| 115/116 [01:13<00:00, 1.54it/s]100%|██████████| 116/116 [01:14<00:00, 1.70it/s]100%|██████████| 116/116 [01:14<00:00, 1.56it/s]