Clemylia commited on
Commit
ada0197
·
verified ·
1 Parent(s): 9979e21

Model save

Browse files
Files changed (2) hide show
  1. README.md +86 -3
  2. model.safetensors +1 -1
README.md CHANGED
@@ -1,3 +1,86 @@
1
- ---
2
- license: mit
3
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ library_name: transformers
3
+ tags:
4
+ - generated_from_trainer
5
+ model-index:
6
+ - name: Tya
7
+ results: []
8
+ ---
9
+
10
+ <!-- This model card has been generated automatically according to the information the Trainer had access to. You
11
+ should probably proofread and complete it, then remove this comment. -->
12
+
13
+ # Tya
14
+
15
+ This model is a fine-tuned version of [](https://huggingface.co/) on an unknown dataset.
16
+ It achieves the following results on the evaluation set:
17
+ - Loss: 9.0425
18
+
19
+ ## Model description
20
+
21
+ More information needed
22
+
23
+ ## Intended uses & limitations
24
+
25
+ More information needed
26
+
27
+ ## Training and evaluation data
28
+
29
+ More information needed
30
+
31
+ ## Training procedure
32
+
33
+ ### Training hyperparameters
34
+
35
+ The following hyperparameters were used during training:
36
+ - learning_rate: 5e-05
37
+ - train_batch_size: 4
38
+ - eval_batch_size: 4
39
+ - seed: 42
40
+ - optimizer: Use OptimizerNames.ADAMW_TORCH_FUSED with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
41
+ - lr_scheduler_type: linear
42
+ - lr_scheduler_warmup_steps: 50
43
+ - num_epochs: 30
44
+
45
+ ### Training results
46
+
47
+ | Training Loss | Epoch | Step | Validation Loss |
48
+ |:-------------:|:-----:|:----:|:---------------:|
49
+ | No log | 1.0 | 4 | 10.2476 |
50
+ | No log | 2.0 | 8 | 10.2346 |
51
+ | 10.2242 | 3.0 | 12 | 10.2122 |
52
+ | 10.2242 | 4.0 | 16 | 10.1811 |
53
+ | 10.1914 | 5.0 | 20 | 10.1409 |
54
+ | 10.1914 | 6.0 | 24 | 10.0923 |
55
+ | 10.1914 | 7.0 | 28 | 10.0357 |
56
+ | 10.0766 | 8.0 | 32 | 9.9716 |
57
+ | 10.0766 | 9.0 | 36 | 9.9017 |
58
+ | 9.9563 | 10.0 | 40 | 9.8282 |
59
+ | 9.9563 | 11.0 | 44 | 9.7509 |
60
+ | 9.9563 | 12.0 | 48 | 9.6713 |
61
+ | 9.7542 | 13.0 | 52 | 9.5897 |
62
+ | 9.7542 | 14.0 | 56 | 9.5149 |
63
+ | 9.54 | 15.0 | 60 | 9.4476 |
64
+ | 9.54 | 16.0 | 64 | 9.3876 |
65
+ | 9.54 | 17.0 | 68 | 9.3340 |
66
+ | 9.417 | 18.0 | 72 | 9.2862 |
67
+ | 9.417 | 19.0 | 76 | 9.2429 |
68
+ | 9.2368 | 20.0 | 80 | 9.2049 |
69
+ | 9.2368 | 21.0 | 84 | 9.1715 |
70
+ | 9.2368 | 22.0 | 88 | 9.1426 |
71
+ | 9.1577 | 23.0 | 92 | 9.1179 |
72
+ | 9.1577 | 24.0 | 96 | 9.0972 |
73
+ | 9.1191 | 25.0 | 100 | 9.0803 |
74
+ | 9.1191 | 26.0 | 104 | 9.0667 |
75
+ | 9.1191 | 27.0 | 108 | 9.0563 |
76
+ | 9.0687 | 28.0 | 112 | 9.0488 |
77
+ | 9.0687 | 29.0 | 116 | 9.0443 |
78
+ | 9.021 | 30.0 | 120 | 9.0425 |
79
+
80
+
81
+ ### Framework versions
82
+
83
+ - Transformers 4.57.1
84
+ - Pytorch 2.8.0+cu126
85
+ - Datasets 4.0.0
86
+ - Tokenizers 0.22.1
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:2fe8c3cec720f890d2cda37691f73e18e5eb21fdc28eb8ca8fbd883c2ff9a828
3
  size 18918928
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:9c9d896b635804a2bfbd2431157bae1d9b84872235d9b88a735a540161094994
3
  size 18918928