Skip to content
Snippets Groups Projects
Commit 052453b9 authored by Solene Tarride's avatar Solene Tarride Committed by Bastien Abadie
Browse files

Add the model trained on HOME-Alcar

parent 5e388f28
No related branches found
No related tags found
1 merge request!4Import model pylaia_alcar from pylaia
# PyLaia Alcar
## Datasets
* Trained on [HOME Alcar](https://demo.arkindex.org/browse/46b9b1f4-baeb-4342-a501-e2f15472a276?top_level=true&folder=true) and [Himanis](https://arkindex.teklia.com/browse/2f6e26b0-5fdd-4193-bb30-a3162b96280c?top_level=true&folder=true).
* Text-lines are resized to a fixed height of 128 pixels.
* The language model is a 6-gram character model trained only on the training set of HOME-Alcar.
| split | N lines Alcar | N lines Himanis | Total |
| ----- | ------------: | --------------: | -----: |
| train | 59,969 | 18,504 | 78,473 |
| val | 7,905 | 2,367 | 10,272 |
| test | 6,932 | 2,241 | 9,173 |
## Results
* Evaluation on the test set of HOME-Alcar
| Split | LM | Evaluation method | CER (%) | WER (%) | Support |
| ----- | --- | ------------------ | -------- | --------- | ------- |
| test | No | basic | 8.35 | 26.15 | 6932 |
| test | No | escape punctuation | 8.35 | 24.6 | 6932 |
| test | Yes | basic | **7.85** | **23.2** | 6932 |
| test | Yes | escape punctuation | **7.85** | **21.76** | 6932 |
\ No newline at end of file
File added
<ctc> <ctc>
! !
& &
# #
' '
( (
) )
* *
+ +
, ,
- -
. .
/ /
0 0
1 1
2 2
3 3
4 4
5 5
6 6
7 7
8 8
9 9
: :
; ;
= =
? ?
A A
B B
C C
D D
E E
F F
G G
H H
I I
J J
K K
L L
M M
N N
O O
P P
Q Q
R R
S S
T T
U U
V V
W W
X X
Y Y
Z Z
[ [
] ]
a a
b b
c c
d d
e e
f f
g g
h h
i i
j j
k k
l l
m m
n n
o o
p p
q q
r r
s s
t t
u u
v v
w w
x x
y y
z z
| |
~ ~
’ ’
© ©
§ §
ª ª
« «
¬ ¬
¯ ¯
° °
¶ ¶
º º
» »
¿ ¿
À À
 Â
à Ã
Ç Ç
É É
Ï Ï
Ü Ü
à à
á á
â â
æ æ
ç ç
è è
é é
ë ë
ì ì
í í
î î
ï ï
ñ ñ
ú ú
ù ù
û û
ÿ ÿ
ę ę
ō ō
œ œ
ȩ ȩ
— —
‘ ‘
’ ’
… …
† †
<unk> <unk>
<space> <space>
File added
<ctc> 0
! 1
& 2
# 3
' 4
( 5
) 6
* 7
+ 8
, 9
- 10
. 11
/ 12
0 13
1 14
2 15
3 16
4 17
5 18
6 19
7 20
8 21
9 22
: 23
; 24
= 25
? 26
A 27
B 28
C 29
D 30
E 31
F 32
G 33
H 34
I 35
J 36
K 37
L 38
M 39
N 40
O 41
P 42
Q 43
R 44
S 45
T 46
U 47
V 48
W 49
X 50
Y 51
Z 52
[ 53
] 54
a 55
b 56
c 57
d 58
e 59
f 60
g 61
h 62
i 63
j 64
k 65
l 66
m 67
n 68
o 69
p 70
q 71
r 72
s 73
t 74
u 75
v 76
w 77
x 78
y 79
z 80
| 81
~ 82
’ 83
© 84
§ 85
ª 86
« 87
¬ 88
¯ 89
° 90
¶ 91
º 92
» 93
¿ 94
À 95
 96
à 97
Ç 98
É 99
Ï 100
Ü 101
à 102
á 103
â 104
æ 105
ç 106
è 107
é 108
ë 109
ì 110
í 111
î 112
ï 113
ñ 114
ú 115
ù 116
û 117
ÿ 118
ę 119
ō 120
œ 121
ȩ 122
— 123
‘ 124
’ 125
… 126
† 127
<unk> 128
<space> 129
<ctc>
!
&
#
'
(
)
*
+
,
-
.
/
0
1
2
3
4
5
6
7
8
9
:
;
=
?
A
B
C
D
E
F
G
H
I
J
K
L
M
N
O
P
Q
R
S
T
U
V
W
X
Y
Z
[
]
a
b
c
d
e
f
g
h
i
j
k
l
m
n
o
p
q
r
s
t
u
v
w
x
y
z
|
~
’
©
§
ª
«
¬
¯
°
º
»
¿
À
Â
Ã
Ç
É
Ï
Ü
à
á
â
æ
ç
è
é
ë
ì
í
î
ï
ñ
ú
ù
û
ÿ
ę
ō
œ
ȩ
<unk>
<space>
File added
0% Loading or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment