README.md

$ cd nerval
$ pip3 install .
$ cd tests
$ pytest
$ nerval -a/--annot <annot_file.bio> -p/--predict <predict-file.bio>
$ nerval -a demo/demo_annot.bio -p demo/demo_predict.bio
$ nerval -a demo/toy_test_annot.bio -p demo/toy_test_predict.bio
Tolkien B-PER
was O
a O
writer B-OCC
. O
['PER','PER','PER','PER','PER','PER','PER',
 'O',
 'O', 'O', 'O',
 'O',
 'O',
 'O',
 'OCC','OCC','OCC','OCC','OCC','OCC',
 'O',
 'O']
Tolkieene B-PER
xas O
writear B-OCC
,. O
['PER','PER','PER','PER','PER','PER','PER','PER','PER',
 'O',
 'O', 'O', 'O',
 'O',
 'OCC','OCC','OCC','OCC','OCC','OCC','OCC',
 'O',
 'O','O']
annotation : Tolkien was a writer .
prediction : Tolkieen xas writear ,.
annotation : Tolkie-n- was a writer- -.
prediction : Tolkieene xas --writear ,.
             PPPPPPPPPOOOOOOOCCCCCCCOOO
annotation : Tolkie-n- was a writer- -.
prediction : Tolkieene xas --writear ,.
             PPPPPPPPPOOOOOOOCCCCCCCOOO
Matches delimitations are represented by ||

annotation : OOOOOOO|PPPPPPPPPPPPPPPPP|OOOOOO
prediction : OOOO|PPPPPPPPPPP|OOOOOOOOOOOOOOO

annotation : OOOOOOO|PPPPPPPPPPPPPPPPP|OOOOOO
prediction : OOOOOOOOOOOOOO|PPPPPPPPPPPPPP|OO

annotation : OOOOOOO|PPPPPPPPPPPPPPPPP|OOOOOO
prediction : OOOO|PPPPPPPPPPP|OOOOPPPPOOOOOOO

annotation : OOOOOOO|PPPPPPPPPPPPPPPPP|OOOOOO
prediction : OOOOOOO|P|OPPPPPPPPPPPPPPOOOOOOO

annotation : OOOOOOO|PPPPPPPPPPPPPPPPP|OOOOOO
prediction : OOOOOOOOOOOOOOOOOOOOOOOOOOPPPPOO

For this last example, no match is found in the prediction.
edit_distance("Tolkien", "Tolkieene") = 2
len("Tolkien") = 7
2/7 = 0.29 < 0.3
OK

edit_distance("writer", "writear") = 1
len("writer") = 6
1/6 = 0.17 < 0.3
OK
PER :
P = 1/1
R = 1/1
F1 = 2*1*1/(1+1)

OCC :
P = 1/1
R = 1/1
F1 = 2*1*1/(1+1)

ALL :
P = 2/2
R = 2/2
F1 = 2*1*1/(1+1)