将命名实体识别格式从ENAMEX更改为CoNLL

问题描述 投票:0回答:1

我有一个像这样的ENAMEX格式的数据集:

<ENAMEX TYPE="LOCATION">Italy</ENAMEX>'s business world was rocked by the announcement <TIMEX TYPE="DATE">last Thursday</TIMEX> that Mr. <ENAMEX TYPE=„PERSON">Verdi</ENAMEX> would leave his job as vicepresident of <ENAMEX TYPE="ORGANIZATION">Music Masters of Milan, Inc</ENAMEX> to become operations director of <ENAMEX TYPE="ORGANIZATION">Arthur Andersen</ENAMEX>.

我想将其更改为CoNLL格式:

Italy  LOCATION
's  O
business O
world  O
was  O
rocked  O
by  O
the  O
announcement  O
last  DATE
Thursday  DATE
...
.  O

我该怎么做?是否有用于这种格式转换的标准脚本?

named-entity-recognition conll
1个回答
0
投票
我写了一个虽然没有经过严格测试的自己为我工作,here
© www.soinside.com 2019 - 2024. All rights reserved.