-
Notifications
You must be signed in to change notification settings - Fork 3
Open
Description
Given this folia (also attached correctie.xml.txt)
FoLiA-2text can extract the corrected text: één.. But there is no way to extract the original text.
The naive FoLiA-2text --class original doesn't work.
So how to do this?
@proycon same question for folia2txtfrom FoliaPY
<?xml version="1.0" encoding="UTF-8"?>
<FoLiA xmlns:xlink="http://www.w3.org/1999/xlink" xmlns="http://ilk.uvt.nl/folia" xml:id="bug" generator="libfolia-v2.4" version="2.5">
<metadata type="native">
<annotations>
<text-annotation set="https://raw.githubusercontent.com/proycon/folia/master/setdefinitions/text.foliaset.ttl"/>
<token-annotation/>
<sentence-annotation/>
<paragraph-annotation/>
<correction-annotation set="folia-correct">
<annotator processor="FoLiA-correct.1"/>
</correction-annotation>
</annotations>
<provenance>
<processor xml:id="FoLiA-correct.1" begindatetime="2020-01-06T12:08:30" command="FoLiA-correct --punct=punct.punct --unk=unk.unk --rank=rank.ranked --clear --inputclass=Test --ngram=3 -v -v " folia_version="2.2.1" host="bonus" name="FoLiA-correct" user="sloot" version="0.14">
<processor xml:id="FoLiA-correct.1.generator" folia_version="2.2.1" name="libfolia" type="generator" version="2.4"/>
</processor>
</provenance>
</metadata>
<text xml:id="text">
<p xml:id="p1">
<s xml:id="s1">
<correction xml:id="cor.1" set="folia-correct">
<new>
<w xml:id="w3.cor">
<t>één.</t>
</w>
</new>
<original auth="no">
<w xml:id="w3">
<t>een.</t>
</w>
</original>
</correction>
</s>
</p>
</text>
</FoLiA>Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
No labels