Skip to content

Cleanxml#1259

Merged
AngledLuffa merged 5 commits intodevfrom
cleanxml
Mar 20, 2022
Merged

Cleanxml#1259
AngledLuffa merged 5 commits intodevfrom
cleanxml

Conversation

@AngledLuffa
Copy link
Copy Markdown
Contributor

Add cleanxml as an option to the tokenize annotator.

Will do ssplit next

@AngledLuffa AngledLuffa force-pushed the cleanxml branch 3 times, most recently from 7ca5fdc to 1742849 Compare March 16, 2022 21:49
@AngledLuffa AngledLuffa force-pushed the cleanxml branch 2 times, most recently from aa71022 to e57afc1 Compare March 16, 2022 22:30
@AngledLuffa AngledLuffa force-pushed the cleanxml branch 2 times, most recently from 55595d3 to 4011cfa Compare March 17, 2022 20:00
For the TokenizerAnnotatorITest, this uppdates the behavior of the test
to reflect that the newlines are now being consumed by the inner ssplit
@AngledLuffa AngledLuffa merged commit b0d1e46 into dev Mar 20, 2022
@AngledLuffa AngledLuffa deleted the cleanxml branch March 20, 2022 01:42
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant