Skip to content

Explicitly use utf-8 encoding#10

Open
Stonesjtu wants to merge 1 commit intomlcommons:masterfrom
Stonesjtu:patch-1
Open

Explicitly use utf-8 encoding#10
Stonesjtu wants to merge 1 commit intomlcommons:masterfrom
Stonesjtu:patch-1

Conversation

@Stonesjtu
Copy link
Copy Markdown

Since it's supposed to deal with non-ascii characters, it's better to ensure open the text file as 'utf-8' encoding.

It fixes my problem when post-processing the corpus.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant