Skip to content

cneud/page-to-text

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

10 Commits
 
 
 
 

Repository files navigation

page-to-text

Extracts the text from a PAGE file and writes it to stdout.

Note that this tool does not consider ReadingOrder if available in the PAGE-XML, but instead writes output based of the order in the XML tree.

Use like:

python page_to_text.py <page-xml-file>

About

extract text from PAGE file

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages