Skip to content

fix colorspace error when form xobject contains a transparency group#1088

Merged
BobLd merged 5 commits intomasterfrom
fix-corpus-0000355-file
Jul 19, 2025
Merged

fix colorspace error when form xobject contains a transparency group#1088
BobLd merged 5 commits intomasterfrom
fix-corpus-0000355-file

Conversation

@EliotJones
Copy link
Copy Markdown
Member

@EliotJones EliotJones commented Jul 17, 2025

when a form xobject contains a reference to a group xobject this can only be used to change attributes of the transparency imaging model. the old code was setting the main colorspaces incorrectly causing errors when the transparency component had a different number of channels. this was causing #1071 in addition to the failure in file 0000355.pdf of the test corpus

0000355.pdf

when a form xobject contains a reference to a group xobject this can only
be used to change attributes of the transparency imaging model. the old
code was setting the main colorspaces incorrectly causing errors when the
transparency component had a different number of channels. this was
causing #1071 in addition to the failure in file 0000355.pdf of the test corpus
@EliotJones EliotJones requested a review from BobLd July 17, 2025 23:42
@EliotJones
Copy link
Copy Markdown
Member Author

With this change we can successfully parse the first 1000 files of the sample corpus 🎉

with the exception of 3 non-PDF files:

  • 0000399.pdf
  • 0000819.pdf
  • 0000920.pdf

and 1 password protected file:

  • 0000300.pdf

@BobLd BobLd merged commit e3388ec into master Jul 19, 2025
2 checks passed
@BobLd BobLd deleted the fix-corpus-0000355-file branch July 19, 2025 10:47
@BobLd
Copy link
Copy Markdown
Collaborator

BobLd commented Jul 19, 2025

Note for myself: investigate the CS issue based on the documents

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants