Fix various type conversion warnings in liblzma by rzikm · Pull Request #208 · tukaani-project/xz

rzikm · 2026-02-12T15:03:12Z

This PR adds explicit conversions in order not to trigger warnings when liblzma is is compiled as a dependency to other projects.

…2_t conversions

…rsions

__cpuid from MSVC (and also the old Intel ICC) use int[], while __get_cpuid from <cpuid.h> in GCC and Clang use unsigned int[]. Adding a cast is the simplest fix. Link: #208

Larhzu · 2026-02-24T20:07:58Z

Thanks! I have been aware (most) of these warnings for a long time. With GCC and Clang, -Wconversion is noisy because of these, thus -Wconversion cannot be used by default at the moment.

In a few cases there might be a better way than just silencing the warning with a cast (I'm thinking especially the uint32_t to size_t warnings). In some other cases I have a mixed feeling because the casts also affect readability. Also, if something in the code is changed later, a cast that is currently correct might not be correct later.

Perhaps I should go through all those sooner than later. :-/ Meanwhile, I wonder if it could make sense to add flags to silence the warnings with MSVC. I added a commit to the misc branch. Let me know what you think.

Fixing the __cpuid warning a clear case though. I added it to the misc branch.

By the way, MSVC is supported in sense that the code builds and runs, but the inline assembly needs GCC or Clang (clang-cl works). LZMA SDK, on which XZ Utils is based on, has MSVC compatible assembly code. The APIs are quite different though.

rzikm · 2026-02-24T20:47:38Z

In a few cases there might be a better way than just silencing the warning with a cast (I'm thinking especially the uint32_t to size_t warnings). In some other cases I have a mixed feeling because the casts also affect readability. Also, if something in the code is changed later, a cast that is currently correct might not be correct later.

The goal was to silence the warnings with minimal risk of affecting functionality. If you give me some guidance about better way to achieve that, I can give it another go.

Meanwhile, I wonder if it could make sense to add flags to silence the warnings with MSVC.

We have some internal policies which dictate which warnings must be enabled during build, and those that this PR fixes are on the list. We could silence these ourselves in our CMakeLists, but that would get eventually flagged by some compliance tool.

__cpuid from MSVC (and also the old Intel ICC) use int[], while __get_cpuid from <cpuid.h> in GCC and Clang use unsigned int[]. Adding a cast is the simplest fix. Link: #208

Larhzu · 2026-03-29T19:02:31Z

The goal was to silence the warnings with minimal risk of affecting functionality. If you give me some guidance about better way to achieve that, I can give it another go.

This is an usual situation because the changes themselves might be close to what I would do after I've gone through the warnings myself again (I say "again" because I think I looked at them many years ago). What matters is the thinking behind the changes, but in this case it's very difficult to communicate it. :-/

This PR addresses one MSVC warning category per commit. That itself is perfectly fine. But in the big picture it has a smell that the changes were done with a focus to make the compiler quiet. That is, there's a feeling that it was assumed that current behavior of the code is certainly correct. My feeling might be wrong, and that's the trouble: I don't know how this detail could be communicated very well in commit messages or comments.

In this case there might not be anything that you can do better. I simply would need to review these warnings myself. (They are approximately a subset of GCC's -Wconversion.)

We have some internal policies which dictate which warnings must be enabled during build, and those that this PR fixes are on the list. We could silence these ourselves in our CMakeLists, but that would get eventually flagged by some compliance tool.

Currently xz's CMakeLists.txt doesn't enable warnings with MSVC. I'm considering to add

/W4 /wd4100 /wd4127 /wd4200 /wd4244 /wd4267 /wd4996

via CMakeLists.txt. That seems fairly quiet when I test in CI on GitHub. C4242 isn't in /W4 so I don't plan to add /wd4242.

I understand that this might violate your internal policies. I also understand that those warnings can be a sign of bugs, so I understand why such policies might exist. However, silencing those warnings by adding casts isn't a real solution unless one has analyzed that the casts really are the best way (I'm mostly thinking of those size_t vs. uint32_t casts; the uint8_t casts don't have any alternatives). A wrong cast can hide or introduce a bug.

Let's leave this issue open for now. I might look at it later and possibly at least reduce the warnings to some extent later, but it's not a high priority.

Larhzu · 2026-03-29T19:13:24Z

src/liblzma/common/filter_flags_decoder.c


 	// Filter Properties
-	if (in_size - *in_pos < props_size)
+	if (in_size - *in_pos < (size_t)props_size)


I don't understand why a compiler could warn here. If size_t is 32-bit, compiler should promote in_size - *in_pos to lzma_vli (alias uint64_t) and then do the comparison.

Note that props_size might not fit into 32-bit size_t. This can happen in corrupt or malicious files only. If the condition is false (we don't return LZMA_DATA_ERROR), only then it's known that props_size fits into size_t.

That is, adding this cast would introduce a bug.

Larhzu · 2026-03-29T19:16:40Z

src/liblzma/common/index.c

 {
 	// Catch a somewhat theoretical integer overflow.
-	if (src->record_count > PREALLOC_MAX)
+	if (src->record_count > (lzma_vli)PREALLOC_MAX)


I don't understand why compiler could warn here either. PREALLOC_MAX is size_t and record_count is lzma_vli alias uint64_t. Compiler should widen the 32-bit size_t constant to uint64_t without an explicit cast.

Larhzu · 2026-03-29T19:25:40Z

src/liblzma/common/file_info.c

-				- coder->footer_flags.backward_size
-				>= seek_amount) {
+						- (size_t)coder->footer_flags.backward_size
+						>= (size_t)seek_amount) {


seek_amount might not fit into 32-bit size_t. Truncating it with an explicit cast might add a bug; I didn't analyze it properly now.

Like in the other cases I commented, I don't understand why a compiler would warn here.

Larhzu · 2026-03-29T19:33:32Z

src/liblzma/lzma/lzma2_encoder.c

 	// "Compressed" size
-	coder->buf[1] = (coder->uncompressed_size - 1) >> 8;
+	coder->buf[1] = (uint8_t)((coder->uncompressed_size - 1) >> 8);
 	coder->buf[2] = (coder->uncompressed_size - 1) & 0xFF;


Here the smell of "just fix the warnings" is stronger. A cast is added for the buf[1] assignment while the buf[2] assignment is left without a cast. I assume that the & 0xFF at the end takes care of silencing the warning. The code is correct but it has an inconsistent feeling.

The original style might be a bit odd too though. It attempts to say that for the buf[1] assignment there value being assigned will fit into uint8_t (uncompressed_size - 1 fits into 16 bits), and thus no & 0xFF or cast is present. For buf[2] there is a cast to show the upper bits are intentionally thrown a way.

rzikm added 5 commits February 12, 2026 15:24

Fix C4057 warning in liblzma: cast uint32_t* to int* for __cpuid

d905b36

Fix C4267 warnings in liblzma: add explicit casts for size_t to uint3…

2b8caf9

…2_t conversions

Fix C4242 warnings in liblzma: add explicit casts for narrowing conve…

4316ee7

…rsions

Fix C4244 warnings in liblzma: add explicit casts for data loss conve…

6b941b8

…rsions

Fix additional MSVC warnings for x86 platform

b626ccd

Larhzu reviewed Mar 29, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix various type conversion warnings in liblzma#208

Fix various type conversion warnings in liblzma#208
rzikm wants to merge 5 commits intotukaani-project:masterfrom
rzikm:fix-msvc-warnings

rzikm commented Feb 12, 2026

Uh oh!

Larhzu commented Feb 24, 2026

Uh oh!

rzikm commented Feb 24, 2026

Uh oh!

Larhzu commented Mar 29, 2026

Uh oh!

Larhzu Mar 29, 2026

Uh oh!

Larhzu Mar 29, 2026

Uh oh!

Larhzu Mar 29, 2026

Uh oh!

Larhzu Mar 29, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

rzikm commented Feb 12, 2026

Uh oh!

Larhzu commented Feb 24, 2026

Uh oh!

rzikm commented Feb 24, 2026

Uh oh!

Larhzu commented Mar 29, 2026

Uh oh!

Larhzu Mar 29, 2026

Choose a reason for hiding this comment

Uh oh!

Larhzu Mar 29, 2026

Choose a reason for hiding this comment

Uh oh!

Larhzu Mar 29, 2026

Choose a reason for hiding this comment

Uh oh!

Larhzu Mar 29, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants