0.18.0: broken HTML file (charset declaration *wrong*) - override possibility??

Hello,

as reported at _lynx-dev_:
[[BUG] [DOCS] broken HTML file (charset declaration *wrong*) - override possibility?? (ASSUME_CHARSET areas etc.)](https://lists.nongnu.org/archive/html/lynx-dev/2026-01/msg00000.html),
I am having an _HTML_ file (_Microsoft_-originating data; "Microsoft Word 15") with wrong charset declaration (`iso-8859-1`), where
the document body contains _UTF-8_ code units (as can be directly seen via
the transport-side _quoted-printable_ encoding:
```
<p class=3D"MsoNormal" style=3D"margin-bottom:0cm;line-height:normal"><span=
 style=3D"color:#003C74;mso-fareast-language:DE">Viele Gr=C3=BC=C3=9Fe,
<o:p></o:p></span></p>
```
).

`links -dump test.mre.html`
(at least version 0.18.0, i.e. older than _HEAD_)

will display glorious [Mojibake](https://medium.com/@thomas.lamiraud/mojibake-when-encoding-goes-wrong-0958d0631883)-laden

>    Hallo Herr Mustermann,
>     
>    vielen Dank fÃ¼r Ihre Meldung. Hiermit bestÃ¤tigt.

output.

I then frantically tried to
override things, via

`links -dump -dump-charset UTF-8 test.mre.html`

This did not work.

(`-dump-charset` option _does_ seem to be initially considered, since
e.g. `ATF-8` will properly cause a
`ELinks: Cannot parse option ATF-8: Read error`
error report).

Thus, I am suspecting that
_[e]links_ is having
the same kind of support weakness that
_lynx_ has (overriding of a b0rken encoding declaration not possible).

...or it might just be that
`-dump-charset` option _is_ intended to
handle this, yet that implementation simply is
broken, currently.

This issue should be easily verifiable in an alternative manner, by
modifying a properly _UTF-8_ _HTML_ file (containing
extended i.e. non-_ASCII_-range characters, _umlauts_ etc.) to
declare `iso-8859-1` charset.

Thank you!!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

0.18.0: broken HTML file (charset declaration wrong) - override possibility?? #417

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

0.18.0: broken HTML file (charset declaration *wrong*) - override possibility?? #417

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions

0.18.0: broken HTML file (charset declaration wrong) - override possibility?? #417