mshow: unknown charset of extracted attachments #248

ashiire · 2023-08-18T19:17:05Z

When extracting attachments via mshow's -x or -O flags, any present charset information is lost. This is troublesome, as charset information is difficult to infer from file contents alone.

mshow does explicitly mention attachment charset in render mode if a filter is used to render the attachment, like so:
--- --- --- 3: text/plain size=235 charset="iso-8859-2" render="mshow-plaintext" ---
However, this seems like the wrong (and inconvenient) place to recover the information from.

I have two ideas that might help:

(Add an option to) explicitly state charset information in list mode, if available.
Add an option to automatically re-encode extracted attachments to UTF-8, same as in render mode.

I think that either one would be sufficient on its own, but both may be desirable.

The text was updated successfully, but these errors were encountered:

leahneukirchen · 2023-08-21T15:47:13Z

I guess the second part wouldn't be too hard to add, as all the steps are implemented already.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

mshow: unknown charset of extracted attachments #248

mshow: unknown charset of extracted attachments #248

ashiire commented Aug 18, 2023 •

edited

Loading

leahneukirchen commented Aug 21, 2023

mshow: unknown charset of extracted attachments #248

mshow: unknown charset of extracted attachments #248

Comments

ashiire commented Aug 18, 2023 • edited Loading

leahneukirchen commented Aug 21, 2023

ashiire commented Aug 18, 2023 •

edited

Loading