Originally posted by stefanero
no sorry, no iso whatsoever!
I made them utf-8 on perspose, since its the most standart on most distros anyways
Why? Where do you know from that this is the correct thing to do?
UTF-8 is indeed the most widely used character set in Linux distros today, but not for man pages. As you can see above, umlauts in UTF-8 man pages are garbage in an UTF-8 environment, whereas and ISO-8859-1 man pages work fine. Man pages are not locale specific.
Using an UTF-8 locale e.g. means that the file system is in UTF-8, but not that every file needs to be converted to UTF-8. Example: .desktop files always have to be UTF-8 in any(!) locale and European man pages always have to be ISO-8859-1 in any(!) locale, whereas HTML can be in any encoding because the encoding is stored in the documents themselves.
Originally posted by stefanero
also its more commen to all systems all over the world. also all po strings are utf8 aswell.
Man pages have nothing to do with .po files. That's a completely different thing. There it's correct to use UTF-8.
Originally posted by stefanero
so if you want, you can edit the rest to utf8, not "fix" the right once
Please read the following thread:
http://mail.nl.linux.org/linux-utf8/2005-07/msg00004.htmlEspecially the following post:
http://mail.nl.linux.org/linux-utf8/2005-07/msg00009.htmlOriginally posted by Kry
There are manpages in a lot of encodings. "man" is not enforced to iso8859-1, or japanese people will be really fucked up, if you know what I mean.
See here:
http://mail.nl.linux.org/linux-utf8/2005-07/msg00009.htmlJapanese man pages are not UTF-8 either. They are EUC-JP. Japanese man pages in UTF-8 are unreadable in any locale, because /usr/bin/man expects EUC-JP input and treats every input as EUC-JP, even if it's UTF-8. The result is garbage.
If you misunderstood what I said: man is indeed not limited to ISO-8859-1, but this doesn't mean that it understands UTF-8. It doesn't. It understands only one specific character set per language, and this is accidentally ISO-8859-1 for all languages you have man pages for.
If you read the linked thread carefully, you will see that the linux man page maintainers want to support UTF-8 man pages in the future, but they don't even know how to do it. Probably they will do it by introducing a special tag in UTF-8 man pages like in HTML so that the man pager knows how to handle them, but currently it doesn't.
BTW it's not that I would "have to" convince anyone. It's no problem for me to convert them myself.