testing umlauts: äöüßÄÖÜ

... in the page title, and in the body: äöüßÄÖÜ

Other accents too:áéíóú

RS adds all of the iso8859-1 characters above 160, for even heavier testing:

 160:  ¡¢£¤¥¦§¨©ª«¬��®¯
 176: °±²³´µ¶·¸¹º»¼½¾¿
 192: ÀÁ����ÄÅÆÇÈÉÊËÌÍÎÏ
 208: ÐÑÒÓÔÕÖ×ØÙÚÛÜÝÞß
 224: àáâãäåæçèéêëìíîï
 240: ðñòóôõö÷øùúûüýþÿ

KBK edits the page to see whether the characters get damaged in the round trip.

mjk I can promise, that in this Wiki will visit some browsers, that will convert ISO-8859-1 characters to UTF-8. You can find my little rant from my user information page. Unfortunately, I'm yet unable to identify, which browsers change character encoding (I have tested with Mozilla 1.6, Firefox 0.8 and IE 6.0).

Let's test links: testing umlauts: äöüßÄÖÜ...

rmax I've investigated that UTF-8 conversion problem today (that's why I created this test page). The current state is, that it seems the modification came from within wikit or tclkit, because some pages were converted from iso8859-1 to UTF-8 twice without being edited by anyone.

jcw - I probably caused this by some sloppy pack/rename action over the past several months, though I don't recall having done anything that disruptive. For now, given the way this page seems to hold out, I'm assuming things will not deteriorate further.

This is running Tclkit 8.4.5 (static) on Linux, btw. When launched from the command line, that build tells me the system encoding is iso8859-1.

Best we can do now I guess, is to manually fix whatever turns out to have been mis-encoded (there's CVS history to help figure it out, if needed). If there is ever any further regression, please let me know - that would mean there are still problems. Reading mjk's comments above, I cannot rule out that some browser is causing the incorrect re-encoding, though it cannot explain how things can go wrong with no intermediate edit, or how a page title got mangled. Maybe some page changes never made it to CVS. Hmm... it's not really clear.

When saving this page, it says "page saved <gibberish>", then the modified page comes up and it looks ok again. Hm, encodings are still not 100%. -jcw

I noticed also that gibberish with IE 6.0 but not with Mozilla. Another note: it is unfortunately quite hard to fix some encoding problems by editing the page. For example this page [L1 ] has both in its title and body misencoded characters. One way would be to create a new page with correct title, but I'm reluctant to do that yet. (I just noticed that by looking the source of any of pages in the Wiki will reveal the following meta tag: http-equiv="Content-type" content="text/html; charset=utf-8". Does this have any effect, how the contents of the text area are sent to the Wiki?) --mjk


 Using <alt>xxx where xxx=(num pad keys)
 140: îìÄÅÉæÆôöò
 150: ûùÿÖÜ¢£¥Pt f
 160: áóúñѪº¿NI
 170: ¬½¼¡«».S:S?S|
 180: ++++++++++
 190: ++++++-+++
 200: ++++++++++
 210: +++++++++FB
 ..
 240: ��±=¾¶§÷¸°¨

ö


IE 6.0 but not with Mozilla. Another note: it is unfortunately quite hard to fix some encoding problems by editing the page. For example this page [L2 ] has both in its title and body misencoded characters. One way would be to create a new page with correct title, but I'm reluctant to do that yet. (I just noticed that by looking the source of any of pages in the Wiki will reveal the following meta tag: http-equiv="Content-type" content="text/html; charset=utf-8". Does this have any effect, how the contents of the text area are sent to the Wiki?) --mjk