| JavaWoman | that's valid UTF-8 - but it' s HTML |
| rmah | I love the confusion that character encoding causes |
| phroggy | well yeh. |
| hobbs | actually it's got nothing to do with UTF-8 |
| rindolf | phroggy: actually it's SGML. |
| mauke | ’ has nothing to do with utf-8 |
| hobbs | the encoding is HTML ;) |
| phroggy | rindolf: right |
| JavaWoman | rindolf, I can see your message though, hebrew and all - does it display correctly rtl though? |
| rindolf | JavaWoman: well, the Hebew letters are in the correct order, but they are aligned to the left. |
| hobbs | it says "this is character 8217", not "this is 0xe2 0x80 0x99" |
| killer_ | Randal: ok.. thanks for your help, my problem has been that this javascript form POSTs 2 params and all the info is in the second var which is two lines that have to be split on \n $var[0] is the params, and $var[1] is the values. so CGI.pm wasn't my problem, |
| hobbs | which is fortunate, as a matter of fact :) |
| JavaWoman | ’ is valid UTF-8 because it's a valid ASCII string; it' s also a entity reference which in (X)HTML context will be interpreted as a Unicode character |
| mauke | haha JavaWoman wins |
| phroggy | right |
| rindolf | JavaWoman++ |
| JavaWoman | rindolf, just so I umnderstand you: it's properly rtl, only left-aligned instead of right-aligned? |
| hobbs | but it should still be decoded |
| phroggy | hobbs: not necessarily! |
| rindolf | JavaWoman: yes, exactly. |
| JavaWoman | hobbs, it will be decoded in a browser - but not in (plain-text) email, thats' the problem |
| phroggy | hobbs: if the browser expects plain text, why should it interpret ’ as a single Unicode character? |
| hobbs | JavaWoman: aaagh. |
| JavaWoman | rindolf, thanks - that's not too bad then :) but it cuold be better |
| hobbs | phroggy: it shouldn't |
| rindolf | JavaWoman: there are a bunch of characters for controlling BiDi in Unicode - LRM, RLM, LRO, RLO, etc. |
| hobbs | phroggy: and we're not talking about a browser either. |
| phroggy | right, sorry. e-mail client. MUA. |
| rindolf | JavaWoman: but Unicode does not have a way to specify right alignment or left-alignment. |
| hobbs | phroggy: when I say "it should be decoded" I don't mean "the MUA should decode it", I mean "Javawoman should decode it before sending" ;) |
| somian | Have you Vimmers all seen http://www.vim.org/scripts/script.php?script_id=1808 ? |
| rindolf | What most text editors is align according to the beginning of the line. |
| somian | It *looks* really neat. |
| JavaWoman | rindolf, I'll have to ind out how the browser could tell what's needed - I think that's *language* rather than *character set* |
| rindolf | somian: nice. |
| hobbs | somian: interesting |