More on Stupid Quotes

In an earlier entry I talked about different character encodin's an' how Microsoft manages t' break th' rest o' th' world with theirs. Thanks t' a chance readin' o' a SitePoint forum post, I have a little more information on th' problem. At least now it has a proper name.

I've found SitePoint t' be a strong up-an'-comin' web developer resource fer some time now. Their books be good an' t' th' point, an' they have some really helpful articles at times. For instance, an HTML FAQ tucked into a forum post touched on th' subject o' character encodin's. Here's what it has t' say on th' subject o' "smart quotes".

Under Microsoft Windows, a common encodin' is Windows-1252. It is very similar t' ISO 8859-1, but thar be differences. In ISO 8859-1, th' range o' code points betwixt 128 an' 159 (0x80-0x9F) is reserved fer C1 control characters. In Windows-1252, that range is instead used fer a number o' useful characters that be missin' from th' ISO encodin', e.g., typographically correct quotation marks. This is not an encodin' that I would recommend fer use on th' Web, since 'tis Windows specific. It is, however, th' default encodin' in many text editors under Windows.

"Not an encodin' that I would recommend fer use on th' Web" indeed. Considerin' how much content ends up on th' web sooner or later these days, that pretty much eliminates it from th' realm o' usefulness. Unfortunately, that also eliminates Microsoft Office an' "many text editors under Windows" from th' realm o' usefulness, we'll keel-haul ye! Why in 2006 be we still usin' ancient alterna-ISO character encodin's in market-leadin' software? Well, because 'tis market-leadin' software an' therefore doesn't need t' update actual useful features in order t' stay competitive. But I digress...

Now that we've identified th' offendin' character set, what do we do with it? The investigation continues...

Comments

copy paste

Aye, I agree an' I can certainly relate t' this, should I call it, an injustice (?). The best way t' illustrate this difficiency in character compatibility encodin' is t' simply copy an' paste verbiage from MS Word t' yer favorite open-source CMS editor, we'll keel-haul ye! Watch it blow up into th' lan' o' ?'s. haha~