View Full Version : Why do some threads have so many weird characters in them?
Mike Irwin
April 30, 2006, 12:10 AM
This thread in particular...
http://thefiringline.com/forums/showthread.php?t=205313
I'm seeing a LOT of odd characters... TM symbols, pound sterling marks, copyright symbols, Euro symbols...
Things like this...
I’d
“aw-chitâ€
Denny’s
and so on...
In some cases entire words are virtually obliterated by strange characters.
I've also noticed this in older threads (as in years old) but the post above is laden with them.
Is it something others are seeing, or is it something going on on my end?
Glock 31
April 30, 2006, 12:34 AM
I see it to. Either the poster thinks it makes him look mysterious and cool (maybe mysteriously retarded) or there is an error in the way the internet explorer (or whatever people use) translated the post. Depends on the poster's settings I guess.:cool:
gfen
April 30, 2006, 12:37 AM
Munged escape codes or extended ASCII... I bet Erick wrote that in an offline editor then cut and pasted it into the post.
UniversalFrost
April 30, 2006, 08:16 AM
In my signature the last quote suddenly started appearing with the crzy characters you mentioned. I originally typed it with the "" marks, but somehow it end up looking screwy. Changed it back to "" the other day and it hasn't reverted back to the screwy characters.
Al Norris
April 30, 2006, 09:10 AM
Quite often, when an article is cut from a website, then pasted into something like MS Word (which preserves much of the "style" of the original doc), where the person then includes their own comments, then cuts the entire thing and pastes it onto the posting box, you will see such characters.
Much better to use a simple text editor like notepad for such operations than a full blown word processor.
Mal H
April 30, 2006, 09:14 PM
I think tyme probably has a better answer, but I believe a font got switched around a few weeks ago leaving a strange set of characters for apostrophes, etc. in the database.
tyme
May 1, 2006, 10:22 AM
That's unicode.
Once upon a time, there was One True Character Set, which contained 256 characters. All standard letters (without accents/marks) were below 128. Fancy latin characters with extra marks, pound symbols, etc. resided between 128 and 255.
So, after various people with non-latin characters in their languages started using computers, a solution was necessary. That solution was UTF-8.
Characters below 128 are the same as usual.
A string of characters above 128 are part of a single multi-byte character... in english writing, usual suspects might be a longdash or fancy quotes or a single quote or a copyright/trademark symbol or something of that sort.
There was a glitch a few weeks ago where the database was converted between character sets or something.
How many of these posts have you noticed? It only happens on posts that contained high-ascii (generally not a good idea on the web) in the first place.
Mike Irwin
May 1, 2006, 02:55 PM
It's only a small percentage of the posts, and it seems to primarily affect posts that are older, although I have seen it in new posts, as well.
tyme
May 2, 2006, 08:06 PM
Universal Frost, can you estimate when your signature got corrupted?
UniversalFrost
May 9, 2006, 09:18 PM
about the middle of april. Also I have noticed it recently in a couple of other guys signatures that weren't affected previously.
vBulletin® v3.8.7, Copyright ©2000-2012, vBulletin Solutions, Inc.