Domain: fileformat.info
Stories and comments across the archive that link to fileformat.info.
Comments · 113
-
Re:Possibly less emoticons?
If this mean less emoticons then I'm all for it.
Can I get an amen?
Never mind. Slashdot doesn't support UTF-8. We'll have to settle for A-U+2642.
-
And where's Tengwar?
They've got symbols for a love hotel, a horse, and a steaming pile of poo, along with emoticons, and they still haven't accepted the Tengwar draft that's been around since '93? Where are these people's priorities!?
-
And where's Tengwar?
They've got symbols for a love hotel, a horse, and a steaming pile of poo, along with emoticons, and they still haven't accepted the Tengwar draft that's been around since '93? Where are these people's priorities!?
-
Re:Favourite unicode character
Here, a punch card glyph. Not quite what I expected but still...
http://www.fileformat.info/info/unicode/char/5361/index.htmThere is also a card index glyph do?
http://www.fileformat.info/info/unicode/char/1f4c7/index.htmThere might not be a punchcard glyph, but there is a minidisk one:
http://www.fileformat.info/info/unicode/char/1f4bd/index.htmand an optical disk one:
http://www.fileformat.info/info/unicode/char/1f4bf/index.htmand a DVD one:
http://www.fileformat.info/info/unicode/char/1f4c0/index.htmI cannot imagine how this can ever be used in a useful manner, instead of being simply an irrelevant gimmick. Does anyone know why this stuff found its way into the standard?
-
Re:Favourite unicode character
Here, a punch card glyph. Not quite what I expected but still...
http://www.fileformat.info/info/unicode/char/5361/index.htmThere is also a card index glyph do?
http://www.fileformat.info/info/unicode/char/1f4c7/index.htmThere might not be a punchcard glyph, but there is a minidisk one:
http://www.fileformat.info/info/unicode/char/1f4bd/index.htmand an optical disk one:
http://www.fileformat.info/info/unicode/char/1f4bf/index.htmand a DVD one:
http://www.fileformat.info/info/unicode/char/1f4c0/index.htmI cannot imagine how this can ever be used in a useful manner, instead of being simply an irrelevant gimmick. Does anyone know why this stuff found its way into the standard?
-
Re:Favourite unicode character
Here, a punch card glyph. Not quite what I expected but still...
http://www.fileformat.info/info/unicode/char/5361/index.htmThere is also a card index glyph do?
http://www.fileformat.info/info/unicode/char/1f4c7/index.htmThere might not be a punchcard glyph, but there is a minidisk one:
http://www.fileformat.info/info/unicode/char/1f4bd/index.htmand an optical disk one:
http://www.fileformat.info/info/unicode/char/1f4bf/index.htmand a DVD one:
http://www.fileformat.info/info/unicode/char/1f4c0/index.htmI cannot imagine how this can ever be used in a useful manner, instead of being simply an irrelevant gimmick. Does anyone know why this stuff found its way into the standard?
-
Re:Favourite unicode character
Here, a punch card glyph. Not quite what I expected but still...
http://www.fileformat.info/info/unicode/char/5361/index.htmThere is also a card index glyph do?
http://www.fileformat.info/info/unicode/char/1f4c7/index.htmThere might not be a punchcard glyph, but there is a minidisk one:
http://www.fileformat.info/info/unicode/char/1f4bd/index.htmand an optical disk one:
http://www.fileformat.info/info/unicode/char/1f4bf/index.htmand a DVD one:
http://www.fileformat.info/info/unicode/char/1f4c0/index.htmI cannot imagine how this can ever be used in a useful manner, instead of being simply an irrelevant gimmick. Does anyone know why this stuff found its way into the standard?
-
Re:Favourite unicode character
Here, a punch card glyph. Not quite what I expected but still...
http://www.fileformat.info/info/unicode/char/5361/index.htmThere is also a card index glyph do?
http://www.fileformat.info/info/unicode/char/1f4c7/index.htmThere might not be a punchcard glyph, but there is a minidisk one:
http://www.fileformat.info/info/unicode/char/1f4bd/index.htmand an optical disk one:
http://www.fileformat.info/info/unicode/char/1f4bf/index.htmand a DVD one:
http://www.fileformat.info/info/unicode/char/1f4c0/index.htmI cannot imagine how this can ever be used in a useful manner, instead of being simply an irrelevant gimmick. Does anyone know why this stuff found its way into the standard?
-
Re:emoticons?
There's also a carrousel horse.
-
Re:Favourite unicode character
For the "under 30" group: cloud
It even has an error character too: thundercloud -
Re:Favourite unicode character
For the "under 30" group: cloud
It even has an error character too: thundercloud -
Re:Favourite unicode character
As if http://www.fileformat.info/info/unicode/char/1f4be/index.htm makes sense to anyone under age 30. I demand the addition of a punchcard glyph...
-
Re:Stick to ASCII
Yeah but can you write a pile of poo in ASCII?
http://www.fileformat.info/info/unicode/char/1f4a9/index.htm
-
Favourite unicode character
has got to be the Love Hotel.
Does anyone know why this is even there?
-
Re:70% on fully updated installs.
An exe is not just arbitrary code. It has a header which contains information about the executable and it can contain entirely different sections for execution under Windows/DOS. A DOS executable header begins with the characters "MZ". Windows executables are denoted by "PE". Most Windows applications will have a DOS MZ executable stub at the beginning, before the PE section, to display "This program cannot be run in DOS mode." and exit.
http://www.fileformat.info/format/exe/corion-mz.htm
Now, if you were talking about a DOS
.com compact executable, yes. Nothing but pure arbitrary code. -
U+01D0 LATIN SMALL LETTER I WITH CARON
The closest thing in Unicode to an i with heart is U+01D0 "LATIN SMALL LETTER I WITH CARON", which appears not to be on Slashdot's code point whitelist.
-
Technical citations
Pay close attention to the symbol names in the index entries part of the table.
The musical sharp sign:
http://www.fileformat.info/info/unicode/char/266f/index.htm
The "horizontal" lines rise to the right to indicate the rise of tone. The second vertical line is drawn to a higher baseline, also to indicate the rising tone.
Now...
The pound sign, aka the octothorpe or number sign:
http://www.fileformat.info/info/unicode/char/23/index.htm
The two "vertical" lines are usually rendered leaning right (but pure vertical is okay). The horizontal lines are truely horizontal and are drawn to the same right-hand and left-hand bounds.
Now: "#" that isn't a "sharp", its a "pound", or number sign, or octothorp or cross hatch. Period.
So this: "C#" is "see pound"...
Sorry, its just the facts man.
(And yes, its a sad fact that looks like a troll or a flame-bait, but that isn't technically my fault. Talk to the "wince" people... 8-)
-
Technical citations
Pay close attention to the symbol names in the index entries part of the table.
The musical sharp sign:
http://www.fileformat.info/info/unicode/char/266f/index.htm
The "horizontal" lines rise to the right to indicate the rise of tone. The second vertical line is drawn to a higher baseline, also to indicate the rising tone.
Now...
The pound sign, aka the octothorpe or number sign:
http://www.fileformat.info/info/unicode/char/23/index.htm
The two "vertical" lines are usually rendered leaning right (but pure vertical is okay). The horizontal lines are truely horizontal and are drawn to the same right-hand and left-hand bounds.
Now: "#" that isn't a "sharp", its a "pound", or number sign, or octothorp or cross hatch. Period.
So this: "C#" is "see pound"...
Sorry, its just the facts man.
(And yes, its a sad fact that looks like a troll or a flame-bait, but that isn't technically my fault. Talk to the "wince" people... 8-)
-
Re:Is it the "see pound" thing?
If you think I am wrong about "C#" being read "see pound" please check the unicode symbol definitions:
A pound/octothorp/number sign has horizontal lines a two (usually slanting) vertical lines drawn from the same baseline:
http://www.fileformat.info/info/unicode/char/23/index.htm
vs.
A musical sharp sign has two vertical lines, the second drawn higher from the baseline than the first, the "horizontal" lines incline. All of this is intended to indicate the half-tone rise of the notes so designated.
http://www.fileformat.info/info/unicode/char/266f/index.htm
You will node the "Index Entries" in both.
The name is "see pound"... sorry, just a fact.
-
Re:Is it the "see pound" thing?
If you think I am wrong about "C#" being read "see pound" please check the unicode symbol definitions:
A pound/octothorp/number sign has horizontal lines a two (usually slanting) vertical lines drawn from the same baseline:
http://www.fileformat.info/info/unicode/char/23/index.htm
vs.
A musical sharp sign has two vertical lines, the second drawn higher from the baseline than the first, the "horizontal" lines incline. All of this is intended to indicate the half-tone rise of the notes so designated.
http://www.fileformat.info/info/unicode/char/266f/index.htm
You will node the "Index Entries" in both.
The name is "see pound"... sorry, just a fact.
-
Re:Shame Really...
I am, and was, fully aware of the usage and intent that "C-#" was supposed to be "see sharp" as in music.
If you examine a sharp sign, it is two horizontal lines and two vertical lines, the _second_ vertical line is _higher_ than the first, in indication that the tone is supposed to be higher than the actual note by a half-tone. The "sharp" indication, which really only exists in the musical clef, is _distinct_ from this symbol "#".
In the octothorp, there are two horizontal lines, and two vertical lines. The vertical lines are the same height and drawn from the same baseline. They are usually slanted such that there are no right angles.
In no case have you ever seen "C#" written as "see sharp". That leaves "see number-sign, see-octothorpe" or "see-pound".
There is a unicode symbol for "sharp", it is \u266f, as opposed to 0x23, the "pound sign".
Seriously, there is a _huge_ difference in symbolism.
It's not _my_ fault that the same people who named a mobile platform "wince", named a language "see pound" while failing to be clever.
http://www.fileformat.info/info/unicode/char/23/index.htm
http://www.fileformat.info/info/unicode/char/266f/index.htm
Intentions only take you so far when you make an epic screw-up. They named it something, but it wasn't "see sharp". 8-)
-
Re:Shame Really...
I am, and was, fully aware of the usage and intent that "C-#" was supposed to be "see sharp" as in music.
If you examine a sharp sign, it is two horizontal lines and two vertical lines, the _second_ vertical line is _higher_ than the first, in indication that the tone is supposed to be higher than the actual note by a half-tone. The "sharp" indication, which really only exists in the musical clef, is _distinct_ from this symbol "#".
In the octothorp, there are two horizontal lines, and two vertical lines. The vertical lines are the same height and drawn from the same baseline. They are usually slanted such that there are no right angles.
In no case have you ever seen "C#" written as "see sharp". That leaves "see number-sign, see-octothorpe" or "see-pound".
There is a unicode symbol for "sharp", it is \u266f, as opposed to 0x23, the "pound sign".
Seriously, there is a _huge_ difference in symbolism.
It's not _my_ fault that the same people who named a mobile platform "wince", named a language "see pound" while failing to be clever.
http://www.fileformat.info/info/unicode/char/23/index.htm
http://www.fileformat.info/info/unicode/char/266f/index.htm
Intentions only take you so far when you make an epic screw-up. They named it something, but it wasn't "see sharp". 8-)
-
Re:Haskell
If your language actually uses the character U+23C7 "DENTISTRY SYMBOL LIGHT DOWN AND HORIZONTAL WITH WAVE" as an operator, your editor will let you type it with a simple keyboard combination, like Compose-T-~. If you're using U.S. Windows and have to resort to Alt+numbers to type things, you're silly.
-
Re:They don't deny it!
The recycling symbol, Unicode code point U+2672, has three arrows. These stand for, *in order of importance*,
- reduce
- reuse
- recycleSo you're right that recycling is not the most favourable option. However, it's still better energy cost than option (4): BAU, just dump it somewhere in a lake until this overflows.
For example, processing bauxite into aluminium like that factory in Hungary costs oodles of energy (for the electrolysis). Melting and recycling used drink cans to reprocess the aluminium costs a lot of energy as well, but probably 1 to 2 orders of magnitude LESS than making the original stuff. So this is an example where you're wrong. Of course it depends on the material.
When you talk about "mining a landfill", I hope you understand what this really looks like in practice.. photos
-
Re:So, I guess now
This code point is not in the subset of Unicode accepted by Slashdot's anti-crapflood filter.
Confirmed by my best efforts in the first reply to this silly story way up top. Tried
HTML Entity (decimal) & # 8 4 8 2;
HTML Entity (hex) & # x 2 1 2 2;
HTML Entity (named) & trade;
and several others from the http://fileformat.info/... page, but none areaccepted by Slashdot's anti-crapflood filter
per tepples.
-
Re:So, I guess now
I believe you need to use the actual tm symbol not just (tm).
-
Re:The 21st century
-
Re:Character in question
I believe they actually meant U+00A1 (hex for 161):
http://www.fileformat.info/info/unicode/char/00a1/inverted_exclamation_mark.png
-
Character in question
-
Re:Unicode does take its time...
wrong. it already does exist http://www.fileformat.info/info/unicode/char/20a8/index.htm
-
Re:Unicode does take its time...Maybe I'm misunderstanding...
Unicode should already support it: http://www.fileformat.info/info/unicode/char/20a8/index.htm
The delay seems to me to be in getting everybody to update their fonts. That's what I was referring to when I said I didn't know how you could plan for it.
-
Re:Isnt there already a Rupee sign?
Here’s a link to those who want to see how it looks: http://www.fileformat.info/info/unicode/char/20a8/index.htm
I hope they simply replace the glyph. -
Two options
* Use U+20A8 RUPEE SIGN. It looks different, but means exactly the same: Indian rupees. Just replace the glyph in new fonts.
* Fake it. Use U+0930 DEVANAGARI LETTER RA, from which the rupee symbol is derived, plus a combining bar, e.g. U+0304, U+0334, U+0335 -- whatever looks best. -
Re:They're right!
Goatse?
Or since Slashdot can't render utf-8,
http://www.fileformat.info/info/unicode/char/272a/index.htm -
Re:Not bullet-izable
Or, in close quarters, a dagger.
Someday
/. will support Unicode. In the meantime... -
Re:Hashing Works
Yes, I have a similar mental hash, although it is more complicated and so the password is longer. It makes sure that no two sites have the same password, so no one can get into my e-mail, say, just because they have found my Slashdot password. They take too long to type in, though, so I let Firefox remember them. Firefox protects them all with one master password that I enter once per session. In turn, my entire home directory (including the Firefox profile) is on a TrueCrypt partition (protected by a completely different passphrase). Incidentally, any sensitive files are encrypted with GPG (with a completely different, long passphrase) before being stored on the TrueCrypt partition for good measure.
If you are worried that your mental hash is easily crackable (e.g. you use "SDpass" for Slashdot, "FBpass" for Facebook... haha, OK that's an exaggeration), then obfuscate it further by using a real hash. Run "SDpass" through md5sum, and you get "6809ec345ad1a2b72f9f8a6e3f96266b". "FBpass" becomes "5b128c5443f4467dfdd4553c3f9a6733". It is not realistically possible for anyone to see any connection between the two. Should you find yourself on a computer lacking md5sum, you could use online services such as http://www.fileformat.info/tool/hash.htm in order to get the hash. (The paranoid will obviously want to do so only in an emergency, as it will be sent over the Web in plaintext, although nobody will have any reason to think it is a password.)
Since md5sum output is limited to the characters 0123456789abcdef, you may want to manually add a few more fixed characters (such as "#@S|-|") to the final product. That way no one can get access, even if they see you generating the hash.
-
Re:Yes kill it
That's why we have U+2764 in Unicode. Though good luck trying to put that in a username.
-
Re:Oh really?
Then why are we allowed to say copulation and feces on TV?
Because as the objectionable alternatives consist of only one syllable, you can be offended faster than you can reasonably prepare. (Or that they're so short, even a cavechild can repeat them.)
But funny how you can work them into syllables of other words and they become acceptable:
...Balzac was a writer
He lived with Allen Funt
Mrs. Roberts didn't like him
But that's 'cause she's a contaminated water
Can really make you sick...But even bare I'd argue that, when rhymed in poetry, you have just as much if not more advance warning.
Is there only the one word with fuhk (or fŭk which slashdot won't render (Latin small letter u with breve), literal or entified) as a syllable in English (with proper pronunciation) that isn't a derivation of the single-syllable word? Does anyone have a greppable phonetic unabridged dictionary?
-
Re:Loaded question
And then there's stuff like medieval transcriptions. How can I post a good transcription of a Middle English romance without the characters thorn, eth, yogh, and wynn? Some of those are available in standard fonts, especially thorn and eth, but yogh and wynn are a lot harder to come by
You're doing it wrong. Both yogh and wynn have unicode code points. They work just fine here.
-
Re:Loaded question
And then there's stuff like medieval transcriptions. How can I post a good transcription of a Middle English romance without the characters thorn, eth, yogh, and wynn? Some of those are available in standard fonts, especially thorn and eth, but yogh and wynn are a lot harder to come by
You're doing it wrong. Both yogh and wynn have unicode code points. They work just fine here.
-
Re:Serious issue!
Excelent. Finally someone who can tell me just what the bloody hell this sign is:
That's my shift+4. Google doesn't even seem to know what the hell it is. It looks a little bit like a bomb as seen from behind - main fusulage in the middle and four fins, but surely you aren't suggesting we bomb the manufacturers?That character which didn't make it through
/.'s silly character filter presumambly is the generic currency character (U+00A4 in the Latin 1 Supplement table of Unicode). You can create it in HTML with ¤ It isn't very widely used. -
Re:syncronicity [sic]
Another fun fact: the diacritical mark on the s is called the "caron" or "hacek".
-
Re:4 hours commuting a day...
Googles...
Ohhh, now I get it.
I hope you don't mind if I post this:
http://www.fileformat.info/info/unicode/char/2665/index.htm
To save some time for those of us who have not memorized unicode yet. At least you aren't being ethnocentric and making ascii only jokes. -
Re:Does it bring back the "Windows Shade"?
... as compared to the rest of the world, who's keyboards have a special key called "Print Screen" perhaps ? I have to point out that the "Print Screen" button does not, in fact, print the screen (in Windows). It copies it to the clipboard (equivalent to Cmd-Shift-Ctrl-3 on a Mac). There is no shortcut to print the screen, although on the Mac there used to be. Any shortcut that requires 4 fingers to use it is not worth the time lost in trying to put the correct finger on the correct key. So don't use the shortcut. Open /Applications/Utilities/Grab and use that. What next, 4 fingers AND a click on one of the mouse's 27 buttons ??? How VERY intuitive :-) My laptop trackpad has only one button, and I like it that way. :-)
By the way, the reason Cmd-Shift-3 is used to save a screen shot on the Mac is, Apple set up Cmd-Shift-(number) as a standard shortcut to run some piece of piece of code called an FKEY. This was before Macintosh keyboards had function keys (and quite possibly before they had Control keys). Cmd-Shift-1 and Cmd-Shift-2 were implemented internally, but Cmd-Shift-3 through Cmd-Shift-9 were available to run whatever FKEY you wanted; FKEYs with IDs 3 and 4 were included by default, but you could modify those with ResEdit, and add new ones if you wanted. As of System 7.1 the cleanest way to add new FKEYs was to add them to an empty font suitcase and drop it in the Fonts folder.
The Fonts folder is probably my favorite crazy Apple hack: all resources (not just fonts) contained in font files and font suitcases in the Fonts folder were loaded as if they were in the System file. Previously, all fonts had to be copied into the System file, which the System 7.0 Finder would open as if it were a folder (thus eliminating the need for the Font/DA Mover application). As of System 7.1, only sounds and keyboard layouts were still displayed within the System file in the Finder (I don't remember if it would let you copy a font there or not).
Ridiculous Mac trivia: in System 7 through Mac OS 9, the only time the Finder will ever display a "Move" dialog box with a progress bar (as opposed to a "Copy" dialog while copying a file) is when moving resources into or out of a suitcase. That's because there is no way to move a file between volumes, and moving a file within the same volume doesn't take a significant amount of time because the data on the disk doesn't need to be moved. In Mac OS X, you can hold down the Command key to move a file between volumes (Mac OS X 10.5 Leopard had a nasty bug in this feature, which fortunately not many people ever use).
On-topic trivia: prior to System 7, Cmd-Shift-3 saved screen shots in MacPaint format, which does not support color. If you had a color monitor, you'd end up with a black-and-white screen shot (not dithered). In System 7, the format was changed to PICT. I believe early versions of Mac OS X saved to TIFF, and current versions use PNG. -
Re:mathmlWhoops, pressed submit instead of preview.
That symbol that didn't show up is supposed to be U+221A.
The entire equation can be written asx=(-b\pm \sqrt (b^2-4ac) )/2a
With that spacing (and a space at the end).
Now, MS Word has its own multitude of problems, but I must say that I *greatly* prefer the "Unicode Nearly Plain-Text Encoding of Mathematics" input method, as opposed to the {}-hell from TeX. -
Re:No, it's *not* MoolenaarAccording to the ever informative,interesting and insightfulwikipedia
The spoof band Spnal Tap raised the stakes in 1984 by using an umlaut over the letter N, a consonant (it also makes use of a dotless i). This is a construction found only in the Jacaltec language of Guatemala and in some orthographies of Malagasy, although it is uncertain whether the writers of This Is Spinal Tap knew this at the time.
Note, this post has informed you aboout the Jacaltec language of Guatemala and (if you view the HTML source) the uses of U+308 aka ̈ the combining diaresis -
Re:Recursion?I may be misunderstanding your comment about recursion, but the regexp you gave
((\d+)\.)+
matches exactly what one would expect - more than one digit followed by a period at least once.
You can verify that Java regexps support this at http://www.fileformat.info/tool/regex.htm and similarly so does Perl (unsuprisingly since Java regexps extremely close to Perl 5 in functionality and feature support). -
Re:Sounds great...
-
Re:Speaking WPM != Chars Per Minute
-
Re:Speaking WPM != Chars Per Minute