MHonArc v2.5.0b2 -->
dita message
[Date Prev]
| [Thread Prev]
| [Thread Next]
| [Date Next]
--
[Date Index]
| [Thread Index]
| [List Home]
Subject: RE: [dita] hyphens and file names
On Fri, 18 Feb 2005, Esrig, Bruce (Bruce) wrote:
> > 5. And then there is the issue of files names that turn
> into parts of URNs, don�t get me started!
>
>
> This one was the last straw for me, and the reason I raised the
> question. I sent someone a request for a URL with a filename in
> it, and it came out with underscores. It has happened again since.
> Here's a chance for the DITA TC to do something absurdly small
> yet exemplary ... standardize on hyphens only in identifiers.
Bruce, have a look at the earlier (October 25) draft of the
"OASIS Artifact Naming Guidelines". I don't know what you mean
by "and it came out with underscores" but if you're talking
about Kavi or the TC Process tools, I'd bet that these draft
naming guidelines are the culprit, ultimately. URLs:
http://lists.oasis-open.org/archives/chairs/200410/msg00056.html
http://lists.oasis-open.org/archives/chairs/200410/doc00003.doc (main doc)
http://lists.oasis-open.org/archives/chairs/200410/doc00002.doc (diff)
Since this draft has not yet been put out for formal review
by the membership (only informally to the Chairs), I would hope there's
a good chance to have some broader input on various topics,
including the notion of converting hyphens to underscores in the
construction of a URL for a URN-based artifact name.
I'm unfamiliar with all the computing history around hyphens
and underscores as name characters (I do recall SGML), but in the
modern context where filenames get used in URLs, the matter
of underscore "disappearing" (as Deborah puts it) feels like
a real problem. When viewing an address in a typical browser,
or when sending the HTML document to the printer, an
troubling ambiguity is introduced: fromm visual, we don't know
whether the character is SPACE or UNDERSCORE.
There was a famous article about a train wreck and (non) use
of SGML. I can envision a similar article about the time someone
lost a big contact because they had only 20 seconds to
communicate on a cell phone (nearly dead battery), and
told a business partner to look at *this vital information* at
"this URL..." He reads the URL from the paper printout and
says: 'http COLON [hmmm]-SPACE-[hmmm]-SPACE-[hmmm]-SPACE-[hmmm]-SPACE-
[hmmm]-SPACE-[hmmm]-SPACE (etc)'... <phone dies> but the URL entered as
such failed, and so did the bid for the contract. The Web lookup
fails, and the poor contract writer assumes he made a mistake
writing down the URL. But he didn't: the real URL did not actually
contain any SPACE characters.
;-)
-rcc
>
> > all the rage in the URN and Unix communities
>
> One reason to avoid hyphens in Unix was to avoid confusion with the flag convention ("cmd -flags").
>
> Second, spaces were token delimiters on the command line, so file names couldn't contain spaces, and underscores were the most unobtrusive alternative. At that time, underlining of running text was a novel and underutilized feature that was only supported on certain terminals, so there was no conflict with underscores. And of course, URLs didn't exist yet.
>
> Rubato,
>
> Bruce
>
>