OASIS Darwin Information Typing Architecture (DITA) TC

 View Only

RE: [dita] hyphens and file names

  • 1.  RE: [dita] hyphens and file names

    Posted 02-18-2005 22:38
     MHonArc v2.5.0b2 -->
    
    
    
    
    
    
    
    
    
    
    
    
    
    
    
    
    
    

    dita message

    [Date Prev] | [Thread Prev] | [Thread Next] | [Date Next] -- [Date Index] | [Thread Index] | [List Home]


    Subject: RE: [dita] hyphens and file names


    On Fri, 18 Feb 2005, Esrig, Bruce (Bruce) wrote:
    
    > > 5. And then there is the issue of files names that turn
    > into parts of URNs, don�t get me started!
    > 
    >
    > This one was the last straw for me, and the reason I raised the
    > question. I sent someone a request for a URL with a filename in
    > it, and it came out with underscores. It has happened again since.
    > Here's a chance for the DITA TC to do something absurdly small    
    > yet exemplary ... standardize on hyphens only in identifiers.
    
    Bruce, have a look at the earlier (October 25) draft of the 
    "OASIS Artifact Naming Guidelines".  I don't know what you mean
    by "and it came out with underscores" but if you're talking
    about Kavi or the TC Process tools, I'd bet that these draft
    naming guidelines are the culprit, ultimately.  URLs:
    
    http://lists.oasis-open.org/archives/chairs/200410/msg00056.html
    http://lists.oasis-open.org/archives/chairs/200410/doc00003.doc (main doc)
    http://lists.oasis-open.org/archives/chairs/200410/doc00002.doc (diff)
    
    Since this draft has not yet been put out for formal review
    by the membership (only informally to the Chairs), I would hope there's
    a good chance to have some broader input on various topics,
    including the notion of converting hyphens to underscores in the
    construction of a URL for a URN-based artifact name.
    
    I'm unfamiliar with all the computing history around hyphens
    and underscores as name characters (I do recall SGML), but in the
    modern context where filenames get used in URLs, the matter
    of underscore "disappearing" (as Deborah puts it) feels like
    a real problem.  When viewing an address in a typical browser,
    or when sending the HTML document to the printer, an
    troubling ambiguity is introduced: fromm visual, we don't know
    whether the character is SPACE or UNDERSCORE.
    
    There was a famous article about a train wreck and (non) use
    of SGML. I can envision a similar article about the time someone
    lost a big contact because they had only 20 seconds to
    communicate on a cell phone (nearly dead battery), and
    told a business partner to look at *this vital information* at
    "this URL..."  He reads the URL from the paper printout and
    says: 'http COLON [hmmm]-SPACE-[hmmm]-SPACE-[hmmm]-SPACE-[hmmm]-SPACE-
    [hmmm]-SPACE-[hmmm]-SPACE (etc)'... <phone dies> but the URL entered as
    such failed, and so did the bid for the contract. The Web lookup
    fails, and the poor contract writer assumes he made a mistake
    writing down the URL. But he didn't: the real URL did not actually
    contain any SPACE characters.
    
    ;-)
    
    -rcc
    
    
    
    > 
    > > all the rage in the URN and Unix communities
    > 
    > One reason to avoid hyphens in Unix was to avoid confusion with the flag convention ("cmd -flags").
    > 
    > Second, spaces were token delimiters on the command line, so file names couldn't contain spaces, and underscores were the most unobtrusive alternative. At that time, underlining of running text was a novel and underutilized feature that was only supported on certain terminals, so there was no conflict with underscores. And of course, URLs didn't exist yet.
    > 
    > Rubato,
    > 
    > Bruce
    > 
    >