Sign In/My Account | View Cart  
advertisement


Listen Print Discuss

XML, one has to admit, has been pretty successful. Despite having a sufficient quantity of annoyances to merit a dedicated column on these pages, XML has powered applications almost anywhere--anywhere except the web, if recent murmurings are an indication.

Douglas Crockford, summarizing his talk and resulting hallway conversations at the XML 2006 conference, mentions numerous voices proclaiming that XML on the Web is dead. Some accept this statement, some insist that XML is the one metalangauge to rule them all, and others say, "It still has a role on the server. If we go around saying it's dead, people might start looking for better alternatives." This isn't the first time XML has been declared dead on the Web: back in 2004, Mark Pilgrim made a similar proclamation.

One factor neglected in those statements, however, is the mobile-centric web, where various modularization-based variants of XHTML have quietly lived up to their original premise. Individual browsers vary widely in quality of implementation, but the language itself, including core concepts of strict well-formedness, distinct layering (yeah, I'm talking about you, document.write), and straightforward usability over the Web are alive and healthy in mobile. In that environment, XHTML continues steady advancement, marching over the ashes of WML.

Outside of mobile, though, things look different. It would be a gross exaggeration to say that XHTML was overtaking HTML in practice on the general web. Why the difference? One possible answer is that mobile is a better fit for early adopters; the advantages inherent in XML provide the most bang-for-the-buck there. Folks outside of mobile fit into more of a laggard profile, but eventually they'll come around to the obviously better way. After all, JavaScript has strict syntax rules, with correspondingly harsh consequences for failing to meet them, all of which hasn't slowed down Ajax and similar client-side developments.

It's interesting to ponder why the mobile corners of the Web have bought into XHTML faster than the rest of the Web. Comments from readers are welcome at the end of this article. But the second question remains: does the acceptance of JavaScript mean a victory for the "draconian" error-handling proponents, including strict well-formedness? I think not. There are many ways to write bad pages, but a properly-constructed page will continue to function in some capacity in the event of a script error (or a similar case of a browser with script disabled). A web page with a catastrophic JavaScript error is still a web page, not an error message. In fact, over time, browser manufacturers have made script errors less and less visible, pushing them back into a hidden error console.

So scripting takes an interesting middle ground between draconian processing--the page must be well-formed, else an error message--and chaotic tag soup. Effectively, instead of simply labeling various conditions as a fatal error, and instead of leaving everything wide open for conflicting implementation interpretation, the combination of markup+script defines a processing model that attempts to continue in the face of inevitable errors. This turns out to work much better on the Web. Instead of saying that XML on the Web is dead, it might be more accurate to say that well-formedness on the Web is dead. (Again, with mobile somehow holding out as an exception).

Which brings us back around to the discussion about SGML simplification 2.0. There are plenty of ways to think about XML, but from a historical bent, it's a derivative--a simplification as originally conceived--of SGML. But even before XML was written down, the seeds of a simplified subset of SGML were already sown in a backwater language known as HTML. HTML 2.0, for instance, claims that "HTML documents are SGML documents with generic semantics that are appropriate for representing information from a wide range of domains." In practice, no widely-used browser ever processed HTML via a SGML parser, though several online validators did and perhaps continue to do so, piling on to the confusion. No, in practice, every widely-used browser has a custom hacked parser, that is neither XML nor SGML, but something less elegant and more battle-scarred.

Browser manufacturers were designing and implementing these quirky parsing techniques in parallel, and separately from the design and implementation of XML. (As a quick recap, the first draft of HTML came out in 1993, the first beta of Netscape Navigator in 1994, HTML 2.0 in 1995, the first written draft of XML in 1996, and XML 1.0 in early 1998, in the heat of the browser wars.) As a result, never the twain met. Browser vendors in particular paid attention to all the stupid, incoherent, and counterproductive ways of unstructured authors that rapidly copied each other's questionable habits through the magic of view source. If anything, XML railed against sloppy practice, enshrining even stricter rules, putting it even farther from existing practice on the Web.

So interestingly, the seeds of XML 2.0 were sown, watered, and germinating even before XML 1.0 was a twinkle in Tim Bray's eye. As mentioned previously in the column, Tim Berners-Lee has blogged about a restart of the work for HTML, influenced largely by the WHAT WG, an invitation-only group of browser vendors and interested parties. This will involve messy syntax discussions. Effectively, this is attempt 2.0 at simplifying SGML rules; call it XML2 with the same unofficial naming scheme as "HTML5".

This important point bears repeating: XML2, whatever it is ultimately called, is already under development, has been for some time, and is on its way to being an official W3C Recommendation track document.

This effort, with web-scale implications, has received relatively little attention. In part, this is because the syntax specification isn't a separate document, as is the case with XML, but rather incorporated as one section of a broader spec. This raises some interesting questions.

Is HTML on the Web a special case? Is it possible to have a single general-purpose syntax used for HTML and other things? Or to put it another way, given a language capable of expressing HTML semantics in a way somewhat compatible with present practice on the Web, would that language be useful in other contexts, like publishing, content management, or information exchange? If the answer is yes, the Web would be better served with XML2 embodied as a separate, standalone specification. Imagine if XML had been first specified as a subsection of XHTML. Would it have achieved as much? The hallmark of a well-designed technology is that it gets used in a way the originators didn't envision. This is unlikely to happen with XML2 if the group working on it comes from too narrow a background. Fortunately, people are starting to notice. For example, Sam Ruby has been blogging about his experiences submitting syntax comments, and, as a result, is eliminating some of the arbitrary incompatibility between XML and XML2.

According to TimBL's blog, the new Working Groups will pursue parallel development along HTML and XHTML tracks. This will create further tensions. One thing emphatically rejected by the new SGML-simplifiers is namespace processing, discussed previously. It's difficult to see at this point how far this will get. For example, what issues will arise in an attempt to embed SVG (which in turn embeds XLink) into HTML? Will the entire XML toolset need to be forked for HTML use? What about namespace-centric technologies like XML Schema or XQuery? The group has some difficult decisions ahead.

Likely, as I write this, the charters for the new HTML work are being worked on in a W3C members-only area. If you are associated with a W3C member organization and care about XML, now would be the best time to make your opinions known. Speak now or forever hold your annoyance. Or at least until XML3.

HTML & XHTML: The Definitive Guide

Related Reading

HTML & XHTML: The Definitive Guide
By Chuck Musciano, Bill Kennedy


Comment on this articleShare your opinion in our forum.
(* You must be a
member of XML.com to use this feature.)
Comment on this Article


Titles Only Titles Only Newest First
  • Los Angeles Locksmith Safe & Vault Sales and Installation 24/7 Locksmiths 1-877-364-5264
    2008-12-19 11:05:55 services123 [Reply]

    Los Angeles Locksmith Safe & Vault Sales and Installation 24/7 Locksmiths 1-877-364-5264


    San Fernando Valley Locksmith 1-818-386-1022



    Los Angeles Hollywood Locksmith 1-323-678-2704



    West Los Angeles, Marina Del Rey, Culver City, Santa Monica, Beverly Hills Locksmith 1-310-925-1720 For All Your Home & Business Needs Doors installation wood, metal, Kalamein, and fire rated wood doors for both heavyweight commercial and standard residential needs. Door Closers Panic Bars Fire Exit Devices Floor or Head Check Bolts Door Frames Hinges Buzzer Systems Card Access Elevator Doors magnetic door locks, electric strikes, CCTV install Dvr Video Camera master key systems, CCTV and video surveillance, Alarm Systems, Keyless Entry Systems and Access Control Iron Works specializes in custom exterior and interior wrought iron design. We design and hand forge wrought iron entrance gates, driveway gates Child guard gates A/C cage gates Elevator doors Backyard gates Fire exit gates Custom design gates Storefront and Commercial Rolling
    Los Angeles Locksmith, locksmith, lock smith, locks, keys, service, door, set, security, los angeles locksmith, auto, car, home, office, door lock, open, install, repair, professional, locksmiths, unlock, lockout, locked out, rekey, rekeying, change lock, usa, california, los angeles, santa monica locksmith, marina del rey locksmith, santa clarita locksmith, hollywood locksmith, beverly hills locksmith, sherman oaks locksmith, encino locksmith, burbank locksmith, agoura hills locksmith, 411 locksmith, yellow pages locksmith, mobile, Commercial locksmith, Residential locksmith, Locksmiths, professional, Locksmith 30 MIN RESPONSE EMERGENCY .Commercial locksmith, Residential locksmith ,Automotive locksmith, Emergency Locked out Locksmith rekeying locks installatin, Roadside Assistance


  • Christmas Lights Los Angeles 310-925-1720
    2008-11-21 08:59:54 Movingcompany [Reply]

    Christmas Lights Los Angeles 310-925-1720
    http://www.christmaslightinstallationlosangeles.blogspot.com/
    We offer the following Products and Services:
    Christmas Lighting New inside / outside christmas planter that lights up



    Christmas tree sale and decoration


    Full service sales and installation departments



    Custom pole-mounted banner sales and installation



    Large animated holiday displays



    Custom holiday displays



    Leasing and rental programs



    In house graphic arts department



    Knowledgeable and helpful year round staff.
    Lvhsystems
    full-service approach begins with the assignment of a project manager, engineer, and draftsman who work closely with you throughout the process to ensure a design reflective of your aesthetic preferences, programming that meets your control requirements, and an Christmas Lighting installation that is efficient and trouble-free. This level of client commitment and systems expertise allows Lvhsystems to stand apart as a premier integrator of design home Christmas lights decoration and solutions throughout the southern California communities of Los Angeles, Santa Monica, Beverly Hills, Calabasas, Agoura Hills, Woodland Hills, Pasadena, Burbank, Glandale and Sherman Oaks.





  • 2007-10-26 06:09:12 takashi-24-f@tbr.t-com.ne.jp [Reply]



  • Is XML2 for real?
    2007-08-19 12:10:19 Logomachist [Reply]

    I haven't seen anything about XML2 elsewhere. Is this just your name for a potential revision to XML, or has preliminary work on a sequal to XML already started?


    I too would like to see an XML2, but I think we have very different ideas about what should be included. If it were up to me, I'd improve DTDs and spin them off into their own spec, I'd make XML schema neutral, I'd add namespace support to the XML spec, and I'd allow XML-based languages to specify fall-back options for how to act if an IDREF specifies an ID that doesn't appear in the document (if they can't do this already- I haven't used anything more complicated than a DTD). I wouldn't allow sloppy coding... it's one thing to write a forgiving parser, it's another to require that parsers be forgiving.


    As for why XML hasn't caught on on the web, it's not really true. RSS and ATOM are both XML, as are most non-HTML markup formats. The only reason so much HTML is still not well-formed is because xHTML 1 didn't bring anything new that wasn't already availiable in HTML4, other than making web pages harder to code. Imagine how much more slowly PNG would have caught on if it was a subset of gifs and had no advantages. Obviously, no one would have adopted it.


    Now, xHTML2 does being some advantages, so I expect once all the major browsers support it web designers will adopt XML en masse.

  • Mobile web acceptance of XHTML
    2007-01-27 06:27:34 DavidJohansson [Reply]

    With regards to why the mobile corners have accepted XHTML, it's all about what you as a developer are forced to do to make your content display on the phones. Sure, http://simon.html5.org/articles/mobile-results shows that a lot of browsers don't particularly care about XHTML anyway, but with the market share the Motorola V3 got, the fact that it does care about XHTML is enough to force any mobile developer to make sure that their content is compliant.

  • Does XML2 have an infoset?
    2007-01-11 23:29:19 Micah Dubinko [Reply]

    I didn't get into the tech details here, but I'm still curious what people think. Does (or will, at some point) XML2 have a reasonable infoset mapping? Will we see toolkits to change back and forth between XML and XML2? -m

  • markup typo
    2007-01-11 18:58:33 Michael Dyck [Reply]

    In the link following "same unofficial naming scheme as", the attribute is missing its closing quote-mark, which (in my browser at least) causes a lot of the subsequent text (up to the next quote-mark) to be slurped in as the attribute value, effectively hiding it.


    (Indeed, "well-formedness on the Web is dead".)