Slashdot Mirror


Only 4.13% of the Web Is Standards-Compliant

Death Metal writes "Browser maker Opera has published the early results of an ongoing study that aims to provide insight into the structure of Internet content. To conduct this research project, Opera created the Metadata Analysis and Mining Application (MAMA), a tool that crawls the web and indexes the markup and scripting data from approximately 3.5 million pages."

16 of 406 comments (clear)

  1. Re:How compliant? by DrSkwid · · Score: 4, Informative

    It is very simple http://validator.w3.org/

    --
    There are places where the networks are not touching,and there are places where they are-Boeing's Lori Gunter
  2. W3C by eldavojohn · · Score: 4, Informative

    W3C's validation tools

    Normally I'd go on my own rant but I'm feeling lazy today and recently I read a good article at A List Apart that sums it up. As for the W3C, I like this list they compile:

    W3C's Pros & Cons

    Pros:

    • Global
    • Academic and scientific body
    • Multiple interests represented, but mostly from paid member companies
    • Attempting to be more open via certain teams such as the HTML5 and CSS Working Groups
    • Attempting to appeal more to work-a-day world via redesigns, blogs, and more human-friendly language throughout the site

    Cons:

    • Creates "open standards" by ideal, not necessarily fact
    • Incredibly slow moving in a highly evolutionary environment
    • Poor economic model that relies on membership monies
    • Discourages independents and open process
    • Passive: only creates specs and recommends, does not do real outreach
    • "Ivory tower" perception

    You should read that article, it's pretty spot on for this subject.

    --
    My work here is dung.
    1. Re:W3C by Anonymous Coward · · Score: 1, Informative

      Oh come on, what the hell is with the Warning about the &tab thing?

      It is inside a URL for crying out loud, did W3C forget that URLs can use &any-word=something-else?
      Oh wait, they say that you need to encode them even inside a URL... well, that kinda takes the biscuit...

      Also, what is with the terrible wording here?
      "This is usually a cascading error caused by a an undefined entity reference or use of an unencoded ampersand (&) in an URL or body text. See the previous message for further details."
      Someone should rub it in W3Cs nose that they aren't perfect either. (i shall be doing so in a second)

  3. Re:Some standards are just too strict... by mikael_j · · Score: 3, Informative

    For example, xhtml-strict does not include support for "target" attributes in links. What kind of idiotic decision was that?

    A very good decision, there are two main uses for the "target" attribute:

    • Frame-based sites - Old-school, annoying way of designing sites that I and many others feel should not be used for new sites.
    • To automatically open links in a new window - Annoying behaviour by web developers who think no one could possible want to, god forbid, leave their site in favor of another site.

    /Mikael

    --
    Greylisting is to SMTP as NAT is to IPv4
  4. Re:How compliant? by Bogtha · · Score: 4, Informative

    only recently added websites or also websites and old pages that exist longer than the standard they validated against exists ?

    MAMA didn't validate against a single document type. They validated against the document type that each individual document claimed to be. So all the ancient HTML 2.0 pages out there will correctly be identified as valid in they are, in fact, valid HTML 2.0.

    --
    Bogtha Bogtha Bogtha
  5. Re:How compliant? by Bogtha · · Score: 2, Informative

    Depends on how strict they're being.

    There aren't degrees of validity. A document is either valid or it isn't. You can't be "more strict" when validating something, if a tool offers you an option like that, then it is doing something other than validating, it's probably linting as well. There's at least one widely-used "validator" that doesn't actually validate at all.

    For example, I never close paragraph and line break tags, but otherwise my html is compliant.

    Yes you do. If you didn't close them, your pages wouldn't work in any browser. What you mean is that you don't explicitly close your paragraph and line break elements. And you don't have to. The closing tags for <p> elements are optional and the <br> element type is empty. Those are not errors.

    --
    Bogtha Bogtha Bogtha
  6. strict vs transitional DTDs by tepples · · Score: 1, Informative

    Are there degrees of strictness?

    Yes. HTML 4.01 and XHTML 1.0 each have two DTDs: a "transitional" DTD that allows presentational elements and a "strict" one that disallows them. The trouble is that a couple structural elements and attributes got removed by mistake in the strict DTDs along with the presentational ones, most notably the value attribute of the li element. For this and other reasons, most valid HTML that I've found has used a transitional DTD.

    1. Re:strict vs transitional DTDs by Bogtha · · Score: 4, Informative

      Yes. HTML 4.01 and XHTML 1.0 each have two DTDs: a "transitional" DTD that allows presentational elements and a "strict" one that disallows them.

      No, that's something different. There aren't degrees of strictness when it comes to validity. If a document claims to be a Strict document, and makes a single mistake, then it is invalid. If a document claims to be a Transitional document, and makes a single mistake, then it is invalid. In both cases, it's an absolute rule with no laxity.

      --
      Bogtha Bogtha Bogtha
  7. Re:Some standards are just too strict... by tepples · · Score: 4, Informative

    It's up to me as a user to choose where a url opens, especially since we are all using the tabbed paradigm now.

    User agents currently do not allow the user to submit a form into a new window or tab. This is the nearly nine-year-old bug 17754 on bugzilla.mozilla.org with 99 votes.

  8. _Already_ 4.13% of the web is standards-compliant by Anonymous Coward · · Score: 1, Informative

    If you go to the source of the research, you will for example notice that the last time a similar study was done (in June 2006), only 2.58% of the tested pages validated completely. A 1.5% increase might not seem to be all that much, but it's definitely indicative that we're on the right way. (And of course, perfect validation is never the final goal in itself, but merely an easy first step for people en route to writing better, semantically-meaningful, universally accessible websites.)

  9. Re:Some standards are just too strict... by SanityInAnarchy · · Score: 2, Informative

    Not really a lot of point to it, though -- savvy users will simply middle-click on the link if they want it in a new tab/window. If they don't, that generally implies they want it right where it is, and your attempt to open a new tab/window is going to be annoying.

    But hey, at least using a target for that is better than linking to a javascript: URL. A lot of sites are even worse -- they add an onClick event, and they set the link href to #, or to javascript:void(), meaning that middle-clicking on it inevitably does something unexpected.

    My preferred method (if I ever need to force a new window) is to use a plain old link, and progressively enhance it with Javascript to open a new window. That way, if people middle-click, it does exactly what they want.

    --
    Don't thank God, thank a doctor!
  10. Re:Well, that depends.... by Lachlan+Hunt · · Score: 5, Informative

    Does using "blink" make my code non-standard?

    Yes, because blink is not defined as conforming in any standard. However, it is possible to make a page containing blink (or any other element or attribute you like) pass validation by providing a custom DTD or an internal subset.

    But note that the claim that "4.13% of the Web Is Standards-compliant" isn't quite accurate. The study only used the W3C markup validator, which is only able to detect a subset of the machine checkable conformance criteria. It's trivial to create a non-standards compliant page that passes validation.

    --
    By reading this signature, you hereby agree with the content of the above comment.
  11. Opera's article by Anonymous Coward · · Score: 2, Informative

    Here's Opera's article, with actual numbers:
    http://dev.opera.com/articles/view/mama-markup-validation-report/

  12. Re:Well, that depends.... by billcopc · · Score: 2, Informative

    Standards do help, but the problem is the leading browser doesn't follow the standards too well.

    My primary browser is Firefox, so I do 90% of the development in there. The last 10% is spent adjusting little bits and bobs for IE6/7. It doesn't matter that my pages validate, and that I'm pretty damned good at making my spaghetti code spew proper XHTML - IE is still a retard.

    Where I work, we guarantee only one browser: IE6. Anything else costs extra, because of the testing involved. If someone wants us to guarantee their site will run perfectly in Opera or any other browser, they have to cover our time spent testing and massaging the markup. Sometimes that can amount to another 2-3 days of billable time on a big site/app.

    I don't use Konqueror at all for web browsing. Frankly I wish it would just embed the Gecko engine and be done with it.

    --
    -Billco, Fnarg.com
  13. Re:How to do your bit for standards compliance by Cope57 · · Score: 2, Informative

    There is also a Web Developer plug-in available at the FireFox Add-ons site.

    --
    http://www.accountkiller.com/removal-requested
  14. Re:Sad. Even sadder is the yet-another-feature cre by coopaq · · Score: 3, Informative

    XMLHttpRequest is now a standard since everyone decided to "ignore standards" and use it anyway.

    http://en.wikipedia.org/wiki/XMLHttpRequest

    "The World Wide Web Consortium published a Working Draft specification for the XMLHttpRequest object's API on 15 April 2008."