Slashdot Mirror


Adobe Makes Flash Crawlable

nickull wrote in his journal that "Today Adobe systems made an announcement that it has provided technology and information to Google and Yahoo! to help the two search engine rivals index Shockwave Flash (SWF) file formats. According to the company, this will provide more relevant search rankings of the millions pieces of Flash content. Until now, developers had to implement workarounds for exposing text content used in Flash to search-engine spiders and other bots such as using XHTML data providers. While the Flash content is exposed, it is not yet clear how it will be utilized by the search engines, as they have not revealed their algorithms. The SWF specification is openly published."

18 of 232 comments (clear)

  1. Silverlight by Anonymous Coward · · Score: 5, Interesting

    Amazing what a little competition will bring...

  2. That's unfortunate by Anonymous Coward · · Score: 5, Interesting

    I'd be much happier if the search engines quit linking to flash-only websites completely. Then maybe those horrible things would go away.

    I can't think of any case where I've seen a Flash-only site where Flash added anything of substance (cuteness doesn't count), and they tend to be hard and non-standard to navigate, break key bindings (like CTRL-T to open a new tab doesn't work if mouse is over Flash), etc.

    Here is an example: A business association's website was redesigned in Flash. Instead of their staff page having a simple list of photos, names, job titles and phone numbers that you could search by hitting CTRL-F, the flash version just shows a photo of all of the staff members and you can only find the job titles and contact info by holding the mouse over the appropriate person's photo. So, if you want to find the contact info for the newsletter producer and you don't already know what he/she looks like, you have to move your mouse over each of 15 different photos until you find the right one. Stupid. There is just too much dumb stuff going on with Flash.

    1. Re:That's unfortunate by superflippy · · Score: 2, Interesting

      One example I've come across in the last few months where Flash is actually useful is for photographers' web sites. They're often paranoid about people copying the photos off their sites, yet they need to show their work so they'll get hired.

      Putting their portfolios into a Flash slideshow is a good compromise. Of course, anyone who can make screenshots can still copy their photos, but it adds a little extra security.

      Though, too many (IMO) go too far and have whiz-bang all-Flash sites with unnecessary bells and whistles. It's fine to use Flash for a specific purpose, but when you add animation, background music, etc. it detracts from the product being promoted (i.e. photography).

      --
      Your fantasies contain the seeds of important concepts.
    2. Re:That's unfortunate by Anonymous Coward · · Score: 5, Interesting

      I hate to break this to you, but I could implement the same thing in Javascript really easily. Or even a Windows app, if I wanted.

      Yes, you can do dumb things with other tools if you try to. But, my point is that such dumb things are common and somewhat encouraged by Flash. Plain old HTML provides a basic user interface that works reasonably well. It's not fancy, but it works. If you try hard enough with JavaScript you can muck things up, but things work reasonably well by default; things only go horribly wrong when developers try to move beyond the basics and make bad decisions in the process. Flash gives you a lot more control over the user interface, but without a basic standard starting point for page structure and navigation (as far as I know). So every Flash developer builds his/her own little custom method of navigation, and many lack sensible functionality, worrying more about cuteness instead. Basic things like hitting CTRL-F to search within a page, or being able to bookmark after navigating around get broken.

      You're blaming the tool for something that is the fault of the developer who sold this crappy site.

      Is it possible to create a decent website in pure Flash? Perhaps. Is it possible to put a screw into a wall with a hammer? Yes, but it's not the best approach. In practice, pure-Flash websites rarely work well, and that's because Flash isn't a good tool for that particular job. Adobe's website isn't pure Flash. That should tell you something.

    3. Re:That's unfortunate by Raenex · · Score: 2, Interesting

      When a technology encourages poor design, it's wise to speak out about it. The more people that understand the issues the better. Flash has it's place, but as a standard replacement for basic web pages it just doesn't cut it.

      Fash: 99% Bad, written in 2000, still makes as much sense today as it did 8 years ago.

      An ideal Flash would be truly integrated in the browser in a seamless way, however it just isn't there. Style sheets and JavaScript are better solutions for most applications.

    4. Re:That's unfortunate by Dachannien · · Score: 2, Interesting

      Flash for DRM is (as you've sort of noted) about as silly as Javascript for DRM. After all, "anyone who can make screenshots" would be anyone who knows where the magic PrtSc key is on their keyboard and also knows what it does. Javascript used to be popular to suppress the right-click context menu, so that web developers could "prevent" people from ripping off their uber kewl web programming leetness. Now, Firefox still pops up the message box that the web developer put in to tell you that you can't use the context menu, and then it shows you the context menu anyway. (And that's with Javascript enabled, to say nothing of NoScript.)

  3. Flash by ledow · · Score: 4, Interesting

    For a start, "crawlable" does not mean it WILL be crawled. More likely, most flash will contain nothing but junk and internals that were never meant to be seen anyway. I wonder when the first "we recovered a password that was stored inside a flash file" / "we googled for vulnerable flash apps and found these" hits will come about. And, as someone's already pointed out, if you *can* extract the text from them, you can't do much useful with it besides say "it's in this Flash somewhere". You can't even do "find in page" once you've clicked on such a link. And if it's at the end of an hour-long Flash animation, you're not going to sit through it.

    Then you'll have some people who have actually used bitmaps instead of text inside the Flash for various reasons, etc. The only useful thing to come out of this may well be a "View as HTML" version of Flash-only pages. But they will still be second-class pages because the designer didn't want to do it theirselves.

    Given that people who use Flash aren't exactly the most popular people in the world (e.g. if you want it to appear in Google, be read by people, to be bookmarked, to be quoted/cited/linked etc.), this won't affect much - Finding content in a Flash file is like looking for a needle in a haystack. That's the problem solved by this announcement. However, finding *useful* content in that file is going to be even worse, and actually getting users TO that data will be almost impossible.

    I imagine that the same thing will happen as it did with images, PDF's, etc. Those who design their Flash well will get something indexed and it'll actually get a hit or two from "View HTML Version" on Google. Those who don't (i.e. 99% of the people who make them) won't see any difference at all.

    1. Re:Flash by Zekasu · · Score: 2, Interesting

      If you store a password in a Java applet in a string, you can download that applet and open it in Notepad to find it. I'm assuming this is the same in SWF documents.

      Also considering you can pretty much disassemble SWF files, well, that's your fault.

      Although I do agree that being able to search Google for vulnerable Flash applications is a major concern, I'm to go out on a limb and say that Adobe, coupled with Google, are going to do something like only make text-containing boxes into crawl-able material.

      That aside, I'm hoping that search engines also implement a "only display results without Flash content" feature.

  4. Re:GREAT! by Anonymous Coward · · Score: 2, Interesting

    It always surprised me there is no Linux distro that has gone trough the trouble of wrapping systrace around the Flash plugin (or the entire browser?)

    I would definitely like to have a Flash-enabled browser that runs in a sandbox, and has only access to some of its own files (to store cookies/settings/etc.) but does not have access to the rest of my homedir.

  5. A Good Thing by intx13 · · Score: 3, Interesting
    I'll deride slow-loading, unintuitive Flash apps as much as the next guy, but this is a big step towards Flash a viable alternative to the HTML/XHTML/Javascript/CSS/PHP jumble that makes up the Web today. Other things that still need addressing (IMO) to make a true Flash web:
    • Flash-to-Flash linking
    • More natural and useful text objects
    • Standardized framework for GUI elements
    • Real accessibility

    If these things could get cleared up, I wouldn't mind seeing a Flash Web... where Flash isn't a box in the center of an HTML page, but the basic protocol itself (like what Curl claims to be).

    Of course given the cludginess of most Flash apps, maybe I'm just being a masochist here!

  6. Re:What Adobe should do by Anonymous Coward · · Score: 1, Interesting

    My company and a couple others that friends of mine work for are all currently re-writing their web front ends to use flex exclusively. Granted in all of these cases, these software packages are intended for internal corporations - there are a growing number of developers using Flex for general web sites because it eliminates cross browser compatibility issues. If you don't know anything about Flex, you should head over to Adobe's web site and read up on it.

  7. Re:Great! by Bogtha · · Score: 2, Interesting

    You need to build this into the flash when creating as far as I know.

    Yes, that's true. Emphasis added to the comment I was replying to:

    Now all they have to do is make it so, when you make a web site in Flash, you can link directly to the "page" you want.

    nine-times was specifically talking about options available to developers.

    With html based sites no special work is required to make a page bookmarkable (aside from some ajax stuff, or in-page anchors).

    That's usually true, but like you say, there are exceptions, and it's not just the ones you mention. Frames, for example, cause similar problems, and JavaScript too (not just Ajax).

    I would welcome this if it is clear that it's flash content (like PDFs) in the search results, so I can make a judgment accordingly.

    I'm not sure if you are aware of this (a lot of people here don't seem to be), but search engines already index Flash. Google example. Yes, Flash results are identified in the same way PDF results are.

    This isn't a new feature, it's Adobe helping search engines improve the indexing they already do.

    --
    Bogtha Bogtha Bogtha
  8. Re:Great! by Anonymous Coward · · Score: 1, Interesting

    I dunno, I think that's a minor point. The rest of his points were valid about the how much flash sucks.
    But if it makes you feel better, I'll add a few more.

    No 64bit version of the plugin.
    When your cursor is over a flash object in a page, you lose the ability scroll with your mouse.
    Flash based site navigation removes the ability to use your browser controls (back, forward, refresh) as they were intended.

    Flash is moderately useful as video player for things like youtube, for just about any other use it's 100% garbage and should be avoided at all costs.

  9. Ok, now how do we get it by Paul+Carver · · Score: 3, Interesting

    This would be great if it can be implemented directly into web browsers. For example, a Firefox plugin that allows me to specify "view text only" for Flash content.

    Or is this "proprietary" information that will only be given to Google and Yahoo and not shared with the us commoners?

    I don't know what brain damage causes people to think that they should present text a half a dozen words at a time in a slideshow, but it would be great if my browser would default to showing me all the text from a flash slideshow and then let me choose if I really want to see it pieced out a few words at a time.

  10. Re:Flash content stuck in Search index by foniksonik · · Score: 3, Interesting

    Uhh... there are now several ways for flash developers to allow deep linking in to flash... it uses a hash and directory structure with javascript to pass the url into flash, which then auto loads or skips to the content being requested.

    This is not new, it's been around for 2 years now.

    Here's a site I built in just such a fashion:

    http://www.soursweetgone.com/flash/#/friends/punk-a-friend/

    This section of the site lets you upload a photo, morph it using a displacement map filter to either spherize or pinch the photo.. don't forget to zoom in on the good part... then you can email the results with a message to anyone. (yes we collect your info, but it's a candy company... your choice).

    --
    A fool throws a stone into a well and a thousand sages can not remove it.
  11. Re:For me... by dotancohen · · Score: 3, Interesting

    Actually, at home I do have broadband. But I like taking cheap shots at Adobe. So far they've saved me hundreds of dollars by forcing me into the Gimp because Photoshop isn't ported to Linux yet.

    You hear me, Adobe! Do not port Photoshop to Linux! I need to keep my money in the bank!

    --
    It is dangerous to be right when the government is wrong.
  12. Re:What Adobe should do by Anonymous Coward · · Score: 1, Interesting

    It most definitely IS as bad as was said, and worse.

    See this thing called the Internet was built upon an OPEN standard. Flash is CLOSED. To participate in the Internet, were it made of Flash, you would need to use both the CLOSED Flash viewer to view any of it (which favors CLOSED platforms as opposed to OPEN platforms), and to produce content you would have to pay for the CLOSED development suite.

    In the second place, I don't care how many features Adobe adds. 1000 words of plain text is 2KB. 1000 words encoded into a Flash file is 50KB. That's extra minutes to store and transmit it every time it is edited or viewed, extra time wasted on both ends to create it and read it, and all around amounts to a useless, worthless, FAIL.

    But I suppose corporations rue the day they let us have the WWW, freely accessible and neutral and free.

  13. Re:What Adobe should do by doom · · Score: 2, Interesting

    There's a subtle problem here: the standard is open in the sense that it's published, but it's still under the control of one company. If they liked, they could pull a Microsoft, and change the "standard" so that the next version would break everyone else's code.

    We might say that it is "open" but not "free", eh?

    (There's also a less-subtle problem, that Flash is designed to turn your computer into a television set. But some people will never get enough television.)