Webmasters Pounce On Wiki Sandboxes

← Back to Stories (view on slashdot.org)

Webmasters Pounce On Wiki Sandboxes

Posted by simoniker on Monday June 7, 2004 @04:53AM from the fold-spindle-mutilate dept.

Yacoubean writes "Wiki sandboxes are normally used to learn the syntax of wiki posts. But webmasters may soon deluge these handy tools with links back to their site, not to get clicks, but to increase Google page rank. One such webmaster recently demonstrated this successfully. Isn't it time for Google finally to put some work into refining their results to exclude tricks like this? I know all the bloggers and wiki maintainers would sure appreciate it."

15 of 324 comments (clear)

Min score:

Reason:

Sort:

Why just wikis? by GillBates0 · 2004-06-07 04:56 · Score: 4, Insightful

Why not normal discussion boards and blogs? We, for one, saw how the SCO joke (litigious b'turds) managed to GoogleBomb SCO in first place without a problem.

--
An Indian-American Hindu committed to non-violent thought/speech/action alarmed by the global explosion of radical Islam
1. Re:Why just wikis? by Andy+Mitchell · 2004-06-07 05:23 · Score: 3, Insightful
  
  I'm not sure this will make you feel better but this startergy has a limited lifetime.
  The contribution of your page to another pages page rank depends on two factors, firstly the page rank of your page, and secondly the number of links coming from your page.
  As more people take up this tactic the return everyone gets from it, gets smaller. E.g. When there are hundred of links on that page they cease to have any real value. Eventually people should give up on this one.
Yes... PLEASE... by Paulrothrock · 2004-06-07 04:57 · Score: 4, Insightful

Google needs to do something about this. I had to turn off comments on my blog because all I was getting was spam. Two or three a day that I had to go in and delete. I have to now find a system that will keep the bots out.
What happened to the nice internet we had in 1996?

--
I'm in the hole of the broadband donut.
You know... by fizban · 2004-06-07 04:58 · Score: 3, Insightful

...what Google needs? A "Was this result helpful in your search?" button for each link returned, so that the search itself also influences page ranks. Maybe that will help get rid of this Google bombing mess.

--
+1 Insightful, -1 Troll. What can I say, I'm an Insightful Troll.
1. Re:You know... by Anonymous Coward · 2004-06-07 05:04 · Score: 4, Insightful
  
  that button will also get spammed, as bots will click 'yes' for their sites and 'no' for the competitors sites
2. Re:You know... by goon+america · 2004-06-07 05:09 · Score: 3, Insightful
  
  Wouldn't that be equally abused?
Who's fault is that? by lukewarmfusion · 2004-06-07 05:00 · Score: 4, Insightful

Google's algorithm isn't the problem. The problem is the availability of easily abused areas such as these "sandboxes."

Some search engines accept any old site. Others accept sites based on human approval and categorization. Google is a nice combination of the two - by using outside references (counting how often the site is linked) it assumes that the site is more relevant. Because other people have put links on their sites. That's a human factor, without directly using human beings to review and categorize the sites and rankings.

Sure it can be abused, but it's not Google's fault; perhaps these areas of abuse (blogs, wikis, etc.) should address the problems from their end.
ROBOTS.TXT by gtrubetskoy · 2004-06-07 05:00 · Score: 4, Insightful

The burden is not on Google, but on Wiki sandbox admins, who should provide proper ROBOTS.TXT files to inform Google that this content should not be indexed.
As a sidenote, I think that with recent Wiki abuse, the issue of open wikis will become a similar one to open proxies and mail relays.
Well, it's about time this gets some attention by digitalgimpus · 2004-06-07 05:01 · Score: 4, Insightful

I've noticed that my blog's getting lots of spam from sites that don't seem like typical spam sites....

From what I can see, it looks like those "search ranking professionals" who "guarantee to raise your google rank in 30 days" are using blog spamming, and perhaps Wiki Spamming as a way to increase their clients ratings.

It's not about meta tags, or submitting anymore... it's spamming.

Perhaps it's time for people to finally be warry of these services. After all, can a third party really guarantee a position in another companies search index?

IMHO those services are pure evil. They either do nothing, or they do something to increase page rank... what is that "something"? How many options do they have?

If they are going to use my blog... why can't I get a cut in that business?
1. Re:Well, it's about time this gets some attention by Lurker+McLurker · 2004-06-07 05:09 · Score: 4, Insightful
  
  IMHO those services are pure evil.
  No, 9/11 was pure evil, some unwanted comments on a blog is an annoyance. If you have a website that allows anyone to post comments, you will get some you don't like. That's life.
  
  --
  Mod parent up!
Re:Cyberneighborhood Not-Watch? by lunax · 2004-06-07 05:03 · Score: 3, Insightful

Why not put the sandbox in it's own folder and add an entry to the robots.txt telling it not to browse that folder?
Tomorrow today yesterday by boa13 · 2004-06-07 05:05 · Score: 4, Insightful

But webmasters may soon deluge these handy tools with links back to their site, not to get clicks, but to increase Google page rank.

The Arch Wiki has sufferred several times from such vandals in the past few months. I'm sure other wikis have, too. They create links over single spaces or dots, so that casual readers don't notice them. Attentively watching the RecentChanges page is the most effective way to find and fight them, but this is tiresome. I guess many wikis will require posters to be authenticated soon, which is a blow in the wiki ideal, but not such a major blow. Alternatively, maybe someone will develop heuristics to fight the most common abuses (e.g. external link over a single space).

So, this is not new, but this is now news.
Re:visual security code for sign-up by stevey · 2004-06-07 05:33 · Score: 5, Insightful

There was a story about defeating this system on /. a while back.

Rather than using OCR or anything poeople would merely harvest a load of images from a signup site - possible when there are only a given number of finite images, or when there is a consistent naming policy.

Then once the images were collected they would merely setup an online porn site, asking people to join for free proving they were human by decoding the very images they had downloaded.

Human lust for porn meant that they could decode a large number of these images in a very short space of time, then return and mount a dictionary attack...

Quite clever really, sidestepping all the tricky obfuscation/OCR problems by tricking humans into doing their work for them ..
Re:Cyberneighborhood Not-Watch? by naoiseo · 2004-06-07 05:34 · Score: 3, Insightful

This fails to address the real issue.

That is, even if you make your links useless (easy with a no-follow meta tag) it wont help, the majority of this spam is AUTOMATED, and will spam your wiki/blog/guestbook based on simple page queues.

Your best personal defense is to manually remove any page or html queues that a spammer would pick up on as being common to a certain type of postable web page or element.

Bloggers have been creating blacklists (banning both poster ips and destination urls) with some degree of success. This is a deterrent, having a spammer show up on a blacklist whereby webmasters use a distributed file to 'clean' their blogs automatically.
Re:Grow up by maxwell+demon · 2004-06-07 07:16 · Score: 4, Insightful

Well, why not link SCO to something the reader gets real value from? Some page where they can learn something about SCO? After all, since those pages indeed tell something about SCO and therefore contain the word SCO, it should even be more effective.

--
The Tao of math: The numbers you can count are not the real numbers.