Detecting Conflict-Of-Interest on the Semantic Web
CexpTretical writes "At the 15th International WWW Conference in Edinburgh Scotland, Refereed Track on Semantic Web accepted many thorough and interesting academic papers on semantic web research on subjects related to where the Web is in the Semantic Web?
One such paper nominated for Best Paper Award, Semantic Analytics on Social Networks: Experiences in Addressing the Problem of Conflict of Interest Detection hits on the whole subject of validation and/or verification in the brave world of so called "Web 3.0" topologies/frameworks/architectures.
The paper describes a "Semantic Web application that detects Conflict of Interest (COI) relationships"."
With all the "Slashvertisements", Slashdot sure could use one of these servers.
Life is not for the lazy.
WTF is up with this web reversioning trend?
------ Take away the right to say fuck and you take away the right to say fuck the government.
Perhaps the harder problem is detecting any interest in the Semantic Web.
If you post it, they will read.
their campaign contributions, former colleagues etc.
That would make for an interesting web application and an interesting election year...
Support NYCountryLawyer RIAA vs People
Calling it a conflict-of-interest is really a matter of semantics. The conflict arises when people see the words semantic web. They are someowhat interested but but are conflicted in not admitting they don't know what the word semantics means and are too embarrassed to look it up.
"You'll get nothing, and you'll like it!"
Let's hear an example of what this semantic web will look like to me, J. Random Hacker. In nethack dialogue preferably.
I'll start:
f - a blessed web browser
af
Suddenly, your blessed web browser starts up! --more--
You see a...
This is an excellent paper that highlights many of the issues that will be encountered as the naive realists promoting the semantic web hit the hard fact that data quality is poor and identitification is hard. From the paper's conclusions:
The goal of full/complete automation is some years away. Currently, quality and availability of data is often a key challenge given the limited number of high quality and useful data sources. Significant work is required in certain tasks, such as entity disambiguation.
As a practical tool the Semantic Web has all of the problems that no-fly lists have. People share names with each other and one individual may appear under multiple names. Datasets are radically incomplete, and an awareness of the possible uses to which data may be put will encourage the less scrupulous amongst us to deliberatly devalue datasets by including misleading or incomplete information.
Even without deliberate poisoning of the data, it is doubtful that standard vocabularies will be used in sufficiently consistent ways by various institutions and individuals to create homogenous (and therefore useful) datasets. For example, people who do multi-centre cancer trials expend an enormous amount of energy on data curation and auditing, which includes actual site visits to institutions and periodic audits of data, as well as centralized control of what gets into the final database. And this is for data collected by cancer centers and cancer docs who are nominally committed to following precise protocols and have been given training in what the fields in the various forms are supposed to mean. Yet centres can and do get delisted from studies due to lack of compliance.
The same thing can be seen in nominally standardized data formats like MAGE-ML and its cousins: industry-standard XML-based languages for marking up genomic datasets. There are specific elements that are intended for particular pieces of data, but a depressing amount of the time companies decide to put the really important stuff in a catch-all element, because "it's easier" than understanding the well-documented and clearly defined format.
Likewise, medical images created in DICOM format by major equipment manufacturers not infrequently have clear and blatant violations of the DICOM standard, despite over a decade of effort to ensure a reasonable level of compliance. And these are not subtle violations, but missing required fields, or incorrect data in required fields ("because all our images are 512x512 why should we have to fill in the width and height all the time? It's easier to just leave them zero.")
People are stupid and lazy. I know I am. And we use the same words to mean different things, and different words to mean the same thing. The Semantic Web requires people to be smart and hardworking, and to use standardized vocabularies in standardized ways. Decades of failed or at best partially successful data exchange protocols strongly suggest that these requirements will not be fulfilled.
Blasphemy is a human right. Blasphemophobia kills.
People that know the most about something have an interest in it *gasp* Smart people in X like to hang around other smart people in X, sometimes even goto conferences *whoa*
This is old news in academia, and yet things still work pretty darn well there. That is because reputation is important, and as soon as you do something unethical or even just stupid, you're toast.
If a field gets too "imbred" as far as their research/reviewing goes (e.g. a group always present at workshops at 3rd rate conferences, and the program committee is always the same 20 people), the community just ignores them.
The web on the other hand, you just whip up a new identity and keep spamming, but that's why most people love it, zero accountability, which of course leads exactly where you expect it to.
- Adam L. Beberg - The Cosm Project - http://www.mithral.com/
I said who put all the things in your head
Things that make me feel that I'm mad
And you're making me feel like I've never been born
She said you don't understand what I said
I said no, no, no you're wrong
When I was a boy, everything was right
Everything was right
I said even though you know what you know
I know that I'm ready to leave
'Cause you're making me feel like I've never been born
"Flyin' in just a sweet place,
Never been known to fail..."
It is far more widely useful to view this as a means of finding people who have common interest and may actually be collaborating. Imagine building a database of relationships, appointments, investments, etc that concentrates on the rich and famous, businessmen, politicians, and others with power and then running these algorithms on those relationships. Imagine examining those relationships in the context of subjects (relationship strengths should differ depending on the subject through which the relationship is being judged). The world of politics could be turned on its heels if a useful map of relationships could be created for each political subject and easily traversed by strength. Surely though, this isn't new. In particular, I've seen many calls for research into this aspect of datamining from the intelligence community, a community that has also been investing in the creation of database technologies that can hold and semantically access many petabytes of data.
At the 15th International WWW Conference in Edinburgh Scotland, Refereed Track on Semantic Web accepted many thorough and interesting academic papers on semantic web research on subjects related to where the Web is in the Semantic Web?
Uhh...wtf? Does that make sense to anyone? How is that sentence a question?
For fucks sake Zonk, EDIT! DO YOUR F'IN JOB !
And now back to mine...
I think anyone who uses the term "Semantic Web" deserves a cock-punch. Who's with me?
> Calling it a conflict-of-interest is really a matter of semantics.
:-)
Well, of course it is! This IS the Semantic Web they're talking about. I think everything on it is a matter of semantics
At least it's not the Symantec Web, though. That'd probably make high end PCs slow to a crawl and ask you "Are you sure you want to open index.html? It might contain a virus or something, although we're not really sure. We're just asking to make it absolutely clear that it's your fault if it breaks something."
People are stupid and lazy. I know I am. And we use the same words to mean different things, and different words to mean the same thing. The Semantic Web requires people to be smart and hardworking, and to use standardized vocabularies in standardized ways. Decades of failed or at best partially successful data exchange protocols strongly suggest that these requirements will not be fulfilled.
_ State_Of_The_Arti c_MediaWikit ic_Wikipedia
A quite standardized vocabulary actually exist in Wikipedia (markup language, templates, categories).
Here is a list of links that try to combine wikipedia and the semantic web:
http://wiki.ontoworld.org/index.php/Semantic_Wiki
http://wiki.ontoworld.org/wiki/Sites_using_Semant
http://en.wikipedia.org/wiki/Wikipedia_talk:Seman
http://www2006.org/programme/item.php?id=4039
http://meta.wikimedia.org/wiki/Semantic_MediaWiki
Even though I looked at the paper, I still have no fucking idea what this is about.
I vote that next year the Best Paper Award go to "Looseleaf Paper" because it has both holes AND lines.
This was interesting enough, I guess, as a really high-level description of a process that could be used to build a semantic web.
That said, the researchers picked the domain very carefully - to guarantee a positive-looking result, I guess - but I don't see how this could scale to the web in general, a place where, well, nobody knows you're a dog.
668: Neighbour of the Beast
Being just curious enough to add a .com after that, I then wondered if they chose that site on purpose. Doubtful, but positive reinforcement DOES work at times...
Don't tell me to get a life. I'm a gamer; I have LOTS of lives!
It's all very well these hucksters peddling the semantic web to funding bodies who don't know any better, as long as they don't start pretending it's anything other than the new FIPA - a collection of committees generating specifications that the world will continue to ignore.
The semantic web assumes everyone in the world will play nice and publish his data using standard schemas.
This is estimated to happen soon after Microsoft will switch to a POSIX standard operating system, the RIAA will support buying musing in Ogg Vorbis format, and Sony and Microsoft will agree on a common Blu-DVD format, and airline companies will really tell you how the compute their ticket prices. And the rupture.
Seriously... the idea is beautiful in theory, but in practice people do not want their data to be available. The business case for the semantic web seems to be "lets all cut our profit margins to nothing!". Small wonder it took off like a lead baloon.
Here is a trivial example: product prices. If vendors had wanted to make it easy for everyone in the internet to be able to view their catalog and compare prices, all it would take is a "standard" using <!-- product: iPod nano --> and <!-- price: $200 -->. There is a reason this doesn't happen. The internet vendors hate pricegrabber and froggle and their kind. They want you the customer to log in to their site to look prices up, thank you very much.
had their own Interwebs. Good for them!
Anonymous Cowards are at -6...
When I read this I thought "Detecting Conflict-of-Interest on the Semetic Web"
:)
Damn Zionists!