purl.org · Domains · Slashdot Mirror

Data & Software Citation. by oneiros27 · 2014-10-29 22:04 · Score: 1 · on The Most Highly Cited Scientific Papers of All Time

The top 100 most cited papers are actually a motley crew of methods, data resources and software tools that through usability, practicality and a little bit of luck have propelled them to the top of an enormous corpus of scientific literature.

The article itself never mention 'data resources' that I saw, but there's a problem in many fields that the standards are to cite the 'first results' paper for that data ... for which the results portion may have already been disproved or otherwise be crap. There are a number of efforts working on being able to cite 'data' separately from 'results of the data', and in a manner that's consistent across all disciplines (as we don't know in advance who might make use of our data). You also run into problems, as the paper being cited may describe the initial release of the data, and not be useful to determine which edition was used (as that may be significant to recreate their results). See the Joint Declaration of Data Ctation Principlies, DataCite (metadata schema + DOI registry system), and the 2012 CODATA-ICSTI report, "Out of Cite, Out of Mind: The Current State of Practice, Policy, and Technology for the Citation of Data".

There are similar issues with software citation -- everyone's citing the announcement of the existing of the software, but how can you track who might've relied on a buggy version to let them know that they may need to re-run their analysis? I'm not as active in this field, but the arguments remain the same (giving proper attribution, documenting everything to make it reproducible, etc.). See the 2013 Knepley et.al paper, "Accurately Citing Software and Algorithms used in Publications" and the work of the Software Sustainability Institute (which also covers topics on writing better research software, as was alluded to in the article)

It's probably also work mentioning that our current ways of tracking 'importance' of papers are flawed. See the Altmetrics Manifesto for a collection of links to efforts to come up with other metrics and CiTO, the Citation Typing Ontology to enable a way to classify why something was cited (it might be for criticism; in most of the cases in the article, it would be "uses method in", which not all disciples feel needs to be cited).

Re:rel=shortlink could eradicate URL shorteners by dkf · 2009-08-19 02:50 · Score: 1 · on URL Shortener tr.im To Go Community-Owned, Open Source

I wrote the shortlink specification a few months ago (based on similar work done by others), released it into the public domain using CC Zero and went about soliciting feedback.

So, are you going to just put it on a random website out there or are you going to do the proper thing and get it on a standards track somewhere? (Maybe IETF or W3C.) That's the only way to get it really trusted by the bulk of users, since they trust those organizations to keep on what they've been doing for years.

rel=shortlink could eradicate URL shorteners by samj · 2009-08-19 01:52 · Score: 5, Interesting · on URL Shortener tr.im To Go Community-Owned, Open Source

I've had a beef with URL shorteners for a long while now for reasons that have been covered ad nauseam (not the least of which being that in addition to adding significant overhead - typically hundreds of milliseconds per request - they are just plain evil). IMO the best solution is to let webmasters create and advertise their own short links using the "shortlink" link relation (e.g. rel="shortlink" in the HTTP headers and/or HTML HEAD) such that they can be auto-detected by clients who then no longer need to generate their own using 3rd party services. I wrote the shortlink specification a few months ago (based on similar work done by others), released it into the public domain using CC Zero and went about soliciting feedback. The standard got a big shot in the arm last week when WordPress.com announced support for rel=shortlink on over 100 million pages. I've since requested support be introduced into the top 20 Twitter clients (representing over 80% of Twitter usage) and have had only positive feedback so far. A number of other high profile sites like PHP.net and Ars Technica have also jumped on board. Anyway if you, like me, are sick of URL shorteners then you're welcome to give me a hand making them go away...

Sam

Re:Why can't we just move it? by Anonymous Coward · 2007-01-17 03:49 · Score: 1, Interesting · on Netscape Restores RSS DTD, Until July

> Perhaps you mean "long-lived, mostly reliable URI".

Yes, and I have a couple of name suggestions for this, we could call it a "permanent URI" or "persistent URI".

Purl may be a good choice for this DTD.

Re:Why would this break RSS readers? by Sir+Pallas · 2007-01-14 07:41 · Score: 1 · on Netscape Dumps Critical File, Breaks RSS 0.9 Feeds

According to the Internet Archive, the DTD was last changed in February 2003. Here's the latest copy of RSS 0.91. Perhaps someone should set up a redirect at PURL.

RSS 1 by the W3C by StandardsSchmandards · 2005-08-22 08:04 · Score: 1 · on RSS Wins, Signals Atom's Death Toll?

RSS 1.0 is also the only syndication format endorsed by the World Wide Web consortium. RSS 0.9 and 2.0 were created at the companies Netscape and Userland.

Re:URI to the Rescue by ggvaidya · 2005-02-22 06:44 · Score: 1 · on Power Outage Takes Wikimedia Down

Something like this?

Esperanto GPL dictionary by Anonymous Coward · 2004-12-08 23:12 · Score: 0 · on Universal Free Dictionary

ReVo is a GPL Esperanto dictionary. It also provides translations to other languages.

I Used to Work for OCLC by Skjellifetti · 2003-09-21 05:10 · Score: 1 · on Hotel Being Sued for Using the Dewey Decimal System

I obviously can't speak for them, but I can provide some background on what they do. OCLC is a nonprofit org providing services for approx 45,000 libraries around the world. If you are a librarian and need to figure out how to catalog a new book in your collection, you go to OCLC to see how others have done it. Ever needed an item that wasn't in your library? OCLC handles the system for arranging inter-library loans. They do a fair amount of original research for libraries and they even open source some of the results. PURL is another OCLC project that some of you may be familiar with. The Dublin Core MetaData Initiative was co-founded by a researcher who got his start at OCLC and is now running the W3C's Symantic Web Initiaitve. OCLC is very well known and respected in the library community.

Library budgets the world over are under attack given the current economic situation. This leaves less and less money available for building the kind of common infrastructure that will help libraries continue to provide new and relevant services for their patrons as more and more of the content becomes digital. OCLC certainly has both the right and the need to defend the Dewey Decimal Trademarks from infringers.

Re:The Semantic Web is already here by Chris+Croome · 2003-01-30 03:53 · Score: 1 · on The J.R.R. Tolkien of the Web

I agree that RSS is probably the most popular semantic web thingy at the moment.

However only RSS 1.0 is based on RDF and is a standard with an open committee in control, RSS 2.0 is plain XML and is controlled by Dave Winer...

One of the best collection of RSS resources is the one Ben Hammersley maintains on DMOZ.

'PURL' is an infringment on OCLC by Anonymous Coward · 2002-01-03 17:00 · Score: 0 · on Online Greeting Cards Patented

'PURL' has been used as an acronym for "Persistent URL" (http://www.purl.org) since at least 1995. I believe that Tumbleweed's use of the 'PURL' name is an infringment, specifically as it is being used within the same knowledge/technical domain.

Forking RSS by Anonymous Coward · 2001-04-28 05:52 · Score: 5 · on Netscape Says No RSS 0.91 For You

I hate to say it, but this is YA example of why forking over petty differences can be A Bad Thing. If one stubborn contingent hadn't steadfastly clung onto the deprecated RSS 0.91 spec instead of moving to RSS 1.0 (which returns to RSS's more dynamic roots), said contingent wouldn't be locked out because *a single document was removed from a web site*. Yeah, Netscape did the wrong thing. But the proponents of the outdated and outmoded spec should have seen this coming a mile away.

Re:Directories are not search engines by dingbat_hp · 2001-03-27 16:33 · Score: 1 · on Is The Web Becoming Unsearchable?

doomed to failure until someone implements something like the Dewey Decimal System for web pages

Yes, we're stuffed -- but Dewey Decimal isn't the answer (we can do a lot better than that).

There's an initiative around that's gaining considerable momentum - the Semantic Web. It starts from one bright idea by one guy, but as the guy in question is Tim B-L, then he gets listened to. There are solutions to all this. We've barely started on what we could easily achieve for indexing the web, without even trying for the really hard stuff.

Once basic semantic level indexing becoms commonplace, through tools like Dublin Core, then take a look at ontological descriptions and projects like DAML.

There's a huge amount happening in this field research-wise, it just hasn't hit the punter's web yet.

Semantic Web is a layered framework by Edd_Dumbill · 2001-03-21 03:24 · Score: 3 · on Is The Semantic Web A Pipe Dream?

As the Semantic Web is a layered framework, the actual vocabularies you use to describe things are applications of the framework rather than the framework itself.

One such application that might prove useful in what you're tackling is RSS. What you seem to be looking for is a taxonomy against which you can classify things. These are expensive to develop and hence rare, the ODP being one of the few that are public. My advice is, if the ODP doesn't fit, classify by topic yourself (but avoid the mistake of struggling to produce a hierarchical system, this is rarely appropriate). At a later stage you can express equivalences to other folks' categories. Folks on the RSS-DEV mailing list would be happy to share experience of categorization.

Anyone seeking more information as to what the Semantic Web actually is and how it fits together might be interested in some of the articles I've written on the topic, which give an overview both of the vision and of ways you can get started with tools:

-- Edd Dumbill, Editor, XML.com.

This is where SWAG comes in by aswartz · 2001-03-21 03:11 · Score: 3 · on Is The Semantic Web A Pipe Dream?

I feel sort of bad plugging my own group, but this is exactly the problem that SWAG is meant to solve. SWAG is the Semantic Web Agreement Group, and we bring different users of the SWeb together to try and build sets of common terms. Our current project is to build a dictionary of common terms, which you can find at: WebNS.net.

Obviously, the Semantic Web won't work if we only have one dictionary, but it will work much better if agree on the terms we use when possible. So SWAG isn't trying to enforce terms, but merely recommend them.

We work on a process of consensus so that we can move quite quickly and new terms don't get bogged down in endless talking.

So I hope you'll visit us, once again the address is: http://purl.org/swag/.