P2P Bibliographies with Bibster

← Back to Stories (view on slashdot.org)

P2P Bibliographies with Bibster

Posted by ryuzaki0 on Monday August 2, 2004 @10:10AM from the it's-a-monster dept.

Noksagt writes "P2P isn't just for government documents anymore! Bibster assists researchers in managing, searching, and sharing bibliographic data in a peer-to-peer network. This project shows great promise to researchers who currently search for citations through centralized servers (Google, Scirus, CiteSeer, ISI. and many others). By making it decentralized, researchers can share bibliographic data with no subscription costs and avoid typing this data in by hand. It can import and export citations using bibtex. The project is GPLed and free clients for windows and Linux are available. There's also a Sourceforge page for Bibster, so you can checkout from the CVS if the Bibster site is slow."

23 of 79 comments (clear)

Min score:

Reason:

Sort:

wait a minute... by Scythr0x0rs · 2004-08-02 10:11 · Score: 4, Funny

this is news for nerds guys...
the CVS server will slow down before the website.
1. Re:wait a minute... by Noksagt · 2004-08-02 11:02 · Score: 2, Interesting
  
  the CVS server will slow down before the website.
  The CVS is hosted by sourceforge, which can handle significant load. The website is hosted on some University computer & I had trouble reaching it when I was emailed the link. So it might not be able to handle the load as well.
We Need a News Version of This! by lofi-rev · 2004-08-02 10:17 · Score: 3, Funny

oh wait....

Seriously, having a collaborative system for journalism with moderation and web of trust like elements could be wonderful - anyone got any bright ideas on how to do it?
ahhh, p2p... by macshune · 2004-08-02 10:22 · Score: 4, Funny

Future conversation between two illustrious academics:

"Could you send over that citation for that lagomorph genome paper?"

"Sure thing. I'll send some Steely Dan too, it helps me when I read papers about the lagomorph genome."

"31337, thx."
People who cite will also read the paper by Anonymous Coward · 2004-08-02 10:22 · Score: 2, Interesting

People who cite will also read the paper before doing so. This system will be useful when one has a paper in hand, but does not have the bibtex entry. No one uses just a citation without the content of the paper.

So you have to prepare the content, and you might as well submit it to those journals, conferences :)
Re: Psychic Answer by Anonymous Coward · 2004-08-02 10:25 · Score: 4, Funny

I'm seeing a URL...no, a number. Yes, it starts with a 5. I believe it's past 500. It's becoming clearer...I see the number 503.

Did you just ask a question? If you did, it appears the answer is "No"
Citation Index by wayward · 2004-08-02 10:25 · Score: 4, Interesting

That looks promising. Will there be an easy way to see a citation index - for example, listing all the publications that cite a given article? (Citeseer does this, and this can be important to academic types.)
So... by theM_xl · 2004-08-02 10:32 · Score: 2, Interesting

Is it just me or is a scientific database every idiot can add to a bad idea?
1. Re:So... by Rosco+P.+Coltrane · 2004-08-02 10:48 · Score: 5, Interesting
  
  Is it just me or is a scientific database every idiot can add to a bad idea?
  
  I suppose it's the same as a wiki: I too first thought it was the dumbest idea to allow everybody and their dogs to edit webpages, but in any wiki I used, the content always turned out to have a pretty good S/N ratio. I still don't understand why, but wikis work. Just look at wikipedia... So perhaps this will work too...
  
  --
  "A door is what a dog is perpetually on the wrong side of" - Ogden Nash
2. Re:So... by j1m+5n0w · 2004-08-02 10:48 · Score: 2, Insightful
  
  Is it just me or is a scientific database every idiot can add to a bad idea?
  
  Maybe. On the other hand, an encyclopedia every idiot can add to turned out alright. But they have a certain amount of centralized control to keep things from getting out of hand.
  Fortunately, few idiots (or anyone else) have much of an incentive to falsify bibliographic data.
  -jim
3. Re:So... by burns210 · 2004-08-02 14:24 · Score: 4, Informative
  
  "I still don't understand why, but wikis work. Just look at wikipedia.."
  
  Wow... Well, lets put aside the subtle notion that people are benevolent and never do wrong to a wiki, and realize the Wikipedia uses strict moderation and privledges, letting a huge moderation team track various pages along with the ability to ban users or lock pages from being edited(George W. Bush's page cannot be edited, for example).
  
  Wikis work because they have a chain of command.
Full texts? User comments? by UniAce · 2004-08-02 10:48 · Score: 5, Interesting

What would be really nice is to have the full texts of articles available P2P. That's the advantage of using centralized databases from subscribing locations (like universities): you can sometimes access full text for newer articles with just one click. Swapping full texts would be tremendously useful (and would keep us lazy scientists from having to actually get up and go to the library). Yeah yeah, I'm sure there are copyright issues... but doesn't fair use apply somehow? I'm a psychology research assistant at a major university, and at weekly lab meetings we often send around articles by email for everyone to read and then discuss, and I've never even really thought about copyright of them until now. Isn't open sharing of knowledge at the heart of the scientific endeavor? Oh, and also: it would be awesome if user comments could be added to each citation. Like: "this was an influential paper that opened new directions for research on human memory," etc. Of course, you can also get a ROUGH idea of that kind of thing by how many times a paper's been cited by other papers, as someone else already said.
Standards based? by azaroth42 · 2004-08-02 11:06 · Score: 5, Interesting

The next big question is whether or not it's standards based. While it would be surprising if it used Z39.50, it would be a shame if it didn't use SRW and/or CQL.

Especially as NISO is recommending them in their current 'Metasearch Initiative' -- an industry/academic/government cross sector committee with the major players and interested parties for allowing cross searching of bibliographic databases with other sorts of things.

(ObDisc, member of both SRW Editorial Board and Taskgroup 3 of NMSI)

--Azaroth
1. Re:Standards based? by Noksagt · 2004-08-02 11:45 · Score: 3, Informative
  
  Unfortunately not. Nor does it seem to use MODS XML for record storage (which, incidentally, will be used by OpenOffive.org's bibliographic and the bibliophile project, which hopes to do cross searching across the open source literature databases.
  
  SRW/U hopes to supplant Z39.50. Not only does it use MODS, but it still uses ZeeRex and CQL .
  
  For more nerdy e-refererence stuff, check out darcusblog
But.. by iantri · 2004-08-02 11:12 · Score: 4, Interesting

What guarantees accuracy? What guarantees high-quality results?
If we were to look at another project, say, CDDB, which stores meta-data for CDs (Title, Arist, Track Listing), something not at all unlike storing meta-data for books (bibliographies), you'll note that CDDBs entries are frequently inaccurate, mispelled and just plain wrong.
When it comes down to it, I don't really trust Random Joe to provide accurate trustworthy info. It's not like its like Wikipedia, or anything, which has constant peer review and a clear history.
I'm a geek... by lukewarmfusion · 2004-08-02 11:42 · Score: 2, Insightful

...married to a non-geek (getting her PhD in Psych). When I told her about this system, she said:

"My system's better anyway. I have a file, with the exact bibliography printed on the folder, for every article I've read or written. If I need one, it's right there. If I need to use the citation, I can just copy it from my Excel spreadsheet. Now why would this thing be better?"

Some people are born geeks, I guess.
1. Re:I'm a geek... by RealAlaskan · 2004-08-02 12:16 · Score: 2, Interesting
  
  If I need to use the citation, I can just copy it from my Excel spreadsheet. Now why would this thing be better?
  This would be better because when she reads a new article, she could get the bibliography from someone else, rather than having to type it in herself.
  Of course, if she has read so few papers, and does so little writing, that Excel (and Word? Ick!) work for her purposes, then this might be an exercise in gilding lillies.
  I use Emacs, with reftex and bibtex, and find that it works far better for my purposes than any of the several wordprocessors I've tried. None of them, including Word and OpenOffice, can equal that combo, with LaTeX for the typesetting. They're just not up to speed, for quality of the output or ease of use.
  
  --
  See what I've been reading.
2. Re:I'm a geek... by imkonen · 2004-08-02 15:10 · Score: 3, Interesting
  
  "I have a file, with the exact bibliography printed on the folder, for every article I've read or written."
  I tried to keep a system like that going for a while. It's one thing to be good about saying "Wow, that was a good article, I should fill out the bibliography right now in case I should like to cite it someday." It doesn't take much discipline since it happens roughly once a year. It takes a whole other level of discipline I just don't have to keep filling in those entries for articles I get bored with halfway through, stacks of articles my boss dumps on my desk, articles I read and decide are completely irrelevant to anything I'll ever be interested in, etc.
  Nowadays I just use SciFinder or one of the other databases which can export in citation manager friendly format instead of typing in by hand. I'm not sure I see how P2P would make my life any easier. However these are all (SciFinder, SciSearch, ISI to be sure, not so sure about others) for fee databases that require my University to pay a subscription. I'm all for the free exchange of information, especially in the scientific community, so if this facititates it, I'm on board.
Re:Full texts? User comments? by j1m+5n0w · 2004-08-02 12:35 · Score: 4, Informative

citeseer has full text available for for most of its articles, and its a free service, so maybe copyright isn't such a big deal for some reason. Maybe it's because most papers in computer science are available from the author's website.
-jim
What an interface! by Sajma · 2004-08-02 13:20 · Score: 3, Interesting

A possible inquiry could be: I am searching for topics about peer-to-peer technologies.
As a result Bibster returns bibliographic entries concerning peer-to-peer technologies.

Next, they'll perfect image search:
A possible inquiry could be: I want to see defiance in the face of insurmountable odds.
As a result Imagester returns images depicting defiance in the face of insurmountable odds.
Seriously, are they offering anything better than standard keyword and author search? What I'd really like to see is such a bibliography database that ranks search results usign a PageRank-like algorithm (as I recall, the idea for PageRank derived from research on citation graphs, so this would bring things full circle).
I'd also like to see Google start parsing publications and indexing them by author, year, and citations. The bibliography databases that I'm familiar with require manual input of new entries; it would be cool if this could be done automatically instead. Of course, there will need to be some interface to correct erroneous entries, and this opens up a large can of worms.
Re:Full texts? User comments? by periol · 2004-08-02 13:31 · Score: 2, Interesting

I've been working on a similar idea for news, and as far as I can tell fair use completely applies to this specific idea of yours - education and the arts, unbiased, not for profit.

There are already some sites out there doing something similar like the Media Awareness Project [mapinc.org] which collects and archives research on drug policy. From what I can tell, they only get sued when they get too big, present content with a bias, or try to profit.

I find it hard to believe my little project is the only one out there. We're working on web/p2p jointly, but there are bound to be others, and they'll all probably be open source. So once one good once comes out, we'll see lots of applications of this within research and academic communities.
OK, When will someone by burns210 · 2004-08-02 14:42 · Score: 2, Interesting

OK, these p2p apps are awesome, but I see a problem, they each need to maintain their own p2p system(protocol), by forking from another project it or by writing from scratch or they need to piggyback another network...

When will someone sit down, using an open source model ofcourse, and write the 'granddad' p2p protocol? It doesn't have to require everything, just has to be able to support everything... Encryption, hidden routing(not being able to tell who is requesting data vs. who is just passing data along), multiple source download, huge scaling, efficient and distributed search, etc.

This public network could become the defacto to what open source apps work off of. As long as the protocol is the focus(a nice gui as well, but seperate the frontend from the backend), you could use it link to files on your website, or you could have multiple apps(a music/napster like app, a scientific research paper app, a bibliographies app, a usenet discussion thread app) each of them using a common protocol, and routing between them, but each app filters out the noise it doesn't want.

It could be the killer app, it could have every major p2p app migrate to it. Project Gutenberg, Bibster, linuxiso.org, all using a common protocol and network.... *drools*
NLP would be nice by tgibson · 2004-08-02 14:46 · Score: 2, Insightful

eTBlast is a bibliographic search engine to which you submit an entire abstract. A little natural language processing and the results returned are to articles which have similiar abstracts. Though the tool operates on the Medline database, there is no reason the algorithm couldn't be used with Bibster.