P2P Bibliographies with Bibster

wait a minute... by Scythr0x0rs · 2004-08-02 10:11 · Score: 4, Funny

this is news for nerds guys...
the CVS server will slow down before the website.

Re:wait a minute... by Noksagt · 2004-08-02 11:02 · Score: 2, Interesting

the CVS server will slow down before the website.
The CVS is hosted by sourceforge, which can handle significant load. The website is hosted on some University computer & I had trouble reaching it when I was emailed the link. So it might not be able to handle the load as well.

Cool by Anonymous+Crowhead · 2004-08-02 10:14 · Score: 1

I am going to download it, a create a bunch of papers written by myself. Soon, I will be published in Science, Nature, and many other of the top periodicals of chemistry, physics and biology. Perhaps I will co-author a paper with Stephen Hawking.

We Need a News Version of This! by lofi-rev · 2004-08-02 10:17 · Score: 3, Funny

oh wait....

Seriously, having a collaborative system for journalism with moderation and web of trust like elements could be wonderful - anyone got any bright ideas on how to do it?

Re:We Need a News Version of This! by Anonymous Coward · 2004-08-02 10:32 · Score: 0

That seems hard! Why not regurgitate press releases from the government verbatim? Journalists aren't computer geniuses!

Wait...how do I submit this to the internet?
Re:We Need a News Version of This! by periol · 2004-08-02 13:21 · Score: 1

Well, I have been talking with a few friends about something along these lines - first a website, but then a p2p client that decentralizes information. We're right at the birthing stage, but that's probably why no one has done it. The only way to pull it off is through fair use, and the only way to pull that off is to be completely non-profit.

The ideal would be a system that posts articles from current news sources (full text, not just links), individuals on the site, with a commenting feature. The articles (and maybe comments)would be archived (again, text, but just text) and searchable. The idea is there, and we're starting the ball rolling on the project. The whole thing will be open source. If you're interested (in any capacity), get in touch with me... periol[at]gmail[dot]com
Re:We Need a News Version of This! by burns210 · 2004-08-02 14:33 · Score: 1

I have a very similar idea... now don't laugh at me... then i realized what usenet is(sue me, I never used it before).

I think a news sindication/decentralized publication would be the greatest application ever made... It would be a killer app for p2p... The uses are endless, but done right, it could be run as a server backend for dynamic websites(like Google News, almost) or by using routing and encryption algos could be the answer to anti-censorhip...

OK, those were to huge ideas that wouldn't be done right away. Back to the app... being able to
1. publish an article,
2.comment on an article,
3. moderate those comments(distributed over p2p, mind you) and
4. track authors
are the four biggies.

instead of an article(or all comments, since an article is just the root comment of a thread)being sent plain text, do a simple xml file out of it. This would allow for metadata to be sent with it, to allow for clients to easily sort and track it... maybe a unique ID for the article(X, where X is a really large numbers) and a unqiue sub ID for comments(X.Y, where Y is the comment order/time/something).

Heck, you could go so far as to give the option to use public key encryption, so that only readers who have the author's public key can read the article, and thus verify that the latest story from John, is actually John, and is not just spoofing the identity.

Much of the work is already there... Waste has public key encryption + sharing worked out, or atleast usable... MUTE, has the routing algorithms that hide who is sending/recieving/requesting the data... Now all it needs is some customizing, a forked protocol, and a gui...
Re:We Need a News Version of This! by dossen · 2004-08-02 23:47 · Score: 1

Heck, you could go so far as to give the option to use public key encryption, so that only readers who have the author's public key can read the article, and thus verify that the latest story from John, is actually John, and is not just spoofing the identity.

You've got that mixed up. By definition the public key is public, so ...only readers who have the author's public key... is effectively anybody who is interested.
Re:We Need a News Version of This! by burns210 · 2004-08-03 16:43 · Score: 1

you are right, but let me clarify what i meant.

using public key encryption means that John uses his private key to encrypt his article. Now, users who track John's articles can decrypt using his freely available public key. That means, anyone who wants can read John's articles, true, but it also means that when an article is said to be written by John, a user can prove it because only John can encrypt using his private key... it authenticates the author so you can't spoof someone's identity.

ahhh, p2p... by macshune · 2004-08-02 10:22 · Score: 4, Funny

Future conversation between two illustrious academics:

"Could you send over that citation for that lagomorph genome paper?"

"Sure thing. I'll send some Steely Dan too, it helps me when I read papers about the lagomorph genome."

"31337, thx."

Re:ahhh, p2p... by foobsr · 2004-08-02 10:54 · Score: 1

Which somehow also shows that an illustrious academic as a concept is a (near?) constant.

CC.

--
TaijiQuan (Huang, 5 loosenings)

People who cite will also read the paper by Anonymous Coward · 2004-08-02 10:22 · Score: 2, Interesting

People who cite will also read the paper before doing so. This system will be useful when one has a paper in hand, but does not have the bibtex entry. No one uses just a citation without the content of the paper.

So you have to prepare the content, and you might as well submit it to those journals, conferences :)

Shouldn't that be... by Anonymous Coward · 2004-08-02 10:24 · Score: 1, Funny

P2P isn't just for pirating music anymore?

Re: Psychic Answer by Anonymous Coward · 2004-08-02 10:25 · Score: 4, Funny

I'm seeing a URL...no, a number. Yes, it starts with a 5. I believe it's past 500. It's becoming clearer...I see the number 503.

Did you just ask a question? If you did, it appears the answer is "No"

Citation Index by wayward · 2004-08-02 10:25 · Score: 4, Interesting

That looks promising. Will there be an easy way to see a citation index - for example, listing all the publications that cite a given article? (Citeseer does this, and this can be important to academic types.)

So... by theM_xl · 2004-08-02 10:32 · Score: 2, Interesting

Is it just me or is a scientific database every idiot can add to a bad idea?

Re:So... by Rosco+P.+Coltrane · 2004-08-02 10:48 · Score: 5, Interesting

Is it just me or is a scientific database every idiot can add to a bad idea?

I suppose it's the same as a wiki: I too first thought it was the dumbest idea to allow everybody and their dogs to edit webpages, but in any wiki I used, the content always turned out to have a pretty good S/N ratio. I still don't understand why, but wikis work. Just look at wikipedia... So perhaps this will work too...

--
"A door is what a dog is perpetually on the wrong side of" - Ogden Nash
Re:So... by j1m+5n0w · 2004-08-02 10:48 · Score: 2, Insightful

Is it just me or is a scientific database every idiot can add to a bad idea?

Maybe. On the other hand, an encyclopedia every idiot can add to turned out alright. But they have a certain amount of centralized control to keep things from getting out of hand.
Fortunately, few idiots (or anyone else) have much of an incentive to falsify bibliographic data.
-jim
Re:So... by Anonymous Coward · 2004-08-02 12:33 · Score: 0

I think you are correct, one can imagine a rather large amount of misinformation floating around. Could have people walking around believing that aliens landed in the desert or something.
Re:So... by Odocoileus · 2004-08-02 13:03 · Score: 1

I tend to agree, unless someone else knows better. One can imagine a large amount of misinformation floating around. We could end up with a bunch of people believing that aliens crash landed in the desert. Or that processed food is healthy.

--
...
Re:So... by burns210 · 2004-08-02 14:24 · Score: 4, Informative

"I still don't understand why, but wikis work. Just look at wikipedia.."

Wow... Well, lets put aside the subtle notion that people are benevolent and never do wrong to a wiki, and realize the Wikipedia uses strict moderation and privledges, letting a huge moderation team track various pages along with the ability to ban users or lock pages from being edited(George W. Bush's page cannot be edited, for example).

Wikis work because they have a chain of command.
Re:So... by Anonymous Coward · 2004-08-02 20:56 · Score: 0

No, Wikipedia works because it has a chain of command. Most Wikis work because they don't have URLs that morons can find.
Re:So... by phiwum · 2004-08-02 21:26 · Score: 1

If an author doesn't know enough about an article to tell whether basic bibliographic information is right or wrong, he shouldn't cite the article.

Authors, titles, approximate year of publication, etc... Anyone citing an article should see at a glance that these are correct. So what damage can a forger do? Start inserting false publisher fields? A bit embarrassing for the author, perhaps, but nothing too serious.

--
Phiwum's law: anyone that names an obvious law after himself and then puts it in his own sig is just pathetic.

Full texts? User comments? by UniAce · 2004-08-02 10:48 · Score: 5, Interesting

What would be really nice is to have the full texts of articles available P2P. That's the advantage of using centralized databases from subscribing locations (like universities): you can sometimes access full text for newer articles with just one click. Swapping full texts would be tremendously useful (and would keep us lazy scientists from having to actually get up and go to the library). Yeah yeah, I'm sure there are copyright issues... but doesn't fair use apply somehow? I'm a psychology research assistant at a major university, and at weekly lab meetings we often send around articles by email for everyone to read and then discuss, and I've never even really thought about copyright of them until now. Isn't open sharing of knowledge at the heart of the scientific endeavor? Oh, and also: it would be awesome if user comments could be added to each citation. Like: "this was an influential paper that opened new directions for research on human memory," etc. Of course, you can also get a ROUGH idea of that kind of thing by how many times a paper's been cited by other papers, as someone else already said.

How 'bout an essayster? by Anonymous Coward · 2004-08-02 10:54 · Score: 0

Skip the whole bibliography bit, how 'bout one that shares essays and book reports? Take the whole Cliff Notes one step more and just pre-build book reports, position papers, classification papers, and other material to cover Freshman Lit, Philosophy/Critical Thinking, and Intro to American Literature. Heck, why stop there? You've got psych papers, econ, anthro...basically everything that'll keep a pure-bred math/physics type focused on their major.

Then you can run bayesian filtering to "learn" your writing style and apply "corrections" as needed to make it your own. Think of it as a more liberal copyleft. :)

This post is intended as a joke. I sure as hell don't advocate plagiarism or anything of the sort. Read your books, write your own papers -- you'll be a more well-rounded person in the end.

Standards based? by azaroth42 · 2004-08-02 11:06 · Score: 5, Interesting

The next big question is whether or not it's standards based. While it would be surprising if it used Z39.50, it would be a shame if it didn't use SRW and/or CQL.

Especially as NISO is recommending them in their current 'Metasearch Initiative' -- an industry/academic/government cross sector committee with the major players and interested parties for allowing cross searching of bibliographic databases with other sorts of things.

(ObDisc, member of both SRW Editorial Board and Taskgroup 3 of NMSI)

--Azaroth

Re:Standards based? by Noksagt · 2004-08-02 11:45 · Score: 3, Informative

Unfortunately not. Nor does it seem to use MODS XML for record storage (which, incidentally, will be used by OpenOffive.org's bibliographic and the bibliophile project, which hopes to do cross searching across the open source literature databases.

SRW/U hopes to supplant Z39.50. Not only does it use MODS, but it still uses ZeeRex and CQL .

For more nerdy e-refererence stuff, check out darcusblog
Re:Standards based? by J1 · 2004-08-02 20:09 · Score: 1

FYI, Bibster uses Semantic Web technology and standards, actually. Data is stored in RDF, and peers retrieve data using the SeRQL
RDF query language.
Re:Standards based? by Anonymous Coward · 2004-08-02 22:55 · Score: 0

And along the same lines, if one is interested, one should check out the Edutella project. Standards based & P2P.
Re:Standards based? by bhd6 · 2004-08-05 03:21 · Score: 1

I think that in referring to "standards," the poster above was referring to the communications protocol. SRW is a quite interesting one, taking all the experience of the z39.50 community and applying it to the world of XML and web services.

The other big issue I have with Bibster is that it is based on bibtex, which may be widely used in the hard sciences, but which is not international-friendly, has a bad data model insufficient to the task of representing the sorts of data that scholars in the humanities (almost none of whom read sourceforge) deal with, etc., etc.

But.. by iantri · 2004-08-02 11:12 · Score: 4, Interesting

What guarantees accuracy? What guarantees high-quality results?

If we were to look at another project, say, CDDB, which stores meta-data for CDs (Title, Arist, Track Listing), something not at all unlike storing meta-data for books (bibliographies), you'll note that CDDBs entries are frequently inaccurate, mispelled and just plain wrong.

When it comes down to it, I don't really trust Random Joe to provide accurate trustworthy info. It's not like its like Wikipedia, or anything, which has constant peer review and a clear history.

Re:But.. by burns210 · 2004-08-02 14:45 · Score: 1

but with the use of users and profiles, a person could become credible/reliable, or not, and the information would weed itself that way. p2p makes it harder to moderate, but not impossible. And the benefits most surely out weigh the added work.

I'm a geek... by lukewarmfusion · 2004-08-02 11:42 · Score: 2, Insightful

...married to a non-geek (getting her PhD in Psych). When I told her about this system, she said:

"My system's better anyway. I have a file, with the exact bibliography printed on the folder, for every article I've read or written. If I need one, it's right there. If I need to use the citation, I can just copy it from my Excel spreadsheet. Now why would this thing be better?"

Some people are born geeks, I guess.

Re:I'm a geek... by RealAlaskan · 2004-08-02 12:16 · Score: 2, Interesting

If I need to use the citation, I can just copy it from my Excel spreadsheet. Now why would this thing be better?
This would be better because when she reads a new article, she could get the bibliography from someone else, rather than having to type it in herself.
Of course, if she has read so few papers, and does so little writing, that Excel (and Word? Ick!) work for her purposes, then this might be an exercise in gilding lillies.
I use Emacs, with reftex and bibtex, and find that it works far better for my purposes than any of the several wordprocessors I've tried. None of them, including Word and OpenOffice, can equal that combo, with LaTeX for the typesetting. They're just not up to speed, for quality of the output or ease of use.

--
See what I've been reading.
Re:I'm a geek... by imkonen · 2004-08-02 15:10 · Score: 3, Interesting

"I have a file, with the exact bibliography printed on the folder, for every article I've read or written."
I tried to keep a system like that going for a while. It's one thing to be good about saying "Wow, that was a good article, I should fill out the bibliography right now in case I should like to cite it someday." It doesn't take much discipline since it happens roughly once a year. It takes a whole other level of discipline I just don't have to keep filling in those entries for articles I get bored with halfway through, stacks of articles my boss dumps on my desk, articles I read and decide are completely irrelevant to anything I'll ever be interested in, etc.
Nowadays I just use SciFinder or one of the other databases which can export in citation manager friendly format instead of typing in by hand. I'm not sure I see how P2P would make my life any easier. However these are all (SciFinder, SciSearch, ISI to be sure, not so sure about others) for fee databases that require my University to pay a subscription. I'm all for the free exchange of information, especially in the scientific community, so if this facititates it, I'm on board.
Re:I'm a geek... by lukewarmfusion · 2004-08-03 01:35 · Score: 1

She has less than I realized. In her two years as a grad student, she's collected about 550 articles total. An Excel spreadsheet will have no problem with that. Eventually, she might move to an Access database (or, if she'll let me, a SQL Server DB with a nice web interface).

By that time, she'll probably abandon her filing cabinets. It's one thing to keep a few hundred files, but we won't have room for ten years of her readings.
Re:I'm a geek... by RealAlaskan · 2004-08-03 03:55 · Score: 1

The big problem with using a spreadsheet for this is that in order to write a paper, you have to enter the bibliograpic information in a different format for each journal you might submit to. Bibtex automates that, so you enter the information, and the formatting is done for you.
As I said, I use Emacs with Reftex for writing. Reftex will let me search for citations in my bibliography file. I use a key made up of the first author's family name and the year. I have innumerable papers in my bibliography file, but typically there are only a dozen or so authors whose work I'm trying to keep in mind at once. It's not hard to remember that it was the '98 paper by C-somebody, and then C*98 brings up a very short list to choose from. I can highlight the right choice, and the reference is added to the text, and the bibliography entry is added to the paper's bibliography, and I'm back to entering text, with no further action. If I want to change the citation or bibliography style, there are about 50 or more to choose from, and changing between them is trivial.
As for the filing cabinets, I dealt with that problem by keeping electronic copies. In economics, most authors post working papers on their web sites, to get quick exposure, stake out territory, get feedback, and so on. It's common to see citations of work in progress in published papers. So, I'd keep the working copies and let the library warehouse the published copies. That option may not be open to you. I think that you said that she's working in psychology? Social scientists don't seem to post their work, for some strange reason. I've been looking for some papers in the fields of education and child development, and there is essentially nothing on line. Maybe you could scan and OCR the really important drawers in the filing cabinet? Don't worry about correcting the scans: she can fix the typos in the few parts she actually wants to cut and paste. If you do the work as she actually uses the papers, it won't be a huge project.

--
See what I've been reading.
Re:I'm a geek... by lukewarmfusion · 2004-08-03 07:35 · Score: 1

Psychology has a standard format - APA. You're right about the electronic copies, though. Many articles are paper only, or are tough to get online.

Her advisor is anti-computers. She's afraid that the computer will change the data when you're not looking. They employ about 15 people to run their lab (includes students, which receive credits and not money). I'd guess that most of those students could be removed from the process and replaced with a computerized data input process. You know, instead of 80 page surveys being mailed out.

My wife isn't entirely comfortable with the electronic process either. She likes paper copies, so there's a physical trail. She won't let me pay our bills online, so she does all of our bills herself. On paper.
Re:I'm a geek... by RealAlaskan · 2004-08-03 10:23 · Score: 1

APA is one of the formats that Bibtex supports. I used it for a term paper once, just to be different. Even if there is never a need to reformat the entries, it's a real bother to have to make the citation, and remember to copy the entry into the appropriate place, and so on. Is the APA style one of those which requires that the entries be alphabetized and numbered? Those are nightmares to handle by hand.
We used to perform some economics experiments by hand. It was a lot of bother, but for some things which didn't fit into the market-centered straight jacket of our software, it was easier to run them by hand, and then type in the data, rather than develop and debug a new application. Running them by hand definitely removed simplicity, and added chances for error.
Tell your wife that the computer does change the data when she's not looking. But, as soon as she looks, even if she just glances at it out of the corner of her eye, it changes the data back again. Challenge her to prove you wrong.
I bet she could come up with some nifty label for that sort of belief.

--
See what I've been reading.

Enough with "-ster" already! by Anonymous Coward · 2004-08-02 11:45 · Score: 0

Why does every new P2P app have to call itself This-ster or That-ster? Are the developers really so lacking in creativity that they can't come up with a new name?

Re:Full texts? User comments? by Anonymous Coward · 2004-08-02 12:11 · Score: 0

I was going to say exactly the same thing.

However, rather than YAP2PN, I would rather see it all integrated into some existing thing. I only want to have one client.

Re:Full texts? User comments? by j1m+5n0w · 2004-08-02 12:35 · Score: 4, Informative

citeseer has full text available for for most of its articles, and its a free service, so maybe copyright isn't such a big deal for some reason. Maybe it's because most papers in computer science are available from the author's website.

-jim

Bibster for Mac OS X? by Anonymous Coward · 2004-08-02 12:55 · Score: 1

I wonder why there is no Mac OS X version. There are many scientists on OS X. It can't be a very hard port since they have a linux version, can it?

Re:Bibster for Mac OS X? by Anonymous Coward · 2004-08-03 01:52 · Score: 0

This is Java based software, so porting to OS X would not have any difficult.

What an interface! by Sajma · 2004-08-02 13:20 · Score: 3, Interesting

A possible inquiry could be: I am searching for topics about peer-to-peer technologies.
As a result Bibster returns bibliographic entries concerning peer-to-peer technologies.

Next, they'll perfect image search:
A possible inquiry could be: I want to see defiance in the face of insurmountable odds.
As a result Imagester returns images depicting defiance in the face of insurmountable odds.

Seriously, are they offering anything better than standard keyword and author search? What I'd really like to see is such a bibliography database that ranks search results usign a PageRank-like algorithm (as I recall, the idea for PageRank derived from research on citation graphs, so this would bring things full circle).

I'd also like to see Google start parsing publications and indexing them by author, year, and citations. The bibliography databases that I'm familiar with require manual input of new entries; it would be cool if this could be done automatically instead. Of course, there will need to be some interface to correct erroneous entries, and this opens up a large can of worms.

Re:What an interface! by harmonica · 2004-08-02 17:35 · Score: 1

I'd also like to see Google start parsing publications and indexing them by author, year, and citations.

Google could start by making use of "author" and "date" meta elements of all web pages and providing a search field for them on the "Advanced search" page.
Re:What an interface! by J1 · 2004-08-02 20:33 · Score: 1

Seriously, are they offering anything better than standard keyword and author search?

Yes, though it may be hard to see this at first. The system makes it possible to query for specific properties of citation entries, which is more precise than simple keyword search. Also, in the current release of the software the interface is limited to a few 'fixed' properties, but there is no underlying technical reason for this, it could be easily extended to allow the user to search for arbitrary properties of any citation. Since the storage format is RDF, extension with properties as needed is trivial.

Re:Full texts? User comments? by periol · 2004-08-02 13:31 · Score: 2, Interesting

I've been working on a similar idea for news, and as far as I can tell fair use completely applies to this specific idea of yours - education and the arts, unbiased, not for profit.

There are already some sites out there doing something similar like the Media Awareness Project [mapinc.org] which collects and archives research on drug policy. From what I can tell, they only get sued when they get too big, present content with a bias, or try to profit.

I find it hard to believe my little project is the only one out there. We're working on web/p2p jointly, but there are bound to be others, and they'll all probably be open source. So once one good once comes out, we'll see lots of applications of this within research and academic communities.

Right direction or ..... by failedlogic · 2004-08-02 14:22 · Score: 1

I don't know if this is a direction I like seeing P2P networks go, in the sense that full articles would be available for download. With some tweaking of the idea, I think there could be an advantage.

Many universities are paying tons of money to privitized databases to store either full text articles (for some)or simply the abstacts so students can search and read articles to their hearts delight. They are, in my experience, unreliable as well. The systems crash, you get database errors or lose the connection.

With enough metadata I would hope someone could come up with a CCDB type system for universities which would at least have the abstract info (summary, author, journal name, date) etc to at least look up in the system. Decentralize it and share it on among all universities. Even if it just stays within 'academia' it would be great. Hopefully speed, reliability and accuracy would improve.

Re:Full texts? User comments? by burns210 · 2004-08-02 14:35 · Score: 1

sounds like almost like usenet, for the comments over p2p, and SubEthaEdit for the group editing, with the added ability to include hidden comments, ofcourse.

OK, When will someone by burns210 · 2004-08-02 14:42 · Score: 2, Interesting

OK, these p2p apps are awesome, but I see a problem, they each need to maintain their own p2p system(protocol), by forking from another project it or by writing from scratch or they need to piggyback another network...

When will someone sit down, using an open source model ofcourse, and write the 'granddad' p2p protocol? It doesn't have to require everything, just has to be able to support everything... Encryption, hidden routing(not being able to tell who is requesting data vs. who is just passing data along), multiple source download, huge scaling, efficient and distributed search, etc.

This public network could become the defacto to what open source apps work off of. As long as the protocol is the focus(a nice gui as well, but seperate the frontend from the backend), you could use it link to files on your website, or you could have multiple apps(a music/napster like app, a scientific research paper app, a bibliographies app, a usenet discussion thread app) each of them using a common protocol, and routing between them, but each app filters out the noise it doesn't want.

It could be the killer app, it could have every major p2p app migrate to it. Project Gutenberg, Bibster, linuxiso.org, all using a common protocol and network.... *drools*

Re:OK, When will someone by DrEasy · 2004-08-02 15:25 · Score: 1

Check out U-P2P (yes, I'm involved in this). It's a fairly easy to use P2P framework where the piping is already done, and all you need to do is specify the schema of the type of document you want to share. You can then create a "bibster", a "stampster", ..., and each community is itself part of a "communityster", so you can publish or discover communities using the same mechanism. Would that fit your description?

It's still beta stuff, but there's also some publications along with the code on the site.

--
"In our tactical decisions, we are operating contrary to our strategic interest."
Re:OK, When will someone by Anonymous Coward · 2004-08-02 17:51 · Score: 1, Informative

What you mean like http://www.jxta.org/...

NLP would be nice by tgibson · 2004-08-02 14:46 · Score: 2, Insightful

eTBlast is a bibliographic search engine to which you submit an entire abstract. A little natural language processing and the results returned are to articles which have similiar abstracts. Though the tool operates on the Medline database, there is no reason the algorithm couldn't be used with Bibster.

News that Slashdot won't report--Doom 3 pirated by Anonymous Coward · 2004-08-02 14:49 · Score: 0

Doom 3 has been massively pirated this weekend, at record highs. Apparently, it's shaping up to be one of the most pirated games ever. Estimates are that id Software has lost up to 2 million dollars. Activision isn't saying anything at this point. Gamespot and the BBC both have articles on the news. The PC Gamer editor has some words for the pirates in the BBC article. This setback is set to cost Activision and id Software millions.

Just what college students need: fake cites! by Tsu+Dho+Nimh · 2004-08-02 14:51 · Score: 1

There needs to be some way to double check the citations, or rate the sources of the cites, or those who like to pad their papers and make up scientific-sounding stuff for websites will have a good time with this. Too good, and it will be full of bogus references to Timmy's article on Cold Fusion.

Doom 3 pirated--news that Slashdot won't report by Anonymous Coward · 2004-08-02 15:06 · Score: 1, Interesting

Doom 3 has been massively pirated this weekend, at record highs. Apparently, it's shaping up to be one of the most pirated games ever. Estimates are that id Software has lost up to 2 million dollars. Activision isn't saying anything at this point. Gamespot and the BBC both have articles on the news. The PC Gamer editor has some words for the pirates in the BBC article. This setback is set to cost Activision and id Software millions. John Carmack is reportedly very unhappy.

Re:Doom 3 pirated--news that Slashdot won't report by Anonymous Coward · 2004-08-02 23:15 · Score: 0

Estimates are that id Software has lost up to 2 million dollars.
Except that it hasn't. A download is not a lost sale. The person doing the download may or may not have legitimately bought the product if the download was unavailable - you just can't tell.

Slick solution to a common problem in academia by __aapopf3474 · 2004-08-02 15:10 · Score: 1

Here in academia, a big problem that students and faculty face is managing their personal publications.

For example, a faculty member may be sponsored by several different projects, each of which wants that faculty member to update their web page with each new publication.
Odds are, most faculty will update their own personal page and possibly one project page. This leaves the other projects needing to harangue the faculty member in to updating their pages.

For example, a postdoc comes and visits, write a bunch of papers and then moves on. It would really be nice if the postdoc could take their publications with them to their next position.

For example, you are on an airplane and need access to your usual bibliography,

For example, all your publications are on one machine, and that machine is unavailable.

Bibster seems like a good start in addressing these issues.

Locally, Professor Edward A. Lee had a similar idea, with the added wrinkle of having centralized project specific servers check the repositories of individual researchers and update the project specific list of publications with the bibliography info and the paper itself.

GPG signatures? by jellybear · 2004-08-02 17:29 · Score: 1

Maybe trusted sources could sign bibliographies. You could add certain contributors to your web of trust.

Re:GPG signatures? by burns210 · 2004-08-03 16:40 · Score: 1

kind of a 'i trust the people my friends trust' setup, making the 'load' of finding reliable sources much less.

Forest for the trees... by tsm_sf · 2004-08-02 18:14 · Score: 1

A decentralized, indexed and well documented database that everyone can access...

What's the difference between a database and a hard drive again?

--
Literalism isn't a form of humor, it's you being irritating.

Re:Forest for the trees... by Anonymous Coward · 2004-08-03 02:10 · Score: 0

what's the difference between public distributed knowledge and a private and centralized scheme?

I think that the internet and distributed systems has gained a lot in making avaible information to anyone, so it's time to make efforts in creating structures with the information, that machines and people can understand.

Which raises the question. . . by munpfazy · 2004-08-02 18:21 · Score: 1

. . . will this actually be useful?

>This system will be useful when one has a paper
>in hand, but does not have the bibtex entry.

Perhaps I'm spoiled by working in a field with very good online databases and journals that require only brief bibliographic entries, but it's hard to imagine where this would actually be useful. 95% of the papers one has in hand were located via an online database and came with bibtex entries. On the rare occasion one finds a paper copy of an article and no bibtex entry, it's usually faster to generate one by hand than to find it in a database.

If there are people who find it useful, I'm happy for them. But, I don't see it myself.

It also seems like it could worsen the propagation of errors in citations. An interesting, if tangential, discussion of the topic is in a paper by Simkin and Roychowdhury. (Note that I'm not endorsing the authors' claim that propagating errors in citations indicate that papers have not been read. A more plausible argument is that authors tend to assemble their citations *after* having completed the paper and crib citation text in order to save time formatting their own. Then again, I suppose that suggests that there are a lot of people who actually will use a service like this one.)

Citeseer does not have all articles by Anonymous Coward · 2004-08-02 18:32 · Score: 0

Citeseer does not have acces to all articles, and therefore the results are not always useful. Actually my bibtex file contains many articles that they don't have, but it is somewhat bothersome to share the information with citeseer. I guess that citeseer could make a page that parsed bibtex files to add to the archive, but I haven't seen such a page (I didn't look lately)

The advantage of the p2p client is that it suddenly becomes easy for me to share my information with the world. I can just hope that citeseer grabs the opportunity to download a lot of popular entries to add to their data base.

If course a lot of problems would be solved if "web of science" made a better user interface that allowed me to export my search results to bibtex, or to easily search among articles written in a given year. Similarly it would help a lot if "scitation" was able to support more journals.

Finally I wish to point out that this client is a step towards a better and fairer world. In poor countries, many institutes cannot afford web of science, so this p2p client will provide a nice and cheap way for them to get some useful info. Hooray for bibster :)

Automatic check through wget by Anonymous Coward · 2004-08-02 18:43 · Score: 0

Many big journals have very standardized url's, so it should be possible for the client to use wget to check for the validity of the entries. This would allow me to check my own bibtex file, but it would also allow my client to kindly warn other clients with wrong entries or to create a blacklist of potential liars on the web.

Spying on the competitors by Anonymous Coward · 2004-08-02 18:49 · Score: 0

HOw about all entries from a particular IP-number?
Oftentimes several researchers will be working on the same problem. The p2p client might allow me to spy on competitors to guess what they are working on. Is this good? Does the client allow me to keep some entries secret while sharing others with the world?

Character encoding? by CRCulver · 2004-08-02 20:06 · Score: 1

In what character set is the bibliographic information stored? If one has a library whose titles are in a zillion different scripts, then it is really essential that one keeps one's bibliography entries in UTF-8. Does this system use such an international encoding, or does it expect us to submit to romanizing all our foreign-script titles just to avoid the issue?

FREENET by Anonymous Coward · 2004-08-02 20:10 · Score: 0

oh, you mean Freenet?

Re:FREENET by burns210 · 2004-08-03 16:48 · Score: 1

well, almost... except not...

freenet doesn't scale well, it has huge routing and discovery issues, making it very unusable... a WWW sized freenet would be crippled, not improved. These are bugs in the code, they need to be fixed.

Freenet doesn't allow the hosting of files, it only allows for submitting. I can't have freenet://wikipedia.org hosted on my own box, and thus ensuring that it is online. Freenet only lets me insert a file or website. That site has to be resubmitted repeatedly(daily, weekly, hourly even) creating multiple versions. PLUS, once a site is submitted, nothing gaurantees that it stays on the freenet, it only is in the user's caches until it gets replaces.

Searching and recieving files in freenet is crippled and broken. Even on good releases, when it 'works', the speed and reliability make it not even qualify as a novelty...

Freenet doesn't work... And this is coming from someone who would love to see freenet succeed.

Re:Full texts? User comments? by Anonymous Coward · 2004-08-02 21:24 · Score: 1, Insightful

OK, so if citeseer has text for most articles and abstracts + citations for all, then explain why we need a P2P service to do less?

Er, ah, um, am I just completely incompetent? by phiwum · 2004-08-02 22:04 · Score: 1

Well, don't answer that. This isn't really about me. I hope.

I've installed the thing. It seems to see peers. So I thought I'd search for a very, very common author. I entered Dana Scott. Nothing. I entered Tanenbaum. Nothing. I entered local boy Vaandrager. Nothing. I entered Barendregt. Nothing. I entered "concurrent". Nothing.

I entered my name. I got everything I've ever published. But then I had imported my own Bibtex files, so I'm not surprised (I've never cited any matches for the above). I entered "coalgebra". I got matches from phiwum again.

Is the user base small? Skewed? Am I just incapable of using the damn thing?

(Note: I don't *think* it's firewall problems, but I could be wrong. I don't see anything in the logs though. But I'm a liberal arts moron, so don't expect much from me.)
Worse than this: I'm a philosopher now. I'm not really doing computer science. I'm starting to guess that this tool won't be too useful for me. At least not for a while. Not until Metaphysics gets its ACM topic category.

--
Phiwum's law: anyone that names an obvious law after himself and then puts it in his own sig is just pathetic.

Re:Er, ah, um, am I just completely incompetent? by Noksagt · 2004-08-03 02:39 · Score: 1

The user base is VRRY small. Before I submitted it to slashdot, there were six peers. And I assume it is skewed too--the ACM box at the bottom suggests that it is only for Comp. Sci. texts & experience suggests that that is probably a select subset of this.

The real trick is getting your peers to buy in to the program--they will likely have many references you'd be interested in anyway.

Re:Full texts? User comments? by Onewheel · 2004-08-02 22:23 · Score: 1

What would be really nice is to have the full texts of articles available P2P.

That's quite easy to do: if I have the article in ps or pdf, then the name of the file is the name of the bibtex-key. And every article is in the 'articles' directory next to the beloved .bib file.

it would be awesome if user comments could be added to each citation.

I use the annote field. However, how can you be sure that the review is accurate?

Text interface? Please? by phiwum · 2004-08-02 23:14 · Score: 1

Am I the only one that tried this thing out and thought, "Damn. Look at that real estate."

I'd like it more if I was uploading in the background and my queries had a lighter, smaller interface, say a shell interface. Better yet (much better) an xemacs interface that works well with reftex.

I know that the latter is much too much to ask for a young product, but I hope that the authors give developers (not me) some APIs to get some lighter weight clients out there.

Assuming I ever get a non-local match for my queries, that is.

--
Phiwum's law: anyone that names an obvious law after himself and then puts it in his own sig is just pathetic.

Re:Full texts? User comments? by neglige · 2004-08-02 23:52 · Score: 1

What would be really nice is to have the full texts of articles available P2P.

S2S is such a network for academic users as the target group. It is currently in a test phase. Sponsored by the German government. Also includes an expert client, where you can sign yourself up as an expert for a specific area and get to answer questions. According to the current statistic, the network provides over 1 million documents.

Homepage is here, but in German: http://s2s.neofonie.de/

--
My cats ate my karma. They also wrote this comment.

Re:Full texts? User comments? by BarryNorton · 2004-08-03 01:55 · Score: 1

Citeseer only sees about 70% (iirc) of the papers in Computer Science, and basically none outside... and its attempts at BibTeX are usually rubbish... and it's up and down like a tart's... (well, until the mirrors are properly sorted and stable).

Slashdot Mirror

P2P Bibliographies with Bibster

79 comments