Haystack: A More Compelling View Of Your Data
Peristaltic writes "MIT's Haystack project has released the source for it's "Universal Information Client", Haystack.
In their words: 'Haystack looks into the use of artificial intelligence techniques for analyzing unstructured information and providing more accurate retrieval.' Unlike some attempts I've seen in the past to pull it all together on my desktop, Haystack shows some promise -- One of it's more useful features allows you to take the information you've been wallowing through, and have Haystack continually refine a 'dynamic hierarchy' until you get what you need. Haystack also performs some neat tricks such as combining Email, IM, web pages, etc. into a single inbox."
Haystack also performs some neat tricks such as combining Email, IM, web pages, etc. into a single inbox
It may just be me, but this is a feature I never want.
I do not want 1 large program to run all of my applications. I do not want to get my email, from where I get my web pages, and my IM. I don't want any of this.
I am quite happy with seperate programs which I can use at my pleasure. I'm happy with the lack of bulk, and the fact I can change an email client without changing a web choice. (although I only use pine anyway.)
Is this just me? Do all of you want your programs shoved together in one large application?
I didn't get any options on my cell phone (like text messaging) because I purchased a cell phone. I wanted a cell phone. To make calls. Nothing else.
http://use.perl.org
From the system requirements:
- Pentium III 700mhz-based computer or better (Pentium 4 2ghz strongly recommended)
- 12 megabytes of RAM (768 megabytes strongly recommended)
s/strongly recommended/REQUIRED/
Wow. Looking at the Haystack site with Mozilla looks awesome! I don't know if it's my version (1.4rc1) or some weird image setting, but the main image on the page stays stationary as I scroll around, but the clipping of the image changes. It's really hard to describe, but looks awesome.
Of course, IE just renders it properly. BOOOORING.
like finding that pesky needle?
Can it organize 3 gigs of random pr0n?
All Troll + "offtopic" mods are meta moderated as "Unfair", because you abused the system.
I see Haystack and I think of one of the old, old IDS systems/companys. Stalker, right?
On the other hand, it's a damn fine name for both types of product, and I haven't seen anything on the Haystack IDS in a long while...
As much as I hate to say it, doesn't this look exactly like the "Outlook Today" page from MS Outlook?
Or maybe that's just because the A/B comparison is too easy because they're both open on my PC at work now...
Have you seen my stapler?
I'm worried that this could be used for data mining/viewing by the federal gov't. Having this software know all about you is a bad idea.
What does this mean? I'm confuzzled... Hmmm... E-mail, webpages and IM into one inbox... Wait a second, IM in an INBOX? IM is INSTANT MESSAGING. How will this work?
"Your effort to remain what you are is what limits you."
Not too much to ask, it doesn't even need to be truly a filesystem. Just overload all the file access commands (At this point, probably easier to just write a new filesystem)...
Group data by category, content, whatever. "Symlink" to the inodes, and you're off. We don't need AI for that and I think it would be a more complete solution. I don't see an AI engine that can correctly categorize my mp3's, I don't think I'd trust it for all of my data yet. Let's start small and get usable systems.
Spiffy program though, wish it weren't in Java... wish it weren't 42MB... wish it ran smoothly under Linux. I'll stop complaining now.
On a side note, Did anybody else find that scrolling image annoying and mentally confusing. Er, I'll really stop complaining now.
Dacels Jewelers can't be trusted.
Can you say SKYNET ?
Now the world has gone to bed, Darkness won't engulf my head, I can see by infra-red, How I hate the night.
The ultimate test for such a system is putting my inbox into the information stream. At the end of the day, 99% of it better be trashed automagically.
I've heard about another tool much like this one, which have been developed through eXtreme Programming. It is a hybride between assembler, aspectoriented Java and Perl, and is platform independent as long as the platform is Linux or FreeBSD or, perhaps, an obscure Java-based realtime-appletkernel. It is used to produce a new kind of buzzwords.
Isn't this just the equivalent of the usual attempts by designers to make applications to be everything to everyone instead of just doing one frigging job well?
"Information in One Place" - I already have access to all my personal information and files in appropriate windows. So does anyone using a basic OS.
combining Email, IM, web pages, etc. into a single inbox
Whatever happened to the "does one thing, and does it very well" philosophy? If I sorta remember that I got something in an e-mail, I look in my e-mail. What's the advantage of throwing away that piece of information (where it came from)?
Yes, it's nice to use the computer to do grunt work for us, but there are some things that are better left to the user. Some of us like to come up with little "systems" for organizing things that are unique to us. We've all heard stories of the receptionist who files contacts under 'D' because new contacts are always invited for Drinks. An AI is not going to be any more rational than that, and the kooky system it devises won't be in our heads--it'll be in some obfuscated format that nobody will understand, not even the ditzy receptionist.
For all intensive purposes, "whom" is no longer a word. That begs the question, "who cares"?
Perhaps these teams should share some notes. I'm not sure about the web page info inclusion, but I'd love to use a top notch PIM that could data mine MY data.
As for the resources... perhaps you could tell it when you wanted it to data mine, so the mining would ot interfere with other work.
# 512 megabytes of RAM (768 megabytes strongly recommended)
Damn.........!!!!!
1. No sig. 2. ???? 3. Profit!!!
What's interesting about Haystack is that it seems to take advantage of such an algorithm, and in such a manner as to permit multiple feeds simultaneously on harvested data. Hopefully, the source is available somewhere to check out, because I'm curious about how they managed it.
I'll believe in their AI when I can type "X free" as a search query and it returns a link to www.xfree86.org instead of a million links to pr0n sites. Does this AI learn what people search for usually ? is it able to determine over time that capital-"X" and "free" in my particular searches are about opensource graphical software, unlike the same query by the dirty old man next door ?
By the way Haystack people, when you use titles and phrases containing "universal", "seeks to bring [...] to the average user", "artificial intelligence" , it trips my PR bullshit meter. I was about to bail out when I noticed the download link.
"A door is what a dog is perpetually on the wrong side of" - Ogden Nash
I thought Emacs already had that!
You could replace a desktop with this sort of interface, where apps you run would integrate into the one tool.. publishing information about their progress, etc
meh
Isn't haystack the problem that this tries to fix? I think this project should have been called 'needle' or possibly 'findy.'
Haystack looks into the use of artificial intelligence techniques for analyzing unstructured information...
Does this mean that we finally got a tool that can help us understand our politicians?!
I didn't see an explanation on why is that????
Suppose it has something to do with the JVM or is it a source code issue???
As far as I could see they don't provide an answer....Any guesses??
1. No sig. 2. ???? 3. Profit!!!
There was only one measly screenshot in the overview section, and NO screenshots in the screenshot section, so here's another one.
up and running is now like trying to find a needle in a haystack.
*rimshot*
Thanks /.
Are you bovilexic? Moo!
Honest to fuck, I'm going to add to my resume:
- Knows the difference between "its" and "it's" and will wield them correctly at the slightest provocation.
But can it find an apostrophe in the possessive third person neuter pronoun? No, because there isn't one.
Opera does all of that - well, okay, it doesn't do Instant messaging, but i don't use that anyway.
But it does have a download agent, a web browser, a mail client and a newsreader all in one.
And its only a 3.7Mb download.
An infinite number of monkeys will eventually come up with the complete works of
Its about time someone pointed this out!
When will this site be able to correct it's own grammatical mistakes?
Nothing like slashdotting MIT to make you feel like you've accomplished something! How's your precious class-A IP registry now?
Sincerely
Bunker Hill Community College
Sure would be nice if this ran fluently on other platforms.
This reminds me of the digital dashboard concept from MSFT and others a while back. Basically the dashboard aggregated email, stock feeds, etc... and displayed it all together... Reuters, Avaya, and some other companies even came up with slick features to display IM, and Voicemail inside the email discussion thread as well. I really don't see this as anything new or noteworthy... Seems like they got a long way to go.
http://web.archive.org/web/20020210150513/http://h aystack.lcs.mit.edu/index.html
It's assumed that if you don't run windows you are inteligent enough to organize your own info.
Its a grass-roots movement that will have to act anonymously.
So imagine a Linux distribution with the future ReiserFS as the filesystem, and something like Haystack (rewritten in C) as the desktop, with a nice API...
Gnome and KDE have done a great job at catching up to the Microsoft desktop, maybe even improving it incrementally...but something like this would make Windows and OSX look as primitive as Win 3.0.
Or use someone elses bandwidth at any rate. /.ing an .edu site is quite impressive.
Gaint haystacks
And if, after thinking, you still hold the same view, please promptly uninstall your Operating System of choice.
The REAL jabber has the user id: 13196
What you do today will cost you a day of your life
So now instead of finding the needle in the haystack, we have to figure out how to put the needle in the haystack?
Doesn't sound so hot to me...
It's assumed that if you don't run windows you are inteligent enough to organize your own info.
That would be much funnier if it didn't run on Linux.
Wait a minute .....
"and bubblesort just won't cut it"
:D
well, see; that was your problem right there: radix sort!
What's the big deal? Yahoo and other portals have had customizable news/weather/mail/todo/calendar pages for years.
http://216.239.33.100/search?q=cache:KtKSHcwESUQJ: haystack.lcs.mit.edu/+&hl=en&ie=UTF-8
A single all powerfull programme to do everything ... hmm sounds like Windows ... or Sauron's ring
for Haystack at LCS recently, and was not that impressed. It is designed to do certain kinds of tasks very well (e.g., editing things that are embedded in other types of information - the tests given were things like "edit this picture that's a part of this entry in your Outlook address book"). Unfortunately, at the expense of making these tasks as close to one-click as possible, other things (versatility the most, but also common sense design) have failed.
I find it easy enough to edit information of the "My Documents" variety without worrying about how it is integrated into other information on my computer, and I'm sure other readers here do, as well.
The best way to actually use this software would be in the case where John Q. has a specific task to do over and over again but isn't ready to tackle a batch process.
KISS.
Haystack is an interesting idea, but I have a hard time distinguishing what it does from what, say, Lotus Notes does. And Lotus is _terrible_.
I like the idea of bringing all my information together in one place. I don't like the idea of only having it in that one place. What I would like would be an application that can watch how I use the computer, then bring those applications together to make it more seemless.
For example, I have about four different calendars in my life: the work calendar, the one on the cell phone that I use for stuff that I can't miss, the calendar that schedules airplane rentals, and (of coursE) my girlfriend's calendar. So how do I bring those all together, and yet still be making entries in them separately?
The same is true for information. I have a primitive blogging system (really just a bunch of text files that are date coded), I have work documents that I use regularly, I have web pages that I monitor (sometimes a little too often) and I have textbooks that I'm reading (instrument flying at the moment). So how do I get all these forms of information - or at least an index into them - together in one place? But again, without changing the current organization scheme.
This is the tool that will make the computer a lot more useful - an actual organizational tool.
Rudy
1. 2.
A file system with the power and flexibility of a relational database ceases to be a file system. What are things like "cp" supposed to mean? How do you transfer "a row" through a serial connection? What kind of transactional guarantees is it going to make; if it's going to make DBMS guarantees, it's too slow for many file system applications, and if it's not going to do that, is it really a DBMS?
If you want a database, just use a database. MySQL and various embedded databases are widely available on Linux now; no need to clutter up the kernel.
tried to download it and it looks like it is /.
I like clickable links
Representing data is all fine and good, but how about a program that actually _derives_ data worth representing?. For me, being able to represent my sock drawer in a nice, organized manner doesn't do me much good.
Contact Me (got tired of viruses emailing me).
"Ever start a conversation in an instant messenger, switch to e-mail, then switch back to instant messaging just to realize that you had forgotten what the conversation was originally about?"
No.
I'm with those who prefer their apps separate.
Not quite as impressive as it might seem. Research groups running inside LCS all run their own servers, so we're probably just banging on some poor Dell box. Also, lots of groups use their workgroup server for everything like nfs/smb/print/web and even number crunching. Maybe they're testing Haystack on that server too... ouch. When we've slashdotted the main MIT web site (web.mit.edu) then I'll be impressed.
I've seen it on other pages using IE
Big whoop
I need something against my haystack, not a software that does more sedimentary data storage.
Emacs already has all that.
Emacs!
Vi!
Come on come on, one and all, joint the flamefest!
Mr. Godwin, meet the nazis of the unix world.
Infuriate left and right
The sys reqs for Haystack do seem a bit absurd. Wouldn't Haystack really have it's place on a PDA where you'd want fewer programs sucking up your hard disk space without giving up functionality? I'd freaking love it if my iPAQ had one interface for all my needs. All they have to do is "shrink it" down req wise...but i guess if it were as simple a job as putting a word in quotes then it'd be done already.
"No, seriously, I AM a wallet inspector"
I am attempting to run it for the first time, it's been going for 10 minutes and still running (I hope, damn Java app).. I'm on a machine with a 2 GHz CPU, and 1 GB RAM.
Six Degrees by Creo is another attempt to do this same sort of thing, except that it's commercial and it's been available for Mac OS X and Windows for several months.
OK, it finally finished.. then... CRASHED! I'm done. Deleting this, and Java from my system :-)
(no text)
I've been waiting for this for a few weeks now. I've been looking for a PIM that has email, calendar, and tasks. Apart from Outlook, what product has that? I have recently tried:
Outlook
OSAF's Chandler PIM
Haystack
Pogomail (not a PIM)
Eudora (not a PIM)
Mozilla
I am now using Mozilla because it has bayesian spam filtering built in and because it has a calendar plug in.
I have decided not to use Haystack. It is simply not production ready, and I'm sure the guys at MIT wont mind me saying so. It crashes. It locks up. It doesnt have undo!!!! I cant tell you how many times I screwed up one of the panels and couldnt get it back. I also couldnt figure out how to delete spam. I get about 200 emails per day, of which 8 arent spam. I could use a pop filter, but I have an emap client too.
However, I am very impressed by this software and it is absolutely the way forward. I *want* my information integrated. I want my tasks to automatically reference the people I need to do them with and the web pages I used for reference and the dates in my calendar. I want my contacts to appear in many different categories, instead of as a different copy in each category all of which I'd have to update.
I want email and calendar and tasks to be like a light switch or a tv. I want to just turn it on and it all be there. This software is fabulous and you would all benefit from giving it a test drive, even if you ultimately uninstall it.
included Haystack into a pending patent, which they keep open for this purposes, with enough buzzwords to make Haystack appear to part of the original patent application, which could be several years old.
The USTPO will approve it no doubt, and then the lizard will start a crusade against those who 'stole' his 'Intellectual Property', siting the original application date of the patent as proof that he had developed Haystack much earlier than MIT.
I haven't met a lawyer yet who didn't think he wasn't an expert in every other discipline on the planet.
"One of it's more useful features allows you to..."
Please people, engineers, and programmers--English grammar is no different from any science, there are rules and you should learn them!
it's == it is
The correct rendering of this sentence would be:
"One of [its] more useful features allows you to..."
Even a second grader knows the difference, please educate yourselves and others, don't use the excuse "but I can calculate quadruple integrals, spelling and grammar don't matter." They do, it's the bare minimum. I can also calculate quadruple integrals, but at the same time write grammatically correct sentences, so what's your claim now? Don't be ignorant, broaden your knowledge or you will pass on these and other fine mistakes to your peers and children, that in and of itself should be considered a crime.
No wonder that the app is slow, it was made with java. I can't understand why people still use that slow framework.
Scopeware Vision is similar but better than this. It only requires 128 megs of ram!
Try the 30 day free trial. It rulz!
So is this the emacs or the microsoft.bob of net apps? Or maybe the microsoft.bob.emacs of net apps, heheheh.
operates you!
No more nasty "thinking." It will tell you "where you want to go today," how you want to get there, how you want to see it and how you store it for retrieval.
Choice is good, so choose not to be bothered with making your own choices. That's the ultimate choice. Brainless sheep for diversity unite!
Of course this is just a preliminary step along the way to the ultimate goal. In the Brave Newer World of the future you will turn your computer on, stick it in the closet, and it will simply do everything for you on it's own. Read and answer your mail, conduct your business, play your games, etc., all without any interaction with you at all. Think of the time and productivity savings!
Once a year it will print out a summary of how your life is going on a 3X5 card in easy to read large type which it will place in a location it deems to be most suitable for such an item.
Bend over. It won't hurt your computer a bit.
KFG
Vi rulezzz! James Joyce rulezzz also!
I know because I added that feature myself :)
After 15 minutes of bootup/installation process with 100% CPU and 150 MB memory usage it just disapeared from process list.
Not good enough for usability experts.
is extended ext2 attributes (google for the kernel patch and userland utils)
It allows you to give attributes to files, (like "category")
I understand there's also gnome support.
THIS THING CAN TURN ON A DIME, MACROSSZERO STYLE ALSO FUCK BETA, ~NYORON
From the Design Principles:
"...provides a single, uniform interface for manipulation of e-mail, instant messages, addresses, web pages, documents, news, bibliographies, annotations, music, images, etc."
"...attempts to match a user's own focus on objects in view and what can be done with them. An operation (such as spellchecking, sending an e-mail message, or rotating an image) can be invoked at any time on any object for which the operation "makes sense" (i.e. a blob of text, a person, or an image respectively)."
Back in the heady days of the PPC 601 and the Newton, one of Apple's software groups was working on this problem exactly. While I don't think OpenDoc could organize your information, it was certainly a uniform interface for manipulating stuff, with the focus on the stuff, and not the application in use. At that point, about seven years ago, I naively believed that one day OpenDoc would provide an environment in which I could edit a web page and all elements (including raster and vector images) without having six applications loaded. Ha!
idd, all of the above and learn to use quotes "X free"
;p
:)
however i must say i prefer "XXX FREE"
or better yet, an 'i feel lucky' on
"* my big black * you * *!" (very funny
seriously, go to searlores.org and learn to go00ogle
With Unlimited calling, versus a pay per text message, of course I would choose a call.
I seem to forget that not everyone has that. So obviously my reasoning is different than everyone elses.
Understand your point of view very much now.
http://use.perl.org
Just tried it. The UI response time is so slow that I uninstalled it ten minutes later out of frustration.
All you need do to find a needle in a haystack is to sit down in the haystack. Murphy was an optimist.
Mac OS anything? You a Mac weenie buddy? A ONE button mouse idiot? Go home, and do not come back without a note from your mother.
..the idea of storing away a "UI continuation" as an object, which can be used to restart an operation from some pending point.
Integration of email and IM is already available in commercial products like WebSphere Portal Server.
catch (ModDownException mde) {post.modUp("Interesting")}
i just wonder about one thing, can one build extensions to it? like say instead of building everything from a webreader to whatever the haystack system just act as a kinda glue and whoever can maybe write a better web part or mail or maybe some games to it? if so then i see this as the next gen UI! context sensitive action menus (rightclick menus), the ability to gather names for anything from meeting chedules to mass mail sendout from anywhere, having the ability to not only bookmark pages but allso actions, and i must say i realy loved the idea that any action you want to do and that needs more feedback puts a box to the side so that you can go around until you are satisfied. the worst idea every for the desktop methafor was the part about stacking windows! im downloading, maybe it will chug badly on my pc but i belvie for the most part that its the "database" system that is the reason for this as its digesting every mail, im and whatsnot that passes tru the system! hmm, someone commented that all this was stuffed into emacs. i wonder what FSF/stallman was aiming for with that one, having it run as the UI on top of the hurd directly? i wonder why they dont just merge the 2 projects and realease it as GNU...
comment first, facts later. http://chem.tufts.edu/AnswersInScience/RelativityofWrong.htm
So, is this similar to Microsoft BOB?
Lotus Notes.
I used this product back in 1998 on my then job. I liked it pretty well then, and I understand they've made some improved versions even.
Buy Text Processing in Python
of this discussion and I'm posting late so this might be a little redundant.
,you can just use the file system. You could just think about a computer in terms of the HFS that everyone has. But does everyone actually want to? A Tree based file system works for some people, but others like lists, and want everything to be in huge (really huge) lists. Instead of having to drill down into their deep, nested file system, some people like to have all their files in one folder. (Usually the desktop)
After reading the information at their web site I think that they are trying to do the same thing that Apple is doing with tthe iApps, just that Apple is slower because they chose to integrate them differently and Apple also has to produce something that people can use now.
They want to basically integrate the different types of data that we all use on a daily basis, email, web, IM, etc in to one data repository so that any program can use them. Instead of having to write a seperate address book for your word processor and use that for mailing labels, just have one large address book for your entire computer. Apple's address book does the same thing. Instead of having a different spelling checker for every single application, use one global one that has a custom dictionary, that way you don't have to enter your last name into like 5 dictionarys.
They want to provide a unified interface to your data instead of 5 disparate ones. Apple is trying to do the same thing. Trying to seperate the data on the computer from the interface. So that the user only has to deal with the interface and not the details of files.
Yes
Some people want to deal with their data in tree format, some in list format, maybe some remember everything by name and others like lots of pretty colors and icons to represent their data. People who have used the mac might know about Finder Labels that let you assign colors to files and folders. So you can then make say a red word document called 'resume' and that way you can remember that it was for emergency use only. Or some other such scheme that one might invent for their own data.
The point is that everyone is different and today's programs don't allow you to interface with your data in many ways. Since every program has it's own data store, it makes it hard to move data from one application to the other. Word maybe your favorite word processor, but when you want to use features like the mail merge, you have to massage your data from whatever program you are using to keep track of all your friends. You could use the outlook integration features, but even then it's using 2 seperate programs to access your data, since Outlook is storing it. Really, shouldn't there be one source for all your personal data on a computer and then outlook draws from that when it needs addresses or Word draws from it when it needs addresses.
alt.binaries.erotica.brunettes.
I use Emacs for mail, news, IM (via erc/bitlbee), as frontend to various websites and wikis, and as a planner/calendar.
I've been planning to switch my phonebook over to BBDB as well.
doesn't mean it's actually a properly framed idea. The market pressures of usability have pretty much spelled out the answer long ago... different functionality calls for discrete apps in this instance. There's simply not enough synergy between IM and Email being in the same place to make it worthwhile... it's just cluttered. It would have happened long ago and been successful if it were useful because it's not technically very difficult to accomplish. Blah. Some ideas that come out of that place are pretty weak (and others rock.) Oh well.
What I think would be cool would be a multidesktop type of environment. No, I'm not talking about multiple virtual desktops either.
You could have a different desktop for each project. You might have several emails for the given project, a few documents and spec sheets, some pictures, and some code. Keep the hierarchical file system underneath. Everything on the desktop is a link to something in the filesystem. Make it easy to copy, manipulate and navigate between different desktops. Basically, this would be an alternative hierachy, independent of the filesystem hierarchy.
No, I don't trust in god. He'll have to pay up front, like everybody else.
http://www.enfish.com
Same thing -- hard to make it fast enough.
Reading through the information on the haystack website I got a small deja vu feeling... a few years ago I designed a system that indexes information on your harddrive using (among others) the google directory to create dynamic categorical hierarchal views on your data ( also in RDF ).
Since then the project kind of stopped, but if you're
interested in this sort of research, please check it out at http://atoms.sourceforge.net/
Being the work of a single person, atomsnet is far from a polished application. I still believe the underlying idea has potential, though. I just can't find the time to expand the number of indexing plugins to make it more useful at the moment.
NB A paper about atomsnet was published at CIVR2002
I work in a small financial services company that contracts through a much larger one. The amount of contacts that each of us makes each day is mind boggling. Haystack was such an exciting idea (I read about it yesterday) that I had to run out and see it for myself. Well, haystack itself is something like what we need-- but it's current iteration is way too buggy.
/. posts, I found out about Creo's product, Six Degrees. Unfortunately, I've found this review of that product:
1 36-1.html?tag=rating
So, reading other
http://www.cnet.com/software/0-3227888-1205-20242
That ain't good!
But, to everyone who's reading, I've got to explain just why this is such an exciting idea to mo. Okay, as I said, we recieve an amazing number of contacts each day. If we could track that, we would be infinitely more efficient. If there were a program that could track that, as well as correlating documents on the hard disk to those people, we'd be golden! The trick, of course, is to have the program do this all transparently, because no one really feels like doing all of the correlation manually. In the end, color me excited, very much so, about Haystack.
Also, if anyone knows of any software for windows that does what I'm looking for, please let me know at
J Gad ik i@L U C.E D U
Thanks!