New Continuous Support System
An anonymous reader writes "eWeek is reporting on a new continuous open-source support system that helps to keep tabs on your mission-critical applications by providing constant diagnostic monitoring. The system is designed to match specific 'signatures' from your applications to a database of over 200,000 possible 'problem' signatures and alert the user for correction or analysis. From the article: 'SourceLabs' Continuous Support System features what Sebastian calls "adaptive diagnostic probes" that are fully integrated and configured for customer environments. The probes identify production issues and begin to gather diagnostic information to help get to the root of the problem, he said. Indeed, the probes can be configured so that as soon as a problem occurs, the SourceLabs support team extracts system information to find and resolve the problem. And the system includes a database of more than 200,000 signatures of problems that might occur.'"
I dont understand. Is this an advertisement?
Was that a news article or a fluff press release? It'd be nice if the editors could let us know in advance when a slashvertisement plug is posted to the front page.
Captain, I am receiving unusual data from the alien probe.
Analysis Spock?
Insufficient data. It may be a successful penetration from the Romulan sector. Or...
Or?
Or accounting is performing their end of month reconciliation jobs.
Is it just me or is the FA completely devoid of useful information about exactly what and how the "SourceLabs Continuous Support System, technology " works? A non article. I have no idea how it differs from say Zabbix or Nagios.
Deleted
I don't really trust these signature-based systems. Like viruses, you have to update them whenever there are new ones out, which means that the problem has to occur in order to get its signature. And, if you have something like this, you probably don't want the problem to occur at all.
Send email from the afterlife! Write your e-will at Dead Man's Switch.
So it scans application log files for errors and then helps people find fixes for them? (I think, TFA was a bit light on details.) News for nerds, maybe. Stuff that matters, definitely not.
he system is designed to match specific 'signatures' from your applications to a database of over 200,000 possible 'problem' signatures and alert the user for correction or analysis.
The interesting thing is that no matter which 'signature' is noticed, the alert always reads "omfg n00b! read the fvcking manual!"
See my journal for slashdot ID's by year. Mine created in 2005. http://slashdot.org/journal/289875/slashdot-ids-by-year
http referrers from slashdot.org :)
--fatboy
How is this different from splunk? Now if it fixed problems for me...
http://igotyoursidekick.spreadshirt.com
Yes, but is it proactively-positive-change-process-oriented?
I am really fascinated by this trend of selling support for open-source software. If a company creates a free, open-source product, and then uses support as their business model (RedHat, for example), doesn't that produce a conflict of interest in regards to the quality of their product? If the product is difficult to use, they will make more money off support. If it's rock-solid and completely intuitive, their revenues will crumble. Am I making any sense?
If you take away my morning coffee, I can probably generate all 200,000 matchable problems in a day's work...
Maybe they should just assume the marketing and sales adage "The customer is always right" and just forgo the whole support system all together.
P.S. Sorry for the lack luster sarcasm, but a story about customer support and problem signatures is a bit to exciting for me to make fun of. Seriously.
Berserk Manga > All
What kind of signatures? What kind of diagnostics? What the hell, exactly, is this article about?
And no, I'm not going to RTFA...if the submitter isn't articulate enough to succinctly describe what it is he or she is submitting, I'm not going to waste my time following the link.
Instead, I'm going to waste my time writing inane comments such as this...
The most common problem in their database -- PEBKAC.
#include ".signature"
I can't count how many times I've heard this before. You either get spammed silly by alerts or turn the alerts down and then do what you did before you bought the product.
Sometimes you can get some use out of them but you've got to spend a whole lot of time with it in setup and ongoing adjustments.
Too many managers buy these things expecting a "Magic Bullet" solution.
The obscure we see eventually. The completely obvious, it seems, takes longer. - Edward R. Murrow
http://www.sourcelabs.com/
Seriously, when your pc is hooped, some diagnostic program looking at threads and poking around in ram isn't going to tell you much. I don't see this as being much more useful than the windows 'An application has crashed, LOL if you haven't backed up your work'.
It's already working - they have redundancy in place in this advertisement :). In case you missed the first occurence of 200,000, the second should help!.
This sounds like a marketing gimmick from AOL to add even more useless software to my computer and use even more system resources.
Eat all you want and NEVER gain a pouind with our revolutionary Continuous Support System! That's right folks! And how much does it cost? Don't answer yet!.....
I judt got a nre Kinesis keybiartf so please excusr ant egregiou typos.
Anyway, it was a very interesting and difficult problem. One of the biggest rubs was the level of assurance you had to provide. In otherwords, can you let the system make changes on its own or should it just recommend changes? If the system mis-diagnoses even one problem, it might break more stuff than it fixes. Most monitoring tools have big problems with 'false positives'. Add to that that the system can't necessary 'undo' all changes. Our solution was to allow the administrator to run the system in a variety of modes so they could choose if the system applied the fix automatically, with approval, or just suggested how to fix the problem.
As for how the system actually works, it basically takes a middle approach between ML (machine learning) and KR (knowledge representation). Basically, either you can hard code all the types of problems you have in a KR language, or setup some big neural net (or other ML algorithm) and let the system 'learn' problems. We split the difference and added some domain knowledge. Certain types of 'features' (parts of a diagnose such as the disk is slow) were diagnosed by ML algorithms, but ultimately KR rules written by Exchange experts actually diagnosed the problems and suggested repairs. A very time consuming, but more reliable solution (but less cool).
"Those that start by burning books, will end by burning men."
From the company website:
I've written a program, and immediatly adapted my failure signatures database. My database now contains 3 failure signatures:
- No output
- Non zero return status
- Any output that is not 'Hello world'
If it wasn't the first program I've ever written and I had more time, I probably could get to 200000.
...write a fluff piece and post it on /. - the lashback and muttered grumbles will ensure product recognition.
Well, you know the old saying: "Any sufficiently advanced technology is indistinguishable from a rigged demo". - $RANDOM
"unprecedented timeliness and effectiveness for enterprise software support"
I'm sorry, but I thought I need support when something unusual occurs or I want to do something unusual with the software. Timeliness and effectiveness is allways required, but how can a 'bot provide support? Support is one of things that explicitly is *not* provided by software but by humans, no? Our does this software include automatic hacking attacks and phone pranks on OSS developers that don't update, bugfix or document their projects or what?
Marketing babble. Won't work.
Let me guess: Some guy at marketing discovered how neat filtering and spidering works with Regular Expressions combined with some http lib and had is favourite programmers bolt some system together. Sorry, guys, but you've got yourself a piece of shelfware on your hands, errm, shelf. No Wiener. Back to square one.
We suffer more in our imagination than in reality. - Seneca
Error 104,237: User incompetence
...for NewsForge, but we don't have staff to waste on regurgitating press releases. We tend to wait until we can either review a product ourselves or until we can find some actual companies using the product and talk to them about their experience with it.
But that's just us...
- Robin
Here's the one-sided phone conversation, as heard from a neighbor of the support person at SourceLabs.
Hey, is Arnold around? This is Frank over at SourceLabs.
Hey, Arnold. It's me again. How's it going tonight?
Oh, really, it's 2:30am there? Wow.
Yeah, I know what you mean. Yeah, it's raining here in Seattle, of course.
Hey, listen, the reason I'm calling is because your shit, yeah, yeah, it's crashing again.
Hey, don't blame me. Talk to your manager about it.
Well, he's the one that bought this support.
Listen, though... the stack trace pops up on my screen here and I'm supposed to give you a call.
Well, yeah.
Yeah.
I mean, it's 24x7. You're somewhere in that 24 and somewhere in that 7, so here I am.
Yeah, I don't enjoy this either.
I know what you mean.
Well, the stack trace looks like your Oracle database is hosed again.
Yeah, tell me about it.
Well, you're using the thin-client drivers.
Looks like you can't get any JDBC connections. What a bitch.
I know, sucks that your site is down. What a pisser.
Well, most people monitor this kind of basic stuff on their own.
Yeah.
Uh huh.
Well, maybe some log4j and Nagios would work. Or something.
Yeah, really. It'd save the time it takes me to call you. Good thing you're only taking like 100 orders/minute at this time of day. Heh heh heh.
Yeah, I had to wake my ass up early this morning, too. I'd almost rather be doing drywall at the new McDonald's.
Yeah, ok, cool. Well, see if you can get your Oracle P.O.S. back up again.
Definitely.
Cool.
Well, I'll probably talk to you soon. Bye!
The system is designed to match specific 'signatures' from your applications to a database of over 200,000 possible 'problem' signatures and alert the user for correction or analysis.. News from 2010 - A company has created a system for a computer to self-diagnose, making humans obsolete. Unfortunately, a test drive of the system in front of the press resulted in the computer getting extremely confused.
RUN CARTMAN RUN!
I have mod points and I am not afraid to use them.
My firm uses a product called Black Box to do some similar things in the .NET world. It detects exceptions (based on how you compile the application into your code) and allows both messaging to a host server as well as data collection for collecting data on exceptions that might occur in production environments.
This is stricly Buzzword 2.0 compliant! It is pro-proactive positive 2.0 change 2.0 process 2.12b3 synergetically synergetic! Ok, Buzzwords 2.0 beta 3. I think the Buzzwords 2.0 are just writing up a finalized spec for iso.
I have freaks! I did something right...
Nice, however the "I don't like apps controlled by someone else" crowd will look at this "product" with suspicion.
The product might work something like Zenprise for Microsoft Exchange. The Zenprise product does the following:
This is very different from a Nagios system -- Nagios only gathers static sets of telemetry and presents a monitoring view of these -- Nagios doesn't interpret any of the information to come up with specific root causes for the problem or resolution plans for the problem.
Nagios might show, via a graph: your disk space remaining on drive M: has consistently been growing on the Exchange mail server for the past 2 hours and is reaching 95% capacity. Nagios cannot say: your disk space remaining on drive M: has consistently been growing because mailstore MS1 has been growing because userX has been sending a flood of e-mails because their client has been infected with virusY, which is something a tool like Zenprise might be able to tell you.
Here's a basic explanation of how products like Zenprise might do this.
OK, I'm thinking that this is just an expensive version of http://www.zenoss.org/home/index.html.
I'm still looking for a way to monitor all my favorite software for updates and possibly even download/install them automagically. Sure some programs have Internet updating capability, but I want an all in one app! Every time I want to make a new install CD that puts all my apps on for me I spend hours going through bookmarks getting the latest versions. Some combination of RSS feed reader and web page scraper is prob what is needed, but with the ability to download files. Hell I've been thinking of writing a basic one in AutoIT3 but don't really have the time to invest in doing a new project from scratch. I spend more time updating software than I do diagnosing or fixing problems.
Jonah HEX
Horror & SciFi Erotic Nudes
It's not just no link in the summary, but none in the article, neither.
Bruce P. summarizes it below, and a poster above mentions Zabbix and Naggios.
There's been a bunch of interested work in monitoring and diagnostics with "Netsaint / Nagios for some time. SysAdmin has had a few *very* cool articles about not just network monitoring with it, but resource monitoring and preventative maintenance of all kinds.
IT Groundwork's done some very interesting things.
SpikeSource is doing similar stuff (presumably so "you don't have to").
Splunk is interesting (w/r/t checking log entries against know issues in an automated fashion.)
We've leveraged Nagios for "preventative diagnostics" of our Test, Dev and Prod environments. It's worked very well at our scale.
I'm less inclined to get excited about stress testing Java middleware as my hope is JBoss, IBM / Websphere, BEA and Oracle would already be doing that for me. If I'm using Tomcat or Resin, it probably means it's because I can and am less concerned.
I'm going to check out Zabbix now - thanks for the tip.
S
http://www.meanbusiness.com/
When we put together multiple-redundant systems with a view to achieving high availability, we tended to find failure modes which we called 'sympathy sickness'. One of the pair would fail for an unanticipated reason, and then that would induce a new failure mode in the second; and you'd have to diagnose a more complex failure situation by hand to get things going again. I've got this suspicion that having a list of 200000 'problem cases' to look for will just ensure that you don't find any of these 200000 problems. We have 'elephant dust' in our pockets; it is very successful at keeping elephants away, I have never seen one in my street at all ever ...
I mean, I've been to a McDonald's in Las Colinas.
My turnips listen for the soft cry of your love