Slashdot Mirror


Ask Slashdot: Knowledge Management Systems?

Tom writes: Is there an enterprise level equivalent of Semantic MediaWiki, a Knowledge Management System that can store meaningful facts and allows queries on it? I'm involved in a pretty large IT project and would like to have the documentation in something better than Word. I'd like it to be in a structured format that can be queried, without knowing all the questions that will be asked in the future. I looked extensively, and while there are some graphing or network layout tools that understand predicates, they don't come with a query language. SMW has both semantic links and queries, but as a wiki is very free-form and it's not exactly an Enterprise product (I don't see many chances to convince a government to use it). Is there such a thing?

134 comments

  1. enterprise grade is weasel. by nimbius · · Score: 4, Insightful

    Kinda offtopic, but bear with me. Enterprise grade is what closed source rolled out once they started losing sales to well maintained and stable open source projects. it comes with support contracts and licenses, but not much else. Just as many closed vendors will disappoint you with their support as open source. You could argue wikimedia is enterprise grade, because it supports 1.21 million accounts. but unless and until the business is committed to defining exactly what they mean by "enterprise grade" you have nothing to go on other than "software that requires a purchase order and recurring license"

    that having been said, check out foswiki. search and control are all pretty good.

    --
    Good people go to bed earlier.
    1. Re:enterprise grade is weasel. by Anonymous Coward · · Score: 0

      "Software that requires a purchase order and recurring license" can have its benefits when you are looking for someone to bear legal responsibilities when something goes wrong.

    2. Re:enterprise grade is weasel. by Kohath · · Score: 4, Insightful

      Theoretically. In actual reality, seeking a legal remedy is very expensive and is reserved for extreme problems.

      The real leverage is the money you're going to pay them next year. Or not pay them.

    3. Re:enterprise grade is weasel. by Hognoxious · · Score: 4, Interesting

      If they're big enough to be worth suing, they're big enough to afford decent lawyers.

      --
      Confucius say, "Find worm in apple - bad. Find half a worm - worse."
    4. Re:enterprise grade is weasel. by Anonymous Coward · · Score: 0

      He never said it had to be open source.

    5. Re:enterprise grade is weasel. by Anonymous Coward · · Score: 3, Interesting

      "Software that requires a purchase order and recurring license" can have its benefits when you are looking for someone to bear legal responsibilities when something goes wrong.

      Good luck ever successfully suing and winning against Microsoft et. al. This canard should be roasted over the legal contracts and licensing paperwork.

    6. Re:enterprise grade is weasel. by Tom · · Score: 2

      I'm a big fan of Free Software, don't get me wrong.

      But in certain environments, the effort it takes to get something into the project is much, much less if it comes from HP or SAP or Microsoft, or even from RedHat or, you know, some other company instead of github. It might be bullshit, but it's a corporate reality, like it or not.

      foswiki seems to be just a regular wiki. For that the company would go with Confluence, I'm sure. I'm looking for something more than a wiki. Something that understands meanings and relations. You know, Wikipedia is cute for storing texts and hyperlinking them, but it can't do cute things like "show me all cities bigger than one million people in french-speaking countries" without someone manually maintaining a list.

      --
      Assorted stuff I do sometimes: Lemuria.org
    7. Re: enterprise grade is weasel. by GrantRobertson · · Score: 1

      Seriously, since when has anyone successfully sued for non-working, off-the-shelf software?

    8. Re: enterprise grade is weasel. by GrantRobertson · · Score: 2

      Semantic-based systems require EVERYONE who enters information to PROPERLY tag it with ALL the CORRECT metadata. Having supported real-world (AKA stupid, lazy) users for almost 15 years, I can tell you that ain't gonna happen. No matter what the users or management claim at the outset. You will be stuck, furiously applying tags well into the wee hours to "fix" the system "you said would work."

      This is a losing game. Avoid playing at all.

    9. Re:enterprise grade is weasel. by Kartu · · Score: 1

      Not sure if that was the OPs point, but I'm thinking more about the patent trolls.
      If you bought the software, it's sellers problem.
      With free open source software, it's yours.

    10. Re: enterprise grade is weasel. by Tom · · Score: 1

      That is why I'm looking for a system that understands these problems exist and helps solving them, e.g. by formalizing data input.

      That is why I think something like SMW won't cut it, because the system needs structure enforced instead of relying on users always doing the right thing all the time.

      It's probably possible to hack something together with SMW and Semantic Forms and some other extensions, I just really wonder if I'm the first person ever who wants structured, query-able documentation?

      --
      Assorted stuff I do sometimes: Lemuria.org
    11. Re:enterprise grade is weasel. by azav · · Score: 1

      "is weasel"??

      What does that mean? That's pretty crappy English. I've got no idea what you're trying to say. Is weasel-y? Is some type of mammal? Nothing "is weasel" except another weasel.

      --
      - Zav - Imagine a Beowulf cluster of insensitive clods...
    12. Re: enterprise grade is weasel. by Anonymous Coward · · Score: 0

      Semantic-based systems require EVERYONE who enters information to PROPERLY tag it with ALL the CORRECT metadata. Having supported real-world (AKA stupid, lazy) users for almost 15 years, I can tell you that ain't gonna happen. No matter what the users or management claim at the outset. You will be stuck, furiously applying tags well into the wee hours to "fix" the system "you said would work."

      This is exactly the objection I've always had with the big semantic pushes (Semantic Desktop, Semantic Web, etc). I spoke from the anticipation that no one would ever get the tagging right. You, however, give us this report through unhappy experience. My condolences to you.

      I saw this train-wreck coming. Unless the AI gets to be reliably good and cheap--or all of your user base is highly disciplned and has a very low error rate--I'd consider the effort a pipe dream.

    13. Re:enterprise grade is weasel. by Gob+Gob · · Score: 1

      Yeah I had an account that I wanted to win so I did a find and replace for "SaaS Solution" with "Enterprise Grade Cloud Technology" on the {successful} proposal.

      Truth be told its still an ancient and buggy PHP app.

  2. Documents and search by Kohath · · Score: 4, Interesting

    Just store a bunch of documents somewhere with a search feature that does full text indexing. Or use a simple Wiki system.

    Anything more complicated than that and you'll be the only one using it. Other people won't care enough to spend their time entering data into specific fields and learning a query system.

    1. Re:Documents and search by Mostly+a+lurker · · Score: 2

      Yes, a wiki combined with something like Google site search makes the information maintainable, and accommodates unanticipated queries very well.

    2. Re:Documents and search by Tom · · Score: 1

      Anything more complicated than that and you'll be the only one using it. Other people won't care enough to spend their time entering data into specific fields and learning a query system.

      These are high-class professionals. If the solution is better than what we have now, it will have support.

      And no, documents or a wiki won't cut it. I need to store and extract meaningful relational information.

      --
      Assorted stuff I do sometimes: Lemuria.org
    3. Re:Documents and search by Zontar+The+Mindless · · Score: 2, Funny

      wiki /wi' ki:/, n.: place where knowledge goes to die.

      --
      Il n'y a pas de Planet B.
    4. Re:Documents and search by Hognoxious · · Score: 2

      Free text search is great if everyone can spell. Good luck finding a bug about "leep years".

      --
      Confucius say, "Find worm in apple - bad. Find half a worm - worse."
    5. Re:Documents and search by Kohath · · Score: 1

      Really? You've surveyed them, asking whether they'd be willing to spend the extra time and effort structuring the information and learning the query system?

      Most people won't. If your group will, then they must

      A. have a difficult problem that's already using up their time, that the Knowledge Management System solves. Since you don't have a system picked out, it's unlikely this is currently true. Maybe someday though.
      B. Already use similar procedures for their jobs. Someone who fills out forms all day may be willing to fill out 1 more. Or
      C. Not be very busy. Busy people don't have time to jump through extra hoops for vaguely-defined benefits, regardless of how high class and professional they are.

    6. Re:Documents and search by Tom · · Score: 1

      Due to NDAs I can't speak about details, but both B and to some extent A are true, together with one other factor that gives us a window to introduce a new system.

      Anyway, I didn't come here to discuss the pros and cons of the management side, I'm trying to figure out if there are better options than Confluence or Word.

      --
      Assorted stuff I do sometimes: Lemuria.org
    7. Re:Documents and search by Kohath · · Score: 1

      Ok. My advice is for the general case. Perhaps your case is special.

    8. Re:Documents and search by Anonymous Coward · · Score: 0

      Many good free text systems have soundex like facilities, and probabalistic matching allows you to do the google style "did you mean" stuff.

    9. Re:Documents and search by Hognoxious · · Score: 1

      Even if those worked perfectly (and they don't) that's sticking a bandaid on it.

      You don't run a business system with free text. You have defined fields, some of which have a limited set of inputs. In a word, structure. Otherwise somebody will type MRA when they mean RMA.

      --
      Confucius say, "Find worm in apple - bad. Find half a worm - worse."
    10. Re:Documents and search by jafiwam · · Score: 1

      Just store a bunch of documents somewhere with a search feature that does full text indexing. Or use a simple Wiki system.

      Anything more complicated than that and you'll be the only one using it. Other people won't care enough to spend their time entering data into specific fields and learning a query system.

      Don't be too sure that using a simple tool will be enough to get people to care enough to spend their time entering data, or learning to query the system.

      Simplicity of the tool is NOT the problem. There are very very few people who want to document, and even fewer who can do it well.

      In general, people are too lazy to even use Google to solve simple problems that actually help them in their daily lives. They won't use whatever the submitter creates either. Oh, they will when the boss tells them too, for a little while. But then they'll be right back to not knowing, or standing in the knowledgeable person's doorway asking stupid questions.

      There's little point in doing anything with the tool besides creating it and using it for oneself.

    11. Re:Documents and search by aix+tom · · Score: 1

      I have had good experiences with Semantic Mediawiki in two settings:

      A) My workplace, where developers and admins manage the data.
      B) My RPG group, where role players manage the data.

      It *totally tanked* in two settings where people saw our developer/admin wiki and "wanted something similar set up" for them to use:

      C) in my workplace for the marketing people.
      D) in my workplace for the procurement people.

      The that A) and B) had in common, they *knew beforehand* what data they wanted to store for a server, an application, a sheduled task, a hero, an NPC, etc... so it was pretty simple so setup up about a dozen semantic templates and have them start entering the data. In cases C) and D) the people had no idea what they wanted to put into the system.

  3. Atlassian Confluence by Anonymous Coward · · Score: 1

    Atlassian Confluence may be the thing you're looking for.

    1. Re:Atlassian Confluence by Anonymous Coward · · Score: 0

      that costs a lot of money, he didn't ask "how can I go broke like a sucker?"

    2. Re: Atlassian Confluence by OzeBuddha · · Score: 1

      I've seen the confluence wiki product used in very large corporates, it tracks changes, links in with your corporate sign on data and seems to be pretty robust. We were using the GUI prototyping plugin a lot as well

    3. Re: Atlassian Confluence by GrantRobertson · · Score: 1

      The only to truly guarantee he has the buy-in he claims is for him to choose a very expensive package. Then "non-ignoring of sunk costs" will kick in and management might actually force his "highly professional" users to enter metadata properly.

      That, or finding a different job, may be his only hope.

    4. Re: Atlassian Confluence by Tom · · Score: 1

      Why the negativity?

      I want to solve a problem, and I have an idea that would improve on the current solution, if I can find something that is not a hack or a DIY. Maybe I will find, maybe I will find out it doesn't exist. I already spent many hours searching and comparing and didn't find, but I already got some good pointers that I can follow for more information. All is good.

      --
      Assorted stuff I do sometimes: Lemuria.org
    5. Re:Atlassian Confluence by Anonymous Coward · · Score: 0

      The problem with confluence isn't that it's expensive, the problem is that it's a dog. You'll need i7's on every desktop to plow through the javascript fast enough to make people want to use it.

      And you need people to WANT to use it. It's has to be a joy to enter/edit info, or your info will get out of date FAST.

  4. Nope by wonkey_monkey · · Score: 4, Funny

    Is there an enterprise level equivalent of Semantic MediaWiki, a Knowledge Management System that can store meaningful facts and allows queries on it?

    I asked my Knowledge Management System and it said that no, no there isn't.

    --
    systemd is Roko's Basilisk.
    1. Re:Nope by Anonymous Coward · · Score: 0

      Canada's federal government uses GCconnex

    2. Re:Nope by Anonymous Coward · · Score: 0

      I asked my Knowledge Management System ...

      Methinks, asking the wife is guaranteed to create a logical paradox.

  5. You mean, like SharePoint by Anonymous Coward · · Score: 0

    From what you asked and the example you gave, it seems like SharePoint or a system like it is what you're looking for, or are you just so Microsoft averse that you never even looked at their products?

    1. Re: You mean, like SharePoint by Anonymous Coward · · Score: 0

      Many of those who has used sharepoint are migrating away from it. Usually to open source and probably license costs are the reason.

    2. Re: You mean, like SharePoint by Anonymous Coward · · Score: 0

      Sharepoint does that have search functionality. It is used for storing documents. Unusable for sharing information

    3. Re: You mean, like SharePoint by Anonymous Coward · · Score: 0

      [quote]Sharepoint does that have search functionality. It is used for storing documents. Unusable for sharing information[/quote]

      You have absolutely no idea what you're taking about. SharePoint is an amazing product. Also there are billions of dollars in development behind it and it's mature.

    4. Re: You mean, like SharePoint by Alan+Shutko · · Score: 1

      [quote]Sharepoint does that have search functionality. It is used for storing documents. Unusable for sharing information[/quote]

        You have absolutely no idea what you're taking about. SharePoint is an amazing product. Also there are billions of dollars in development behind it and it's mature.

      I have never seen an installation of sharepoint which was good for sharing information. It's probably possible to build something that people find usable with it, but it's like recommending a hammer and lumber to someone asking for a house.

    5. Re: You mean, like SharePoint by Anonymous Coward · · Score: 0

      Really? So automatic notification when a document changes, workflows to alert others of what they need to do, being able to collaborate on a document at the same time, being able to publish BI from a ton of different data sources, the ability to send links to documents to others outside your company and have the same respect rights management, etc. That's not to even mention their wiki templates, meeting templates (attendees can share calendars, documents, be alerted automatically. when a meeting they couldn't attend has notes/docs), Intranets, department web pages, the bazillion apps and templates there are for it. That isn't information sharing? WTF man.

      I swear sometimes the people on here make comments out of their ass talking about some shitty implementation they saw 15 years ago. There's a reason SharePoint is used by tens of millions. Read about it.... https://en.wikipedia.org/wiki/...

    6. Re: You mean, like SharePoint by GrantRobertson · · Score: 1

      SharePoint doesn't support true semantic metadata. All your documents would have to be HTML and you would need a separate tool to add the RDF markup.

    7. Re: You mean, like SharePoint by Alan+Shutko · · Score: 1

      So, this is based on the current sharepoint installation at a Fortune 25 company.

      Automatic notifications of changes are great. Workflows might be ok, but I've seen very few sites using them internally.

      There is no collaborating on a document at the same time. There's a checkin/checkout model. While Excel offers true simultaneous editing of a file on a shared drive, that's gone if it goes in a sharepoint. Documents with OLE linking don't work. It has some limited BI capabilities, which is nice, but it's hard to embed real BI solutions (BizObj, Tableau, etc) into sharepoint so there are either links or we're dropping exported files in a document library. It would be nice to send links to people outside our company (and you can define federated identity) but that definitely requires a lot of configuration to make happen. (It's not currently set up in our company.)

      As it is, since everything in Sharepoint seems to site-based, we have hundreds of individual sites across multiple sharepoint farms. There's no global way to search all share points. When there is a search, it's really, really bad compared to what people get from Google. (And glacially slow compared to google, but I suppose if we dedicated google-scale infrastructure to sharepoint, it might be better.) As a result, people do not use search. It's almost never a successful tactic. There's no automatic clustering of content like "See Also" or "Related documents".

      Most groups end up using a single document library as a shared drive and maybe add a shared calendar. Meeting sites are set up by very few groups only for standing meetings, because it's a lot of work for each meeting. If one is set up, that information is siloed away from everything else. The wiki pages work, even though they aren't as easy to use as a normal wiki.

      I'm sure that all of these problems could be fixed by working hard enough. That's my point: Sharepoint is a tool that groups could use to build a decent information sharing platform, with suitable care, planning, adoption of third party apps, etc. It's not a good information sharing or knowledge management tool out of the box.

      And yes, there's a reason that it's used by tens of millions: it integrates with the Office products and is sold alongside the other MS enterprise offerings, and is therefore bought by lots of IT departments where the purchasers of the software are separate from the people who end up having to use it.

    8. Re: You mean, like SharePoint by Anonymous Coward · · Score: 0

      Nah, it's like recommending a snap-together system of a714-6b building units, with the option of using anything from the a714 series of building products from the friendly folks a Friendlyblock. You can't use nails or screws to connect anything to them, though, so be sure to also recommend they buy ALL the series a714 building products. Well, license them.

      Seriously, what he needs is not Microsoft products. The basis of this should be text and a database.

    9. Re: You mean, like SharePoint by swalve · · Score: 1

      That's because any IT generalist can get a sharepoint site working. Kinda like saying "Word sucks! I've never seen anything good that had been written on Word!"

  6. SharePoint * by Anonymous Coward · · Score: 0

    SharePoint. Create a form that takes in the information and saved to a list library. That is just one way I can think of on top of my head. There are probably a dozen more ways using SharePoint if I think about it. Make sure that if you are using SharePoint 2010, that you have FAST installed for search queries. Heck you can even load up your Word files and still search based off of those. With SharePoint 2013, you just need to have Enterprise Search enabled, they took the FAST guts out and added them to SharePoint search.

  7. SharePoint by thegarbz · · Score: 1

    SharePoint can do anything, and anything it can't do, just slap in a bit of jquery.

    1. Re:SharePoint by Tom · · Score: 1

      Can you point me to more detailed instructions than "it can do anything"?

      The idea of extending SharePoint is good. I found http://www.semantic-sharepoint... with a quick Google. Is that what you mean or something else?

      --
      Assorted stuff I do sometimes: Lemuria.org
    2. Re:SharePoint by angel'o'sphere · · Score: 1

      I believe your parent was joking.
      Sharepoint is a glorified versioning system, mounted into the windows file system and somehow integrated into the office suit of Microsoft.
      It has no query language, nor does the versioning/office integration work reliable.
      However it produced a lot of jobs for mediocre programmers who now tweak and configer it with VisualBasic .Net ....
      Obviously the whole thing relies heavily on tools from microsoft, like IIS, databases and backup solutions etc.

      --
      Cost free eBook I read (by iBook/Kobo/Amazon/ObookO/Gutenberg etc.): "The Green Odyssey" by Philip Jose Farmer.
    3. Re:SharePoint by thegarbz · · Score: 1

      I was part joking.

      The part bit was because I've used Sharepoint in a KMS before. But it paled in comparison to the system it replaced. None the less IT and the CTO were on the Sharepoint can do everything bandwagon so they did everything with it. I'm glad I don't work there anymore.

    4. Re:SharePoint by Tom · · Score: 1

      However it produced a lot of jobs for mediocre programmers who now tweak and configer it with VisualBasic .Net ....

      I already want to puke.

      I've been using Sharepoint a fair bit, but I can't say I've grown to like it, or developed an interest about it.

      --
      Assorted stuff I do sometimes: Lemuria.org
    5. Re:SharePoint by Tom · · Score: 1

      I was part joking.

      What a shame. No, actually I'm part happy about that because I'm no big fan of SharePoint, but if there were something built on top of it, that would make the convincing and introducing part easier.

      --
      Assorted stuff I do sometimes: Lemuria.org
  8. Oracle Knowledge, formerly InQuira by Anonymous Coward · · Score: 0

    The Oracle Knowledge product, which was InQuira Knowledge Management until it was acquired by Oracle ~2012. We've built an integrated knowledge management / troubleshooting tool that's deployed to 100k call center agents.

    1. Re:Oracle Knowledge, formerly InQuira by tehcyder · · Score: 1

      The Oracle Knowledge product, which was InQuira Knowledge Management until it was acquired by Oracle ~2012. We've built an integrated knowledge management / troubleshooting tool that's deployed to 100k call center agents.

      The only sane reason for recommending an Oracle product is that you are working on commission from Oracle. Anything else is just insanity or pure evil.

      --
      To have a right to do a thing is not at all the same as to be right in doing it
  9. Can you stick wiith one solution? by joe_frisch · · Score: 5, Informative

    A big issue to be aware of for information management systems is the large training effort to use them and the effort to move your documentation into them. We have had problems that a new system is brought in. It takes literally a few years for employees to get their work in the new system and get comfortable with it - then a newer latest and greatest system comes out. We now have 20 years of documentation in half a dozen different places - each of those places originally declared as our "permanent solution".

    You need to budget a lot of training and content transfer time. If you just hope it happens naturally, you will be very disappointed.

    If you don't want to spend time and $ on training and moving documents, your best bet IS just files in a directory tree with a normal OS provided content search. If people use keywords in documents, that is good enough for 95% of all documentation uses, and its free.

    1. Re:Can you stick wiith one solution? by Tom · · Score: 1

      your best bet IS just files in a directory tree with a normal OS provided content search

      Not useful for my case. I know that I will need to query the system in more complicated ways than keyword searches allow. I need to know how data flows and how systems are connected. I want to store relations as well as information, that's why I thought about SMW, except that it's too much of tinkering. I've been in this area for a while, I have a fairly good understand of what is possible how. If someone took SMW and turned it into a solid product with Enterprise support, I would propose it immediately. I found a semantic plugin for Confluence, but I'm uncertain about its capabilities, and I don't want to be locked in on Wikis. Maybe someone else has a non-wiki solution?

      --
      Assorted stuff I do sometimes: Lemuria.org
    2. Re:Can you stick wiith one solution? by angel'o'sphere · · Score: 1

      JetBrains, the company behind the Java IDE also offers a combined Issue-Tracker / Wiki called YouTrack.
      That is primarily an issue tracker. However it has an excellent custom querry language. Ofc you can define your own issue type, 'knowledge' whith a simple lifecycle: posted, accepted.
      From the UI it might look to much like an typical issue tracker, but with the options to attach files, tag your 'issues' group them into projects and stories and the extended query language and various views, the options to write plug ins for it, I could imagine it could easily be adapted for your use.

      --
      Cost free eBook I read (by iBook/Kobo/Amazon/ObookO/Gutenberg etc.): "The Green Odyssey" by Philip Jose Farmer.
    3. Re: Can you stick wiith one solution? by Anonymous Coward · · Score: 0

      Semitic-wiki for content.
      An in-house search engine that crawls the wiki the wiki hurly.
      A custom writtn program that has buit-in AI/Machine learning that builds pages on the wiki, with semantic data correctly demarcated, to use for queryong the wiki, when keywords, and the sesrch engine don't provide instant answers.

    4. Re:Can you stick wiith one solution? by Tom · · Score: 1

      How would I store that this application is stored on that server located on these networks, exchanging these types of data with this and this other systems?

      --
      Assorted stuff I do sometimes: Lemuria.org
    5. Re: Can you stick wiith one solution? by Anonymous Coward · · Score: 0

      I always say that no software vendor will ever produce a product with a red button for you to press that does exactly what you want.

      As you say this is an IT project It sounds like you need a CMDB that can store relations and metadata.
      Couple this with confluence for hard documentation and you should be covered.
      If not, roll your own but these days I would go for nosql to store metadata.

    6. Re:Can you stick wiith one solution? by angel'o'sphere · · Score: 1

      Never dug into it. I only proposed it because of its query language and its "wiki like" UI ... if you understand the gap between a wiki and a ticket system/issue tracker.

      As it stores everything in an ordinary database, and the tool itself is extendable via plug ins, I would focus on figuring if the "UI part" makes any sense and tackle the data sharing/integration afterwards.

      --
      Cost free eBook I read (by iBook/Kobo/Amazon/ObookO/Gutenberg etc.): "The Green Odyssey" by Philip Jose Farmer.
    7. Re: Can you stick wiith one solution? by Anonymous Coward · · Score: 0

      One of the modelling tools supporting https://en.wikipedia.org/wiki/ArchiMate?

    8. Re:Can you stick wiith one solution? by thegarbz · · Score: 1

      I've never seen a solution be replaced due to features or for fun. They typically get replaced due to obsolescence, defunct vendor and lack of support, or because the vendor is trying to fleece the client with an insanely expensive support contract once their solution is in place.

      Sticking with one solution is not as easy as you think.

    9. Re: Can you stick wiith one solution? by Tom · · Score: 1

      As you say this is an IT project It sounds like you need a CMDB that can store relations and metadata. Couple this with confluence for hard documentation and you should be covered.

      It is the "couple this" part that brings me here. I want to couple it better than "look here for this and look there for that". If I can integrate these two, I would be covered. Any pointers?

      --
      Assorted stuff I do sometimes: Lemuria.org
    10. Re: Can you stick wiith one solution? by Tom · · Score: 1

      This deserves to be modded up, it is the best answer so far, and integration with the business layer is a big added benefit I didn't even consider.

      I will look into it, thanks.

      --
      Assorted stuff I do sometimes: Lemuria.org
    11. Re:Can you stick wiith one solution? by greenfruitsalad · · Score: 1

      hello fellow former oracle customer

  10. not the system by Anonymous Coward · · Score: 0

    You are not managing knowledge, you are managing laziness. Writing something up, even in a crude way, is fun and refreshing. After all the specialist can claim to have shared his knowledge and get praise from the team lead. This applies to all teams where the definition of "done" includes documentation.

    The next day, however, maintaining documentation is as hard as it is maintaining code and releases. It's boring work, just like fixing old bugs. The rockstars have moved on, and code rots like the documentation.

    The essence is that there needs to be a clear focus on the value of it, if it has any. Otherwise it will be a waste of effort down the line, regardless of the system used.

  11. Don't try to be too clever by Peter+(Professor)+Fo · · Score: 2

    The number one problem is at the input interface: People will only use it if it's useful or there is someone standing over them with a cosh. So how do you do that? By finding applications they find useful for their knowledge or sharing knowledge. Progress report, interface specs, requests for changes or whatever the knowledge generators want. So it's a management problem.

    Say to management, "I have this as a solution, I think it's the most flexible, can we give it a try? Look! I've piloted it on my latest project and see what it can do... Think how useful if..." When management champions it there is some chance of it working. Until then paddle your own canoe and offer to show people how clever you are.

    It's a good overall question, but exactly the same issues apply to 'Enterprise'(whatever that is) and novelists trying to keep track of places, people, timelines, todos, feedback etc. Until you've really put any solution (I've tried all sorts over 35 years and keep coming back to a book of notes or a master notes document.) to the test by actual use you won't understand the practicalities. The human brain is a pretty good filter if you can do basic organisation and remember to make notes/put things in the right place.

    1. Re:Don't try to be too clever by Tom · · Score: 1

      and novelists trying to keep track of places, people, timelines, todos, feedback etc

      That was actually more helpful than you might think, because I'm a roleplayer and GM a lot and I know this problem very good. It's an interesting way to look at it and yes, it is very similar.

      keep coming back to a book of notes or a master notes document.)

      The problem is that the complexity of the system is too large for notes.

      --
      Assorted stuff I do sometimes: Lemuria.org
  12. Why not Semantic Mediawiki by Anonymous Coward · · Score: 0

    I'm not sure why you're discounting Semantic Mediawiki as an option. Sure, it's not as adaptable as a proper relational database system and a custom interface built on top of it, but on the other hand it is more free-form, higher-level, and you can encode whatever information you like into the individual pages, link between them, and then query the pages based on that information and present the results of those queries to the user. It's not hard to use the CSS to customize the appearance a bit. It's not perfect, but I've used it for a project with ~10000 individual pages and tens of thousands of links between them all (over 50000). Works fine. Mind you, I didn't establish all those links manually, so an "untrained user" situation might not work as well. Hardware-wise I've got it on a vanilla i3-class machine with an SSD, a linux install, apache web server and database. It's nothing special. With that setup it isn't slow at all, even when a page display generates a few hundred individual queries at a time, but then I haven't tried it under very heavy load (probably wouldn't survive a slashdotting). Together with MediaWiki it has pretty effective caching options to speed pages up that don't change often. I only found one thing it didn't do that I wanted, so I modded the code in one module to implement the new feature. The source code is open so it was relatively easy to do once I understood how the combination of Semantic MediaWiki and MediaWiki itself work together.

    It's not clear to me how much you've experimented with this option, but if you haven't done so yet it is worth setting up a test case and hammering it with some load tests matching your expected "enterprise" demands.

    1. Re: Why not Semantic Mediawiki by Anonymous Coward · · Score: 0

      He's secretly hoping someone will suggest foxpro.

  13. docuwiki or confluence by Anonymous Coward · · Score: 0

    At my last job, we used Docuwiki for all the teams to store their documentation. IT used it to document how to setup systems with kickstart, modify DNS, add the node to puppet, etc. It had most of the features and was web-based, so you could get to from anywhere. It was easy to maintain and backup since it was static files and web server.

    My current job uses Confluence which has the same features as Docuwiki but is integrated with their bugtracker and source code control offerings. It needs a lot more juice to run and maintain and should have been put on something other than a small VM. With 500 people banging on it, it frequently crashes.

    If all you want is something quick and easy and will scale to 100 easily, Docuwiki should work for you.

  14. Why not? by jon3k · · Score: 1

    (I don't see many chances to convince a government to use it)

    The government uses a tremendous amount of open source software, I don't see any reason they wouldn't consider mediawiki? Plus, everyone's heard of wikipedia,it's a pretty easy sell: "You've used wikipedia right? We're going to use the same exact software that runs wikipedia - and it's free!"

  15. open source cms? by kennethmci · · Score: 1

    I developed a large information based system recently and we used Drupal 7 and a plugin to push the content to OpenCalais which then tagged the content with the semantic info back into the drupal system. You can then use a faceted search which will allow you to drill down to your data.

    1. Re: open source cms? by GrantRobertson · · Score: 1

      Could you explain a little more about this plugin? Did you build it yourself?

    2. Re: open source cms? by kennethmci · · Score: 1

      https://www.drupal.org/project... is a module which pushes your saved content to receive the rich semantic data autotagged from the open calais web service. then you can install a faceted search module ( https://www.drupal.org/project... is an example of the module from Drupal 6 ).

    3. Re: open source cms? by kennethmci · · Score: 1

      forgot to say, you can get a pre-setup version of drupal to do some of this too : http://openpublishapp.com/ - at the time this was already setup to work with opencalais - ( http://www.opencalais.com/cala... )

  16. The answer would seem to be "no" by msobkow · · Score: 3, Insightful

    Seeing as I've seen Tom reject every single suggestion anyone has had, I guess the answer to his question is "No."

    --
    I do not fail; I succeed at finding out what does not work.
    1. Re:The answer would seem to be "no" by Tom · · Score: 1

      I may not have been precise enough in describing what I'm looking for. Or I set people off in the wrong direction by using SMW as an example, because it's the closest I know to what I want. Or maybe it really doesn't exist.

      --
      Assorted stuff I do sometimes: Lemuria.org
    2. Re:The answer would seem to be "no" by roachmotel3 · · Score: 1

      You really should check out Oracle Knowledge.

    3. Re:The answer would seem to be "no" by Anonymous Coward · · Score: 0

      You really should check out Oracle Knowledge.

      ...and SharePoint. Seriously, if he's not looking at anything but open source, he's crippling himself. And those that brought up training and migration issues are also credible in their critiques. Look at *ALL* options and make an educated decision. Otherwise, all you prove is that you're the dumbass.

    4. Re:The answer would seem to be "no" by Anonymous Coward · · Score: 0

      Seeing as I've seen Tom reject every single suggestion anyone has had, I guess the answer to his question is "No."

      You should be 5-Insightful for that comment.

    5. Re:The answer would seem to be "no" by Anonymous Coward · · Score: 0

      "No."

    6. Re:The answer would seem to be "no" by Tom · · Score: 1

      That would fly, but from the little information I could gain it's basically a CMS with good search capabilities. Does it have any kind of structured data, ontology, etc?

      --
      Assorted stuff I do sometimes: Lemuria.org
    7. Re:The answer would seem to be "no" by Anonymous Coward · · Score: 0

      That would fly, but from the little information I could gain it's basically a CMS with good search capabilities. Does it have any kind of structured data, ontology, etc?

      Yes, along with the ability to add "custom fields" as needed to define it better.

      However, as someone else pointed out earlier, the more complex you make it, the less people will use it and eventually it will be seen as "more trouble than it's worth".

  17. API Dokumentation + Usecases + Glossary ... by Qbertino · · Score: 2

    ... Howtos + 5-minute Screencasts are what you're looking for.

    Most KMSes/DMSes are crap - wether FOSS or not. Don't burden yourself with an extra system that is more trouble than use. Verbose opening comments of classes, API docs with examples, documented Usecases, double-checked by the users, Howtos and Screencasts are what you're need and want.

    Once you've generated the final docs, give them a nice design, some search-thingy with elasticsearch or something and put the Howtos andd Screencasts Front and center along with some Intros for n00b users.

    All that is done best with textfiles and API doctools + proper versioning. Perhaps some diagrams of archticture, setup and Main Usecases nicht help.

    KMSes are the Fallout of 2000s mid-execs bullshit-bingo sessions and IMHO hardly ever worth the hassle.

    My 2 cents.

    --
    We suffer more in our imagination than in reality. - Seneca
    1. Re:API Dokumentation + Usecases + Glossary ... by Tom · · Score: 1

      This doesn't work and everyone knows it. Yes, if you have a few people on staff for the documentation, you will end up with a nicely designed, professionally looking piece of documentation. That may or may not have a relation to what is actually implemented. The more time passes, the more it will move towards "may not".

      That is because in real life scenarios, knowledge in the head of people working the system every day trumps knowledge stored in Word files. Because they are too clumsy and reading them is too slow. It is faster to find someone who knows and ask them.

      But if you could query the documentation, that would be faster. If I need to know which systems handle billing data and on which networks they are located. Or if I want to understand how this record of data moves through the system before it ends up in the database because I'm tracking a bug.

      KMSes are the Fallout of 2000s mid-execs bullshit-bingo sessions and IMHO hardly ever worth the hassle.

      Maybe. The ones I've seen so far are pretty shit. That's why I ask here, because I'm sure somewhere is a good one or two.

      --
      Assorted stuff I do sometimes: Lemuria.org
    2. Re:API Dokumentation + Usecases + Glossary ... by giuntag · · Score: 1
      Disclaimer: I've been working in the CMS area for the last 8 years, not KMS, but here you go:
      • what you are looking for is a project, not a product
      • defining taxonomies, tagging content, making people follow agreed-upon workflows, naming conventions, 'this goes there' conventions takes a lot of time and effort
      • the best "tool" you can buy is a dedicated knowledge management official (the 'webmaster' of lore ;-)
      • 2nd best tool is buy-in from all parties. KMO can train them, provide them with self-learning material, reassure them, etc. Better convince top-level mgmt first, though
      • KMS products are just toolboxes. Each take a lot of config time, learning, setup, workarounds
      • go OSS, as you will need to tweak and patch them as needed. Avoid proprietary like the plague
      • pick something running on SOLR - it has a diverse ecosystem of semantic analysis plugins, and its world-class in searching
      • you might want to start using a Document Management System (Alfresco) or a CMS with a flexible content model (eZPublish)
      • Confluence is just a glorified wiki, it leads to duplicate content everywhere because of its simplistic permission model, document model and missing multi-positioning in the content tree
    3. Re:API Dokumentation + Usecases + Glossary ... by Tom · · Score: 1

      You are probably right that a knowledge manager would be useful at this stage. And yes, I understand I won't be able to buy a product and press a button and everything is good. But I need to start somewhere, and the right technology choice is usually a good starting point.

      Just that I would like to put the metadata into the documentation while it is being written, instead of using some kind of search or analysis in an attempt to extract it later on. It seems stupid to me to not do it right the first time.

      --
      Assorted stuff I do sometimes: Lemuria.org
    4. Re:API Dokumentation + Usecases + Glossary ... by Anonymous Coward · · Score: 0

      If you want a convenient solution, go with a wiki. If you want the 'enterprise' primateur, go with SharePoint, or (god help you), something like OpenText.

      Most enterprise solutions are obsessed with security. Convenience and discoverability goes out the window when you have this orientation. Auditability isn't the main issue (surprisingly enough) because wikis have this in spades. No tool is perfect though and I've found the wikis lack in administrative capabilities.

  18. Prolog by ZeroWaiteState · · Score: 1

    In the title. Thank you and have a great day.

    1. Re: Prolog by zarr · · Score: 1

      True

    2. Re: Prolog by garethjrowlands · · Score: 1

      no.

  19. Will the semantic data be input and maintained? by Blaskowicz · · Score: 1

    Even a basic wiki or any kind of system (let's say internal IM or some stuff to schedule when the meeting room gets used) may get approved, set up and then virtually unused. Or in stronger terms it will be unused.

    e.g. the tags attached to slashdot stories. At least I've noticed that today clicking on them brings a list of stories (it used to not work I think). But it is likely that 80% of stories (or a lot more) that would warrant relating to a given tag are missing, and many tags were one-time snarky remarks. Now that they don't fail they do seem to bring very interesting content though.

    Semantics technology seems ideal for e.g. a database of animal or botanical species with people paid to exhaustively maintain the data. Or a collection of towns, some "booming", some "decaying", some linked to others in a certain way?
    Thus you may want to define some areas of knowledge where the semantical features will really be used more than in others, and somehow get it enforced through policy?

    1. Re:Will the semantic data be input and maintained? by Tom · · Score: 1

      Thus you may want to define some areas of knowledge where the semantical features will really be used more than in others, and somehow get it enforced through policy?

      That is the plan. Ideally, however, the two (strict areas and less strict ones) are in the same system so the documentation is easy to improve whenever someone feels the need.

      --
      Assorted stuff I do sometimes: Lemuria.org
  20. SMW *is* used in the enterprise. by caitriona81 · · Score: 2

    Looking at http://semantic-mediawiki.org/... there are a few examples that can definitely be considered enterprise users, including some high-risk government users (NASA uses it to plan EVAs for the ISS for example).

    The "enterprise mentality" makes most of the alternatives too cumbersome to actually be effectively used - ultimately you have to have buy-in from you users, or what management wants is not going to matter - if it's not pleasant to use you'll be back to emailing 70 different versions of the same different Word document around in a few months time with file renames as your only version control (if you are that lucky).

  21. Government usage by Anonymous Coward · · Score: 0

    I don't see many chances to convince a government to use it

    In fact, the UK Environment Agency uses SMW for it's Restore Rivers project: https://restorerivers.eu/wiki/index.php?title=Main_Page

    So apparently it is possible to convince government to use it.

  22. Some tools by angel'o'sphere · · Score: 3, Informative

    http://protege.stanford.edu/ Java Desktop Application.,Used to define/manage ontologies. Not sure if they have a web version meanwhile and if comes close to what you need. However it supports plugins, perhaps the frontend can be adapted to access a centralized DB. Oh, found it: http://semanticweb.org/wiki/We...égé.html

    This is a info page with an overview about various tools: https://en.wikipedia.org/wiki/...

    Did you stumble over this: http://www.w3.org/2001/sw/wiki...? Dozens of various tools mentioned.

    Another tool, I stumbled iver, but did not use it yet: http://oboedit.org/

    And then there is https://jena.apache.org/docume...
    But that is more a programming API to dynamically create classes to store/manage data in an ontology described database. (Did not use it yet, but looks promizing)

    And then we have this: http://semanticweb.org/wiki/To...

    BTW, I can offer remote programming/assistance in such tools.

    --
    Cost free eBook I read (by iBook/Kobo/Amazon/ObookO/Gutenberg etc.): "The Green Odyssey" by Philip Jose Farmer.
    1. Re:Some tools by Tom · · Score: 1

      These are really good resources, thanks a lot.

      If we go this direction, I have your e-mail.

      --
      Assorted stuff I do sometimes: Lemuria.org
    2. Re: Some tools by Anonymous Coward · · Score: 0

      If you need basic semantic knowledge management then the brain ekp might work. In addition to free text you can establish a more fluid connection between ideas than the tired paradigm of top-down folder management. Beyond that you might want a content management system which lets you build queries on the data, but that's only going to be as successful as your requirements will let it.

  23. RDF/OWL route by chizor · · Score: 1

    you might want a tool storing claims in the form of RDF triples; these can be used as the basis for deductive reasoning. one standard in this area is the Web Ontology Language. several software reasoners are named in the wikipedia article. also, for existing knowledge bases, see: https://en.wikipedia.org/wiki/Web_Ontology_Language#Public_ontologies

    --
    ... !
    1. Re:RDF/OWL route by Anonymous Coward · · Score: 0

      Try Virtuoso, and also dig up a KDE distro circa 2013 (Nepomuk) which uses RDF stored in Virtuoso; lots of people hated it but it handled sophisticated queries and worked for me with ~500k documents. (you would probably have to either drop your Word stuff in the store as ODF or write a indexer for Word docs)

    2. Re:RDF/OWL route by hey! · · Score: 1

      I looked into RDF and OWL a few years back. OWL is more powerfully expressive, which is not always a good thing.

      RDF more closely matches the way relational databases work; in many ways SPARQL is like SQL without the relational model baggage (or the awesome query optimizers that come with that). OWL on the other hand understands a much more powerful subset of first order logic, including generalizations such as such and so cannot be true for any object of a certain type. On the surface this seems like it would be great -- it eliminates the need for things like the trigger mechanism that all non-trivial relational database platforms provide. RDBMSs need triggers because the underlying relational model doesn't have any provision for expressing logical constraints; it's only good for saying the FOO of BAR is BLECH -- just like RDF.

      But the downside of having a larger subset of f.o.l. is that you have to pay much, much more attention to the logical consistency of your model. This may sound to naive ears like something you always want to do anyway, but in practice it's often simply impossible. Consider the problem of two groups who don't view the world the same way sharing some kind of information base. Because they don't agree with each other and probably never will, their models of the world are inconsistent with each other. That precludes them for sharing an OWL dataset unless you've carefully segregated out the areas of disagreement, which probably means that the common bits meet neither group's requirements. However neither would have any objection to asserting that the AUTHOR or WAR AND PEACE is LEO TOLSTOY -- exactly kind of simple, uncomplicated assertion that the relational model and RDF were designed to represent.

      The conclusion I drew is that OWL is a fine language for creating logical models, but that RDF was more practical for systems where diverse users share data. OWL might well be very useful on the boundaries of such systems, enforcing policy rules about data that is allowed out or into the system from outside.

      --
      Post may contain irony: discontinue use if experiencing mood swings, nausea or elevated blood pressure.
    3. Re:RDF/OWL route by Tom · · Score: 1

      Yes, the more I read about the subject, the more I tend towards simply using RDF to model the knowledge data, mixed with some kind of wiki or reader to contain the prose parts.

      I still think mixing them is more natural to people who write the documentation, and that's what I like SMW for, because you can simply write things like "Server A is on vlan [network:123]" and both human and machine can understand the meaning of that.

      But I can't believe a hack to MediaWiki is the best that's available.

      --
      Assorted stuff I do sometimes: Lemuria.org
  24. Convince a government to use it??? by Julz · · Score: 1

    Surely you just say something like... Look the cost is in the staff who you already have, it's Open Source and sits on top of an Open Source application, it's free, the platform to host it is either free or low cost commodity, plenty of people use it already (proven technology), it'll look good that the government is investing time not necessarily money from tax payers and it's using Open Source and Open Standards, so you're not tied into some niche technology only supported by a select few large corporation that when things go wrong give you zero support or guarantees (contrary to the contract). That last one is just my rant from an experience I've had already when I failed to convince a company to use Open Source and things went south big time.

    --
    When shit hits the fan get some of these https://youtu.be/pY-GncsZ-UE
  25. Leverage existing file systems by Tablizer · · Score: 2

    large training effort to use them and the effort to move your documentation into them

    There was a discussion on Slashdot a couple of months about a more sophisticated file system.

    In my opinion we should extend file systems instead of replace them because users and support staff are used to them and their stuff is already there.

    If file systems easily allowed meta data to be attached to files and folders, then semi-structured queries etc. could be done on them. "Views" could be made of combinations of folder trees, similar to RDBMS views. Rough example:

    define view foo as
    /serverX/zip/grog/stuff/*.pdf
    /serverX/zip/grog/stuff/snerg/*.*
    /serverY/derf/*.* -subfolders
    end view;
    select file_name, create_date, author, office
    from foo where create_date <= '12/31/2009'
    order by file_name;

    While there is an existing standard on file meta-data, it appears inconsistent across vendors/OS/versions, poor support by file API's, and poorly tested.

    I'm thinking of adding a secondary system on top of the file system to store meta-data rather than depend on vendors' meta-data. I have a rough-draft for an open-source product. (It won't be very fast, but if it catches on, optimizers could be added.)

    It could also serve light-duty CRUD, such as specialized tracking systems.

  26. What I've found searching today by Anonymous Coward · · Score: 0

    I've found a couple companies that might be what you're looking for. I'd be interested in talking (erin at hyperbuddha.net) if you find something that fits your requirements; a good semantic system is something I've wanted for systems engineering of certain projects, but I haven't found one yet.

    I'm treating Semantic+CMS as kind of equivalent to a KMS, but maybe I'm misunderstanding there.

    http://www.webnodes.com/
    http://redlink.co/ which I found through http://www.iks-project.eu/
    http://flow.li/ seems to be using a KMS to organize data on the backend for the purpose of making publishing easier.

  27. Git + Grep by Anonymous Coward · · Score: 0

    Git + Grep is fully decentralized, works off line and easily scales the amounts if information you are likely to have. Its modular (you can replace grep with other tools) and it allows users to use their preferred editor. It keeps backups, and great version history. Its also tamper resistant, though be aware of the potential weaknesses with sha-1. Its portable between many OSes, and there are several existing tools for exporting data from it for use in other systems. There are also available privet or hosted web UIs, and a lot of people who already know how to use it, and lots of documentation and good long term support.

  28. I've got to disagree. by Anonymous Coward · · Score: 0

    I set up a wiki for engineering, and it's been a huge success. All the key information for a project, design documents, vendor data sheets, build procedures, test procedures, area all there for all of engineering to see. If you ask a question and the answer isn't on the wiki, the policy is add it to the wiki and reply with a link.

    I'd be interested to hear about where approaches like this have failed.

    1. Re:I've got to disagree. by Zontar+The+Mindless · · Score: 1

      For individual projects and small teams--yeah, sure. We even use some for those things.

      The biggest problem with wikis is that they're flat and amorphous. The *two* biggest problems with wikis are that they're flat, amorphous, and they don't scale. The *three* biggest problems problems with wikis are that they're flat, they're amorphous, they don't scale, and they possess no intrinsic semantics. The *four* biggest problems, then--*Amongst* the biggest problems with wikis are issues such as... Wait, I'll come in again.

      Amongst the greatest problems with wikis are such diverse issues as being flat, being amorphous, non-scalability, lack of intrinsic semantics, and nice red uniforms--oh, damn!

      --
      Il n'y a pas de Planet B.
    2. Re:I've got to disagree. by rp · · Score: 1

      The problem is that organizing stuff and keeping it organized requires a tremendous continuous investment of time on the part of the users no matter how much support you get from software. There is no magic bullet. Something flat and amorphous at least stands a chance of being used at all. Wikipedia has a couple of post-hoc information structuring projects; I think it's a better approach than overdoing organization before the content is entered.

  29. IBM Knowledge Center by Anonymous Coward · · Score: 0

    Disclaimer: I work on the project, don't get any kickback from sales.

    The first thing that comes to my mind when talking about enterprise-grade KMS is ibm.com/suppory/knowledgecenter, you write html | DITA | EHS type content and index/share it using KC, there is an offline version available as well.

  30. Do you need Latent Semantic Analysis and search? by dsmatthews9379 · · Score: 1

    There are tools for that, http://www.x-media-project.org...

  31. Re: why the negativity? by Anonymous Coward · · Score: 0

    Experience.

  32. How semantic should it be? by KonstantinVoznesensk · · Score: 1

    As for query "all the cities with more than a million people, in frenchspeaking countries" - we are not there yet. But could it be submitted as city.country.legalLanguages ="French" city.population >= 1M - if yes, and you are fine with the fact that numerical values will only be from some "fact box" - then it could be done, I think even sort of auto-complete for users wouldn't be too hard to implement.

  33. alfresco by Anonymous Coward · · Score: 0

    Cms

  34. This book may help by Anonymous Coward · · Score: 0

    Take a look at the book:

    Jörg Rech; Björn Decker; Eric Ras Emerging Technologies for Semantic Work Environments: Techniques, Methods, and Applications
    IGI Global, 2008

    Do not be fooled by word "semantic" in it, some chapters will let you understand the requirements for KMS better. And when requirements are understood,
    it will be much simpler to come to suitable solution.

  35. Palantir by Anonymous Coward · · Score: 0

    Its not cheap, but its very, very good.

  36. I'm not sure it does exactly what you need... by Anonymous Coward · · Score: 0

    But had you have a look at Collibra? There is a lot of noise around it for data management and governance where I work. Not sure if you have the ability to query it though.

  37. Desktop tools - with shared database by Anonymous Coward · · Score: 0

    DEVONThink with database synchronisation comes close to what you are asking for. Mac only (although there is a webserver built in). Fantastic AI augmented classification, searching and document association.

  38. DMS + Librarian by Anonymous Coward · · Score: 0

    I worked in KM systems for years. What we found was:
    * 2 people will edit a wiki article. Everyone else will read and complain.
    * document management systems without a strong, smart, structure suck. Metadata in documents are pretty much worthless without manual validation and double checks.

    The short answer is to get a librarian and use a DMS with full text search, but do NOT allow normal document files. Only allow text and simple HTML. Page layout crap like powerpoint, doc, xls ARE NOT knowledge. The words are. Simple lists (ordered/unordered/nested) are amazing organization tools. If you lose the fight to avoid "office files" (and I expect you will), try to get the data in ODF files so people aren't stuck using a specific version of a paid application for access. PDF may seem like a good idea, but it is not.

    Options are Xerox Docushare, if you have money, and Alfresco, if you have even more money. The average cost of a free Alfresco deployment is $250K.

    However, if you can make a wiki something that people will edit as part of their jobs, encourage edits internally, great! Don't make anyone use wiki-markup. Only 2% of nerds will, not anyone else. Heck, I setup our mediawiki system and found the wiki syntax too difficult - I already use other markdown languages and didn't want to pollute my existing knowledge.

  39. Clarifying what you really want: by GrantRobertson · · Score: 2

    Well, I had written out an incredibly long response, on my phone, and then lost it when I went to look something up. You are lucky I actually care about this topic a lot or I would have just blown the whole thing off...

    Anyway...

    First, a clarification question: Will your users actually be submitting whole documents (whether in the form of a wiki page, html content, a .PDF file, or a word processing document) and THEN supposedly selecting multiple snippets of that document and adding metadata about those snippets? Or, will they be submitting those same documents and then merely adding metadata about the document as a whole? {Though from your description of the kinds of queries you want to be able to do, this does not seem to be what you are doing.} OR, will they be merely entering independent "factlets" of information and you want those to be structured?

    Your original question lead all of us to believe that the first option is what you want. That is the option that is almost impossible to get done reliably.

    I will wait for your answer before writing more.

    1. Re:Clarifying what you really want: by Tom · · Score: 1

      There is existing documentation emerging from the project phase that is largely written in Word and is planned to be added to a wiki or something similar, where it can be better maintained and updated. So every piece of documentation will be touched anyway.

      The metadata I am looking for is not about whole documents. I don't give a rats ass about that, if I am looking for the document about encryption protocols, I already know where to find it. I want factlets. From a security perspective, when a network is compromised I want to query the configuration database to find all systems on that subnet, but also the documentation to figure out the destinations of all data flows out of that subnet, for example. When a server fails, I want to understand which other systems that rely on it will likely fail next. When an application is reworked, I want to know which other applications are connected in one way or the other.

      All this understanding of the system architecture is usually in the heads of people and hidden somewhere in tomes of documentation and graphs that you need to read and understand. I want that knowledge query-able.

      --
      Assorted stuff I do sometimes: Lemuria.org
    2. Re: Clarifying what you really want: by GrantRobertson · · Score: 1

      OK, next question:
      Is it absolutely required that the data be embedded in, and tagged within, your documents which are stored in the wiki?

      RDFa is RDF in attributes. It is a standard for embedding tags within html documents to indicate RDF triples. It is assumed that the snippet in the document is the subject. That is wrapped in an html tag. Then attributes in that html tag are used to indicate the predicate and object via IRIs. I actually worked with the REFa working group for a short while. My problem with RDFa, and RDF in general, is the dogged insistence that everything be stored only in triples. There is no way to add metadata ABOUT the triple. Who created it? When was it created? Etcetera.

      In addition, all of the RDF triples have to be mined out of the html documents and stored in a database before they can be queried. ALL of the discussion within the working group assumed that the tags would remain relatively static and, once they had been mined out of documents all over the internet, then the real work would be in querying the database. But those of us in the real world know that is unrealistic. In your case, your system would have to grind through your entire document store every night, mining out a fresh RDF database for you to query the next day.

      On top of all that: There really are no tools to make inserting those RDFa tags easy. I was brought into the working group as a technical writer, tasked with the chore of explaining RDFa to web content writers so they could understand it well enough to be able to insert the tags by hand. Except RDFa is insanely convoluted. It would be like writing an entire interactive web site without using a framework or any existing libraries. Just hand coding every function. NOT GONNA HAPPEN.

      Earlier, you said you figured other people would have wanted to do this before. Of course they have. It's called "The Semantic Web," and Tim Berners-Law has been championing it for decades now. But it has never been widely accepted due to the dogged insistence that everything be done using the all-powerful, magic triple. You would not believe the insanely complex data structures necessary to build real, usable databases with only triples. It would be as if everything in the world were built with Lego because that was all engineers had learned to use.

      So, I think you would be far better off with a separate database to contain your information. The records in that database could include citations pointing to your documents, in case someone wanted to drill down to the true source of the info.

      Your information is structured but the structure varies enough that you might need to use quite a bit of creativity to use a standard relational database. But it should be possible. Or you could design an XML schema to define and "contain" the structure of your information. There are existing XML database engines and query languages. Either way, you are going to need to custom-build your solution.

      The last option is to just shop for an existing software solution for tracking network data. Look for one that has a Notes field that allows hyperlinks. Then you can insert links to your documents. In the end, this may be the most realistic solution. Network tracking and documentation software is a very mature field. There will be lots of features available that you would have be able to build on your own. You can still incorporate your documentation library, but from the other way around. Instead of storing your information IN your documents and mining it out, you store your information in a purpose-built SYSTEM and then refer to your documents as supporting evidence.

      Well, I gotta eat breakfast and go to jury duty. Let me know what you think.

    3. Re: Clarifying what you really want: by Tom · · Score: 1

      Firstly, thanks a lot for your time. This is exactly the kind of exchange I was hoping for (well, that and a small hope someone would post "sure, look here at http://semantic-documentation-...").

      Is it absolutely required that the data be embedded in, and tagged within, your documents which are stored in the wiki?

      Not absolutely, but I am afraid that if prose and metadata are seperate, they will go out of sync. One thing I like about SMW, for example, is how easy it is to update both at the same time, it comes so naturally that you don't even realise it.

      RDFa is RDF in attributes.

      Yes, if there were an editor (you already point out there isn't, damn) this would be really cool.

      Earlier, you said you figured other people would have wanted to do this before. Of course they have. It's called "The Semantic Web,"

      Yes and no. I want something very similar, except with a more narrow focus and thus easier to make possible. I'm currently looking at ArchiMate following someone elses pointers and the advantage of modelling business processes in it as well. Someone in the Netherlands has studied the exact problem I'm trying to solve and published a small paper.

      You would not believe the insanely complex data structures necessary to build real, usable databases with only triples.

      I believe. The database design for my online game (see footer) is anything but trivial and it has many places where I've encountered exactly this problem of complex relations between entities.

      So, I think you would be far better off with a separate database to contain your information. The records in that database could include citations pointing to your documents, in case someone wanted to drill down to the true source of the info.

      Yes, I've been thinking along similar lines, except that I would like to embed or something, at least one step better than linking, but that's a UI issue.

      Yes, in most documentation you have some prose and then tables or diagrams containing hard data. It would be possible to do it this way, if the systems are well integrated.

      You can still incorporate your documentation library, but from the other way around. Instead of storing your information IN your documents and mining it out, you store your information in a purpose-built SYSTEM and then refer to your documents as supporting evidence.

      That is another option. What I have learnt from all the answers here put together is that no such solution exists (what a shame. Anyone got VC and looking for a Startup concept?) and that it will either have to be built or put together from existing components. That turns it into an integration issue.

      Very interesting answers, thanks. I will follow up on them and talk to others in the company to find the solution that best fits.

      --
      Assorted stuff I do sometimes: Lemuria.org
    4. Re: Clarifying what you really want: by GrantRobertson · · Score: 1

      OK, well I will just leave you there. Part of what I want to do in my Ph.D. work is to tackle this problem. But I got a ways to go first.

    5. Re: Clarifying what you really want: by Tom · · Score: 1

      That's interesting. I won't be able to wait for that, unfortunately. But maybe we should stay in touch, because I reckon that I will encounter this issue again and might tackle it as well.

      --
      Assorted stuff I do sometimes: Lemuria.org
    6. Re: Clarifying what you really want: by GrantRobertson · · Score: 1

      Just google the word "ideationizing" and my name.

  40. Oracle, SMW by movdqa · · Score: 1

    Oracle has a Semantic layer over their RDBMS that comes with their Spatial package. My only knowledge of it comes from talking to a product manager about it a long time ago. My son works in a shop which uses SMW for their LIMS. I'd say that it's an enterprise application with him as the main developer and a few other non-IT/CS/EE folks writing queries against it. He's like a developer + DBA and everything goes through him and he writes almost all of the forms and reports from requests by his department. I've been fairly impressed with the size of their application that's been developed by a pretty small and mostly non-technical team. He has to fix SMW problems from time to time too. SMW had some rather severe problems with maintenance and I think that the group that used to do it is in Germany. There is a place to get maintenance I think but I don't know how reliable it is. He goes in and fixes problems with SMW when there's a problem that they can't work around. That's part of the fun with Open Source.

  41. Semantic web tools by Anonymous Coward · · Score: 0

    You can always convert your data into RDF triplets using Jena and store them in fuseki (like sql server but for facts). You can then query the data using Sparql. However you need an ontology first to organize your facts. There is an ongoing effort to move us government data to semantic web (nih repos for example). This is the vision Tim Berners Lee wants for the future and he has several vision papers on this subject ( the benefit of moving to linked data etc.) which can be a good starting point for convincing ppl to follow the semantic web principles.

  42. Use a graph database? by Eyeballs · · Score: 1

    From:
    http://radar.oreilly.com/2013/07/why-choose-a-graph-database.html

    "Instead of de-normalizing for performance, you would normalize interesting attributes into their own nodes, making it much easier to move, filter and aggregate along these lines. Content and asset management, job-finding, recommendations based on weighted relationships to relevant attribute-nodes are some use cases that fit this model very well."

    An example of this is/was FreeBase:
    https://www.freebase.com/
    (Look at the query examples.)

  43. The Brain by Anonymous Coward · · Score: 0

    http://www.thebrain.com/