Slashdot Mirror


UML, PostgreSQL Get Corporate Support

tcopeland writes "An article on NewsForge highlights some changes in the upcoming PostgreSQL release (v7.5) that are funded by Fujitsu. PostgreSQL core team member Josh Berkus says that "Tablespaces, Nested Transactions, and Java support" are being underwritten by Fujitsu; this has also been mentioned on the postgresql-hackers list. He also says that 7.5 will be "...the most significant new release of the software since version 7.0 almost four years ago". Good times for PostgreSQL users!" And ggoebel writes "Jeff Dike posted a notice to the UML [User-mode Linux] developers mailing list: 'The first bit of news is that as of last Monday, I am working for Intel. They generously offered a full-time position, off-site, with my time mostly spent on UML. This basically means that UML is no longer a part-time, after-hours thing for me, so we should start seeing more work happening on it, especially compared to the last month or two.'"

58 of 213 comments (clear)

  1. UML is pretty awesome by Anonymous Coward · · Score: 3, Informative

    It's really the future of "shared" webhosting because it balances the power of a full server against the cost of a shared one. Some hosts like JVDS and RimuHosting are already doing this and it's great.

    1. Re:UML is pretty awesome by gtrubetskoy · · Score: 5, Interesting
      It's really the future of "shared" webhosting because it balances the power of a full server against the cost of a shared one.

      I respecfully disagree. While UML gives you excellent isolation, it is an extremely inefficient way to virtualize your server since it does not take advantage (by design) of all the optimizations that UN*X provides. UML is great for kernel developers and applications where isolation is far more important than performance.

      In Linux virtual server hosting, the future will be Linux VServer Project

      (ok, I'm somewhat biased, I admit)

    2. Re:UML is pretty awesome by drinkypoo · · Score: 2, Insightful
      Linux VServer might be the future, but the documentation is so far in the past, we'll never know.

      Why are Wikis always touted as the solution to documentation, yet every time I try to find useful information in some project Wiki, it is always useless?

      Ahh, there's a paper on VServers. Sounds kind of like jails with more separation. However, the filesystem separation of UML is a feature. VServers are good for completely managed hosting, I'm sure, but UML is the answer to people who want to get whole machines. Instead of leasing a computer to someone, you can lease a UML instance, and no matter what they do to it, they (in theory) cannot cause problems for anyone else.

      --
      "You're right," Fisheye says. "I should have set it on 'whip' or 'chop.'"
    3. Re:UML is pretty awesome by Anonymous Coward · · Score: 3, Interesting

      There are plenty of "customers" out to cause problems. There are many that will take advantage of anything they can if it'll benefit them, regardless of how it affects the rest of the people using the server. I browsed some of the VServer hosts, and they all seemed more expensive than the UML host I have.

  2. That's all fine and dandy, but... by Mr.+Spontaneous · · Score: 2, Interesting
    'The first bit of news is that as of last Monday, I am working for Intel. They generously offered a full-time position, off-site, with my time mostly spent on UML. This basically means that UML is no longer a part-time, after-hours thing for me, so we should start seeing more work happening on it, especially compared to the last month or two.'

    Will this mean that Intel might have a chance to influence its development? The true benefit of projects such as this is their independence from the big brother corporations who attempt to control the industry/market.

    --
    Its all fun and games until someone loses an eye... then its just fun.
    1. Re:That's all fine and dandy, but... by jbolden · · Score: 4, Interesting

      Well obviously if they are funding development then they will have influence on what gets worked on. What political agenda do you see Intel as likely to have on advancing user mode? It would seem to me that this is fairly typical of Intel software devleopment for the last 15 years -- making sure that there is publically available code highlighting how to do cool things with their CPUs.

    2. Re:That's all fine and dandy, but... by SIGALRM · · Score: 5, Insightful

      The true benefit of projects such as this is their independence from the big brother corporations

      You mean like Sun and HP funding the Apache group?

      Or Novell and Ximian underwriting the Mono Project?

      Or IBM contributing to F/OSS?

      Do you think these and other projects would be where they are today without the backing of serious money/resources?

      --
      Sigs cause cancer.
    3. Re:That's all fine and dandy, but... by Mr.+Spontaneous · · Score: 3, Insightful

      I just think that, with the funding, the projects are encouraged in a certain direction. This isn't always bad, but seeing something with a "Optimized for the Pentium 4" logo always makes me wonder what would have happened if it didnt have this funding. (I would say the same thing if it was optimized for AMD)...

      --
      Its all fun and games until someone loses an eye... then its just fun.
    4. Re:That's all fine and dandy, but... by bfields · · Score: 2, Insightful
      I just think that, with the funding, the projects are encouraged in a certain direction. This isn't always bad, but seeing something with a "Optimized for the Pentium 4" logo always makes me wonder what would have happened if it didnt have this funding.

      You've seen free software projects with "Optimized for the Pentium 4" on them?

      I think people may not realize the extent to which free software development is already corporate-funded.

      --Bruce Fields

    5. Re:That's all fine and dandy, but... by cleverhandle · · Score: 3, Interesting

      Loosen up that tinfoil hat, man. This is a pretty natural project for Intel to invest in. Improved User Mode Linux leads naturally to more shared servers, as others have detailed. And, in the interest of efficiency, those shared server operators will be interested in nice, juicy processors to allow more virtual servers on the same piece of physical hardware.

      Sounds like a simple business investment to me - no need to search for conspiracies here.

    6. Re:That's all fine and dandy, but... by Sxooter · · Score: 3, Insightful

      That's not the same thing. If the higher ups at Oracle tell the Oracle developers to include a dubios feature that may be bug ridden, the developers DO IT because they were told to.

      If IBM tells the apache group to put in dubious and buggy code, the apache group tells them to buzz off.

      There is a difference, even if it isn't obvious at first glance.

      --

      --- It is not the things we do which we regret the most, but the things which we don't do.
  3. clarification please... by Anonymous Coward · · Score: 3, Informative

    UML ....

    (1) Unified Modeling Language?

    or (2) User Mode Linux?

    Methinks (2), given that I work alot with (1) and have never heard of Jeff Dike

    1. Re:clarification please... by mrwiggly · · Score: 3, Funny

      Sheesh, click on the link! It's user mode linux.

  4. Solid stuff, that PostgreSQL... by tcopeland · · Score: 5, Interesting

    ...RubyForge has been running on it for almost a year now, no problems.

    Only a half million records and only about 75K queries a day, so it's not a huge DB... but it's definitely getting the job done.

  5. UML is no longer a part-time, after-hours thing by Anonymous Coward · · Score: 4, Funny

    This basically means that UML is no longer a part-time, after-hours thing for me

    You have my deepest sympathy.

  6. UML by Un+pobre+guey · · Score: 5, Informative
    OK, UML is User Mode Linux. Got it. No, no, I'm not confused, I get the coincidence with the other extremely widespread use of the acronym. No prob, Dude.

    1. Re:UML by timothy · · Score: 2, Interesting

      Lorcha --

      You're right; I'd meant to parse the name and add in a link (as I now have done) to the project's web page.

      timothy

      --
      jrnl: http://tinyurl.com/c2l8yr / foes: http://tinyurl.com/ckjno5
    2. Re:UML by Ignignot · · Score: 3, Funny

      Egad, a slashdot editor has apologized for giving an incomplete story blurb! Did I cross over into... The Twilight Zone?

      Thanks timothy

      --
      I submitted this story last night, and it didn't get posted.
  7. UML by lorcha · · Score: 2, Informative
    Who the hell is Jeff Dike and why is he working on the Unified Modeling Language? And why does Intel care about it?

    Oh, you meant User-mode Linux? Well, why didn't you say so? Sometimes I think these writeups are intentionally confusing.

    --
    "Avoid employing unlucky people - throw half of the pile of CVs in the bin without reading them." -- David Brent
  8. Table spaces? by AKAImBatman · · Score: 5, Interesting

    Does this mean that PostGreSQL will actually be able to write *directly* to disk cluster? That would be one serious performance boost! My only request is that they do us all a favor and make sure that we can fragment the tables across spaces. It tends to suck when one table fills an entire drive, and it refuses to use all the space on the other drives.

    1. Re:Table spaces? by jadavis · · Score: 5, Informative

      "Tablespaces" allow you to put individual tables on different storage devices. Prior to tablespaces, an entire database had to be on one device*.

      You are referring to two completely different technologies:

      (1) "Writing directly to disk cluster" - By that you seem to mean direct disk access, not through the filesystem. I don't even think this is part of the PostgreSQL TODO, because there is just not a very strong need. Are you experiencing performance problems in this regard?

      (2) "fragment tables across spaces" - By that you mean "Table Partitioning". That allows you to break up a single table across multiple storage devices. That would be very valuable technology, but as far as I know, won't make 7.5.

      If all these features really work out for 7.5, they should call the release 8.0, and maybe they will.

      *: There are some tricks you can use if you need to move a single table to a different device prior to 7.5. I think symlinks work fine, but if it's important, I'd wait for 7.5 or ask on the -general list to make sure it's correct.

      --
      Social scientists are inspired by theories; scientists are humbled by facts.
    2. Re:Table spaces? by rgigger · · Score: 2, Informative

      No it does not write directly to the disk cluster if you mean that it can write to a raw unformatted disk. They want to get all of the nice buffering for free from the os because they can't beat it's performance yet. Writing directly to the raw disk would slow it down right now. They are going to reconsider this if someone can write a caching system that can beat the os but so far that hasn't happened.

      They also do not have table partioning. It has been discussed and it is a high priority feature but it doesn't seem like anyone has seriously tried to tackle it yet. I'm guessing that it will be on the radar for the next release though.

      Tablespaces basically just lets you partition your db across different volumes but a single table cannot be split up.

      I am not a developer but this is what I have gleaned from the hackers list.

    3. Re:Table spaces? by jadavis · · Score: 4, Insightful

      They are going to reconsider this if someone can write a caching system that can beat the os but so far that hasn't happened.

      It's a little more complicated I think. Using the filesystem has other advantages as well:

      (1) PostgreSQL can work well with other applications running. Let's say you invent the best caching algorithm possible, then you still have two seperate caches, one for PostgreSQL and one for everything else. That means you have to dedicate the machine to PostgreSQL and have a high PostgreSQL cache (but any other app will suffer), or give postgres a low amount of cache space and it will suffer.

      (2) The postgres developers don't want to worry about the bugs involved in making their own filesystem. Also, who's to say they can make a filesystem as fast right off the bat? It might be a huge development effort, with relatively minor benefit for most people.

      --
      Social scientists are inspired by theories; scientists are humbled by facts.
    4. Re:Table spaces? by GooberToo · · Score: 4, Informative

      "Tablespaces" allow you to put individual tables on different storage devices. Prior to tablespaces, an entire database had to be on one device*.

      Strictly speaking, that's not true. You can move things around manually, and some have done so, but it's not pretty, not easy, and not easy to maintain. Implementation of tablespaces in PostgreSQL simply allows its users to easily do what was previously an arcane-voodoo art. So clearly, it's a big step up. But, you already knew that. ;)

      "Writing directly to disk cluster" - By that you seem to mean direct disk access, not through the filesystem. I don't even think this is part of the PostgreSQL TODO, because there is just not a very strong need. Are you experiencing performance problems in this regard?

      That's correct. AFAIK, there is no desire to implement raw partition support. The speed difference is minimal and the required code is large. Basically, you wind up writing a FS and associated buffer management into the database. The return generally is not very high. It used to be, many years ago. These days, filesystem technology and implementations are plenty fast. Those that want raw partition access, IMO, are simply living in the past.

      If all these features really work out for 7.5, they should call the release 8.0, and maybe they will.

      You are correct. Accordingly to the list, the numbering constantly goes back and forth. From what i gather, they are waiting to see what features actually make it in. Depending on the scope of changes, they'll then determine the version number. As a rule of thumb, people are calling it 7.5, simply because nothing else has been blessed.

      Please don't think I'm correcting what you've said. You've said nothing that I disagree with. I'm simply adding a followup remark. ;)

      Cheers!

    5. Re:Table spaces? by Just+Some+Guy · · Score: 4, Interesting
      By that you mean "Table Partitioning". That allows you to break up a single table across multiple storage devices.

      For the uninitiated and lazy, is there any compelling reason why that's better than putting the database files on a RAID and letting the OS split the table across devices?

      --
      Dewey, what part of this looks like authorities should be involved?
    6. Re:Table spaces? by kpharmer · · Score: 4, Informative

      > For the uninitiated and lazy, is there any compelling reason why that's better than putting the
      > database files on a RAID and letting the OS split the table across devices?

      Sure, you might want to distribute your data across multiple arrays. For example - keep your logs and tempspace on an fast & expensive raid 0+1 array of fast (15k drives). Then put small OLTP stuff on a another raid 0+1 array. Then put your huge graphic images, documents, etc on a much more economical RAID5 array.

      I use multiple arrays all the time for performance and economics (in db2 & oracle) - this is cool to see postgres pick itup.

    7. Re:Table spaces? by Java+Ape · · Score: 3, Informative
      The answer to this question depends on what your database looks like. For most small, general purpose databases the RAID approach is great. Fast, simple and not much planning required.

      However, for larger or more complex systems there are some advantages to splitting tables over multiple disk systems. For example, tables with lots of little niggling disk writes (access tables, change logs, temp tables) can go on a fast (possibly striped) disk system. You don't have to waste high-priced, high performance RAID on archived data (if it crashes, restore from tape), or on large media files etc stored as blobs or clobs.

      These are just examples, but on a large server with several different disk sytems available, this technology lets the database designer match storage system performance characteristics much more accurately than a simple raid.

  9. I'm a programmer by Tim+C · · Score: 3, Insightful

    I suspect a lot of people here are. To me, and probably to most of them, UML is Unified Modelling Language. Hell, do a google search for UML and the top hit is to the UML website.

    I know it's too much to ask OSS projects not to pick confusing acronyms and names, but I'd like to think that story submitters or at least editors could a little clearer.

  10. Re:Good to Hear... by Twirlip+of+the+Mists · · Score: 4, Informative

    the primary DB System for so long has been MySQL. PHP coders don't have too much for an alternative

    Au contraire, there are PHP interfaces for PostgreSQL, Oracle, Sybase, and MSSQL built right in to the source distribution. I seem to recall that back in the Bad Old Days before Mac OS X, when you had to compile things yourself, building PHP with all the necessary libraries was a huge pain, but now it's a trivial thing. Marc Liyanage maintains a PHP module package that snaps right into the built-in Apache web server on your Mac, and it already has most of the necessary bells and whistles built in.

    --

    I write in my journal
  11. Re:Good to Hear... by nojomofo · · Score: 3, Insightful

    the primary DB System for so long has been MySQL.

    Care to qualify that statement? Ever hear of Oracle? Or DB2 or SQL Server or Sybase or...?

  12. Good tools out there for PostgreSQL.... by tcopeland · · Score: 3, Informative

    ...some on PGFoundry, some still on GBorg.

    PLUG: For example, there's this little SQL query analysis utility!

  13. All Welcome and expected - expect more.. by eamacnaghten · · Score: 4, Insightful
    This is great news, not only for the projects involved, but for FOSS in general.

    Also this is consistent with the Open Source Paradigm. Where it is in the interests of companies to improve the software, and the advantages far outweigh the disadvantages of them not being exclusive. It is this philosophy, in my opinion, that will beat proprietary software models such as Microsoft, and it is these companies that are key in stopping those who want to halt the advancments of FOSS using idiotic patents and other invalid IP arguments.

    --

    Web Sig: Eddy Currents

  14. OLAP still missing... by zeux · · Score: 3, Interesting

    I'm really looking for an OLAP implementation on PostgreSQL... It's getting more an more important but it's still not there!

    I made my company switch from SQL Server to PostgreSQL but now I have to export data every day from PostgreSQL to SQL Server just to get my OLAP reports!

    As soon as OLAP is there I'll definitely get rid of SQL Server.

    1. Re:OLAP still missing... by GileadGreene · · Score: 2, Insightful

      I love PostgreSQL. I support Open Source. But I have to say: perhaps you shouldn't have forced a switch to PostgreSQL if it didn't actually meet all of your (and more importantly your company's) needs. Sounds like SQL Server was the better solution in this case.

    2. Re:OLAP still missing... by stuktongue · · Score: 2, Informative

      Okay, I'll bite. While I am certainly not an OLAP expert, I have found a need to learn a little about it and I plan to use it as part of an application I am developing for personal use.

      For the uninitiated, OLAP stands for online analytical processing. In layman's terms, this refers to the process of interactive analysis of data, typically via incremental queries that progressively slice, dice, and refine the data set in order to reveal non-obvious relationships between various parameters.

      OLAP is typically performed on data that is of medium-age; i.e., not just current data, as would be found in a typical operational database, but maybe not the full long-term historical data, as would be found, say, in a data mining environment. Of course, different types of data and different application scenarious make such generalizations somewhat problematic, but, generally, OLAP is focused on analysis of, say, the last year or two of data. Regardless, the data sets returned by OLAP queries are typically quite large. As a result, special techniques, distinct from those used for traditional transaction processing, are usually employed in order to meet query response time requirements, which are often key requirements for OLAP systems.

      One technique often employed is the use of so-called "star" or "snowflake" schema. This form of schema is quite different from the very normalized schema of transaction processing systems in that the data are organized into central "fact tables" with related dimension tables. Dimensions are things like date, location, product, etc., and have attributes that allow fine-grained querying of the facts in the fact tables. These dimension tables are also constructed in a way that reflects natural hierarchies; e.g., a date dimension would allow queries by year, month, week, day, etc.

      While such schema can be defined in traditional transaction processing systems, OLAP-aware database systems typically incorporate design elements that optimize processing of queries on such schemas. OLAP queries are focused on examining aggregates of data across the various dimensions, such as sums, averages, etc. These aggregates may be precomputed on selected chunks of the overall data set to speed up online queries, but the query processor needs to be able to identify opportunities to take advantage of such things. So, optimizing queries for OLAP is a key feature of an OLAP-aware system.

      Another feature of an OLAP-capable system is some sort of API for creating the various components needed, e.g., the schema, definitions for any pre-computed aggregates, defining rules for "rolling up" from lower levels of a dimension's hierarchy to higher levels, etc. Oracle's OLAP, for instance, provides several techniques for accessing OLAP data and metadata, but they mostly boil down to either a Java API (high-level) or a more arcane, lower-level API for more direct access. The API(s) available to program an OLAP application can be critical in determining the ease with which applications can be created, and the types of applications that can be created.

      Does this help a little?

    3. Re:OLAP still missing... by Sxooter · · Score: 2, Informative

      Take a look here:

      efeu

      --

      --- It is not the things we do which we regret the most, but the things which we don't do.
  15. Postgres is kicking butt by johnnyb · · Score: 5, Interesting

    and taking names. In addition to Fujitsu's additions, they are also doing point-in-time recovery. They have multiple replication solutions. It's an absolutely wonderful database to develop for.

    It's got several really cool features, such as the ability to create your own index types, the ability to create your own column types, the ability to create rules for updating views, and a lot of other things that make it an absolute joy to work with.

    The only thing I don't like about it is that it needs the ability to read bytea's as if they were BLOBs. Then life would be perfect!

    From Fujitsu's pile, tablespaces is the most interesting feature I see - and that's actually pretty cool. That's one of the things that really allows you to realize the logical/physical separation that relational databases promise.

    1. Re:Postgres is kicking butt by Sxooter · · Score: 2, Informative

      Do a google search for slony. It's in early beta right now, but looks very promising.

      --

      --- It is not the things we do which we regret the most, but the things which we don't do.
  16. This rules! by Anonymous Coward · · Score: 5, Interesting

    I'm loading more than half a million records into a Postgres db on my iBook as I write this, and I gotta say that pgsql is cool as hell. The data type support alone (polygons?!?!) makes it worth the small amount of extra effort it takes to get it up and running.

    Postgres flat blows away MySQL in every way I can thnk of except for the fact that one has to "manually" vacuum (cleanup + reindex) the db ... but that's what cron is for. The only things I miss from my MSSQL days is the ability to do on-the-fly data type changes on columns; this is actually a good thing because now I'm not so lazy about designing the db right in the first place. ;-)

    If you're out there playing with MySQL or MSSQL, you owe it to yourself to give Postgres a shot.

    1. Re:This rules! by tesmako · · Score: 3, Interesting
      Just having a polygon datatype is kinda cool in itself, but the fact that PostgreSQL really supports using R-tree indexes and thus make efficient geometric queries quickly and easily is really great.

      PostgreSQL is probably the most well-polished and useful open source project there is (gcc being the runner up, I skip linux since there really are plenty of decent OSS alternatives to it). Good going PostgreSQL team!

  17. Good news! by rfernand79 · · Score: 5, Interesting

    Certainly good news! :) PostgreSQL is a very robust and complete database, enjoyed by many academic users (mostly because of its excellent implementation of different SQL standards...) It's nice to hear that a company is backing them up now. UML and Intel, really cool, too. It's not as good as Linus/OSDL, but definitelly equivalent to the Linus/Transmeta years. So, in general, is this the road for the free world now? Backed up by powerful companies who also benefit? I certainly hope so.

  18. More servers running PostgreSQL... by tcopeland · · Score: 2, Informative

    ...can be found on the Big List Of GForge Sites.

    Props to Tim Perdue for picking a solid database on which to build GForge!

  19. Re:Good by Anonymous Coward · · Score: 2, Interesting
    Same with ours.

    Although recently one of our employees demo'd a "clone" (not of all the features, but enough to show it's real) of our system ported to PostgreSQL.

    It's being considered for some new (possibly lower margin, so free is good) products in the product family.

    The old "pgadmin II" tool had a useful migration tool, so other than stored procedures, the upgrade from MSsqlserver to PostgreSQL is supposedly quite smooth. That tool is still available but is hard to find because the newer pgadmin III doesn't (yet) have the migration feature.

  20. User-Mode Linux Management by Anonymous Coward · · Score: 5, Informative

    ...If you want to manage a lot of UML virtual machines, I _highly_ recommend UMLazi. It has a very slick configuration file format-- configuration directories instead of a single file, which makes it really easy to manipulate with scripts--, and they've obviously put a lot of thought into security.

    I had a few problems getting it started, but the developers were very helpful.

  21. Why corporate self-interest can be good for OSS by j.+andrew+rogers · · Score: 5, Interesting
    There is more than just Fujitsu supporting PostgreSQL and the reasons there is corporate interest is pure unadulterated self-interest of the best kind.

    Postgres is getting really close to the functionality and capabilities of the Big Commercial Enterprise DBMS, close enough that anyone can see that bridging that gap is quite doable. Most of the arguable weaknesses in Postgres are in the more esoteric high-end feature space, as it is already strong and quite feature complete for most routine RDBMS work. And the upcoming new version addresses a great many of those weaknesses. As the article said, this is going to be a major release.

    The self-interest part is that it is a HELL OF A LOT CHEAPER for a corporation to pay people to add those last few features and bits that they want to Postgres than to pay an unholy amount of money to buy the required Oracle licenses. The Postgres engine is clean and fundamentally pretty good in an engineering sense, and so enterprise feature tweaks are relatively cheap. It is all about dollars and sense at the end of the day. Purchasing Postgres plus feature development is almost always going to be vastly cheaper than buying Oracle. And unlike Oracle, it is pretty much a one-time fixed cost. It is worth repeating that the engineering strength and scalability of the underlying Postgres platform is the primary reason the market is evolving this way. The gap between MySQL and high-end RDBMS is comparatively much too great for a company to fund closing that gap because a lot of additional arguably unrelated work may be required because of the internals. This increases time to delivery of features, increases the cost of adding high-end features, and increases the risk of problems.

    If Oracle suddenly dropped its enterprise licensing costs by a couple of order of magnitude, then it would seriously threaten Postgres development. But since that is unlikely to happen, corporate money will continue to flow into making Postgres a formidable Oracle replacement, which it is already well on its way to being.

    1. Re:Why corporate self-interest can be good for OSS by nsayer · · Score: 2, Informative

      Using a raw partition typically means bypassing the filesystem code in the OS. Since most databases simply consist of a small number of large files that are randomly accessed by the database system, the overhead of the filesystem is unnecessary. Not having a filesystem between your database and the disk also means faster crash recovery - there's no need to run FSCK on the (largely irrelevant) filesystem AND run a database consistency check - you can jump right to the latter.

      You're right about this being for dedicated postgres boxes, but then dedicated database machines are exactly what you find in large enterprises. The "dot com" I work for has a big iron Sun running Oracle and nothing else, and a large number of smaller machines that do the "everything else". I think you'll find that fairly typical.

    2. Re:Why corporate self-interest can be good for OSS by stuktongue · · Score: 2, Informative

      My experience is with Oracle, so my comments here will be mostly restricted to that context. You are correct in saying that database servers are best dedicated to that function alone; the resources involved (memory, network, etc.) in running a non-trivial database server usually demand their own machine.

      I take some exception, however, to your view on raw partitions vs. filesystem-based storage. At least in the Oracle world, most studies and expert opinion I have viewed generally recommend against use of raw partitions. With appropriate use of RAID and suitable filesystem selection, the overhead associated with filesystem storage is usually not considered significant, despite many folks's assumptions otherwise. When you consider the difficulties in managing storage over time--e.g., altering tablespace mappings to files, expansion of tablespaces, equalization of I/O--use of filesystems makes such administration much more straighforward. Tom Kyte, a highly-respected technical expert at Oracle, highly recommends against the use of raw partitions unless you just can't stand the 2-3% performance hit.

      That said, raw partitions have been required in "Real Application Clusters" (RAC) environments (previously known as Oracle Parallel Server (OPS)), at least until the mainstream acceptance of so-called cluster filesystems. It is my understanding that Oracle's work on clustered filesystems is aimed at allowing RAC systems to enjoy the substantial benefits of filesystem storage.

  22. GUI Tools by Bilbo · · Score: 3, Interesting
    I just started using pgadmin3, and I have a feeling I have a way to go before I get really good at it. I could really use a migration tool since, as with the previous poster, we have a lot of SQL Server based applications I'd like to see moved to something like PostgreSQL.

    Frankly, I still like the old TCL based "pgaccess". It was buggy as all get out, and really bogged down on larger databases, but it had some really nice tools such as the visual query designer.

    The article mentions a couple of other GUI tools for accessing and maintaining PostgreSQL databases. Has anyone else used these, or are there other tools that people like?

    --
    Your Servant, B. Baggins
    1. Re:GUI Tools by lexus99 · · Score: 2, Informative

      Actually, you may find pgadmin2 a better choice for now. It has a migration plugin that works wonderfully. ASAIK, this plugin is not yet available for pgadmin3, and doesn't appear to yet be a priority, as it should IMO.

      PGAdmin2 is not available for Linux. I can only assume you use Linux since you mentioned pgaccess. I've not heard of a Win port of it, but since it is written in TCL/TK, it would probably be fairly easy to port. PGAdmin2 may even run fine under WINE (not tested)

      However, with that said, the former poster was correct, MS Access DOES work very well with postgresql. There are a few problems, but I've always managed to work past them.

      LeX
  23. windows port by MagicMerlin · · Score: 3, Informative

    7.5 will contain a native windows port with no external dependencies. You can find the current binary version here.

    Even though it is currently in beta it works very well. The port is now being downloaded over 2000 times a week and increasing all the time.

  24. Also in PostgreSQL 7.5 - Native Windows Port by john_smith_45678 · · Score: 3, Informative

    Looks like version 7.5 will also include a native Windows port. Prior to this, PostgreSQL on Windows has always required Cygwin (which offers a lot of great stuff in and of itself) to run.

  25. Re:Good-Postgres and SQL Server by randall_burns · · Score: 2, Interesting

    I'm a SQL Server DBA and Python/Perl/Postgresql developer.

    My sense is that it would be possible to extend Postgres to have a mode fully compatible with Oracle and/or Microsoft SQL Server. What this might mean is having SQL interpreters fully compatible with the quirks of Oracle and SQL Server-identical system tables available and identical libraries. I think Oracle will be the first target here because Oracle licensing fees are much higher than SQL Server--and parts of SQL Server are harder to re-engineer(i.e. DTS and some of the scheduling stuff).

    Databases are a great Open Source target because scripts are open _and_ customers frequently control their data file format.

  26. Google didn't exist when user-mode linux started by mec · · Score: 2, Informative

    Jeff Dike started user-mode linux in February 1998

    message from jeff

    Unified Modelling Language may have existed in early 1998; I first saw it in April 1999. But Unified Modelling Language was a lot smaller back then.

    And Google did not exist in February 1998!

    These days, when I need to name something, I stick the name in google and check for conflicts.

  27. advanced features can be ignored by brlewis · · Score: 2, Interesting

    apt-get install postgresql will work fine for most common installations. Once you've created your database there's really nothing simpler about mysql; PostgreSQL is simply more adhering to SQL92. There's very little reason to ever choose MySQL anymore.

  28. Re:Google didn't exist when user-mode linux starte by red+floyd · · Score: 2, Informative

    Rumbaugh, Booch, and Jacobsen started on UML in the mid 90s.

    According to this, UML 0.9 was from 1996, UML 1.0 was 1997.

    --
    The only reason we have the rights we have is that people just like us died to gain those rights. -- Cheerio Boy
  29. Re:Good by KenSeymour · · Score: 2, Insightful

    Desipite this being moderated offtopic, I found this to be true myself.

    I suspect there are more system administrators reading slashdot than programmers.

    So if you are being paid to program and you want to work with PostgreSQL, your best bet is to talk your current employer to switch.
    That is because almost no one is hiring programmers for PostgreSQL or MySQL.

    Or you can keep using Oracle or MSSQL and put marketable skills on your resume.

    --
    "We can't solve problems by using the same kind of thinking we used when we created them." -- Albert Einstein
  30. And that's a problem, because....? by hargettp · · Score: 2, Insightful

    I've used the OLAP stuff on SQL Server, and I honestly didn't find it much help. From what I saw, it was practically a *report writer* front-end on top of the underlying database--not nearly enough power for the problems my company has.

    True OLAP often involves many layers of analysis, with many steps of processing. I had hoped that SQL Server OLAP would help manage all that, but it doesn't do enough. To be fair, there are some nice tools to graphically create what amouont to some stored procedures, but after a while, an experienced SQL developer will just revert to lots and lots of stored procedures chained together in a job that runs on a regular basis. Consequently, all that mysterious logic about *how* a number in the middle of an OLAP report came to be calculated is still stored in a procedure that can't easily be modified. OLAP in SQL Server gives the illusion that analysis is easy, but, alas, it is not.

    I'd be curious to hear from others who have used SQL Server's OLAP and other tools like Cognos, Oracle, or Siebel, etc. What OLAP features *should* PostgreSQL adopt?

  31. Re:what's the point? by sumbry · · Score: 4, Interesting

    You've obviously never run a large database before. While a single RAID partition is fine for most uses, when you get into situations where you measure queries by how many are run per second then things really start to hit the fan.

    Tablespaces allow you to do things like place a table that is 90 percent read and 10 percent write on one RAID array while taking another table that is maybe 50 percent write and 50 percent read on another table and then taking the Postgres WAL and placing that on a completely different array.

    Table usage varies greatly across large databases. Some tables barely get touched, others get written to alot, others get read from alot.

    I'm currently running a database where our peak loads are around 35 queries, per second. I've actually symlinked table locations to put my most heavily accessed tables on a seperate RAID array from the rest of my database. This gave me a 3 fold increase in speed. This is really noticed when we do things like VACUUM the db.