PostgreSQL 8.0 Enters Beta

You can save a few clicks... by tcopeland · 2004-08-10 04:03 · Score: 5, Informative

...and jump right to the beta announcement message.

--
The Army reading list

where to download by jbellis · 2004-08-10 04:06 · Score: 5, Informative

http://developer.postgresql.org/beta.php

the windows installer is at
http://pgfoundry.org/projects/pginstaller

BitTorrent Downloads by Anonymous Coward · 2004-08-10 04:07 · Score: 5, Informative

http://bt.postgresql.org

Bittorrent [was Re:where to download] by dfetter · 2004-08-10 04:08 · Score: 4, Informative

http://bt.postgresql.org

Join the torrent! :)

--
What part of "A well regulated militia" do you not understand?

I recommend Mysql users to take a look at PG by xutopia · 2004-08-10 04:09 · Score: 4, Insightful

it is the most advanced Open Source database there is. If anything pissed you off with MySQL chances are Postgresql will have a solution for you.

Re:I recommend Mysql users to take a look at PG by Jason+Earl · 2004-08-10 04:16 · Score: 5, Informative

While it is true that the PostgreSQL project doesn't include full text searching there is a full text searching engine for PostgreSQL. The "problem" is that it is licensed under the GPL, not a BSD-style license, and so it is not included in the official distribution. Here's the link.
OpenFTS has been around for quite a while, and is used pretty heavily, so there really is very little reason to put up with MySQL's many shortcomings.
Re:I recommend Mysql users to take a look at PG by rtaylor · 2004-08-10 04:19 · Score: 5, Informative

In short, yes and perhaps.

Replication (master & multiple slaves) works great with Slony.

Fail over should not be done by the database, but by the operating system or an external monitoring system (Big Brother). RedHat Enterprise with the standard failover configuration works fine.

You just need to tell Slony that the new node has been elected as master. It's not as clean as it could be (no gui tools, etc.) but it is functional enough for the .org servers (Affilias).

--
Rod Taylor
Re:I recommend Mysql users to take a look at PG by Svennig · 2004-08-10 04:22 · Score: 5, Interesting
Okay, I'll bite
Stored Procedures. They arent functions. Functions are different. Functions should be called inline, from within SQL statements. There should be a difference.
When looking towards migrating to an OSS database from MS SQL Server I looked into how easy it was to use the postgres stored procedure/functions/things. I couldn't find any equivalent of returing a resultset. In MS SQL you use:
CREATE PROCEDURE name AS select * from test

The only way that I could find to do that in postgres was:
1. Return a cursor reference, in which case throuch JDBC you need to obtain this horrid postgres specific class from the driver.
2. Return a set of results (or was it records, my mind fails me), but I couldn't get this to work
3. Return a set of created types, but this doesnt work well for dynamic queries and makes maintainance a headache - you'd end up defining hundreds of types!
4. So it seems that we're stuck with MS SQL server.
  If someone DOES know how to do this is a non-evil manner, please tell me! Were planning on doing an upgrade, and I'd rather not have to fork out the money for SQL Server licences etc...
Re:I recommend Mysql users to take a look at PG by JamesKPolk · 2004-08-10 04:36 · Score: 5, Informative

Using postgresql 7.4.2:

create function testfunction() returns setof record as 'select name from companies;' language sql; select name from testfunction() as names(name char(50));

That will print all the company names in my database.
Re:I recommend Mysql users to take a look at PG by Ragica · 2004-08-10 05:55 · Score: 4, Informative

"tsearch2" which openfts is based on is BSD licenced, and included in the PostgreSQL contrib package. It's pretty good, and all you need for creating full text searchable indexes.
The documentation describes the differences between tsearch2 and openfts like this:
OpenFTS is a middleware between application and database, so it uses tsearch2 as a storage, while database engine is used as a query executor (searching). Everything else (parsing of documents, query processing, linguistics) carry outs on client side. That's why OpenFTS has its own configuration table (fts_conf) and works with its own set of dictionaries. OpenFTS is more flexible, because it could be used in multi-server architecture with separated machines for repository of documents (documents could be stored in file system), database and query engine.

Die IT theme by ScytheBlade1 · 2004-08-10 04:10 · Score: 4, Funny

Die IT theme.

Can it Compete with Oracle or DB2? by Greyfox · 2004-08-10 04:12 · Score: 4, Interesting

I was interviewing with a company a while back that was planning to move from an Oracle based set-up to a Postgres based one. They were pushing enough data around daily that they were having to redesign their collector software to be more efficient, so I was somewhat concerned that Postgres wouldn't give them the performance they needed, but I haven't really seen enough of it to be able to comment.

On the other hand, I've seen both Oracle and DB2 corrupt indexes and database table data in various circumstances (Usually the failing of a DBA in some capacity or other.) I'd be curious to see how the various databases stack up against each other without the hype that most of the parties that publish such studies usually bring to the table.

--

I'm trying to teach myself to set people on fire with my mind... Is it hot in here?

Re:Can it Compete with Oracle or DB2? by rtaylor · 2004-08-10 04:29 · Score: 5, Informative

PostgreSQL is a touch slower than Oracle in some special cases (single user on multi-cpu machine doing large or very complex queries) since Oracle can split the work up amongst CPUs.

With 50 to 100 users, both have similar performance for many workloads (you need to test your specific workload), but Pg may require a touch more tuning.

Pg is, however, quite a bit more reliable than Oracle as far as corruptions go (keeping in mind this is a .0 release) so long as the hardware is adequate (doesn't lie about writes hitting disk when they're really in cache).

There are ways of purposfully crashing PostgreSQL as an authenticated user (particularly if you have root access and can write functions in C), but in standard operation it's quite reliable.

I cannot speak for DB2, but based on their Docs I think Pg is nearly as feature complete in most areas, and much more complete in many others.

--
Rod Taylor
Re:Can it Compete with Oracle or DB2? by Stone316 · 2004-08-10 04:33 · Score: 5, Informative

In my 8 years as an Oracle DBA I don't think i've ever seen a corrupt index. Saying that, I don't even know how you could force a index to get corrupted so I don't think were seeing all the info here.

There is alot of hype in the database market as is there with any other area such as OS preference. A good DBA, like a good software developer will pick the database that fits the needs. Saying that, usually DBA's (like anyone else) are under certain constraints from management but heres my take:

For:

Mission Critical Apps and large DB's: Oracle or DB2
Mid-Range apps/Mid-sized DB's: The above + SqlServer, PostgreSQL.
Small: SqlServer, PostgreSQL. (+ Oracle and DB2 if you like to toss around the $$).

I'd feel comfortable using PostgreSQL for upto and including mid-sized DB's. Currently I wouldn't use mySQL for anything until they fix their Gotcha's.

If you really and I mean really have faith in your developers and they have reviewed these gotcha's i'd consider MySQL but unfortunately, as every DBA knows this isn't always possible. It only takes one bad apple to mess everything up.

Of course there are many other factors to choosing a database but the core comes down to your employees and licensing. If all your DBA's have extensive training on Oracle then it doesn't make much sense to retrain them to use another DB just because it maybe a better fit. Also, if you have a server license for Oracle it doesn't make much sense to buy another so you can use DB2.

At our shop we have a mix of Oracle and SqlServer. But we do installs for DB2, Redbrick, informix, everything and anything under the sun.

Anyways, thats my 2 bits.

--
"Thanks to the remote control I have the attention span of a gerbil."
Re:Can it Compete with Oracle or DB2? by Stone316 · 2004-08-10 04:48 · Score: 4, Insightful

There's more to a database than feature set. While postgreSQL has enough features for 98% of the applications out there it isn't that simple.
In 8 years i've never lost data from an Oracle database from a software problem. I've seen data get lost, corrupted for a number of reasons including incompetent DBA's and their managers. (Off topic but once I was called into a major telephone company because they had some corrupted blocks from a hardware issue. I asked them how long did they keep their backups, 3 months they replied. Ok, how long have you known about this problem, 5 months. Hrmm... 3 - 5 = -2. You can fill in the rest of the story..)
I haven't used postgreSQL in a production environment long enough to know how stable it is. The reason companies choose commercial vendors for their DB's are two fold:
1) Track record. They know if they pick Oracle or DB2 they are getting a solid,proven database system. And there is an abundance (if not expensive) of knowledge out there. Their support is bar none and for the most part I have been extremely satisfied.
2) Managemnt loves to save money. Who wouldn't want to be able to say they saved the company millions of dollars by going to a 'free' or inexpensive (if you pay for support) rdbms like postgresql or mysql. One reason, blame. Opensource still has the stigma attached to it that no one is accountable for bugs. If a critical revenue generating db goes down because of a bug a manager needs to be able to point their finger. Until this stigma goes away we won't see alot of opensource adoption in the DB market. Whether this is right or wrong doesn't matter, people don't like change and opensource is a huge change for alot of managers.

--
"Thanks to the remote control I have the attention span of a gerbil."

Native Win support - awesome by Stone316 · 2004-08-10 04:13 · Score: 5, Interesting

I think this was a major stumbling block for postgreSQL's adoption. I'd love to use it here at work for some small projects but unfortunately were getting more and more windows servers. PITR recovery is a must for any production database these days. Maybe there are some 3rd party packages but I don't think mysql supports this yet. This is great news and I hope it spurs a new round of adoption for pgsql!

--
"Thanks to the remote control I have the attention span of a gerbil."

Windows support by Anonymous Coward · 2004-08-10 04:14 · Score: 5, Interesting

I think Windows support is the only reason MySQL is so popular. PostgreSQL has always been ahead of MySQL in terms of everything but speed. But everybody is familiar with MySQL because, when you want to pick something up, you pick the one that will work with your system, and most people are on Windows.

Up until this point, you have had to install hundreds of MB of cygwin to get PostgreSQL to work on Windows. I think it's a little late to usurp MySQL's market share, especially as MySQL is now entrenched in the cheap web hosting market, but at least PostgreSQL might get the respect it deserves.

Re:Windows support by Khomar · 2004-08-10 04:34 · Score: 4, Insightful

I think Windows support is the only reason MySQL is so popular...ahead of MySQL in terms of everything but speed. (emphasis added)

You cannot overlook the speed aspect as well. For many, many databases, the special features of foreign keys, stored procedures, etc. are not required. I have worked extensively with Oracle databases in the past, so I am well aware of the advantanges of these advanced features. However, in my current company, there has been no need. Most of our databases, while large, consist of a very small number of tables. The vast majority of our searches are performed on a single table, and these searches are completely optimized for speed. Stored procedures will not help in these scenarios, and foreign keys are not needed. I imagine there are a lot of web sites and applications that have the same characteristics.

I will not deny that the Windows support is huge, but it is not the only factor in MySQL's court. Speed is a huge issue, especially in the database world.

--
I believe in de-evolution. God made the world perfect, man fell, and its been going downhill ever since!

full text search by jbellis · 2004-08-10 04:16 · Score: 4, Informative

I guessed you missed OpenFTS, which has been out for a couple years now.

Some nice performance enhancements by grunt107 · 2004-08-10 04:19 · Score: 4, Interesting

The cross-datatype comparison indexing is very important (ex. '1' = 1), as well as index usage on OR clauses. Both of these before would cause full table scans, which is very costly on VLDBs (Very Large DataBases).
The improvement to the VACUUM I/O processor is important for Postgre to be used on a multi-app server. The 'play nice' feature will allow one server to house the DB AND web servers (albeit at a performance hit to the DB processes).
Overall, a nice improvement.

Thank you Fujitsu And Afilias. by ron_ivi · 2004-08-10 04:20 · Score: 4, Interesting

I just wanted to say thanks to Fujitsu for helping pay for this

Fujitsu foots the bill for new PostgreSQL database features Thursday July 01, 2004 (07:04 AM GMT) ... Berkus described the new Fujitsu-formed features as follows: * Tablespaces is a means of partitioning large amounts of data easily and efficiently on separate storage devices, a key requirement for maintaining PostgreSQL's performance on large databases in the hundreds of gigabytes, and terabyte range; * Nested Transactions allows application developers a very granular level of control over database commits and rollbacks, which is particularly significant for maintaining data integrity and porting applications from other database platforms; * Robust support for stored procedures in Java that exceeds the goals of the SQLJ specification in the ANSI SQL99 standard.

And thanks to Afilias (the guys who run the .org domain) - from the same article:

More recently, .org and .info domain registry company Afilias has sponsored developer Jan Wieck to work full time on developing a new, enterprise-class replication system for PostgreSQL called Slony-I, to be presented next month at OSCON in Portland, Ore.

Re:What the versions mean by SSpade · 2004-08-10 04:25 · Score: 4, Informative

In the case of postgres there are three digits in the version - a.b.c

If your upgrade increases either a or b then the on-disk structure of the database has changed, and as part of the upgrade you'll need to dump the database out to a backup file and restore it.

If the only change is in the final digit c then there's no on-disk change and you can upgrade just by upgrading the binaries. That tends to mean that final digit upgrades are bug fixes, and you should always do the upgrade.

So if you're running 7.4.1 you can easily, and definitely should, upgrade to 7.4.3. But upgrading to 8.0 is a marginally more time-consuming upgrade that you may not want to make on a production system unless you want the new features in 8.0

There's no 'technical' difference between a first digit change upgrade and a second digit upgrade. There's a difference in expectations though, and the version following 7.4.* has enough major new features to justify a major version jump to 8.0.0, with all the positives and negatives that jump implies. The upgrade path would have been identical had it been called 7.5.0 though.

replication is not a failover solution by RelliK · 2004-08-10 04:28 · Score: 5, Interesting

PostgreSQL supports replication BUT replication is absolutely useless as a failover mechanism because it is asynchronous. That is, when you commit a transaction, you cannnot be sure if/when it gets propagated to the slaves. For true failover you need distributed transactions. Neither MySQL nor PostgreSQL support them, but curiously, Firebird does.

--
___
If you think big enough, you'll never have to do it.

Re:replication is not a failover solution by wieck · 2004-08-10 04:51 · Score: 5, Informative

Looking with a narrow view at situations that need 100% guaranteed zero transaction loss on failover, you are right. Many businesses however can live with a little (few seconds) lag and the risk of losing the last couple of transactions, given that there is a mechanism to later analyze the failed server (after recovery) and find out what had been lost, to solve these cases manually or inform users/customers.

The true failover functionality you are talking about will be the goal of my follow-up project Slony-II, which will implement synchronous multi master replication for PostgreSQL. The design phase will start in about 3-5 months.

Sincely, Jan

--
It takes a real man to ride a scooter ... what are you compensating for?

Re:Newbie Question - UI Tool by rycamor · 2004-08-10 04:47 · Score: 4, Informative

Try PGAdmin III.

Re:DROP COLUMN, ALTER COLUMN TYPE and SQL-99 Specs by rtaylor · 2004-08-10 05:17 · Score: 5, Informative

You'll like this, actually...

If it can be cast directly (integer to numeric) it will do so implicitly.

So, integer to smallint will do part of the work -- but if it fails (comes across a number that won't fit in the smallint) then it will rollback -- nothing lost but a little time (most PostgreSQL commands are atomic and transactional).

However, you can use an expression to do the conversion if you have something stranger in mind -- it's essentially run as an UPDATE.

Excuse the crappy formatting.

BEGIN;

SAVEPOINT altertab;

ALTER TABLE tab
ALTER COLUMN text_col TYPE bool
EXPRESSION (CASE WHEN text_col = 'SOMETHING'
THEN TRUE
ELSE THEN FALSE
END);

ROLLBACK TO altertab;

ALTER TABLE tab
ALTER COLUMN text_col TYPE bool
EXPRESSION (CASE WHEN text_col IN ('SOMETHING', 'OR', 'ANOTHER')
THEN TRUE
ELSE THEN FALSE
END);

COMMIT;

--
Rod Taylor

Really cool stuff in PostgreSQL by cmowire · 2004-08-10 05:41 · Score: 4, Informative

user defined types, including the ability to create customized indicies for them. So there's a type specifically to handle encrypted authentication passwords, for example, so you don't need to mess with crypt() all of the time.
Inheritance for tables, so you can agregate different types of data without needing to explicitly set all kinds of primary keys.
Full ACID support, like a real database
MVCC instead of row or table locking.
BSD license

--
Gentoo Sucks

Re:P.I.T.R by GooberToo · 2004-08-10 05:52 · Score: 5, Informative

Btw, does PostgreSQL have row-level locking yet?

Static docs provide your answers.

Remember, one of the points of using MVCC is to avoid row locking whenever possible. But, I think you'll be hard pressed to come up with a situation where your desired locking facilities are not provided for with PostgreSQL.

To quote the online documentation:
"In addition to table-level locks, there are row-level locks. A row-level lock on a specific row is automatically acquired when the row is updated (or deleted or marked for update). The lock is held until the transaction commits or rolls back. Row-level locks do not affect data querying; they block writers to the same row only."

Re:Database Questions by ciroknight · 2004-08-10 05:59 · Score: 4, Insightful

To the second question: Nobody in the open source world.

Microsoft's on the way to doing it, and there was a small project that existed for a few months that allowed GNOME to access a database as a file system (It was very nasty; involved a kernel patch (a CORBA orb) that nobody was too happy about, so the project never took off.)

I've thought about the problem a few times. It requires the kernel pass information back to user space, unless the database was actually incorporated into kernel space (and that won't blow over well for a number of reasons). Passing the data back to user space requires a messaging system. The problem is, there are very few messaging systems out there designed specifically for kernelspace-userspace communication. CORBA was one developers answer; my answer would to be to ground up a protocol (because I feel that a network messaging solution, i.e. TCP/IP, can't be secured well enough in the long run).

Lastly, a daemon needs to exist to listen to the calls from the kernel and interpret them into SQL. This could be built into the kernel itself, but once again you have to question the security of the kernel building an SQL query that would go directly to the database server. Also, one of the better parts about this daemon would be metadata extraction; since the daemon is virtually transparent to both the user and the database server, the metadata can be completely ripped from the data and stored in a seperate table to allow for much faster, more optimised searches. EXIF Tags can be copied from JPEGs, ID3 tags copied from MP3s, etc.

Ideally, the daemon would be pluggable, allowing for anyone's metadata extension to be added after compiliation, but I believe that it's important to have a functional system before having a featureful system.

If you'd like to talk more about it, I'm really open to the idea of finally having an SQL-based file system. A relational database file system is the future; if we get there before Windows, we can add yet another example of the speed of open source development.

--
"Victory means exit strategy, and it's important for the President to explain to us what the exit strategy is." G.W.Bush

MySQL only fast for single user? by wieck · 2004-08-10 06:11 · Score: 5, Informative

You really need to update your "known facts".

Using a TPC-W style benchmark suite implemented with Apache, PHP4 and either MySQL 4.1.1 or PostgreSQL 7.4.2, I get more or less the same performance. Because of the transactional requirements and the update concurrency, all tables are InnoDB, of course. Based on that I cannot but contradict your claims about MySQL's scalability (and I am a PostgreSQL CORE developer). It keeps well up and is stable even under heavy load. Where the test uses a stored procedure in PostgreSQL, it must use a bunch of PHP code and separate query calls in the MySQL case, but that is exactly what developers do today and since the Apache server is part of the benchmarked system, this is as fair as possible.

That said, Apache+PHP+DB is the environment most people are talking about when they speak about simple to medium complex Web applications. With the scalability and performance being head to head, why would someone voluntarily miss stored procedures, views, triggers and all the other yet to be done for MySQL features? And while the (new in 4.1) subselect support makes it possible to get all of the TPC-W functionality implemented at all, to get it running fast enough in MySQL one has to rewrite some queries in a manner that I would call unmaintainable code. These complex features are not something where you can say "Transactions, checkmark". You have to look at how complete the implementation is and how well the query optimizer can deal with queries that use that feature.

So looking at the two right now, with the performance advantage gone, and the Win32 support knocking at the door, replication available and tons of well settled features in the HISTORY that are still on MySQL's ROADMAP, PostgreSQL is not just the better choice in some cases. It is ahead ... except for MySQL's outstanding marketing.

Sincerely, Jan

--
It takes a real man to ride a scooter ... what are you compensating for?

Re:not yet on par with MySQL by Bromrrrrr · 2004-08-10 06:21 · Score: 4, Interesting

..Mysql is more friendly towards programmers.

Are you kidding me? If you like to do everything in your app that _should_ have been handled by the DBMS then yeah, sure :)....MySQL is more friendly to the programmer who hasn't got a clue about databases maybe, if you know what a real DBMS can do it becomes a pain in the ass to work with.

you can't have a DBMS which makes both happy: there are opposing roles.

Well, I don't think I'd agree with that, but if anything I'd say MySQL is the easier one to administer. Postgres needs a lot more thought with regards to tuning and user management.

Especially user management in Postgres is, in my view, horrible and a lot better in MySQL. Well, can't have it all I guess :) I'd still take Postges over MySQL any day.

--

What a rotten party, have we run out of beer or something?

Windows Native Support by adolfojp · 2004-08-10 07:29 · Score: 5, Interesting

I believe that native windows support for PostgreSQL is essential, not necesarily to deploy apps in that enviroment but to test and develop them. When I started using MySQL on my windows box, I had also looked into PostgreSQL. The lack of windows binaries for PostgreSQL made MySQL the default choice for me. On features alone PostgreSQL wins hand down. Also, in my experience, the faster performance of MySQL over PostgreSQL dissapears when I use InnoDB tables for transactional data processing. The doors to PostgreSQL have been open to many developers stuck in the windows world. Perhaps I will try PostgreSQL for my ASP.NET apps in addition to my trusty MySQL.

Cheers
Adolfo

32 of 368 comments (clear)