PostgreSQL 7.4 Released

Ah! Just in time by hattig · 2003-11-18 07:00 · Score: 2, Interesting

For me to migrate my company's systems from MySQL to Postgres. Shame that my nice O'Reilly book won't cover the new features ... but I probably am not so advanced as to need them!

Now how does it compile and run on FreeBSD / x86-64?

Rock on! by JohnA · 2003-11-18 07:02 · Score: 5, Interesting

I use PostgreSQL extensively, and I have had a hard time convincing my-mySQL (I'm so clever) exclusive friends to give it a try.

One thing that should be noted is that the JDBC drivers (http://jdbc.postgresql.org) are now among the best I've used. For those developing Java apps, the choice is now even more clear.

Re:Rock on! by Evil+Adrian · 2003-11-18 07:04 · Score: 2, Insightful

Are they having a hard time convincing you to try mySQL, too? ;-)

--
evil adrian
Re:Rock on! by IpSo_ · 2003-11-18 07:15 · Score: 4, Interesting

Show them this:

MySQL Gotcha's

If they still don't seriously consider at least switching away from MySQL, I would be very scared.

--
Open Source Time and Attendance, Job Costing a
Re:Rock on! by mattcasters · 2003-11-18 07:48 · Score: 2, Informative

I beg to disagree on the JDBC claim.
As long as Statement.setFetchSize() or Statement.setFetchDirection() are not supported, it's close to useless to me.
Try going over a couple of milion records if you doubt it's really needed.
Not that MySQL's driver is any better in this regard.
Real JDBC drivers like the ones for Oracle, DB2 and (gasp) even SQL Server support these hints without a problem.

Matt

--
News about the Kettle Open Source project: on my blog
Re:Rock on! by slamb · 2003-11-18 08:48 · Score: 2, Informative

I beg to disagree on the JDBC claim. As long as Statement.setFetchSize() or Statement.setFetchDirection() are not supported, it's close to useless to me.
Cursors are supported now. Nic Ferrier wrote a patch for this back in April, and I think it got applied in the beginning of May. There's a trick I can't remember right now to enable it (maybe setting the result set type/scrollability first with JDBC2 methods), since it's not quite as efficient for smaller queries. I'm sure the people on the pgsql-jdbc mailing list would help you if you can't figure it out.
Keep in mind that the JDBC drivers shipping with any given version of PostgreSQL are likely not the best available drivers to use with that version. The JDBC people don't make changes to the release branches, unless they started recently. The best available drivers tend to be the ones from their website or in CVS HEAD.
Also, if you stumble on a page listing the compliance features/misfeatures of PostgreSQL, it's hopelessly out of date. I hope they're not still linking to it. The picture is much brighter than what that page suggests.
Re:Rock on! by the_mad_poster · 2003-11-18 10:05 · Score: 5, Insightful

Trading stability for speed is stupid.

Period. End of story. The job of a RDBMS is not "be fast".

Maybe then, they need to stop claiming they're building a relational database management system because they're obviously not and anyone who thinks they are is addled in the head.

...are trivial to avoid in the code.

If the application layer has to handle data integrity, the system behind it isn't relational and it's arguably not even doing the job of a DMBS. More like a convenient indexing tool.

MySQL makes a fine database management system where it doesn't matter if your data gets mangled and all you want to do is fast, simple SELECTs, but what irritates the "zealots" like me is that MySQL folks will actually sit and argue that MySQL is even remotely close to a RDBMS. pgsql and their ilk aren't truly relational either.. but they're a heck of a lot closer.

--
Alito: A vote for Alito is a punch in the eye to put that bitch back in her place!

Impressive but... by slayer99 · 2003-11-18 07:02 · Score: 3, Interesting

....still no native replication. MySQL has this one single advantage over Postgres.
Oh, raw disk use would be nice too.

--
Martin Brooks / Slayer99 #linux / UIN 2178117

Re:Impressive but... by Paul+Jakma · 2003-11-18 07:31 · Score: 5, Informative

still no native replication.

One of the new features in 7.4 is the replication technology contributed from eServer.

--
I use Friend/Foe + mod-point modifiers as a karma/reputation system.
Re:Impressive but... by bovinewasteproduct · 2003-11-18 07:50 · Score: 3, Insightful

Huh?

ERServer was released open source months ago. Check out GBorg for more information.

And why do YOU need raw disk access? The PostgreSQL developers belive (and rightly I think) that the operating system can do the caching better than they can. Why re-write the wheel? Operating systems have came a long way in the last 10 or 12 years. Days of slow access to the disk are long gone.

BWP
Re:Impressive but... by gabe · 2003-11-18 08:32 · Score: 2, Informative

MySQL has raw disk usage thanks to InnoDB. The only draw back is that it's slower than using regular InnoDB files in an actual filesystem... So, MySQL clearly doesn't have any advantage over PostgreSQL since it has raw disk support.

Looking for replication? Use the eRServer replication that comes with PostgreSQL now.

--
Gabriel Ricard
Re:Impressive but... by thenextpresident · 2003-11-18 11:46 · Score: 2

Where is the notice for this? The only thing I find in the release notes on replication is:

"New /contrib/rserv replication toolkit (Vadim)"

I can't seem to find it any place else? The reason I went with MySQL over PostgreSQL was the desire/need for easy replication. From what I know, PostgreSQL has some third party tools, but I wanted something that was apart of PostgreSQL itself, and this would be nice to see.

--
Jason Lotito
Re:Impressive but... by Sxooter · 2003-11-18 13:06 · Score: 2, Insightful

Errm, have you seen what's involved in actually _using_ this replication system?

Yes I have, and it's not all that hard to set up. Not a simple drool and click interface, but no harder than setting up the ftsearch or a few other projects I've put online

And actually, I'd rather have the database decide what gets cached and what doesn't.

Really? Even if it makes the wrong decision? What if the decisions it made would be exactly the same, is it still worth the 100s of man hours to create such a cache manager and maintain it?

You present no metrics for WHY the database should decide what stay in cache, yet you assume it would make a better decision. There are tons of things still to be done in Postgresql, (like the win32 port, Point in time recovery, 2 phase commit, and replication) to piss away time on a project of dubious returns like this.

--

--- It is not the things we do which we regret the most, but the things which we don't do.

Yummy, or about time. by blanks · 2003-11-18 07:03 · Score: 5, Informative

IN/NOT IN subqueries are now much more efficient.

Queries using the explicit JOIN syntax are now better optimized.

New multikey hash join capability.

Cursors conform more closely to the SQL standard.

Sounds like they pushed closer to the SQL standards, good job guys.

Does anyone here know more about this "New client-to-server protocol" they speak of?

--
TruePunk | Games

Re:Yummy, or about time. by Anonymous Coward · 2003-11-18 07:11 · Score: 2, Interesting

I helped debug the new client/server protocol along with Anders Johnson. Query results > 4k (eg, retrieving everything at once, or large binary fields) can be compressed with gzip or bzip2. If you run postgresql on a dedicated box, this can make a significant difference in network transport speed.
Re:Yummy, or about time. by bovinewasteproduct · 2003-11-18 07:55 · Score: 2, Interesting

Does anyone here know more about this "New client-to-server protocol" they speak of?

They needed to change the protocol to support such nifty features as proper error codes (SQLSTATE I belive) and better transaction control. I'm hoping that we'll see multi-stage transactions in 7.5.

Older clients can still talk to newer servers, they just won't support the new features.

Re:This could be good... by Docrates · 2003-11-18 07:03 · Score: 5, Insightful

If you ask me, I'm glad that people opine and wage war when it comes to PostgreSQL vs. MySQL. MySQL has done for PostgreSQL what windows has done for Linux: make it want to thrive, compete and prosper.

This is also why Microsoft WANTS there to be an enemy: they need someone to compete against to continue improving their product (which they do, even if we hate to admit it).

If you don't believe me, ask Dubya.

--

There are two kinds of people in the world: Those with good memory.

Question: discuss among yourselves by Uma+Thurman · 2003-11-18 07:03 · Score: 3, Interesting

I run postgres on my own database servers (when I'm not making movies, that is). Now, there's a distributed database project associated with Postgres, trying to add replication into the databases' bag of tricks.

Lotus Notes implements e-mail and lots of other things on top of a database engine that performs replication. So, could Postgres be used to develop a Lotus Notes type application with replicated databased for e-mail, calendars, team rooms, etc?

--
This is America, damnit. Speak Spanish!

Re:Question: discuss among yourselves by axis-techno-geek · 2003-11-18 07:26 · Score: 2, Insightful

Use phpGroupWare or something but please for the love of god, don't port Notes anywhere!
The Monks said it all (about Notes): Nice Legs, Shame About Her Face.
...or even the Manic Street Preachers: If You Tolerate This Your Children Will Be Next.

--
This is not the sig line you are looking for... -- Old Jedi Sig Line Trick
Re:Question: discuss among yourselves by IANAAC · 2003-11-18 08:22 · Score: 2, Interesting

Not sure why you were modded insightful, but... Care to share why Notes shouldn't be ported? Fact is, the server already runs on Linux. All that would be needed is the client, which is a fair bit better than Outlook. But if you can get away from big clients, I agree that web-based is the way to go.

Re:This could be good... by grub · 2003-11-18 07:06 · Score: 4, Funny

This is also why Microsoft WANTS there to be an enemy

Umm.. In the enterprise db market there is a little company called Oracle...

--
Trolling is a art,

postgres isn't used in the enterprise by Anonymous Coward · 2003-11-18 07:07 · Score: 2, Funny

because no one can effectively pronounce it.

Is it pronounced "Post Grays"?

"Post Grez?"

"Post Gress?"

"Post Gray?"

"Post Gruh?"

I've also heard people refer to it as "progress"...

Seriously, though, as an IT manager, you can't 'sell' the concept of using an OS RDBMS that you can't even pronounce. People use MySQL because its pronounceable. My Sequel. Simple. Yellow. Different.

Those guys over @ Postgreays need to figure out a better friggin name for their RDBMS if they expect it to be used in the enterprise.

Re:postgres isn't used in the enterprise by drgroove · 2003-11-18 07:12 · Score: 3, Interesting

Actually, Sequel was the name for the original Standard English Query language invented by IBM. SQL was the name of the second version of the language, also invented by IBM.

http://www-106.ibm.com/developerworks/db2/librar y/ techarticle/0301jones/0301jones.html
Re:postgres isn't used in the enterprise by AstroDrabb · 2003-11-18 08:54 · Score: 2, Informative

In the PostgreSQL FAQ, PostgreSQL is pronounced Post-Gres-Q-L.

--
If Tyranny and Oppression come to this land,
it will be in the guise of fighting a foreign enemy. -James Madison

Autovacuum by Space+cowboy · 2003-11-18 07:07 · Score: 2, Interesting

Ok, this is nice - One thing that's always put me off postgres is the need for vacuuming. It seems to me it's a labour-intensive task that the computer ought to do on its own, without my help :-)

Simon.

--
Physicists get Hadrons!

Re:Autovacuum by bmarklein · 2003-11-18 07:26 · Score: 3, Informative

While vacuuming you can't really do much else with the database.

Not true. Starting in 7.3, the default version of VACUUM no longer locks the table. From the 7.3 docs:

Plain VACUUM (without FULL) simply reclaims space and makes it available for re-use. This form of the command can operate in parallel with normal reading and writing of the table, as an exclusive lock is not obtained. VACUUM FULL does more extensive processing, including moving of tuples across blocks to try to compact the table to the minimum number of disk blocks. This form is much slower and requires an exclusive lock on each table while it is being processed.
Re:Autovacuum by kcbrown · 2003-11-18 07:30 · Score: 2, Informative

It's still a labour-intensive task - for the dbms. While vacuuming you can't really do much else with the database.

That's true if you're doing a VACUUM FULL. But that is now something that generally needs to be done only rarely (this will depend greatly on how you use the database). The database will now re-use freed tuples (VACUUM FULL compacts the physical table, thus eliminating such free space entirely and causing new data to be appended to the end of the physical table), so all that's usually needed is to find such tuples and mark them as being free.
The autovacuum process invokes the standard VACUUM (without FULL) which does the "lightweight" VACUUM processing. It will also automatically update the statistics associated with the tables so that the planner will make better choices about things like whether to do a full-table scan or an index scan, the type of join to use, etc.
VACUUM is much easier on the database than it used to be. It still isn't optimal, but if you read the pgsql-hackers mailing list you'll find that the developers are discussing at length how best to minimize VACUUM's effect on the database performance.

--
Use 'slashdot stuff' in the subject line in any email you send me if you want to get past the spam filter.
Re:Autovacuum by Zeut · 2003-11-18 08:47 · Score: 5, Informative

Ok, there is a lot of talk about vacuum what it does / doesn't do and what effect autovacuum has. Here are the details (FYI, I wrote pg_autovacuum).

Recent versions of postgresql don't take your database offline during vacuum. However, the vacuum process is an I/O intense process and can still, even 7.4, slow the server significantly while it's running. Work is has alredy been done in the 7.5 development tree to address the I/O storm created by vacuum.

Typically, you setup cron to run vacuum your entire database nightly. This is fine, except it has two main problems. 1) It wastes time vacuuming large tables that probably don't need it (think audit train table that only gets inserted into). 2) It probably doesn't vacuum tables that are constantly updated often enough, which results in bloated data files, and slower queries.

The new pg_autovacuum daemon addresses both of these concerns by monitoring database activity (using the stats system). When it sees that a table has has been modified enough to warrant a vacuum then it does so, when it sees that a table might benifit from a analyze only, then it does that. And when a large table doesnt' need to be vacuumed, it doesn't vacuum.

Windows by kannibal_klown · 2003-11-18 07:07 · Score: 5, Interesting

I'm sure I'm gonna get modded down for this, but does anyone know when is there gonna be a version that can run in windows natively (without using Cywin)?

I ask because we are FORCED to use Windows boxes at work, and they gave all of the developers 2. We can't reformat and put linux on (or do a dual-boot) because they check to make sure everything is status-quo. And right now the atmosphere around here is not the greatest, so I'd rather not risk anything with the PHB's by trying to trick them.

I usually have my 2nd machine as a server running mySQL as a testbed for my database apps. I'd LOVE to switch to Postgresql, but I'm limited as to what I can do.

Any idea when a Windows native version will be available?

Re:Windows by RayMarron · 2003-11-18 07:22 · Score: 5, Informative

And here's a link to the native Windows 7.2.1 release. I've tried it on NT4 and 2000. It seems to works just fine.

http://techdocs.postgresql.org/guides/Installing On Windows

--
ON DELETE CASCADE
Re:Windows by justins · 2003-11-18 08:02 · Score: 2, Informative

What's wrong with Cygwin. Not on the "approved" list? Unless that's the problem I wouldn't hesitate to use it; it's pretty darn solid today.

Performance.

--
Now before I get modded down, I be to remind whoever might read this that what I am saying is FACT. - bogaboga
Re:Windows by Anonymous Coward · 2003-11-18 08:54 · Score: 2, Informative

See http://momjian.postgresql.org/main/writings/pgsql/ win32.html

with the latest update on this issue.
Re:Windows by gabe · 2003-11-18 08:55 · Score: 5, Informative

CommandPrompt Mammoth PostgreSQL for Win32, Mac OS X, Linux
http://www.commandprompt.com/entry.lxp?lxpe=295

dbExperts PostgreSQL for Windows, Mac OS X, Linux
http://www.dbexperts.net/postgresql

PowerGres (threaded Windows PostgreSQL
http://osb.sra.co.jp/PowerGres/introduction-en.php

--
Gabriel Ricard
Re:Windows by PizzaFace · 2003-11-18 09:49 · Score: 5, Informative

"Create native Win32 port" is one of three "urgent" items on the PostgreSQL to-do list, and Bruce Momjian publishes a detailed status report on the ongoing work. No one on earth can tell you when the work will be complete.

Why use PostgreSQL over MySQL? by IpSo_ · 2003-11-18 07:11 · Score: 5, Interesting

Because they care about your data, among other things! You could have the fastest database server in the world, but if you find your data is corrupt, or truncated without warning, it doesn't do you much good.

Here is huge list of MySQL Gotcha's that absolutely floored me when I first read it. In my opinion, a "gotcha" in regards to a database is a "Bad Thing(tm)"

MySQL Gotchas"

--
Open Source Time and Attendance, Job Costing a

Re:Why use PostgreSQL over MySQL? by schon · 2003-11-18 09:09 · Score: 2, Interesting

Just because mySQL fails to implement many parts that standard SQL programmers consider important doesn't mean it's unreliable.

You're right. The fact that it will silently accept bad data, and change it to something else, means it's unreliable.

Re:This could be good... by phlyingpenguin · 2003-11-18 07:12 · Score: 5, Interesting

I'm using Postgres in a company that analyzes statistics, and maintain a ~250GB db. Trust me, when you're talking about doing a seq. scan on 250GB of data, the preformance differences are MUCH more than minor. The reports of speed increases don't cite minor increases either, they cite really major changes.

Taken from the presskit:

PERFORMANCE
Several major performance enhancements have been added in version 7.4, enabling PostgreSQL to match or exceed the speed of other enterprise database systems. These include:

* Hash aggregation in memory to make data warehousing and OLAP queries up to 20 times faster;
* Improvements in subquery handling by the planner resulting in up to 400% speed increases in some complex queries;
* New script to set more reasonable postgresql.conf defaults for shared buffers, yielding better "out of the box" performance;
* New wire protocol (version 3) increases the speed of data transfers;
* Enhanced implementation of functional indexes allows better indexing on custom data types and composite fields;

PgAdmin 3 by DAldredge · 2003-11-18 07:14 · Score: 2, Interesting

PgAdmin 3 is also ready and in now multiplatform.

Re:PgAdmin 3 by stoolpigeon · 2003-11-18 07:22 · Score: 3, Insightful

And it is very, very nice. If you have people resistant to using pgsql (which I like a lot personally) pgAdmin III will give them all the GUI stuff they are used to from programs like SQL Server Enterprise Manager.

The postgreSQL community is extremely helpful, key developers are very active in helping out users and addressing issues rapidly. It is a project that just exemplifies what is good about open source.

I am compiling 7.4 on my development server right now in preparation for moving our production server soon. I guess maybe I sound like a fan boy but as a database administrator I just can't over emphasize my joy at getting to work with such an excellent product.

--
It's hard to believe that's how Micronians are made. Why don't we see it right now by having you both kiss one another?

Full text searching improved and other goodness by SuperBanana · 2003-11-18 07:14 · Score: 5, Informative

Full text searching also got another overhaul- I plan on messing around with it when I get some free time. They've included a .sql file you can just import into an existing DB.

The real power here is that the index is quick to update, and as a result, can be done in real-time via triggers and stored procedures- neither of which you can do with MySQL :-) The new release is also even more SQL compliant- something else MySQL can't claim. PostgreSQL is both SQL92+98 compliant if I recall.

It can't be said enough- PostgreSQL is now MUCH faster...and due to features like stored procedures, triggers, and some of the best locking available combined with some of the best transaction support, it's actually far faster at many of the same tasks if you take advantage of these greater abilities.

Even back as early as '99, PostgreSQL absolutely mopped the floor with MySQL when as little as 10% inserts or updates were thrown into a select test. Why? Piss-poor locking and zero transaction support. The stuff you have to do in the application layer to make up for proper(or ANY) transaction support will make most benchmarks completely pointless.

MySQL always has, and always will be, a DB best suited for blogs and 2-guys-in-a-garage; it's slapped together, has a low featureset, and is not standard-compliant. PostgreSQL is not an enterprise fish(replication still needs work if I understand it correctly)- Oracle, DB2 etc have that market pretty well covered- but it's great for everyone else who isn't, say, a multibillion $ company...if those people just bothered to have an open mind instead of pointing their fingers at benchmarks showing MySQL running out of an in-ram-only table can select 50,000 rows faster than PostgreSQL can, and whining about how they need to make a cron job to vacuum/vacuum analyze tables at an appropriate time(with autovacuum, also in this release, there goes that excuse!)

--
Please help metamoderate.

Re:Full text searching improved and other goodness by phlyingpenguin · 2003-11-18 07:22 · Score: 2, Interesting

I hear that. When you're talking about a large database (More than 50k rows, things closer to 400M rows/table) Postgres is the only way to go after you figure out that MySQL has NO functionality in the realm of RDBMS. In fact Postgres is even better when you find all of the different things you can do with your data afterwards (via custom aggregates, custom sequences, functions, triggers, views, etc.) that the dorks in their garages who only 'SELECT * FROM BLOG' are even further pushed out of the picture.
Re:Full text searching improved and other goodness by rtaylor · 2003-11-18 07:42 · Score: 2, Informative

Yes, there are plans to make a native windows port -- it's actually about 75% of the way there.

The big issues now are safety. For example, sync() has traditionally been used for checkpoints to ensure everything is on disk. Windows does not seem to have an equivelant (though several close options).

If your interested in helping out, please send a note to pgsql-hackers for a list of current issues for win32.

--
Rod Taylor
Re:Full text searching improved and other goodness by abulafia · 2003-11-18 08:08 · Score: 2, Informative

Does PostgreSQL have a good GUI component for direct database manipulation?
Several. Take your pick. (Check the administrative link off the nav, and also look through the developer tools.)
I understand that PostgreSQL requires cygwin to run on a Windows platform. Since my company is a Windows shop for now (I have not yet been able to convince them to move to Linux) this is an important issue for me. How big of a performance hit is there for running through the cygwin interface? In other words, is PostgreSQL faster than MySQL in Windows?
Can't help you here, I've never done it. Sorry.
Are there plans to go to a native Windows platform?
Yes. There are technical hurdles, but it is actively being worked on. Check out the list, if you're interested.
w

--
I forget what 8 was for.
Re:Full text searching improved and other goodness by AKAImBatman · 2003-11-18 08:16 · Score: 2, Informative

> Does PostgreSQL have a good GUI component for direct database manipulation?

Traditionally, this has been the job of third party developers. SQL Server is the one exception as it tries to make you dependent on the GUI tools. Still, if you want a GUI tool, both pgAdmin and my own DataDino are good alternatives.

--
Javascript + Nintendo DSi = DSiCade

Replication Replication by augustz · 2003-11-18 07:18 · Score: 3, Insightful

For some applications with a chance of growth I've had two issues with Postgresql. One is that despite the fact that they have talked about being an "enterprise level" database for ages, we found that in any kind of swift moving transaction enviroment we had to VACUUM pretty regularly. How they expected folks to leave pgsql running over extended periods of time (months -> years) is beyond me. Looks like they may have solved it. It will be interesting to see if the systems can take a pounding and stay up 24/7 for a while without slowing to a crawl.

The other issue has been replication. With mysql this has saved our bacon more then once. Nead to do intensive analysis on live data and don't want to disturb active system? Set up a nice slave and query away.

Want basic fault tolerance? Set up a slave, you have a live mirror of the data.

Have lots of queries coming in (load balance the reads at least).

PostgreSQL now has some type of replication available from PostgreSQL Inc, but it looked to me like somewhat of a hodge podge of perl, triggers and who knows what else.

I think I'll try it out, and if I can get the same replication speed as I do with a mysql array I'd switch over, but first glance it didn't look like I would. Anyone compared the replication performance yet (and ease of setup, I was very impressed with mysql in this regard).

Re:Replication Replication by scrappy · 2003-11-18 07:38 · Score: 2, Informative

re: replication ... the version of eRServer that PgSQL, Inc released OSS earlier this fall is Java based, actually ... only perl is for the admin scripts, and the OSS community is in the process, I believe, of getting rid of that also ...

See eRServer on GBorg

Jan Wieck is also just in the process of re-writing replication based on the experiences of eRServer, again, as an OSS project, and it can be viewed at:

Slony-1 on GBorg
Re:Replication Replication by RelliK · 2003-11-18 08:38 · Score: 2, Insightful

PostgreSQL now has some type of replication available from PostgreSQL Inc, but it looked to me like somewhat of a hodge podge of perl, triggers and who knows what else.
Strange. That's how MySQL looks to me...

--
___
If you think big enough, you'll never have to do it.

There is no such thing as "faster" by axxackall · 2003-11-18 07:20 · Score: 3, Insightful

There is no such thing as "one DBMS is faster than another". Instead, when you compare the performance, you should always say what's the test to compare performance.

I work with both of them, so I can compare both. Personally, I see many cases, especially when the data model is complicated enough, when PostgreSQL is faster than MySQL. But for that I am spending some extra efforts, because many OSS projects are ported to work with one DBMS, not with both.

I love PostgreSQL and it's functionality but unfortunately there are still many developers of other open source projects who heard about MySQL and did not do any research for existing alternatives and thus made his project based on MySQL.

And, again unfortunately, while PostgreSQL is very close to SQL standard, but MySQL is not that close, so you cannot just substitute the database library - you have to re-write (and thus re-test) all SQL code of the project. So, that's why I still have to use MySQL.

With all my respect to great technical quality of PostgreSQL software, I think PostgreSQL team doesn't do a great job to make PostgreSQL being popular. The athmosphere in PostgreSQL community reminds me the one of BSD (read: very unfriendly).

--

Less is more !

Re:There is no such thing as "faster" by ma_sivakumar · 2003-11-18 22:56 · Score: 2, Informative

The athmosphere in PostgreSQL community reminds me the one of BSD (read: very unfriendly).

In my experience the PostgreSQL user mailing list pgsql-general@postgresql.org is one of the most friendly mailing lists I subscribe to.
You get answers to trivial questions as well as very complex ones. I have not seen anyone flamed for asking something. People are very helpful. I have seen a couple of cases where problems affective live databases were sorted in the mailing list threads within hours.
I do not understand what do you mean by "very unfriendly")
(I use postgresql for our application)

--
yAthum UrE yAvarum kELir All the places are our place, everybody is our kin. (A Tamil Poet - 2000 years ago)

Best New Feature by corby · 2003-11-18 07:25 · Score: 4, Informative

New autovacuum tool

The new autovacuum tool in "contrib/autovacuum" monitors the database statistics tables for "INSERT"/"UPDATE"/"DELETE" activity and automatically vacuums tables when needed.

Not really what I meant... by Space+cowboy · 2003-11-18 07:33 · Score: 2, Interesting

Hmm. re-reading my post, I can see why you replied like you did, so let me spell out the case I have:

There's a DB with (currently) about 30 million rows in it, it's the audit trail for a clients adserver. Queries are run on the db by their clients to figure out their usage, click-throughs etc. over periods. The ad-server gets anything from 0 to ~20 inserts per second across some 200 sites, and depending on the client policy, is cleared daily,weekly,monthly or yearly of that clients data.

Now, my understanding of the vacuum command was that it effectively took the DB offline (not good with the hit-rate I have), and my understanding of 'auto-vacuum' was that it would negate that effective downtime. It appears that that is not the case.

MySQL copes with this quite happily. I was simply investigating whether postgres would cope equally happily... more strings to the bow etc...

Simon

--
Physicists get Hadrons!

Re:Not really what I meant... by Qzukk · 2003-11-18 08:00 · Score: 3, Interesting

Now, my understanding of the vacuum command was that it effectively took the DB offline (not good with the hit-rate I have), and my understanding of 'auto-vacuum' was that it would negate that effective downtime. It appears that that is not the case.

Normal VACUUM commands do not lock tables as of 7.3. Only the full vacuum command does this, which you probably only need to use when permanently retiring a client, rather than just rotating out their data, as the lock is used to actually repack the database on the filesystem to reduce the filesize.

For your needs the normal vacuum provided by auto-vacuum should be sufficient, as you're going to fill up the empty filespace with the next period's data anyway.

--
If I have been able to see further than others, it is because I bought a pair of binoculars.

No flames... by Anonymous Coward · 2003-11-18 07:47 · Score: 5, Interesting

We have one client who uses Linux, the rest are all Windows-based... Is there an unbiased (as far as can be) comparison ?

Now, see your problem?

Neither PosgreSQL or mySQL are full, complete, and utterly perfect implemtations of a database. Neither is Oracle, BTW.

mySQL got a HUGE push some time ago. Back then, mySQL couldn't be beat for handling read-only (Actually, highly read-almost exclusively always). mySQL was a champ when you had a web site, mostly static catalog of products (for example), and had really limited demand for SQL (Like one query that read 'select * from catalog;')

That basis of comparison is no longer true.

So, at the time, hords of little corporate minions lined up and specified mySQL. Not a bad bet at the time, but open mindedness only seems to happen once in computer circles. Day 1 you have a need, day 2 you actually research available solutions, and day 3 you declare a "winner" and it is forevermore cast in stone as the "one true solution". The fact masses of people tend to go thorugh the same process at basically the same time doesn't help. Thus the broad noise that mySQL is "the Answer(tm)".

Anyway, postreSQL has always sought to compete in the full function space. Oracle was/is a much better "comparison" to postgreSQL than mySQL.

Now, both mySQL and postreSQL have improved over time - greatly. postgreSQL seems to be focused on getting things "correct", while mySQL doesn't seem so concerned. Bascially postgreSQL will not provide a feature, while mySQL will hacking it together in some bizzare way (re: early "transaction" handling). mySQL has quite a few anti-social behaviors. Over time, their refinement of those various behaviors drive certain development costs and create some degree of lock-in dependency (a continuing basis for self-justification).

Bottom line, if you invested in learning and implementing mySQL, and it is still working for you, then there is absolurely no need to be concerned with postgreSQL yet.

If you are in the database selection mode, you should surely look towards postgreSQL and try to de-hype yourself from any pro-mySQL bias. Hype has inertia and much of the pro-mySQL hype is based on old comparisons and narrow needs. Yes, evaluate both, but don't assume mySQL or postgreSQL is "better" based on what you hear.

Re:Ok, so which is faster *now* by Anne+Thwacks · 2003-11-18 07:53 · Score: 2, Insightful

Do you want speed? Or would you prefer your data to be consistent and safe?

They are tools for different jobs:

MySQL is intended for systems where the data is uploaded, and thereafter never changes significantly - eg static data accessed via the web.

PostgreSQL is intended for things like payroll systems where some values persist for years, while others change daily.

Without triggers, you cannot expect to maintain data integrity with online data input and a wide range of input methods. ie any system with an expected live lifetime exceeding a few months

--
Sent from my ASR33 using ASCII

Re:Enough of the anti-MySQL garbage by Anonymous Coward · 2003-11-18 07:53 · Score: 2, Insightful

Untrue? Since when has MySQL had stored procedures? Do you even know what stored procedures are? Do you run 10 database queries on every web page?

How about subqueries? Do you run a query and then run queries on the results in the application?

Simple selects might be slightly faster in mysql, but very few applications do a single simple select at a time.

Did anyone notice today that.... by icejai · 2003-11-18 08:04 · Score: 2, Informative

MaxDB is out?

MaxDB is a sort of merger between sapdb and mysql. As of today, MaxDB includes features such as:

* Views * Server-side cursors * Stored procedures and triggers * Automatic failover (to a standby server) * Scheduling and automatic messaging on alerts * Snapshots * Archive tables * Synonyms

And these are features in addition to mysql's feature set!

Check it out guys! http://www.mysql.com/products/maxdb/index.html

But don't be in too much of a rush to upgrade, mysql interoperability is slated for Q1 of 2004.

Enjoy!

Re:Did anyone notice today that.... by Anonymous Coward · 2003-11-18 08:26 · Score: 2, Funny

these are features in addition to mysql's feature set!

Wait - MySQL has a feature set?

I thought that it's lack of features was what made it better than everything else.

damn.. now I'm all confused.

Re:Enough of the anti-MySQL garbage by thing12 · 2003-11-18 08:12 · Score: 4, Informative

But MySQL has had transactions and row-level locking for quite some time now

Sure, row-level locking is nice -- even MSSQL has that. PostgreSQL has MVCC - so that writers never block readers and likewise. Complete data consistency (i.e. repeated reads give the same results) from the start of a transaction to the end of it. Can MySQL do that? (I am actually asking....)

Windows via cygwin by rjamestaylor · 2003-11-18 08:21 · Score: 2, Informative

As soon as a MySQL version runs on Windows without cygwin, a PostgreSQL version will be available.

What? You didn't know MySQL ran via cygwin?

C:\mysql\bin\cygwinb19.dll

MySQL just has a better installation process is all.

BTW, I run cygwin on my WinXPPro laptop (the only way I'll get UNIX-like OS features on my Dell Inspiron 5150 sadly) along with KDE3. Yep and uh-huh.

--
-- @rjamestaylor on Ello

Re:Enough of the anti-MySQL garbage by schon · 2003-11-18 08:23 · Score: 2, Insightful

he is an idiot who doesn't understand "different tools for different tasks"

So a guy I work with was bitching about his Geo Metro the other day - he was complaining (for the umpteenth time) that it takes too long to drive the 300 miles to work each day.

So I say to him (again for the umpteenth time) "Dude, you work across the street from an airstrip, and your driveway is large enough to function as a landing field - get a plane."

So the next day, he tells me that he decided to take my advice and try a plane - a small single-prop Cessna.. then he says that it sucked! It was slower than his Metro!

"How can that be?" I asked.

"Well, every time I got the damn thing over 40, the wheels came off the ground!" he says.

"So? It's a plane, not a car - that's what's supposed to happen." I said.

"That sucks!" He says, "I don't know how to fly a plane! I'm sticking with my Metro - it's faster than that Cessna!"

Then I tell him that the Metro is only faster if you don't know how to fly a plane.

And then he went and muttered something about "different tools for different tasks" - completely missing the point that he's ignoring the correct tool because he's unwilling to learn how to use it.

There's a moral in there - I'll let you figure out how it's applicable to this conversation.

Never used PostgreSQL by DroopyStonx · 2003-11-18 08:55 · Score: 2, Insightful

I'm thinking I should give it a while. I hope it's better than MySQL, because, in my opinion, MySQL is overrated.

I've recently started a project using MySQL because we want to get away from the ridiculously overpriced licensing that MS makes you get. I'm not trying to be a troll or anything, but I'm trying to give a realistic viewpoint by someone who's used to MS crap, but really wants to switch over to open source.

I love Open Source and I definintely try to keep an open mind about it (hey, whatever gets the job done at are more efficient and cost-effective manner is good stuff) and am by no means a Windows/Linux zealot.

Take a look at MySQL's current state. Way behind on the times... for example: it *still* doesn't have stored procedures. Do you realize how annoying it is hardcoding SQL statements? There's complete lack of subquerying, which really makes it a pain to do certain calculations often requiring additional queries, which is extremely inefficient (although I do understand that it's currently in alpha).

Things like this.. make me think twice. If these useful features haven't even been implemented yet, then how can I (someone who's used to using MS crap) trust it?

I'm gonna give PostgreSQL a try and hopefull it has more functionality (and stability) than MySQL!

--
We have secretly replaced these Slashdot mods' sense of humor with a rusty nail. Let's see if they notice!!

Re:Never used PostgreSQL by tangledweb · 2003-11-18 10:59 · Score: 2, Informative

Take a look at MySQL's current state. Way behind on the times... for example: it *still* doesn't have stored procedures. Do you realize how annoying it is hardcoding SQL statements? There's complete lack of subquerying, which really makes it a pain to do certain calculations often requiring additional queries, which is extremely inefficient (although I do understand that it's currently in alpha).

No, stored proceedures are in Alpha. Sub queries have been in production for a year.

Re:Win32? by Zeut · 2003-11-18 09:15 · Score: 2, Informative

Originally it was hoped that 7.4 would have a native win32 port. In fact much work has been done, but it didn't get done in time for 7.4. Work continues, and it's getting closer. Everyone working on it hopes that it will be included in the next version of PostgreSQL, but no one will promise. Such is life in an Open Source project.

plphp by matchboy · 2003-11-18 10:13 · Score: 2, Interesting

PHP geeks, don't forget to check out plphp. You can create functions/triggers with php for PostgreSQL.

--

Robby Russell
PLANET ARGON
Robby on Rails

Re:This could be good... by kfg · 2003-11-18 11:22 · Score: 3, Informative

Yes, MySQL is small, light and fast and I use it as my general light duty DBMS, but I'm not religous about it. When the going gets tough I switch to something tougher.

Looks, it's not because you can't do things with MySQL. It's how you have to go about doing them. That lightness and speed comes at a price, it's an engineering tradeoff. There's no such thing as a free lunch and all that.

What it gives up is intergrety constraints. If you don't spend the cycles to insure data integrity you can be smaller and faster.

So let me ask you, how fast do you want your data munged?

If you don't want your data munged at all and you're using MySQL you need to pass off integrity issues to your app. Well, there you are using cycles again. The DBMS is faster, but now your app is slower (yes, you're still saving a bit of disk access time, which can add up. That's a flaw in SQL itself. There are alternatives.). More importantly you're using your time as a developer to reinvent the integrity constraint wheel in every app. Coding time goes up. Bugs go up. Support issues go up. All to accomplish something that is a logical function of the DBMS. That's why we call them a DBMS in the first place. It has been argued that MySQL doesn't even meet the definition of a DBMS.

Once I had data
My DBMS munged it
But damn it was fast!

Again, don't get me wrong, I use MySQL, but I use it in full knowledge of what it does and does not do and what it does not do is guaruntee the consistency and integrity of my data.

And I have better things to do with my time than recoding DBMS functions into my apps. I use MySQL where data integrity isn't a critical issue.

KFG

Mysql speed by vlad_petric · 2003-11-18 12:17 · Score: 2, Informative

Your comment is implying that mysql is actually faster than postgresql. That's simply wrong.

Yes, if you run simple queries in a single user scenario you do get better performance with mysql than with pg. With more complex queries and more users however, the simplistic query optimizer and concurrency manager that mysql has makes it perform worse than pg.

--

The Raven

PostgresQL is slow... by puppetman · 2003-11-18 13:03 · Score: 3, Insightful

We are in the midst of moving our databases away from Oracle. There were three contenders: MySQL, Postgres, and Matisse (OODBMS).

Speedwise, PosgreSQL trails the pack by a fair bit. Sometimes it would be comparible to Oracle, and other times it wouldn't be without a fair bit of tuning. Outer-joins, for example; the optimizer can't seem to make heads or tails of it.

I spent two years lurking on the Postgres lists, and when doing performance testing, was asking for help tuning queries and the database in general; this isn't a statement made based on, "I tried it once, and it didn't work."

The guys on the list (especially Tom Lane) were very helpful and polite, but I just couldn't get reasonable performance out of the database without doing some serious SQL-rewriting (our CTO thinks that relational databases require too much tweaking already; putting optimizer hints into the queries is just too much).

Overall, the database is great - great feature set, great developers, and a good support community, but the optimizer is not efficient enough (search for the word optimizer in the PostgreSQL lists, and you'll find hundreds of posts where the optimizer is doing a sequential scan and ignoring indexes when it should be using those indexes).

MySQL (4.0.16, using InnoDB tables) has foreign keys, transactions, etc. I haven't been able to crash it yet (I miswrote a query on purpose, and let it run over 2 days at 99% CPU, and the machine stayed up, and is still up a week later).

MySQL doesn't have triggers or stored procs, but as a DBA and senior developer, I can honestly say that's a good thing.

- if you modify a table that a trigger or stored proc uses, chances are the trigger and stored procedure are invalidated quietly behind the scenese - the database doesn't tell you until you call the stored procedure or execute a statement that causes the trigger to be executed.

- debugging a stored procedure or trigger is not easy.

- people tend to forget about triggers and stored procedures; they're hidden logic that can cause no end of problems.

- triggers and stored procedures are (in most cases) database-dependant; they are a huge hinderance when moving to another database. We have 12,000 lines of Oracle stored procedures. I dislike them.

- the database is for data storage. It's not for application develoment. Keep the business logic in the application, and the data-storage logic in the database. Oracle is trying to sell their RDMS as a development tool to justify the price. Don't believe the hype.

PostgreSQL is trying to position themselves as an Oracle replacement, and thus have a similar feature set. PostgreSQL is also very good at very large databases (probably even more so than MySQL, at least until InnoDB gets multiple tablespaces in the next release).

Databases with simple queries where results are not needed instantly would do well with PostgreSQL.

Re:PostgresQL is slow... by chriskl · 2003-11-18 13:29 · Score: 2, Informative

This is just not true. If you were unwilling or unable to tune your DBMS, you can hardly go blaming PostgreSQL!

All you need to do is reduce your random_page_cost a bit, set your effective_cache to an appropriate value and bob's your uncle.

PostgreSQL can do optimisations that MySQL can only dream of. The reasons most of those posts are complaining about the optimiser is, let's face it, the users in question don't know about indexes. Another whole lot of them are about the slowness of IN () queries - which is now hundreds of times faster in PostgreSQL 7.4.

The deal with outer joins in PostgreSQL is that they are executed in the order you specify in your query. This means that you simply just have to have a couple of tries with it to make sure you haven't written a totally degenerate query.

There is discussion on allowing the optimiser to re-arrange outer joins, however it is a rather difficult problem theoretically to be able to prove that a certain rearrangement will still be an equivalent query.

Chris
Re:PostgresQL is slow... by mijp · 2003-11-18 22:25 · Score: 2, Informative

Speedwise, PosgreSQL trails the pack by a fair bit. Sometimes it would be comparible to Oracle, and other times it wouldn't be without a fair bit of tuning. Outer-joins, for example; the optimizer can't seem to make heads or tails of it.

The 7.4 release is much better on explicit join optimization (the formerly behavior of using explicit joins as optimizer hints has been disabled by default). So using outer-joins should be improved. PostgreSQL is advancing.

Re:This could be good... by Sxooter · 2003-11-18 13:22 · Score: 2, Interesting

Not exactly. If you try to enter a date of 'grandmas panties' as a date, postgresql will NOT accept it. MySQL will turn it into 0000-00-00.

If you try to insert 8123928392382923 into an int4 in postgresql, it will throw an error. MySQL will turn it into 2^31 (or 32 if it's unsigned).

If you make a not null column in postgresql, it will NOT ACCEPT A NULL, while MySQL will silently convert it to an acceptable default.

MySQL will allow you to define foreign key references to tables that can't support them, and silently fail to create or maintain them.

That's the problem, the silent failures / data munging. If there was a switch that I could throw that would let MySQL run ONLY on innodb tables and ONLY work the right way I might be tempted to use it, but their "what, me worry?" attitude towards data is a bit disheartening.

--

--- It is not the things we do which we regret the most, but the things which we don't do.

on the other hand by kpharmer · 2003-11-18 15:40 · Score: 2, Interesting

Might not always be the fastest...but I can't imagine developing a database application - and putting all the join code in the app - that could instead be a subselect, inline view, etc.

I'm not a huge fan of triggers - but often find circumstances in which their careful application has saved a huge number of hours.

Stored procedures are another example of a technology that when used wisely can result in a huge improvements to flexibility and adaptability. My favorite use of them is when on a fast-moving project - I can have the developers create an object model and almost immediately start developing to it. My first task is to knock out a set of stubbed stored procedure (or sometimes views) that map to that object model. Often within a week they've got a fully working application (with a fake backend) to continue testing and tweaking. Then I can take my time mapping the object model / stored procedures to a relational model - perhaps using some advanced features that would have been time-consuming to build and test. The net result is drastic reduction in development time, complete dba-control over all queries, and the ability to change the model occasionally (as performance or whatever demands) without any impact to the developers.

The concern that stored procedures aren't portable isn't completely valid either. Stored procs shoudn't be complex - simple ones can be easily ported between most databases.

Views, unions, and subselects are other sql capabilities that the mysql-ab team has frequently referred to as being only useful to 1% of the applications out there. And unlike the triggers and stored procs described above - I'd consider these core database capabilities.

not broken by kpharmer · 2003-11-19 05:55 · Score: 2, Interesting

> Anyone who needs to use COUNT() is completely out
of luck. Can't use PgSQL. This problem has not been
> fixed in the latest release:

why would you be doing count(*) on *huge* tables without a where clause anyway? I mean, sure - I run queries like that all the time - in adhocs, where a scan of 100+ million rows might take 10-30 seconds.

But I can't think of the last time I've built an application and needed to do count(*):
- without a where clause
- on a huge table
- and needed fast response time.

Also note that most databases don't seem to store the min/max values for non-index columns to speed up this situation. Informix does, IIRC, but I don't think that Oracle does.

So yeah, it would be nice if this was improved, but really - I use this functionality all the time, and am completely unaffected by this issue.

72 of 451 comments (clear)