Slashdot Mirror


PostgreSQL on Big Sites?

An anonymous reader asks: "I've been using PostgreSQL for years on small projects, and I have an opportunity to migrate my company's websites from Oracle to an open-source alternative. It would be good to be able to show the PHBs that PostgreSQL is a viable candidate, but I'm unable to find a list of high-traffic sites that use it. Does anyone know of any popular sites that run PostgreSQL?"

24 of 89 comments (clear)

  1. Several examples by IO+ERROR · · Score: 5, Informative
    --
    How am I supposed to fit a pithy, relevant quote into 120 characters?
  2. The .org registry? by tzanger · · Score: 5, Informative
    1. Re:The .org registry? by jadavis · · Score: 2, Informative

      and the .info registry

      Both are run by Afilias, which is a big user and big developer of PostgreSQL. They're the ones that did the work on the Slony-I replication server.

      --
      Social scientists are inspired by theories; scientists are humbled by facts.
  3. recent interview with Josh Berkus by oreaq · · Score: 4, Informative

    MadPenguin has an interview with Josh Berkus, one of the core team members of PostgreSQL.

  4. well oracle IS pretty good by Naikrovek · · Score: 3, Insightful

    I've never used PostgreSQL so I can't and won't say anything about it other than this: Make sure Postgres does everything you need and can perform similarly to Oracle in your environment.

    We momentarily thought about dropping Oracle for PGSQL at my last company, but after we hired a consultant to do everything he could with Postgres to improve performance, Oracle was still a clear winner for us.

    I don't know if he was incompetent or what, but the performance numbers weren't even close with what we needed it to do.

    If your database will run just as well on PostgreSQL, I say go for it. If you go with PostgreSQL and it doesn't perform as well as Oracle in your environment, your management will have serious doubts about open-source software from then on, and that's a stain that is hard to get rid of.

    in short: choose based on your needs, not based on the fact that one is open and the other isn't.

  5. Need more info by The+Slashdolt · · Score: 2, Informative

    Is your companies website essentially read-only page loading? If so, why not just go with MySQL. Do you really need MVCC in a read-only scenario?

    On the other hand, If your company is doing transaction processing, like a customer facing product ordering system (think amazon), its a lot more than just having to sustain certain volumes. The reputation of your company and its ability to make money by selling products will rely entirely on your database. In a best case scenario there may be no difference between oracle and postgres. But imagine the worst case scenario. Peak volume, company is making $1M/hour in sales on the web, db dies and won't come up....who you gonna call?

    There's more to the equation than up front cost and ability to handle volumes....

    --
    mp3's are only for those with bad memories
    1. Re:Need more info by snorklewacker · · Score: 4, Insightful

      > Is your companies website essentially read-only page loading? If so, why not just go with MySQL.

      MyISAM can't handle a database of larger than 2 gigs. Once you switch to another table backend, MySQL's vaunted performance advantage pretty much evaporates.

      > Peak volume, company is making $1M/hour in sales on the web, db dies and won't come up....who you gonna call?

      My DBA, assuming I'm running point-in-time recovery. That's all Oracle is going to tell you to do. The unemployment office if I'm not. Although PITR in pgsql is something of a PITA, which just might go to recommend Oracle for the time being.

      --
      I am no longer wasting my time with slashdot
    2. Re:Need more info by The+Slashdolt · · Score: 2, Interesting

      Assuming the relational database is storing relational data, its hard to imagine a 2G database needed to store read only pages. Of course, some people store pdf's, word docs, etc into relational db's as clobs. This is a complete waste of resources. I would use MySQL in a read-only scenario where transactions are unneeded.

      I am also assuming that the guy who is posing this question IS the DBA. At least I sure hope so, for whoever is the DBA's sake. Your scenario is a best case recovery scenario using point-in-time recovery. What if its the point-in-time recovery that is broken/buggy? As a DBA, who do you want to deal with? Who are you gonna call?

      --
      mp3's are only for those with bad memories
    3. Re:Need more info by snorklewacker · · Score: 3, Insightful

      > What if its the point-in-time recovery that is broken/buggy? As a DBA, who do you want to deal with?

      If I'm doing a million bucks an hour, I damn well had better be running a replica, so let's add that to the solution menu too. pgsql's replication ain't terrific either. Works, but not too flexible. Score another for Oracle.

      Anyway, if Oracle's PITR is broken/buggy, you are screwed screwed screwed. First, let's forget the fanciful notion that you can sue them. Now you're part of the support machine, the wheels of which grind exceedingly slowly and roughly.

      I don't often like to plug source access because it's extremely overrated, but as a last resort, if you can instrument your database startup with a debugger and trace the point of failure, you now have an advantage FAR greater than that Oracle is going to give you once while your trouble ticket clears through the dozen support techs who repeat the same useless advice and tie up your time.

      I also don't like to sling the term "FUD" around, because it's so often this shibboleth of the open source crowd, anything they disagree with, but what Oracle employs against solutions like PostgreSQL is often pure FUD. "Who you gonna call? Who's behind your data? What will you do WHEN it breaks? Scary scary scary, you just don't knooooowwww!!" I could probably turn around to an Oracle rep and say "right, that's about the same sort of feeling I get when dealing with YOUR support organization as well."

      If I'm doing a million bucks an hour, I'm probably picking Oracle too, because it's had more years to shake out PITR, hot backup, and clustering than pgsql has, so there's more of a body of knowledge accumulated on it. I just don't like the climate of fear going around when there's plenty of Oracle disasters to look at and learn from as well.

      --
      I am no longer wasting my time with slashdot
    4. Re:Need more info by The+Slashdolt · · Score: 2, Interesting

      That wasn't my point. My point was that given the current information his question cannot be answered. I then gave two examples. If his company wants a database for a scenario that is a read-only webpage scenario, it doesn't really matter if you use an open source db or a commercial one. Whatever works best and cheapest.

      On the other hand, if his companies business is reliant on this database for its core revenue generation then this is a business decision and not a technical one. Cost is only a minor factor in this decision. And actually, open source vs. commercial is a minor factor as well. If postgres offers equivalent uptime, equivalent performance, equivalent service, equivalent ability to hire skilled resources worldwide, etc then it should be a no-brainer.

      --
      mp3's are only for those with bad memories
    5. Re:Need more info by yamla · · Score: 3, Informative

      FUD. MyISAM certainly can handle a database of larger than 2 gigs. It can even handle _tables_ larger than 2 gigs. "As you can see, MySQL remembers the create options exactly as specified. And it chose a representation capable of holding 91 GB of data." (p.38, High Performance MySQL: Optimization [sic], Backups, Replication & Load Balancing, by Jeremy D. Zawodny & Derek J. Balling, published by O'Reillly, April 2004.

      --

      Oceania has always been at war with Eastasia.
  6. How's .org and .info by snorklewacker · · Score: 4, Informative

    SPI, the authoritative .org registrar, and Afilias, the authoritative .info registrar both use PostgreSQL for their registration databases.

    --
    I am no longer wasting my time with slashdot
  7. Companies that use PostgreSQL by bendsley · · Score: 4, Informative

    Their website shows that BASF uses PostgreSQL as their DB.

    www.basf.com

    They're an enormous company. I've always heard too that PostgreSQL is much better for larger sites. Cannot say for sure though as I have never used it.

    --
    Alcohol & calculus don't mix. Never drink & derive.
  8. PGSQL by hey! · · Score: 2, Informative

    There's no reason however to write all your SPs in PLSQL. Oracle supports stored procedures in Java, as does Postgres.

    This not only makes it easier in some instances to migrate some applications to PGSQL, it also improves performance (JIT compiling). You don't say exactly where the performance bottlenecks are, but this could improve performance and close the gap between PGSQL and Oracle.

    That said, if you've been working for years on tuning your Oracle physical design to a fare-thee-well, it's going to be nearly impossible to beat, supposing the transaction volume and query performance are the chief issues.

    --
    Post may contain irony: discontinue use if experiencing mood swings, nausea or elevated blood pressure.
  9. Apple Remote Desktop by dadragon · · Score: 2, Informative

    Apple's remote desktop 2 package uses PostgreSQL for its data store.

    link

    --
    God save our Queen, and Heaven bless The Maple Leaf Forever!
  10. OpenACS by aquarian · · Score: 2, Informative

    OpenACS has been Postgres-based for a long time, as a free alternative to Oracle. You can get plenty of Postgres information at www.openacs.org The folks there have been using it for years for all kinds of sites, so it's pretty well tested. OpenACS is a unique system using AOLServer and TCL, but the database performance should translate to whatever server/scripting platform you're using.

  11. Note that PostgreSQL is being benchmarked... by tcopeland · · Score: 2, Informative

    ...on some hefty hardware these days. This post talks about running it on a 16 CPU machine...

  12. PostgreSQL on Big Sites? by Anonymous Coward · · Score: 2, Informative

    We are an e-Learning company which started 4 years back with very little startup budget. We have been using Postgres for 4 years now and it has never let us down. We never imagined our company would grow so big so fast. Today we provide an ASP solution for over 10,000 users from around 20 companies. Postgres scales very well and is quite responsive. In the past we have had periods of 100% CPU utilization but postgres did not crash on us. You have to know how to configure it correctly and is will perform as well as a commercial DB.

  13. postgresql goodness by mcguyver · · Score: 2, Interesting

    My company uses PostgreSQL and are pretty happy with the performance. The only problem we had was in November when the Google spider went crazy and hit us a few million times a day for a few weeks. After a few hours of optimization, the sites were running smooth. A few years ago we had to come up with a db platform and we were a small company. We could use oracle but it's all around expensive. Oracle software, support, licensing, and engineers are expensive. Mysql's transaction support was too bleeding edge at the time. What I like most about postgresql is the transition from oracle to postgresql is smooth and most our engineers come from an oracle environment. Plus postgresql has adequate transactions support, subselects and functions...and it's free.

    (In defense of Google, their spider did not intentionally go crazy - we have distributed webservers on seperate IPs so the spider can't tell if it's pounding one particular site. However Google only spidered more pages as a publicity stunt before MSN search was released so maybe they are to blame...)

  14. How much traffic are we talking about? by jbrayton · · Score: 5, Insightful

    This question really requires more data. How much traffic are we talking about? How much data are we talking about? And then there are all sorts of variables, like the type of content begin stored in the database, the number and types of queries that are done on each page, and the type of caching your application is doing.

    Also, if Oracle is already purchased and paid for, you will have a difficult time making a business case for PostgreSQL.

    Don't get me wrong, I like PostgreSQL. But you will want to have a reason for switching, aside from PostgreSQL being open source.

  15. Re:If you can afford Oracle by jadavis · · Score: 4, Insightful

    Just because your company can spend hundreds of thousands of dollars (or millions for a large installation) on something that's really orthogonal to the actual business that your company is in, doesn't mean you should.

    If I was a PHB type for an online retailer and I looked at the costs and noticed that 50% of our profits are going to Oracle rather than to our pockets, I'd have some questions for the IT guys like:

    (1) Are we a retailer or a data warehousing company?
    (2) What is Oracle and why is it so expensive?
    (3) Can you get the same job done with less money? If so, what costs, benefits, and risks might we see?
    (4) My friend's IT guys use this thing called Post-whatever-SQL, and it costs $0. Is Oracle kinda like that?

    --
    Social scientists are inspired by theories; scientists are humbled by facts.
  16. PostgreSQL Replication & Culture by jgardn · · Score: 2, Insightful

    I tend to think of the PostgreSQL replication problem the same way people approach any problem: None of the solutions are endorsed as the "official" answer to the problems (because there is no absolute authority on these issues.) All have their shortcomings. All have their benefits. It's up to you to decide which combination of problems and benefits you want.

    PostgreSQL, like Linux, is more like an ecosystem of software, where you can go and pick and choose or even write your own stuff. It's not as diverse or as popular as Linux. As far as database systems go, however, it is the most diverse project out there.

    Oracle, on the other hand, is like Stalinist Russia. You can't pick and choose. There's only one model of car. And you have to buy the same bread at the same price at the same time as everyone else. And consumer input? Practically zero.

    PostgreSQL's biggest advantage is that it is extensible. The simple testimony to that fact is that there are numerous excellent extensions out there, written for a variety of purposes. Oracle is not, and will never be as long as it is closed source. (Doesn't this sound like Linux vs. Windows to you? It should. PostgreSQL people pride themselves on their openness.)

    Finally, one caution I like to give to people who are looking at PostgreSQL. Is it panacea? Of course not. There are problems with it. The problems are different than the problems of Oracle. But they exist. The biggest difference is the culture and the philosophy. So when you choose your database, choose the culture and philosophy you desire or agree with the most, and the software and solutions will come naturally and you'll be much happier in the end. This may mean that you prefer the world of Oracle over PostgreSQL. That's not a decision I can make for you.

    --
    The radical sect of Islam would either see you dead or "reverted" to Islam.
  17. Re:Yahoo, Google, etc. by jadavis · · Score: 2, Insightful

    The challenge to port an Oracle application to PostgreSQL is much less than to port an Oracle application to MySQL. Particularly in the training department, since MySQL is the most unlike Oracle.

    Also, the application matters a lot. MySQL is very effective as a cache to hold a relation. It would not surprise me if many of those companies use Oracle/DB2/MSSQL/PostgreSQL as a backend database, and then use MySQL to cache some of the data for fast access. If you list the companies using PostgreSQL extensively, they are likely to be using PostgreSQL as a replacement for the likes of Oracle. If you list the companies using MySQL, that's probably not the case, it's more likely that they're using it as a complement to Oracle or for a purpose that you normally shouldn't use Oracle for.

    I would say out of the relational databases, MySQL is the most different in terms of application domain, functionality, performance, and behavior.

    --
    Social scientists are inspired by theories; scientists are humbled by facts.
  18. PostgreSQL powers the WhitePages Network by sbaylis · · Score: 2

    I work at WhitePages.com, one of the top 100 US websites, and we use PostgreSQL in our mix of databases. We have the entire US and CA white and yellow page data loaded into PostgreSQL and we see awesome performance from our configuration. We've got over 250,000,000 rows of data and a *lot* of indexes making our database about 375G. We run over 1,500,000 queries per server per day which is about 100 per second at peak. Under load tests, we've seen almost triple that volume from the same servers. However, all of our use of PostgreSQL is entirely read-only in production. So, while you can point to us as a "high-traffic site" using PostgreSQL, you should be aware that our usage is likely very different than your needs.