Slashdot Mirror


Java Performance Tuning, 2nd Ed.

cpfeifer writes "Performance has been the albatross around Java's neck for a long time. It's a popular subject when developers get together "Don't use Vector, use ArrayList, it's more efficient." "Don't concatenate Strings, use a StringBuffer, it's more efficient." It's a chance for the experienced developers to sit around the design campfire and tell ghost stories of previous projects where they implemented their own basic data structures {String, Linked List...} that was anywhere from 10-50% faster than the JDK implementation (and in the grand oral tradition of tall tales, it gets a little more efficient every time they tell it)." Want to kill the albatross? Read on for the rest of cpfeifer's review of O'Reilly's Java Performance Tuning, now in its 2nd edition. Java Performance Tuning, 2nd Edition author Jack Shirazi pages 570 publisher O'Reilly and Associates rating 9/10 reviewer cpfeifer ISBN 096003773 summary It's the most up to date publication dealing specifically with performance of Java applications, and is a one of a kind resource.

Every developer has written a microbenchmark (a bit of code that does something 100-1000 times in a tight loop and measure the time it takes for the supposed "expensive operation") to try and prove an argument about which way is "more efficient" based on the execution time. The problem, is when running in a dynamic, managed environment like the 1.4.x JVM, there are more factors that you don't control than ones that you do, and it can be difficult to say whether one piece of code will be "more efficient" than another without testing with actual usage patterns. The second edition of Review of Java Performance Tuning provides substantial benchmarks (not just simple microbenchmarks) with thorough coverage of the JDK including loops, exceptions, strings, threading, and even underlying JVM improvements in the 1.4 VM. This book is one of a kind in its scope and completeness.

The Gory Details
The best part of this book is that it not only tells you how fast various standard Java operations are (sorting strings, dealing with exceptions, etc.), but he has kept all of the timing information from the previous edition of the book. This shows you how the VMs performance has changed from version 1.1.8 up to 1.4.0, and it's very clear that things are getting better. The author also breaks out the timing information for 3 different flavors of the 1.4.0 JVM: mixed interpreted/compiled mode (standard), server (with Hotspot), and interpreted mode only (no run time optimization applied).

Part 1 : Lies, Damn Lies and Statistics
The book starts off with three chapters of sage advice about the tools and process of profiling/tuning. Before you spend any time profiling, you have to have a process and a goal. Without setting goals, the tuning process will never end and it will likely never be successful.

The author outlines a general strategy that will give you a great starting point for your tuning task forces. Chapter 2 presents the profiling facilities that are available in the Java VM and how to interpret the results, while chapter 3 covers VM optimizations (different garbage collectors, memory allocation options) and compiler optimizations.

Part 2 : The Basics
Chapters 4-9 cover the nuts and bolts, code-level optimizations that you can implement. Chapter 4 discusses various object allocation tweaks including: lazy initialization, canonicalizing objects, and how to use the different types of references (Phantom, Soft, and Weak) to implement priority object pooling. Chapter 5 tells you more about handling Strings in Java that you ever wanted to know. Converting numbers (floats, decimals, etc) to Strings efficiently, string matching -- it's all here in gory detail with timings and sample code.

This chapter also shows the author's depth and maturity; when presenting his algorithm to convert integers to Strings, he notes that while his implementation previously beat the pants off of Sun's implementation, in 1.3.1/1.4.0 Sun implemented a change that now beats his code. He analyzes the new implementation, discusses why it's faster without losing face. That is just one of many gems in this updated edition of the book. Chapter 6 covers the cost of throwing and catching exceptions, passing parameters to methods and accessing variables of different scopes (instance vs. local) and different types (scalar vs. array). Chapter 7 covers loop optimization with a java bent. The author offers proof that an exception terminated loop, while bad programming style, can offer better performance than more accepted practices.

Chapter 8 covers IO, focusing in on using the proper flavor of java.io class (stream vs. reader, buffered vs. unbuffered) to achieve the best performance for a given situation. The author also covers performance issues with object serialization (used under the hood in most Java distributed computing mechanisms) in detail and wraps up the chapter with a 12 page discussion of how best to use the "new IO" package (java.nio) that was introduced with Java 1.4. Sadly, the author doesn't offer a detailed timing comparison of the 1.4 NIO API to the existing IO API. Chapter 9 covers Java's native sorting implementations and how to extend their framework for your specific application.

PART 3 : Threads, Distributed Computing and Other Topics
Chapters 10-14 covers a grab bag of topics, including threading, proper Collections use, distributed computing paradigms, and an optimization primer that covers full life cycle approaches to optimization. Chapter 10 does a great job of presenting threading, common threading pitfalls (deadlocks, race conditions), and how to solve them for optimal performance (e.g. proper scope of locks, etc).

Chapter 11 provides a wonderful discussion about one of the most powerful parts of the JDK, the Collections API. It includes detailed timings of using ArrayList vs. LinkedList when traversing and building collections. To close the chapter, the author discusses different object caching implementations and their individual performance results.

Chapter 12 gives some general optimization principles (with code samples) for speeding up distributed computing including techniques to minimize the amount of data transferred along with some more practical advice for designing web services and using JDBC.

Chapter 13 deals specifically with designing/architecting applications for performance. It discusses how performance should be addressed in each phase of the development cycle (analysis, design, development, deployment), and offers tips a checklist for your performance initiatives. The puzzling thing about this chapter is why it is presented at the end of the book instead of towards the front, with all of the other process-related material. It makes much more sense to put this material together up front.

Chapter 14 covers various hardware and network aspects that can impact application performance including: network topology, DNS lookups, and machine specs (CPU speed, RAM, disk).

PART 4 : J2EE Performance
Chapters 15-18 deal with performance specifically with the J2EE APIs: EJBs, JDBC, Servlets and JSPs. These chapters are essentially tips or suggested patterns (use coarse-grained EJBs, apply the Value Object pattern, etc) instead of very low-level performance tips and metrics provided in earlier chapters. You could say that the author is getting lazy, but the truth is that due to huge number of combinations of appserver/database vendor combinations, it would be very difficult to establish a meaningful performance baseline without a large testbed.

Chapter 15 is a reiteration of Chapter 1, Tuning Strategy, re-tooled with a J2EE focus. The author reiterates that a good testing strategy determines what to measure, how to measure it, and what the expectations are. From here, the author presents possible solutions including load balancing. This chapter also contains about 1.5 pages about tuning JMS, which seems to have been added to be J2EE 1.3 acronym compliant.

Chapter 16 provides excellent information about JDBC performance strategies. The author presents a proxy implementation to capture accurate profiling data and minimize changes to your code once the profiling effort is over. The author also covers data caching, batch processing and how the different transaction levels can affect JDBC performance.

Chapter 17 covers JSPs and servlets, with very little earth shattering information. The author presents tips such as consider GZipping the content before returning it to the client, and minimize custom tags. This chapter is easily the weakest section of the book: Admittedly, it's difficult to optimize JSPs since much of the actual running code is produced by the interpreter/compiler, but this chapter either needs to be beefed up or dropped from future editions.

Finally, chapter 18 provides a design/architecture-time approach towards EJB performance. The author presents standard EJB patterns that lend themselves towards squeezing greater performance out of the often maligned EJB. The patterns include: data access object, page iterator, service locator, message facade, and others. Again, there's nothing earth shattering in this chapter. Chapter 19 is list of resources with links to articles, books and profiling/optimizing projects and products.

What's Bad?

Since the book has been published, the 1.4.1 VM has been released with the much anticipated concurrent garbage collector. The author mentions that he received an early version of 1.4.1 from Sun to test with. However, the text doesn't state that he used the concurrent garbage collector, so the performance of this new feature isn't indicated by this text.

The J2EE performance chapters aren't as strong as the J2SE chapters. After seeing the statistics and extensive code samples of the J2SE sections, I expected a similar treatment for J2EE. Many of the J2SE performance practices still apply for J2EE (serialization most notably, since that his how EJB, JMS, and RMI ship method parameters/results across the wire), but it would be useful to fortify these chapters with actual performance metrics.

So What's In It For Me?

This book is indispensable for the architect drafting the performance requirements/testing process, and contains sage advice for the programmer as well. It's the most up to date publication dealing specifically with performance of Java applications, and is a one-of-a-kind resource.

You can purchase Java Performance Tuning, 2nd Edition from bn.com. Slashdot welcomes readers' book reviews -- to see your own review here, read the book review guidelines, then visit the submission page.

65 of 287 comments (clear)

  1. Isn't this the compiler's job? by chrisseaton · · Score: 4, Insightful

    If all these performance hacks are documented, why doesn't the compiler implement them?

    I've often found that will bytecode languages (Java, C#...) the bytecode instructions are made for the language so that the compiler can just throw them out easy peasy, but they seem to overlook the sort of optimizations that C compilers, for example, work hard to implement.

    1. Re:Isn't this the compiler's job? by chrisseaton · · Score: 3, Informative

      Think about the string object problem - people have to use stringbuffer because strings are immutable.

      When a program thrashes strings around, why doesn't the compiler detect that, and switch to a string buffer object to perform those operations, and then convert the final result back to a string?

    2. Re:Isn't this the compiler's job? by cmburns69 · · Score: 4, Informative

      With non-bytecode langauges, the compiler can optimize to the environment. It can re-order code based on the fastest execution time for the platform the code is compiled for.

      Java (and other bytecode languages) were desinged to run well not just on a single platform, but on a variety of platforms. So as a trade-off, you lose environment-specific optimizations at compile time.

      JIT JRE/compilers can work to prevent this. They can further optimize the bytecodes at execution time because they are platform specific.

      An online Starcraft RPG? Only at
      In soviet Russia, all your us are belong to base!

      --
      Online Starcraft RPG? At
      Dietary fiber is like asynchronous IO-- Non-blocking!
    3. Re:Isn't this the compiler's job? by jd10131 · · Score: 2, Insightful

      Having a compiler swap objects out from under you would be

      1. Tricky to implement properly
      2. Rude

      However, emitting a warning when the compiler thinks you could do a better job with a StringBuffer might be nice.

    4. Re:Isn't this the compiler's job? by briaydemir · · Score: 5, Insightful

      If all these performance hacks are documented, why doesn't the compiler implement them?

      The most common reason is that most performance hacks and optimizations are not decidable, and you want a compiler to implement only decidable algorithms becuase those are the ones that enable a compiler to be deterministic. It is usually much easier for a person, i.e., human, to determine what can be done, than it is for a machine to determine that exact same thing.

      Consider the following piece of code.

      boolean f(int[] a, int[] b)
      {
      int x = a[0];
      b[0] = a[0] + 2;
      int y = a[0];
      return (x == y);
      }

      Does f always return true? Only if we can prove that a and b never point to the same array. A person maybe able to do this, but a machine would have great difficulty (assuming the machine could even do it).

      So to sumarize, compiler's don't implement many optimization hacks becuase then they might not be deterministic, and that is a bad thing.

    5. Re:Isn't this the compiler's job? by gmarceau · · Score: 3, Interesting

      For most of theses transformations to be correct, the compiler has to prove there is only one pointer to the object -- in the whole program. Whole-program analyses are expensive, and so are point-to analysis. And there is just not that kind of time to spare in a JIT, where every second spent analysing the program is time spent not executing it.

      The optimization the book proposes are all hit-or-miss adventures. Even for a programmer with intimate knowledge of the code, it is sometime difficult to predict if a change will help or imper performance. The compiler has even less chance to do so correctly -- and nobody like a compiler which slows down their code trying to optimize it.

      --
      This post was compiled with `% gec -O`. email me if you need the sources
    6. Re:Isn't this the compiler's job? by ruriruri · · Score: 2, Insightful
      Actually, most java compilers do exactly what you suggest. However, consider:

      String foo = "Ripple in still water\n";
      foo += "When there is no pebble tossed\n";
      foo += "Or wind to blow.";
      In this case, the compiler does the following steps:

      1. Create new string "foo" with the initial value (this may already exist).
      2. Turn foo into a StringBuffer and append "When ..."
      3. Turn foo back into a String.
      4. Turn foo into a StringBuffer and append "Or..."
      5. Turn foo back into a String.

      So you can see this is not optimal. A human could easily rewrite the code as:

      StringBuffer foo = new StringBuffer( "Ripple in still water\n" );
      foo.append( "When there is no pebble tossed\n" );
      foo.append( "Or wind to blow" );
      String bar = foo.toString();
      Here, we've saved one StringBuffer creation and one StringBuffer.toString() operation. Hooray us! But I do not believe any java compiler will make this level of optimization. Perhaps they are smarter now.
    7. Re:Isn't this the compiler's job? by Anonym0us+Cow+Herd · · Score: 2, Interesting

      If someone were building a compiler that contemplated making such optimizations, wouldn't it be better to have an option to output optimization hints to the programmer? If I use a certian switch, the compiler emits another output file of notes about the source code, and what optimizations it suggests might be in order.

      Naturally, the programmer might see that since both array parameters a and b point to the same array, that this is not really a possible optimization. This realization by the programmer is equivalent to the programmer realizing to insert an assert( a != b );

      Why not move the optimizations into the source code directly. This makes the optimization transparent. Furthermore, all optimizations were then supervised by the programmer, not automatically hacked in by the compiler, which could fail if the same array were passed in for both parameters a and b.

      --
      The price of freedom is eternal litigation.
  2. Definite purchase by Timesprout · · Score: 3, Informative

    I have drastically cut back on my tech book purchases in recent times but this book will definitely be on my shopping list. The First edition offered many insights into not only getting the best performance from Java but also solid guidelines for when and where to apply optimisations.
    As a side note I would disagree about performance being an albatross for Java. Well written Java code can be very high performant just as poorly written code in ANY language can perform slowly. Many of the performance issues associated with Java are inexperienced developers using inappropriate methods and objects.

    --
    Do not try to read the dupe, thats impossible. Instead, only try to realize the truth
    What truth?
    There is no dupe
  3. New Title by borg05 · · Score: 3, Funny

    Java Performance Tuning: A course in C programing

    1. Re:New Title by Anonymous Coward · · Score: 2, Funny

      Writing secure server applications that take less than several minutes to respond: A course in C++ programming

      Writing complex enterprise systems that don't get cancelled, because they don't perform and are written by former Taxi drivers turned Java programmers: A course in C++ programming

  4. Is this a review or a synopsis? by rfischer · · Score: 3, Insightful

    there is a difference, you know.

  5. Correct ISBN is 0596003773 by zipwow · · Score: 5, Informative

    The bn.com link is broken for me, here's the correct ISBN:

    0596003773

    --
    I don't know which is more depressing, that 2/3 didn't care enough to vote, or that 1/2 of those that did are crazy.
  6. Java Strings are the main problem by Anonymous Coward · · Score: 3, Interesting

    Each String is around 64 bytes of memory minimum. What a stupid decision to make such a fundamental data type so heavy weight.

    1. Re:Java Strings are the main problem by sohp · · Score: 2, Funny

      ... because no one will ever need more than 64 bytes.

  7. Java performance better in the Sun IDE? by 192939495969798999 · · Score: 3, Interesting

    I have noticed my JAVA programs run considerably faster under the Sun Forte/One IDE. Once the JAVA app is on its own (especially through a browser), it slows considerably. Does anyone else have experience with this phenomenon?

    --
    stuff |
  8. Process by spakka · · Score: 2, Insightful

    The book starts off with three chapters of sage advice about the tools and process of profiling/tuning. Before you spend any time profiling, you have to have a process and a goal. Without setting goals, the tuning process will never end and it will likely never be successful.

    No, you have to profile first. Profiling will tell you whether there is even any point in tuning, and, if so, what goals are reasonable.

    1. Re:Process by egoots · · Score: 2, Insightful

      No, you have to profile first. Profiling will tell you whether there is even any point in tuning, and, if so, what goals are reasonable.

      It's a classic chicken and egg conundrum... If your program meets your performance requirements, why spend time profiling in the first place (but perhaps this is always necessary with Java apps).

      I still believe that premature optimization is way too prevalent, unnecessary, and problematic. I recommend the following approach:

      Make the program function correctly first

      If it doesnt meet performance requirements, profile it to find out where the problems are

      If necessary, optimize code highlighted by profiler

  9. What performance issues? by Anonymous Coward · · Score: 5, Funny

    Java has performance troubles? I thought we were all supposed to deny that. Did I miss a memo or something?

  10. Insightful? by rfischer · · Score: 4, Insightful

    Remember there is a distinction between client- and server-side Java. Java on the server makes me very happy.

  11. Java doesn't cut it by AirLace · · Score: 3, Interesting

    We ported some of our internal Java business applications to C# for use with Mono, and emperical results already suggest the solution is several times faster than the Java code. The port was very easy, with each line of Java code mapping onto one line of C# or less. Porting the UI to Gtk# was more difficult, but we find the Gtk# code more maintainable and the UI, along with the Gtk+ WIMP plugin integrates much more nicely with Windows than SWING. We'll be investigating a switch to Linux over the next few months for some of our Point-of-Sales terminals as a result, and it should be easy thanks to the portability of Mono and Gtk#.

    We also ported some of our backend tools for use with Mono. In use with the newly released Mono JIT runtime, Mini, we've achieved some truly stunning results. It turns out that some of the optimisations in the new JIT are better than those used by GCC, so once the code is loaded in memory, it performs better than raw C code. Although I don't yet have hard numbers to back up these result (the transition is still in progress), it has to be said that Mono is the real answer to Java performance. Being Open Source, we can also contribute back to the runtime to make it better suit our needs. It also plays nicely with RedHat 9's NPTL threading implementation, which is more than I can say for the current crop of Java JREs.

    1. Re:Java doesn't cut it by sbrown123 · · Score: 3, Interesting


      We ported some of our internal Java business applications to C# for use with Mono, and emperical results already suggest the solution is several times faster than the Java code.


      You could have saved yourself some porting by just compiling your java code with GCJ. GCJ allows you to compile your java byte code to native executables.


      Porting the UI to Gtk# was more difficult, but we find the Gtk# code more maintainable and the UI, along with the Gtk+ WIMP [sourceforge.net] plugin integrates much more nicely with Windows than SWING.


      Use SWT with Java. SWT uses Windows native widgets on Windows or GTK on Linux.


      We also ported some of our backend tools for use with Mono. In use with the newly released Mono JIT runtime, Mini [ximian.com], we've achieved some truly stunning results. It turns out that some of the optimisations in the new JIT are better than those used by GCC, so once the code is loaded in memory, it performs better than raw C code.


      Either your telling a big lie or dont have your facts straight. Unless you can show hard facts your not going to sway anyone into believing interpreted code outperformed compiled.


      it has to be said that Mono is the real answer to Java performance.


      Yes, if you dont mind porting existing working code.


      Being Open Source, we can also contribute back to the runtime to make it better suit our needs. It also plays nicely with RedHat 9's NPTL threading implementation, which is more than I can say for the current crop of Java JREs.


      What is exactly wrong with Java's use of native threads on Linux boxes?

    2. Re:Java doesn't cut it by AirLace · · Score: 4, Insightful

      You could have saved yourself some porting by just compiling your java code with GCJ. GCJ allows you to compile your java byte code to native executables.

      This might become an option in a few years, but the GNU classpath is as yet not complete enough for our years. We actually didn't find gcj output that performant, despite it being compiled to native code. The JRE still beat it in many cases.

      Use SWT with Java. SWT uses Windows native widgets on Windows or GTK on Linux.

      We also investigated this. SWT is a _horrendous_ API which offers very little abstraction. You end up writing your code once for the Gtk+ target, and again for the native Windows target. It isn't really a cross-platform abstraction like WxWindows, and it's probably the reason why the Eclipse codebase is so large. You end up writing your application for each UI target platform. Gtk# runs and integrates with the platform instead, so you only write your code once.

      Either your telling a big lie or dont have your facts straight. Unless you can show hard facts your not going to sway anyone into believing interpreted code outperformed compiled.

      I did mention the results are empirical, but they're also pretty obvious from where I stand. You don't need benchmarks when something performs, in some cases, eight times faster than the original implementation. I may well put togther some benchmarks and post them to mono-list or linuxtoday.com. I don't have benchmarks yet; does that make me a liar? Sigh.

      What is exactly wrong with Java's use of native threads on Linux boxes?

      It's pointless to interface with the threads layer directly when pthreads exists. It makes the runtime essentially unportable to other unices/operating systems. Mono plays nicely with the environment, so the runtime can just be compiled on any POSIX-compilant system. Linux is great, but being attached to it so firmly that your application breaks when Linus changes some internal interfaces is not.

    3. Re:Java doesn't cut it by AirLace · · Score: 2, Insightful

      Sorry, that should be "for our uses".

    4. Re:Java doesn't cut it by mritunjai · · Score: 3, Insightful
      I've been using eclipse and SWT for quite sometime.
      We also investigated this. SWT is a _horrendous_ API which offers very little abstraction. You end up writing your code once for the Gtk+ target, and again for the native Windows target.
      Complete BS !!

      SWT offers a very high level of abstraction. If you want a still higher level of abstraction, then use the jface interface.

      I've written a filesystem tool for QFS (QNX file system) and it runs without a single line of modification on QNX, windows and solaris!

      SWT is a very sweet API. After using the utter crap that is Swing, its refreshing to see SWT. It uses native widgets so the app doesn't feel "out-of-place" !!

      Combine this with the fact that SWT is as fast as any other GUI toolkit interface in a higher level language (higher than C/C++) and that its a filesystem tool, nobody ever suspects that its written in java !!!

      And no dude, eclipse codebase is not huge... its not just another IDE as you think... its a complete platform!! You can write your own whole software platform using that baby!!

      --
      - mritunjai
  12. Re:Web pages by catch23 · · Score: 3, Interesting

    Why would the users care whether you use a J2EE or PHP4 backend? You must be thinking of Java applets which very few few people actually use for any "web page development". And the chapters 15-18 that talk about J2EE, what's wrong with using J2EE for the generation of webpages and content?

  13. Inherent performance issues by MSBob · · Score: 2, Informative
    There are certain design decisions that were made by the java team that limit java's performance in a number of ways. Lack of stack objects comes to mind and collections that cannot store basic types.

    That said for most network centric applications java is plenty fast. Now if we only stopped short of introducing the unbelievable overhead of XML's excessive verbosity...

    --
    Your pizza just the way you ought to have it.
  14. idiots.; by mandrake*rpgdx · · Score: 2, Interesting

    Why does programming languages have to be an either or situation? Everyone here assumes that anyone who programs in JAva does not know C/C++...why is that? Can't someone know multiple prog langs? I know many (too many too really list here) and find it asinine that people really think that everyone should just program in one lang.

    1. Re:idiots.; by iapetus · · Score: 4, Insightful

      Because the sort of people who like to get involved in discussions about whether C# is 'better' than Java or Java is 'better' than Perl or crunchy peanut butter is 'better' than textured masonry paint can't cope with more than one thing at a time, and tend to apply their religious zealotry with great vigour.

      Those of us who can program in more than one language and know that sometimes it's a matter of choosing the right tool for the job (peanut butter for sandwiches, masonry paint for walls) tend to go through three stages:

      1) Try to engage in such discussions on the premise that there's actual intelligent debate going on.

      2) Discover ourselves becoming violently opposed to whatever rant we're reading at the time, writing tracts about how Java sucks when we're reading the work of a Java fanatic and drooling about the glory of Java when faced with a C++-toting moron.

      3) Either give up in disgust and let the language fanboys get on with it, or sit on the sidelines and snipe at both sides - similar to stage 2, but more consciously applied. Normally that progresses towards giving up, though, since the zealots are just too easy and predictable...

      --
      ++ Say to Elrond "Hello.".
      Elrond says "No.". Elrond gives you some lunch.
  15. More efficient != better by brianjcain · · Score: 3, Insightful
    "Don't concatenate Strings, use a StringBuffer, it's more efficient."

    Perhaps it is more efficient. I say, let the compiler do it for me. Code like this:
    final String foo = frob + " noz " + baz.barCount()
    + " bars found";
    is much more readable/maintainable than
    StringBuffer fooBuff = new StringBuffer();
    fooBuff.append(frob);
    fooBuff.ap pend(" noz, ");
    fooBuff.append(baz.barCount());
    fooBuff.appe nd(" bars found");
    final String foo = fooBuff.toString();
    1. Re:More efficient != better by Anonymous Coward · · Score: 2, Insightful

      That's not the kind of situation that's being talked about. In fact, that case is already done by the compiler.

      final String foo = frob + " noz " + baz.barCount()
      + " bars found";

      becomes

      final String foo = new StringBuffer (frob).append (" noz ").append (baz.barCount()).append (" bars found").toString ();

      The problem is when people do things like:

      String s = "";
      while (hasMoreData ()) s = s + nextCharacter ();

      which becomes

      String s = "";
      while (hasMoreData ()) s = new StringBuffer (s).append (nextCharacter ()).toString ();

      which has an extraordinarily higher cost than

      StringBuffer b = new StringBuffer ();
      while (hasMoreData ()) b.append (nextCharacter ());
      String s = b.toString ();

    2. Re:More efficient != better by blamanj · · Score: 5, Informative
      Actually,that's exactly what the compiler does. The problem occurs in cases like this:
      String foo = "";
      while (source.hasMoreTokens()) {
      foo += source.nextToken();
      }
      where you are creating a destroying a large number of strings. In this case, using a StringBuffer is far more efficient and doesn't really harm readability.

    3. Re:More efficient != better by avandesande · · Score: 2, Informative

      Totally agree. Unless you are doing bioinformatics routines or building multimegabyte docs, you don't need the string buffer. Might as well keep the server busy while you are waiting for that 30ms query to complete.

      --
      love is just extroverted narcissism
  16. String/StringBuffer by toriver · · Score: 4, Informative

    It does under the hood whenever you use + for concatenation; this is why using String + String in a loop is ineffective: You create a new StringBuffer object per iteration. The solution in this case is to declare the StringBuffer outside the loop and use append() explicitly within.

    For concatenating two strings, the concat() method can be faster than using StringBuffer, since it only needs to create a new char[] and do a (fast) arraycopy from the two internal arrays.

    Also, everyone should be aware of the 1.4.1 memory leak associated with using StringBuffer's toString() and setLength() methods.

    1. Re:String/StringBuffer by bay43270 · · Score: 2, Interesting

      It does under the hood whenever you use + for concatenation; this is why using String + String in a loop is ineffective: You create a new StringBuffer object per iteration. The solution in this case is to declare the StringBuffer outside the loop and use append() explicitly within.

      I think your missing the point the parent was trying to make. Aren't there much bigger things to worry about than writing around bugs in the compiler?

      Two years ago, everyone used StringBuffer.. today, everyone knows the use of the plus operator is as fast as it should be, so they opt for readability. Except people, who (like yourself) still use a StringBuffer when doing concatenation in a for loop.

      Two years from now, the compiler will be optimized to use a StringBuffer *even* when the concatenation takes place in a loop. My code will still be readable (and run a tiny bit faster). Your code will run at the same speed and people will scratch their head - "why was he using a StringBuffer for concatenation?"

      Why can't we just write our Java code as readably as possible, and then go back over it when we're done with an optimization tool looking for bottlenecks?
  17. Who cares? by Elgon · · Score: 5, Insightful

    Okay,

    flippant comment but let's think about this for a second: The majority of the time the alleged efficiency advantage is small or, as is generally the case, a pointless optimisation. Java coders seem to have the major efficiency/speed hangup - they use it to lord it over scripting programmers but they want/lack/desire the swiftness of C. (And yes, I do program in Java.)

    To my mind, this is approching the problem from entirely the wrong direction: CPU time and CPU power are far cheaper than developer time and designer time. Therefore, rather than use some cobbled-together hack, use the standard implementations and take the performance hit.

    This will be cheaper, probably 95% as efficient and, most importantly, be 195% easier to maintain or change at a later date. Consider the big picture rather than a single aspect.

    NB - YMMV, for certain apps, it really does make sense to break all of the above ideas and principles, but if you REALLY need it to run that fast, you should be using C anyway.

    Elgon

    1. Re:Who cares? by Jord · · Score: 4, Insightful
      True, going back and fine tuning to gain a 2% speed increase (example) is a waste of time. However the value I see in books like this is in training/teaching the developer to write more efficient code the first go around. If you get out of the habit of doing String + String + String and use StringBuffers instead your code is more efficient from the beginning.

      That is the value I see from books like this.

    2. Re:Who cares? by Randolpho · · Score: 3, Informative
      Perhaps you should pick your examples better. Here's an exerpt from the StringBuffer JavaDoc:
      String buffers are used by the compiler to implement the binary string concatenation operator +. For example, the code:
      x = "a" + 4 + "c"
      is compiled to the equivalent of:
      x = new StringBuffer().append( "a" ).append( 4 ).append( "c" ).toString()
      Granted, people should get in the habit of coding optimizations automatically, but in this case it's actually more efficient to do String + String + String; it takes less time to code than typing the method calls, and is easier to read/understand.

      Which just brings me to my biggest beef about Java: no syntactic sugar. Operator overloading should be a part of Java, and bugger whatever the purists say. I want to save time typing dammit! :)
      --
      "Times have not become more violent. They have just become more televised."
      -Marilyn Manson
    3. Re:Who cares? by evilpenguin · · Score: 2, Interesting

      I used to side with the purists. I've seen operator overloading so badly abused in some C++ programs that it is terrible.

      But I recently had to write a Java program that did financial calculations (more rare, even in business software, than you might think). You don't want to use floating point (for all the classic reasons), and, in this case, you don't want to use integers because you need power functions for interest calculations and so forth.

      The classic solution appears to be to use the BigDecimal class. I decided to wrap this in my own Money class that would include the financial functions and would also do currency conversions.

      This kind of spcialized or "value added" numeric type is exactly where operator overloading should be available. Java would take a huge leap forward in utility if you could just overload the =, +, -, *, and / operators (= being assignment).

      I understand why Java's designers did what they did. If they don't want to do this, then for goodness sake, do a hack like "+" support for Strings and "," support in for statements to give at least BigDecimal the basic operator set!

      Or bite the bullet and give us operator overloading. I'm come around 180 degrees on this issue. I want my overloaded operators!

      While I'm at it, let me trot out my favorite Java rant. Sun should LGPL or BSD License the entire Java SDK and APIs. (I actually think they should GPL it, but I'd settle for this). They are their own worst enemy. Java has a strong market position, and I don't think its going to go away any time soon, but it could completely destroy C# and .NET if some of the nagging problems could be dealt with by those who care. They are so concerned about "control" of the "standard" that they are endangering their de facto standard. Sure, there's gcj and kaffe, but why not just get us all working right on the real thing?

      I would say the same thing about IBM's Java SDK, but I believe (someone correct me if I am wrong) they have some licensed Sun intellectual property in their SDK.

      I actually worry about C# because it does support operator overloading. It is the only thing tempting about it to me.

    4. Re:Who cares? by maraist · · Score: 2, Interesting

      I'm unsatisfied by the idea that hardware is cheaper than developer time. Word Perfect alledgedly attempted to make a version in java, but scrapped it because of speed concerns.

      If your product just barely runs within an acceptible time-frame, then you confronted with the probability that a given customer will agree with you. If a customer doesn't agree, then they will not use your product.. Thus while you save money on developer time, you lose potential customers (or existing ones). Worse, late in the game (after spending significant R&D time), management may decide to scrap the project when performance doesn't cut the mutards.

      Yes, I realize that there are full-fledged GUIs written in java (Idea/eclipse/etc), but I definately notice their performance penalties (even on a 2GHZ machine with 0.5Gig mem). Moreover, the memory isn't enough to keep these monsters happy.

      Now imagine a work-station runing a dozen java apps, each taking up a majority of the CPU. Current hardware can't even support this sort of environment without some hair-pulling.

      Moreover, the idea that you can "throw hardware at it" assumes a certain level of parallelizability. Some problems are inherently serial and there is a definate max performance a single thread can achieve in a given generation.

      Moreover there is an incredible danger in sacrificing performance to the hardware gods.

      We're going to assume for the moment that we're talking about real developers here and that the idea of O(k), O(n) and O(nlog(n)) are religiously adhered to.

      Assume that a given method is a k times slower than it could be for a given environment (where k need not be an integer; e.g. allowing 1.2). Now extend this process for an entire enterprise app.

      Now realize that methods are generally part of loops, the delay is amplified. While the app may only utilize nlog(n), there are m such loops (where m grows as if n). Thus we have m^2 = (k*m)^2 => k^2.

      Now if we only have k == 1.2, then we're fine, but since we have an interpreted environment, simple operation really take many times longer than equivalent c. Thus k is really 4 to 20 times it's conceptual cost. Thus k^2 => 16 to 400.

      A java app has the potential to be 400 times slower than a c equivalent. Given the fuzzy numbers being used, I'll simply state that:
      An arbitrarily abstract language is capable of producing sufficient values of k that there exists a level of complexity of code that will produce wait times that exceed the performance capabilities of a given hardware generation.

      In short, we can get carried away in abstracting the hardware and overcomplexifying the software to the point of self-destruction.

      One good example was an [unnamed] JDBC driver that wrote very elegant object oriented code but unfortunately accounted for 90% of the CPU and memory load of neighboring application. The overhead was small but subtle. Each component allocated and threw away objects.. Each such operation required potential garbage collection and initialization. There was never anything more than O(n), but there were several such O(n) calls within a single JDBC method invocation. The k's compounded enormously.

      The thing is that you wouldn't really notice the overhead if you only accessed the JDBC once or twice. Unfortunately we happened to utilize jdbc calls in loops, which exposed the innefficiency. By using a different driver (which performed optimizations such as utilizing a static StringBuffer to avoid ANY memory allocations) the JDBC overhead became negligable.

      The moral of the story is to identify (at design time) critical loops AND critically accessed sections and consider performance.. This is profiling 101, but I hear more and more java-developers arrogantly transcend such problems.

      --
      -Michael
    5. Re:Who cares? by onash · · Score: 2, Insightful

      And what I don't understand is how we can talk about Java OO-purist when the primitive data types like integers aren't object and needs a wrapper to stick it in a container. That and working with Strings is what I don't like about Java.

      I'm pretty sure that a modern compiler should be able to optimize things like that easily by now. If Sun is just holding on to old crap like that just because its old, then Java is doomed to be replaced.

      but still I use Java because the IDEs like Eclipse and IntelliJ IDEA are so nice..

  18. Re:Oxymoron ? by Randolpho · · Score: 2, Insightful

    Amazingly, Java actually performs very well once the JVM loads. Sure, it can't match uber-efficient c code, but, let's be honest here, how much c code really is efficient? I'm sure it's less than c programmers like to believe. ;)

    That said, "slow" performing Java GUI aps are not so much the fault of the platform itself as they are the fault of the Java programmer's inability to deal efficiently with threads.

    --
    "Times have not become more violent. They have just become more televised."
    -Marilyn Manson
  19. Albatrosses by colin_zr · · Score: 2, Interesting

    Want to kill the albatross?

    Ick.

    /. editors: please improve your literary references.

    The albatross doesn't need killing -- it's already dead. The albatross was hanging from the mariners neck because he had killed it, and by doing so had brought bad luck upon his ship.

    Quoting from memory here, because I can't be bothered to go find my copy of the poem:

    "God save thee, ancient mariner,
    from the fiends that plague thee thus.
    Why lookst thou so?" "With my crossbow
    I shot the albatross!"

    ...

    Ah well-a-day, what evil looks
    had I from old and young.
    Instead of the cross, the albatross
    about my neck was hung.

    ...

    Oh happy living things! No tongue
    their beauty might declare.
    A spring of love gushed from my heart
    and I blessed them unaware.

    That self-same moment I could pray
    and from my neck so free
    the albatross fell off and sank
    like lead into the sea.

    As I said, that's from memory, so there are probably plenty of mistakes in there, but I'm sure a little googling will turn up a proper copy of the poem.

  20. Re:Don't use Java.... by vingilot · · Score: 3, Insightful

    Don't be an idiot. The size of the standard api does not relate to any inefficiency java has. How can the number of standard classes translate to inefficiency What is the magic number of standard classes to be "just right"?
    The best thing about java is the richness of the api. And the size of the documentation. C++/C should take a page from java's book in this department.
    You don't have to use the standard classes, go ahead and write the classes you need.

    Jonathan

  21. It's all about the VM by slagdogg · · Score: 4, Insightful

    I read the first edition of this book completely. There are some good tips for extracting a few percentage points of improved performance. However, nothing has as profound an impact as simply using a better VM ... for example, many of my applications saw 25%+ speed increases simply by switching from the 1.2.x series VM to the 1.3.x series VM. Java does a pretty could job as a language of encouraging best practices, i.e. the inclusion of a standard StringBuffer. Extreme optimization at the code level will always be limited given the high abstraction of the language. However, extreme optimization at the VM level is a very real thing, and it doesn't take a whole lot of effort for the Java programmer.

    --
    (Score:-1, Wrong)
  22. Insightful!? by Jerk+City+Troll · · Score: 2, Insightful

    Java isn't just about applets. In fact, applets are the least used feature of Java -- they're a neat little toy useage. Java is used primarily for back-end code now. Servlets talking to databases, for instance, are where Java is most often found.

  23. Re:Oxymoron ? by wfmcwalter · · Score: 4, Interesting
    I don't quite know what it is that they've improved, but JVM startup time seems to have gotten dramatically better somewhere betweek JDK1.3 and JDK1.4.1. On my mainstream winXP machine I can have a text-mode java HelloWorld running to completion in 0.2 secs (tested using cygwin's "time" command). That's a huge improvement over the several seconds it used to take, and makes writing little command line utilities in java a practical prospect.

    Previously, the startup slowdown was due to the system having to load, verify, and link the twenty or so classes a simple program depends upon. Pjava and J2ME-CDC solved that by storing an image of the heap with the system classes already loaded, verified, and linked (and quickened) so the system was run-ready almost immediately. I wonder if the J2SE folks picked up on that? Alternatively, they could just be skipping the verify for those classes in the signed rt.jar, and offline preverify them prior to signature - the verifier always was the slow part of the process.

    Your point about threads is well taken, and applies more generally to much of java programming. Java's language and libraries make it all to easy to write architecturally-slow programs - you really still have to fully understand what you're doing in order to write a decent program, regardless of the language.

    --
    ## W.Finlay McWalter ## http://www.mcwalter.org ##
  24. Re:Sysadmins don't buy into this article. by markv242 · · Score: 5, Interesting
    I forwarded this article to our sysadmins, and they basically laughed it off. At least for web-based applications, almost none of these points apply. The article talks about the author's hatred of JVM startup times (which don't apply for an application server) and exception verbosity (which any real java developer will catch and deal with himself, rather than leaving it up to the sysadmin(?) to deal with).

    I challenge you to make a C++/C# application that is thread-safe and can scale to millions of pageviews per day without writing a ton of supporting code. With a good J2EE app server, a java coder essentially just has to wrap his thread-unsafe code in a syncronized() statement, and he's done-- his app is now thread-safe.

    Additionally, the "cross-platform doesn't matter for sysadmins" is a false statement; our CIO asked our net ops group "what would be the impact of us moving to an Intel platform?" and our sysadmins (after consulting with the coders) replied "absolutely no impact". That made our CIO very, very happy. Again, I challenge you to move your C++ apps from Solaris to Linux, or even to Windows, without any hiccup.

    All of these other arguments are very specious: "I don't have enough RAM" will get you a reply of "go down to Fry's and spend $125 on another GB" every time. Processor speeds, even on Sun boxes, is getting to the point where the processor will never be a bottleneck for anything. Sure, java won't run as fast as a natively-compiled app. Neither will perl, php, tcl, or what have you. Raw processor speed is not as important when you have a couple of GHz to play with.

  25. Re:Don't use Java.... by boomgopher · · Score: 4, Insightful

    Java would've been far better if they'd stuck to a few basic classes, and let people develop the classes they need as they go.

    Well, gosh, you go right ahead and write your own replacement classes for everything that Sun has done already. What's stopping you?

    That's exactly why I like Java. They have a lot of good built-in libraries that cover a wide-range of applications. I don't have to reinvent the freaking wheel every time I write an app.

    --
    Your hybrid is not saving the environment. Its purpose is to make you feel good about buying something.
  26. Java performance is second priority for us by snatchitup · · Score: 3, Insightful

    The bottleneck in our applications is not how fast whatever server-side language we use, and I imagine this is similar is most IT shops.

    Our bottleneck is how fast we can execute lots and lots of stored procedures in our SQL and Oracle databases.

    It really hasn't mattered if one of our coders has been terminating loops via try{}catch{}, or ending on a condition.

    The most important thing has been, "Does each line, each method, each class do what it's actually supposed to do?"

    Our bottlenecks have always been flow back and forth between different systems, including Lotus Domino, Oracle, MS SQL Server, Websphere, etc. etc.

    Java is a small player in all this... C++, C#, Fortran, Lisp would not speed this up for us.

  27. Re:Pre-written appendix for Java Tuning by egomaniac · · Score: 4, Informative

    That article is the most absurd joke I have ever read. He spends half the article complaining about Java's startup time (which (A) does not apply in any server situation, and (B) is unfair, because you don't count the machine's bootup time when talking about the performance of C programs, do you?).

    Then he invents other ways to talk about the startup time without seeming to talk about the startup time (for instance, trussing Hello World results in a ton of output, but naturally that's Java starting up and loading its classes. Again, do you consider what the machine has to do to boot itself up when you're talking about C programs?). I will point out again that Java's startup time is almost irrelevant, especially in a server environment (which is what he's talking about).

    The rest of the article is picking on the "jar" tool. jar is a program written in Java. Criticisms against the jar tool no more reflect on Java than criticisms against gzip reflect on C. The fact that jar doesn't do a good job of reporting errors is (A) irrelevant, because it's a developer tool and we know how to read exceptions, and (B) still more irrelevant, because how well it reports errors has nothing to do with what language it was written in. Tons of C programs have lousy error reporting as well, such as a number of Unix utilities I might name.

    Further, this article is obviously very old. He's talking about Java 1.1.8, which is what, five years old now? Might as well criticize Linux by talking about obscure video driver bugs that were fixed five years ago. Obviously, that's not the article's fault for having been written so long ago, but it is the parent poster's fault for bringing it up as if it is somehow still relevant.

    --
    ZFS: because love is never having to say fsck
  28. Re:Java is plenty fast by be-fan · · Score: 4, Informative

    The FFT benchmark is a very specific case. Once the JIT kicks in, it's not Java vs C++ anymore, it's the JVM optimizer vs the GCC one. Contrary to popular belief, the GCC optimizer is very good (check out benchmarks vs ICC at coytegulch.com). However, the FFT benchmark is a case where the additional information available to the JIT optimizer allows it to outperform native code. The whole benchmark is so small, it probably even fits in cache, and doesn't really stress any of the performance pitfalls of the language itself. Now, if you have a larger application, that doesn't consist of a single inner-loop, and meanders through a lot of varied code (ie. most real applications) then the performance story will be very different. At that point, Java's performance faults (excessive bookkeeping overhead, object allocation/deallocation, overhead from the JVM, etc) come much more into play.

    --
    A deep unwavering belief is a sure sign you're missing something...
  29. Re:Sysadmins don't buy into this article. by markv242 · · Score: 2, Insightful
    "You should _always_ try to find the best (or most optimal, I should say) approach/solution to your problem."

    I disagree.

    You should always try to find the best, most efficient, most cost-effective approach/solution to your problem.

    If your internal time is billed out at $50 per hour, and you want to save your company money, you aren't going to spend 4 hours to create a custom garbage collector just to save another 5k of RAM-- you're going to go out and buy another stick of memory.

    I agree wrt bad coding habits and the like, but everything has its price. If someone can push an application out the door rapidly that can still be easily maintained and only requires a bit more memory or a bit faster processor, I'm more than willing to expense the money for that new hardware.

  30. Re:The original post is wrong, anyway... by tgd · · Score: 2, Informative

    Well, there are two ways you can prove it -- write a test... do a million iterations of one, versus a million of the other and watch your RAM usage and time it takes. Alternately, compile it and then use a java disassembler, and look at the resulting code. The string concatination is very heavily optimized, and structures like that where you are concatinating hardcoded strings basically does the equivalent of interning them (there aren't real string objects for those). Compilers are smart. If you concatinate to produce the string, it knows what you are trying to do and can optimize. If you self-optimize and get it wrong (which is what happens when you use a StringBuffer), it doesn't know what you are doing and can't optimize it. Two seconds of google searching turned up this, if you don't want to test it yourself: http://www.precisejava.com/javaperf/j2se/StringAnd StringBuffer.htm

  31. Another explanation for performance increases by zipwow · · Score: 2, Insightful

    Since other posters have already indicated that gcj /does/ lead to better performance, I think I have a cause for your performance increase beyond "Java sux":

    Re-implementation removed the bottleneck.

    What kind of profiling did you do against your original Java application? Where was the time being spent? I've worked on some pretty high-performance java applications, and have found them to be quite scalable.

    If you're talking about GUI responsiveness (not client/server or high processing interactions), then you may have a point. All the nefarious interactions between the platform-specific GUI toolkits and their OS of choice (this applies both to Windows and Linux) do a lot of very specific optimizations that just can't be done as well cross platform.

    Interestingly, the original AWT used components based on native ones for just this reason, but that turned out to be problematic.

    Anyway, if you have the intention of supporting your claim that your application had performance problems due to Java itself, I'd be interested in hearing about your profiling process.

    -Zipwow

    --
    I don't know which is more depressing, that 2/3 didn't care enough to vote, or that 1/2 of those that did are crazy.
  32. Re:Pre-written appendix for Java Tuning by Admiral+Burrito · · Score: 2, Funny

    Java exception verbosity is a serious problem. Many times I've heard of "java.lang" errors. The correct solution to this problem is to use C-style exception handling:

    try {
    // code
    }
    catch (Throwable t) {
    System.err.println("segfault");
    }

    As you can see, Java is every bit as good as C.

  33. Re:Java is plenty fast by ChrisRijk · · Score: 2, Insightful

    The FFT benchmark is a very specific case.

    Why? It's smaller than most code, but why does that inherantly benefit Java?

    Once the JIT kicks in, it's not Java vs C++ anymore, it's the JVM optimizer vs the GCC one.

    That's the whole point. Unless you only care about programs where the entire execution time is a few seconds, the JVM optimisation time isn't going to be much of an issue.

    However, the FFT benchmark is a case where the additional information available to the JIT optimizer allows it to outperform native code.

    I compiled specifically for the machine I was running on. I tried everything I could to make it faster than Java. For Java VMs, being able to get "additional information" is what always happens. It's not specific to the benchmark.

    The whole benchmark is so small, it probably even fits in cache, and doesn't really stress any of the performance pitfalls of the language itself.

    The code is 10KB file with a number of critical functions. A good optimiser would have to do inlinging, loop unrolling as well as a lot of data-flow optimisation. I ran it across a range of data sizes, and Java did better at bigger array sizes (until memory bandwidth was the limiting factor). You have it the wrong way around - the smaller the code/data, the more language specific issues show up.

    Now, if you have a larger application, that doesn't consist of a single inner-loop, and meanders through a lot of varied code (ie. most real applications) then the performance story will be very different.

    Unless your app's performance is dependant on I/O, OS calls or strings, making the application bigger is not going to make things very different. Actually, it can give JVMs a number of extra advantages due to the super-inlining capabilities available to run-time generated code.

    At that point, Java's performance faults (excessive bookkeeping overhead, object allocation/deallocation, overhead from the JVM, etc) come much more into play.

    I don't know what you mean by "bookkeeping overhead" but C code has to allocate/deallocate memory too, and has problems like memory fragmentation. How things compare with a garbage collector depends on the code, system, JVM etc, but modern JVMs can handle 10s of gigabytes of data in the heap. Simply to say that GC is slow is simplistic. Unless you have code that only runs for a few seconds, "overhead from the JVM" (if by that you mean the optimiser) isn't going to be a problem.

    I'm not saying there's not areas where Java is at a disadvantage (I've listed some in this post). In some areas, it's going to remain inherant, but in others, JVMs are becoming advanced enough that issues like run-time optimisation can be a distinct advantage. Anyway, as far as I'm concerned, I have no problems with Java performance for anything I've done, including GUI code.

    These days when writing highly optimised web-server code in Java, I have to get a super-accurate timer since Java's standard timer is only accurate to one millisecond, and that's too coarse grained to tell how much different various optimised I add make. This is where all the HTML is dynamically generated for each request btw.

  34. Re:Sysadmins don't buy into this article. by pmz · · Score: 2, Interesting

    If your internal time is billed out at $50 per hour, and you want to save your company money, you aren't going to spend 4 hours to create a custom garbage collector just to save another 5k of RAM-- you're going to go out and buy another stick of memory.

    This reminds me how broken many (most?) corporate accounting systems are. Where I work, for a stick of RAM (or software, or whatever), it would take at least four hours spread over a couple weeks just to figure out who to submit the request to, wait for our "purchasing agent" to get a couple signatures from bureaucrats, wait for the purchase order to work its way to the top of a pile, and finally get the RAM only to discover they ordered the wrong type. All the while, they'll happily pay for labor hours wasted on slow computers with inadequate RAM (for example).

    Why there is such a fundamental disconnection between spending money on labor versus spending it on time-saving equipment and software leaves me questioning reality.

  35. Blah blah performance tuning... by Lardmonster · · Score: 3, Interesting
    So people write their own versions of linked lists and strings, to get up to 15% performance improvement. Well whoopie-fsck.

    How much does that extra development time cost?

    Writing ones' own java.lang.String takes time. Writing routines to convert com.donkeybollocks.String to java.lang.String and back again takes time. Supporting it takes time. And time is money. Me, I'd rather spend an extra £100 on a faster processor, or a Gb of RAM, and take a 25% performance improvement.

    Come on guys, one of the major wins of the OO methodology is code reuse. Time was when programmers would always have to write their own I/O routines - I thought those days were long-gone. Rewriting fundamental parts of the Java API is just plain silly, unless it has a bug or a serious limitation (eg, it's non-threadsafe).

    --
    The more advanced the technology, the more open it is to primitive attack
  36. Re:Sysadmins don't buy into this article. by Bodrius · · Score: 2, Interesting

    I don't know what pletorah of things they taught you in CS school, but much of the wisdom they taught some of us can be summarized:

    - Big O matters. Optimization of constants is an expensive luxury.
    - Reimplementing the wheel for the sake of marginal efficiency is a sure way to get a square and inefficient wheel.

    Most algorithms of any common use are provided in the standard libraries of each language. If not there, any algorithm can be implemented in any language by virtue of its Turing-completeness. This guarantees you bigO efficiency, which is what matters in the long run.

    The article complains about Java being slow for the sake of its pcode nature. That's a constant factor, not bigO. It's automatically defeated by "CPU is cheap, RAM is cheap", i.e.: constant factor acceleration is cheap.

    You better have a good reason to worry about constant factors: if your program demands so much from the machine that the constants make the difference on whether it's practical or not, you better be experimenting with the 'bleeding edge' or there's something really wrong with your program.

    Efficient algorithms are used on every language by any programmer worth 2 bucks. Java has the advantage of implementing a bunch of them on standard libraries that work quite well, thank you. Someone who uses bubblesort in Java outside of a classroom is not lazy, he's an idiot. Implementing bubblesort is more complex and expensive than calling Arrays.sort().The same thing actually applies to any programming language.

    If your concerns about speed as a typical sysadmin (servers and workstations) or even worse, as a developer, are dominated by constant factors, it's time to go back to take data structures and algorithm analysis at CS school.

    --
    Freedom is the freedom to say 2+2=4, everything else follows...
  37. Not a bad albatross by jemenake · · Score: 3, Insightful
    Performance has been the albatross around Java's neck for a long time..
    Every time some C/C++ snob snipes Java for being slow, this is what I tell them:

    When I write a Java program... if it's too slow today, then, in time, the problem will go away without any more effort on the part of the programmer. In a year from now, we'll certainly have faster computers, which will make up for any speed problems.

    On the other hand...

    A year from now, we will almost certainly not have CPUs that are suddenly immune from dangling pointers and memory leaks.

    In other words, there are not plausible, near-future-forseeable advancements in computing hardware that could fix the worst problems of C/C++. Meanwhile, the near-future advancements in hardware are almost guranteed to fix Java's worst problem.

    The same holds true for doing your computing today... regardless of what hardware is available a year from now. Personally, I'd rather have a slow program that could keep running than one that was really fast, but crashed before I could save my work.
  38. Re:Java is plenty fast by be-fan · · Score: 2, Informative

    Why? It's smaller than most code, but why does that inherantly benefit Java?
    >>>>>>
    The reason it inheretly benefit Java is because of the characteristics of the Java language. First of all, it's a JIT language. Thus, if you have a tight inner loop, the JIT optimizer can optimize the hell out of it (even more so because it has access to runtime information that the static C++ optimizer does not) and just hand it over to the processor for execution. The JVM isn't even executed again until the loop is over. This situation doesn't invoke any of the language overhead that makes Java slow. This overhead takes many forms. The JVM has a large instruction cache footprint. Java objects all have an extra header containing type information that causes a data cache footprint and impacts memory bandwidth. The garbage collector can be a big problem. As Tannenbaum said, avoiding disaster is more important that optimal performance. The average case allocation/deallocation might be quite fast for a garbage collector, but when an actual collection occurs, you get that "disaster" that you're trying to avoid. The collection process thrashes the cache and occupies the application for a (comparatively) long time. When a function is invoked, the JVM has to check to see if the JIT has the code already cached. This takes time. Move beyond that to the Java APIs themselves. The Java APIs are designed for purity and ease of use. APIs like the C++ STL are designed for pure performance. For example, a dynamic cast is inherently slow, by the nature of the operation. Yet, the Java Collection APIs require them for every access. All Java class methods are by default indirect. Last time I measured (on a PII 300) indirect calls are about 10 times slower than direct calls. A small numeric benchmark doesn't hit any of these performance issues, but real application code hits them, hard.

    --
    A deep unwavering belief is a sure sign you're missing something...
  39. Re:New math? by Anonym0us+Cow+Herd · · Score: 2, Informative

    You only need 4 bytes to make reflection work. Each object has a pointer to what class it is. The class information has everything else, the vtable so to speak, what interfaces are implemented, etc. All String objects have one single pointer to a common String class object the defines what a String is, what its members are, what its methods are, etc. everything needed for reflection. Think of the vtable as being more than just an array of method pointers, but meta information (the class object) about the object. More than just a vtable. Therefore, reflection requires no overhead really, since an OO language needs a vtable pointer.

    --
    The price of freedom is eternal litigation.
  40. Re:New math? by julesh · · Score: 2, Insightful

    There's an additional 16 bytes involved in the object class descriptor pointer and the reference to the object's monitor (*2 - one for the String and one for the char[], which is a fully fledged object in its own right), plus probably a couple of bytes overhead for the memory allocator. About 32 bytes seems reasonable. I think the point to make is, though, there is no _sensible_ way of making the string any smaller without sacrificing performance. Plus the objects have the ability to share the array between two strings that have similar data (say one is a substring of another) and that _substantially_ reduces memory requirements. I'd say the Java string implementation is about as good as it gets.

  41. Re:Sysadmins don't buy into this article. by Bodrius · · Score: 2, Interesting

    20% sounds like an inflated number. The typical app spends most of its time waiting for I/O or doing silly stuff for user interaction. A 20% overall difference would imply what, 50%, 100% slower when it's actually working?

    There's some overhead, but it's never that bad. Sure, the overhead matters, which is why there's an investment on improving VM technology, providing access to native operations, etc.

    But also the worst overhead offenders are not VM issues, but application design issues: blocking I/O, threading bugs, NOT using multithreading when you should, etc. Also, using Swing/AWT in a non-trivial GUI. I'm beginning to consider Swing/AWT just a giant bug.

    However, let's assume the 20% speed difference in an application...

    It's not that constant factors don't matter at all, it's that they matter the least. And when you have so many more important problems to fix, they take the last place in the priority list.

    By definition, choosing your implementation language is an early decision that takes place long before that list is filled up. Although speed is always a concern, algorithmic speed is more important than PL speed, and independent of it. So PL speed shouldn't have that much to do with the decision.

    Saving 20% in hardware is great, and the efficiency marketing advantage is important too. But those advantages are worth nothing if your application is buggier, less extendable, less flexible than your competitors, and specially if it gets to the market AFTER your competitors.

    Developing applications in C is more expensive and complicated than developing them in Java, and the difference is typically more than the difference in speed. (I'm not saying that C applications are inherently worse than Java apps, just that to develop the same application with the same extensibility, features and stability takes more time, and a bunch of non-standard libraries).

    Development costs add much more quickly than hardware costs, and unlike hardware costs, development costs are not guaranteed a return in performance. These are not dollars and cents, but hundreds and tens of dollars: development is more expensive than hardware.

    Then you go to the client 6 months after your competitors and try to sell them the application. If they haven't already bought from the competition, you'll try to convince them that although your application is more expensive, and altough it hasn't been tested in the market for as long as the other ones, and although they'll need a C programmer versed in your favorite non-standard libraries to maintain it (you do give them API documentation and the tools to maintain it, right?), your application will save them a few bucks in hardware.

    Unless the difference in hardware costs has more than 4 digits, I think your customer will advice you to take an accounting class.

    If the difference is more than 4 digits, you are pushing the technology and you need to care about constant factors.

    Either that or you're not dealing with a typical application. For example, a scientific analysis program that spends most of its time in pure computation needs all the juice it can get. Although I understand Java works fine for pure computational tasks.

    Now, if you can make your applications in C as cheap, fast, and safely as with Java, then you have great C developers so you should just keep doing that. Most people can't.

    --
    Freedom is the freedom to say 2+2=4, everything else follows...