No, It's Not Always Quicker To Do Things In Memory

← Back to Stories (view on slashdot.org)

No, It's Not Always Quicker To Do Things In Memory

Posted by Soulskill on Wednesday March 25, 2015 @03:45AM from the performance-that-fails-to-perform dept.

itwbennett writes: It's a commonly held belief among software developers that avoiding disk access in favor of doing as much work as possible in-memory will results in shorter runtimes. To test this assumption, researchers from the University of Calgary and the University of British Columbia compared the efficiency of alternative ways to create a 1MB string and write it to disk. The results consistently found that doing most of the work in-memory to minimize disk access was significantly slower than just writing out to disk repeatedly (PDF).

26 of 486 comments (clear)

Min score:

Reason:

Sort:

Check their work or check the summary? by s.petry · 2015-03-25 03:51 · Score: 4, Insightful

'll have to dig through their testing and methods, but this seems pretty fishy given the summary.
Seek/Read/Write time of a disk is always slower than memory. No exceptions to the rule exist given current commodity hardware. Bus length to a disk is also much longer than to memory. Again, there are no exceptions given commodity hardware.
Won't be the first time someone reported that the laws of physics don't exist for something, and I'm sure it won't be the last. Maybe someone with free mornings in the US can break it down better than the summary.

--
-The wise argue that there are few absolutes, the fool argues that there are no probabilities.
1. Re:Check their work or check the summary? by LordLimecat · 2015-03-25 03:54 · Score: 4, Interesting
  
  Tl; DR:
  They used python and java. Sort of hard to develop a meaningful thesis on general programming when you're that far up the abstraction stack. Who knows, maybe python and Java suck at memory management (GASP).
2. Re:Check their work or check the summary? by Frnknstn · 2015-03-25 04:06 · Score: 5, Interesting
  
  It's not even the choice of tools, they seem to willfully misuse the languages to get poor results.
  
  --
  If it's in you sig, it's in your post.
3. Re:Check their work or check the summary? by Anonymous Coward · 2015-03-25 04:16 · Score: 5, Insightful
  
  Let me guess
  1. They used "" + "" instead of StringBuilder
  2. They didn't actually flush the file bytes to disk, so it's really a comparison of stupid programmer in-memory string cat and intelligence caching of file writes.
  3. They intentionally engineered a scenario that reported data that was contrary to reality in order to get clicks
4. Re:Check their work or check the summary? by bondsbw · 2015-03-25 04:23 · Score: 5, Informative
  
  Specifically, the time measured to write to memory uses the following code:
  for (int i=0; i < numIter; i++) { concatString += addString; }
  The time measured to write to disk uses the following code:
  for (int i=0; i < numIter; i++) { writer.write(addString); } writer.flush(); writer.close();
  In Java, strings are immutable. Each string concatenation produces a new string on the heap, and the old string is unchanged. So there are numIter strings created in memory, and I assume garbage collection will probably happen at some point once enough memory is used. O(n) reads and O(n) writes to the heap with O(n^2) memory usage plus an unknown number of garbage collections. This can cause considerable slowing of the in-memory algorithm.
  That algorithm is then compared with one that does numIter writes to a buffer, which is then flushed to disk at the end. O(n) writes to memory buffer (no need to re-read memory) using O(n) memory space, followed by O(1) writes to disk and O(n) disk space used.
  Granted, it's been over a decade since I took algorithms so I wouldn't doubt that someone can show how I am off, but this kind of thing should be simple to spot for anyone who has an undergrad CS degree.
  PS - I love how the paper makes this aside as if it doesn't matter tremendously:
  
  Java performance numbers did not change when the concatenation order was reversed in the code in Appendix 1. However, using a mutable data type such as StringBuilder or StringBuffer dramatically improved the results.
  
  --
  All my liberal friends think I'm a conservative, all my conservative friends think I'm a liberal.
5. Re:Check their work or check the summary? by danlip · 2015-03-25 04:30 · Score: 5, Interesting
  
  The language is not the problem, the code is terrible. They did String concatenation in the most expensive way possible. I'm pretty sure if you used a pre-sized StringBuilder it would be faster in memory.
  They also make some very novice benchmarking mistakes.
  This is actually a pretty good interview problem. Anyone who writes code like that should not be hired, even for a junior position.
6. Re:Check their work or check the summary? by halivar · 2015-03-25 04:43 · Score: 4, Insightful
  
  And this is why we should not teach CS101 in Java or Python. If they'd been forced to use C this whole experiment would have turned out differently. Even the professors are getting lazy, now.
7. Re:Check their work or check the summary? by OverlordQ · 2015-03-25 07:50 · Score: 4, Informative
  
  THATS THE ENTIRE POINT OF THIS PAPER.
  
  It is easy to explain the results: In high-level languages such as Java and Python, a seemingly benign
  statement such as concatString += addString may actually involve executing many extra cycles behind
  the scenes. To concatenate two strings in a language such as C, if there is not enough space to expand
  the concatString to the size it needs to be to hold the additional bytes from addString, then the
  developer has to explicitly allocate new space with enough storage for the sum of the sizes of the two
  strings and copy concatString to the new location, and then finally perform the concatenation. In Java
  and Python strings are immutable, and any assignment will result in the creation of a new object and
  possibly copy operations, hence the overhead of the string operations. The disk-only code, although
  apparently writing to the disk excessively, is only triggering an actual write when operating system
  buffers are full. In other words, the operating system already lessons disk access times. A developer
  familiar with the language and system internals readily notices the causes of this observed behaviour,
  but this behaviour may be easily missed, as indicated by examining similar cases in production code.
  
  --
  Your hair look like poop, Bob! - Wanker.
This is the dumbest research I've seen this year by MobyDisk · 2015-03-25 03:53 · Score: 5, Informative

This is the dumbest research I've seen in 2015. There was actually no computation involved -- they just wanted to write a long string to disk. They concluded that adding the superfluous step of concatenating strings in memory, then writing to disk, was slower. Well duh! That's not what memory is for!
Re:It depends by Lunix+Nutcase · 2015-03-25 04:04 · Score: 4, Insightful

Even the slowest DDR3 SDRAM has more memory bandwidth and magnitudes faster access time.
Re:It depends by greg1104 · 2015-03-25 04:09 · Score: 5, Informative

SSDs and disk speed have nothing to do with this. None of these writes are hitting disk. All they've shown is that when you cache a write to disk, the operating system might add data to it more efficiently than the slow Python and Java string code can expand a string.
Re:It depends by hcs_$reboot · 2015-03-25 04:11 · Score: 4, Insightful

RAM *is* faster (by far) than any persistent media 9SSD, HD...). So whatever the test, the algorithm is probably bad,

--
Slashdot, fix the reply notifications... You won't get away with it...
Re:It depends by jedidiah · 2015-03-25 04:25 · Score: 5, Insightful

A more accurate title would be: "You can be sufficiently stupid with your memory access that it's faster to do disk IO."
Java is not the only system that can manifest this.

--
A Pirate and a Puritan look the same on a balance sheet.
Re:It depends by ShanghaiBill · 2015-03-25 04:32 · Score: 5, Insightful

Even the slowest DDR3 SDRAM has more memory bandwidth and magnitudes faster access time.
Indeed. Their results make no sense. They are doing something weird. For instance, their paper says that concatenating a million one byte strings into a single million byte string takes 274 seconds. That should take much less than one second. Their code is listed at the end of the paper, and they seem to be assuming that "flush" means the code is actually written to disk. It does not. It just means the bytes were passed to the operating system.
The real story here, is that if you don't know how to write code properly, then string concatenation can be really slow.
Was their paper peer reviewed?
Re:The new antipattern by Anonymous Coward · 2015-03-25 04:42 · Score: 5, Insightful

Sorry but you'll need to do it without using any memory. We need to make it fast.
Memory bandwidth is about 20Gb/s. Disk bandwidth is about 0.05Gb/s. The performance consequences of this are obvious to anyone who knows how basic arithmetic works.
The results they got are invalid because their test framework is broken. This is exactly why everyone should be forced to learn C/C++ or Assembler in college/university. The reason for the crap result is they did not preallocate their buffers so they wasted all their execution time allocating and reallocating larger buffers from the heap. The disk APIs have their own internal buffer implementations, that were not written by idiots, that manage this correctly which is the cause of the difference.
Re:It depends by Anonymous Coward · 2015-03-25 04:54 · Score: 5, Funny

Was their paper peer reviewed?
It just was. Why do you ask?
lololol
Re:It depends by PacoSuarez · 2015-03-25 04:55 · Score: 4, Informative

[...] For instance, their paper says that concatenating a million one byte strings into a single million byte string takes 274 seconds. That should take much less than one second.
I didn't RTFA, but after reading this I am certainly not going to. This C++ piece of code takes around 0.01 seconds to run on my computer:
#include <iostream>
#include <string>
void build_string(std::string &s, std::string r) {
for (int i = 0; i < 1000000; ++i)
s += r;
}
int main() {
std::string s;
build_string(s, "a");
std::cout s.length() '\n';
}
Stupid is as stupid publishes.... by TiggertheMad · 2015-03-25 04:58 · Score: 5, Insightful

I just scanned the paper, because their claim seem to be idiotic. It looks like they are appending a single byte on the end of a string in memory and on disk. For the memory operation, this will result in a string copy since strings are immutable, vs. doing a one byte file append onto the disk. The former is increasingly expensive and the latter is a fixed cost, so after infinite operations, the disk cost becomes far less than the memory operation. If this is indeed their claim, and I am not missing something, then they should be collectively slapped for wasting our time by writing this paper. If this is really your use case, write some proper data structures to manage your data in a sane fashion.

So yes, if you do stupid things, you can make bad engineering decisions look like good ones.

--

HA! I just wasted some of your bandwidth with a frivolous sig!
python and java by Spazmania · 2015-03-25 05:10 · Score: 4, Informative

They tested using strings in python and java, both of whose string libraries are very much overweight. And they tested by concatinating strings in a way that requires constant reallocations and memory copies versus pushing data to fixed size disk buffers in the OS cache.
So... surprise! When writing data sequentially the C implementation of disk buffers is faster than the java and python implementations of strings.

--
Moderating "-1, Disagree" is simple censorship. Have the guts to post your opinion.
HOT BREAKING NEWS! by Alsee · 2015-03-25 05:12 · Score: 5, Funny

NEW SCIENTIFIC DISCOVERY!
For n equal to one million, an O(n^2) algorithm is slower than an O(n) algorithm. Even when the O(n^2) algorithm is run in RAM, and the O(n) algorithm is disk writes being buffered and optimized by the operating system.
I'll take my Nobel Prize now, thank you.
-

--
- - You can't take something off the Internet! That's like trying to take pee out of a swimming pool.
We're all doing it wrong! by jetkust · 2015-03-25 05:29 · Score: 5, Funny

Maybe we should store our files in memory and load them into the harddrive to do calculations.
Re:It depends by Penguinisto · 2015-03-25 05:38 · Score: 4, Interesting

That's the very first thing I thought of... what if the code were written in a lower-level language (and not in fucking python or Java!), then made do this task on Windows $latest, OSX $latest, Linux $latest, maybe a resurrected DOS $latest for reference, etc... I mean, it can't be that hard to write this thing in C and port it as needed.
Doesn't seem very scientific at all otherwise. I mean, are they testing memory versus disk, are they testing memory vs. disk performance in a given specific language, or what? Maybe they just needed to flesh out their abstract a bit more to reflect this?

--
Quo usque tandem abutere, Nimbus, patientia nostra?
Re:It depends by sjames · 2015-03-25 05:41 · Score: 4, Insightful

It makes perfect sense once you read the paper. The conclusion is techniocally correct but deceptive.
The results apply in the case of Java and Python where strings are immutable objects. They also used buffered I/O handled by libc. When you concatenate immutable strings, you must allocate a new string large enough to hold both parts, then a memcpy from both of the parts is performed to construct it. The parts are eventually garbage collected.
In contrast, writing to a file with buffered I/O means just copying the additional write buffer to the current end of the buffer and moving updating the accounting information.
As a result, in both cases, only one actual filesystem transaction takes place writing out the complete string. Thus, the actual practical difference between the two methods is that the 'in memory' version copies the memory around many times while the 'disk i/o' one copies the data once (in multiple steps, but each byte sees one copy).
That seems like a bit of a no-brainer, but the point is valid because many programmers may deceive themselves into thinking the 'in memory' method is faster because they don't take the file i/o buffering and the way immutable strings are handled into account.
Re:It depends by lgw · 2015-03-25 05:41 · Score: 5, Insightful

How in the world? Trivially. They're doing it in an O(n^2) way - it's the only explanation.
If you use string concat library code naively, you can end up "copy the string, add one byte, repeat" easily enough in languages like Java. And it's not exactly breakthrough research to discover that O(n) disk can be faster than O(n^2) memory for large enough n.

--
Socialism: a lie told by totalitarians and believed by fools.
Re:It depends by gnupun · 2015-03-25 06:50 · Score: 4, Informative

There's nothing wrong with Java or Python, but the programmer is inexperienced. Java and Python strings are immutable. So, any time they concatenate a single character to an existing string, the Java runtime creates a brand new string, leaving the original string intact (since it is immutable). So if they create a million character string using using million concatenations, guess what, a million new strings are created and that's very slow. A better solution is to use a mutable String aka, StringBuilder.
But the right solution is to use a small buffer, say 16KB to 100KB in size, fill that with characters and flush that buffer to disk every time it's full. The speed would be same as any other method, but the max memory used is 20x smaller.
Re:It depends by Anonymous Coward · 2015-03-25 06:51 · Score: 5, Insightful

And they're using BufferedWriter to write to the file which, as the name suggests, is buffering the data *in memory* before writing it.
So the result of the paper is actually O(n) in memory algorithm outperforms O(n^2) in memory algorithm for data sizes of 1MB. Hardly surprising.