Slashdot Mirror


Ask Slashdot: Why Is It So Hard To Make An Accurate Progress Bar?

hyperorbiter writes "How come after 25 years in the tech industry, someone hasn't worked out how to make accurate progress bars? This migration I'm doing has sat on 'less than a minute' for over 30 minutes. I'm not an engineer; is it really that hard?"

41 of 736 comments (clear)

  1. it's because by Anonymous Coward · · Score: 5, Funny

    Comment loading ...

  2. Uhmmm. by Fellon · · Score: 5, Insightful

    Yes it is "that hard".

    --
    I did it all for the penguins!
  3. Yes. by kc9jud · · Score: 5, Funny

    Yes it is. And to be fair, it's a lot more accurate than Nostradamus ever was.

  4. because by ILongForDarkness · · Score: 5, Insightful

    Things are asyncronous. You wait for things from disk, ram, user input, over the network etc. How long it will take is non-deterministic. So a task composed of a bunch of these little pieces will be non-deterministic too.

    1. Re:because by eksith · · Score: 4, Insightful

      This. Also, there may be other applications/processes that may start up, suddenly demand resources or jump in priority, which inevitably result in a progress bar going backwards.

      --
      If computers were people, I'd be a misanthrope.
    2. Re:because by geekmux · · Score: 5, Interesting

      Things are asyncronous. You wait for things from disk, ram, user input, over the network etc. How long it will take is non-deterministic. So a task composed of a bunch of these little pieces will be non-deterministic too.

      If this is true (which it is), then logic would dictate that it is pointless to represent a progress bar in measurements of time.

      Perhaps the larger question is how much longer are we going to continue acting stupid about it, rather than trying to make a clock out of it.

    3. Re:because by NotQuiteReal · · Score: 5, Insightful

      Plus, you are lucky even get a "guesstimate bar". Showing progress with bad estimates is more reassuring than just a "working on it spinner". Either is an order of magnitude better than an unresponsive GUI because the programmer didn't put that stuff in another thread.

      Personally, I like the "list of what I am going to do + check them off as they are done" approach, if the "progressing task" is of varying descriptions.

      That said, for some tasks, you have to consider the "Heisenberg effect" a bit, depending on the work being done - do you want your progress bar pestering the "worker" for progress reports all the time, or just doing its job.

      --
      This issue is a bit more complicated than you think.
    4. Re:because by ILongForDarkness · · Score: 5, Insightful

      The list/status bar solution is nice for another reason: if your program does hang your users know where in the process it happened. You get more useful feedback from the users, sometimes they might be able to troubleshoot it themselves (oh it is dying when connecting to my fileserver, perhaps I"m not connected to the network). Heck it is useful for course grained optimization you see the logical steps that take the most time and can drill down into speeding them up as much as possible.

  5. Various reasons by GoodNewsJimDotCom · · Score: 5, Insightful

    One reason is the progress bar starts out as just a generic tool to show that your loading hasn't froze. At first it is parsed correctly with the elements to be loaded, but as scope increases and more things load, it can get sketchy later on.

    Another reason is it is difficult to estimate time left. If you look at some old FTP programs, they'd estimate the rest of the download's time based on how fast the previous has taken. Future lag, fragmented files, etc aren't taken into consideration.

    There's a bunch more reasons, but namely the progress bar's main purpose is to show you that the whole system isn't locked up, which they've been doing well for the past 30 years or so.

    1. Re:Various reasons by uncqual · · Score: 5, Insightful

      And a spinning asterisk or swirlie does that fine, without implying anything about "progress" or "time left".

      The spinning asterisk or swirlie just tells you that the thread updating the asterisk or swirlie hasn't frozen -- it often doesn't tell you anything about the underlying activity the user is waiting for.

      --
      Why is there an "insightful" mod and why isn't it "-1"? If I wanted insight, I wouldn't be reading /.
  6. Re:no, it's easy. by dotHectate · · Score: 5, Funny

    You know someone is going to take your suggestion literally as a tutorial on how to implement a progress bar - later they'll come back with some mystical crash always happening at 0%.

    --
    Patience is a virtue, but haste is my life.
  7. It's mathematically impossible. by PseudoThink · · Score: 4, Informative

    See http://scribblethink.org/Work/kcsest.pdf and http://scribblethink.org/Work/Softestim/softestim.html

    (No, I'm not being serious. The topic just reminded me of when I once jokingly justified a poorly estimated ETA on a "simple" development project by referencing the above paper.)

  8. MSIE by Anonymous Coward · · Score: 5, Interesting

    My favorite terrible progress bar was Internet Explorer, back in its early days of essentially being a renamed version of NCSA Mosaic. When attempting to load a site that wasn't available, the progress bar would slowly creep towards complete, despite the server being completely unresponsive. Then after a long while the browser would give up and stop the progress bar. Why on earth would the progress bar move if the server is completely unresponsive? Who programmed this? I would hope that they, like the inventor of Clippy, suffered a terrible death by fire.

  9. Can't Go Backwards by naroom · · Score: 5, Insightful

    Consider this: Once you've put progress on a bar, you can't take it off. Suppose you start a process that should take 20 minutes, and do the first 5 minutes, progress is now at 25%. But then, partway through, something unexpected happens and you realize the process is actually going to take 40 minutes. You can't take the progress "back" now, that would disorient the user. So you have to rescale the remainder of the bar.

    1. Re:Can't Go Backwards by Jafafa+Hots · · Score: 4, Insightful

      I've seen some progress bars that do exactly that - go backwards.

      --
      This space available.
    2. Re:Can't Go Backwards by citizenr · · Score: 5, Informative

      This is why you put two progress bars. One general and one for sub processes.
      http://www.codeproject.com/KB/progress/SplitProgressBar/SplitProgressBar.gif

      --
      Who logs in to gdm? Not I, said the duck.
    3. Re:Can't Go Backwards by shutdown+-p+now · · Score: 4, Interesting

      I've seen progress bars that go backwards, but it usually indicates reversal of something that is done previously. E.g. in MS Office installer, if you cancel the installation midway through, it will "rewind" the progress bar while cleaning up.

    4. Re:Can't Go Backwards by Agent+ME · · Score: 5, Insightful

      What? Why the fuck would a progress bar go backwards? A progress bar shouldn't be pegged to the fraction of time of the job done. It should be pegged to the actual progress. If progress starts going slower, then the progress bar should start moving slower.

    5. Re:Can't Go Backwards by The+Snowman · · Score: 5, Funny

      Yes and it's those--as well as those that dance left and right for no damn reason--that really piss me off.

      You mean like this one? http://xkcd.com/612/.

      --
      24 beers in a case, 24 hours in a day. Coincidence? I think not!
    6. Re: Can't Go Backwards by LesFerg · · Score: 5, Insightful

      Realistically a large IO bound task is something you hand off to low-end process to get it done as fast as ppssible. Would you over-ride the O/S to transfer that 10gigs in 100 meg blocks and report each bit back to your high level UI so that th progress bar is accurate?

      --
      If I had a DeLorean... I would probably only drive it from time to time.
    7. Re:Can't Go Backwards by asliarun · · Score: 5, Insightful

      To put it another way, it is impossible to make an accurate progress bar because it is impossible to predict the future. That's all.

      You can make assumptions, extrapolation, use past performance,etc but you are never going to get it accurate.

      This is like getting pissed at your GPS because it told you you will reach in 22 mins when it actually took you 35.

      It is either a dumbass expectation from the user or a dumbass design decision. You might have as well used an hourglass so the user could spend their time shaking it trying to get the sand to fall faster.

    8. Re:Can't Go Backwards by ibennetch · · Score: 5, Funny

      I think I prefer going backwards to what I once saw on (what I remember as) a Microsoft Office installation, probably 15 years ago. The progress bar ever so slowly crept upwards...98, 99, 100% done...then 101, and so on. It finally locked up somewhere after 140%.

    9. Re: Can't Go Backwards by Impy+the+Impiuos+Imp · · Score: 5, Interesting

      We used an industrial strength version of cp in one product, which used DMA for much faster copy, and would write out status every 20 Mb that it could resume from in power loss, and would also, via shared memory variable, publish progress in bytes, all so you could have an accurate progress indicator.

      Some people do know what they are doing.

      --
      (-1: Post disagrees with my already-settled worldview) is not a valid mod option.
    10. Re:Can't Go Backwards by Arancaytar · · Score: 4, Insightful

      Yeah, but then users complain because the bar is weirdly scaled, like 95% taking up a second and the remaining 5% an hour. You can try to profile it, but the actual time is determined by the local hardware after all: Processor speed, number of cores (which affects only some operations), network speed, free memory - there's no way to predict how much of the time will be spend on each task.

      (And if the progress bar isn't an indicator of execution time at all, then the arbitrary completion percentages assigned by the developer mean little. In that case, you may as well go back to hourglasses and spinning beachballs...)

    11. Re:Can't Go Backwards by JonJ · · Score: 4, Funny
      --
      -- Linux user #369862
    12. Re:Can't Go Backwards by fnj · · Score: 5, Insightful

      Suppose a bar can never back up. Then its speed may vary enormously, making the visual how-done-am-I representation valueless.

      It is HARD.

      Define "fraction of job time done" numerically. Suppose you have to download 100 packages and then install them. How many % of the total should the download represent? Does the guy have cable or dialup? And how many % does the install represent? Does he have a slow USB hard drive or a ridiculously fast SSD? What if the first file downloads at 1 MBps and the 17th one at 139 Bps? I've seen that happen (varying server load, per-file mirrors). Which installs faster, a package containing 250 files totaling 10 MB, or a package containing 10 files totaling 100 MB - hint; it depends.

      Generally, might as well show a rotating propeller or a cylon helmet moving dot, neither of which has a beginning or an end. Just an indication that something is going on, and the system doesn't look frozen. Maybe a couple of numbers under it showing # files done out of # total files.

    13. Re:Can't Go Backwards by rioki · · Score: 5, Insightful

      What I like is the Step X of Y types of progress indicators. Sure step 5 of 12 might take way longer then the rest but at least it is honest. It also helps to tell the user what it is currently doing. You know that downloading drivers may take a while when the internet is slow (or down).

    14. Re:Can't Go Backwards by telchine · · Score: 5, Funny

      Sure step 5 of 12 might take way longer then the rest but at least it is honest. It also helps to tell the user what it is currently doing.

      Reticulating splines?

    15. Re:Can't Go Backwards by hattig · · Score: 4, Interesting

      I hate those, they're totally pointless. "I'm possibly doing something, I don't know how much of it I've done, and I don't know when it will end, and I won't show you when I started".

      To be honest, progress bars shouldn't be used for indeterminate timescales. If you can guess a time remaining, then say that, as this can be changed, and it is visually more useful than a context-less progress bar that can go in reverse.

  10. Cost Benefit by infernalC · · Score: 4, Informative

    Progress bars do not make sequences of actions complete any faster. In fact, they make them slower.

    That being said, take for example an installer that must perform the following steps during an upgrade:

    0. Figure out how many files need to be replaced.
    1. Replace 30 files of varying sizes.
    2. Add 10 files.
    3. Update a half million rows inn a table with a million rows setting a column to a computed value based on some predicates.
    4. Run a third party installation mechanism (MSM?) for a supporting library, etc.

    Modern computers are time-sharing systems. Each process that involves computation is at the mercy of the scheduler in the kernel to give it the cycles it needs to complete. That means that even if you measure the time it takes to complete some process, it's not going to be the same a second time, because the installation process doesn't get undivided attention.

    Steps 0 - 2 - you're at the mercy of the IO buses, hard disk, antivirus software interfering, etc.
    Step 3 - What shape are the database statistics in? How efficiently can you apply the predicates? What does the distribution of the data look like? You can't tell this ahead of time...
    Step 4 - Does this third party installer provide you some sort of metrics as it runs?

    These are the sorts of problems to be overcome to do an accurate progress bar. In short, they aren't worth overcoming.

  11. consider a rocket launch countdown by at10u8 · · Score: 4, Insightful

    For over 50 years rocket launch countdowns have not run in a linear fashion, sometimes even being set backwards.

  12. Physics! by ignavus · · Score: 5, Funny

    You can work out where you are (% completed) or how fast you are going (rate at which the progress bar is growing), but not both at the same time.

    It's simple quantum mechanics.

    --
    I am anarch of all I survey.
  13. Re:Yes by greg1104 · · Score: 4, Interesting

    I would have stopped at "yes" without the optimism, and it's not even a platform/processor issue. This is a fundamentally unsolvable thing, since in some cases it reduces to the halting problem. That best you can do is come up with an approximation.

    But the approximations will be wrong too. Simple example: write speed. If you write a block to disk, and it fits into the operating system cache, a write happens almost instantly. If the cache is full, you might have to spend tens of seconds waiting before that write occurs. Here's what happens with installers, copy programs, and a large chunk of other things. You write until the cache fills, and those happen at memory speeds. Then, all of the sudden, you grind to a halt when the cache fills. You won't see any progress, sometimes for minutes--it can take a while to chug through gigabytes of random writes. I show an example of this on my blog on Linux, and this problem gets worse as memory increases, not better.

    What does that look like to the user? They get a progress bar, it zooms along for a while, and then it cranks to a halt. Then it hangs for a bit, starts moving, and the whole thing completely changes scale. Does this sound familiar? That's what people complain about, right? You can't make that go away without building a model of the caching mechanism that's more accurate than the cache itself. After all, if you could predict this was coming, the OS could have done a better job scheduling I/O with that information, too. Think about that for a minute: to write a really good progress bar for write operations, you have to do a better job on I/O scheduling than Linux does.

    If you step back and say "well let's approximate how long disk I/O takes then and base the bar on that", you'll discover that doesn't work either. There's over a 100:1 difference between the fastest and slowest storage on the market. Good luck modeling that accurately enough to predict the future, too.

  14. Re:sometimes by PlusFiveTroll · · Score: 5, Informative

    The Windows8/Server 2012 dialog is much better in this case.

    http://encosia.com/blog/wp-content/uploads/2012/09/windows-8-file-copy-dialog.png

    It draws a graph showing you the current rate, in which you can see the average over time.

  15. Re:Crappy software by Jiro · · Score: 5, Informative

    The progress bar on the HTTP download doesn't show the amount of remaining time in the bar. It shows the number of bytes remaining in the bar; the number of bytes remaining can't go into reverse. The time remaining is showed as a numeric value for how long it would take assuming the speed is the same as the speed so far; if the transfer suddenly slows down, this value can go into reverse.

  16. Because it should be a graph. by gargleblast · · Score: 4, Interesting

    It is hard to make an accurate progress bar because it shouldn't be a bar at all - it should be a graph.

    Consider the humble download: bytewise, it might be 97 percent complete, but at the last moment, the bps rate has fallen. With a progress bar indicating a percentage and an estimated time, it might say 97% complete, 3 seconds to go. If the progress indicator was a graph, you could tell that the bps rate has fallen, and that the 3 seconds to go estimate (probably based on a linear extrapolation of progress to date) does not apply.

    I have never seen it done though. Partly, because I have never done it.

  17. The 99%... by DavidD_CA · · Score: 4, Insightful

    The public opinion of the Progress Bar would be considerably more favorable if programmers would simply treat 100% as if it were 75%.

    In other words, do all the stuff you have to do, measuring progress and whatnot, but when you're actually at 80%, report yourself at 60%. Likewise, when you're at 95%, say you're at 70%.

    Then, only when you really are completely finished, you jump from 75% to 100% in under a second.

    Complaints gone.

    --
    -David
  18. You know what I'd like to see? by k3vlar · · Score: 5, Insightful

    You know what I'd like to see more than a working progress bar? A "Cancel" button that actually stops the f*%! process! .
    I don't want to finish the sub-process I'm currently doing (which has probably stalled)... just FREAKING STOP.

    If you (programmer) want to close connections, or save the changes to the disk, do it in the background. Making me sit there for another 10 minutes while you're "cancelling..." is not helpful. I will force close your program. Failing that I will hard-reset the computer. Seriously.

    --
    Unlike porn, which yada yada rimshot hey-ooh!
  19. Re:Crappy software by nabsltd · · Score: 5, Insightful

    Give this man some upward moderation.

    The problem is that the question is wrong. It's trivial to make a progress bar...just sum up all the things you have to do, and move the bar each time a "thing" is done, rounding to the nearest pixel. It doesn't matter if the "thing" is a byte to copy, a file to install, or any generic task. As long as you can add one to a counter each time you have done another "thing", you can then display it graphically.

    The actual complaint is about displaying accurate time remaining to complete the task, which really has nothing to do with the display of the progress bar. Instead, it involves guessing about how long each remaining "thing" will take to complete, and then displaying that sum of those times. This is hard because no matter how accurate the data used to make the guess, something outside the control of the program can disrupt the processing.

  20. We will have .... by PPH · · Score: 5, Funny

    ... a functional progress bar in 5 years.

    No, wait. It seems to have stalled.

    --
    Have gnu, will travel.
  21. Re:Crappy software by Zadaz · · Score: 4, Insightful

    Except even the number of tasks is often variable over the life of the task.

    Take for example loading a web page. It starts out as 1 task: Get a page from the server. Once you've done that, how many more requests will that first request generate? Impossible to tell. It could be none. It could be hundreds, and some of those can generate their own requests. (etc, etc.)

    The answer is this: Some feedback, no matter how incorrect, is better than no feedback at all.