Hacker News is very active - I suspect a lot of geeks are there. What does slashdot offer that is truly unique any more (not counting anon coward shenanigans)?
I publish in the field of atmospheric science. The American Meteorological Society journals are the "gold standard" amongst my research peers. While I am not happy with how much it costs to publish and subscribe to the AMS journals (the former is covered by NSF grants, the latter is "subsidized" by the university), all I know as a scientist is (a) I have to publish there to get maximum impact (b) I have to read everything in there to keep up and because the authors will likely be reviewing my future manuscript submissions. If, tomorrow, the AMS journals drastically changed their subscription model - making it open access, dropping the cost to subscribe and/or publish, etc., it would not make a single bit of difference to me, and to probably most other scientists publishing in AMS journals. Everyone would continue to submit and review, because that's where you publish, and if you don't publish there, the "right" people might not read your work.
A major change in the business model for journals would radically affect the journals themselves, and in atmospheric science in the US, the AMS itself which produces the journals. I don't think many of us weather-heads want to see the AMS fail because it does a ton of great stuff (advocacy, education, conferences, etc.), like similar organizations in other fields. But, without us scientists doing peer review for free, all "refereed" journals cease to exist, but so long as you have scientists willing to do the peer review, journals can certainly exist without much staff (many of whom are publishing scientists themselves). So we collectively wield a lot of power should we choose to use it but I personally don't know any other scientists who are willing to die on that particular hill, we're all mostly busy doing our science and trying to scrape some funding together in an extremely competitive environment.
I suspect governments themselves (as was mentioned upthread with the EU) will have to catalyze the real change. Currently AMS journal articles are open to the public after 1 year, with select articles available right away. If NSF decided tomorrow that it was not allowable to ask for more than [dollaramount] of page charges per article in NSF budgets, it would have a real, immediate impact on the problem addressed in the article.
Anyhow put me down as another academic type who definitely agrees that things should change, but also as someone who is not going to spend any time advocating for it simply because it's too big of an issue and I have grants and papers to write (sorry, just being honest here). Switching to a new (read: inferior) set of journals is not a viable option, and that is where the journals wield their real power - we need them (because all the important people read Important X Journal), and they need us to maintain their quality via the peer review process. It's probably going to take an outside force to break the current model; it's hard enough to get funded/tenured/promoted as it is, much less without deliberately publishing in the "wrong" journals.
Thanks for that - I am somewhat pleasantly surprised. I think the general scientific community that uses these things will be by and large happier - they seem to ramping up the clock speed on the CPUs (good ol' Moore) to get a bunch of their performance, and stuffing more cores on a node, etc... performance that will show itself w/out rewriting code. The accelerators will be there and I will be focusing on figuring out ways to exploit them in our fluid code. I suspect this will really be the last NSF machine that won't require significant rewrites, unless [snicker] the compiler writers [haha] find a way to have it just work [Bwahahahahahaha.... I want a pony]...
I'm an atmospheric scientist who has been using federal supercomputing hardware to better understand thunderstorms [orf.media] for years. Blue Waters is the current "Leadership Class" NSF-sponsored supercomputer. My Blue Waters allocation is currently winding down, and I can speak to how great it has been as a machine that has enabled (I know it's a cliche, but it's true) breakthrough science. A typical Blue Waters node contains 16 floating point AMD cores and 64 GB of memory. Many of the Blue Waters nodes contain a GPU, but it's miles behind the times since the machine was created about 7 years ago.
Frontera (for some reason the Canyonera theme song plays in my head) is the Phase 1 machine for the next Leadership Class supercomputer. The Phase 1 machine is supposed to come on line in 2019 and hold us over until 2024 when the next machine will come on line. When you look at how much money is being spent on Fronterra, and you compare it to Blue Waters, you realize that the vendor is being asked to create a much more powerful machine for a fraction of the price. What this will mean in practice, and what most of the scientific computing world is not ready for, is that a large bulk of the FLOPS on this new machine will be GPU flops. GPUs are not easy to use for doing heavy lifting (say, fluid dynamics solvers) using existing code. So a lot of people are going to have to decide whether to try to shoehorn their current MPI only (or MPI + some OpenMP) code to MPI, OpenMP + OpenACL (or nvidia CUDA), or to start from scratch (nobody wants to start from scratch). You have to remember that the vast majority of us scientists are NOT trained computer scientists, and most of us code for shit. I am off to a hackathon at NCSA in a couple weeks with some students to optimize some radiation code for GPUS... I spend half of my time doing computer stuff, and the other half doing science (and the other other half writing proposals, etc.).
So for those of you who aren't excited about new supercomputers, or don't understand their true power, I'm here to say that it's currently a very exciting time to be a numerical modeler, if you're willing to learn a bit on how to best wrestle these supercomputers into submission. I've spend over a decade just figuring out the most efficient way to write, organize, and analyze the TB-PB of data that a high resolution model can produce, and trying to make sense out of the firehose of data that these things can make. The hard-won benefits are crystal clear to me, but as always, tech is a moving target, so what works today might not work tomorrow...
The nice thing about supercomputers is they serve as a virtual lab for just about any field you can imagine. There are people in the humanities using supercomputers to do interesting things, beyond all the usual astrophysics, chemistry, and geophysical modeling.
I'm a long-time audiophile (it's kind of a disease; a fun one, if you can afford it). Never in a million years did I think I'd pay for a streaming service. My main objections are lossy encoding (MP3 or similar) and not having any product whatsoever, digital or otherwise (CD, vinyl). But now that services are coming along that offer CD quality (PCM, 44.1 kHz/16 bit - or perhaps higher) I finally broke down and subscribed to one of them (Tidal). What I like the most is being able to browse the catalog and play new stuff, remotely piloting a Squeezebox Touch that feeds a DAC that feeds "the good stereo". Twenty bucks a month for an unlimited CD quality catalog is a pretty good bargain if you are a voracious music consumer.
At the same time, I continue to buy a little new and used vinyl here and there, but mostly for the fun of it, as I'm just old enough to remember when records were the main media for music. So if you are a format fetishist, you can buy records and CDs will be out there for a long time (and used CDs are cheaper than dirt).
I do wonder how the artists will fare with streaming. I suspect poorly, as always, and that the people who will make the money will be the labels and the streaming services. I hope I'm wrong.
1. Two factor authentication, ALWAYS 2. People should stop using email for anything sensitive that you don't want read by your worst enemy. Use some P2P encrypted chat program or something. One would think Americans, at least, could see the value in something other than damned emails for sensitive communication.
If you are looking at the high end tornado figure, there appears to be a weak downward trend over several decades, but 2011 just rang and asked if El Reno, Joplin and Tuscaloosa wanted to come out and play.
You may turn CP off but you're sure as hell still parameterizing (microphysics!! And just how many choices do you have for those fun knobs!). And 4km is still pretty damned coarse for thunderstorms. But yeah there is clearly more to surface wind intensity than eyewall replacement, that's just an example of something that is thought to play a role and that is definitely handled better with finer meshes (tropical is not my expertise).
We're doing much better, but we've still got a lot of work to do to get hurricanes right - some of it is with the models, some of it (more, in my opinion) has to do with what we feed the models. Hoping GOES R will be a big help. Pray for a launch that doesn't blow up on the launchpad.
Good riddance to cumulus parameterization, when that day comes in everything but climate models (let's not get too carried away) I will dance with glee on reams of printouts of CP code. Microphysics is a big enough kludge but I don't think we're going to be following raindrops and snowlfakes around in our models for a long, long time.
The path of a hurricane is somewhat unpredictable (been known to turn 90 degrees for no apparent reason).
We've gotten much better at predicting the paths of hurricanes which are, to a first degree of approximation, steered by larger-scale winds that have gotten easier to predict with time because of improved observational data feeding models, as well as the fact that they are easier to resolve and are dictated by things that don't require lots of parameterization (like what you have with convective clouds and precipitation). What we continue to struggle with is the strength of the surface winds over time, which of course is highly desired at landfall. There are small scale processes going on within hurricanes (involving what meteorologists call eyewall replacement cycles) that modulate surface winds and that are less understood and much more difficult to forecast correctly. Basically, the smaller it is, the harder it is to forecast over long time periods.
If I had to choose between getting one or the other right, though, I'll choose path over surface wind strength. Getting out of the way of a Category 2 vs getting out of the way of a Category 5 is pretty much the same process!
You are basically describing ensemble forecasting, which is very powerful for providing statistically meaning forecasts where you can intelligently talk about the uncertainty of the forecast, something single deterministic forecasts cannot do.
In my research, I'm doing single deterministic forecasts to study what happens with tornadoes in supercell thunderstorms, where I am cranking up the resolution to capture flow that is otherwise unresolved. I get one version of a particular storm, which is good for studying certain aspects of storms, but not good at being able to generalize (that takes lots of simulations).
Both big deterministic simulations and ensembles have their place. Of course, today's big simulation can be the resolution of tomorrow's ensembles! Right now, you can do lots of good science with ensembles. Operationally (weather forecasting) this is basically the new paradigm, although forecasters are slow to change from just looking at the single deterministic GFS and NAM forecasts. The ensemble approach, once we start running hundreds of forecasts at higher resolution that we do today, will transform our forecasting accuracy (and precision). However it will be limited to the amount of good observational data we can feed the models (otherwise GIGO). This is where remote sensing comes in. GOES-R will be a big help.
It will indeed take people from atmospheric science, computer engineering, software engineering, etc. working together to best exploit exascale machines. NCSA understand this and that's what makes it (and other similar organizations) great.
One of the biggest problems of the current large scale HPC machines is users (like you but maybe not you specifically) are typically scientists/analysts who write software that does not scale well. There either needs to be better frameworks for you to work within that handle all the grunt work of doing efficient parallelization and message passing or every atmospheric physicist needs to be teamed with a computer scientist and a software engineer.
My main point is, just throwing more cores at "mostly MPI" weather models is not sustainable. We are going to need to be much smarter about how we parallelize.
I'm a weather guy - running cloud model code on Blue Waters, the fastest petascale machine for research in the U.S. I don't think we've managed to get any weather code run much more than 1 PF sustained - if even that. So it's not like you can compile WRF and run it with 10 million MPI ranks and call it a day. Ensembles? Well that's another story.
Exascale machines are going to have to be a lot different than petascale machines (which aren't all that different topologically than terascale machines) in order to be useful to scientists and in order to no require their own nuclear power plant to run. And I don't think we know what that topology will look like yet. A thousand cores per node? That should be fun; sounds like a GPU. Regardless, legacy weather code will need to be rewritten or more likely new models will need to be written from scratch in order to do more intelligent multithreading as opposed to mostly-MPI which is what we have today.
When asked at the Blue Waters Symposium this May to prognosticate on the future coding paradigm for exascale machines, Steven Scott (Senior VP and CTO of Cray) said we'll probably still be using MPI + OpenMP. If that's the case we're gonna have to be a hell of a lot more creative with OpenMP.
My only point is really undebatable. If, for instance, I want to listen to Wilderness Road's eponymous album, I have to listen to the vinyl, or somebody's copy of it. And it's a fucking great album and I don't need someone telling me that I'm an idiot for wanting to be able to listen to it.
Some real nutty music fans still spin 78s, can you believe it? Lot of music on those shellac platters never made it to any other format.
You cut yourself out of a surprisingly large amount of music (I'm talking about old stuff mostly) if you don't have a turntable.
I'm a self-proclaimed "audiophile" but not in the annoying, trust-my-ears-only way that plagues the hobby (I'm a scientist, dammit). I have a nice tube amp, great speakers, subwoofer, etc.... and I have a turntable as well (and a network enabled player + nice DAC). Anyhoo.... I can speak to the non-hipster side of things. Yes, some of the growth of vinyl has a faddish aspect to it. But, keep in mind, many musicphiles and audiophiles never stopped collecting and buying vinyl even through the meteoric rise of CD.
If you are a major music fan (and do not have an unlimited supply of pirated needledrops on the internet), a turntable is essential. A lot of obscure stuff was never released on CD. A lot of stuff that was released from the past on CD sounded (and continues to sound) dreadful due to the mad scramble to ride the CD wave; nth generation tapes, some equalized for vinyl, were used as the source material. Thankfully a lot of stuff these days that is selling is remastered versions of old stuff from original master tapes (not copies). You can be cyincal about this (say the major labels are just milking old warhorses) and you can also acknowledge that the digital audio technology has increased astoundingly since the late 80s and 90s. What does this have to do with vinyl? Well, vinyl can sound really good if done well. I won't argue that it is a better medium than digital; it simply isn't. But it has its own charms.
I have bought vinyl reissues that were mastered very well, and the vinyl was quiet, lacking surface noise - but about a third of the time I get burned with either lousy mastering (sibilance and related issues - and I have a very good microline cart) or more commonly, ticks and pops in shrinkwrapped new vinyl (and run through a we clean). This is the way it has always been and will always be with vinyl.
A primary motivation I have for buying new vinyl releases of new music is to acquire recordings that haven't been as dynamically squashed in the digital mastering process. While vinyl releases can be very dynamically compressed as well, as a rule, vinyl releases tend to be mastered with more dynamic range than the digital version (you could argue that this is partly, or mostly due, to physical limitations of the vinyl medium). And yes, I acknowledge that most vinyl is either digitally sourced or goes through an ADA conversion.
But mostly I continue to buy vinyl because it's fun - it's part of a hobby I enjoy very much. Spending hours just sitting "in the sweet spot" and listening to music (from any source - digital, tape, vinyl or whatever) is something I enjoy. So while people scoff at the vinyl "revival" I'm just glad to see there are more choices our there for getting good sounding music.
Most science is not breakthroughs; it's usually slow progress, with many failed experiments.
These computers are facilitating a lot of good science and increases like this in our computational infrastructure for research is great news. I do wonder how they are going to power this beast and what kind of hardware it will be made of. 300 PFlop is pretty unreal with today's technology.
As a visualization guy, it always makes me happy to see such a good use of visualization. Thanks for providing some extra technical details here! A couple of questions, though:
1) What grid type does your simulation code use? If it's regular grid, have you considered switching to something more adaptive like AMR or unstructured grids?
2) Since I/O is your main bottleneck, have you considered further decimating your output and visualizing in situ to fill in the gap? I suspect your visual analysis is too complicated for current in situ techniques to cover everything you want to do, but I'd like to hear your thoughts on it.
It's isotropic (delta x = delta y = delta z) for most of the storm, and then uses an analytical stretch function to determine the mesh outside of that region. I used the stretch technique of Wilhelmson and Chen (1982; Journal of the Atmospheric Sciences).
AMR has its benefits but adds a lot of complexity, and I tend to wish to go towards less complex, not more. I am more interested in these new heaxongal grids (see: NCAR's MPAS) which have very nice properties. I predict MPAS will be the Next Big Thing and will apply to both large-scale (say, climate) and mesoscale (like CM1) models eventually.
In-situ visualization was something we have played with. But I don't see a huge benefit to me, other than showing that it can be done. The simulation I reported on was not the first try and visualizing it with volume renderer during the simulation would have been cool but consider the fact that I've had the data on disk for over six months and have barely begun to mine it. So really, it's what you do with the data once it's on disk that matters... plus you can visualize it in all sorts of different ways.
I have other ways to see what's going on during the simulation that are near-real time. All I need to know during the simulation is that the storm is not drifting out of the box, and that it's doing something that looks reasonable. I do this with parsing the text output of the model (that spits out global statistics periodically) and I have a way to view slices of the data that was just written that works just fine.
I got about halfway through the video before the kids interrupted me (and it). So let me just ask:
Did your model take into account the energy gathering and discharge that would show a multi-amp, million-volt DC discharge? Because the energy implications of that are going to be enormous to the model.
Did it also have a mechanism that generated the lightning discharges of the storm? Because again, the lightning discharges are going to affect the electrical energy available to help / hinder the tornado.
No lightning in the model. I know of some research on modeling lightning in supercells, but I'm pretty sure they are one-way models; the lightning occurs based upon what we know about inductive and non inductive charge mechanisms, and flashes can happen - but they do not feed back into the storm. I don't think they probably feed back appreciably into real storms. Even though a lot of energy is released with lightning, so much more is released due to latent heating (phase changes between solid/liquid/gas) that really drive thunderstorms.
Trying to make a model that only focuses on the critical parts of a storm would be like trying to make a model of a car that only focused on one piston.
Good insight in all your comments. Actually, tornado modeling has a long history. Think of those vortex chambers you see at some museums etc... pioneers like Neil Ward studied tornadoes in this manner. Now, think of a numerical model of one of those chambers. This type of work has been done by folks like David Lewellen at West Virginia University.
What makes this new simulation special is that we've simulated the entire storm and allowed "things to evolve naturally" (for a very judicious interpretation of what "natural" is). One could argue that the tornado is actually not the most interesting part of the simulation, but that the processes leading to its formation, and what is going on in the supercell thunderstorm to support the tornado's long life, are the most interesting parts.
At the resolution I'm running at: decades, and this would require improvements in algorithms / hardware utilization, as well as finding a way to build and utilize machines on the exascale. Going from petascale to exascale is going to require methods/topologies that don't exist yet, I think (not just GPUs for example). And exascale power requirements are a real problem with existing hardware.
At "convection resolving" simulations (on the order of 1 km resolution - not 30 meters!) where a tornado-like vortex might form and serve as a proxy for a real tornado: That's much closer, and there are different efforts going on right now aiming towards that goal. The biggest hurdle I see is getting enough observational data to feed to the models; otherwise you end up with garbage in / garbage out. Remote sensing is the way to go with some quantites but with others there is really no substitue for in-situ measurements, and instrumentation at the scale we need is expensive.
Hacker News is very active - I suspect a lot of geeks are there. What does slashdot offer that is truly unique any more (not counting anon coward shenanigans)?
I publish in the field of atmospheric science. The American Meteorological Society journals are the "gold standard" amongst my research peers. While I am not happy with how much it costs to publish and subscribe to the AMS journals (the former is covered by NSF grants, the latter is "subsidized" by the university), all I know as a scientist is (a) I have to publish there to get maximum impact (b) I have to read everything in there to keep up and because the authors will likely be reviewing my future manuscript submissions. If, tomorrow, the AMS journals drastically changed their subscription model - making it open access, dropping the cost to subscribe and/or publish, etc., it would not make a single bit of difference to me, and to probably most other scientists publishing in AMS journals. Everyone would continue to submit and review, because that's where you publish, and if you don't publish there, the "right" people might not read your work.
A major change in the business model for journals would radically affect the journals themselves, and in atmospheric science in the US, the AMS itself which produces the journals. I don't think many of us weather-heads want to see the AMS fail because it does a ton of great stuff (advocacy, education, conferences, etc.), like similar organizations in other fields. But, without us scientists doing peer review for free, all "refereed" journals cease to exist, but so long as you have scientists willing to do the peer review, journals can certainly exist without much staff (many of whom are publishing scientists themselves). So we collectively wield a lot of power should we choose to use it but I personally don't know any other scientists who are willing to die on that particular hill, we're all mostly busy doing our science and trying to scrape some funding together in an extremely competitive environment.
I suspect governments themselves (as was mentioned upthread with the EU) will have to catalyze the real change. Currently AMS journal articles are open to the public after 1 year, with select articles available right away. If NSF decided tomorrow that it was not allowable to ask for more than [dollaramount] of page charges per article in NSF budgets, it would have a real, immediate impact on the problem addressed in the article.
Anyhow put me down as another academic type who definitely agrees that things should change, but also as someone who is not going to spend any time advocating for it simply because it's too big of an issue and I have grants and papers to write (sorry, just being honest here). Switching to a new (read: inferior) set of journals is not a viable option, and that is where the journals wield their real power - we need them (because all the important people read Important X Journal), and they need us to maintain their quality via the peer review process. It's probably going to take an outside force to break the current model; it's hard enough to get funded/tenured/promoted as it is, much less without deliberately publishing in the "wrong" journals.
Thanks for that - I am somewhat pleasantly surprised. I think the general scientific community that uses these things will be by and large happier - they seem to ramping up the clock speed on the CPUs (good ol' Moore) to get a bunch of their performance, and stuffing more cores on a node, etc... performance that will show itself w/out rewriting code. The accelerators will be there and I will be focusing on figuring out ways to exploit them in our fluid code. I suspect this will really be the last NSF machine that won't require significant rewrites, unless [snicker] the compiler writers [haha] find a way to have it just work [Bwahahahahahaha.... I want a pony]...
I'm an atmospheric scientist who has been using federal supercomputing hardware to better understand thunderstorms [orf.media] for years. Blue Waters is the current "Leadership Class" NSF-sponsored supercomputer. My Blue Waters allocation is currently winding down, and I can speak to how great it has been as a machine that has enabled (I know it's a cliche, but it's true) breakthrough science. A typical Blue Waters node contains 16 floating point AMD cores and 64 GB of memory. Many of the Blue Waters nodes contain a GPU, but it's miles behind the times since the machine was created about 7 years ago.
Frontera (for some reason the Canyonera theme song plays in my head) is the Phase 1 machine for the next Leadership Class supercomputer. The Phase 1 machine is supposed to come on line in 2019 and hold us over until 2024 when the next machine will come on line. When you look at how much money is being spent on Fronterra, and you compare it to Blue Waters, you realize that the vendor is being asked to create a much more powerful machine for a fraction of the price. What this will mean in practice, and what most of the scientific computing world is not ready for, is that a large bulk of the FLOPS on this new machine will be GPU flops. GPUs are not easy to use for doing heavy lifting (say, fluid dynamics solvers) using existing code. So a lot of people are going to have to decide whether to try to shoehorn their current MPI only (or MPI + some OpenMP) code to MPI, OpenMP + OpenACL (or nvidia CUDA), or to start from scratch (nobody wants to start from scratch). You have to remember that the vast majority of us scientists are NOT trained computer scientists, and most of us code for shit. I am off to a hackathon at NCSA in a couple weeks with some students to optimize some radiation code for GPUS... I spend half of my time doing computer stuff, and the other half doing science (and the other other half writing proposals, etc.).
So for those of you who aren't excited about new supercomputers, or don't understand their true power, I'm here to say that it's currently a very exciting time to be a numerical modeler, if you're willing to learn a bit on how to best wrestle these supercomputers into submission. I've spend over a decade just figuring out the most efficient way to write, organize, and analyze the TB-PB of data that a high resolution model can produce, and trying to make sense out of the firehose of data that these things can make. The hard-won benefits are crystal clear to me, but as always, tech is a moving target, so what works today might not work tomorrow...
The nice thing about supercomputers is they serve as a virtual lab for just about any field you can imagine. There are people in the humanities using supercomputers to do interesting things, beyond all the usual astrophysics, chemistry, and geophysical modeling.
Yay supercomputers, and yay NSF.
Hoarding is a mental health condition you should get help for. Personally I consider those that glorify the music industry fools.
Hoarding? Hardly. I ditched all my physical media (hundreds of CDs) except records, and I only have a couple hundred of those.
I'm a long-time audiophile (it's kind of a disease; a fun one, if you can afford it). Never in a million years did I think I'd pay for a streaming service. My main objections are lossy encoding (MP3 or similar) and not having any product whatsoever, digital or otherwise (CD, vinyl). But now that services are coming along that offer CD quality (PCM, 44.1 kHz/16 bit - or perhaps higher) I finally broke down and subscribed to one of them (Tidal). What I like the most is being able to browse the catalog and play new stuff, remotely piloting a Squeezebox Touch that feeds a DAC that feeds "the good stereo". Twenty bucks a month for an unlimited CD quality catalog is a pretty good bargain if you are a voracious music consumer.
At the same time, I continue to buy a little new and used vinyl here and there, but mostly for the fun of it, as I'm just old enough to remember when records were the main media for music. So if you are a format fetishist, you can buy records and CDs will be out there for a long time (and used CDs are cheaper than dirt).
I do wonder how the artists will fare with streaming. I suspect poorly, as always, and that the people who will make the money will be the labels and the streaming services. I hope I'm wrong.
I grew a big-assed tornado on a supercomputer. How fucking cool is that? Can't do that on an Apple ][.
1. Two factor authentication, ALWAYS
2. People should stop using email for anything sensitive that you don't want read by your worst enemy. Use some P2P encrypted chat program or something. One would think Americans, at least, could see the value in something other than damned emails for sensitive communication.
All it will take is one sustained power outage, and that's all she wrote.
The amount of power consumption required to keep things that cold must be enormous. What a waste.
Bullshit. Did you even look at the article?
http://www1.ncdc.noaa.gov/pub/...
Show me your trend. Please.
If you are looking at the high end tornado figure, there appears to be a weak downward trend over several decades, but 2011 just rang and asked if El Reno, Joplin and Tuscaloosa wanted to come out and play.
You may turn CP off but you're sure as hell still parameterizing (microphysics!! And just how many choices do you have for those fun knobs!). And 4km is still pretty damned coarse for thunderstorms. But yeah there is clearly more to surface wind intensity than eyewall replacement, that's just an example of something that is thought to play a role and that is definitely handled better with finer meshes (tropical is not my expertise).
We're doing much better, but we've still got a lot of work to do to get hurricanes right - some of it is with the models, some of it (more, in my opinion) has to do with what we feed the models. Hoping GOES R will be a big help. Pray for a launch that doesn't blow up on the launchpad.
Good riddance to cumulus parameterization, when that day comes in everything but climate models (let's not get too carried away) I will dance with glee on reams of printouts of CP code. Microphysics is a big enough kludge but I don't think we're going to be following raindrops and snowlfakes around in our models for a long, long time.
The path of a hurricane is somewhat unpredictable (been known to turn 90 degrees for no apparent reason).
We've gotten much better at predicting the paths of hurricanes which are, to a first degree of approximation, steered by larger-scale winds that have gotten easier to predict with time because of improved observational data feeding models, as well as the fact that they are easier to resolve and are dictated by things that don't require lots of parameterization (like what you have with convective clouds and precipitation). What we continue to struggle with is the strength of the surface winds over time, which of course is highly desired at landfall. There are small scale processes going on within hurricanes (involving what meteorologists call eyewall replacement cycles) that modulate surface winds and that are less understood and much more difficult to forecast correctly. Basically, the smaller it is, the harder it is to forecast over long time periods.
If I had to choose between getting one or the other right, though, I'll choose path over surface wind strength. Getting out of the way of a Category 2 vs getting out of the way of a Category 5 is pretty much the same process!
You are basically describing ensemble forecasting, which is very powerful for providing statistically meaning forecasts where you can intelligently talk about the uncertainty of the forecast, something single deterministic forecasts cannot do.
In my research, I'm doing single deterministic forecasts to study what happens with tornadoes in supercell thunderstorms, where I am cranking up the resolution to capture flow that is otherwise unresolved. I get one version of a particular storm, which is good for studying certain aspects of storms, but not good at being able to generalize (that takes lots of simulations).
Both big deterministic simulations and ensembles have their place. Of course, today's big simulation can be the resolution of tomorrow's ensembles! Right now, you can do lots of good science with ensembles. Operationally (weather forecasting) this is basically the new paradigm, although forecasters are slow to change from just looking at the single deterministic GFS and NAM forecasts. The ensemble approach, once we start running hundreds of forecasts at higher resolution that we do today, will transform our forecasting accuracy (and precision). However it will be limited to the amount of good observational data we can feed the models (otherwise GIGO). This is where remote sensing comes in. GOES-R will be a big help.
It will indeed take people from atmospheric science, computer engineering, software engineering, etc. working together to best exploit exascale machines. NCSA understand this and that's what makes it (and other similar organizations) great.
One of the biggest problems of the current large scale HPC machines is users (like you but maybe not you specifically) are typically scientists/analysts who write software that does not scale well. There either needs to be better frameworks for you to work within that handle all the grunt work of doing efficient parallelization and message passing or every atmospheric physicist needs to be teamed with a computer scientist and a software engineer.
Absolutely agree 100%!
You can say "one of" but you can't say "the fastest" petascale machines my friend
http://www.hpcwire.com/2012/11...
I should have added "on a college campus".
My main point is, just throwing more cores at "mostly MPI" weather models is not sustainable. We are going to need to be much smarter about how we parallelize.
Weather guys want this after NSA's done.
I'm a weather guy - running cloud model code on Blue Waters, the fastest petascale machine for research in the U.S. I don't think we've managed to get any weather code run much more than 1 PF sustained - if even that. So it's not like you can compile WRF and run it with 10 million MPI ranks and call it a day. Ensembles? Well that's another story.
Exascale machines are going to have to be a lot different than petascale machines (which aren't all that different topologically than terascale machines) in order to be useful to scientists and in order to no require their own nuclear power plant to run. And I don't think we know what that topology will look like yet. A thousand cores per node? That should be fun; sounds like a GPU. Regardless, legacy weather code will need to be rewritten or more likely new models will need to be written from scratch in order to do more intelligent multithreading as opposed to mostly-MPI which is what we have today.
When asked at the Blue Waters Symposium this May to prognosticate on the future coding paradigm for exascale machines, Steven Scott (Senior VP and CTO of Cray) said we'll probably still be using MPI + OpenMP. If that's the case we're gonna have to be a hell of a lot more creative with OpenMP.
chinook:% echo $0
bash
Now I'm totally confused. Zero dollars equals bash. Bash what? Bash head into keyboard? ORP BASH!
It is impossible to to be objective about a subjective experience.
No, it is not.
My only point is really undebatable. If, for instance, I want to listen to Wilderness Road's eponymous album, I have to listen to the vinyl, or somebody's copy of it. And it's a fucking great album and I don't need someone telling me that I'm an idiot for wanting to be able to listen to it.
Some real nutty music fans still spin 78s, can you believe it? Lot of music on those shellac platters never made it to any other format.
You cut yourself out of a surprisingly large amount of music (I'm talking about old stuff mostly) if you don't have a turntable.
I'm a self-proclaimed "audiophile" but not in the annoying, trust-my-ears-only way that plagues the hobby (I'm a scientist, dammit). I have a nice tube amp, great speakers, subwoofer, etc.... and I have a turntable as well (and a network enabled player + nice DAC). Anyhoo.... I can speak to the non-hipster side of things. Yes, some of the growth of vinyl has a faddish aspect to it. But, keep in mind, many musicphiles and audiophiles never stopped collecting and buying vinyl even through the meteoric rise of CD.
If you are a major music fan (and do not have an unlimited supply of pirated needledrops on the internet), a turntable is essential. A lot of obscure stuff was never released on CD. A lot of stuff that was released from the past on CD sounded (and continues to sound) dreadful due to the mad scramble to ride the CD wave; nth generation tapes, some equalized for vinyl, were used as the source material. Thankfully a lot of stuff these days that is selling is remastered versions of old stuff from original master tapes (not copies). You can be cyincal about this (say the major labels are just milking old warhorses) and you can also acknowledge that the digital audio technology has increased astoundingly since the late 80s and 90s. What does this have to do with vinyl? Well, vinyl can sound really good if done well. I won't argue that it is a better medium than digital; it simply isn't. But it has its own charms.
I have bought vinyl reissues that were mastered very well, and the vinyl was quiet, lacking surface noise - but about a third of the time I get burned with either lousy mastering (sibilance and related issues - and I have a very good microline cart) or more commonly, ticks and pops in shrinkwrapped new vinyl (and run through a we clean). This is the way it has always been and will always be with vinyl.
A primary motivation I have for buying new vinyl releases of new music is to acquire recordings that haven't been as dynamically squashed in the digital mastering process. While vinyl releases can be very dynamically compressed as well, as a rule, vinyl releases tend to be mastered with more dynamic range than the digital version (you could argue that this is partly, or mostly due, to physical limitations of the vinyl medium). And yes, I acknowledge that most vinyl is either digitally sourced or goes through an ADA conversion.
But mostly I continue to buy vinyl because it's fun - it's part of a hobby I enjoy very much. Spending hours just sitting "in the sweet spot" and listening to music (from any source - digital, tape, vinyl or whatever) is something I enjoy. So while people scoff at the vinyl "revival" I'm just glad to see there are more choices our there for getting good sounding music.
Take a look, there is some neat stuff going on with Blue Waters: https://bluewaters.ncsa.illino...
Most science is not breakthroughs; it's usually slow progress, with many failed experiments.
These computers are facilitating a lot of good science and increases like this in our computational infrastructure for research is great news. I do wonder how they are going to power this beast and what kind of hardware it will be made of. 300 PFlop is pretty unreal with today's technology.
As a visualization guy, it always makes me happy to see such a good use of visualization. Thanks for providing some extra technical details here! A couple of questions, though:
1) What grid type does your simulation code use? If it's regular grid, have you considered switching to something more adaptive like AMR or unstructured grids?
2) Since I/O is your main bottleneck, have you considered further decimating your output and visualizing in situ to fill in the gap? I suspect your visual analysis is too complicated for current in situ techniques to cover everything you want to do, but I'd like to hear your thoughts on it.
It's isotropic (delta x = delta y = delta z) for most of the storm, and then uses an analytical stretch function to determine the mesh outside of that region. I used the stretch technique of Wilhelmson and Chen (1982; Journal of the Atmospheric Sciences).
AMR has its benefits but adds a lot of complexity, and I tend to wish to go towards less complex, not more. I am more interested in these new heaxongal grids (see: NCAR's MPAS) which have very nice properties. I predict MPAS will be the Next Big Thing and will apply to both large-scale (say, climate) and mesoscale (like CM1) models eventually.
In-situ visualization was something we have played with. But I don't see a huge benefit to me, other than showing that it can be done. The simulation I reported on was not the first try and visualizing it with volume renderer during the simulation would have been cool but consider the fact that I've had the data on disk for over six months and have barely begun to mine it. So really, it's what you do with the data once it's on disk that matters... plus you can visualize it in all sorts of different ways.
I have other ways to see what's going on during the simulation that are near-real time. All I need to know during the simulation is that the storm is not drifting out of the box, and that it's doing something that looks reasonable. I do this with parsing the text output of the model (that spits out global statistics periodically) and I have a way to view slices of the data that was just written that works just fine.
I got about halfway through the video before the kids interrupted me (and it). So let me just ask:
Did your model take into account the energy gathering and discharge that would show a multi-amp, million-volt DC discharge? Because the energy implications of that are going to be enormous to the model.
Did it also have a mechanism that generated the lightning discharges of the storm? Because again, the lightning discharges are going to affect the electrical energy available to help / hinder the tornado.
No lightning in the model. I know of some research on modeling lightning in supercells, but I'm pretty sure they are one-way models; the lightning occurs based upon what we know about inductive and non inductive charge mechanisms, and flashes can happen - but they do not feed back into the storm. I don't think they probably feed back appreciably into real storms. Even though a lot of energy is released with lightning, so much more is released due to latent heating (phase changes between solid/liquid/gas) that really drive thunderstorms.
Trying to make a model that only focuses on the critical parts of a storm would be like trying to make a model of a car that only focused on one piston.
Good insight in all your comments. Actually, tornado modeling has a long history. Think of those vortex chambers you see at some museums etc... pioneers like Neil Ward studied tornadoes in this manner. Now, think of a numerical model of one of those chambers. This type of work has been done by folks like David Lewellen at West Virginia University.
What makes this new simulation special is that we've simulated the entire storm and allowed "things to evolve naturally" (for a very judicious interpretation of what "natural" is). One could argue that the tornado is actually not the most interesting part of the simulation, but that the processes leading to its formation, and what is going on in the supercell thunderstorm to support the tornado's long life, are the most interesting parts.
At the resolution I'm running at: decades, and this would require improvements in algorithms / hardware utilization, as well as finding a way to build and utilize machines on the exascale. Going from petascale to exascale is going to require methods/topologies that don't exist yet, I think (not just GPUs for example). And exascale power requirements are a real problem with existing hardware.
At "convection resolving" simulations (on the order of 1 km resolution - not 30 meters!) where a tornado-like vortex might form and serve as a proxy for a real tornado: That's much closer, and there are different efforts going on right now aiming towards that goal. The biggest hurdle I see is getting enough observational data to feed to the models; otherwise you end up with garbage in / garbage out. Remote sensing is the way to go with some quantites but with others there is really no substitue for in-situ measurements, and instrumentation at the scale we need is expensive.