Is Your Internet Connection Free From Bufferbloat? (blogspot.com)

How can I tell? by Snotnose · 2016-12-25 16:03 · Score: 1

Not like I'm writing router or server code, I'm just a clueless dude surfing the web. Bad stuff happens, "the network" is hosed.

Re: How can I tell? by mspohr · 2016-12-25 16:14 · Score: 1

Why is bufferbloat bad?

--
I don't read your sig. Why are you reading mine?
Re: How can I tell? by pem · 2016-12-25 16:40 · Score: 1

Because it was there.
Well, your sig was, anyway.
Re:How can I tell? by PopeRatzo · 2016-12-25 16:44 · Score: 1

I took the test and got an A in buffer bloat. Is that good or bad?

--
You are welcome on my lawn.
Re:How can I tell? by crashumbc · 2016-12-25 16:48 · Score: 1

bad you should immediately turn off you internet fo for 30 days to "reset" it....
Re:How can I tell? by PopeRatzo · 2016-12-25 16:55 · Score: 4, Funny

bad you should immediately turn off you internet fo for 30 days to "reset" it....
Say, I wasn't born yesterday. I know very well that if I just disconnect the cables and put the router in the microwave for 45 seconds at 50% power it'll do the same thing.

--
You are welcome on my lawn.
Re:How can I tell? by crashumbc · 2016-12-25 17:10 · Score: 1

OMG man, disconnect the cables!!!! NEVER, Skynet will detect that and launch! Joshua already had 8 of the launch codes! you need to make him play tic tac toe, enter zero players!
Re: How can I tell? by mtaht · 2016-12-25 17:42 · Score: 1

do you ever use skype, play games, surf the web, while someone or something else is more heavily using you connection?
Re:How can I tell? by mtaht · 2016-12-25 18:13 · Score: 1

It has nothing to do with writing code, but normal uses actually using the internet when contending for bandwidth.
Re: How can I tell? by fbobraga · 2016-12-26 07:17 · Score: 1

Why is bufferbloat bad?
causes high latencies (very bad for gaming)

Re:Does the Tin Man have a sheet metal cock? by Anonymous Coward · 2016-12-25 16:33 · Score: 1

Why the fuck would Tin Man have a woody?

Bufferbloat? Yes. But... by Anonymous Coward · 2016-12-25 17:13 · Score: 1

While bufferbloat was patched out, my router is still under the control of a cargo ship, which claims to actually be an aircraft carrier. What is to be done about blufferboat?

Go measure by mtaht · 2016-12-25 17:17 · Score: 1

Judging from the first 25 replies, the slashdot readership is suffering from an overdose of eggnog. Here's a link (which has links to results from every ISP), which shows latency under load often measured in seconds. http://www.dslreports.com/spee... The problem with this survey is that there are now plenty of folk that get sub-30ms latencies on their internet - which is what those using bufferbloat fixes get, and the question was if you or your isp was driving improved hardware to get those results. Problem seems to be 99% of the results are worse than that, still, 4+ years after the code to fix first arrived in Linux.

Re:Go measure by waveclaw · 2016-12-25 18:41 · Score: 5, Interesting

With dislreports and other aggregation tests, the bloat for download and upload may not be symmetric. So the resulting score might not be as good as it looks.
Paying for a commercial connection? Test for this kind of performance daily and scream as soon as it drops. Otherwise why bother to pay so much?
In the United States and other jurisdictions a home 'customer' user is not expected to run a "server" on their paid for Internet connection. Downloads may be finely tuned to low bloat. But upload may have significant bufferbloat, caps and gradual dropout. For financial reasons, of course.
This upload problem may get to be much worse in the future. More and more services push data from "client" devices in the home or office. Camera phone videos, twitch streams, shared google docs and your home automation spyware upend the upload/download assumptions of last-hop telcos. P2P is impacted now. The highly asymmetric buffering of uploads is detectable using protocols like bittorrent that don't have client-server separation.

--

"You cannot have a General Will unless you have shared experiences. You cannot be fair to people you don't know."
Re:Go measure by mtaht · 2016-12-25 18:50 · Score: 2

I am not huge on basic web tests, preferring the finer grained results we get from flent. (https://flent.org).
And I totally agree that the trendline is to ever more devices doing ever more stuff randomly when you least desire it. We need to have edge routers AND ISPs ready for this change in traffic patterns.
The article you cited was quite good, although it missed completely the outputs of the ietf aqm working group, of which both I and fred baker are members.
https://tools.ietf.org/wg/aqm/
Re:Go measure by wbr1 · 2016-12-26 01:44 · Score: 1

Dave, I think most of the /. readership is on the eggnog all the time. However, this is the type of thing a few of us still come to this site for. Thanks for your work on this, one can only hope more ISPs and equipment manufacturers implement it.
I see the effects of bufferbloat everyday. As a manager at a small MSP, we have many clients who have large scheduled 'cloud' backups that can saturate the upstream connection. Especially on DSL. Significant reduction of bufferbloat would mean that we could use more upstream bandwitch, even during peak hours with minimal detriment to the customer.
Keep up the work!

--
Silence is a state of mime.
Re:Go measure by wbr1 · 2016-12-26 01:48 · Score: 1

Cloud PC backups affect this as well. We offer this to business and home user clients, and those that are on smaller, non business, connections suffer from upstream bloat far more. Of course our primary DSL provider is Centurylink who is pretty terrible. Comcast seems to have been doing a bit better job, but not by much, and that may just be an artifact of a larger pipe.

--
Silence is a state of mime.
Re:Go measure by unixisc · 2016-12-26 03:19 · Score: 1

Why is bufferbloat something that's done at the routers, rather than in our browsers, w/ a variable buffer that we/the browser itself have an option of deleting? Why is it the job of a router to store all that garbage, rather than get that from the browsers themselves and do it?
Re:Go measure by ipb · 2016-12-26 05:05 · Score: 1

"Dave, I think most of the /. readership is on the eggnog all the time. However, this is the type of thing a few of us still come to this site for. Thanks for your work on this"
I second this.
After the first couple of pages I almost gave up reading because of the eggnog comments.
For the record, queue management in OpenWrt has done a lot to lower bufferbloat on the systems I use.
Re:Go measure by richb-hanover · 2016-12-26 06:13 · Score: 2

The router needs to manage the bottleneck link, since that's the place all the data gets queued up (in those buffers that are the topic of interest). The router is the only device that has visibility into the amount of data that's in transit "to the internet". Your browser doesn't know that your spouse's/kid's iPhone just decided to upload all new images to the cloud. Nor can your gaming system. Browsers are designed to send the data as fast as possible. Gaming systems are designed to send immediately after you click. It's the responsibility of good networking equipment to regulate all the flows of data so that *everyone* gets good performance. (And while you *could* spend your life optimizing qos rules, the beauty of fq_codel/cake is that they take one parameter (link speed) and they automate all the rest.)
Re:Go measure by Bengie · 2016-12-26 08:35 · Score: 1

+1 this up. Nailed it, and not the meme type. Most people, including network admins with years of working with QoS, are incapable of setting up QoS correctly, and only think they've set it up correctly not because of theoretical correctness but because they cannot even think of the edge cases to get empirical tests..

Re:Cute name, no tangible problem by Fly+Swatter · 2016-12-25 17:19 · Score: 1

The tangible problem is if you need low latency, or want to maintain the latency you have, when your upstream connection is saturated. At least I think that is what it means.

Forget BB, the plethora of ad-serving sites... by GerryGilmore · 2016-12-25 17:30 · Score: 4, Insightful

...is what slows my connection speed down. Fuck, I could have a gigabit connection and would spend 80% of my time waiting for the next version of ad.doubleclick.net, etc. Really? Bufferbloat? I wish!

Re:Forget BB, the plethora of ad-serving sites... by mtaht · 2016-12-25 17:36 · Score: 1

Oh, I strongly recommend ublock, too! I go around installing that on friends and family's computers every christmas. :) But this christmas, I reflashed a ton of routers, too.
Re:Forget BB, the plethora of ad-serving sites... by tlhIngan · 2016-12-25 19:48 · Score: 1

...is what slows my connection speed down. Fuck, I could have a gigabit connection and would spend 80% of my time waiting for the next version of ad.doubleclick.net, etc. Really? Bufferbloat? I wish!
Yeah, you'd think the folks at Alphabet (DoubleClick's parent company) would know a thing or two about how to optimize for the Internet.
On the other hand, now DoubleClick knows everything you did on other Alphabet sites, like Google, YouTube, etc.
Re:Forget BB, the plethora of ad-serving sites... by dinfinity · 2016-12-26 00:16 · Score: 1

I highly recommend DNS based blocking in your router. All smartphones and tablets using your network will also be rid of 99% of all that crap.
There's a package in OpenWRT (not in the main repository, though) that updates blocklists on a schedule (the scripts are very straightforward and DIYable, but it's nice to have a click and go solution):
https://github.com/openwrt/pac...
The only downside is that making (temporary) exceptions is not really an option.
Re:Forget BB, the plethora of ad-serving sites... by thegarbz · 2016-12-26 08:28 · Score: 1

...is what slows my connection speed down. Fuck, I could have a gigabit connection and would spend 80% of my time waiting for the next version of ad.doubleclick.net, etc. Really? Bufferbloat? I wish!
Two different problems. I don't randomly get ads in the middle of time sensitive UDP packets while video chatting or playing games.
There are two problems we can solve here.
Re:Forget BB, the plethora of ad-serving sites... by Lennie · 2016-12-27 01:04 · Score: 1

You really think it's just ads ?
Here is the Bufferbloat demo from 2013:
https://www.youtube.com/watch?...

--
New things are always on the horizon

Re:Nagle algorithm? by ShanghaiBill · 2016-12-25 17:37 · Score: 1

And how would that improve things?

It wouldn't. Nagle's algorithm doesn't cause congestion, it reduces it.

"Solving" a problem by going back to a probably worse one isn't really "solving it"

The first step in "solving" a problem is verifying that it is actually a problem. I am not convinced that "bufferbloat" (whatever that means) is a problem. Buffering can reduce latency, especially under heavy load, by better bandwidth utilization, and allowing faster retransmission of dropped packets. If it is slowing things down, then you should fix the buffering rather than eliminating it.

... and yes, I read TFA. It is a bunch of poorly labeled graphics that didn't make any sense to me, and seem to be designed to obfuscate rather than enlighten, although that may just be a result of Hanlon's Razor.

Re:Cute name, no tangible problem by NilesDonegan · 2016-12-25 17:38 · Score: 3, Interesting

DSL is unfortuantely the best internet connection in the small town I live in. The upload rate of these connections is really slow, and for large uploads, can saturate the connection. What this translates to in the real world is constant complaints from people about how their internet connection has just died for no good reason. What's happening in 99% of these cases is that some iPad in their house is backing up to iCloud, and bufferbloat from this upload is temporarily wiping out download speeds.

What I did was install the OpenWRT firmware on my TP-Link router, and install the SQM (Smart Queue Management) QoS application on it. This shapes uploads so that bufferbloat is greatly reduced. I tested all of this on DSLReport's Bufferbloat page, and it works great.

Re:Cute name, no tangible problem by mtaht · 2016-12-25 17:40 · Score: 1

which sqm mode are you using?

More data? by wizden · 2016-12-25 17:53 · Score: 1

The latency measurements in the article are meaningless. Reducing seconds of latency to milliseconds! Where is point a and b? The driver layer adds ten seconds of latency? None of this makes sense.

Re:More data? by mtaht · 2016-12-25 17:59 · Score: 4, Informative

If you are referring to the cake article, the baseline latency of the path is ~11ms. It grows to about 250ms under pressure from a tcp transfer with a "normal" cable modem, and to only 16ms or so with cake. See the bar graph... wifi could get much much worse. but we fixed it in the upcoming linux 3.10 release. Not that anybody seems to understand....
Re:More data? by mtaht · 2016-12-25 18:02 · Score: 2

As for the wifi article, yes, we have seen 10+ seconds of excess latency in the wifi stack. 1-2 seconds is typical with normal traffic at lower rates, as most protocols time out in that range.
Re:More data? by amacide · 2016-12-25 23:57 · Score: 2

Thank you for such excellent contributions to Linux kernel :-)
Re:More data? by wbr1 · 2016-12-26 01:50 · Score: 2

Few here can understand anymore. Admittedly it is at the edge of my skill and knowledge level, but I understand enough to respect it. I think most of the real engineers have gone from here.....

--
Silence is a state of mime.
Re: More data? by wizden · 2016-12-26 03:02 · Score: 1

Thanks for sharing the baseline latency. I work for a rather prominent wireless manufacturer and I just don't see the latency you're talking about. Voice over wifi would be impossible with that level of latency and we see customers deploy that everyday. Is this limited to Linux?
Re: More data? by mtaht · 2016-12-26 05:09 · Score: 1

These are "latency under load" measurements (using the dslreports and flent tools to stress out your link). If your network is otherwise idle voip is fine, but with people adding ever more devices to their network doing random things at random times, the bloat problem raises its ugly head.
(and yes, voip is frequently unusable when your ISP link is under stress from something else without out these queue management techniques in place there)
I tried to stress in the lwn article that first eliminating bloat from the ISP link will make your wifi a lot better, because wifi is usually not the bottleneck in many scenarios. But: the wifi work we just pushed upstream makes voip far more possible when wifi is contended.
Is it limited to linux? No - it seems to be deeply affecting the current crop of gateways supplied by ISPs, as well as nearly the entire 3rd party router market, except those who are deploying qos sanely (which is nearly everybody these days in third party firmware - "fq_codel" lies underneath many a rebranded qos system nowadays. "cake" is a possible successor.
The frustrating part is that wifi folk are often saying their stuff is fine, when it can be so deeply affected by the next hop up, and also tends to become poor anytime a second or third device is stressing the link (transferring files to a NAS, for example, screensharing for another). 802.11ac devices tend to have more latency than 802.11n, also, because they tend to use a fixed buffer size suitable for their highest rates, and not something that adjusts to the actual rate.
If you are interested in poking into these issues further, on your equipment, take a look at flent, and/or come on over for the discussions on the make-wifi-fast mailing list.
Re:More data? by mtaht · 2016-12-26 05:13 · Score: 1

I was a bit put off by the first 25 posts being basically trollish. I have tried to be helpful, merely, since.
Re: More data? by mtaht · 2016-12-26 05:16 · Score: 2

Many enterprise APs are pretty good, btw - and while I have not tested the current crop of stuff from eero, and google and so on, I'm pretty sure they've been paying attention to the work. (portions of the make-wifi-fast project were funded both by google and comcast research) So I hope you've been making your stuff great in the first place, and not having to deal with paying off all the technical debt we've been paying off here: https://docs.google.com/docume... But please go test for the things we are testing for and fixing!
Re:More data? by DamonHD · 2016-12-26 06:20 · Score: 1

That's gracious of you.
I've gone and read a bunch of your work, including blogs, and it is very interesting and definitely a public good if you pull it off, thank you.
I like smart distributed algorithms.
I am still baffled from an afternoon's reading round the subject if to be effective your anti-BB magic has to happen at (nearly) every edge device, or (nearly) every lossy (or speed-mismatched) network gap, or if BB can be fixed by judicious ISP infrastructure deployment, or would cumulatively benefit if multiple of those happened.
Rgds
Damon

--
http://m.earth.org.uk/
Re: More data? by wizden · 2016-12-26 07:51 · Score: 1

I guess my assumptions are tainted from running enterprise APs in my house and at my customers. Of course our hardware is awesome. :)
Re:More data? by mtaht · 2017-01-07 09:28 · Score: 1

Dear Damon:
I'm sorry, I tuned out of slashdot after a day.
"I am still baffled from an afternoon's reading round the subject if to be effecitive your anti-BB magic has to happen at (nearly) every edge device, or (nearly) every lossy (or speed-mismatched) network gap, or if BB can be fixed by judicious ISP infrastructure deployment, or would cumulatively benefit if multiple of those happened."
Better queue management everywhere would be good. Your second thought is closest to correct:
"(nearly) every lossy (or speed-mismatched) network gap" needs better queue management. That's a LOT (billions) of devices. The thing is, the queue management problem was known well before 1992, it's just that RED did not deploy very well, and FQ techniques were often kept as secret sauce. Things got out of hand as speeds went up and the potential speed mismatch variance between links went to 6 orders of magnitude, since 1992.
I (we) fully realize that the scope (billions of machines/year) of sticking solutions everywhere is hard, but it is never too late to start, (b and replacing dumb overbuffered fifos everywere with a couple hundred lines of code - considering the millions elsewhere seems simple!) and we've pursued developing an easily deployable solution (fq_codel primarily), as well as standardization efforts (ietf aqm working group). Things like systemd default to fq_codel, so do most third party linux router firmwares.
About the only major thing left (since fixing wifi) is actually getting this stuff into hardware and "big iron" like cmtss and BRASes.
... On devices themselves, we've worked on ripping out excessive buffering throughout the stack (BQL, things like TCP_NOTSENT_LOWAT (now the default in OSX), most recently pacing via the sch_fq qdisc and "TCP BBR") so that the tcp's and applications (mostly linux, but increasingly BSD), are not storing crazy amounts of data internally. There's been a lot of other changes, all my talks include a slide on the higher levels of stack and application issues.
... IF you have enough capacity, you don't see BB (except for microbursts, which couldbe quite bad before we started moderating TSO/GSO/GRO bursts). Certainly the core and a well designed infrastructure that never saturates removes the issue (except when it does happen!)
... At some point I need to sit down and write something definitive, instead of this vapor trail of 6 years worth of work all over all the gear and all of the stack(s).
-- Dave TÃht Let's go make home routers and wifi faster! With better software! http://blog.cerowrt.org/

Re:Nagle algorithm? by mtaht · 2016-12-25 18:11 · Score: 5, Informative

It is entirely probable we've been inside our own filter bubble so long (6 years) we cannot properly communicate with first time readers! some folk explaining the problem... the ietf video shows the benefit from fixing it. https://www.bufferbloat.net/pr... showing the extent: http://www.dslreports.com/spee... you have this entirely backwards: "Buffering can reduce latency, especially under heavy load, by better bandwidth utilization, and allowing faster retransmission of dropped packets. If it is slowing things down, then you should fix the buffering rather than eliminating it." You want enough buffering to absorb bursts, but any more just adds latency. Van Jacobson and kathie nichols calls this distinction good queue and bad queue: https://tools.ietf.org/html/dr... Less buffering (and fair queuing) allows for faster retransmission in particular.

Comment removed by account_deleted · 2016-12-25 21:04 · Score: 1

Comment removed based on user account deletion

Re: Cute name, no tangible problem by Anonymous Coward · 2016-12-25 21:14 · Score: 2, Interesting

Buffers are not a problem for latency, the growing internet is. Back in the early 2000s from a particular place in Europe to the west coast in US we averaged over 220ms RTT because it was going up to a satellite, landing in Newark and then traveling over 4 hops to the west coast. Around 2003 when we switched to fiber we got down to about 110ms, with the fiber going via two landing stations on the north shore of Africa, then via France via the Atlantic Ocean to Maine (or dalaware) then over land to the west coast. That was something like 12 to 14 hops.
As the years passed newer and faster fibers were put in place, but also more routers were added to branch the backbone more. Now the same geographic location in Europe to the west coast of US is again at 220ms RTT, because the hop count is around 36. Almost 3 times more routers today than 14 years ago. This is where the latency problem comes from - packet switching in the multitude of hops and MPLS tunnels that you don't even see, not from some imaginative buffers.

Yes by Bender+Unit+22 · 2016-12-25 23:07 · Score: 2

Now it is after I got my fiber connection it is all gone. My old *DSL connected at 50/10 mbit(errorfree) but I couldn't get anywhere near that(30mbit at most) and latency were way too high. Only place it caused me some problems was when I worked from home and the Citrix connection as I don't play online games.

Re:Yes by ciro2016 · 2016-12-26 08:57 · Score: 1

masa thatk you for this nice post

Re: Cute name, no tangible problem by Dagger2 · 2016-12-25 23:15 · Score: 4, Insightful

Badly managed buffers are a massive problem for latency. Just look at this graph from the article. You see the four ping time measurements on the right? You see how one of them is 100-250ms and the rest are more like 20ms? That's exactly the same link in all cases, but the first measurement has a giant pile of latency introduced purely by poor buffer management.

I'm not going to dismiss the problem you described, because I agree it's a problem. But it makes no sense to worry about 100ms on cross-Atlantic links and yet completely dismiss 200ms right on the first hop.

Re:This is completely a non-issue with the ISPs by Dagger2 · 2016-12-25 23:52 · Score: 1

Maybe in their core, but what happens when they try to fit that traffic down your 10 Mbit/s DSL link? There is going to be a buffer there.

Re:This is completely a non-issue with the ISPs by mtaht · 2016-12-26 00:11 · Score: 1

cut through routing works when there is no congestion. http://www.dslreports.com/spee...

Re:Not magic by adolf · 2016-12-26 00:24 · Score: 2

You're right, of course. The trouble is, the latency increases aren't reasonable for common consumer networks under load.

Two speedtests I just did on my lightly-loaded hardwired home network (30Mbsp cable from Time Warner):

With QoS

Without QoS

Throughput is less (rather surprisingly less -- I may want to check some things) with my QoS rules that group connections into individually-throttled categories, but bufferbloat is sane-ish (a brief peak at 250ms was observed, but otherwise under 100ms).

Without QoS, bufferbloat starts at around 1000ms (x10 increase!) and goes up from there.

I'm currently using Shibby's version of Tomato-USB on an overkill dual-core Asus router to accomplish this, though I have used other consumer-ish hardware with reasonable success (including the venerable WRT54G/L/GS) using similar software.

The trick, as I see it, is primarily to ensure that the cable modem (and whatever is directly upstream of it at the head-end) never see enough throughput for their buffers to begin filling by keeping all nearby bottlenecks under my own control.

The other benefit of QoS is that on heavily bandwidth-constrained networks, some tasks can be given higher priorities than other tasks, which is easy when we control the neck of the bottle.

I dated a girl for a bit who had the cheapest Internet she could get: 2Mbps down. Her kids hated it, and web browsing with tablets and phones and laptops was terrible for all of them if anyone was streaming a video (badly) or downloading (slowly). Loud banter over who was "hogging the Internet" and ruining gaming was common, and not unreasonable. It got worse when people would visit. It was really bad.

Best case: They were taking turns using the Internet. In 2014.

After observing this and suggesting she get faster Internet ("no, it's not important to me," she said) I gave her a router with Tomato, did some obvious QoS priorities that were tweaked for that particular situation, and voila: The games worked fine. Web browsing was always quite responsive. Youtube worked (worked meh, but worked), and downloads and BT didn't trash any of the above. Anyone could do whatever they wanted, and the inevitable slowdowns were graceful while responsiveness remained good. The gamer of the house didn't get upset anymore seemingly-randomly.

But that's just one success story. I've been doing tricks like this for over a decade on a myriad of non-enterprise networks, using cheap hardware and thoughtful software.

(Now it's time for someone to pop up and tell me that I've done it all wrong, and that my results are impossible. This always happens on /. when I write about using Tomato and QoS to solve real, practical problems. I'm ready.)

--
Kid-proof tablet..

Re:Not magic by wbr1 · 2016-12-26 01:53 · Score: 2

You've done it all wrong! Those results are impossible!

--
Silence is a state of mime.

Re:UBlock = inferior + inefficient vs. hosts by IWantMoreSpamPlease · 2016-12-26 02:10 · Score: 1

I have 32GB of ram in my system, 64mb barely registers on the radar. Nice try, thank you for playing.

--
So rise up, all ye lost ones, as one, we'll claw the clouds.

Rogers isn't by davecb · 2016-12-26 02:59 · Score: 1

At least in Tranna

--
davecb@spamcop.net

Re:Rogers isn't by davecb · 2016-12-26 03:04 · Score: 1

The sped test says and shows as a png

--
davecb@spamcop.net

Re:Not magic by adolf · 2016-12-26 03:04 · Score: 1

I'll need to see your CCNA before I can accept your retort.

--
Kid-proof tablet..

Re:Does the Tin Man have a sheet metal cock? by rotorbudd · 2016-12-26 03:55 · Score: 1

Steely Dan?

--
A bullet may have your name on it, but artillery is addressed to " Whom It May concern"

No problem here! by DaMattster · 2016-12-26 04:23 · Score: 2

I built my own router because I don't want any of these mass-produced, consumer piece of shit routers with more holes in them than swiss cheese.

Re:No problem here! by Lennie · 2016-12-27 01:06 · Score: 1

So what OS did you use ? Did you enable fq_codel or similar ?

--
New things are always on the horizon

Re:Not magic by mtaht · 2016-12-26 04:29 · Score: 1

Nice success story and the exact circumstances we were trying to make easier to solve with cake. (and the dream is more ISPs would just be doing it for you on their default supplied boxes)
I would like to benchmark more stuff like tomato's qos against cake, the equivalent (single!) command line for outbound would be:
tc qdisc add dev your_device root cake bandwidth 2mbit nat
which automatically applies per host fairness, qos, and queue length management.
inbound requires a slightly more complex setup but not much.

Re: Nagle algorithm? by Anonymous Coward · 2016-12-26 05:24 · Score: 2, Insightful

It doesn't matter how big or small the buffer is, what matters is why it's filling up to start with.
If you're buffering because of a transient traffic spike or network load, then the buffer helps. If it's constantly filling up and evicting then there's a deeper problem that won't be solved either by using, eliminating, or changing the buffering strategy.

Re: Not magic by Anonymous Coward · 2016-12-26 06:11 · Score: 1

Buffering is more useful for UDP, and is primarily intended to smooth out transient congestion on a network link or interface.
On a Carrier network, it's used to help deal with bursty traffic... TCP is a rather "slow" response mechanism which works fine in general, but can't handle congestion which comes and goes on millisecond or sub millisecond time scales.

For example, if an ISP has ten 100gig links running in a bundle, and a 1ms duration traffic burst fills one link up, the buffer will soak it up and allow for the load balancing to recalculate traffic flows across the other links. Without the buffer, those packets would drop, and all the TCP clients would react and throttle back, even though the overall bundle is still under 50% capacity, and even though the congestion has long since cleared before the packet times out.

The other place buffers are useful is when enforcing QoS on a network. They allow the router to evict higher priority traffic ahead of other traffic, so that when congestion hits you can still guarantee some traffic types.

Re:Not magic by richb-hanover · 2016-12-26 06:38 · Score: 1

Buffer bloat can't happen without congestion. Congestion is the real problem and talk of buffer bloat is a bit off-point. Sure, if you combat congestion with very large buffers (and hence significant queuing), you get increased latency due to the queuing. Reasonable increase in latency (say 20%) is not a huge hit on performance. Remember that you're trading that extra latency for lower probability of dropped packets.

You're correct that bufferbloat "only happens" when there's traffic. But I don't think you appreciate the current nature of internet traffic.

With web pages averaging 2 megabytes these days, you're "doing large file transfers" all the time. And if your iPhone kicks off an upload of its pictures, or your child starts watching videos, or your spouse starts their own web browsing/mail session, you're at the mercy of your router's queue management algorithm.

I don't think a "20% increase" in latency is reasonable, given that the Smart Queue Management (fq_codel, and soon cake) that's available in the Linux kernel (not to mention LEDE/OpenWrt/DD-WRT for your home routers) provide a "no-settings" way to limit lag/latency to an increase of only a few msec (or a couple dozen msec on a crummy DSL link).

Re:This is completely a non-issue with the ISPs by richb-hanover · 2016-12-26 06:49 · Score: 2

Well... I disagree that the "modern internet does not suffer from this problem." I have seen it at my house, and at measurements at many other places. (If you're only considering FTTH as "modern", I have still seen bufferbloat there...)

The ISP does have an opportunity to control buffering in two places: at both ends of the bottleneck (which is likely to be your cable/DSL/FTTH/etc.) link between your house and their facility.

a) Their "head end" gear might control queues for traffic going *to* you
b) Their Customer Premise Equipment (CPE) also would have the ability to control outgoing queues

If the ISP did both, then no one would have need to coin the term "bufferbloat". But the fact of the matter is that the vast majority of ISPs do *neither*.

Consequently, in late 2016, I believe it's prudent to provide my own solution and use one of the Smart Queue Management solutions (fq_codel, cake) that's available in LEDE/OpenWrt/DD-WRT so that I can get on with useful work.

Re:Nagle algorithm? by Bengie · 2016-12-26 07:14 · Score: 2

Buffering can reduce latency, especially under heavy load, by better bandwidth utilization

You have no idea what you're talking about. Buffering is one of the main causes of latency. Ever see a 1,000ms ping? That's not because the speed of light is too slow, that's because there is a backlog of packets in the buffer. With the speed of light through fiber, no one should ever see a ping above 300ms to anywhere in the world. The highest ping I see from Midwest USA to Australia, India, or China is about 220ms.

Buffers are not inherently bad, but "bufferbloat" is because buffers are too large. Too large of buffers actually reduce throughput because TCP takes longer to respond to changes in congestion. Even worse is when bufferbloat starts to get up into the 3second range, yes seconds not milliseconds, TCP treats it as a lost packet and resends the data. I regularly see bloated Linux ISO seeders with 2k-4kms pings resending nearly 50% of their packets, most of which were not actually lost but only highly delayed.

Good anti-bufferbloat AQMs like fq_Codel and Cake increase effective bandwidth, while isolating light traffic from heavy traffic and keeping latency almost idle-link low. Want 10ms pings while paying games and downloading/uploading torrents, I have that already.

Re:Nagle algorithm? by Bengie · 2016-12-26 07:18 · Score: 1

My response to this, it's not even wrong.

Re:Nagle algorithm? by Bengie · 2016-12-26 07:23 · Score: 2

The article talks about shaping download, which isn't possible at the endpoint. The traffic is already there and you have to deal with it. Dropping it will create retransmits for TCP and make the problem worse.

Wrong. Dropped packets signal congestion. If you don't signal congestion, the congestion will only get worse. You eventually have to drop a packet. The sooner you drop the packet after congestion has started, the less the congestion will be. The flip side is if you signal too early, you lose effective bandwidth. I shape my download and it has caused my average to go up because it stabilizes the flows.

With normal fifo buffers, once the buffer is full, you get a burst of lost packets. This is much worst than dropping a single packet earlier.

Re:UBlock = inferior + inefficient vs. hosts by fbobraga · 2016-12-26 07:29 · Score: 1

maintain a hosts file is a PITA

Re:Cute name, no tangible problem by Bengie · 2016-12-26 07:30 · Score: 1

The only real way to solve this is to timestamp packets as they enter a buffer and drop the ones that are too old.

You don't have to timestamp them to get the same effect. Codel and RED both effectively use time without timestamping. But yes, the "tracking time" is pretty much the only way.

GEO is 0.24 s round trip by tepples · 2016-12-26 07:40 · Score: 1

With the speed of light through fiber, no one should ever see a ping above 300ms to anywhere in the world.

Even to places where there's no fiber connection? In a lot of places, the only route to the Internet with a throughput greater than the 0.15 Mbps of IDSL is through a satellite in geostationary Earth orbit, 36,000 km up. An ICMP ECHO request from a subscriber to a satellite ISP, such as Exede, needs to go up to the satellite and down to the destination network, and its response needs to come out of the network and then go up to the satellite and back down to the subscriber. That's 0.12 light seconds for each of four legs, already nearly half a second, plus whatever latency is in the destination network.

Re:GEO is 0.24 s round trip by Bengie · 2016-12-26 08:20 · Score: 1

In a lot of places
At some point in your life, you should realize that "no one" mean very very rarely. Absolutes are never absolute. I question if my last statement makes any sense.

A testimonial by Anonymous Coward · 2016-12-26 07:53 · Score: 1

I've been using CeroWrt (https://www.bufferbloat.net/projects/cerowrt/wiki/ - the initial testbed for all of the bufferbloat work) for at least four years. For the majority of that time I had 1.5Mbps DSL service, but now I'm connected via a 12Mbps ADSL2+ link.

Prior to the installation of CeroWrt, it was painful for me to attempt to work remotely using an SSH tunnel if someone was watching a show via Netflix, but after setting up CeroWrt everyone was happy (me for not having to yell at my daughter and my daughter for being able to watch Netflix without me yelling).

With the 12Mbps link, it doesn't seem to be the ingress traffic that causes issues, but the egress traffic (at times, I upload large data sets). Without shaping the outbound traffic, I can see round-trip times in excess of 2 seconds which is just a bit excessive. ;-)

I recently installed LEDE (https://lede-project.org/) (an OpenWrt (https://openwrt.org/) fork) on a spare router (the same model as the CeroWrt router - WNDR3800) and it is obvious that the software continues to improve.

It appears that LEDE may be approaching its first stable release (https://forum.lede-project.org/t/criteria-for-first-lede-stable-release/552). If you have a spare router that is supported by LEDE, please consider installing a current build and report any issues found.

If you would like to learn more, here are a few random links to get you started:

Explaining RRUL Charts (https://www.bufferbloat.net/projects/bloat/wiki/RRUL_Chart_Explanation/)
The Cerowrt-devel Mailing List Archives (https://lists.bufferbloat.net/pipermail/cerowrt-devel/)
The Lede-dev Mailing List Archives (http://lists.infradead.org/pipermail/lede-dev/)
Does LEDE support my router? (https://lede-project.org/supported_devices)
The Make Wi-Fi Fast Wiki (https://www.bufferbloat.net/projects/make-wifi-fast/wiki/)
The Make-wifi-fast Mailing List Archives (https://lists.bufferbloat.net/pipermail/make-wifi-fast/)
Possible OpenWrt and LEDE merge (https://www.google.com/search?q=OpenWrt+LEDE+merge)
All of Dave's Patreon posts (https://www.patreon.com/dtaht/posts)

I feel that the work that Dave (and everyone else that is involved) is so important that I send a few coins his way every month via Patreon. Here's his most recent update: "Where your donations go" (https://www.patreon.com/posts/where-your-go-7564906).

Dave, a belated Merry Christmas to you and I'm looking forward to a New Year where all of the efforts to tame bufferbloat and make WiFi fast benefit everyone.

Re: Nagle algorithm? by Bengie · 2016-12-26 08:08 · Score: 1

If it's constantly filling up and evicting

This is actually normal for any congestion control algorithm that uses only packetloss to signal congestion. TCP? It keeps sending data until the buffer fill and drops "a packet", but we really know FIFO taildrop buffers drop bunches of packets. Then TCP backs off. But wait, there's more! You have many TCP flows going over the connection, so they are all fluctuation, keeping the buffer either in a state of steady full, which causes high latency and lots of dropped packets, or wildly swinging between empty an full because of global synchronization.

Re:Not magic by Bengie · 2016-12-26 08:42 · Score: 1

Remember that you're trading that extra latency for lower probability of dropped packets.

Not once you've gotten into the "bloated" range of buffer sizes. Increased latency from large buffers also increases the latency to signal to the sender that the route is congested. The sender will spend more time sending packets that will ultimately just get dropped. If the latency was lower, the sender would have known sooner to reduce its rate. Latency and loss go hand-in-hand once you get into unnaturally large buffers. I'm not sure the exact recommend buffer size, but I think it's around 10ms of the bandwidth. Many people are seeing 1,000ms+, which is 2 orders magnitudes above optimal.

Re:Nagle algorithm? by Bengie · 2016-12-26 11:11 · Score: 1

That is not the definition of "shaping", that is Cisco's definition for their own internal terminology. Regardless of what you want to call it, I can control the amount of bandwidth a flow or group of flows can use regardless of direction (ingress/egress), assuming they respond to normal loss, marked, or delayed packet. Most people calling this "shaping bandwidth", but you can call it whatever you want.

Re: Not magic by adolf · 2016-12-26 13:17 · Score: 1

I have a spare Asus RT-N16.

Where do I get started with Cake?

--
Kid-proof tablet..

Re:i dont get it. by Dagger2 · 2016-12-26 13:31 · Score: 2

It's pretty much not that at all. It's closer to:

* The provider is selling 100/100 Mbit/s to 20 people with a 1 Gbit/s uplink.
* You hook a WiFi router up to the 100/100 connection.
* While trying to VoIP/Skype on one WiFi device, somebody else starts watching Netflix on another.
* The latency on your WiFi (and thus your VoIP call) jumps up to 50-100ms due to bad buffer management on the WiFi.
* A third device starts trying to sync photos to a backup service, introducing another 100-250ms* of latency by tying up your upstream and generating another badly managed queue on your router.

That's a ton of unnecessary latency being generated right in your own house, by your own gear, and none of it will be helped by the ISP putting in more upstream bandwidth.

...on the topic of which, it would be insanely unnecessary to have 2 Gbit/s of bandwidth for twenty 100 Mbit/s users. You don't need enough bandwidth for every user to max out their connection simultaneously, because that never happens; you only need enough to cover whatever your actual peak traffic is without dropping any packets. When averaging over thousands of customers, this actually works out to needing something around 100 Kbit/s(!) per customer today.

Of course 20 is much less than "thousands" and the traffic profile of 20 customers will be much more peaky than the one of 1000 customers, but I suspect even then that 1000 Mbit/s would be enough to cover twenty 100 Mbit/s connections without dropping any packets. It certainly wouldn't be anywhere near having a "real uplink of 50 Mbit/s".

(*: Probably it wouldn't be this bad with a symmetric 100/100 connection; the graph I linked is for a 140/12 connection, but those are probably more common than symmetric connections anyway.)

HFSC to the rescue by HighBit · 2016-12-26 16:09 · Score: 1

bufferbloat is definitely still a thing.

I've been using this script for years to drop packets early to improve latency. it uses HFSC (built into linux since forever) and works great:

https://gist.github.com/eqhmco...

from that:

Congestion avoidance algorithms (such as those found in TCP) do a great job of allowing network endpoints to negotiate transfer rates that maximize a link's bandwidth usage without unduly penalizing any particular stream. This allows bulk transfer streams to use the maximum available bandwidth without affecting the latency of non-bulk (e.g. interactive) streams.

In other words, TCP lets you have your cake and eat it too -- both fast downloads and low latency all at the same time.

However, this only works if TCP's afore-mentioned congestion avoidance algorithms actually kick in. The most reliable method of signaling congestion is to drop packets. (There are other ways, such as ECN, but unfortunately they're still not in wide use.)

Dropping packets to make the network work better is kinda counter-intuitive. But, that's how TCP works. And if you take advantage of that, you can make TCP work great.

Re:i dont get it. by DamonHD · 2016-12-26 22:30 · Score: 1

There are definite peak hours for customer traffic, eg work hours for businesses, and evenings and weekends for home users, so even the very generous 2:1 contention ratio that you seem to be suggesting probably would still result in a saturated backhaul from time to time.

Thus shifting as much discretionary stuff away from that peak as possible will help, just as for power grids, but that's a separate topic.

Rgds

Damon

--
http://m.earth.org.uk/

Re:You're the one stalking/trolling by ac by IWantMoreSpamPlease · 2016-12-27 03:36 · Score: 1

You know, I'd love to use your program and do a comparison between it and a few other ones I use, but I do have a few questions.
Rather than clutter up this forum, drop me a line, let's talk, geek to geek

--
So rise up, all ye lost ones, as one, we'll claw the clouds.

No by allquixotic · 2016-12-27 05:36 · Score: 1

I have an LTE Verizon Jetpack as my primary Internet connection, and the firmware is proprietary and not user-modifiable, and of course they refuse to implement bufferbloat mitigations on their own. So, no, it's not free from bufferbloat.

Re:Ask here (I overcome objections directly) by IWantMoreSpamPlease · 2016-12-27 07:06 · Score: 1

Link to my homepage has my e.mail address there.

--
So rise up, all ye lost ones, as one, we'll claw the clouds.

Re:i dont get it. by Dagger2 · 2016-12-29 01:28 · Score: 1

It depends on the number of users involved. For this case of 20 users... yeah, you're probably right that you couldn't guarantee no packet loss all of the time, but you would probably get quite close. "It's a weekend" wouldn't be enough to saturate it; even when actively using the internet, most people's bandwidth use is short large spikes surrounded by lots of idle time. Torrents would be a better bet, but maxing out the link would require 10 users torrenting at their max line speed. There are people who will do that, sure, but your odds of having 10 of them at once in a building of 20 people are low.

At the main ISP level, where you're aggregating thousands of customers together, you can overprovision far more than 2:1 safely because customers average each other's traffic out and unusually high peaks become even rarer.

(I should point out that I have no operational experience running a network like this so a lot of this is educated guesswork, but the ~100 Kbit/s figure I gave in the last post comes from people who do have that experience.)

Slashdot Mirror

Is Your Internet Connection Free From Bufferbloat? (blogspot.com)

84 of 147 comments (clear)