Why I Hate the Apache Web Server
schon writes "Today's the last day of ApacheCon Europe; There was a hilarious presentation entitled 'Why I Hate the Apache Web Server' for anyone who has expressed frustration with the various inconsistencies and nuances of the Internet's favourite config file. And yes, it includes a comparison to Sendmail."
I guess that when performance, and not 'tradition' is your priority, you choose the appropriate tool for the job.
When you click on a pdf it displays a choice of opening it in a new tab, downloading it , or viewing it as HTML (which i think uses google, but i could be wrong because i never use this option)
It really saved me.
Slashdot Sig. version 0.1alpha. Use at your own risk.
I'd assume the presentation was originally a talk given with slides. The PDF file is obviously just each individual slide in PDF format.
Really your complaint is the presentation hasn't been translated to a text based format, not about PDF files themself. The Slide show is merely meant to complement the actual talk, and wasn't meant as the main source of information.
AccountKiller
Well, until they feel like adding a PDF warning, you should get the PDF Download extension for firefox. It's small, simple, and prevents PDF ambushes from raping your system's resources.
>And it's URI, not URL. No such thing as a URL.
There's a whole RFC here to prove you wrong:
http://www.ietf.org/rfc/rfc1738.txt
NeoThermic
Use my link above, or to view my server, NeoThermic.com
For some reason a copy of acroread.exe remains working even after closing all the PDF documents, and that's the one that locks up the browser.
To recover from that lock, all you need is to open the task manager window, and shut down that faulty acroread.exe.
The basic auth logout: yes, people have been asking for it for years, but it's HTTP itself that doesn't provide a mechanism for logging out users, it's not Apache's fault.
The lax syntax: hell no. That sort of thing leads to security holes. If I make a typo, I'm perfectly capable of going back and fixing it, should Apache notify me immediately. But if it misinterprets a typo as meaning something I didn't want, I won't know about it until it bites me in the ass. There is nothing wrong with strict syntax for config files.
The rest are relatively sane complaints and ones I've said myself in the past. Even if they are contained in a godawful PDF.
Put this in your userContent.css:
:link[target="_blank"]:after, :visited[target="_blank"]:after,
:link[target="_new"]:after, :visited[target="_new"]:after
a[href$=".pdf"]:after {
font-size: smaller;
content: " [pdf]";
}
I also find the following useful:
{
font-size: smaller;
content: " [new]";
}
a[href^="javascript:"]:after
{
font-size: smaller;
content: " [shite]";
}
>Well, it would be nice if it didn't stop responding when your access log hits 2GB, too...
o gs.html or http://httpd.apache.org/docs/programs/rotatelogs.h tml
One should never let the logs get larger than a few hundred MB. Consider rotating the logs, either monthly or weekly (or even more often if your logs gain size quickly).
See:
http://httpd.apache.org/docs-2.0/programs/rotatel
NeoThermic
Use my link above, or to view my server, NeoThermic.com
> I absolutely hate pdfs embedded in web browsers.
So (in Firefox) untick the box that opens it in a web browser.
(edit / preferences / downloads / plugins, or tools / options / downloads / plugins, and then click teh tick so that it isn't there any more).
Indeed. URLs are a subset of URIs. URIs = URLs + URNs. A URL is an indicator of where to find a resource while a URN is an unique name for a resource regardless of where or how many places it may be found. To add more confusion, sometimes URNs are also URLs.
Got that?
check out foxit reader... it stopped me from hating pdf files.
Why I Hate The Apache Web Server
Lessons learned from IRC - Rich Bowen
Note: Opinions expressed are those of our users, as expressed on IRC. The goal of this talk is to make people aware of things which those "outside" see as problems, but which we tend to be so used to that we don't see at all. If I get carried away, feel free to throw fruit.
Why do I hate thee? Let me count the reasons.
- Fragile
- Confusing
- Missing stuff that EVERYONE asks for
Fragile
- Breaks easily. Small changes have big results
The first of these forbids Indexes. The second one permits them. Huh?
... wow, don't get me started
Disclaimer
"But that's not supported syntax!"
Then it should throw an error and break, not do something utterly unexpected. Unfortunately, several major Linux distros ship with this broken-but-almost-looks-right configuration, or variants thereof
Example 2
Vhosts
That vhost is silently ignored. Yeah. That's intuitive.
...
Discussion
"But the docs say not to do that!" Yes, I know. I wrote that line in the docs. It's still really irritating.
Another
Unlike every other Apache config setting, "Require" is case sensitive, so that's not valid valid_user would be nice too. Oh, and "Require User" and "Require Group" don't work either.
Missing (asked daily on IRC)
- Can I set a variable and use it later?
- Can I have an if/else syntax?
- Can I please reload my configuration file without restarting my server?
- How do I make ServerTokens return "Bob's Handy Dandy HTTP Server"? (Yes, this is silly, but it would sure shut a lot of people up finally.)
What else? mod_imap: how many of you have actually used that module? How many of you who are not committers know what it does? Why is it on by default? Come on folks. Netscape added client-side image maps in 1995!
And while we're on the topic mod_cern_meta: Who even knows what this module does? For the record, yes, I do. But I doubt any of you have ever used it.
CONFUSING
NumServers ServerLimit ThreadLimit ThreadsPerChild StartThreads StartServers MaxSpareThreads MinSpareServers MaxSpareServers MinSpareThreads MaxClients MaxThreadsPerChild MaxRequestsPerChild MaxRequestsPerThread ThreadStackSize
Oh sweet God make it stop
What's that directive called?
RLimitMem, RLimitCPU, RLimitNProc? I have to look these up every time. Of course, since they don't seem to do what the docs say, maybe that's not a bad thing.
Am I running out of time yet?
- Why do I have to set up two separate vhosts for http://example.com/ and https://example.com/ when they're the same website?
- Why are dynamic vhosts so darned hard?
- Why doesn't the default configuration file match the "security tips" document?
mod_rewrite
I probably don't need to say anything more than just "mod_rewrite". But I will: "Voodoo" and "... flexibility of sendmail". The docs practically scream "GO AWAY!"
RewriteMap
Nice, but have you ever found an actual useful example? Oh, and the example script for generating db map files doesn't actually work. (Note: Paul fixed this 2 weeks ago. See httxt2dbd)
How about this?
If I want these two aliases to work, I have to:
Nothing is foolproof to a sufficiently talented fool.
Really, the version in KDE >=3.4 is incredible.
Game! - Where the stick is mightier than the sword!
It's not necessarily dumb. Latex-Beamer can produce, IMO, the nicest "PPT-like" presentations I've seen.
The TargetAlert extension for firefox places a nice little icon after links to PDFs or ones that open in a new window.
Just cause you feel it doesn't mean it's there.
Some FTP servers can do this too - e.g. try downloading a directory name with .tar.gz appended.
I've never found a link so compelling that I would click on it without knowing something about where I was about to go and what the file type was.
You don't know the file type. The file extension is meaningless. You can follow a link to some resource named foo.html and it might be a PDF, and you can follow a link to some resource named foo.pdf and it might be an HTML document.
Same goes for the location, unless you disable client-side scripting or manually copy & paste the URIs instead of clicking on the links.
Unless you manually perform an HTTP HEAD request, you don't know the type of file you are requesting.
Did you miss the bullet point that said "Yes, I know. I wrote that line in the docs. It's still really irritating"? As an Apache project member that primarily contributes documentation, I think he has RTFM -- in fact, he has WTFM. ;)
Most of the time, it is not a real lock-up.
Based on my experince, 99% of time Adobe has a dialog box hidden underneath other windows that prompt you to check for update. Once you disable the freaking Adobe update checking feature, all is good.
Ok, make your pdf with 10pt text (though I personally find that annoying to read at a comfortable distance*) and people that read it can hit the little plus sign next to the magnification precentage. done.
PDF is a document specification container. The whole point of pdf is to make your document look good and once your document looks good, to make it look the same on every machine it's displayed on. Thus ensuring that it looks good (at least according to your definition of looking good) on every screen and when printed.
If you want viewers to be able to edit your files at the expense of looking the same, or even good across platforms (and you're willing to put up with images getting strewn about willy nilly) you might as well use a word document. It's not going to look any worse than html.
*If you're making your pdfs with LaTeX then you've probably read some of the docs. A good deal of typesetting knowledge went into the design of TeX and much of it is explained in the documentation. Those docs mention something about a recomended number of characters per line for maximum comfort reading. At 10pt, you're either going to have far to many characters or you're going to have huge margins and/or double-spaced text. At which point you've negated the benefits of your 10pt text.**
**ok you could use a multicolumn format to get around that ugly fact, but then your document looks like a cheesy newsletter.
Can you be Even More Awesome?!
Guess you haven't upgraded to Tiger yet. Now, they open up in the browser window. I miss the old way (especially because it doesn't put a taskbar across the top so I can easily zoom in - have to right click, Zoom In several times).
Safari 2.0 works fine for me, although I'd appreciate a PDF toolbar.
$x='S24;r)>63/* h@<5+oZ)32"5cz';$me='phroggy'x$];
$x=~y+ -xz+\0-Tx+;print$_^chop$me for split'',$x;
Not a problem. The hated Apache's mod_gzip renders the point moot.
LaTeX defaults to 10pt fonts and 1.5 inch margins, which look damn pretty. The optimal number of characters on a line is 66. Even if you scale your margins so that you have 66 characters per line at 12pts, you're going to have fewer total characters on a page than at 10pts because of vertical spacing issues. Moving up to 12pt tends to look awful. Then again, as long as I'm paid by the page instead of the word. . .
After all, I am strangely colored.
Maybe you start to look for alternatives. KPDF has come a long way, and is amazing these days.
Ahhh... Serenity now.
I've seen IIS sites handle a /.ing fine, I've seen Apache dragged to the dirt. Why? Well /.ing kills sites one of two ways:
/.ing isn't at all impressive, it's expected. Any webserver worth it's shit should be able to had out massive amounts of data with little resource usage. It's other processing like PERL scripts, DB requests, SSL, etc that kill it, or simply overtaxing the available bandwidth.
1) Bandwidth. Whatever if being offered is large enough that the line it's on becomes highly over saturated and thus requests are processed very slowly, if at all.
2) CPU load due to dynamic content. Sites that use databases, or scripts to create their pages or something get overwhelemed because they don't have enough CPU to support all the requests.
The webserver itself isn't the problem. Either Apache or IIS can easily saturate a 100mb link with static content, even on a fairly old server.
When I worked for the school paper and we were linked, it was no problem at all. The line was 10mb, and the content was fairly small (say 300-500k total) and all static. Despite being a P2 300 the server didn't even break a sweat, load average was below 1. When the department I now work at was receantly linked for a comet simulator, it killed out webserver, despite the content being about 2k and it being a fiarly fast SPARC machine. The reason was each request required computation, so our load average was about 100.
Apache being able to survive a
Bandwidth is actually fairly common, many servers are run on small lines. I have a couple servers in my closet on my 768k up line. That is plenty for normal usage, people find the sites quite zippy. However Slashdot would easily overwhelm that bandwidth.
This was at an apache conference. He's written documentation for Apache. Chances are he uses it and knows more about it than you ever will. Dumbass.
Actually it is: How I learned to stop worrying and love the web. This is a direct link to the pdf rendered into html.
A bad analogy is like a leaky screwdriver.
Cool. I'll add them here.
What part of "gestalt" don't you understand?
Caudium rocks my world. Faster, easier to configure, GPL (or MPL2 if you prefer), built-in support for dynamic content, and more.
Uhm, that's not Apache's fault. It's your browser's.
Roses are #FF0000, violets are #0000FF, all my base are belong to you
As it turns out this isn't the whole truth (there can be heuristics) but what you are doing is rather dicey (read as: may give different results in different browsers or in the future). Make your charsets agree or you might cause yourself problems.
What is nice is that you have one syntax to access different kind of storage it svery powerfull.
"Use cases are fairy tales..." I. S. 2005
I don't want to detract from your main point (which is that configuration is hard) but can't they pay someone else to do the forking? Who says that everyone has to be programmer in order to have their own fork?
Having formerly dealt with MS IIS servers I've come to embrace the granularity of configuring an Apache server.
Virtual Hosts, Redirects, etc. are easy if you read throught he config file. Granted, it is getting a little long and should be separated into different config area files.
For the record, the talk was created in Keynote, which will export to a variety of formats. PDF seemed like the least problematic of those formats. I don't use PowerPoint.
Thanks for all the great feedback from folks about my complaints and examples. Yes, it was more than a little tongue-in-cheek, and several of these problems have in fact already been fixed - a couple of them right there are the conference.
And, I'm not *just* complaining. I fixed the "Require User" case sensitivity thing, and I'm rewriting the mod_rewrite docs.
Apache guy, Open Source enthusiast, runner