What Knowledge Gaps Do Self-Taught Programmers Generally Have?
BeardedChimp writes "I, like many others here, have learned to program by myself. Starting at a young age and learning through fiddling I have taught myself C++, Java, python, PHP, etc., but what I want to know is what I haven't learned that is important when taught in a traditional computer science curriculum. I have a degree in physics, so I'm not averse to math. What books, websites, or resources would you recommend to fill in the gaps?"
Being self taught myself, I think the biggest downside is some of the strategies and standards that are taught in the mainstream curriculum (IE: how to properly use the object-oriented model, etc.). Especially when I first started out, my code may get the job done, but it wasn't the cleanest or best approach. Luckily, I think it will come to you in time if you focus on improving your code.
testing and architecting (building frameworks) etc.
read your gamma and fowler...
design patterns
metageek
Although in practice most of the time advanced data structures and algorithms are not used, it is useful to study them and implement them yourself at least once. Dijkstra's algorithm, Prim's, Kruskal's, maximum flow, and other basic graph-operating algorithms are a good example.
Knowing exactly how it works, and I mean down to the falling/trailing edge of the voltage change.
Though most 'proper' trained developers have these days the same problem, especially if they never had to learn any assembler.
Creating a good database (or object) model, or code structure is not something trivial and usually self-made programmers are not the best in those fields.
As a self-taught programmer myself, learning standard C has given me a whole new outlook on how I code. I also thought it would be good to know where a lot of the more modern languages were derived from.
go to college sir, or at least try to follow the courses they have. before i went to college, i did embedded development(fiddled with my phone, did some java apps), php, perl , linux scripting, high level C++, complicated algorithms ( acm, ace.delos.com, etc.. ), but college really opened it up to me.
No, I'm not being a smart ass.
Others have slightly different styles and conventions, and ways of solving problems. Working on something like a large open source project could teach you about working on a team, where one person can't "own" a whole part of a program. (And cleaning up others code will greatly help you learn about documenting and formatting your own code.) One good team assignment we got, Each person work on a part of a program. Away from computers, we had to, on a whiteboard or whatever, decide inputs and outputs, etc between parts, then code separately. Grade on that assignment was how well the program behaved when the teacher, in front of the class, compiled the separate parts, and ran it for the first time combined.
What are we going to do tonight Brain?
Design Patterns: common "Template" solutions to regularly encountered problems/variations-on-that problem. Be careful when learning these that you don't fall victim to "when you have a hammer, everything is a nail". Also learn the Anti-patterns, wikipedia has a good list of anti-patterns.
Algorithms & Data Structures: Analysis, average running time Big O is most important, but understanding worst-case runtime is important too. Designing algorithms vs knowing when to leverage an existing one.
the C++ standard library provides a great many of these, it has a high efficiency sort (from ), it has good collection data structures (vectors, linked lists, maps, etc)
Objected Oriented Analysis And Design: Knowing when to make something an object, when and how to use inheritance and polymorphism, when to not make something an object. Plain old data objects. separation of responsibility: UI is not logic, logic is not UI.
Threading: proper thread synchronization techniques (mutexs, semaphores, conditions, etc), threading patterns such as Producer-Consumer, Inter-process communication
Automata & Computability: (Deterministic|Nondeterministic) Finite State Machines, Regular Languages, Turing Machines
Programming Languages: LL language parsing & rules authoring.
Computer Architecture: Processor design, pipelining, caching, function calling conventions, etc - how to use this knowledge to write more efficient programs
If you cannot keep politics out of your moderation remove yourself from the Mod Lottery.. NOW!
you can be a self taught coder.
but to be a Software Developer you need to study for years, and build upon it with experience.
like people before me said, design patterns, modelling, and algorithms.
I've worked with both, and you can tell.
Theory and practice of software development and computer science, plus whatever isn't covered by what i already mentioned.
I've learned more in the right job than from any book or course.
Experience helps, but the real killer deal is experience backed by a CS/Eng. degree.
Need an ISP in South Africa?
If you know your algorithms and data structures, and have a firm grasp of the architecture of modern computer systems, you'll be way ahead of a depressingly large proportion of people with degrees in CS that come past me in interviews.
The most informative and entertaining book I can recommend on algorithms is Bentley's "Programming Pearls".
"Skill shows through where genius wears thin." -Wittgenstein || Religion: uniting aviation and architecture.
I've been programming Visual Basic for years!
Good software engineering practices (this is my personal experience). This is a problem when managing middle-to-large projects.
What did you miss in school... practice. A student's full time job, for 4 years, is to practice CS technique and theory. That's it. Mystery solved.
I'd say you need to learn enough mathematics to get an appreciation for what goes into the discovery of an O(nlog[n]) algorithm -vs- its [naive] O(n^2) counterpart.
The problem is that you need to know just an incredible amount of graduate level mathematics to even get a sense of what the Fourier Transform is all about before you can have any sense of awe at how a Fast Fourier Transform would be an improvement over a [naive] "Slow" Fourier Transform.
Heck, I don't know if you can even get a good feeling for the structure of a fast sorting algorithm without having a really strong grounding in infinite series, and that probably requires at least a couple of semesters of Advanced Calculus.
Transactions and threads. People tend to think of these as boring concepts and therefore don't learn all about them and how they work. An engineer who hasn't internalized these concepts is dangerous.
Comment removed based on user account deletion
What gaps do schooled programmers have that self-taught programmers don't? While a self-taught programmer might go about getting the job done differently, I can almost always count on him to get it done. Programmers coming out of school often still have a horrible worth ethic, especially when compared to their self taught peers. Granted, I have a very limited experience, so I wouldn't cast that judgment over all, but I would be curious to here what others think.
Invexi - a Phoenix, AZ based web design and web development company.
I think my biggest problem with being a self-taught programmer, is not really knowing how to organise code into physical files so that it makes sense, and when is it a good time to commit ones code to version control, and should one branch to do this or do it on the main branch.
I guess I'm also lacking some of the standard algorithms, as a self taught programmer I started out by coding to solve some real world problem I had rather than some contrived exercises involving how to code the basic algorithms such as linked lists, stacked based recursion and recursive algorithms
I highly recommend Lewis' Elemtns of the Theory of Computation. Garey's Computers and Intractability seems to get quoted a lot as well. I'm not sure how important this stuff is in every day computing, but if you want to learn computability, these two cover everything.
Imagine if you weren't allowed to use roads because a bus company complained about your driving 3 times. --skunkpussy
I went to college to get a comp-sci degree after I already had several years of experience in the real world. The one thing that I took away from that education that I had not stumbled upon naturally was Big-O notation. Specifically, how it describes the time and space efficiency of different data structures and algorithms. It really comes in handy when trying to optimize. There have been several where I had become focused on optimizing the crap out of an inner loop, only to realize that if I switched to a different data structure I'd get much faster O(log n) performance.
Other than that, many people only experience low-level C and assembly programming while at school. But you mention that you have C++ experience, so you're probably already well familiar with segfaults and buffer overflows.
Lastly, Structure and Interpretation of Computer Programs (SICP) is a great read. The levels of recursion and abstraction that it employs will often blow your mind. Sadly, I can't use that kind of coding in the real world because I have to write "maintainable" code and most corporate programmers don't want to have their mind blown.
IMO, those who are not self-taught (and spent months in some "History of the Abacus" class) can search the Interweb for code samples and tinker around. One day, they'll fill the gap and catch up to us.
I am/was self taught as well, mostly web-based to start with... PHP, then javascript, then C# over many years and up to using OO in all three. A year ago I started taking c++ classes in college and I can say I picked up a few very helpful things.
1. C++ gave me a look into pointers and how stuff runs at a lower level than any of the other languages.
2. Object use. I used objects before, but I didn't leverage them quite as heavily as I could have, including inheritance and polymorphism. What I learned in C++ has greatly affected how I code in PHP and C#.
3. One of my C++ classes was about efficiency in algorithms... so I learned a bunch about how stuff is done deep down and how to write efficient code. Very helpful for some of the stuff I do in PHP and javascript when working with large data sets.
4. Code design is a big one... before, I just put all my methods into a single class and called them there. The class ended up being more of a repository for functions and data than a real class. Now I have a better grasp of breaking stuff into parts and either creating child classes or whatever is appropriate.
It's still a learning process. I'm only on my 4th c++ class and haven't even taken the data structure class yet, but I've learned a tremendous amount, even in just the first intro to c++ class. My suggestion... take at least intro to c++ at a community college.. you'll probably be surprised how much you'll learn. As an added bonus, being familiar with programming makes getting an A a breeze, so you can concentrate on actually writing decent code and learning stuff.
Satis clankiller.com
I have a CS degree from a major university. I have to disagree with most of the comments I've seen so far. Things like design patterns, proper object modeling, even advanced data structures and algorithms can be picked up on your own with a bit of effort as you need them, and experience building real production used software is the key to hone those skills.
...) so that I could accurately predict how it would behave under different conditions (especially when some of those conditions can't easily be tested). And an introduction to a large swath of computer science terms and facets, so that years later something comes up and I have a faint understanding of where to start looking.
IMHO there are two things that I got from school. How to properly analyze code (in terms of processing time, memory usage,
The code quality, design, and ability to apply [insert new hot term of the day] correctly all come from real world experience. And I do think you have to get that experience in a professional setting (I would consider much of the open source world profession, just FYI), hobbyist work just won't let you grow the way you need to.
I've been doing this for a few years and the one gap I'm seeing more and more of doesn't actually have anything to do with programming techniques, "design patterns" or anything else that's hugely technical. All of these things are pretty well-known and accepted by everyone, and you can always be sure that there'll be someone around pushing one or another of them as the be-all and end-all of Programming.
The one gap you might have as a self-taught programmer is in fact in the _history_ of computer science. There's a lot of stuff that has happened and in fact people keep finding and solving the same problems, never realizing that the problems have been encountered and solved many times. (An example that's particularly relevant to me at the moment has to do with extent-based file systems; ext4 has extents and so do a number of new file systems. Great idea, right, particularly for large file systems? Thing is, extent-based file systems have been used at least since the 70s in mainframe operating systems. Odd that it took 40 years to get it into Unix.)
But don't feel bad that your self-teaching has skipped the history of computing. It appears that most university computer science programs neglect that little bit of background as well, in favor of jumping straight into C++ or Java.
Maybe I'm an old fart but that half-semester of history I took back in 1981 made a small but significant improvement in my ability as a software engineer.
Ability to estimate memory and computing power consumed by different algorithms. Or maybe you learned it too. But most self-taught programmers I met just learned syntax.
I've found that self-taught programmers can actually be quite productive. However I've noticed (in general) the following deficiencies which I think are both rooted in the fact that the need to memorize seemingly arbitrary facts about a system is inversely proportional to deepness of understanding of that system (see graph):
-Design Patterns (noted earlier by others): There is a tendency of self-taught programmers to follow a design pattern more doggedly than others. This can be tied back to the fact that for the self-taught a particular design pattern represents what programming is to them. They memorize a series of facts that support the design pattern they use rather than understand the nature of a design pattern itself. They tend to have steeper learning curves when presented with new structures and design patterns because using a new design pattern requires the abandonment of the facts they've memorized and starting anew with memorizing a new set of facts.
-Adaptability: Self-taught programmers tend to reach a certain level of comfortableness with technology (ie: languages/libraries/etc.) and attach themselves to it. The thought of using a different language, library, or system is daunting (or even aggressively resisted) since, again, changing requires a new memorization of facts around the technologies (see graph).
Much of what you should learn formally from a CS degree is WHAT a programming language is or WHAT a design pattern is, not merely HOW to program or HOW to use a particular design pattern.
That said, there's nothing stopping a self-taught individual from learning these things on their own. It's just that when you're teaching yourself a trade you, naturally, immediately (and sometimes exclusively) focus on things that allow you to compete on a particular level or with a particular technology. Learning design patterns or what programming is in the abstract doesn't seem to have an immediate payoff (clients aren't going to ask you about those things). But they are skills which allow you to be competitive across technologies or design patterns which is especially important in the rapidly changing world of computers.
Faith is a willingness to accept something w/o complete proof and to act on it. Reason allows you to correct that faith.
Memory allocation and management. Design patterns. Data structures. Search and Big O calculations.
I just wanted to shout out to a fellow Physics major programmer.
I did happen to get a CS minor so I did get a chance to dabble in Big O, Algorithms and Datastructures. While you probably won't miss them normally you may run into a situation where you'll be wondering how you can do something faster or better and these would be your answers.
So it's not as much about the gaps in your knowledge as a gap in the way you think about things. the way you make simple models and use them to reason about basic properties of any system that you are proposing to build. You can get the general outline logically correct before you try to make the details perfect.
Having said that, the subjects that opened my mind were:
Functional languages - make you think differently. You realise that there is another way and maybe you realise when and why you'd choose it.
Complexity is the most obvious thing that I find non-CS people don't understand. They optimize rather than choosing a good algorithm. Understanding various algorithms and how they scale (even why) is very useful in all work that I did later on.
Automata - finite state machines, non -deterministic finite state machines etc etc. This has always helped me to think about very complicated bits of logic in real programs.
Security - I remember learning a notation that helped you calculate how much information you had given away to someone else in a communication. It helped you to analyse a protocol to see if you were exposing information that could be useful to an attacker before you had created adequate trust. This was awesome and it made me think a lot about how insecure everything really is and how you need many layers and grades of security because some are bound to be penetrated.
Architecture - we studied the MIPS architecture i enough detail to feel that we really knew what happened inside a CPU to a very fine level of detail. This was probably an illusion but it was still eye opening and is the basis of so many decisions that I make nowadays that it's frightening. Nothing direct - it's all background but it is immensely useful.
Persistent databases - made me impatient with Relational DBs and it's one reason why I never want to write another line of SQL or use a Relational DB ever again.
Cheers,
Tim
This is all just my personal opinion.
My problem wasnt with any part of the languages' syntax, it was more with the size of projects. Just looking at an even relatively small project filled me with dread
As a self-taught PHP and C# Developer, the biggest trouble has already been outlined as limited exposure to new concepts. The bigger question, however, is how to gain exposure.
#1 - User Groups I personally don't attend user groups because I have 2 jobs, and 2 kids, however, the Ruby community has shown again and again that it works, not just for the new stuff, but for the old stuff. They just overhauled Rails and as long as the community keeps talking, they'll do it again and again to perfection.
#2 - Contracting It's a large assumption, but if you have the time to learn a language, you've got time to find small contracts, and hopefully ones that will introduce you to knew people with difference foci (focuses?). Also, digging unto other people's code helps you think outside of the structures that you taught yourself - you might even get some extra cash. Check out craigs list, elance, etc
#3 - Open Source Not as good for your wallet up front, but if you think you have a unique perspective that is applicable to an existing project, donate some code. Bug fixes are just as valuable as new features.
#4 - Publications I use this in the loosest sense of the word possible. I "camp" PHP.net because there are new functions popping up all the time. Their search database is fairly decent, so when you're thinking like a PHP dev, put a word or two in and see what pops up. MSDN isn't too bad either, but the naming conventions vary, and it's so large that simply search for keywords is a challenge (They have an "OrderedDictionary, but not a UniqueList...?)
#5 - Inspiration (& Perspiration) Nothing develops with out the the will power and simply getting things done. Going back to #3, you can simply start your own project or feature. Lots of things are pluggable these days, and if your desired functionality doesn't exist, don't cry about it - build it! PHP doesn't have events, because events don't make a lot of sense on the Web.... HOWEVER, if you're writing a PHP-JS-AJAX framework, then they make a LOT of sense. Noone says you HAVE to release your code either... managing a repo is a lot of work. The point is to build something, find the pain points, then ask yourself "Is there a better way to do this?" Find the better way, build it, and make your life easier... then share it if you can.
Comment removed based on user account deletion
I'm about 3/4 self-taught, I had some CS courses in college (before there was even a CS minor in the school) and a couple of grad school courses (which I can't say I got all that much from.)
But here's my list, based on what I've experienced over the last 30 years:
analysis of algorithms, "Big O" and similar things. If you've read a basic data structures book, you -might have- seen this stuff. But it's really important theory to understand. I'd rate this as the #1 gap; people who don't have this knowledge have a real hard time reasoning about or discussing performance, etc.
AI - not an interest of mine, so I've never bothered to learn it. But that's a big hole in my own knowledge. That includes knowledge representation, reasoning systems, etc.
multiple programming paradigms - In this case I'm "OK", I've learned and applied several different programming languages (most of which are not popular/politically correct these days...) I've said I'd never hire a mono-lingual programmer, and that's because learning different languages has given me other ways to look at problems. (See http://en.wikipedia.org/wiki/Sapir-whorf - which applies to programming languages as much as natural languages!)
numeric analysis - That's one of 2 post-grad courses I took where I actually learned a lot, and it seems that numeric analysis is somewhat of a dying art. But a lot of what we do is still 'calculating', so this is important to know.
concurrency - Here I'm personally in good shape because it is an interest, and I've done a lot of work in Ada which provides a very strong concurrent programming model. But things like race conditions, deadlock, consensus algorithms, etc. are growing in importance due to multi-core CPUs. Nancy Lynch's "Distributed Algorithms" is The Book to read here, too bad it's so expensive (http://www.amazon.com/Distributed-Algorithms-Kaufmann-Management-Systems/dp/1558603484)
compiler theory - it's worth knowing how language processing tools work, and with compiler theory (parsers, etc), I'd add knowledge of machine languages. If you ever have to go down to assembly language to find a compiler optimization bug (something I've had to so several times - ain't fun), you MUST know this stuff.
optimization/operations research mathematics - I'm not sure how much this gets taught in CS departments, but I was fortunate enough to get -2- Operations Research courses as a math major (at 2 different schools...) Understanding linear programming, integer programming (things change when you can only deal in whole numbers), PERT/CPM, etc, has proven to be valuable for both hard-core programming and for systems design (and even project management.) I'd also add graph theory to this list, a lot of problems I've worked on are graph problems (they show up in compiler construction, too...)
I've learned -a lot- just by reading other people's code, too. Remember, a program is written once, but read many times during its lifetime. Reading code gives you both an appreciation for how others attack a problem, and the need to make your own code more understandable. This is where the Open Source movement has been a Godsend. (I'll note in passing the Ada community was very good about sharing source code back in the '80s when this wasn't as popular as it is today.)
Finally, a "meta-comment": Although my formal education in CS itself was weak, I've invested a lot in learning on my own. I'm sure my professors would argue they taught me how to learn, and there's some truth to that. But not everyone will snuggle up to a textbook on graph theory on a snowy winter night :-)
I'll second Finite State Machines.
More times than I can count, I've drawn a finite state machine on paper and gone on to implement it in a high level language. Though I feel awfully old fashioned when I do it, they're great for parsing stuff and handling incoming data streams. The results are typically high performance and bullet proof.
This is a must for programmers - And soooo few know how to do it. It's all fine and dandy that you can find your way around the interface that you designed, but you need to realize that not everyone thinks like you. I was fortunate enough to take a course in it while studying, and I think I was one out of a handful that didn't say "This is useless" and in some cases people were even outright hostile to the idea of usability. It applies not only to UIs, but also to APIs and code, too. Usability is about how you grease your work so it runs smoothly for everyone, not just you. In the case of coders it lets them become fluid in your API easier.
As well as a broad range of basic skills, there are a couple of Comp. Sci. skills that I like to see, and without these I tend to be more cautious: - Compiler Construction - Formal Methods - Parallel Programming These broad heading are the general indicator of what I am looking for, I think they give the kind of depth that you will come back to again and again in a career. So I look at what options my prospective employee has taken in their Comp. Sci. Degree. If I see optional courses taken that fall into the categories above, they go to the top of the list. If they have instead taken lots of multimedia, web or softer options instead I will be more cautious (unless of course I am looking for these particular skills)
My initial programming experience predated the Design Pattern concept. At the same time, I knew about pattern language, the concept behind design patterns, because I had heard about the architectural version by Christopher Alexander.
Think of a pattern language as being a conceptual construct that does a specific job, be it an entry way for a house, a menu for an application or the splash page for a web site. Certain activities occur in that 'space' and if you know what those activities are, based on past history, you can effectively design for them. You might even have a set of solutions that you can copy and modify, which can save time and money.
Once the specs have been made, you can build it with the appropriate tools, be they construction materials or bits and bytes.
Thinking about it, I would say that good programmer and analysts were already looking for design patterns long before Design Patterns became formalized.
Was fun even 15 years ago to see Pascal programs full of goto loops, from people that learnt plain basic by themselves and then "upgraded" to a more serious language. The very thinking of how is the flow of a program, independant from the language used, was wrong because that "programmer" never understood some basic programming concepts. Programming has evolved with time, and chances for doing it all wrong because missing or not fully understood key concepts are bigger now.
Depends on how you're self taught. School teaches you primarily from books. Self taught people, most of the time, learn things from books. The difference is that self taught people don't get the benefit from the great teachers with years of knowledge and wisdom. They also don't have to deal with idiotic teachers who don't even understand what they're teaching (and YES, they DO exist even at the masters level of instruction).
cynicism, hopelessness, futility, frustration, despair, indignation, indentured servitude, the gut wrenching emptiness that hits when a project you've poured your heart and soul into gets canned right when it's almost ready to release just because the ceo read some article about how everything should be done in some new sexy framework...
Here are some things I found usefull in university:
- study of algorithms (big-O notation with case studies on sorting algorithms); This one completely changed the way I view program efficiency
- formal languages / compiler theory (grammars and parsing have never been the same for me since). This is something you will look at when you write any low-level parsing/validation: XML, functional / expression editors and even program parameters parsing in some cases.
- language classes (this was not the actual name of the course and I don't remember what it was actually), but we went through query languages (SQL), unstructured languages (BASIC), procedural and functional (C, pascal), OOP (CPP, java) and declarative (prolog). Prolog was something that made me see differently how the language changes the way you think about programming.
All that said, the academical medium has never been accused of being very practical minded, and I learned at least as much in working in programming as I learned in university. Don't dismiss one in favor of the other as each will show you things the other simply doesn't.
Tie two birds together: although they have four wings, they cannot fly. (The blind man)
I find that many self-taught programmers learn by example. They copy a bit of code here, and modify it to suit their needs. The problem with this is they often do not understand why the code segments work the way they do. For example, I see a lot of Java programmers with the misconception that the 'import' statement is equivalent to C's #include statement (rather than the using statement). They know that if I 'import java.util.*' I can use the classes in the java.util package. Instead what they should say is: I can write the short form of classes in java.util.
For this same reason, I find a lot of beginner programmers confused on instance methods vs static or global methods. As Java does not have global methods, the simpler to understand static methods require an additional keyword to use. Many Java programmers will start out with applets, where the starting point is an instance method, and then try to transition to applications where the main method is static and wonder why they can't call instance methods from their main method.
Self-taught = passionate = "talent" not teachable in class I know who I'd rather hire
Self taught also, Mechanical Engineer by training. I've found my biggest gap to be fundamental CS stuff, I'm currently working my way through a book on algorithms: http://www.amazon.com/Introduction-Algorithms-Third-Thomas-Cormen/dp/0262033844/ref=sr_1_1?ie=UTF8&s=books&qid=1266595707&sr=8-1 While the practical implications of knowing this stuff is limited, it's always good to have a deeper understanding, and it will help if you're ever in an interview with hard core CS types.
I'd say that formally taught programmers may not have much experience with maintenance programming, especially with legacy systems that have been running for years. They are used to 'blank sheet' programming assignments that allow them to control the entire project from start to finish. They don't have to deal with code that has been modified dozens of times over the years, often without much documentation.
I got my professional start in programming doing maintenance programming under the supervision of a senior programmer who had spent years working with the systems as they evolved. While I had done some programming in college, all 'blank sheet' stuff, doing maintenance programming was much more educational. You had to make sure that things were done right, otherwise you could cause big, real world problems.
Wel many maths and physics people cross over to become programmers. Especialy in firms that work in a mathamatical of physical feild.
I have worked in a mining software firm that did this and can offer some insight. I have also taught tertiary students so I know roughly where the imprtant gaps are.
Firstly in terms of code quality.
-you may lack of expose to maintaining software and be acusted to math using single letter variables. The absolute importance of descriptive variable names for any thing not self explanitory (i, x, y) is often lost.
Other code quality issues like use of comments ect have not been forced in by experince.
Remember always that you are probably going to code complicated things on a subject matter that the rest of the programming staff only have a general knolede in. But it falls to the rest of the staff to maintain and debug your code. You must write all physics code with that in mind.
I found that a lot of code writend by non-programmer expers is very costly and difficult to maintain.
You may find that some pure comp sci stuff is missing. Can you do predicate calculus? work in lisp, train neural networks, optimise evolutionary algorithems, optimise code using a vast array of obscure data structurs.
Probably not. But most of the industry jobs don't require this.
Important knowledge gaps may also exist in code related sujects. Database design and SQL are a artform that self taught programmers will generaly not tough because it is not interesting. But it is important in the workforce.
Beyond that a tertiary education only deos so much. Time spent programming and industry experience will improve your code more than reading a few undergraduate level books.
The true strengh up a comp sci degre is often the amount of free time students have t practice programming.
Hope this helps.
I can't say that people graduating have a good grasp of the more abstract concepts of programing like polymorphism and recursion but they do get taught along with some of the more difficult data structures (B*Tree or Graphs). I could also point to standards documentation forms like UML relational diagrams, timing diagrams etc.
It really comes down to experience with real code for most things.
You almost certainly already have some grasp of Complexity Theory since it governs why e.g. mergesort is faster than bubblesort. I personally found it a somewhat dull topic but it is probably worth delving into a bit for "self improvement" purposes.
Functional programming is worth playing around with. US universities tend to focus on Lisp, I think. ML and Haskell are often used in the UK and have a very interesting type system (proponents say that it's about the most advanced one out there) that it's also worth being aware of. Haskell is also a lazy language, which is interesting although you're unlikely to encounter it anywhere else! Some of my ML programming course dealt with how to build lazy data structures without explicit language support, which was potentially a useful technique.
Others have mentioned design patterns. I guess it's worth looking at those since even though you might instinctively know some, it's easier in an interview if you can *name* them so they know you know what you're talking about.
As someone who was in exactly your position a few years back I might have some insight into this.
First off I have a BS in physics with a minor in Mathematics and my physics department was HEAVY into computational physics. My grad work is in a different field altogether! As an undergrad, we coded for almost every class after our intro courses but we were never required to take programming classes; most all of us were self taught. I had started to code on a Commodore as kid (self taught) so I was comfortable with programming, but programming is not computer science!
A few years back I took a position at a software company as a developer (still there, still coding, although now a little more senior) and my tech lead was a computer scientist (versus a programmer or engineer). I had the good fortune to work with him for a few years and I quickly learned that programming is not computer science.
As others have pointed out, you may have missed good OOP design (talked a lot about but rarely done), design patterns and big O notation (I had a numerical analysis course in college so I had seen that before); these are easy to learn as needed (e.g. The Art of Computer Programming, Google, etc.). Perhaps the biggest thing I had to learn was the whole development process; customer requirements, functional specifications, technical specifications, QA cycle, learning to do what is needed versus what is requested, defensive coding, future anticipation coding, etc. Another thing that you might be missing is a fundamental understanding of why a computer does what it does (e.g. why you shouldn’t compare two doubles for equality, race conditions in parallel work, etc). Again, my physics program was computationally heavy so I covered some of this but your history may be different.
I’d say that should cover what you’d find you’re missing (for your first 2-3 years as a professional at least). I might add to this the fact you under estimate how little new code you actually get to write; meetings, documents, technical support, design and maintenance take up a lot of your time (both low level and senior developers).
I think the biggest help was coming to terms with the famous quote “Computer science is no more about computers than astronomy is about telescopes.” (Edsger Dijkstra).
Don't overlook the knowledge to be gained by working with someone who's already been there. Try to find someone who does (or did) the same kind of programming that you do and open a dialog with them. Swapping war stories with someone who has 10 or 15 years more experience can help you decide what kind of things you still need to learn, and what direction you want your career to take.
As you, I am a self-taught programmer. I started at young age with a ZX Spectrum 48K, and after finishing high school, I had the good luck to have an opportunity to work at a software house as a programmer. I know now that what I knew about programming at that time was really close to nothing. Currently, I "write code" for a living as a freelance professional (though what I actually "sell" is always an end product or service to meet specific needs).
What I learned through the years by experience, is that the practice of project management is crucial when writing code and has an enormous impact on the end product's quality and meeting deadlines. The fact that a person has developed the habit of looking at a project (no matter how small) from a top level perspective makes him or her make better decisions when writing code or designing models.
Getting the work done and well done, in time and within the budget is always all that matters. That's why I believe that PM is the most important discipline for any programmer, much more than being highly specialized in a specific technology or area. It gives the individual the ability to see the whole, articulate better with other collaborators, understand the ramifications of his decisions and the purposiveness of his work.
"Computers and Intractability" is more like a list of problem-reductions among NP-complete problems + references.
If you want to write "Problem XYZ is NP-complete", it is a good book to quote, but it's not at all a book, that teaches you anything...
The MAFIAA is a bunch of mindless jerks who will be the first up against the wall when the revolution comes
One of the biggest ones I've seen is that self-taught programmers tend to not think about algorithm efficiency. Learn how to determine the big-O of your functions and learn how to code more scalable algorithms.
http://en.wikipedia.org/wiki/Algorithm_analysis
http://en.wikipedia.org/wiki/Big_O_notation
IMHO it's not going to be specific gap; the gap is going to be quantitative. Formally taught programmers will have fewer of them, because they're "forced" to work on stuff that doesn't interest them or otherwise wouldn't ever come up in their projects (or doesn't appear to them, to come up in their projects). You can be self-taught and have a shitload of professional experience, and yet that experience can be very narrow, whereas a CS graduate is going to have most of the bases covered.
So it's not so much a matter of self-taught people not knowing design patterns or algorithm complexity analysis; it's that they're just generally less likely to know them, and even less likely to know both. That's why I think a lot of peoples' suggestions here will ring true at first, except that for any of them you're also going to realize there are a lot of counter-examples, where self-taught programmer x actually does know about complexity analysis. But keep adding and adding to the list, and you'll see that the CS guys know almost all of it, and the self-taught ones know less of it. It's not this item or this item; it's the count.
As copyright owner of this comment, I authorize everyone to defeat any technological measure which limits access to it.
There is a confounding variable in this which is the level of expertise needed for a particular job. A programmer doesn't need a degree. Someone that just writes code doesn't need advanced knowledge. Developers need some experience and knowledge. Software Engineers need a higher level of experience and knowledge. Even more so for Software Architects. In essence, the self taught programmer isn't missing anything. They know what they need to know. The self taught Software Architect is a whole other story. At this last level, it's important to be able to envision how you want a whole system to work, how others might want it to work, figure out which algorithms are important, what qualities it should possess, which patterns and anti-patterns are the solutions toward your goals, be able to re-envision an entire sub system and how it works to best fit your goals as various test implementations fail, how to combine problems to find a more universal solution, how to break down a problem to define the various different pieces of what you're dealing with, etc.
This all assumes the knowledge of Patterns, Anti-patterns, Algorithms, Data structures, OO, Threading, Computability, Automata, Language design, Usability, Business & Risk Management, and computer architecture (etc, etc) to have in your tool set to solve the problems of designing and coding a system. Gaining the knowledge of this tool set is very hard without formal training. The ability to think logically about these problems is also enhanced by the knowledge and experience of having to think about these things.
In my experience the biggest problems are caused by self-taught programmers who lack the humility to realize that computer science and computing a large field, in which they are not domain experts of all of it. Seriously, I'm saying this not to be an insult, but as a plea for self-taught programmers to take their blinders off, and admit to themselves there is a lot of knowledge about computer science and computing (or IT), which has a rich if relatively short history. The ones who get pass the chip-on-their-shoulder defense often become very good to great programmers. The ones who don't tend to become isolated, confrontational, unable to handle constructive feedback or criticism, and tend to be poor team players.
The other big weakness they can have, is they seem more apt to have problem with NIH-syndrome (not invented here). Anything they didn't do or create is crap. Again this seems to be a self-defense mechanism gone astray.
If these two psychological factors are dealt with, the technical knowledge gaps are in comparison trivial to deal with.
In your case, exposure to higher education including post secondary mathematics, helps with the building the experience of abstract thinking which is a excellent trait or training for programmers, to deal with programming in both the concrete and abstract terms.
A programmer who knows and understands the fundamentals of computer science, including data structures, algorithms, number systems, boolean logic, at least a basic understanding of computer architecture in my experience tends to be more flexible and adaptable to change in computing / IT in general as well as able to less stressful to change development environments including languages. Structure and Interpretation of Computer Programs, Structured Computer Organization, and Introduction to Algorithms are excellent resources for any self-taught programmer looking to fill gaps in the knowledge.
Also professional computing / IT society memberships might be worth considering (especially if your employer will pick up the tab), for example the ACM, and the IEEE Computer Society. Both have a bent towards academia, but they largely due to the self-interest of authors in academia to publish (for their own career success), as opposed to a conscience focus away from the "real-world" programming in the trenches.
For any new professional programmer, texts like The Pragmatic Programmer, Code Complete, Peopleware, and The Mythical Man-Month are strongly recommended reading matieral.
Computer programming is of little value without the ability to communicate.
I have to hire college graduates to change diapers at the school I run -- to ensure that when they do speak to the children, they do so with correct grammar.
Of my four years of formal education, two classes stand out:
Database design - If you don't know the formal procedure for database normalization, you shouldn't be designing a schema -- not because you'll necessarily use that procedure, but because you know the factors to take into consideration.
System architecture - I think the one most useful class I ever took was the one that stopped the CPU from being a black box. I had an old-school professor, and he went deep -- we started off with digital logic, Karnaugh maps and the like, and by our final could draw the extra data paths needed to be able to implement a new instruction on a MIPS chip.
...and then there are the opportunities one gets access to as a result of having been at school; much of what I know I learned at my internship (being one of the few userspace developers in MontaVista Software's sea of Linux kernel programmers around the tail end of the boom), not an opportunity I'd have acquired on my own.
I am about half self-taught and half college-taught. I am currently looking for jobs, and despite having 10 years programming experience and knowing more than a handful of languages, understanding OO and algorithms, I don't have the minimum requirements for most jobs I'm looking at. Why? I don't know SQL well. It's one thing if you're looking to work at a major, kickass place like Google or Microsoft, but a lot of the smaller shops are looking for people who can write client-server code in Java and SQL. If you don't have that, 80% of the jobs on Monster are off the table.
Being a computer scientist means you tell people how computers should work, not that you know how they actually work.
1. Learn how to use source code repositories (including branching, merging, and resolving conflicts), and how organizations do releases
2. Learn how to do code reviews, and how to adapt to local practices and style guides.
3. Learn how to be flexible.
4. Be humble.
Self-taught programmers frequently develop and religiously stick to their own favorite practices, code formatting, programming languages, IDEs, etc. When you're part of a team, though, your code needs to be readable, understandable and maintainable by other members of the team. This means learning how to be flexible and work effectively outside of your favorite language/environment. Some of this you can get by participating in some open source projects, and active participation in open source projects is something I look for on resumes when a candidate seems to be self-taught.
Heck, I think students who didn't opted for that RDBMS design class would design tables in complete different ways than one who does. For example, I tend to design and interpret tables with relationships (one-one, one-many, many-many) first, and then fill in the PK/FK later, while some colleagues just start with table and keys at the same time. Or, I can look at a set of tables designed by someone else and find that it makes no sense when you draw out the tables using "chicken feet" diagrams.
Both self-taught programmers and school-taught programmers have gaps. I feel like I got pretty good exposure to both scenarios, since I was self-taught but did go to college, where I had some of those gaps filled in (and also took plenty of courses where I learned precious little).
Some things that self-taught programmers often lack are,
There's others (compiler theory, AI, whatever), but in practical software development those are the two that come up most often. But I find that self-taught programmers are often better at what they do than school-educated ones, because they are sensitive to the gaps they have, and they know how to learn things on their own.
School-taught programmers on the other hand have far, far more gaps in my experience. They often lack
In my opinion the main problems with current Computer Science curricula are that the professors themselves are heavily isolated from real-world application development. Courses on version control (especially newer tools like git or mercurial as opposed to CVS) are virtually nonexistent. Courses that teach students how to structure code for others to use (ex., writing code as a library for others to link against) don't exist. Because Computer Science courses are isolated from real-world needs like maintenance and working with others, the only criteria students' code is judged on is whether or not it produces correct output.
Students learn one or two languages, but virtually always languages that share roughly the same syntax (e.g., C++ and Java). Very little time is spent on highly-productive scripting languages like Ruby, JavaScript, or Python. And students are never taught to adapt their solutions to the strengths of different languages. If they learn Java first, every program they write will try to solve it the "Java way". Not to pick on Java — the point is simply that nobody is taught to identify and use the strengths of languages, or learn their common idioms, so they just write everything like it's their Language of Choice.
No comment.
I'm learning C++ using Stroustrups's 2009 "Programming -- Principles and Practice Using C++" textbook. This book is huge, but well-written for both first-semester college students and self-taught programmers.
I opted for C++ instead of C because I wanted a multiparadigm language to help me learn different approaches to coding. The procedural paradigm is strong and useful to learn how computers work in low level, but I C++ provides with other useful paradigms as well which might come handy when you jump to another language. Afterall, learn C++ at an adequate level and then most other computer languages should be easy to comprehend.
Nevertheless, programming is expressing some ideas in code. To get these ideas and put them down in useful algorithms is another story. Perhaps in a formal Computer Science or Software Engineering course they learn about how to design the algorithms by practicing elegant abstraction techniques.
"Sum Ergo Cogito"
Having worked with great programmers who were university educated or self-taught I agree that advanced data structures and algorithms is an obvious gap. However I think there are some more general gaps:
1. Underlying theory. The more general case of your point I suppose.
2. A common vocabulary. For example I can say thats NP and a university grad is more likely to understand.
3. The ability to complete a long, boring and bureaucratic process. Seriously this is an asset, it demonstrates that one has the potential to stay with a project until completion. Some people coming from the self-taught camp bailed out of the more traditional university path because of the time and bureaucracy. That is a liability for some projects. How many open source projects founder because there are plenty of volunteers for the fun parts but few or none for the boring parts?
--
Perpenso Calc for iPhone and iPod touch, scientific and bill/tip calculator, fractions, complex numbers, RPN
It really depends on the person and the exposure. I've been writing code for more than 25 years and over that span I've developed projects in nearly every language/platform imaginable. The young people I mentor and mange are not getting much benefit from college as I far as I can tell. Most of the colleges seem to be behind in technology. Being taught by professors that either have never worked in the field or did so 20 years ago and have lost touch with today's scene. Most of my best employees have the hunger to learn. It's not what they're learning in school, but what they're teaching themselves. All anyone needs is a library card (or Google these days) and a strong desire to succeed.
Gaps between those that are self-taught and those who are university educated are primarily due to their study materials. Those going down the university path have an advantage due to broader and more in-depth coverage (particularly with the underlying theory and math). I've found solutions to problems coming from topics I never expected to be relevant and topics I may have never studied on my own. However I have a friend who is primarily self-taught and who is blessed with the intelligence and motivation to read a broad collection of university textbook level materials on his own. For such an individual there is really no gap.
... Doing so will vastly improve your odds of making it.
So if you want to be a video game programmer do not go to your local shopping mall's Barnes and Noble bookstore and buy books on C++ and OpenGL. Go to your local college/university bookstore and buy *textbooks* on Data Structures and Algorithms, Computer Architecture, Computer Graphics, Databases, Networking,
--
Perpenso Calc for iPhone and iPod touch, scientific and bill/tip calculator, fractions, complex numbers, RPN
Ours uses ADTs, Data Structures, and Problem Solving with C++ Second Edition by Larry Nyhoff. It may be too simplistic for an accomplished amateur, but it does go beyond basic control structures. You can get previous editions of used text books fairly cheap. There usually isn't much new in a new edition. It is mostly a scam to enrich text book producers.
"I'm not a quack, I'm a mad scientist! There's a difference." - Dr. Cockroach
There's way too much emphasis on Software Engineering in these comments. Multithreading? Self-taught programmers are the only people who ever have the time to study IA-32 and associated errata to the point where they're able to write a reliable threading library. Object-oriented design? Computer architecture? Design patterns? All of this is easily within the reach of a self-taught programmer.
Here's my answer: self-taught programmers will not learn computability theory, complexity (incl. real analysis), classical algorithms/data structures, discrete math, combinatorics or numerical methods. Other than that, a self-taught programmer will probably be much better at the purely mechanical part.
You could teach people all they need to know about big O and common algorithms in an afternoon.
Sorry, but I gotta call B.S. on that one.
You need YEARS of mathematical training to grok this stuff.
Have you ever tried teaching college level programming to recent American high school graduates?
I have had young adults [some already with bachelor's degrees who were coming back to school to brush up] who couldn't reliably compute anything in Base-16 [hexadecimal].
They need the better part of a decade's worth of intensive mathematics training to get to the point that they could really grok the difference between what goes into a "slow" O(n^2) algorithm and its "fast" O(nlog[n]) counterpart.
And let's face it, lots of people doing basic HTML or VBA [Visual Basic for Applications] probably don't have sufficiently high IQs to make that transition.
And even if they do have sufficiently high IQs, then summoning the self-discipline [not to mention just the spare time] to tackle this stuff is going to require a really formidable application of the will.
Which is not to say that it can't be done, but the odds are definitely stacked against them.
This is something even experts struggle with. As a long-time skeptic of OOP[1], I've tried to find a consensus about the "proper" way to architect an app using OOP so that I have something consistent to dissect and publicly analyze besides "toy" examples, but found the OOP author/proponents' answers either vary widely per practitioner or are too open-ended.
They pretty much say, "I know good OO when I see it, but I cannot write down solid rules for recognizing and/or creating it". It's difficult to dissect and analyze the benefits of techniques if they are a moving target or non-describable.
[1] It has its place, but for the most part is over-hyped. It's one of many tools
Table-ized A.I.
Take a look at Project Euler
Being self taught, one of the major revelations I had was that the more I tried to teach myself, the more I realized I knew just a FRACTION of what it took to become a good developer. In the beginning when I thought I understood what i was doing, writing oop programs, when in fact I was writing procedural code with objects scattered around it. Regular Expressions were (to me) among the arcane arts known by masters, engines were something to be licensed from those same masters, and I didn't even hear the words "design pattern" until I'd been trying to work professionally for over a year. Many of the things I know today, I wouldn't have even known to ask/research/learn about back then, because I couldn't conceive of its existence. Heck, I'll probably be saying that forever! To quote a friend of mine: "I cannot grasp the depth of my own ignorance." However, while doing the 'self taught' thing, I have seen other threads like this recommending the staple books like the Petzold book for windows programming, the O'Reilly series, the Stroustrup book for C++, and the "Head First Design Patterns" book. However since i still have google up when doing the vast majority of my coding, I've actually found that more and more I've been bookmarking blogs by developers more experienced and skilled than myself. Moreso than most books, the blogs I keep bookmarked tend to have prefaces at the beginning of each article explaining exactly what programming obstacle the blogger had encountered, thus providing a concrete application for the code to follow in that article.
I think one thing schools teach is to comment. In schools, comments are generally how you explain to your professor what you were doing. In the work place, comments is how you explain to your co-workers, and predecessors what was going on in your mind. It's also how you learn what your predecessors and co-workers were thinking when they wrote code you are inheriting. Generally speaking , people who learn by them selves don't comment, because they don't expect anyone else to read their code. Personally I find commenting a great way of explaining to myself how I am going to go about solving a problem as well.
The second big topic is efficiency. Anyone can write code that gets the job done, but how efficient is the code you write? Does your code complete in constant time, exponential, or logarithmic? With small, low volume projects this doesn't matter, but how well your application scales to serving large numbers of people or utilizes vast quantities of data it matters immensely. Google Big O for more information on this topic.
Finally at college you learn to use a lot of programming laguages, many of which you will never use again. While you might think this is a wasted effort, it helps you learn how to learn new programming languages. You seem to have a lot of programming languages under your belt, so you probably have this covered.
If you are doing basic bean counting software, you don't really need that much math. Calculus is overkill.
Heck, in some instances, you need to know how to interpret legal documents more than you need advanced math. I remember a case where I had to tweak an asset ledger system in order to help it meet IRS standards for depreciation after a change in ownership.
At the same time, if you are doing engineering or scientific programming, it can be extremely useful to know Advanced Calculus because that is what the engineers and scientists are using. You may be even better at the topic then they are because you have to translate the calculations into code.
One thing that you're going to be seriously lacking in knowledge is code management.
You've got to use something like CVS or Subversion to keep track of your code. (Both tools are free) If you aren't using a CMS, then you don't have code, you have a bunch of text files that happen to compile. You can check differences, make rollbacks, and use a lot of great tools to track what's been going on in your codebase. You can try something new and then either integrate it or dump it. I used TortoiseCVS as the front-end client for CVS for years. (When I moved into PLCs, I couldn't find a tool that would work, which was quite terrible.)
Nobody teaches you that, even if you go to school.
You'll also want to learn some scripting, like Tcl / Tk, so you can run automatic tests. Nothing shows your edge conditions like a mindless script.
Learn how to comment and be consistent in your code. Use good and obvious variable names with consistent syntax. If you want to use CamelCasing, that's fine. If you want to use ALL_CAPS_WITH_UNDERLINES, that's weird but okay. Here's the stumbling block: The code should obviously tell you WHAT it is doing. The comments should tell you why it's doing that. I wrote a Viterbi decoder that was about 30 lines of code. With the comments, it was closer to 130 because it was a weird and intricate system that required a lot of explaining. Trust me, when you go back in five years and wonder what in the hell you were thinking five years ago, those comments will save you a lot of grief. People who say they don't have to comment are either terrible programmers or terribly inexperienced, NO EXCEPTIONS.
---
ECHELON is a government program to find words like bomb, jihad, plutonium, assassinate, and anarchy.
"Happy families are all alike; every unhappy family is unhappy in its own way". Same for self (or partially) taught developers. Each will have a different gap. It depends on the developer's interests, project history, etc. A self-taught game developer may have a solid grasp of 3D and analytic geometry, while a self-taught web developer may have a solid grasp of database theory. Presumably, a developer who went through academic training will know at least something about both (and many other issues) - depending which school he went too, but that's another discussion. So, bottom line is, "it depends".
The longer you work at it, the more you will be exposed to, but you'll learn a bit more over the first few years. I'll list a few that come to mind.
Modeling--UML. You really should know sequence diagrams, they rock. Seriously. When you need them, you NEED them.
O notation. O1, On, ... Just helps you articulate some concepts that you intuitively know but might have trouble discussing with others.
Data Structures--what to the insides of a hash table or heap look like?
State Machines--They can replace threading in many scenarios, would you automatically know when and how to use one?
Low level crap--how are variables stored? how do references work? how do operating systems work? Could you dive in and write a parser for a DSL if it was prudent?
There are also many areas that a self-taught person would have more in-depth knowledge then someone who decided to specialize in computer science after a year or two of college just due to his passion.
A lot of it, too, is simply communication. If you take a class on Design Patterns, nearly everything they teach should be DAMN OBVIOUS to you, however if someone comes up and says that should use the Memento, Flyweight, Bridge or Builder pattern, would you know what they were saying? When you need a pattern you should be able to derive it just through being a programmer, but how long does it take you to describe it without knowing the names? Get a book on design patterns, by the way. One on Refactoring wouldn't hurt either.
I may be somewhat biased since I have a BS and MS in Computer Science. Computer Science gives you a good foundation to understanding what is going on behind the scenes. More understanding is better. There are libraries and built in functions for just about everything these days which anybody can use, but you are better able to diagnose problems when something goes wrong if you have a deeper understanding of how they work.
A recent CS grad and a self taught developer with 4 years of experience are not at the same level. The self taught person is generally going to be better at writing applications. However, in the long run a person with a CS degree will surpass the self taught person. This of course assumes equal levels of talent. About 10% of developers are just better than the rest by an order of magnitude or more.
I learned a lot in school that isn’t particularly useful for my job. For instance, no one has asked me to write an application to solve differential equations. I did have to write a quick sort in basic when I first graduated. I also ran into an issue recently where a client had some equations they wanted coded into an application for reporting. I had a self taught developer who just looked at it and had no clue. I coded the equations into a function for him so he could finish the report.
Nothing.. keep doing what you are doing.. I'm a self taught programmer myself and fiddling with the code seems to bee and will always be the bet way do learn...
because you are not just learning "what to write" (copy&paste like).. but you are learning "why to write".. what work and how it works.. and that will always give you a more deep understanding of the platforms and technologies..
Of course reading a bit about general concepts.. will help you generalize and will give you the tools to "abstractly" design and application.. but in the end it's always about implementation.. and that is what you need to know about.. And if it works.. well,fast and with out bugs.. who cares if is a "standard design" or just something you just cooked up.. if other is team dot understand.. then its them that need to get practical...
Not to seem to full of myself.. but if I compare with other guys in my company.. they may know all theory behind designs.. and may know a lot of buzz words.. but in the end.. I'm the one that does cleaner, simple working code.. and I'm the one, they all come to when they need to understand how the framework works or to make some kind of "tricky" code
There are two main types of computer people in this world - those who know theory and books and those who know from experience. If I was running a company, I'd want 100% self-taught and capable programmers and not a bunch of cookie-cutter designers. Sure, the formally taught employees might get the job done a bit faster most of the time, but thinking laterally and outside of the box as well as cleaning up and fixing problems and emergencies, well, that's something that self-taught people have to do or have done many times in the past. It's a different type of thinking, really. And it's something that a lot of college educated students just don't have skills in or have to spend years learning.
Perhaps another example would be those 100-in-1 electronics kits that a lot of us had growing up. We may replace components now instead of fixing the actual parts that broke, but there's a world of difference between an engineer or even a PC tech who understands and knows why the stuff works from using/making it themselves versus just what they were taught somewhere. One knows why it failed and will likely fail again, and the other knows how to use a screwdriver. (and yes they still make those kits, so get your kids one!)
Plus, most college courses require you to take tons of useless classes to GET to the more advanced stuff that you could just as easily go out and buy a book on Linux and save yourself 1-2 years of classes. Of course, your education matters, since if for no other reason than to get that interview, but programming should be something you know and learn and not necessarily what you actually got your degree in. IMO, the smart CIS major switches to Math, Physics, or Engineering instead and does programming as a skill they happen to know.
I'm a self-taught programmer.
I know algorithms, complexity theory, and lots of math: linear algebra, discrete maths, etc.
What does a good programmer need to know though? Fundamentals. Everything else is specialization. You'll need to know hardware, compilers, interpreters; abstractions, and a solid foundation in scientific principles. You need to be able to hypothesize, implement, test, and evaluate. A basic understanding of algorithms and complexity theory naturally comes into play when theorizing about computer problems which is why most courses integrate them into the curriculum. Beyond a very principled but fundamental understanding however, an in-depth and broad education in such subjects isn't necessary.
I taught myself a lot of maths because that's what interests me. I wanted to know how to write ray tracers, how to optimize them; how to use statistical analysis and discreet mathematics to sort through and find interesting facts about large sets of data. Do I need to know any of that to write a piece of software, compile it, and have it run efficiently? No. I learned that stuff for the kinds of programs I was writing.
The only real difference I find between myself and my university-trained colleagues is that I haven't suffered under the same financial burden as they for the knowledge I've acquired. Yet they talk about it like it's a serious advantage! Of course, there really isn't a serious difference between us that matters when the software ships. It's just a matter of perspective.
The stigma engendered to self-taught programmers however, is the hurdle I find most difficult to overcome and also easiest to ignore. Difficult because people assume things about you when you tell them that you learned on your own. Easy because once they realize that you are competent and capable, the stigma is removed. What many people don't seem to realize is that there are some great programmers out there who were self-taught. Just read through some of the interviews in "Coders at Work" by Peter Siebel. If the autodidacts among us are to be taken seriously, this generalization that self-taught programmers are backyard hacks has to stop. A bad programmer is a bad programmer whether they are self-taught or university taught.
After all, I could just turn around and generalize by saying that university trained programmers are privileged snobs who think "real-world" programming is beneath them. But that wouldn't be fair would it?
It takes all kinds and to each their own.
If you think you need to know something to solve the problem you're having, learn it. Move on.
If you grok the math enough to understand the difference, you would probably be working for vendors that create the hardware and software that everybody else uses. Or you would be doing software design and development for engineers or scientists. In either instance, you need to grok the math in order to write the software and design the hardware/firmware.
If you are just trying to learn enough to apply various algorithms, an afternoon or so would suffice. That or a cookbook on the 'best' algorithm for a given circumstance. It is kind of like driving a car. You don't have to know how the mechanical system works in order to get acceptable results. The results may not be as good as they could be though. You run into a trade off between the time spent finding the 'perfect' algorithm and the time it takes to get the job done.
It would be interesting to do a study on the costs of using the right algorithms versus the costs of using something that works. If it is something that is life and death, or you are dealing with big impacts in terms of time, money or other resources, you need the right algorithms. But if it is something relatively trivial, you need to ask yourself the question if the cost of implementing the 'right' algorithm is more than the cost of getting the job done quickly.
Understanding Linear Algebra and Calculus allows for Mathematical solutions that can drastically cut down on the amount of coded recursion, looping, and needless flow control (if-then-else, case,select,switch,etc...)
-=[ Who Is John Galt? ]=-
You can learn the theory, semantics and grammar of programming, you can learn everything in the books, and in actuality be a better programmer.
But here is what you miss. A culture and a type of group think that the university pound in to the students.
This is one where they are taught to produce mounds of myopic code. They are taught not to ask questions and to be OK not understanding the bigger picture and how it all comes together.
At the same time more attention is given to every dam minute detail of a language and with trivial logic puzzle during the interviews that these guys get through better. While at the same time, produce mostly marginal bloating and inefficient code that will almost never take in to account future growth paths.
Being self taught myself, I find it's getting increasingly harder to get in to corporate jobs because of this.
I have also run several startups and had many programmers working for me and have a hard time working with formally educated programmers unless they were self taught before they went to school.
This is again because with the self taught programmers I just have to share my goals and view, and they will plan with me.
With the "educated" ones, they just put out brittle code that lacks insight.
I guess it's like an English major vs. a best selling novelist who probably never needed to study English in college to get to where he is.
I am always doing that which I can not do, in order that I may learn how to do it. - Pablo Picasso
How full of bullshit your superiors are. I mean you'll deal with professors and administrators who are completely full of shit. This will prepare you for your bosses who will be so full of shit that the only way they could be more full of shit would require the use of a pump, a hose, and a bag of manure.
Did you know 80 to 90% of the moderators on slashdot wouldn't recognize a troll even if one dragged them under a bridge.
This U.C. Berkeley Data Structures class would help a lot:
http://www.youtube.com/view_play_list?p=4BBB74C7D2A1049C&search_query=berkeley+cs+61b&rclk=pti
You could probably go through all the lectures in a month and you would know a lot more than most programmers in the industry!
One interesting thing to learn is about algorithm which are O(n), O(n2), etc.
It's not difficult and it can help optimising a program..
Also the way to proof that the program is correct (there's a introduction about it in programming's pearls*), it's not used often but it can be very useful for 'tricky' algorithms.
*Funnily the sort program with the proofs has still bugs: it computes the middle of a range with middle:=(x + y)/2 which can overflow!!
Functional Programming, unless your etc. did include Lisp/Scheme or OCaml or Haskell.
Some assembly wouldn't hurt.
Why? Different ways of thinking about solutions and it will feed back positively into what you do with the imperative C-like languages you did list.
Algorithms and datastructures are an enormous gap in most autodidacts' knowledge sets. The NIST DADS is a great place to fight that, or alternatively an algorithms book like CLRS or Knuth 2.
StoneCypher is Full of BS
I've found most formal training fairly useless. This is also what I've heard recounted by most of my community and profession peers. I think this is the same whether it is CS, ElecEng (software) or my own personal choice, industrial design. This is from a sample of several hundred software developers, and web developers in the UK, USA and Australia. It has also been my experience as an developers being responsible for looking after new graduate employees. There is real value in learning how to learn. An in this sense post-grad courses can be of more practical use I'd say, though they still have potential to be as useless. I realise this is a broad brush stoke, and I would confess my own degree has given me useful skills in team working, usability and an understanding of what a waste of time formal education is, well 95% of it anyway.
Complexity analysis sometimes evades the college-educated, too, but it's a particular weakness among the self-taught programmers I have known. It's a little sad to see someone trying to micro-optimize the heck out of a very inefficient algorithm, when f they just used a BST or a hash table instead of a list, they'd have vastly better performance.
Get some good books on Algorithms, design patterns and data structures. This will give you more breadth to your knowledge. My first book was Knuthe's "Sorting and Searching Algorithms". I of course couldn't understand everything in it, but just reading about the different algorithms made me aware of all of the work that had been done that I could pull from if I needed something.
I always remember my first boss who was an engineer and a self taught programmer looking at my complicated and efficient quick sort program and telling me that it was ridiculous because you could write a sort program in 3 lines of code. ( I think he had seen a print out of a bubble sort in Byte magazine.) I of course wanted to use the system sort program built into the VMS on our VAX, but he wanted me to write a custom sort program.
I concur.
Recursion and inductive reasoning is typically not self-learned, but once you learn it, it's much easier to reason about your program's correctness when you write recursive functions than when you use for-loop or while-loop. Recursion is also key to understand divide and conquer algorithms. Not many people understand that modern computers are designed to run divide and conquer algorithms well. That's because the memory hierarchy exploits locality of reference, which is a property observed from running divide and conquer algorithms. That's because programmers tend to divide a problem into smaller subproblems, and focus on one subproblem at a time.
Furthermore, I would add data structure to this list as well. Many people advocate learning about running time using the big-O notation (you can analyze space usage such as stack depth using big-O as well), but data structure is a big motivation behind the big-O notation.
I once had a signature.
Freethinking is the most important thing to teach yourself: see all the "design patterns" hints here and people modding them +5. That's terrible "groupthink". Design patterns are mostly workaround 3GL defects, probably 75% of them yet who understands that? Who knows what a 3GL is? I'm self-taught and started with 2GL, eat that youngster ;)
When you read things like "Java doesn't support multiple inheritance", take it with a grain salt. Exercise freethinking. Is inheritance only "subclassing+subtyping inheritance"? Of course not. So maybe that multiple interface inheritance is inheritance after all: multiple subtyping inheritance + delegation is all that is needed to translate *any* OOA/D to OOP. etc.
Most books s*cks big fat azurean mega donk*y dick.
A book on design patterns that doesn't start by acknowledging that most design patterns are workarounds specific 3GL defects isn't worth your time.
When you read something "accepted" then think "oh, that's what they *think*, why do they think that?". Don't take what you read for granted.
Read Knuth if your math oriented. That's "all" you need to know. All the rest is just the latest fluff du jour and as long as you have sound basis there's not a single buzzword technology that cannot be easily mastered.
> I have taught myself C++, Java, python, PHP, etc
Depending on how deeply you "know" each of those languages, you may be lacking experience in some very different ways of doing things. You ought to at least add one functional language (Scheme or Haskell), and do some really low-level programming (in assembly language, or C). If you haven't looked into C++ templates yet, that is another very different paradigm worth learning.
...you ignore when learning things on your own. Seriously, if you teach yourself a language it's usually by developing a project of your choice (mostly for the fun of it, as it has been for me). What people do is, they go for the fun bit of hacking things together to get a working piece and pick one of two choice: * Try to use all the language features even if this means overkill for simple situations. * Go on to piece it together and leave a horrible mess. I used to do the latter >10 years ago in that situation, starting out in the mess that Visual Basic itself is, never really realising just how bad it was. I was proud of what I'd achieved! Looking back, I can't believe I published any of that. It's when people go for the first choice and start realising how software is meant to look like from the source that they learn to be a competent programmer. There's all these intricate details like Garbage-Collection, String-Manipulation, Floating-Point Math (the point being that it is unlike mat taught at school !) and many more things like Memory-Management and some such that one *can* get to work sloppily, but it's only when one realises how to utilise these things in the correct way that i would agree self-teaching is the way to go. I'm a self taught programmer, still in university (3rd year with an average mark of 1.8), employed as a software engineer and I'm 100% certain I wouldn't have been able to achieve 1 quarter of these things without teaching myself how to program. ....Yeah... it is at this point that people can rightfully say they *do* waste quite some time in university.
I'm guessing that working direction with hex is just optimal in some situations especially in graphics, eg:
a BYTE data type =0...255, whereas an int data type = -127...127 ...
a bitwise shift is especially useful when you're working with binary:
given BYTES (r,g,b): r16+g8+b = DWORD (hex_color_code)
Get a good book on Algorithms and Data Structures. Then look at some Computer Science curriculums and work from their. (You can read some classic books like SICP, these should help)
Once you've done that you need to show you understand these things, maybe a portfolio of example code could help you here.
However, you're basically trying to prove that you're a Computer Scientist without a CS degree, in the long run you may want to study for a CS degree just for the accreditation.
From a performance perspective, a program that works well on a standalone system or even a LAN might be a dog over the web. There is also the issue of scalability. When you start pumping huge amounts of data into an app, it should not grind to a halt if the algorithms are as efficient as possible. BTW, you can learn a lot about writing efficient code by working in assembler.
From a security perspective, you should not only be validating inputs and outputs based on the underlying infrastructure, but you should also provide the admins of your app with all the info you can to help them safely configure and maintain it.
Unfortunately, this is not limited to the self-taught. I have not met anyone who took a course devoted to secure programming. And the level of detail in various CS programs is all over the map.
Later . . . Jim
My list:
- complexity theory (and some 'concrete' mathematics)
- functional programming
- lisp
- domain specific language design
- concurrent systems design and implementation
- graph algorithms
- large-scale OO design (i.e. how to design a framework rather than a program)
- DFAs
That's after I eliminated the things I already knew I didn't understand when I graduated in 1993 (language parsing, compiler design and implementation, database theory, unit testing, ...).
What do you want - to be an effective programmer, or to understand CS theory that you may, depending on your specialty, never get to use?
I'm biased - I came into programming as an EE who landed a job as an embedded software engineer. I tend to work on low-level driver issues(race conditions, improper locking, buffering issues), but I've done contract work on VB6, python, C++, bash scripts, Java, OpenGL, digital video, etc... Beyond a good understanding of locking, buffering(be able to implement a circular buffer in your sleep), and how computers function(down to boolean logic, although I find comfort in knowing how to bias a transistor), I don't think you need most of what a CS guy spends his time on.
I barely know what turing complete means, and I couldn't give a shit about the towers of hanoi or a traveling salesman. None of that has ever come up in my line of work. If you don't already know about that stuff, chances are that you're not smart enough to land the type of job where you would use it.
Object oriented design took me a few years to really get. You just need to apply it, and apply it again, and apply it wrong, and see how other people have applied it wrong(no good definition of wrong here, but you'll know it when you see it). Eventually you'll get a better feel for it. I still can't architect as well as some CS friends of mine, but I consider those guys geniuses, in contrast with a lot of CS educated morons.
Write a multi-tasking OS. I wrote a simple one for the 68HC12 a few years back and it taught me a lot about what you need, and what you don't. You could probably find an emulator so you don't have the added challenge of bringing up a development board, but you really should be able to bring up a development board(write mem tests, exercise the hardware(serial ports, gpio, timers, etc)).
Do it all when you're young. Life catches up with you by the time you're 30.
http://www.masturbateforpeace.com/
Each individual is different and I've met some great self-taught programmers as well as poor programmers with with CS degree.
There are some general rules of thumbs though:
- a lot of the self-taught coders are actually college drop-outs that might be technically good but have a poor work ethic
- there are self-taught programmers with a degree from a different field (Electrical engineering for instance). Those can generally do a good job as coders if they spend enough time researching the new (computer science) field tackling coding problems
- open source participation is probably the best way to distinguish a good self-taught coder from a bad one - if a person demonstrably does a lot of work in OS projects he/she can easily become the most resourceful, enthusiastic and helpful colleague
To cover 95% of the work done by your average programmer just go to a local university and see if you can audit a data structures class. This is assuming you're already competent in general.
One thing I haven't seen mentioned is what the whole university experience is supposed to be fundamentally about learning how to learn. Yes, in Computer Science/Software Engineering you will get a broad brush of skills and training, from computing concepts, mathematics, and sciences. But you will also be exposed to other knowledge domains at a high level - economics, business, psychology, philosophy, languages - no, not programming but spoken, etc.
Some of the best programmers I have ever worked with had little to no real training in daily programming skills. Instead, they had degrees in other fields that allowed them to be free thinkers. This helps with solving complex problems, thinking "outside the box" and having a broad experience in things other than programming language details. Anyone can read books on advanced C++ techniques and other "learn how to program in 21 days" types of books and do a decent job at coding. However, I want people who have been through a difficult university program and not only survived, but excelled and had a broad experience. Sure, they might take a bit longer to get up to speed on the "design patterns" and "agile programming" methodology du jour, but they will probably be more likely to excel in a challenging, free thinking, problem solving work environment. Sure, some exceptional, self-taught people can also be amazing, but that's usually the exception.
If you just need coders that write php code to spec, stick to self-taught or community college grads - they are cheaper.
...was learning Haskell! Even if you do it just for the sake of learning it. (Especially since that great new book came out last year.)
My code is on a whole different level now, since I have seen the Matrix. :)
Next step: QED, by Richard Feynman. (Should be a piece of cake now. ^^)
Any sufficiently advanced intelligence is indistinguishable from stupidity.
After about 10 years of learning by just reading specs and figuring things out on my own I began a study. And that's when I heard about hash tables the first time. I would have never come up with hash tables on my own. It was like they were introducing in something occult.
Finding the desired record in giant amounts of data INSTANTLY on average.
It seems like too many people think encryption is only about scrambling data until it looks hard to recover. This isn't just a knock against self taught programmers, but I do tend to see it slightly more often in self taught individuals (including myself; when I was a kid I thought repeatedly XORing a password with plaintext was a great idea until I figured out how to automatically recover the passphrase from the ciphertext with statistical analysis a few years later). Anyone who's looked even slightly deeper knows that cryptography is seriously hard and involves knowing advanced mathematics and learning about every advanced published attack before attempting to design a cipher. Even if you pick trusted cryptosystems, there are timing attacks, watermarking attacks, man in the middle attacks, side channel attacks, and any number of other security problems that have to be addressed in the system using the cryptographic primitives. General security is hard, but secure cryptography is arguably even harder. You might think that a general programmer won't run into problems involving cryptography very often, but how many programs or web sites need to manage users and permissions? Store passwords that aren't vulnerable to rainbow tables? Transfer data over the Internet to another trusted host or user? Securely store and process credit card numbers?
Assuming 32-bit floats, the "small" array is 40 gigabytes. It's fairly uncommon to have that, but not entirely implausible.
The large array is 4 petabytes. There are very few computers in the world that can handle such a thing even today, never mind "a long time ago". The memory allocation would fail. For normal FORTRAN 77 arrays, the program wouldn't even start.
I guess I could see it with C++ using a sparse array class, but your "long time ago" physicist wouldn't be doing that. He'd be using FORTRAN 77 or older.
one example would be to display a table with a drop down for selecting colors, fetching the color list from the database for every row even if it's the same for all of them
Programmers often work in teams on large projects. They may even work for different employers, supplying libraries to a 3rd party who puts them together.
Programmer X does an array index operation to get the color name. He's assuming that this is really fast.
Programmer Y overloads operator[] so that he can internationalize the code. In all the usage examples he's ever seen, nobody looks up color names in a loop. He's assuming that this isn't done.
Perhaps I missed some comments due to my filter settings, but I didn't see anyone directly recommending Discrete Math/Finite Math. That's a pretty important topic -- I did see it sort of mentioned in passing as part of other topics, but having a deep understanding of discrete would help a lot. Boolean logic topics like DeMorgan's theorem & Carnot maps can help you considerably with optimization and even just reviewing the logic portions can help you avoid those "if/else/oops" scenarios wherein you forget to consider one of the cases of a logic statement.
Linear Algebra can also be extremely helpful for a fair number of problems that arise and it can also lead you into linear optimization topics.
Although more of an applied math topic rather than math itself, Digital Design can have a tremendous impact on your understanding of the system -- particularly when dealing with hardware or a compiler bug.
And to toss in some non-math, Operating Systems is another great topic -- as well as Compiler Design. You'll see lots of methods for doing things that you have to do regularly anyway while gaining a deeper understanding of the "behind the scenes" operations.
Discrete maths(1 book http://www.cimt.plymouth.ac.uk/projects/mepres/alevel/alevel.htm) and wikipedia(1 list http://en.wikipedia.org/wiki/List_of_computer_programming_topics). I also have a physics degree and taught myself programming. It's important to realize you'll never be a member of "the in crowd" until you have a piece of paper from an overpriced university. It's the only way unintelligent people can distinguish between programmers and it gives them an easy out if you suck.
To properly design a complex computer system, you need to know the science behind it: discrete mathematics, logic etc.
Most self-taught figure out the necessary craft, such as design patterns, but they often lack the science.
- Don't have the training to survive multiple 2-4hr meetings every week
- Have not been forced to work with coding morons
- No skills in creating weasel-word specifications or slide sets
- And the biggest of all - Will not be able to hang by the water cooler and compare the college you went to with the other geeks
Its not the years, its the mileage
I just hosted Richard Stallman at my university (http://csee.wvu.edu/rms video coming soon). After the lecture was finished a student asked him "What can I do to be a better programmer" to which he responded "Learn LISP."
I can say from my experience with the language, that RMS, ESR, and PG are all right. Learn LISP. It will change how you think.
A job recruiter would say, "You don't have at least 7 years experience in Microsoft SQL Server 2008... I'm sorry but you're not a fit at this organization. Best of luck!"
Applies to both self and institution taught. I came through the self taught route and have been developing software (and employing developers) for 20 years. Theory is great but a natural developer will absorb that even if self taught - and I do believe developers are born rather than made. Many of the mediocre developers are simply doing the wrong job and will never excel because they don't have the right mindset. In the real world I am far more interested in someone who will deliver the required solution than what formal training they have had. And that (usually) means somebody with a track record of successful delivery
What Knowledge Gaps Do Self-Taught Programmers Generally Have?
The same ones us taught programmers have. The ones we need and haven't learned yet.
if only
This wasn't my experience but I guess I'm atypical. I'm self-taught, but in 1987 I bought the 3 extant volumes of Knuth's Art of Computer Programming. You don't need calculus to understand the math, just limits. I can't imagine its taking YEARS for the average person to learn the concept of limits
Off the top of my head - as a first-order approximation - I'd guess that you have an IQ of 140+ [certainly at least 135; very doubtful that it's any lower than 130].
We're talking about trying to teach this stuff to people with IQs down around 120 - maybe even as low as 115 [in a worst-case scenario].
Those folks can't teach themselves Knuth - they have to have it spoonfed to them, over the course of many, many semesters.
And even then, it's doubtful that they'll ever get much beyond HTML or Visual Basic.
'Nuff said.
-- Cerebus
Many self-taught programmers lack even a basic understanding of algorithmic complexity. This is the single biggest issue I run into in industry and should be one that isn't too hard to fix. Again and again I find people have written some algorithm that just doesn't scale to larger amounts of data because it is n^3 or n^4 instead of linear. After that I would say it is a good grasp of some of the ideas in functional programming particularly recursion. There are a lot of other things I run into but I never noticed that self-taught programmers were more prone to them than others.
In my experience, self-taught programmers lack humility, in that they tend to think they know more than they do, and generally assume that they are not missing out on anything that a formal education in computer science / electrical engineering could have taught them. They also tend to perform poorly in a team environment. This second observation is most troublesome, as there is not a complex software project that can be successful without a strong team. Some of these problems may be more of a reflection on the chip they carry around on their shoulders if they have to work with people with more credentials on their resume. Or they may really be as neurotic as you perceive them to be. If they perceive they might be shown up they become overly defensive of their ideas, even though all good engineers know there is usually more than one way to solve any given problem. They also tend to seek out positions of authority, even though they usually have no business managing people. In short, good luck working for them. They tend to be everywhere.
Oh, and the algorithm crap is a complete waste of time. Most software projects have so many more important problems to deal with every day than saving a few clock cycles on a modern 4 GHz quad core. I'm not saying go ahead and write 'tard code and make stuff that is slow just because you're lazy. It's just not as important as making sure you allocated enough buffer space and actually validated that user input stream. And what's the point of saving those extra clock cycles if you continuously have to fix buggy code that was rushed out the door and that wasn't needed in the first place because some poor excuse for a product manager keeps asking for the wrong features from the engineers? Software sucks. If you are motivated enough to self-teach, get out your math texts and get a real job doing anything other than software.
Get the book "Code Complete." I started reading it after a survey in ComputerWeek or TechRepublic asked what the best programing book was. They chose that one.
It covers real world programing, issues over commenting and variable names and standards and conventions vs getting things done vs maintenance down the road.
That book does two things that I found revelatory:
1) Exposes you to functional programming and important features thereof (such as referential transparency) from the very beginning.
2) Starts with a very high-level, functional model of computation (the Scheme language, which is essentially lambda calculus), and proceeds toward increasingly low-level models of computation, culminating in a Scheme evaluator based on a traditional stack machine. Many programming books tell that story in reverse, starting with memory and registers and instruction sets, and then explaining how high-level language constructs map onto those low-level concepts. By starting at the functional level and elaborating increasingly detailed models of physical computation, Abelson and Sussman drive home the point that computation is an abstraction that can be implemented, and thought about, in lots of different ways.
It really is a mind-blowing experience. I read it after 15 years as a professional programmer in languages from assembly through C to Lisp and Prolog, and I learned A LOT from it.
Normalization Theory.
Seriously, the percentage self-taught programmers who slap together databases without the faintest clue is very close to 99.99999% I have seen far too many databases "designed" by these folk with data being duplicated, triplicated, split into 2, 3 or 5 tables (I kid you not!) - collecting their money and walking away. Then some poor slob (like me) has to go in and try to untangle the mess and rescue the data.
Sigh.
Another excellent read is A Discipline of Programming (Dijkstra), which goes to the core of the issues in programming.
In fact, the above List on Wikipedia may give you some more ideas.
anecdote:
via maxxed PDP-8/i and the free-for-the-asking DEC PDP8 & 11 books I learned programming around 1975.
But at first I couldn't properly conceive of nested loops. "linear thinking" ?
So I wrote an klugey/bruteforce/hack sort algorithm as part of a particular dice game for ASR33, using just one BASIC FOR/NEXT loop.
It's in the creative computing magazine jan/feb 1978. In case you have one of those lying around, please feel free to
scan/post/criticize it and perhaps consider me a Slightly Less Anonymous Yet Still Embarassed Coward.
Formally-schooled programmers tend to think 'bigger-picture' and code for wider issues, whereas self-taught take pleasure in the engineering, will adopt minimalist solutions and tend to re-invent the wheel in fragments (as opposed to methods from a large library). As such, the self-taught solutions are likely to be more 'magical' and innovative - not necessarily good for deployment, but could be useful for prototyping and proof-of-concept.
I come from a very similar background to yours and am largely self-taught as well.
As far as Computer Science goes, the important things in addition to what you have already studied are Algorithms (Knuth, Sedgewick) and Functional Programming (scheme is nice and simple and mature). I wish I'd known about Functional Programming 20 years ago...
In the world of work, if you understand and can implement basic algorithms in C and C++ you will be miles ahead of 90% of people.
What's really important for work is good engineering practice. You should learn about design and testing. Test Driven Development is a really powerful technique and quite easy to pick up if you are a hands-on learner. Your code will be so much better as a result. Again. you'll be miles ahead of most other people.
Learn about Combinatorial Testing. See the NIST website.
Do code reviews. Review code written by people better and more experienced than yourself. That's a very powerful way to learn what works and good style.
Befriend some grey-beards and heed their words.
Continuous integration, refactoring, ... it's a lifetime's challenge but it's great fun and very rewarding.
Finally, knowing Bourne shell (or bash) scripting is as very useful skill to have. It opens many doors and can make you a very valuable team member since many people only know GUI tools (Visual Studio).
Don't over-emphasise the PHP. It leads to dead-end jobs that are way beneath you.
Stick Men