Searchable C/C++ DB surpasses 275 million lines
Sembiance writes "I've been working on a C/C++ source code search database for the past year. It has recently surpassed 275 million lines of searchable open source C/C++ code. The search engine is C/C++ syntax aware so you can search for specific elements such as functions, macros, classes, comments, etc. The site is built upon many open source products including: MySQL and Lucene for the database, CodeWorker to parse the code, PHP and Apache for the website and GeSHi for syntax highlighting. I'm currently looking for suggestions on what sort of 'interesting statistics' I could create from 275+ million lines of open source C/C++ code."
i got first!
I write code. I have found that correctly using OO techinques reduces the lines of code and while it is more challenging to write than procedural code it often results in better management.
How is is possible for you to maintain and develop close to a quater billion lines of code efficiently? This seems like a logistic nightmare. I am curious to excactly how you amassed this many lines of code for a database program, was the goal to develope the worlds biggest program?