Out of the 100k passwords how many were unique? Could we have a graph of how many passwords were used how many times? Something that could be analysed to say that in your case about 85% of people used a unique password and 10% used a password in the top 10 or top twenty whatever. This could be used to compare to other datasets to extract a level of cluelessness/cluefulness.
We agree one needs at the very least two environments. When you have lots of money -- or lots of VMs:-) you can have lots of environments. You need a developer free-for-all environment where developers can play, this could be their own machine but sometimes you'll need dedicated machines (hardware, expensive licences, whatever). Then you need testing/QA machines where normally you would not permit developers, and production machines where you do not permit anyone more than necessary.
The packing needs to be done because when you have few people authorized to do installs, then the people who do have that authorization must be able to do any and all installations, and it forces handoff to Operations. I work in a place where we we have so many different applications we need a database to keep track of who is responsible for which part of which application, and it would be (even more of) a nightmare if something had been installed in some unknown or non-standard way.
But there are two reasons for these rules (at least!).
Most people here are taking this from an engineering reliability aspect, and that is a valid concern, but in many companies the rules separating environments are also motivated by security and confidentiality, and are often even based in law and contract agreements. One might hire a team of contractors/temps to develop something, but not only must they have no chance of inserting malicious code anywhere, they must never ever even *see* production data, only dummy data! The classical example is a bank or a hospital, but this could/should also apply to ISPs (mail...), anything that stores SSNs or credit card numbers or passwords, etc.
[It's] a human who then asks you all of the same questions as the automated system that I really hate.
I have a supplier whose automated system asks for contract number and system ID's and the like. Once, my system was totally down and the different numbers I had were refused by the supplier's IVR. I remembered hearing that some IVR systems detect swearing. I quite deliberately swore a few times at the system, and it beeped and asked "Are you currently experiencing a severity-1 production outage, press one". I did and got a human immediately. I'll never again complain about their system . .
I find it much easier to understand that CentOS 6.1 is a newer version than CentOS 6.0, for example, than trying to remember that "Killer Kangaroo" is newer than "Sloppy Sloth".
Well, you shouldn't try to remember that, since Ubuntu names in alphabetical order, just like Android. That will roll around in some half a dozen years, but Ubuntu also has YY.MM version numbers, so you know immediately that version 08.04 is over four years old. It's better than Debian where the name is not given alphabetically, but Debian also has a version number when you need it. Geeks make the OS. Geeks like the wacko names. Deal with it.
If your aim is to use identity as an index, work out a way in which you can translate an identity into an an index or hash value (i.e. one way). This is not going to be perfect (there will be about a million "John Smith"s out there), but if you have a consistent pair such as name and phone number, turn that into a hash and use it as data index.
Bad idea when you get a hash collision. Account numbers do not have to be seen by the user, but there aren't (m)any useful ways of avoiding their use internally.
If OP is storing data for analysis and not for immediate reuse, there are some often overlooked but stupidly easy things to do like making sure that the user-facing machines collecting the data only have append/insert access to the data (no read, no modify). Analysing the data would be done from another machine/subnet/database account whatever.
I tried running a free de-dup program, but it ran for a week straight and was still 'processing' when I finally gave up on it.
Maybe you're not naming the free de-dup program in question out of politeness, but I'd like to know... Or leave a message with the author of said program?
Verdun isn't really a good example. When talking of surrendering in the context of the French Foreign Legion one can only refer the French-hater to http://en.wikipedia.org/wiki/Camerone
First horseman: conqueror Second horseman : war Third horseman : the economic oppressor Fourth horseman : Death (and Hell followed with him, killing with sword, famine, disease, beasts)
Just one "SF" novel, "Kallocain", written eight years before Orwell's 1984. Definitely worth reading for the day when technology can easily detect lies and/or force people to speak the truth.
Well I like the Sudoku Solver at sudokuwiki.org, but it didn't find a single number before saying "Run out of known strategies. Use 'Solution Count' to check to see if the puzzle has only one solution."
I did, and after 2068781 recursions it said OK, there's just one solution.
Cloud + Hadoop + Big Data are serious issues these days. I'm gaining my chops in that area (main emphasis is performance engineering), and there is a LOT of interest in anyone with "Big Data" (Hadoop + MapReduce) type of experience.
MPLS is cool. However in an MPLS network, you have 1) P routers (core routers), 2) CPE ("client premise equipment"), and 3) PE routers (that connect the P routers and the CPEs). The CPE doesn't need to be and indeed shouldn't be an MPLS-capable router. In a five-console library a sub-$1000 Cisco 1800 for CPE would be a perfect fit.
Our Cisco vendor had bid on the same job and not only came in with a far, far more expensive piece of equipment but told us that we would need a CCN-something for five hours to program the thing.
Sure, but if you have 1000 of them you'll need at least one competent full-time tech anyway, and I can guarantee that someone experienced (i.e. knowing Cisco) who has 1000 client routers will prefer the 1000 routers to be Cisco. That's not the question, actually.
To get back on topic, he'd prefer $500 Ciscos, or $1000 Ciscos. The $22,000 Ciscos can go route the university campuses they were designed for instead of taking up space and ELECTRICITY in 1-5 console libraries. Say two or even four each, but I doubt there are 250 universities in WV, am I right?
Some guy interviewed linked from the TFA said it was buying Lamborghinis. The comparison is not quite correct. Instead of buying bog-standard five-seater consumer cars, these people bought new 18-wheeler long-haul Kenworth trucks with three seats and paid extra to have two beds in the back of the coach, justifying the choice by saying maybe one day they'd need more luggage space. Maybe they got a good price for what they bought, but it was a little bit overkill for a thousand parents to bring their kids to school.
Honestly, $22K isn't that bad for a decent edge router. It looks to me like a Cisco 7603, so with a service contract, that's not really that bad of a deal.
Article says Cisco 3945, which at least is marketed as a client-side router. If they're supposed to go to Gbps fiber, a case could be made. It would be full of holes, of course.
State officials requested that the devices include a "T1 interface card" that would allow schools, libraries and other sites to use the high-capacity routers with their existing copper-wire T1 broadband connections -- while waiting to hook up to fiber optic cable.
The adapter cards added $1.08 million to the purchase price.
Instead of, say, keeping the old routers, and buying a Cisco 1800 for less than $1000? There HAS to be a illegal commission somewhere in there... $22 million stupid?
Hold a gun to everyone's head and force them to vote.
Gun? Bad idea. Fines are better, 20-dollar fine in Australia for example. See http://en.wikipedia.org/wiki/Compulsory_voting
Yeah. It actually /is/ good. Except if you happen to know something about computers, unfortunately.
'nuff said
Good luck on getting Netapp/IBM/HDS/EMC agree to cost-free replacement on site in less than four hours.
Out of the 100k passwords how many were unique? Could we have a graph of how many passwords were used how many times? Something that could be analysed to say that in your case about 85% of people used a unique password and 10% used a password in the top 10 or top twenty whatever. This could be used to compare to other datasets to extract a level of cluelessness/cluefulness.
We agree one needs at the very least two environments. When you have lots of money -- or lots of VMs :-) you can have lots of environments. You need a developer free-for-all environment where developers can play, this could be their own machine but sometimes you'll need dedicated machines (hardware, expensive licences, whatever). Then you need testing/QA machines where normally you would not permit developers, and production machines where you do not permit anyone more than necessary.
The packing needs to be done because when you have few people authorized to do installs, then the people who do have that authorization must be able to do any and all installations, and it forces handoff to Operations. I work in a place where we we have so many different applications we need a database to keep track of who is responsible for which part of which application, and it would be (even more of) a nightmare if something had been installed in some unknown or non-standard way.
But there are two reasons for these rules (at least!).
Most people here are taking this from an engineering reliability aspect, and that is a valid concern, but in many companies the rules separating environments are also motivated by security and confidentiality, and are often even based in law and contract agreements. One might hire a team of contractors/temps to develop something, but not only must they have no chance of inserting malicious code anywhere, they must never ever even *see* production data, only dummy data! The classical example is a bank or a hospital, but this could/should also apply to ISPs (mail...), anything that stores SSNs or credit card numbers or passwords, etc.
[It's] a human who then asks you all of the same questions as the automated system that I really hate.
I have a supplier whose automated system asks for contract number and system ID's and the like. Once, my system was totally down and the different numbers I had were refused by the supplier's IVR. I remembered hearing that some IVR systems detect swearing. I quite deliberately swore a few times at the system, and it beeped and asked "Are you currently experiencing a severity-1 production outage, press one". I did and got a human immediately. I'll never again complain about their system . .
I find it much easier to understand that CentOS 6.1 is a newer version than CentOS 6.0, for example, than trying to remember that "Killer Kangaroo" is newer than "Sloppy Sloth".
Well, you shouldn't try to remember that, since Ubuntu names in alphabetical order, just like Android. That will roll around in some half a dozen years, but Ubuntu also has YY.MM version numbers, so you know immediately that version 08.04 is over four years old. It's better than Debian where the name is not given alphabetically, but Debian also has a version number when you need it. Geeks make the OS. Geeks like the wacko names. Deal with it.
If your aim is to use identity as an index, work out a way in which you can translate an identity into an an index or hash value (i.e. one way). This is not going to be perfect (there will be about a million "John Smith"s out there), but if you have a consistent pair such as name and phone number, turn that into a hash and use it as data index.
Bad idea when you get a hash collision. Account numbers do not have to be seen by the user, but there aren't (m)any useful ways of avoiding their use internally.
If OP is storing data for analysis and not for immediate reuse, there are some often overlooked but stupidly easy things to do like making sure that the user-facing machines collecting the data only have append/insert access to the data (no read, no modify). Analysing the data would be done from another machine/subnet/database account whatever.
OP wrote:
I tried running a free de-dup program, but it ran for a week straight and was still 'processing' when I finally gave up on it.
Maybe you're not naming the free de-dup program in question out of politeness, but I'd like to know... Or leave a message with the author of said program?
The recent lost
Shouldn't that be "loss" ?
Unless we're talking about a kind of lawsuit called a "recent" . . .
Verdun isn't really a good example. When talking of surrendering in the context of the French Foreign Legion one can only refer the French-hater to http://en.wikipedia.org/wiki/Camerone
War (well, the sword) is definitely a classic.
First horseman: conqueror
Second horseman : war
Third horseman : the economic oppressor
Fourth horseman : Death (and Hell followed with him, killing with sword, famine, disease, beasts)
Just one "SF" novel, "Kallocain", written eight years before Orwell's 1984. Definitely worth reading for the day when technology can easily detect lies and/or force people to speak the truth.
Well I like the Sudoku Solver at sudokuwiki.org, but it didn't find a single number before saying "Run out of known strategies. Use 'Solution Count' to check to see if the puzzle has only one solution."
I did, and after 2068781 recursions it said OK, there's just one solution.
Cloud + Hadoop + Big Data are serious issues these days. I'm gaining my chops in that area (main emphasis is performance engineering), and there is a LOT of interest in anyone with "Big Data" (Hadoop + MapReduce) type of experience.
Second that. Want to learn a new language? Pig.
Go back to the backpack, and wear it all the time.
I have seen a thief unzip a backpack on its owner's back, steal what was on top, and take off running.
MPLS is cool. However in an MPLS network, you have 1) P routers (core routers), 2) CPE ("client premise equipment"), and 3) PE routers (that connect the P routers and the CPEs). The CPE doesn't need to be and indeed shouldn't be an MPLS-capable router. In a five-console library a sub-$1000 Cisco 1800 for CPE would be a perfect fit.
Our Cisco vendor had bid on the same job and not only came in with a far, far more expensive piece of equipment but told us that we would need a CCN-something for five hours to program the thing.
Sure, but if you have 1000 of them you'll need at least one competent full-time tech anyway, and I can guarantee that someone experienced (i.e. knowing Cisco) who has 1000 client routers will prefer the 1000 routers to be Cisco. That's not the question, actually.
To get back on topic, he'd prefer $500 Ciscos, or $1000 Ciscos. The $22,000 Ciscos can go route the university campuses they were designed for instead of taking up space and ELECTRICITY in 1-5 console libraries. Say two or even four each, but I doubt there are 250 universities in WV, am I right?
Some guy interviewed linked from the TFA said it was buying Lamborghinis. The comparison is not quite correct. Instead of buying bog-standard five-seater consumer cars, these people bought new 18-wheeler long-haul Kenworth trucks with three seats and paid extra to have two beds in the back of the coach, justifying the choice by saying maybe one day they'd need more luggage space. Maybe they got a good price for what they bought, but it was a little bit overkill for a thousand parents to bring their kids to school.
TFA says 1064 routers.
Honestly, $22K isn't that bad for a decent edge router. It looks to me like a Cisco 7603, so with a service contract, that's not really that bad of a deal.
Article says Cisco 3945, which at least is marketed as a client-side router. If they're supposed to go to Gbps fiber, a case could be made. It would be full of holes, of course.
Incidentally, searching for cisco 3945 on the net gives https://supportforums.cisco.com/thread/2146460 which seems to be the reporter behind TFA looking for background.
State officials requested that the devices include a "T1 interface card" that would allow schools, libraries and other sites to use the high-capacity routers with their existing copper-wire T1 broadband connections -- while waiting to hook up to fiber optic cable.
The adapter cards added $1.08 million to the purchase price.
Instead of, say, keeping the old routers, and buying a Cisco 1800 for less than $1000? There HAS to be a illegal commission somewhere in there... $22 million stupid?
Filmed while driving and texting
They should have called Arthur Weasley! He's quite the expert on regurgitating toilets
As the passenger, of course. Only fools drive and text *wink* wink*.
Walk and text and you might get eaten by a grue. Or at least by a bear: http://www.youtube.com/watch?v=WmhvdtX72eQ