Re:statistically, cyclists don't hit pedestrians
on
How Safe Is Cycling?
·
· Score: 1
The comment you're reacting to was using that case as an example for: "should cycling accidents doing off-road or downhill count?". Everybody agrees, i think, that hobby-off-road-cycling should not count towards the "bad rep" for cycling, whereas cycling to work, maybe "on the sidewalk" litterally is "off-road", but should be counted as a cycling-transport-related-incident.
Many people have a screen that's (way) wider than it is high. And they browse "full-screen". I like to multi-task: keeping an eye on other things on my screen besides my browsing. So my browser window will be something like 800x1024 when I'm browsing on a 1280x1024 screen.
Anyway, with that background, having even more "screenspace" that is dedicated to the sidebar is annoying. The sidebar is useful at the top (it has things), but then becomes empty next to most of the articles. So now I have just "half" the screen that's useful, the other half being blank means I bought a big screen with many pixels all for nothing....
I disagree. I'm pretty confident that canonical builds the binaries in a safe way. Even though there is a thirty year old "proof of principle" article on how to hide a backdoor somewhere almost impossible to detect.
The problem is that an average Unix system has many bugs at any one time. Debian and Canonical have given up on updating the kernel on every privilege escalation bug. You might forget to run apt-get upgrade for a few days.
When the NSA needs your data, they will for sure have a big list of things-to-try to get into your system. They will most likely succeed. It's the same with every other directed adversary. He'll be able to get in.
Note that I'm toning down what I read in the media a bit. When the facts state: "NSA was able to hack into some phones", the reports will change from "NSA hacked into phones" into "NSA can hack all phones" and "NSA has access to all data on your phone". They don't. Once they are interested in you, they will try to hack your phone and get what they need.
It's different for internet monitoring. Traffic analysis and automated wiretapping is something they are apparently into. So the big internet providers have automated that and they can't do much to verify the requests they get. So when Snowden says he could tap anyone anywhere, he means he could issue a "wiretap warrent" from his desk, which would reach, say yahoo as: "NSA wants access to the traffic from IP x.y.z.w, you're not allowed to ask why." And then Yahoo might try to fight that a few times, but they have already lost. So nowadays that is processed immediately, and/or automatically.
Moore's law predicts that the "factor-of-33" will be bridged in about 10 years. There is only a factor of 20 to the "peak performance", so about a year before that, peak performance might topple the exabyte "barrier". (Some people plug in different constants in Moore's law. I use factor-of-1000 for every 20 years. That's 30 every 10, 2 every 2, and about 5 every five. This has never failed me: it always works out).
No. I'd gladly pay for a phone that is not 7.6mm thick but say 12mm, but then has a 3x better batery life. It is the autonomy and the "I don't have to remember to charge it up every time" that matters to me.
A cellphone works a normal amount of time on a battery that holds 1500mAh at 3.7V, or 20kJ. Moving that amount of energy in 20 seconds takes 20kJ / 20 s = 1kW. This is quite in the realm of a residential power socket.
The problem however is that you'd need contacts that can handle 250A if you keep the voltage below 4V as the current battery uses. Or you can use higher voltages... To keep the current under 1A (as the a normal phone connector uses) you would need to up the voltage to about 1000V.
In both cases the connector would almost need to be bigger than a modern phone.
Of course these are a bit extreme. Say 10A at 30V would give you a charging time of a minute which is still way better than what we have now.
But in all honesty.... I don't think that she'll be able to match the energy density of a modern lipo.
As a privacy concern the fact that "the network" knows where you are is a problem. It does not matter whether the network knows your location to 3m (your phone sends the current GPS location), several tens of meters (your phone reports the strenghts of several towers it "hears"), several hundreds of meters/ a few km (which cell) or several tens of km's (the LA).
Track the LA's of my phone and you'll know when I visited my mom last week. Sure, you'd have to guess that I actually visited my mom and not someone else in that town. But if someone develops an interest in your whereabouts, the LA will already give away lots of information.
There MUST be a timeout on an "inactive" phone. Otherwise phones that are dropped into a toilet would remain ghost-phones in the LA where they dropped (pun intended) off the network.
With the argument that it is expensive for phones to transmit, they will transmit as little as possible already. If the timeout is known to be 1 hour, they will transmit after 55 minutes. (provided the protocol sends and ACK from the tower, otherwise you have to account for the possibility of a lost packet and transmit for example every 25 minutes).
The cell-phone transmit is "expensive" in that it drains the battery. You can optimize the electronics all you want but if you have to transmit a 1W burst for 0.01 seconds to indicate that you're still there, the energy expense of that burst is fixed and cannot be reduced. This is apparently a trick to reduce the number of transmissions from the phone to the towers so that the battery can last longer. I didn't know that. Thanks for the update!
If you want to receive calls or SMSes, you need to leave the phone on and transmitting:
When a call for your number comes in, the incoming call is NOT transmitted nationally. Only in the GSM-cell that you are actually in is the signal transmitted. So, the system has to know in which cell you are to be able to "call" your phone. If you properly turn it off, the phone will tell the GSM network it is going off. So when a call comes in, it will go to voicemail immediately. If you yank the battery, the system will assume you are still in that cell where you last had the phone on, but it will probably time you out if it doesn't hear from your phone for a while. (which happens naturally if for example you drive out of range).
Chess will go this route. No Master of any rank will be allowed to exceed their 'reasonable' ability.
no.... This incident is a wakeup call. People showing such a sudden over-achievement should not play games that have a live broadcast of their moves. Or maybe live broadcasts of moves should be stopped altogether.
What we suspect from this incident is that he cheated. If he cheated, the "live broadcast on the internet" helped. Cut that route off. Consider other routes. e.g. people in the crowd doing a private "live internet broadcast".
So: We now know that advanced technological cheating might be going on. So tournament organizers should take apropriate (better) precautions.
It's a bit like some rat-research in the nineteenthirties. People were teaching rats (or mice?) to go to the right or wrong portal with the cheese behind. Turns out that the rats were a) smelling the cheese. b) hearing the cheese by listening to their feet hitting the floor of the arena. There was a widely ignored article about how the rats cheated, NOT solving the puzzle the humans had set up but going for the cheese anyway. Knowing THAT cheating might be going on and knowing some of the routes that this might be done helps in providing a fair competition.
I am not sure where you got this, but chess is easily solvable in finite time.
Where did YOU get the idea that "finite time" has something to do with NP or not?
There are NP complete problems. They are "known to explode" and other NP complete problems can be transformed into them such that solving one solves the other NP complete problems. Chess is not NP complete: The chess board is finite, which means that it is impossible to transform say a large "traveling salesman" problem onto a chessboard.
NP is nonderterministic Polynomial. Say the traveling salesman problem has to be reformulated a bit. The "easily understood" form says: what's the shortest route. The formal question is: "Can you traverse these cities with less than xxx km?". Once you come up with a route that solves it in xxx km, you can easily check if the condition is satisfied.
A formulation of "the" chess question: "does white win?" is likely not even NP. The NP answer would be: Look if you do these moves, you always win. With games like "tictactoe" you can do that sort of thing.
Looking at it from another point-of-view: Chess is a finite game. The board is finite. So if there is a strategy, it is bounded by the size of the board. Chess can be solved in constant time. The problem is: the constant is quite humongous.
The FSF is controlled by a certain person known as RMS, who has certain extremistic views on free software. Any and all closed-source software is evil and must not be touched. Fine. He is entitled to HIS opinion, I'm entitled to mine.
What HE wants to "clarify" about the GPL does not concern me. I have the licence and I get to interpret it (within legal limits) as the text allows. We KNOW that RMS wants to "tighten" the GPL to make it more contagious etc etc. RMS would like it very much that when microsoft the company ran any GPL application on any of their computers, they could be forced to open up their software. That's what he really wants. It's not legally feasable to try this right now, but that would be the end goal for him.
When I write code, it gets licenced under GPL V2. No "or any later version". What if RMS suddenly wants to make GPL V4 say: "An author who has released an application under GPL, is required to put all sourcecode he writes on the internet."
I will provide sourcecode to anbody who has (bought) the binary.
The fundamental problem is that sometimes an error is an error to the calling program, but sometimes it is not.
For example, when you issue: open "$HOME/.myconfig", the inability to find the file does not mean there is an error. Just that the optional config file is not there. But when you try to open the source file for an operation, the open-error really IS an error.
This duality happens at most levels. A library wrapping "open" will have the same problem. Does the caller consider this a fatal error or not?
Similarly, sometimes errors should result in telling the user and then quitting. But for a gui application it's better to show a graphical message and continue, even if the error is more or less "fatal".....
Evolution works that way: In good times, a big population is generated that has great genetic variety. When bad times come along, the bad genetic variations will be removed from the population.
Suppose for instance that suddenly tomorrow all oaktrees had pollen that is deadly to most humans. The genetic variations builtup over the last 200 years might have provided a (possibly small) percentage of the population that is resistant to the deadly pollen. The result would be that a small group survives and starts working on a new gene-pool.
Yes, genetically we have been living in "good times" the last generations. More and more "slight defects" in the genetic pool are able to survive into mature ages.
A friend is totally colorblind. A genetic disadvantage, you'd say? Nope, his "grayvision" is a LOT better than that of most of us. Apparently he can spot camouflaged army-material from way further away than us normal people. When suddenly THAT becomes a winning trait (i.e. those that don't have it die), his descendants will form a larger part of the population.
This expansion of the gene pool also allows for combinations. Suppose the guy with the super-vision marries the gal with the super hearing?
The point in photoshop skills is to be able to touch up bad photos. If the students are allowed to take their own pictures, who says they won't just take a good picture to start with?
After doing tens of programs with pointers and stuff, the intro-to-computer-programming had a task to turn in a program that used pointers to sort a list of items (using bubble sort).
Compare the items, and when they need swapping, swap them. SImple. Well, somehow, my swapping routine with the pointers always messed up. So instead of swapping the pointers, I swapped the whole records. Kind of against the point of the excercise. But the teaching assitant didn't spot it. It was just a stupid excercise way below the level I was at, at the time, and I faked the result because it was pointless.
(to illustrate the level op programming I was at, a later course, had me starting to do the excercises a few days after they opened the practical side of the course up. When I had turned all the excercises 2 days later in it turned out I was the second one to complete all excercises. Apparenly someone was doing 1 a day while I was doing 3 a day, but he beat me by 20 minutes by starting 3 days earlier......)
without mentioning it, the author of the article assumes that you will completely drain the battery on every commute, for both the velomobile, and the electric car. With that completely rediculous assumption, he comes to the figure of 25% of the windturbines.
Anybody watch the video and agree with me that he seemed severely hypoxic in his reactions to messages from ground control?
29 release seatbelt.... no reaction
From then on, he's quite unresponsive. "say Roger if... " he responds with "roger", but that could just be a response to the "say roger" and not the part after the "if"....
I don't know if they are actually going to do it this way, but a voice call also causes data to be transferred back and forth to/from the phone.
So by just charging for the data, people get more choices: Chose a lower bitrate and pay less. Send an SMS for incredibly little, etc etc. Download a HQ video from Youtube: Pay a lot.
A botnet is a nuisance because it DOES annoying things.
If the botnet is instructed to send spam, you can detect computers sending "too much" Email. If the botnet is instructed to DDOS a certain host, you can detect it sending the malicious requests.
If a host in a botnet is a "sleeper", it doesn't matter much if you firewall it off. But the hosts doing the malicious, detectable stuff should be firewalled off.
The problem is that if a botnet consists of 2 million computers, and the spammer wants to send off 2 million spams, Then each computer need only send one Email. That could/should not be considered as "too much".
As the author of "same", I was going to post the above suggestion.
Last time I used "same", 4.2 million files was peanuts. Of course, running through 4.8Tb of data is going to take some time.
People above are doing suggestions like doing CRCs of the files. Checking filesizes. Etc etc. Same does all of this:
First a list is compiled of the files to be handled. Then each file is stat-ed to determine its size. Then only same-size files are considered candidates for being the same. Next if the filesizes are the same, the CRCs are compared. The CRCs are calculated on an "as needed" basis. This means that most big media files will never need to be read entirely unless a duplicate is going to be found. Anyway. When the CRCs are the same, the files are compared bit-for-bit and if THAT comes out good, the files are hardlinked together.
The hardlinking means that you can further process the results. You can use find to eliminate say all duplicate files in a directory called "backup", provided that they ARE duplicates. Now you'll be left just with the Uniqe files in that directory.
I'm not sure if all of this will easily run on windows: It's a Unix program. On the other hand, it uses simple calls and should easily be ported using the cygwin suite.
The comment you're reacting to was using that case as an example for: "should cycling accidents doing off-road or downhill count?". Everybody agrees, i think, that hobby-off-road-cycling should not count towards the "bad rep" for cycling, whereas cycling to work, maybe "on the sidewalk" litterally is "off-road", but should be counted as a cycling-transport-related-incident.
Many people have a screen that's (way) wider than it is high. And they browse "full-screen". I like to multi-task: keeping an eye on other things on my screen besides my browsing. So my browser window will be something like 800x1024 when I'm browsing on a 1280x1024 screen.
Anyway, with that background, having even more "screenspace" that is dedicated to the sidebar is annoying. The sidebar is useful at the top (it has things), but then becomes empty next to most of the articles. So now I have just "half" the screen that's useful, the other half being blank means I bought a big screen with many pixels all for nothing....
I disagree. I'm pretty confident that canonical builds the binaries in a safe way. Even though there is a thirty year old "proof of principle" article on how to hide a backdoor somewhere almost impossible to detect.
The problem is that an average Unix system has many bugs at any one time. Debian and Canonical have given up on updating the kernel on every privilege escalation bug. You might forget to run apt-get upgrade for a few days.
When the NSA needs your data, they will for sure have a big list of things-to-try to get into your system. They will most likely succeed. It's the same with every other directed adversary. He'll be able to get in.
Note that I'm toning down what I read in the media a bit. When the facts state: "NSA was able to hack into some phones", the reports will change from "NSA hacked into phones" into "NSA can hack all phones" and "NSA has access to all data on your phone". They don't. Once they are interested in you, they will try to hack your phone and get what they need.
It's different for internet monitoring. Traffic analysis and automated wiretapping is something they are apparently into. So the big internet providers have automated that and they can't do much to verify the requests they get. So when Snowden says he could tap anyone anywhere, he means he could issue a "wiretap warrent" from his desk, which would reach, say yahoo as: "NSA wants access to the traffic from IP x.y.z.w, you're not allowed to ask why." And then Yahoo might try to fight that a few times, but they have already lost. So nowadays that is processed immediately, and/or automatically.
Moore's law predicts that the "factor-of-33" will be bridged in about 10 years. There is only a factor of 20 to the "peak performance", so about a year before that, peak performance might topple the exabyte "barrier".
(Some people plug in different constants in Moore's law. I use factor-of-1000 for every 20 years. That's 30 every 10, 2 every 2, and about 5 every five. This has never failed me: it always works out).
No. I'd gladly pay for a phone that is not 7.6mm thick but say 12mm, but then has a 3x better batery life. It is the autonomy and the "I don't have to remember to charge it up every time" that matters to me.
A cellphone works a normal amount of time on a battery that holds 1500mAh at 3.7V, or 20kJ. Moving that amount of energy in 20 seconds takes 20kJ / 20 s = 1kW. This is quite in the realm of a residential power socket.
The problem however is that you'd need contacts that can handle 250A if you keep the voltage below 4V as the current battery uses. Or you can use higher voltages... To keep the current under 1A (as the a normal phone connector uses) you would need to up the voltage to about 1000V.
In both cases the connector would almost need to be bigger than a modern phone.
Of course these are a bit extreme. Say 10A at 30V would give you a charging time of a minute which is still way better than what we have now.
But in all honesty.... I don't think that she'll be able to match the energy density of a modern lipo.
Yes. And I was joking, not serious. Duh!
To be perfectly honest the proof that the gap between consecutive integers doesn't grow forever is pretty simple. It stays 1.
On second thought:
As a privacy concern the fact that "the network" knows where you are is a problem.
It does not matter whether the network knows your location to 3m (your phone sends the current GPS location), several tens of meters (your phone reports the strenghts of several towers it "hears"), several hundreds of meters/ a few km (which cell) or several tens of km's (the LA).
Track the LA's of my phone and you'll know when I visited my mom last week. Sure, you'd have to guess that I actually visited my mom and not someone else in that town. But if someone develops an interest in your whereabouts, the LA will already give away lots of information.
There MUST be a timeout on an "inactive" phone. Otherwise phones that are dropped into a toilet would remain ghost-phones in the LA where they dropped (pun intended) off the network.
With the argument that it is expensive for phones to transmit, they will transmit as little as possible already. If the timeout is known to be 1 hour, they will transmit after 55 minutes. (provided the protocol sends and ACK from the tower, otherwise you have to account for the possibility of a lost packet and transmit for example every 25 minutes).
The cell-phone transmit is "expensive" in that it drains the battery. You can optimize the electronics all you want but if you have to transmit a 1W burst for 0.01 seconds to indicate that you're still there, the energy expense of that burst is fixed and cannot be reduced. This is apparently a trick to reduce the number of transmissions from the phone to the towers so that the battery can last longer. I didn't know that. Thanks for the update!
If you want to receive calls or SMSes, you need to leave the phone on and transmitting:
When a call for your number comes in, the incoming call is NOT transmitted nationally. Only in the GSM-cell that you are actually in is the signal transmitted. So, the system has to know in which cell you are to be able to "call" your phone. If you properly turn it off, the phone will tell the GSM network it is going off. So when a call comes in, it will go to voicemail immediately. If you yank the battery, the system will assume you are still in that cell where you last had the phone on, but it will probably time you out if it doesn't hear from your phone for a while. (which happens naturally if for example you drive out of range).
The problem with having to enter the code on the PC is that malware running on the PC will be able to get the key.
WARNING!
The thieves come back after a week or two to get all your new stuff you bought with the insurance money!!!
Chess will go this route. No Master of any rank will be allowed to exceed their 'reasonable' ability.
no.... This incident is a wakeup call. People showing such a sudden over-achievement should not play games that have a live broadcast of their moves. Or maybe live broadcasts of moves should be stopped altogether.
What we suspect from this incident is that he cheated. If he cheated, the "live broadcast on the internet" helped. Cut that route off. Consider other routes. e.g. people in the crowd doing a private "live internet broadcast".
So: We now know that advanced technological cheating might be going on. So tournament organizers should take apropriate (better) precautions.
It's a bit like some rat-research in the nineteenthirties. People were teaching rats (or mice?) to go to the right or wrong portal with the cheese behind. Turns out that the rats were a) smelling the cheese. b) hearing the cheese by listening to their feet hitting the floor of the arena. There was a widely ignored article about how the rats cheated, NOT solving the puzzle the humans had set up but going for the cheese anyway. Knowing THAT cheating might be going on and knowing some of the routes that this might be done helps in providing a fair competition.
I am not sure where you got this, but chess is easily solvable in finite time.
Where did YOU get the idea that "finite time" has something to do with NP or not?
There are NP complete problems. They are "known to explode" and other NP complete problems can be transformed into them such that solving one solves the other NP complete problems. Chess is not NP complete: The chess board is finite, which means that it is impossible to transform say a large "traveling salesman" problem onto a chessboard.
NP is nonderterministic Polynomial. Say the traveling salesman problem has to be reformulated a bit. The "easily understood" form says: what's the shortest route. The formal question is: "Can you traverse these cities with less than xxx km?". Once you come up with a route that solves it in xxx km, you can easily check if the condition is satisfied.
A formulation of "the" chess question: "does white win?" is likely not even NP. The NP answer would be: Look if you do these moves, you always win. With games like "tictactoe" you can do that sort of thing.
Looking at it from another point-of-view: Chess is a finite game. The board is finite. So if there is a strategy, it is bounded by the size of the board. Chess can be solved in constant time. The problem is: the constant is quite humongous.
The FSF is controlled by a certain person known as RMS, who has certain extremistic views on free software. Any and all closed-source software is evil and must not be touched. Fine. He is entitled to HIS opinion, I'm entitled to mine.
What HE wants to "clarify" about the GPL does not concern me. I have the licence and I get to interpret it (within legal limits) as the text allows. We KNOW that RMS wants to "tighten" the GPL to make it more contagious etc etc. RMS would like it very much that when microsoft the company ran any GPL application on any of their computers, they could be forced to open up their software. That's what he really wants. It's not legally feasable to try this right now, but that would be the end goal for him.
When I write code, it gets licenced under GPL V2. No "or any later version". What if RMS suddenly wants to make GPL V4 say: "An author who has released an application under GPL, is required to put all sourcecode he writes on the internet."
I will provide sourcecode to anbody who has (bought) the binary.
The fundamental problem is that sometimes an error is an error to the calling program, but sometimes it is not.
For example, when you issue: open "$HOME/.myconfig", the inability to find the file does not mean there is an error. Just that the optional config file is not there. But when you try to open the source file for an operation, the open-error really IS an error.
This duality happens at most levels. A library wrapping "open" will have the same problem. Does the caller consider this a fatal error or not?
Similarly, sometimes errors should result in telling the user and then quitting. But for a gui application it's better to show a graphical message and continue, even if the error is more or less "fatal".....
Absolutely true!
Evolution works that way: In good times, a big population is generated that has great genetic variety. When bad times come along, the bad genetic variations will be removed from the population.
Suppose for instance that suddenly tomorrow all oaktrees had pollen that is deadly to most humans. The genetic variations builtup over the last 200 years might have provided a (possibly small) percentage of the population that is resistant to the deadly pollen. The result would be that a small group survives and starts working on a new gene-pool.
Yes, genetically we have been living in "good times" the last generations. More and more "slight defects" in the genetic pool are able to survive into mature ages.
A friend is totally colorblind. A genetic disadvantage, you'd say? Nope, his "grayvision" is a LOT better than that of most of us. Apparently he can spot camouflaged army-material from way further away than us normal people. When suddenly THAT becomes a winning trait (i.e. those that don't have it die), his descendants will form a larger part of the population.
This expansion of the gene pool also allows for combinations. Suppose the guy with the super-vision marries the gal with the super hearing?
The point in photoshop skills is to be able to touch up bad photos. If the students are allowed to take their own pictures, who says they won't just take a good picture to start with?
After doing tens of programs with pointers and stuff, the intro-to-computer-programming had a task to turn in a program that used pointers to sort a list of items (using bubble sort).
Compare the items, and when they need swapping, swap them. SImple. Well, somehow, my swapping routine with the pointers always messed up. So instead of swapping the pointers, I swapped the whole records. Kind of against the point of the excercise. But the teaching assitant didn't spot it. It was just a stupid excercise way below the level I was at, at the time, and I faked the result because it was pointless.
(to illustrate the level op programming I was at, a later course, had me starting to do the excercises a few days after they opened the practical side of the course up. When I had turned all the excercises 2 days later in it turned out I was the second one to complete all excercises. Apparenly someone was doing 1 a day while I was doing 3 a day, but he beat me by 20 minutes by starting 3 days earlier......)
without mentioning it, the author of the article assumes that you will completely drain the battery on every commute, for both the velomobile, and the electric car. With that completely rediculous assumption, he comes to the figure of 25% of the windturbines.
Anybody watch the video and agree with me that he seemed severely hypoxic in his reactions to messages from ground control?
29 release seatbelt. ... no reaction
From then on, he's quite unresponsive. "say Roger if... " he responds with "roger", but that could just be a response to the "say roger" and not the part after the "if"....
I don't know if they are actually going to do it this way, but a voice call also causes data to be transferred back and forth to/from the phone.
So by just charging for the data, people get more choices: Chose a lower bitrate and pay less. Send an SMS for incredibly little, etc etc. Download a HQ video from Youtube: Pay a lot.
A botnet is a nuisance because it DOES annoying things.
If the botnet is instructed to send spam, you can detect computers sending "too much" Email. If the botnet is instructed to DDOS a certain host, you can detect it sending the malicious requests.
If a host in a botnet is a "sleeper", it doesn't matter much if you firewall it off. But the hosts doing the malicious, detectable stuff should be firewalled off.
The problem is that if a botnet consists of 2 million computers, and the spammer wants to send off 2 million spams, Then each computer need only send one Email. That could/should not be considered as "too much".
As the author of "same", I was going to post the above suggestion.
Last time I used "same", 4.2 million files was peanuts. Of course, running through 4.8Tb of data is going to take some time.
People above are doing suggestions like doing CRCs of the files. Checking filesizes. Etc etc. Same does all of this:
First a list is compiled of the files to be handled. Then each file is stat-ed to determine its size. Then only same-size files are considered candidates for being the same. Next if the filesizes are the same, the CRCs are compared. The CRCs are calculated on an "as needed" basis. This means that most big media files will never need to be read entirely unless a duplicate is going to be found. Anyway. When the CRCs are the same, the files are compared bit-for-bit and if THAT comes out good, the files are hardlinked together.
The hardlinking means that you can further process the results. You can use find to eliminate say all duplicate files in a directory called "backup", provided that they ARE duplicates. Now you'll be left just with the Uniqe files in that directory.
I'm not sure if all of this will easily run on windows: It's a Unix program. On the other hand, it uses simple calls and should easily be ported using the cygwin suite.