Slashdot Mirror


Edward Snowden Kills Team Trump's Conspiracy Theory By Explaining How The FBI Can Quickly Comb Through Email (geekwire.com)

FBI director James Comey told Congress Sunday that the further investigation of emails related to Hillary Clinton didn't turn up anything that would cause the bureau to recommend charges against her. The FBI had reviewed over 650,000 emails under nine days. Upon hearing this, GOP presidential candidate Donald Trump and his supported started to question whether the FBI could go through all those emails in such a short period of time. We will never know for sure until the FBI explains its process to us all (which is unlikely to happen), so people turned to Edward Snowden over the weekend for answers. And Mr. Snowden didn't disappoint. From a report on GeekWire: How easy would it be to cull out the duplicate emails? Outspoken journalist Jeff Jarvis posed that question to Snowden in a tweet, and got a quick response: "Drop non-responsive To:/CC:/BCC:, hash both sets, then subtract those that match. Old laptops could do it in minutes-to-hours."

7 of 488 comments (clear)

  1. Bingo by PvtVoid · · Score: 3, Informative

    It would be very easy via automation to tag the emails which are dupes of ones already in the data set.

    Which, apparently, was all of them. No shit, Sherlock.

  2. Re:He didn't do shit by 110010001000 · · Score: 3, Informative

    Um, if you "deduped" the list using hashing the resulting list was zero because there were no fucking new emails. It would literally take less than 5 minutes to run the algorithm.

  3. Email Threading and DeDupe by dave562 · · Score: 5, Informative

    I work for an organization that is heavily involved in electronic discovery processing for large corporations, law firms and the United States government.

    Email threading, and duplication detection / dedupe are standard tasks that are performed on a daily basis on huge datasets. (As part of the Processing phase of the EDRM model.)

    It is not at all unfeasible that the FBI could have used standard, off the shelf software to identify duplicates and generate an exception report for all 'new' emails that were not in the previously collected datasets.

    https://en.wikipedia.org/wiki/...

  4. Re:removing dupes is easy... by 110010001000 · · Score: 5, Informative

    There were no "non-dups". There were no new emails. Not too hard to grasp.

  5. Re:This just in by smooth+wombat · · Score: 4, Informative

    Well apparently Trump and his supporters aren't high school level then.

    Trump does poll well with the uneducated.

    --
    We will bankrupt ourselves in the vain search for absolute security. -- Dwight D. Eisenhower
  6. Re:Drone Snowden's ass already by __aaclcg7560 · · Score: 5, Informative

    That you elected a lying bitch instead of a lying asshole?

    As one Republican consultant said in a Politico article, "Given a choice between crooked and crazy, the American people will always vote for crooked."

  7. Re:Unless we know the number of non-dupes. by Anonymous Coward · · Score: 2, Informative

    He had no difficulty in declining to release FBI's conclusion that Russians were behind the DNC hack, because it was "too close to the election day". That was about a week before he decided to make a very vague, inconclusive announcement that FBI was "looking into" these emails allegedly tied to HRC. In other words, he clearly picked a side in deciding that something that might hurt HRC was fit to release right before the election but something that was even barely connected to Trump was not.