Domain: yahoo.com
Stories and comments across the archive that link to yahoo.com.
Stories · 5,662
-
Wikileaks Founder Arrested In London
CuteSteveJobs writes "The founder of WikiLeaks, Julian Assange, has been arrested by London police on behalf of Swedish authorities on allegation of rape. Assange has admitted that he is exhausted by the ongoing battle against authorities. The Swiss Government has confiscated $37K in his Swiss Bank account. PayPal and Mastercard have frozen Wikileak's accounts, hampering Wikileaks from raising any more funds." -
Consumer Reports Gives AT&T Lowest US Carrier Rank
tekgoblin writes "Consumer Reports has just released results for consumer satisfaction across all US cell phone carriers. The survey covered around 58,000 Consumer Reports subscribers. Over half of the respondents who used AT&T used the iPhone when taking the survey. According to Consumer Reports, iPhone users were less satisfied with AT&T than other users with different phones. An AT&T spokesman responded by citing independent speed tests, as well as higher subscriber numbers and a dropped call rate within 0.1% of the industry leader." Update: 12/07 01:49 GMT by S : Corrected last sentence to indicate the 0.1% dropped call rate statistic is the difference between AT&T and another carrier, not 0.1% overall. -
Explosive-Laden California Home To Be Destroyed
wiredmikey writes with this snippet from an AP report: "Neighbors gasped when authorities showed them photos of the inside of the Southern California ranch-style home: Crates of grenades, mason jars of white, explosive powder and jugs of volatile chemicals that are normally the domain of suicide bombers. ... Now authorities face the risky task of getting rid of the explosives. The property is so dangerous and volatile that they have no choice but to burn the home to the ground this week in a highly controlled operation involving dozens of firefighters, scientists and hazardous material and pollution experts. ... Some 40 experts on bombs and hazardous material from across the country and at least eight national laboratories are working on the preparations. They have analyzed wind patterns to ensure the smoke will not float over homes beyond the scores that will be evacuated. They have studied how fast the chemicals can become neutralized under heat expected to reach 1800 degrees and estimate that could happen within 30 minutes, which means most of the toxins will not even escape the burning home." -
Aquarium Uses Eel Powered Christmas Lights
A Japanese aquarium is using the greenest energy possible to power the lights on its Christmas tree, an electric eel. From the article: "Each time the eel moves, two aluminum panels gather enough electricity to light up the 2-meter (6 ft 6 in) tall tree, decked out in white, in glowing intermittent flashes." -
UK Asks News Outlets Not To Publish WikiLeaks Bombshell, US Prepares For Fallout
Stoobalou writes "The UK government has issued Defense Advisory Notices to editors of UK news outlets in an attempt to hush up the latest bombshell from whistle-blowing web site WikiLeaks. DA Notices, the last of which was issued in April 2009 after sensitive defense documents were photographed using a telephoto lens in the hand of Assistant Commissioner Bob Quick as he arrived at No 10 Downing Street for a briefing, are requests not to publish, and therefore not legally enforceable." This news comes alongside a raft of articles detailing the US government's preparations for the release. Officials are warning allies that the documents will be more damaging than previous releases, to the point of potentially damaging diplomatic relations with countries like Turkey. The Vancouver Sun wonders if this will lead to a change in the way diplomats communicate. -
The Problem With the Top500 Supercomputer List
angry tapir writes "The Top500 list of supercomputers is dutifully watched by high-performance computing participants and observers, even as they vocally doubt its fidelity to excellence. Many question the use of a single metric — Linpack — to rank the performance of something as mind-bogglingly complex as a supercomputer. During a panel at the SC2010 conference this week in New Orleans, one high-performance-computing vendor executive joked about stringing together 100,000 Android smartphones to get the largest Linpack number, thereby revealing the 'stupidity' of Linpack. While grumbling about Linpack is nothing new, the discontent was pronounced this year as more systems, such as the Tianhe-1A, used GPUs to boost Linpack ratings, in effect gaming the Top500 list." Fortunately, Sandia National Laboratories is heading an effort to develop a new set of benchmarks. In other supercomputer news, it turns out the Windows-based cluster that lost out to Linux stumbled because of a bug in Microsoft's software package. Several readers have also pointed out that IBM's Blue Gene/Q has taken the top spot in the Green500 for energy efficient supercomputing, while a team of students built the third-place system. -
Stuxnet Virus Now Biggest Threat To Industry
digitaldc writes "A malicious computer attack that appears to target Iran's nuclear plants can be modified to wreak havoc on industrial control systems around the world, and represents the most dire cyberthreat known to industry, government officials and experts said Wednesday. They warned that industries are becoming increasingly vulnerable to the so-called Stuxnet worm as they merge networks and computer systems to increase efficiency. The growing danger, said lawmakers, makes it imperative that Congress move on legislation that would expand government controls and set requirements to make systems safer." -
White House Edited Oil Drilling Safety Report
bonch writes "The Interior Department inspector general has released a report stating that the White House edited a drilling safety report by reordering paragraphs to make it appear as though a seven-member panel of independent experts supported the six-month ban on offshore drilling. The IG report states, 'The White House edit of the original DOI draft executive summary led to the implication that the moratorium recommendation had been peer-reviewed by the experts,' but the panel had only reviewed a draft of safety recommendations and not a drilling ban. The White House has issued a statement saying that there was 'no intentional misrepresentation of their views.' This follows complaints from scientists and environmentalists that the administration has not been holding to its promise of policy guided by science and not ideology." -
Dentists Offer Halloween Candy Buyback Program
Kids in Pennsylvania have an option for all the Smarties they collected this Halloween other than a trash can thanks to two dentists. Nalin and Arpan Patel will donate $1 to troops in Afghanistan and Iraq for every pound of candy they receive. From the article: "The dentists are accepting candy from November 1-5, and hope what is collected will cheer US troops overseas. 'Now during these hard times, the spirit of giving and helping is needed more than ever,' they said." -
The Empire Strikes Back Vader Costume For Sale
Now is your chance to own an original Darth Vader costume from the best of the Star Wars movies. Christie's auction house plans on putting it up for sale on Nov. 25 and it would be unwise to underestimate the value of this costume. From the article: "The jet-black helmet, mask and armor worn by the intergalactic villain are expected to sell for between 160,000 pounds and 230,000 pounds ($250,000 and $365,000) at a sale of pop culture memorabilia next month." -
Bible.com Investor Sues Company For Lack Of Profit
The board of Bible.com claims that it is easier for a camel to pass through the eye of a needle, than to make money on the domain name, but an angry shareholder disagrees. From the article: "James Solakian filed the lawsuit in Delaware's Chancery Court against the board of Bible.com for breaching their duty by refusing to sell the site or run the company in a profitable way. The lawsuit cites a valuation done by a potential purchaser that estimated bible.com could be worth more than dictionary.com, which recently sold for more than $100 million." -
Apple's Long Road To $300
itwbennett writes "Apple shares inched over $300 for the first time Wednesday, nearly 30 years after Apple's initial public offering in December 1980. But it hasn't been a steady climb. In fact, says blogger Chris Nurney, 'Apple's stock history can be divided into two clear periods — the early years, from the IPO through Steve Jobs's long absence from the company after losing a power struggle in 1985, and the modern Jobs era, which began on September 16, 1997.' The bottom line: 'If you had purchased $10,000 of Apple stock the same month that Jobs again began leading the company, your shares would be worth $554,000 today. Not a bad return on the investment.'" -
Dogs Can Be Pessimistic
Not that it will change anything, but researchers at Bristol University say that your dog might be a gloom-monger. In addition to the downer dogs, the study also found a few that seemed happy no matter how uncaring the world around them was. "We know that people's emotional states affect their judgments and that happy people are more likely to judge an ambiguous situation positively. What our study has shown is that this applies similarly to dogs," said professor Mike Mendl, an author of the study and head of animal welfare and behavior at Bristol University. -
At Commonwealth Games, the World's Largest Aerostat
GillBates0 writes "Last weekend saw the world's largest aerostat of its kind featured at the opening ceremony of the 2010 Commonwealth Games in Delhi. The helium balloon or aerostat measures 40x80x12 meters, contains 20,000 cubic meters of helium and features lights, mirrors, cameras, a 360 degree projection screen, projectors and a reflective underbelly. During the show, it was raised 25 meters off the ground and transformed into a giant Bodhi Tree and a meditating Buddha, with giant puppets to complement the cultural performances beneath. These slideshows tell the story." -
Copyrights and CD-Rs Endanger Audio History
SEWilco writes "A study by the Library of Congress has found that many audio recordings are being lost due to copyright restrictions and temporary media. Old audio recordings are protected by a various US state copyrights, so it's hard for preservationists to get and copy material. Recent data is threatened by being put on writable CDs, because CD-Rs begin to lose data after a few years, so recordings from as recently as 9/11 and the 2008 elections are already at risk." -
Senate Votes To Turn Down Volume On TV Commercials
Hugh Pickens writes "Ever since television caught on in the 1950s, the FCC has been getting complaints about blaring commercials but concluded in 1984 there was no fair way to write regulations controlling the 'apparent loudness' of commercials. Now the AP reports that the Senate has unanimously passed a bill to require television stations and cable companies to keep commercials at the same volume as the programs they interrupt using industry guidelines on how to process, measure and transmit audio in a uniform way. Senator Charles Schumer (D-NY), a co-sponsor, says it's time to stop the use of loud commercials to startle viewers into paying attention. 'TV viewers should be able to watch their favorite programs without fear of losing their hearing when the show goes to a commercial.' The House has already passed similar legislation, so before the new measure becomes law, minor differences between the two versions have to be worked out when Congress returns to Washington after the November 2 election." -
Safety Commission To Rule On Safety of Rulers In Science Kits
The Consumer Product Safety Commission has been trying decide for weeks if science kits designed to teach children are safe enough for children to use without vigorous testing. It's not just the chemicals or sharp items in the kits that they are troubled with however. They are also concerned about the dangers of paper clips, magnets, and rulers. From the article: "Science kit makers asked for a testing exemption for the paper clips and other materials. The commission declined to grant them a blanket waiver as part of the guidance the agency approved Wednesday on a 3-2 vote." To be fair, paper clips can cause a lot of damage — just look at what Clippy did to Microsoft Office. -
Protesters Brick Up Bank Entrance
Upset over a lack of lending, a group of protesters bricked up the entrance to a Barclays bank in Bournemouth, England. From the article: "Property developer Cameron Hope told Sky News Online that 'things are getting worse' for bank customers and businesses, and the financial institutions 'aren't fit for purpose.'" I'm no expert in fair lending practices, but I'm pretty sure getting into the bank is an important first step in the loan process. -
Swedes Cast Write-In Votes for SQL Injection, Donald Duck
An anonymous reader writes "The Swedish elections were held recently (the third Sunday of September to be exact) and it seems that a few people tried to interfere with the election by voting for parties which were in effect named to be SQL injection attacks or similar. Clever stuff! Little Bobby Tables in real life." That wasn't the only oddity of the election; reader MZeroOne writes: "The Swedish Election Authority published the results of last Sunday's general election and even though the current prime minister retained power, the candidate who got the most individual handwritten votes was Disney's Donald Duck." Maybe the existence of the Hard Alcohol Party (237 votes) helps explain why the Pirate Party didn't have a better showing. -
Hundreds of Crocodiles Escape Mexican Refuge After Hurricane
At least 280 crocodiles escaped a Mexican refuge after Hurricane Karl caused heavy flooding in the area. Authorities are sending crocodile experts to the area in an attempt to round up the escaped reptiles. From the article: "The governor of Veracruz told reporters about 280 crocodiles were missing from the reserve in La Antigua, although some media put the number of reptiles at closer to 400." I can't wait for the movie makers at Syfy to hear about this. -
Man Claims Caffeine Made Him Kill
caffiend666 writes "'A Kentucky man accused of strangling his wife is poised to claim excessive caffeine from sodas, energy drinks and diet pills left him so mentally unstable he couldn't have knowingly killed his wife, his lawyer has notified a court.' 'Dr. Roland Griffiths, a professor of behavioral biology at Johns Hopkins University has noted in an unrelated study that there is a diagnosis for "caffeine intoxication," which includes nervousness, excitement, insomnia and possibly rambling speech.' Personally, I just blame 'dark roasts.'" -
Child Abuse Verdict Held Back By MS Word Glitch
An anonymous reader writes "Last week several defendants including one high-profile TV presenter were sentenced in Portugal in what has been known as the Casa Pia scandal. The judges delivered on September 3 a summary of the 2000-page verdict, which would be disclosed in full only three days later. The disclosure of the full verdict has been postponed from September 8 to a yet-to-be-announced date, allegedly because the full document was written in several MS Word files which, when merged together, retained 'computer related annotations which should not be present in any legal document.' (Google translated article.) Microsoft specialists were called in to help the judges sort out the 'text formatting glitch,' while the defendants and their lawyers eagerly wait to access the full text of the verdict." -
US Military Eyes the Glow of Fireflies
GarryFre writes "According to the AP: 'Someday, the secrets of fireflies or glowing sea plankton could save an American soldier in battle, a Navy SEAL on a dive, or a military pilot landing after a mission. That's the hope behind a growing field of military-sponsored research into bioluminescence, a phenomenon that's under the microscope in laboratories around the country. This phenomenon is noteworthy because this produces light without wasting energy because it does not generate any heat. A possible military use of bio-luminescence would be creating biodegradable landing zone markers that helicopters can spot even as wind from their rotors kicks up dirt.'" -
Scientists Cut Greenland Ice Loss Estimate By Half
bonch writes "A new study on Greenland's and West Antarctica's rate of ice loss halves the estimate of ice loss. Published in the journal Nature Geoscience, the study takes into account a rebounding of the Earth's crust called glacial isostatic adjustment, a continuing rise of the crust after being smashed under the weight of the Ice Age. 'We have concluded that the Greenland and West Antarctica ice caps are melting at approximately half the speed originally predicted,' said researcher Bert Vermeeersen." -
University Offers Class In Zombie Studies
Young people at The University of Baltimore will be able to study the zombie condition thanks to the newly available English 333. Students in the class will watch 16 classic zombie films and read zombie comics. Instead of writing a final research paper they may write a script or draw storyboards for their own zombie movie. Unfortunately the class doesn't seems to cover brain appreciation. -
Senate Candidate Sued By Copyright Troll
The Iso writes "Las Vegas based company Righthaven found two articles from the Las Vegas Review-Journal about Republican Senate candidate Sharron Angle reprinted on her web site without permission, so it did what it always does: bought the rights to the articles from the Review-Journal and sued the alleged infringer, seeking unspecified damages." -
Firm Can't Fire Man For 1.8 Cent Theft
An anonymous reader writes "A German company that fired a man for the theft of 1.8 euro cents (two US cents) worth of electricity had no grounds for sacking him, a court ruled, dismissing the firm's appeal against his reinstatement. Network administrator Oliver Beel lost his job after charging his Segway, a two-wheeled electric vehicle, at work in May 2009. After he connected the vehicle to the firm's power source for 1-1/2 hours, his boss asked him to remove it. Twelve days later Beel found himself without a job." -
Old People Enjoy Reading Negative Stories About Young
A study by Dr. Silvia Knobloch-Westerwick and co-author Matthias Hastall suggests that your grandma's self-esteem gets a boost when she hears about the stupid things young people do. "Living in a youth centered culture, they may appreciate a boost in self-esteem. That's why they prefer the negative stories about younger people, who are seen as having a higher status in our society," said Dr. Silvia Knobloch-Westerwick. From the article: "All the adults in the study were shown what they were led to believe was a test version of a new online news magazine. They were also given a limited time to look over either a negative and positive version of 10 pre-selected articles. Each story was also paired with a photograph depicting someone of either the younger or the older age group. The researchers found that older people were more likely to choose to read negative articles about those younger than themselves. They also tended to show less interest in articles about older people, whether negative or positive." -
3 Drinks a Day Keeps the Doctor Away
Nzimmer911 writes "Heavy drinkers outlive non-drinkers according to a 20 years study following 1,824 people. From the article: 'But a new paper in the journal Alcoholism: Clinical and Experimental Research suggests that - for reasons that aren't entirely clear - abstaining from alcohol does actually tend to increase one's risk of dying even when you exclude former drinkers. The most shocking part? Abstainers' mortality rates are higher than those of heavy drinkers.'" -
GPS Tracking Without a Warrant Declared Legal
jnaujok writes "The Ninth Circuit court has declared that attaching a GPS tracker to your car, as it sits in your driveway, or by extension on a public street, and then using it to monitor every one of your movements, is totally legal, and can be performed by the police without needing a warrant. So, if you live in the Western United States, big brother has arrived." -
The Strange Case of Solar Flares and Radioactive Decay Rates
DarkKnightRadick writes "Current models for radioactive decay have been challenged by, of all sources, the sun. According to the article, 'On Dec 13, 2006, the sun itself provided a crucial clue, when a solar flare sent a stream of particles and radiation toward Earth. Purdue nuclear engineer Jere Jenkins, while measuring the decay rate of manganese-54, a short-lived isotope used in medical diagnostics, noticed that the rate dropped slightly during the flare, a decrease that started about a day and a half before the flare.' This is important because the rate of decay is very important not just for antique dating, but also for cancer treatment, time keeping, and the generation of random numbers. This isn't a one time measurement, either. 'Checking data collected at Brookhaven National Laboratory on Long Island and the Federal Physical and Technical Institute in Germany, they came across something even more surprising: long-term observation of the decay rate of silicon-32 and radium-226 seemed to show a small seasonal variation. The decay rate was ever so slightly faster in winter than in summer.'" -
Los Angeles Unveils $578 Million Public School
An anonymous reader writes with this excerpt from an Associated Press report on next month's opening of the Robert F. Kennedy Community Schools in Los Angeles: "With an eye-popping price tag of $578 million, it will mark the inauguration of the nation's most expensive public school ever. The K-12 complex to house 4,200 students has raised eyebrows across the country as the creme de la creme of 'Taj Mahal' schools, $100 million-plus campuses boasting both architectural panache and deluxe amenities. ... At RFK, the features include fine art murals and a marble memorial depicting the complex's namesake, a manicured public park, and a state-of-the-art swimming pool. 'There's no more of the old, windowless cinderblock schools of the '70s where kids felt, "Oh, back to jail,"' said Joe Agron, editor-in-chief of American School & University, a school construction journal. 'Districts want a showpiece for the community, a really impressive environment for learning.' ... Critics note that nearly 3,000 teachers have been laid off over the past two years, the academic year and programs have been slashed, the district faces a $640 million shortfall and some schools persistently rank among the nation's lowest performing." -
Study Says Your Personality Doesn't Change After 1st Grade
A study authored by Christopher Nave, a doctoral candidate at the University of California, says that our personalities stay pretty much the same from early childhood all the way through old age. From the article: "Using data from a 1960s study of approximately 2,400 ethnically diverse schoolchildren (grades 1 - 6) in Hawaii, researchers compared teacher personality ratings of the students with videotaped interviews of 144 of those individuals 40 years later. They examined four personality attributes - talkativeness (called verbal fluency), adaptability (cope well with new situations), impulsiveness and self-minimizing behavior (essentially being humble to the point of minimizing one's importance)." This must explain my overriding need to be first captain when we pick kickball teams at the office. -
Researchers Pinpoint Cause of Gluten Allergies
An anonymous reader writes "When patients with celiac disease consume foods containing gluten — a protein present in wheat, barley and rye — their immune systems send out an alarm, triggering a response that can damage their intestines and prevent them from absorbing certain nutrients. Now, scientists have pinpointed the culprits most responsible for this harmful reaction: three small fragments within the gluten protein that spark chaos in the gut." -
Beautiful Data
eldavojohn writes "Beautiful Data: The Stories Behind Elegant Data Solutions is an addition to six or so other books in the 'Beautiful' series that O'Reilly has put out. It is not a comprehensive guide on data but instead a glimpse into success stories about twenty different projects that succeeded in displaying data — oftentimes in areas where others have failed. While this provides, for the most part, disjointed stories, it is a very readable book compared to most technical books. Beautiful Data proves to be quite the cover-to-cover page turner for anyone involved in building interfaces for data or the statistician at a loss for the best way to intuitively and effectively relay knowledge when given voluminous amounts of raw data. That said, it took me almost two months to make it through this book, as each chapter revealed a data repository or tool I had no idea existed. I felt like a child with an attention deficit disorder trying my hand at nearly everything. While the book isn't designed to relay complete theory on data (like Tufte), it is a great series of short success stories revolving around the entire real world practice of consuming, aggregating, realizing and making beautiful data." Keep reading for the rest of eldavojohn's review. Beautiful Data: The Stories Behind Elegant Data Solutions author Edited by Toby Segaran and Jeff Hammerbacher pages 384 publisher O'Reilly Media, Inc. rating 9/10 reviewer eldavojohn ISBN 978-0-596-15711-1 summary A collection of twenty essays and chronicles from the implementers of successful projects revolving around real world data processing and display. Since the individual articles in this book are essentially a series of what to do and what not to do, this review is more like a list of notes that were my personal rewards from each chapter. Given my background, these notes will be very specified to my interests and responsibilities for web development whereas a statistician, academic or researcher might pull a completely different set from the book. The book also has a nice colorized insert that allows the reader to get a better sense of the interfaces discussed throughout the book. One potential problem with these "case studies" is that they will most certainly become dated — and in our world that happens quite quickly. It's very easy for me to think that specific information about colocation facility usage by social networking sites (Chapter Four) will always be useful and relevant. The sad fact of the matter is that because of the unforeseen nature of hardware advancements and language evolution, many of these stories could become irrelevant blasts from the past in one or two decades. I think the audience that stands to benefit this most from this book are low level managers and people in charge of large amounts of data that they don't know what to do with. The reason for this is that while there are a few chapters that deal with low level implementation details it mostly consists of overviews of popular and successful mentalities surrounding data. One other type of audience that might be a target for this book would be young college students with interests in math, statistics or computer science. Had I picked this book up as a freshman in college, no doubt the number of projects I worked late into the night on would have multiplied as would my understanding of how the real world works.
Chapter One deals with two projects done by grad students: Personal Environmental Impact Report (PEIR) and your.flowingdata (YFD). This chapter starts out slow describing how the system harnesses personal GPS devices — a common trend in phone development these days. After clearing the basics, the chapter reveals a lot about the iterative developments the author took to select and include a map interface to effectively and quickly display several routes that a user has driven with intuitive visual queues to indicate which was the most environmentally expensive. Trying to stick with the green means good and red means bad proved difficult and they employed an inverted map of mostly shades of gray to avoid clashing colors with the natural colors on a regular map. The final part of PEIR discussed a Facebook application that simply paired you up against friends also using PEIR. This gave the user a relative value basis of otherwise incomprehensible numbers surrounding their environmental impact. YFD focuses more on an interface for accumulating Twitter data from a user to help them track sleeping and weight loss.
The second chapter deals entirely with constructing a very simple survey that has a variable length depending on what answer you give to an earlier question. While this seems to be a very simple task, the chapter does a great job of explaining how you can make it better and why doing this makes it better. A great quote from this chapter is "The key method for collecting data from people online is, of course, through the use of the dreaded form. There is no artifact potentially more valuable to a business, or more boring and tedious to a participant." The chapter points out that for every action you require the user to make, the user may decide the survey is not worth their time. Yes, clicking "Next" on a multi-page form only gives the user another chance to decide this isn't worth it. Furthermore, many pages might cause the user to be unsure of the real length of the survey. So they decided against this and instead made the survey branch from one page so that page would continually get a little larger depending on how you answered the questions. Knowing the targets for the surveys were older made a copy large font mandatory as 72% of Americans report vision impairment by the time they are age 45. This chapter dealt more with collecting the data, respecting the source of data and building trust with the participants than displaying the data they provided.
Chapter Three deals with the recently disabled Phoenix that landed on Mars and how precisely the image collection was done. While it might seem like the wrong place to do it, there was actually pre-processing and compression done on board the lander before transmission to Earth. This article tackles interesting issues that are long thought to be an extinct animal in computer science where resources are constrained and radiation bombarding keeps the CPU modestly lower than your average desktop. Do you process the image in place in memory or make a copy so that the original image can be retained during processing? These are familiar issues to embedded developers but stuff I haven't touched since college. While the author details the situation on all fronts down to the cameras being used, it's largely a blast from the past as far as resource aware computing is concerned. Then again, I doubt any of my code will ever be flight certified by NASA.
Chapter Four has a very interesting analysis and description of Yahoo!'s PNUTS system for serving up data in complex environments like tackling issues with latency across the world when dealing with social networking. The chapter does a decent job of explaining how issues are resolved when replicated servers across the United States become out of sync and the resolution strategy. The chapter ends on an even more interesting note explaining why Yahoo! deviated from Google's BigTable, Amazon's Dynamo, Microsoft's Azure and other existing implementations. This tale of well thought out design is a stark contrast to Chapter Five which centers on a Facebook 'data scientist' that — instead of explaining the solution as a well planned finalized implementation — tells the trial and error approach of a very small team of developers treading into waters unknown with data sets of Sisyphean proportions. It was tempting for me to read this chapter and chastise the author for not foreseeing what numbers could come with making it big in social networking. But the chapter has a lot of value in a "lessons learned" realm. It may even prepare some of you who are writing web applications with a potentially explosive or viral user base. While it's popular to hate Facebook and in turn transfer that hate to the developers, no one can argue against them being one of the most successful social networking sites and any information of their (sometimes flawed) operations certainly proves to be interesting.
Chapter Six was completely unengaging for me. The chapter covers geographing. More specifically the efforts to take pictures of Britain and Ireland and map/display them geographically. The images would aim to cover a large area than users could tag them with what they see (tree, road, hill, etc). Unfortunately it never really registered with me why someone would want to do this and what the end goal was that they were aiming for. Instead they managed to produce some pretty heinous and very difficult to digest heat maps or "spatial tree maps." By embedding coloration and lines into the treemaps the authors hoped to convey intuitive information to the reader. Instead my eyes often glazed over and sometimes I flat out disagreed with their affirmation that this is how to display data beautifully. You're welcome to try to convince me that geographing has some sort of merit other than producing pretty mosaics of large image sets but it took a lot of effort for me to continue reading at points in this chapter.
Chapter Seven sets the book back on track in "Data Finds Data" where the writers cover very important concepts and problems surrounding federated search and instead offer up directories with some semantic metadata or relationship data that makes keyword searching possible over billions of documents. For anyone dealing with large volumes of data, this chapter is a great start to understanding the options you have to processing your data when you first get it (and only once) versus searching for that data just in time and paying for it in delay. While the former incurs much more disk space cost, Google has proven that paradigm shift definitely has merit.
Chapter Eight is about social data APIs and pushes gnip heavily as the de facto social endpoint aggregator for programmers. The chapter mentions WebHooks as an up and coming HTTP Post event transmission project but doesn't offer much more than a wake up call for programmers. The traditional polling has dominated web APIs and has lead to fragile points of failure. This chapter is a much needed call for sanity in the insane world of HTTP transactional polling. Unfortunately, the community seems to be so in love with the simplicity of polling that they use it for everything, even when a slightly more complicated eventing model would save them a large percentage of transactions.
Chapter Nine is a tutorial on harvesting data from the deep web. What they mean by this is that — given proper permission — one can exploit forms on websites to access database data and then index that instead of merely being relegated to static HTML pages. In my opinion, this is a fragile and often frowned upon approach to data collection but as this chapter (and many others) illustrates, sometimes data is locked up due to lack of resources to expose it. This means that if a repository of information is meant to be available to you through a simple submission form, you can tease that information out of "the deep web" and into your system with the tricks mentioned in this chapter.
Chapter Ten is the story of Radiohead's open sourced "data" music video of "House of Cards" and the collection process from the kinds of devices used to the methodology of collecting that data to the attitude they used when treating the data. This chapter is a sort of key for understanding what data you have with Radiohead's offerings and I heavily recommend it for anyone interested in taking a stab at this video. The most interesting things I found in this was their method for collection and, more importantly, their decision to actually degrade the data and opted not to texture when displaying Thom Yorke's face — citing artistic choice. This chapter gave me one very amazing display tool that I am embarrassed to admit I had no knowledge of prior to this book: processing.
Chapter Eleven is the story of a few people that chose to do something about serious crime problems in Oakland. The city was compiling reports of crimes weekly but they weren't opening up the data. You could do a search and get a very minimal display on a map of crimes that had happened. This caused Oakland Crimespotting to arise. At first they were forced to graphically scrape and estimate crime locations so their own system could offer it back to the user in more intuitive and useful ways to the citizens so the citizens could take action. At first they were forced to work around problems but in the end the city government came to its senses and began offering them the data in a far more open format. From browsing the site now, you can get an idea of the tale this chapter tells. The evolution of that end product is chronicled in this chapter.
Chapter Twelve center's on sense.us, a potentially powerful product that aims to empower users to analyze and create notations on graphs that might relay correlations between factors inside US Census data. The only disappointment with this chapter is that sense.us isn't live for us to use. The tool shows powerful abilities in collaboration in analysis of census data but also is a double edged sword. There's nothing that stops this tool from being used for political and monetary ideals instead of purely academic revelations. They used tools like Colorbrewer and prefuse to dynamically generate graphs and charts that were pleasing to the eye. Then they used 'geometric annotation' (a vector graphic approach to recording user's doodling and annotations) in order to facilitate collaboration. The notes the researchers took on the collaboration between their pilot users is probably more intriguing than their actual approach to display good graphics. Each user seemed to take a natural progression from annotation producer to annotation crawler and then bounce between them as other user annotations gave them ideas for more annotations to create. While not exactly ideal collaboration, it's interesting to hear what users do in the wild when left to their own.
Chapter Thirteen "What Data Doesn't Do" is a very short chapter with a set of ten or so rules that are intended to remind you that data doesn't predict, more data isn't always better or easier, probabilities do not explain, data doesn't stand alone, etc. This chapter felt sort of like a pause and remember way point through the book. Just when you've gone through these great stories of success, the book, reels you back into reality with this chapter. In other chapters you'll be reminded to avoid pitfalls like the narrative fallacy but this book just reminds you quite literally what data doesn't do automatically for you. It's an indicator that you need to shore up these things that data doesn't magically do when you present data.
Chapter Fourteen is Peter Norvig's "Natural Language Corpus Data" and does not disappoint. Once the reader is empowered with the code and the data in this chapter, it almost seems like one could solve several problems using ngrams, Bayes' theorem and natural language analysis. As you read this chapter, Norvig lays out how to tackle several problems with ease: decoding encryption levels up to WWII, spelling correction, machine translation and even spam detection. In just 23 pages, Norvig conveys a tiny bit of the power of a corpus of documents coupled with the willingness to be a little dirty (total probabilities summing to more than one, dropping ngrams below a threshold, etc). It's clear why he's employed at Google.
Chapter Fifteen takes a drastic turn into one of Earth's oldest data stores: DNA. As the chapter so coyly notes, programmers can view DNA as a simple string: char(3*10^6) human_genome; The chapter gives you a brief glimpse of DNA analysis but focuses more on the data storage involved in facilities that are currently working to harvest data from many subjects. As of the writing of this chapter, one facility was generating 75 terabits per week in raw data. Most interesting to me from this chapter was ensemble.org, a site to find DNA data, genome data and also collaborate with other researchers on annotating and commenting on certain parts and regions of DNA.
Similar to the previous chapter, Chapter Sixteen focuses briefly on chemistry and describes how data was collected "to predict teh solubility of a wide range of chemicals in non-aqueous solvents such as ethanol, methanol, etc." Having a very minimal chemistry background, it's never really revealed what purpose this data collection has but nonetheless the chapter explains a lot of challenges in this environment that are similar to other chapters. The interesting aspect of this chapter is that the team used open notebook science to collect this data and therefore faced the challenge of cleaning crowd-sourced data. A constantly recurring problem in these chapters is how one represents data and chemistry apparently has many standards — some more open than others. This book makes a very good argument for open standards and selecting open standards when one witnesses the screen scraping, licensing issues and costs researchers face when unifying data even for something as old as the representations of chemicals.
Chapter Seventeen is the case study of FaceStat, a statistically more ambitious Hot-or-Not effort from researchers. The site would allow anyone to upload a photo of a person and then allow users to rate them and tag them. After collecting this data, the researchers used the ubiquitous R statistical language to do some feature extraction on the data. Of course, the chapter first deals with cleaning the data and catching bad user input. While this chapter sounds like vanilla run-of-the-mill feature extraction, it also includes some interesting display examples as well as the very interesting yet controversial stereotype analysis. From taboo topics like attractiveness vs age line fitting to the sexism of tags to using k-means in order to establish stereotype clusters in the data. While other chapters sought offense through possible privacy concerns, this chapter reveals more about the callow stereotypes that internet inflict upon each other.
Chapter Eighteen looks at the San Fransisco Bay Area housing market from a very interesting selection of recent years. What differentiates this chapter from so many of the others (we collect, clean and process the data) is that it needed to break the data down by neighborhood to find the really interesting features of the data. The neighborhoods could then be grouped into six different groups with their increase in house prices to their decline in house prices. Only one group had one neighborhood that showed no decline (Mountain View). Unfortunately for this chapter and the next one, by the time the reader arrives they appear to be straight forward replications of ideas from other chapters. Chapter Nineteen is brief chapter on statistics inside politics. Aside from revealing five or six interesting correlations in voting revealed through data, this chapter merely relays what we already know: politicians implement statistics to a sometimes harmful degree (gerrymandering).
The last chapter is, appropriately, about the many sources of data exposed on the internet and the problems everyone faces in matching entities from one data source to another. The idea of using a URI to describe a movie hasn't really seemed to catch on. And if that wasn't enough, even words like "location" used to describe a column could mean drastically different things between houses and genomes. The chapter lists out a number of sources where data is available to download and tinker with (most already listed in the book) and proceeds to analyze an algorithmic (collective reconciliation) way for a system to differentiate between two movies with the same name. Naturally the author of this chapter worked on freebase which was recently (and predictably) acquired by Google. Although a short chapter, it speaks to problems that all online data communities face and what prohibits mashups from automagically happening between two disparate data sources holding data that is actually related.
With the exception of chapter six, every chapter offered me something that I won't forget. More importantly, most chapters offered a data source or data processing tool that expanded my toolbox of things to use when programming. The only reason this book misses a perfect 10/10 from me is chapter six and a couple of the later chapters feeling like weaker ideas from earlier chapters rehashed into a different domain. A worthwhile book if you work with data — whether you be a consumer or producer.
You can purchase Beautiful Data: The Stories Behind Elegant Data Solutions from amazon.com. Slashdot welcomes readers' book reviews -- to see your own review here, read the book review guidelines, then visit the submission page. -
Suspected Mariposa Botnet Creator Arrested
mehemiah writes "The writer of the Mariposa Botnet has been arrested through international effort. The FBI said this arrest and the arrests of three alleged operators in February were the result of a two-year joint investigation into the Mariposa Botnet, which may have infected as many as eight million to 12 million computers around the world." -
Churchill's Dentures Sold At Auction
A partial set of dentures that once had the honor of sitting inside Winston Churchill's mouth were sold at auction Thursday for 15,200 pounds ($23,723.) From the article: "The upper dentures, one of several sets specially made for the wartime prime minister, were used to maintain his distinctively slurred speaking style. They were bought by a British collector of Churchill memorabilia at an auction in England at three times the estimated price." -
World of Warcraft Can Boost Your Career
Hugh Pickens writes "Forbes reports that although videogames have long been thought of as distractions to work and education rather than aids, there is a growing school of thought that says game-playing in moderation, and in your free time, can make you more successful in your career. 'We're finding that the younger people coming into the teams who have had experience playing online games are the highest-level performers because they are constantly motivated to seek out the next challenge and grab on to performance metrics,' says John Hagel III, co-chairman of a tech-oriented strategy center for Deloitte. Elliot Noss, chief executive of domain name provider Tucows, spends six to seven hours a week playing online games and believes World of Warcraft trains him to become a better leader." -
China Warns Tourists About 'Forced Shopping' in Hong Kong
China's National Tourism Administration has put out an advisory for tourists warning them that Hong Kong tour guides might insult them or force them to shop. From the article: "'An undated video clip currently circulating on the Internet shows a Hong Kong tour guide allegedly abusing a group of visitors from the Chinese mainland and forcing them to shop, triggering a backlash from the mainland public,' the Xinhua news agency said Saturday. Budget Chinese tour packages often try to pad out profits by taking tourists to shops which return a percentage of the sales revenue to the agency. The practice is common both in mainland China and on overseas tours offered by Chinese agencies." -
The Demographics of Web Search
adaviel sends a link to work out of Yahoo Research indicating that demographics can help Web searches; e.g. a women searching for "wagner" probably wants the 18th-century German composer, while for men in the US "wagner" is a paint sprayer. The Yahoo researchers claim that by taking user demographics into account, "they managed to get the chosen link to appear as the top-ranked result 7 per cent more often than in the standard Yahoo search." New Scientist mentions this research and two other innovative adjuncts to current search practice: following the mouse cursor as a proxy for eye tracking, and taking back bearings on online criminals by studying the searches they make. (The latter raises disburbing privacy questions: would you want Google trolling through your search data? How about governments?) -
Stem Cells Curing Burn-Induced Blindness
mcgrew writes "The AP (via Yahoo) is reporting that Italian researchers can now cure blindness caused by chemical burns using the patient's own stem cells. 'The treatment worked completely in 82 of 107 eyes and partially in 14 others, with benefits lasting up to a decade so far. One man whose eyes were severely damaged more than 60 years ago now has near-normal vision.' Previously, this kind of injury needed either a corneal transplant or stem cells from someone else, both of which are plagued by problems with tissue rejection. Unfortunately, this only works for damaged corneas — so far." -
Pakistan To Scour Google, Yahoo For Blasphemy
sv_libertarian sends in this excerpt from an AP report: "Pakistan will start monitoring seven major websites, including Google, Yahoo, and Amazon, for sacrilegious content, while blocking 17 other, lesser-known sites it deems offensive to Muslims, an official said Friday. The moves follow Pakistan's temporary ban imposed on Facebook in May that drew both praise and condemnation in a country that has long struggled to figure out how strict a version of Islam it should follow. ... 'If any particular link with offensive content appears on these websites, the (link) shall be blocked immediately without disturbing the main website,' [said Pakistan Telecommunication Authority spokesman Khurram Mehran]." -
Finance, Scientific Users Get ActivePython Updates
jcasman sends along this clip from PCWorld: "ActiveState has added three open source mathematics libraries to its ActivePython Python distribution that might interest financial and scientific computing markets, the company announced Thursday. The packages are being added, in part, to anticipate the demand that may arise from new proposed rules for the US financial community brought about by the US Securities and Exchange Commission. ... In April, the government agency posted a set of proposed rules for handling asset-backed securities that called for financial firms to disclose, along with their prospectus filings, the source code of the programs that generated the filings, as rendered in Python. The government agency will be accepting input about the proposed rule until August 2. The three libraries that are being added to the ActivePython package are NumPy, SciPy, and matplotlib." -
Louisiana Federal Judge Blocks Drilling Moratorium
eldavojohn writes "In the ongoing BP debacle, the Obama administration imposed a six-month moratorium on offshore drilling and a halt to 33 exploratory wells going into the Gulf of Mexico. Now a federal judge (in New Orleans, no less) is unsatisfied with the reasons for this and stated, 'An invalid agency decision to suspend drilling of wells in depths of over 500 feet simply cannot justify the immeasurable effect on the plaintiffs, the local economy, the Gulf region, and the critical present-day aspect of the availability of domestic energy in this country.' The state's governor agrees on the grounds that blocking drilling will cost the state thousands of lucrative jobs." The government quickly vowed to appeal, pointing out that a moratorium on 33 wells is unlikely to have a devastating impact in a region hosting 3,600 active wells. And reader thomst adds this insight on the judge involved in the case: "Yahoo's Newsroom is reporting that the judge who overturned the drilling moratorium holds stock in drilling companies. You can view his financial disclosure forms listing his stock holdings online at Judicial Watch (PDF)." -
Louisiana Federal Judge Blocks Drilling Moratorium
eldavojohn writes "In the ongoing BP debacle, the Obama administration imposed a six-month moratorium on offshore drilling and a halt to 33 exploratory wells going into the Gulf of Mexico. Now a federal judge (in New Orleans, no less) is unsatisfied with the reasons for this and stated, 'An invalid agency decision to suspend drilling of wells in depths of over 500 feet simply cannot justify the immeasurable effect on the plaintiffs, the local economy, the Gulf region, and the critical present-day aspect of the availability of domestic energy in this country.' The state's governor agrees on the grounds that blocking drilling will cost the state thousands of lucrative jobs." The government quickly vowed to appeal, pointing out that a moratorium on 33 wells is unlikely to have a devastating impact in a region hosting 3,600 active wells. And reader thomst adds this insight on the judge involved in the case: "Yahoo's Newsroom is reporting that the judge who overturned the drilling moratorium holds stock in drilling companies. You can view his financial disclosure forms listing his stock holdings online at Judicial Watch (PDF)." -
For-Profit, Illegal Movie Download Sites Threaten MPAA
vossman77 writes that BitTorrent is no longer the MPAA's enemy number one. They are now more concerned about illicit, for-profit movie download sites. This reader adds, "Just a thought, but maybe if the studios offered a low-cost, for-profit, legitimate download site without DRM, they could receive the profits at the expense of the cyberlockers." "Movie fans downloading free pirated films are no longer Hollywood's worst nightmare, but that's only because of a newer menace: cheap, and equally illegal, subscription services. Foreign, often mob-run, businesses aggregate illegally obtained movies into 'cyberlockers.' Cyberlocker-based businesses operate from Russia, Ukraine, Colombia, Germany, Switzerland, and elsewhere. ... Hollywood movies are made available via illegal for-profit sites within days of theatrical release, while the advent of global releasing now allows the proliferation of individual titles into an array of language dubs within the first month of a theatrical debut. ... When movies are released on DVD and Blu-ray disc, the sites upgrade the quality of video offered from camcorded images to pristine digital copies. 'Sometimes these sites look better than the legitimate sites,' Huntsberry said. 'That's the irony.'" -
Romania Now Taking Donations
The Romanian government is taking an unusual approach to fixing its economic problems; it has created a donation box. Everyone except legal entities can donate money to the newly formed "solidarity fund." From the article: "Officials said the new fund is aimed at public officials who earn additional income on top of regular wages by serving on administrative boards of companies entirely or partially owned by the state. Prime Minister Emil Boc has also said he will donate his wages to the fund, but the account is open to anyone who wishes to contribute. Donations can be made by bank transfer to a special account and a list of donations will subsequently be published on the ministry's website." -
Smart Underwear Designed For Military
A team of scientists at the University of California San Diego, led by nano-engineering professor Joseph Wang, has designed some high-tech underwear that may save lives. Sensors in the waistband can monitor a person's blood pressure, heart rate, and other vital signs. The designers also hope that one day the underwear can release drugs to relieve pain and treat wounds. From the article: "But the technology's range of application goes beyond the military. 'We envision all the trend of personalized medicine for remote monitoring of the elderly at home, monitoring a wide range of biomedical markers, like cardiac markers, alerting for any potential stroke, diabetic changes, and other changes related to other biomedical scenario,' said Wang. Wearable biosensors can also provide valuable information to athletes or even measure blood alcohol levels." -
Scientists Use Calvin Klein Cologne to Lure Big Cats
Biologists can't speak on the effectiveness of Calvin Klein Obsession for Men on the cougars at your local bar, but they do know that jaguars love it. Rony Garcia and Jose Moreira from the Wildlife Conservation Society's (WCS) Jaguar Conservation Program use the cologne to attract jaguars in the jungles of Guatemala. "The method we are using to study the jaguars here in Guatemala is a non-invasive method which is based on photographing the individuals by using camera traps," Moreira says. "It has been very useful using Obsession (for Men) to get the jaguars in front of these camera traps ... and that allows us to estimate with greater confidence the genders and the numbers that live in each studied site." -
Restaurant Tells Diners To Eat Everything On Their Plate
Chef Yukako Ichikawa will offer a 30% discount to patrons who eat all the food they have ordered, and will kindly ask those who don't clean their plates to not come back. "Finishing your meal requires that everything is eaten except lemon slices, gari (sushi ginger), and wasabi," says the menu. "Please also note that vegetables and salad on the side are NOT decorations; they are part of the meal too."