Domain: princeton.edu
Stories and comments across the archive that link to princeton.edu.
Stories · 137
-
DuckDuckGo CEO: 'Google and Facebook Are Watching Our Every Move Online. It's Time To Make Them Stop' (cnbc.com)
An anonymous reader shares a report from CNBC, written by Gabriel Weinberg, CEO and founder of DuckDuckGo: You may know that hidden trackers lurk on most websites you visit, soaking up your personal information. What you may not realize, though, is 76 percent of websites now contain hidden Google trackers, and 24 percent have hidden Facebook trackers, according to the Princeton Web Transparency & Accountability Project. The next highest is Twitter with 12 percent. It is likely that Google or Facebook are watching you on many sites you visit, in addition to tracking you when using their products. As a result, these two companies have amassed huge data profiles on each person, which can include your interests, purchases, search, browsing and location history, and much more. They then make your sensitive data profile available for invasive targeted advertising that can follow you around the Internet.
[...]
So how do we move forward from here? Don't be fooled by claims of self-regulation, as any useful long-term reforms of Google and Facebook's data privacy practices fundamentally oppose their core business models: hyper-targeted advertising based on more and more intrusive personal surveillance. Change must come from the outside. Unfortunately, we've seen relatively little from Washington. Congress and federal agencies need to take a fresh look at what can be done to curb these data monopolies. They first need to demand more algorithmic and privacy policy transparency, so people can truly understand the extent of how their personal information is being collected, processed and used by these companies. Only then can informed consent be possible. They also need to legislate that people own their own data, enabling real opt-outs. Finally, they need to restrict how data can be combined including being more aggressive at blocking acquisitions that further consolidate data power, which will pave the way for more competition in digital advertising. Until we see such meaningful changes, consumers should vote with their feet. -
DuckDuckGo CEO: 'Google and Facebook Are Watching Our Every Move Online. It's Time To Make Them Stop' (cnbc.com)
An anonymous reader shares a report from CNBC, written by Gabriel Weinberg, CEO and founder of DuckDuckGo: You may know that hidden trackers lurk on most websites you visit, soaking up your personal information. What you may not realize, though, is 76 percent of websites now contain hidden Google trackers, and 24 percent have hidden Facebook trackers, according to the Princeton Web Transparency & Accountability Project. The next highest is Twitter with 12 percent. It is likely that Google or Facebook are watching you on many sites you visit, in addition to tracking you when using their products. As a result, these two companies have amassed huge data profiles on each person, which can include your interests, purchases, search, browsing and location history, and much more. They then make your sensitive data profile available for invasive targeted advertising that can follow you around the Internet.
[...]
So how do we move forward from here? Don't be fooled by claims of self-regulation, as any useful long-term reforms of Google and Facebook's data privacy practices fundamentally oppose their core business models: hyper-targeted advertising based on more and more intrusive personal surveillance. Change must come from the outside. Unfortunately, we've seen relatively little from Washington. Congress and federal agencies need to take a fresh look at what can be done to curb these data monopolies. They first need to demand more algorithmic and privacy policy transparency, so people can truly understand the extent of how their personal information is being collected, processed and used by these companies. Only then can informed consent be possible. They also need to legislate that people own their own data, enabling real opt-outs. Finally, they need to restrict how data can be combined including being more aggressive at blocking acquisitions that further consolidate data power, which will pave the way for more competition in digital advertising. Until we see such meaningful changes, consumers should vote with their feet. -
Over 400 of the World's Most Popular Websites Record Your Every Keystroke (vice.com)
An anonymous reader quotes a report from Motherboard: The idea of websites tracking users isn't new, but research from Princeton University released last week indicates that online tracking is far more invasive than most users understand. In the first installment of a series titled "No Boundaries," three researchers from Princeton's Center for Information Technology Policy (CITP) explain how third-party scripts that run on many of the world's most popular websites track your every keystroke and then send that information to a third-party server. Some highly-trafficked sites run software that records every time you click and every word you type. If you go to a website, begin to fill out a form, and then abandon it, every letter you entered in is still recorded, according to the researchers' findings. If you accidentally paste something into a form that was copied to your clipboard, it's also recorded. These scripts, or bits of code that websites run, are called "session replay" scripts. Session replay scripts are used by companies to gain insight into how their customers are using their sites and to identify confusing webpages. But the scripts don't just aggregate general statistics, they record and are capable of playing back individual browsing sessions. The scripts don't run on every page, but are often placed on pages where users input sensitive information, like passwords and medical conditions. Most troubling is that the information session replay scripts collect can't "reasonably be expected to be kept anonymous," according to the researchers. -
What Happens When Geoengineers 'Hack The Planet'? (thebulletin.org)
Dan Drollette shares an article by an Oxford physics professor who's concerned about the popularity of radical new proposals to fight global warming. The Christian Science Monitor wonders if it's time to re-engineer our climate. MIT's Technology Review basically thinks the answer is "yes," having described it earlier as "cheap and easy." The Atlantic seems quite smitten with Economist writer Oliver Morton's vision of remaking the planet, which geoengineering booster Jane Long breathlessly called "geopoetry." The idea received recent coverage (much of it favorable) by New Scientist, NBC, and in TED talks; I myself have recently participated in an NPR panel discussion on the subject... But what has really catapulted the idea into the public eye is Harvard's reckless plan for a privately-funded field trial testing some of the key elements needed... Proceeding to field experimentation crosses a thin red line beyond which lies the slippery slope down to ever-larger field trials and ultimately deployment.
Harvard's experiment -- which is partially funded by Bill Gates -- is "subject to no governance save what Harvard chooses to impose upon itself," according to the article. The experiment involves "putting something in the atmosphere to reflect more sunlight back out into space," which the article warns will create "enduring" effects -- and require humanity to commit to maintaining the same atmospheric conditions forever. -
Researchers Send Information Using a Single Particle of Light (vice.com)
An anonymous reader quotes a report from Motherboard: According to research published Thursday in Science, physicists at Princeton University have designed a device that allows a single electron to pass its quantum information to a photon in what could be a big breakthrough for silicon-based quantum computers. The device designed by the Princeton researchers is the result of five years of research and works by trapping an electron and a photon within a device built by HRL laboratories, which is owned by Boeing and General Motors. It is a semi-conductor chip made from layers of silicon and silicon-germanium, materials that are inexpensive and already widely deployed in consumer electronics. Across the top of this wafer of silicon layers were laid a number of nanowires, each smaller than the width of a human hair, which were used to deliver energy to the chip. This energy allowed the researchers to trap an electron in between the silicon layers of the chip in microstructures known as quantum dots. The researchers settled on photons as the medium of exchange between electrons since they are less sensitive to disruption from their environment and could potentially be used to carry quantum information between quantum chips, rather than within the circuits on a single quantum chip. The ability to scale up this device would mean that photons could be used to pass quantum information from electron to electron in order to form the circuits for a quantum computer. "We now have the ability to actually transmit the quantum state to a photon," said Xiao Mi, a graduate student in Princeton's Department of Physics. "This has never been done before in a semiconductor device because the quantum state was lost before it could transfer its information." -
Princeton Researchers Announce Open Source 25-Core Processor (pcworld.com)
An anonymous reader writes: Researchers at Princeton announced at Hot Chips this week their 25-core Piton Processor. The processor was designed specifically to increase data center efficiency with novel architecture features enabling over 8,000 of these processors to be connected together to build a system with over 200,000 cores. Fabricated on IBM's 32nm process and with over 460 million transistors, Piton is one of the largest and most complex academic processors every built. The Princeton team has opened their design up and released all of the chip source code, tests, and infrastructure as open source in the OpenPiton project, enabling others to build scalable, manycore processors with potentially thousands of cores. -
NASA's Planet Hunter Spots Record 1,284 New Planets, 9 In A Habitable Zone (networkworld.com)
coondoggie quotes a report from Network World: NASA's planet hunting space telescope Kepler added a record 1,284 confirmed planets to its already impressive discoveries of extraterrestrial worlds. [This batch of planets is the largest single account of new planets since Kepler launched in 2009 and more than doubles the number of confirmed planets realized by the space telescope so far to more than 2,300.] The discoveries were a result of an automated technique implemented in a publicly available custom software package called Vespa, which lets scientists analyze thousands of signals Kepler has identified to determine which are most likely to be caused by planets and which are caused by non-planetary objects such as stars. "Vespa computed the reliability values for over 7,000 signals identified in the latest Kepler catalog which identified 4,302 potential planets and verified the 1,284 planets with 99% certainty," said the Princeton researchers that developed Vespa. NASA said, based on their size, nearly 550 of the validated planets could be rocky like Earth. Nine of which orbit in their sun's habitable zone. -
CV of Failures: Princeton Professor Publishes Resume of His Career Lows (theguardian.com)
An anonymous reader shares a Guardian report: A professor at Princeton University has published a CV listing his career failures (PDF), in an attempt to "balance the record" and encourage others to keep trying in the face of disappointment. Johannes Haushofer, who is an assistant professor of psychology and public affairs at the university in New Jersey, posted his unusual CV on Twitter last week. The document contains sections titled Degree programs I did not get into , Research funding I did not get and Paper rejections from academic journals. Haushofer writes: Most of what I try fails, but these failures are often invisible, while the successes are visible. I have noticed that this sometimes gives others the impression that most things work out for me. As a result, they are more likely to attribute their own failures to themselves, rather than the fact that the world is stochastic, applications are crapshoots, and selection committees and referees have bad days. This CV of Failures is an attempt to balance the record and provide some perspective. He added another section called "Meta-Failures" to his resume, writing, "This darn CV of Failures has received way more attention than my entire body of academic work." -
Coding Styles Survive Binary Compilation, Could Lead Investigators Back To Programmers (princeton.edu)
An anonymous reader writes: Researchers have created an algorithm that can accurately detect code written by different programmers (PDF), even if the code has been compiled into an executable binary. Because of open source coding repositories like GitHub, state agencies can build a database of all developers and their coding styles, and then easily compare the coding style used in "anti-establishment" software to detect the culprit. Despite all the privacy implications this research may have, the algorithm can also be used by security researchers to track down malware authors. We also discussed an earlier phase of this research. -
White House Names Ed Felten As Deputy U.S. Chief Technology Officer
New submitter bird writes: Ed Felton, Director of Princeton University's Center for Information Technology Policy (CITP) and well-known and outspoken consumer advocate, has been appointed deputy US chief technology officer. His is a voice of reason that needs to be heard when tech policy is made. The press release says: "We are excited to announce that Dr. Ed Felten is joining the White House Office of Science and Technology Policy as Deputy U.S. Chief Technology Officer. Ed joins a growing number of techies at the White House working to further President Obama’s vision to ensure policy decisions are informed by our best understanding of state-of-the-art technology and innovation, to quickly and efficiently deliver great services for the American people, and to broaden and deepen the American people’s engagement with their government." -
White House Names Ed Felten As Deputy U.S. Chief Technology Officer
New submitter bird writes: Ed Felton, Director of Princeton University's Center for Information Technology Policy (CITP) and well-known and outspoken consumer advocate, has been appointed deputy US chief technology officer. His is a voice of reason that needs to be heard when tech policy is made. The press release says: "We are excited to announce that Dr. Ed Felten is joining the White House Office of Science and Technology Policy as Deputy U.S. Chief Technology Officer. Ed joins a growing number of techies at the White House working to further President Obama’s vision to ensure policy decisions are informed by our best understanding of state-of-the-art technology and innovation, to quickly and efficiently deliver great services for the American people, and to broaden and deepen the American people’s engagement with their government." -
How To Increase the Number of Female Engineers
HughPickens.com writes: Lina Nilsson writes in an op-ed piece in the NY Times that she looks with despair at estimates that only about 14 percent of engineers in the work force are women. But there may be a solution to the disparity that is much simpler than targeted recruitment efforts. "An experience here at the University of California, Berkeley, where I teach, suggests that if the content of the work itself is made more societally meaningful, women will enroll in droves," writes Nilsson. "That applies not only to computer engineering but also to more traditional, equally male-dominated fields like mechanical and chemical engineering." Nilsson says that Blum Center for Developing Economies recently began a new program that, without any targeted outreach, achieved 50 percent female enrollment in just one academic year. In the fall of 2014, UC Berkeley began offering a new Ph.D. minor in development engineering for students doing thesis work on solutions for low-income communities. They are designing affordable solutions for clean drinking water, inventing medical diagnostic equipment for neglected tropical diseases and enabling local manufacturing in poor and remote regions.
According to Nilsson, women seem to be drawn to engineering projects that attempt to achieve societal good. She notes that MIT, the University of Minnesota, Penn State, Santa Clara University, Arizona State, and the University of Michigan have programs aimed at reducing global poverty and inequality that have achieved similar results. For example, at Princeton, the student chapter of Engineers Without Borders has an executive board that is nearly 70 percent female, reflecting the overall club composition. "It shows that the key to increasing the number of female engineers may not just be mentorship programs or child care centers, although those are important," concludes Nilsson. "It may be about reframing the goals of engineering research and curriculums to be more relevant to societal needs. It is not just about gender equity — it is about doing better engineering for us all." -
NJ Museum Revives TIROS Satellite Dish After 40 Years
evanak writes TIROS was NASA's Television Infrared Observation Satellite. It launched in April 1960. One of the ground tracking stations was located at the U.S. Army's secret "Camps Evans" Signals Corps electronics R&D laboratory. That laboratory (originally a Marconi wireless telegraph lab) became the InfoAge Science Center in the 2000s. [Monday], after many years of restoration, InfoAge volunteers (led by Princeton U. professor Dan Marlowe) successfully received data from space. The dish is now operating for the first time in 40 years! The received data are in very raw form, but there is a clear peak riding on top of the noise background at 0.4 MHz (actually 1420.4 MHz), which is the well-known 21 cm radiation from the Milky Way. The dish was pointing south at an elevation of 45 degrees above the horizon. -
Many DDR3 Modules Vulnerable To Bit Rot By a Simple Program
New submitter Pelam writes: Researchers from Carnegie Mellon and Intel report that a large percentage of tested regular DDR3 modules flip bits in adjacent rows (PDF) when a voltage in a certain control line is forced to fluctuate. The program that triggers this is dead simple — just two memory reads with special relative offset and some cache control instructions in a tight loop. The researchers don't delve deeply into applications of this, but hint at possible security exploits. For example a rather theoretical attack on JVM sandbox using random bit flips (PDF) has been demonstrated before. -
Massive Volcanic Eruptions Accompanied Dinosaur Extinction
schwit1 writes: A careful updating of the geological timeline has shown that massive volcanic eruptions aligned with the extinction event that wiped out the non-avian dinosaurs 66 million years ago (abstract). "A primeval volcanic range in western India known as the Deccan Traps, which were once three times larger than France, began its main phase of eruptions roughly 250,000 years before the Cretaceous-Paleogene, or K-Pg, extinction event, the researchers report in the journal Science. For the next 750,000 years, the volcanoes unleashed more than 1.1 million cubic kilometers (264,000 cubic miles) of lava. The main phase of eruptions comprised about 80-90 percent of the total volume of the Deccan Traps' lava flow and followed a substantially weaker first phase that began about 1 million years earlier.
The results support the idea that the Deccan Traps played a role in the K-Pg extinction, and challenge the dominant theory that a meteorite impact near present-day Chicxulub, Mexico, was the sole cause of the extinction. The researchers suggest that the Deccan Traps eruptions and the Chicxulub impact need to be considered together when studying and modeling the K-Pg extinction event." -
Be True To Your CS School: LinkedIn Ranks US Schools For Job-Seeking Programmers
theodp writes "The Motley Fool reports that the Data Scientists at LinkedIn have been playing with their Big Data, ranking schools based on how successful recent grads have been at landing desirable software development jobs. Here's their Top 25: CMU, Caltech, Cornell, MIT, Princeton, Berkeley, Univ. of Washington, Duke, Michigan, Stanford, UCLA, Illinois, UT Austin, Brown, UCSD, Harvard, Rice, Penn, Univ. of Arizona, Harvey Mudd, UT Dallas, San Jose State, USC, Washington University, RIT. There's also a shorter list for the best schools for software developers at startups, which draws a dozen schools from the previously mentioned schools, and adds Columbia, Univ. of Virginia, and Univ. of Maryland College Park. If you're in a position to actually hire new graduates, how much do you care about applicants' alma maters? -
Scientists Seen As Competent But Not Trusted By Americans
cold fjord writes The Woodrow Wilson School reports, "If scientists want the public to trust their research suggestions, they may want to appear a bit 'warmer,' according to a new review published by Princeton University's Woodrow Wilson School of Public and International Affairs. The review, published in the Proceedings of the National Academy of Sciences (PNAS), shows that while Americans view scientists as competent, they are not entirely trusted. This may be because they are not perceived to be friendly or warm. In particular, Americans seem wary of researchers seeking grant funding and do not trust scientists pushing persuasive agendas. Instead, the public leans toward impartiality. 'Scientists have earned the respect of Americans but not necessarily their trust,' said lead author Susan Fiske, the Eugene Higgins Professor of Psychology and professor of public affairs. 'But this gap can be filled by showing concern for humanity and the environment. Rather than persuading, scientists may better serve citizens by discussing, teaching and sharing information to convey trustworthy intentions.'" -
Breakthrough In LED Construction Increases Efficiency By 57 Percent
Zothecula writes: With LEDs being the preferred long-lasting, low-energy method for replacing less efficient forms of lighting, their uptake has dramatically increased over the past few years. However, despite their luminous outputs having increased steadily over that time, they still fall behind more conventional forms of lighting in terms of brightness. Researchers at Princeton University claim to have come up with a way to change all that by using nanotechnology to increase the output of organic LEDs by 57 percent. -
US Revamping Its Nuclear Arsenal
FreedomFirstThenPeac writes: As a former Cold Warrior (both launch officer side and staff analytical mathematician side), I now appreciate the bitterness I saw in former WW2 warriors when they would see a Japanese car. According to the NY Times, a new assembly plant in Kansas is "part of a nationwide wave of atomic revitalization that includes plans for a new generation of weapon carriers. This expansion comes under a president who campaigned for 'a nuclear-free world' and made disarmament a main goal of American defense policy." Mind you, Mutual Assured Destruction is a dangerous path, and one we managed to negotiate only because we were lucky (and we were) and because we were careful (and we were).
As a strategy, it only works with rational people (e.g., world powers with lots to lose) who might have irrational expectations that they will win in the long run. (The rapid fall of imperialist Russia was helpful — I have seen blackboard talks on this as a mathematical result in game theory. This speed minimized the time we spent in the high-risk regions while transiting from MAD to where we were in the 1990s). The Times article says, "The original idea was that modest rebuilding of the nation’s crumbling nuclear complex would speed arms refurbishment, raising confidence in the arsenal’s reliability and paving the way for new treaties that would significantly cut the number of warheads. Instead, because of political deals and geopolitical crises, the Obama administration is engaging in extensive atomic rebuilding while getting only modest arms reductions in return." -
Solar-Powered Electrochemical Cell Used To Produce Formic Acid From CO2
Zothecula writes Rising atmospheric CO2 levels can generally be tackled in three ways: developing alternative energy sources with lower emissions; carbon capture and storage (CCS); and capturing carbon and repurposing it. Researchers at Princeton University are claiming to have developed a technique that ticks two of these three boxes by using solar power to convert CO2 into formic acid. With power from a commercially available solar panel provided by utility company Public Service Electric and Gas (PSE&G), researchers in the laboratory of Princeton professor of chemistry Andrew Bocarsly, working with researchers at New Jersey-based start-up Liquid Light Inc., converted CO2 and water to formic acid (HCOOH) in an electrochemical cell. -
Study Finds US Is an Oligarchy, Not a Democracy
An anonymous reader writes "Researchers from Princeton University and Northwestern University have concluded, after extensive analysis of 1,779 policy issues, that the U.S. is in fact an oligarchy and not a democracy. What this means is that, although 'Americans do enjoy many features central to democratic governance,' 'majorities of the American public actually have little influence over the policies our government adopts.' Their study (PDF), to be published in Perspectives on Politics, found that 'When the preferences of economic elites and the stands of organized interest groups are controlled for, the preferences of the average American appear to have only a minuscule, near-zero, statistically non-significant impact upon public policy.'" -
Searching For Dark Matter From Deep Under an Italian Mountain
Zothecula writes "Like the Higgs Boson, dark matter is one of those things in the Universe that evidence points to, but is very difficult to pin down. A team of researchers is looking to verify the existence of this most elusive of ingredients that is thought to make up 23 percent of the Universe using powerful detectors buried deep in an Italian mountain. The DarkSide-50 project is an international collaboration between Italian, French, Polish, Ukrainian, Russian, and Chinese institutions, as well as 17 American universities. The project team spent last (Northern hemisphere) summer assembling the detector in a laboratory deep within the Gran Sasso mountain, which is accessed via an exit off a six-mile (9.6 km) long highway tunnel in Italy." -
Unpublished J. D. Salinger Stories Leaked On Bittorrent Site
192_kbps writes "Catcher in the Rye author J. D. Salinger wrote the short story The Ocean Full of Bowling Balls and left depository copies with a few academic libraries with the understanding that the work would not see mass distribution until the mid-21st century. The only authorized place to read the story is in a special reading room at Princeton where electronics are not allowed and a librarian continuously babysits the reader. A PDF of the story, as well as two other unpublished stories, appeared on private bittorrent site what.cd where a huge bounty had been placed for the work. Incredibly, the uploader (or someone connected to the uploader) bought an unauthorized copy on eBay for a pittance. The file, Three Stories, is making the bittorrent rounds but can also be read on mediafire." -
Mathematicians Team Up To Close the Prime Gap
Hugh Pickens DOT Com writes "On May 13, an obscure mathematician garnered worldwide attention and accolades from the mathematics community for settling a long-standing open question about prime numbers. Yitang Zhang showed that even though primes get increasingly rare as you go further out along the number line, you will never stop finding pairs of primes separated by at most 70 million. His finding was the first time anyone had managed to put a finite bound on the gaps between prime numbers, representing a major leap toward proving the centuries-old twin primes conjecture, which posits that there are infinitely many pairs of primes separated by only two (such as 11 and 13). Now Erica Klarreich reports at Quanta Magazine that other mathematicians quickly realized that it should be possible to push this separation bound quite a bit lower. By the end of May, mathematicians had uncovered simple tweaks to Zhang's argument that brought the bound below 60 million. Then Terence Tao, a winner of the Fields Medal, mathematics' highest honor, created a 'Polymath project,' an open, online collaboration to improve the bound that attracted dozens of participants. By July 27, the team had succeeded in reducing the proven bound on prime gaps from 70 million to 4,680. Now James Maynard has upped the ante by presenting an independent proof that pushes the gap down to 600. A new Polymath project is in the planning stages, to try to combine the collaboration's techniques with Maynard's approach to push this bound even lower. Zhang's work and, to a lesser degree, Maynard's fits the archetype of the solitary mathematical genius, working for years in the proverbial garret until he is ready to dazzle the world with a great discovery. The Polymath project couldn't be more different — fast and furious, massively collaborative, fueled by the instant gratification of setting a new world record. 'It's important to have people who are willing to work in isolation and buck the conventional wisdom,' says Tao. Polymath, by contrast, is 'entirely groupthink.' Not every math problem would lend itself to such collaboration, but this one did." -
Soda Makes Five-Year-Olds Break Your Stuff, Science Finds
Daniel_Stuckey writes "Shakira F. Suglia and co-authors surveyed 2,929 mothers of five-year-olds (PDF) and found that 43 percent of the kids consumed at least one serving of soft drinks per day. About four percent of those children (or 110 of them), drank more than four soft drinks per day, and became 'more than twice as likely to destroy things belonging to others, get into fights, and physically attack people.' In the past, soda and its various strains have been related to depression, irritability, aggression, suicidal thoughts, and delusions of sweepstake-winning grandeur. Of course, this study didn't find out what types of soda the children had consumed." -
J.K. Rowling Should Try the Voting Algorithm
Frequent contributor Bennett Haselton proposes a new use for online, anonymous voting: helping sort skill from luck in the cheek-by-jowl world of best-selling (and would-be best-selling) authors: "J.K. Rowling recently confirmed that she was the author of a book she had published under a pseudonym, which spiked in sales after she was outed as the true author. Perhaps she was doing an experiment to see how much luck had played a role in propelling her to worldwide success, and whether she could recreate anything close to that success when starting from scratch. But a better way to answer that question would be to strike a deal with an amateur-fiction-hosting site and use the random-sample-voting algorithm that I've written so much about, to test how her writing stacks up against other writers in the same genre." Read on for more. Update: 07/20 01:23 GMT by T : Note: An editorial goof (mine) swapped out the word "confirmed" for "revealed" (above) in an earlier rendering of this story.Rowling confirmed (after the information leaked accidentally) that she had authored a new book, The Cuckoo's Calling, under the male pseudonym Robert Galbraith, which went on to sell only about 1,500 copies before she announced that she was the real author and sales of the book spiked 150,000%.
Stephen King actually tried something similar in the 1970s, publishing a series of books under the pseudonym "Richard Bachman," which he later said was partly an attempt to answer the question of whether his success was due to talent or luck. (The Richard Bachman books sold 10 times as many copies after King was revealed as the author.) Rowling has not said whether she was attempting a similar experiment, having issued a statement that before the revelation, it had been "wonderful to publish without hype or expectation, and pure pleasure to get feedback under a different name."
But if either J.K. Rowling or Stephen King really wants to find the answer to the question of talent vs. luck, the solution lies in the random-sample voting algorithm that I've been advocating in occasional articles for years now, going back to "Censorship By Glut" in 2006. Here's how the experiment could work, for evaluating the quality of fiction writing:
-
Rowling or King could approach a pre-established amateur fiction hosting site with a large number of registered users. Or they could create their own fiction hosting site and announce it to the world for the purpose of running the experiment, which would almost certainly attract a large number of users to sign up. (The experiment only works if the site has a large number of users, for reasons that will become clear.)
-
When a user submits a new short story to the site, the site randomly selects a small subset of other users on the site (say, 20 other users), emails them a link to the new story, and invites them to read it and rate its content. There are several ways you could incentivize those users to read the link and rate the story on a scale of 1 to 10. You could bill it as the "civic duty" of registered users of the site (in the same way that it's the civic duty of registered Wikipedia editors to maintain the quality of articles, even though the editors are working for free). You could require registered users to read and rate any stories that are emailed to them (although of course there'd be no way to stop someone from lazily submitting a rating without even reading the story). You could actually require payments from users who submit stories, and then use that money to distribute small payments to the raters as compensation for reading the story (although that seems like it would be the biggest headache, since you'd have to jump through legal and logistical hoops to set it up, and it would attract cheaters who would try to abuse the system just for the free small payments). But in any case, you don't need every user who gets emailed a story, to actually click through to read the story and rate it. All that matters is that out of those 20 users, enough of them click through to read the story, that you get a statistically representative sample of what users think of the quality.
Optionally, the story raters could also submit written feedback about why they liked or did not like a story. But the important part is collecting the numeric ratings so that they can be averaged into a single overall rating for the piece of content.
-
If a story gets a high enough average rating in the first round of voting, then it gets emailed out to a larger random sample of voters, say, 200. The ratings given by this larger sample can be used to distinguish the very best stories from the merely good. (We expect that for good stories, the ratings would tend to cluster around the high end of the scale, so with that smaller variance, it would take a larger sample size to find a statistically significant difference between the quality of two stories.)
-
The stories that get the highest ratings can be featured on the front page of the site, so that everybody can have the benefit of enjoying the "best" stories. Meanwhile, J.K. Rowling and Stephen King have the benefit of finding out how their stories compare against stories written by unpublished amateur writers.
It all sounds deceptively simple, but the important feature is that you've taken the arbitrariness out of the outcome. As long as your sample sizes are large enough, the rating that a story obtains in this system, will be approximately equal to the average rating it would get from all users across the site. "Luck" is no longer a factor, because you could re-run the experiment twice with the same set of stories, and get approximately the same outcome.
This is important, because numerous experiments and real-world studies have shown that in any environment where users can recommend content to each other and browse content that is already known to be popular — in other words, how most of us discover content in the real world — luck plays a much greater role in which content becomes wildly successful. The generally accepted explanation is that an initial stroke of luck can have a self-reinforcing snowball effect — if a few key influencers happen to discover and recommend a piece of content at the same time, their friends and followers will be drawn to that content as well, and once it crosses that threshold, the content has now become "popular" enough that even more users will be drawn to it just because it's popular.
This is also why any of the existing fiction-rating sites would not work for this experiment — because most such sites allow authors to invite their friends to sign up and give high ratings to their stories, or to form cliques that all give high ratings to each other's writings. It's usually in the site's best interests to allow these tricks, because it gives authors the incentive to promote the site to their friends in order to get them to sign up. But it also means that (a) authors can easily game the system, and the highest-rated stories may not be the highest-quality ones but the ones whose authors simply play the game the best, and (b) even without "gaming the system", the fact that users can see other users' ratings and can seek out "most popular" or "trending" stories, creates the snowball effects discussed above, and introduces a huge amount of arbitrariness into the process.
Duncan Watts' excellent book Everything Is Obvious (Once You Know The Answer) is an excellent introduction to the arbitrariness phenomenon, but if you don't have time to read the whole book, just read about the Matthew Salganik 'many-worlds' experiment", which Watts co-authored and which I've linked to in pretty much every other article I've written about the random-sample-voting algorithm. The gist was that if you divide users into multiple artificial "worlds," where users can recommend content only to other users within those worlds, and seed all artificial worlds with the same content (in this case, songs), then songs which become wildly popular in some worlds will become duds in others.
The whole of Everything is Obvious is at least as insightful as anything ever written by Malcolm Gladwell, and would appeal to the same people, but it never became a bestseller, because — well, probably because we live in one of the many possible worlds of a Salganik experiment, and in the world we happen to live in, the luck of the draw meant that book didn't take off.
But back to the proposed experiment. It is true that the votes of the average users would not tell us anything about whether the winning stories were "artistically" good, however you define that. But in King's case, he was not trying to answer questions about artistic merit. he was trying to find out if his bestselling-author status was due to talent or luck, so the average rating from regular readers would be quite on point. Rowling said that she wanted to write without any hype and receive honest feedback, and it's hard to imagine a better place to do that than writing under a pseudonym for a fiction site that distributes your content directly to the public.
Both King and Rowling deserve some credit for even addressing the question of whether their success was due to talent or luck. It would have been easy for them to assume that their global success was due to their innate skill and hard work, and 99% of the world would have accepted that explanation, so it took no small amount of courage to even raise the question of how luck might have played a role. (We all know plenty of successful people who take umbrage if you even mention "the L word".)
But King did say that he thought he was outed too early to obtain any conclusive results from the experiment (and Rowling also said she wished she could have kept writing under the pseudonym, although she didn't say whether she had any similar "experiment" in mind). The random-sample-voting algorithm would provide instant feedback, not just to King and Rowling, but to any other writer who wanted to see how their writings would stack up against others in the field, from unpublished amateurs to worldwide bestselling authors.
My prediction, if such an experiment is ever conducted: King's and Rowling's writings would be rated very good, but so would many other writers' stories, including struggling writers who have never been published. Or as economist Daniel Kahneman put it: "success = talent + luck; great success = a little more talent + a lot of luck." (That took a certain amount of modesty on his part too, having achieved "great success" himself in the form of a Nobel Prize.) If J.K. Rowling or Stephen King ever launched such an experiment, the biggest favor they'd be doing for the world would not be to boost the egos of a few struggling writers, but to call more attention to the role that luck plays the world.
It's not as if their own egos would have to be bruised in the process. Donald Trump, the last person in the world that I would have guessed to have uttered these words, actually said that "Everything in life is luck," but it didn't seem to deflate his opinion of himself. You don't have to be a jerk like Trump, but just because some unpublished author's story gets a higher rating than yours, doesn't mean you have to let him come live in your mansion.
-
-
Facebook's Complaint Process Is Arbitrary — But So Is Campaigning
Bennett Haselton writes "After initial abuse reports failed to shut down some anti-women and pro-rape pages on Facebook, a wider lobbying campaign succeeded in prompting a Facebook policy change. This has been alternately hailed as a vindication of the campaigner's cause, or derided as proof that Facebook can be cowed by humorless feminists. In reality, the success of the campaign was most likely the outcome of a mostly arbitrary and random process that required a lot of luck, just as the initial abuse reports didn't succeed because they didn't have the necessary luck on their side. Neither result should be taken to reflect on the merits of the campaigner's actual points." Read on for the rest of Bennett's thoughts. On May 28th, Facebook released a statement acknowledging that it had not responded effectively to complaints against pages containing "gender-based hate speech" (e.g. "Slapping hookers in the face with a shoe." and several much worse examples glorifying rape or violence). The decision came at the end of the "#fbrape" campaign by feminist groups to pressure advertisers whose ads had been appearing on the most offensive pages; major advertisers like Nissan announced that they were withdrawing advertisements from Facebook until they could be assured their ads would not appear on the pages in question (most of which were ultimately shut down by Facebook).
I've written before about the arbitrariness of Facebook's abuse-report process, and in particular how it can be abused by convening a "flash mob" of users to file abuse reports about pieces of content that they want removed, even when that content doesn't violate Facebook's terms. The solution I proposed, briefly, was for Facebook to sign up, say, 100,000 volunteers (or even paid users) to review "abuse reports." and when an abuse report is received, have the report evaluated by a random subset of 100 of those volunteers, to vote on whether the report is legitimate. The decision whether to remove the content can be based on what percent of those 100 users vote that it violates the terms of service. The nice property of this system is that it can't be manipulated by conscripting a "flash mob" of users to file complaints all at once — no matter how many mobsters you have filing abuse reports, if your complaints don't have merit, they won't pass the random-sample review (unless you manage to control a significant proportion of the 100,000 users that the 100 users are randomly selected from, but that would be a very tall order).
This also means that no abuse complaint would be ignored because too few people submitted it — any abusive content that was reported, would trigger the 100-user review. (Or if you thought cranks would waste too much of the reviewers' time by filing phone abuse reports, you could only trigger the 100-user review after, say, 3 people had complained about a given page. Or you could start ignoring complaints from users after they had filed a certain number of complaints that were all rejected by the 100-user review process.) Readers suggested various improvements to the algorithm and pointed out potential problems, but I think the basic idea is still sound.
Some of the abusive pages cited by the #fbrape campaigners, are truly graphic and offensive, certainly in violation of Facebook's "community standards" against "hate speech." If they had been reviewed by a 100-user random sample, they probably would have been removed. As it is, the complaints probably landed in the lap of some $1-an-hour grunt worker who ignored them (Facebook's opacity in regards to its review process gives us little more information than that). If the complainers had been luckier, perhaps the abuse reports might have gotten noticed by someone more proactive.
So even if the #fbrape campaigners didn't put it in these terms, their gripe was essentially that the Facebook complaint review process leads to arbitrary outcomes, and the complaints didn't gain traction because luck wasn't on their side.
But what about the #fbrape campaign itself, to bring the pages to Facebook's attention through media action, after the initial abuse reports were ignored?
This is probably an example of what could be called the "Salganik Effect." Matthew Salganik is a Princeton University professor who in 2006 conducted a study examining how certain songs became popular in simulated worlds in which users could rate songs and recommend them to their friends. In his simulation, he divided users into eight artificial "worlds" in which the users in a given world could only see the ratings and recommendations from other users within that world. Then each world was seeded with the same set of songs to see which songs grew in popularity. His team found that the set of songs which became "successful." varied wildly between worlds — such that within any given world, although the very worst songs never came popular, the set of songs that did become popular were essentially a random selection from among those that were merely "good enough."
Online movements gain traction through such a similar process — users 'liking' a page or recommending it to friends, recommendations radiating out from the popular elite according to Malcolm Gladwell's "Law of the Few -- that this suggests the success of a campaign like #fbrape could have been the result of an arbitrary process dominated by luck, just like the success of a song in one of Salganik's artificial worlds. We can never know for sure, since we can't divide real-life Facebook users into multiple artificial worlds, or re-run history to see how often the outcome would be different. But you should read Everything Is Obvious* (Once You Know The Answer), a book written by Duncan J. Watts, one of the co-authors of Salganik's study. The book argues that many of the outcomes that seem like foregone conclusions in hindsight, such as the success of a product, twitter meme, company, idea, or person, are really the result of an arbitrary process that is impossible to predict, much less control. If you liked Freakonomics or Thinking, Fast And Slow, you should add Everything Is Obvious to your reading list right away.
In the case of the #fbrape campaign, the strong form of the conclusion would be to say that the success of the campaign is probably the outcome of a random process. But everyone should at least agree with the weak form of the conclusion, which is that the success and failures of online campaigns could be a random process — and that it's a mistake to say that the success of a campaign definitely is determined by the merits of the campaign's ideas or by the efforts of the campaign organizers. If we can't prove how much luck has to do with it, we have to acknowledge that it could be quite a lot.
That doesn't mean the #fbrape campaign didn't have merit. Like the songs in Salganik's artificial worlds that were "good enough" to succeed if given the chance, the #fbrape campaign organizers did have a point. But we shouldn't take the phenomenal success of the campaign to mean that they had that much more of a valid point than many other campaigns which fizzled out due to bad luck. (Thus I think that articles like this one by Sandy Garossino, even if they're right about the problem of pro-rape content, are missing the point insofar as they imply that the movement's success was due to the hard work of the "smartest feminists on the planet." It's a bit early to declare that "On May 27th, women won the Internet.")
The initial complaints failed because of an arbitrary process, and then the #fbrape campaign succeeded because of an equally arbitrary process. The next such awareness campaign, even if it has merit, might not have luck on its side.
The arbitrariness in both of these processes can be fixed. For the first process — abuse reports submitted to Facebook — the fix is easy: have each complaint reviewed by a random subset of volunteers or employees who are signed up to review such content, as described above. This makes the outcome dependent on the attributes of the content itself (as it should be), rather than luck and/or the size of the mob that wants something removed.
The arbitrariness in the second process — the process by which memes "catch fire" and spill over into mainstream media and broader awareness — is a taller order, but I think it can be fixed by essentially the same algorithm. What would be required would be for a site that has the power to make new memes through its sheer dominance, like Google+ or Reddit, to implement the random-sample-voting algorithm for memes and calls-to-action. Any user can submit an argument — very broadly, any type of exhortation that "we" should do "something" -- and these arguments could be reviewed by a random sample of, say, 20 other users on the site. Those arguments that have the highest percentage of "Yes" votes would get promoted on the front page. (This is the algorithm I was pushing in a previous article, Censorship By Glut.)
The system sounds deceptively simple, but note what's missing: You can't manipulate the voting by rallying your friends to vote for your idea (or by creating multiple "sockpuppet" accounts to vote up your own post). You don't even have the accidental Salganik effects, where friend-to-friend recommendations result in a chaotic feedback loop where certain ideas race ahead of others due to random factors that have nothing to do with the idea's merits. You've taken the arbitrariness out of the process, so that the fate of the idea is a function only of the attributes of the idea itself, which determines the percentage of randomly sampled users who vote it up. (This is not quite the same as rewarding the ideas with the most "merit" — rather, it's the ideas that the population being sampled perceive to have the most merit — but at least the outcome is not random, and the system cannot be gamed.)
Meanwhile, I hope that Facebook won't err too far on the side of abolishing sexist humor where the humor is in proportion to the offensiveness. In Women, Action & the Media's list of examples of "gender-based hate speech." they included a Facebook page titled "Hope you have pet insurance because I'm about to destroy your pussy." which I would optimistically like to think refers to enthusiastic sex and not rape. (The humor really derives from the fact that the words appear next to a physically unattractive man, which is one group that feminists never seem to get riled up about defending.) And what about jokes about anti-male violence, which were left out of WAM's examples? A friend of mine likes posting things on his Facebook like "I was trying to remember the name of Rihanna's ex, and then it hit me," which I thought was funny, but which some WAM supporters probably would have reported as "abusive content." I wonder how many of those same people would have filed a report if he'd said, "I was about to say the name of Lorena Bobbitt's ex, but I got cut off." -
Facebook's Complaint Process Is Arbitrary — But So Is Campaigning
Bennett Haselton writes "After initial abuse reports failed to shut down some anti-women and pro-rape pages on Facebook, a wider lobbying campaign succeeded in prompting a Facebook policy change. This has been alternately hailed as a vindication of the campaigner's cause, or derided as proof that Facebook can be cowed by humorless feminists. In reality, the success of the campaign was most likely the outcome of a mostly arbitrary and random process that required a lot of luck, just as the initial abuse reports didn't succeed because they didn't have the necessary luck on their side. Neither result should be taken to reflect on the merits of the campaigner's actual points." Read on for the rest of Bennett's thoughts. On May 28th, Facebook released a statement acknowledging that it had not responded effectively to complaints against pages containing "gender-based hate speech" (e.g. "Slapping hookers in the face with a shoe." and several much worse examples glorifying rape or violence). The decision came at the end of the "#fbrape" campaign by feminist groups to pressure advertisers whose ads had been appearing on the most offensive pages; major advertisers like Nissan announced that they were withdrawing advertisements from Facebook until they could be assured their ads would not appear on the pages in question (most of which were ultimately shut down by Facebook).
I've written before about the arbitrariness of Facebook's abuse-report process, and in particular how it can be abused by convening a "flash mob" of users to file abuse reports about pieces of content that they want removed, even when that content doesn't violate Facebook's terms. The solution I proposed, briefly, was for Facebook to sign up, say, 100,000 volunteers (or even paid users) to review "abuse reports." and when an abuse report is received, have the report evaluated by a random subset of 100 of those volunteers, to vote on whether the report is legitimate. The decision whether to remove the content can be based on what percent of those 100 users vote that it violates the terms of service. The nice property of this system is that it can't be manipulated by conscripting a "flash mob" of users to file complaints all at once — no matter how many mobsters you have filing abuse reports, if your complaints don't have merit, they won't pass the random-sample review (unless you manage to control a significant proportion of the 100,000 users that the 100 users are randomly selected from, but that would be a very tall order).
This also means that no abuse complaint would be ignored because too few people submitted it — any abusive content that was reported, would trigger the 100-user review. (Or if you thought cranks would waste too much of the reviewers' time by filing phone abuse reports, you could only trigger the 100-user review after, say, 3 people had complained about a given page. Or you could start ignoring complaints from users after they had filed a certain number of complaints that were all rejected by the 100-user review process.) Readers suggested various improvements to the algorithm and pointed out potential problems, but I think the basic idea is still sound.
Some of the abusive pages cited by the #fbrape campaigners, are truly graphic and offensive, certainly in violation of Facebook's "community standards" against "hate speech." If they had been reviewed by a 100-user random sample, they probably would have been removed. As it is, the complaints probably landed in the lap of some $1-an-hour grunt worker who ignored them (Facebook's opacity in regards to its review process gives us little more information than that). If the complainers had been luckier, perhaps the abuse reports might have gotten noticed by someone more proactive.
So even if the #fbrape campaigners didn't put it in these terms, their gripe was essentially that the Facebook complaint review process leads to arbitrary outcomes, and the complaints didn't gain traction because luck wasn't on their side.
But what about the #fbrape campaign itself, to bring the pages to Facebook's attention through media action, after the initial abuse reports were ignored?
This is probably an example of what could be called the "Salganik Effect." Matthew Salganik is a Princeton University professor who in 2006 conducted a study examining how certain songs became popular in simulated worlds in which users could rate songs and recommend them to their friends. In his simulation, he divided users into eight artificial "worlds" in which the users in a given world could only see the ratings and recommendations from other users within that world. Then each world was seeded with the same set of songs to see which songs grew in popularity. His team found that the set of songs which became "successful." varied wildly between worlds — such that within any given world, although the very worst songs never came popular, the set of songs that did become popular were essentially a random selection from among those that were merely "good enough."
Online movements gain traction through such a similar process — users 'liking' a page or recommending it to friends, recommendations radiating out from the popular elite according to Malcolm Gladwell's "Law of the Few -- that this suggests the success of a campaign like #fbrape could have been the result of an arbitrary process dominated by luck, just like the success of a song in one of Salganik's artificial worlds. We can never know for sure, since we can't divide real-life Facebook users into multiple artificial worlds, or re-run history to see how often the outcome would be different. But you should read Everything Is Obvious* (Once You Know The Answer), a book written by Duncan J. Watts, one of the co-authors of Salganik's study. The book argues that many of the outcomes that seem like foregone conclusions in hindsight, such as the success of a product, twitter meme, company, idea, or person, are really the result of an arbitrary process that is impossible to predict, much less control. If you liked Freakonomics or Thinking, Fast And Slow, you should add Everything Is Obvious to your reading list right away.
In the case of the #fbrape campaign, the strong form of the conclusion would be to say that the success of the campaign is probably the outcome of a random process. But everyone should at least agree with the weak form of the conclusion, which is that the success and failures of online campaigns could be a random process — and that it's a mistake to say that the success of a campaign definitely is determined by the merits of the campaign's ideas or by the efforts of the campaign organizers. If we can't prove how much luck has to do with it, we have to acknowledge that it could be quite a lot.
That doesn't mean the #fbrape campaign didn't have merit. Like the songs in Salganik's artificial worlds that were "good enough" to succeed if given the chance, the #fbrape campaign organizers did have a point. But we shouldn't take the phenomenal success of the campaign to mean that they had that much more of a valid point than many other campaigns which fizzled out due to bad luck. (Thus I think that articles like this one by Sandy Garossino, even if they're right about the problem of pro-rape content, are missing the point insofar as they imply that the movement's success was due to the hard work of the "smartest feminists on the planet." It's a bit early to declare that "On May 27th, women won the Internet.")
The initial complaints failed because of an arbitrary process, and then the #fbrape campaign succeeded because of an equally arbitrary process. The next such awareness campaign, even if it has merit, might not have luck on its side.
The arbitrariness in both of these processes can be fixed. For the first process — abuse reports submitted to Facebook — the fix is easy: have each complaint reviewed by a random subset of volunteers or employees who are signed up to review such content, as described above. This makes the outcome dependent on the attributes of the content itself (as it should be), rather than luck and/or the size of the mob that wants something removed.
The arbitrariness in the second process — the process by which memes "catch fire" and spill over into mainstream media and broader awareness — is a taller order, but I think it can be fixed by essentially the same algorithm. What would be required would be for a site that has the power to make new memes through its sheer dominance, like Google+ or Reddit, to implement the random-sample-voting algorithm for memes and calls-to-action. Any user can submit an argument — very broadly, any type of exhortation that "we" should do "something" -- and these arguments could be reviewed by a random sample of, say, 20 other users on the site. Those arguments that have the highest percentage of "Yes" votes would get promoted on the front page. (This is the algorithm I was pushing in a previous article, Censorship By Glut.)
The system sounds deceptively simple, but note what's missing: You can't manipulate the voting by rallying your friends to vote for your idea (or by creating multiple "sockpuppet" accounts to vote up your own post). You don't even have the accidental Salganik effects, where friend-to-friend recommendations result in a chaotic feedback loop where certain ideas race ahead of others due to random factors that have nothing to do with the idea's merits. You've taken the arbitrariness out of the process, so that the fate of the idea is a function only of the attributes of the idea itself, which determines the percentage of randomly sampled users who vote it up. (This is not quite the same as rewarding the ideas with the most "merit" — rather, it's the ideas that the population being sampled perceive to have the most merit — but at least the outcome is not random, and the system cannot be gamed.)
Meanwhile, I hope that Facebook won't err too far on the side of abolishing sexist humor where the humor is in proportion to the offensiveness. In Women, Action & the Media's list of examples of "gender-based hate speech." they included a Facebook page titled "Hope you have pet insurance because I'm about to destroy your pussy." which I would optimistically like to think refers to enthusiastic sex and not rape. (The humor really derives from the fact that the words appear next to a physically unattractive man, which is one group that feminists never seem to get riled up about defending.) And what about jokes about anti-male violence, which were left out of WAM's examples? A friend of mine likes posting things on his Facebook like "I was trying to remember the name of Rihanna's ex, and then it hit me," which I thought was funny, but which some WAM supporters probably would have reported as "abusive content." I wonder how many of those same people would have filed a report if he'd said, "I was about to say the name of Lorena Bobbitt's ex, but I got cut off." -
Higgs Data Could Spell Trouble For Leading Big Bang Theory
ananyo writes "Paul Steinhardt, an astrophysicist at Princeton University in New Jersey, and colleagues have posted a controversial paper on ArXiv arguing, based on the latest Higgs data and the cosmic microwave background map from the Planck mission, that the leading theory explaining the first moments of the Big Bang ('inflation') is fatally flawed. In short, Steinhardt says that the models that best fit the Planck data — known as 'plateau models' because their potential-energy profiles level off at relatively low energies — are far less likely to occur naturally than the models that Planck ruled out. Secondly, he says, the news for these plateau models gets dramatically worse when the results are analyzed in conjunction with the latest results about the Higgs field coming from CERN's Large Hadron Collider. Particle physicists working at the LHC have calculated that the Higgs field is likely to have started out in a high-energy, 'metastable' state rather than in a stable, low-energy configuration. Steinhardt likens the odds of the Higgs field initially being perched in the precarious metastable state as to those of dropping out of the sky over the Matterhorn and conveniently landing in a 'dimple near the top,' rather than crashing down to the mountain's base." -
Excel Error Contributes To Problems With Austerity Study
quarterbuck writes "Many politicians, especially in Europe, have used the idea that economic growth is impeded by debt levels above 90% of GDP to justify austerity measures. The academic justification came from a paper and a book by Kenneth Rogoff and Carmen Reinhart. Now researchers at U Mass at Amherst have refuted the study — they find that not only was the data tainted by bad statistics, it also had an Excel error. Apparently when averaging a few GDP numbers in an excel sheet, they did not drag down the cell ranges down properly, excluding Belgium. The supporting website for the book, 'This time it is different,' has lots of financial information if a reader might want to replicate some of the results." The Excel error is making the rounds as the cause of the problems with the study, but it's actually a minor component. The study also ignores some post-WWII data for countries that had a high debt load and high growth, and there's some fishy weighting going on: "The U.K. has 19 years (1946-1964) above 90 percent debt-to-GDP with an average 2.4 percent growth rate. New Zealand has one year in their sample above 90 percent debt-to-GDP with a growth rate of -7.6. These two numbers, 2.4 and -7.6 percent, are given equal weight in the final calculation, as they average the countries equally. Even though there are 19 times as many data points for the U.K." -
Why Did It Take So Long To Invent the Wheel?
Hugh Pickens writes "Wheels are the archetype of a primitive, caveman-level technology, and we tend to think that inventing the wheel was the number one item on man's to-do list after learning to walk upright. But LiveScience reports that it took until the bronze age (3500 BC), when humans were already casting metal alloys and constructing canals and sailboats, for someone to invent the wheel-and-axle, a task so challenging archaeologists say it probably happened only once, in one place. The tricky thing about the wheel isn't a cylinder rolling on its edge, but figuring out how to connect a stable, stationary platform to that cylinder. 'The stroke of brilliance was the wheel-and-axle concept,' says David Anthony, author of The Horse, the Wheel, and Language. To make a fixed axle with revolving wheels, the ends of the axle have to be nearly perfectly smooth and round, as did the holes in the center of the wheels. The axles have to fit snugly inside the wheels' holes, but not too snug, or there will be too much friction for the wheels to turn. But the real reason it took so long is that whoever invented the wheel would have needed metal tools to chisel fine-fitted holes and axles. 'It was the carpentry that probably delayed the invention until 3500 BC or so, because it was only after about 4000 BC that cast copper chisels and gouges became common in the Near East.'" -
Google: Best Adaptation of a Novel To a Patent?
theodp writes "The USPTO's Thursday publication of Google's patent application for Inferring User Interests was nicely-timed, coinciding with what ZDNet called Google's privacy policy doomsday. The inventors include Google Sr. Staff Research Scientist Shumeet Baluja, the author of The Silicon Jungle, a cautionary tale of data mining's promise and peril, which Google's Vint Cerf found 'credible and scary.' No doubt some will feel the same about Beluja's patent filing, which lays out plans for mining 'user generated content, such as user interests, user blogs, postings by the user on her or other users' profiles (e.g., comments in a commentary section of a web page), a user's selection of hosted audio, images, and other files, and demographic information about the user, such as age, gender, address, etc.'" -
Browser History Sniffing Is Back
An anonymous reader writes "Remember CSS history sniffing? The leak is plugged in all major browsers today, but there is some bad news: in a post to the Full Disclosure mailing list, security researchers have showcased a brand new tool to quickly extract your history by probing the cache, instead. The theory isn't new, but a convincing implementation is." -
Bug Forces Android Devices Off Princeton Campus Network
pmdubs writes "A major bug in the Android DHCP implementation has forced network administrators to (effectively) ban the use of such devices on the Princeton campus. In the last few months, Princeton has had to kick more than 400 Android devices off the campus network for using IP addresses well beyond the allotted DHCP lease (to the detriment of other users), sending invalid DHCPREQUEST messages after lease expiration, and a variety of other wacky behaviors. The link provides a clearly documented explanation of the buggy behavior, as does this largely neglected bug report. Without doubt, this buggy behavior is affecting other, less vigilant networks, and disrupting Wi-Fi traffic for Android and non-Android devices alike." -
Zoho Don't Need No Stinking Ph.D. Programmers
theodp writes "When it comes to tech academic credentials, Zoho CEO Sridhar Vembu has The Right Stuff: a Ph.D. in EE from Princeton. But Vembu has eschewed Google's Army-of-Ph.D.s approach to software development in favor of tapping into the ranks of high school grads who would not normally go to college for Zoho. Seeing his youngest brother succeed at programming without a college degree convinced Vembu that others could follow that example with the proper training and guidance. And studying the best employees in his own company led to another epiphany: 'What if the college degree itself is not really that useful?' thought Vembu. 'What if we took kids after high school, train them ourselves?'" -
High Fructose Corn Syrup Causes Bigger Weight Gain In Rats
krou writes "In an experiment conducted by a Princeton University team, 'Rats with access to high-fructose corn syrup gained significantly more weight than those with access to table sugar, even when their overall caloric intake was the same.' Long-term consumption also 'led to abnormal increases in body fat, especially in the abdomen, and a rise in circulating blood fats called triglycerides.' Psychology professor Bart Hoebel commented that 'When rats are drinking high-fructose corn syrup at levels well below those in soda pop, they're becoming obese — every single one, across the board. Even when rats are fed a high-fat diet, you don't see this; they don't all gain extra weight.'" -
Questionable "Best Effort" Copyright Enforcement
pmdubs writes "Princeton University Professor Michael Freedman, creator of CoralCDN, discusses how he received around 100 pre-settlement letters in one month from various copyright agencies after invalid BitTorrent tracker requests were issued through CoralCDN's proxies. Interestingly, the participating agencies made no effort whatsoever to verify that the Coral nodes were actually running BitTorrent, which they weren't! He questions just how much effort agencies take to reduce false positives when it comes to DMCA notices. Considering the credence that network operators give to such notices (they'll often cut your service upon receipt), it would seem that the answer is 'not enough.'" -
Princeton Boasts Its Kindle Project Is Noblest
theodp writes "Mirror, mirror, on the wall, what's the noblest Amazon Kindle DX project of all? While other universities announced similar programs, Princeton is boasting its project is unique in that it will focus on sustainability by reducing the amount of electronic-reserve course materials that students print. Under the pilot program, $60,000 will reportedly be used to provide 50 lucky Princeton students with $489 Kindle DX devices loaded with materials for three courses. In a FAQ, students are told not to worry about 'this time of severe economic constraints' — Princeton and Amazon have managed to tap into a fund specifically endowed to support sustainability projects to provide Kindles at no cost. In addition to a $30,000 grant from the High Meadows Foundation, which is headed by Princeton alum Carl Ferenbach (who, coincidentally, serves on the Board of Trustees of the Environmental Defense Fund with the wife of Amazon Director John Doerr), a matching amount will be provided by Princeton alum Jeff Bezos' Amazon. The E-reader Pilot Program has more information." -
Princeton Boasts Its Kindle Project Is Noblest
theodp writes "Mirror, mirror, on the wall, what's the noblest Amazon Kindle DX project of all? While other universities announced similar programs, Princeton is boasting its project is unique in that it will focus on sustainability by reducing the amount of electronic-reserve course materials that students print. Under the pilot program, $60,000 will reportedly be used to provide 50 lucky Princeton students with $489 Kindle DX devices loaded with materials for three courses. In a FAQ, students are told not to worry about 'this time of severe economic constraints' — Princeton and Amazon have managed to tap into a fund specifically endowed to support sustainability projects to provide Kindles at no cost. In addition to a $30,000 grant from the High Meadows Foundation, which is headed by Princeton alum Carl Ferenbach (who, coincidentally, serves on the Board of Trustees of the Environmental Defense Fund with the wife of Amazon Director John Doerr), a matching amount will be provided by Princeton alum Jeff Bezos' Amazon. The E-reader Pilot Program has more information." -
Princeton Boasts Its Kindle Project Is Noblest
theodp writes "Mirror, mirror, on the wall, what's the noblest Amazon Kindle DX project of all? While other universities announced similar programs, Princeton is boasting its project is unique in that it will focus on sustainability by reducing the amount of electronic-reserve course materials that students print. Under the pilot program, $60,000 will reportedly be used to provide 50 lucky Princeton students with $489 Kindle DX devices loaded with materials for three courses. In a FAQ, students are told not to worry about 'this time of severe economic constraints' — Princeton and Amazon have managed to tap into a fund specifically endowed to support sustainability projects to provide Kindles at no cost. In addition to a $30,000 grant from the High Meadows Foundation, which is headed by Princeton alum Carl Ferenbach (who, coincidentally, serves on the Board of Trustees of the Environmental Defense Fund with the wife of Amazon Director John Doerr), a matching amount will be provided by Princeton alum Jeff Bezos' Amazon. The E-reader Pilot Program has more information." -
If We Have Free Will, Then So Do Electrons
snahgle writes "Mathematicians John Conway (inventor of the Game of Life) and Simon Kochen of Princeton University have proven that if human experimenters demonstrate 'free will' in choosing what measurements to take on a particle, then the axioms of quantum mechanics require that the free will property be available to the particles measured, or to the universe as a whole. Conway is giving a series of lectures on the 'Free Will Theorem' and its ramifications over the next month at Princeton. A followup article strengthening the theory (PDF) was published last month in Notices of the AMS." Update: 03/19 14:20 GMT by KD : jamie points out that we discussed this theorem last year, before the paper had been published. -
Cheap Scanners Can "Fingerprint" Paper
carusoj writes "Researchers at Princeton University and University College London say they can identify unique information, essentially like a fingerprint, from any blank sheet of paper using any reasonably good scanner. The technique could be used to crack down on counterfeiting or even keep track of confidential documents. The researchers' paper on the finding is set to be presented at an IEEE security conference in Oakland, Calif., in May." Update: 03/10 22:43 GMT by T : J. Alex Halderman, Associate Professor of Electrical Engineering and Computer Science at the University of Michigan and one of the authors of the study, writes with more: "My group has just put up a site about the work and a copy of the full paper, and we will probably add a video later tonight." -
Censorship By Glut
Frequent Slashdot contributor Bennett Haselton writes "A 2006 paper by Matthew Salganik, Peter Dodds and Duncan Watts, about the patterns that users follow in choosing and recommending songs to each other on a music download site, may be the key to understanding the most effective form of "censorship" that still exists in mostly-free countries like the US It also explains why your great ideas haven't made you famous, while lower-wattage bulbs always seem to find a platform to spout off their ideas (and you can keep your smart remarks to yourself)." Read on for the rest of Bennett's take on why the effects of peer ratings on a music download site go a long way towards explaining how good ideas can effectively be "censored" even in a country with no formal political censorship.
In a country where you're free to say almost anything in the political arena, I think the only real censorship of good ideas is what you could call "censorship by glut". If you had a brilliant, absolutely airtight argument that we should do something -- indict President Bush (or Barack Obama), or send foreign investment to Chechnya, or let kids vote -- but you weren't an established writer or well-known blogger, how much of a chance do you think your argument would have against the glut of Web rants and other pieces of writing out there? Especially if your argument required people to read it and think about it for at least an hour? Perhaps your situation could be compared to that of a brilliantly talented band submitting a song for Matthew Salganik's experiment.
What Salganik and his co-authors did was recruit users through advertisements on Bolt.com (skewing toward a teen demographic) to sign up for a free music download site. Users would be able to listen to full-length songs and then decide whether or not to download the song for free. Some users were randomly divided into eight artificial "worlds" in which, while a user was listening to a song, they could see the number of times that the song had been downloaded by other users in the same world -- but only by other users within their own world, not counting the downloads by users in other worlds. The test was to see whether certain songs could become popular in some worlds while languishing in others, despite the fact that all groups consisted of randomly assigned populations that all had equal access to the same songs. The experiment also attempted to measure the "merit" of individual songs by assigning some users to an "independent" group, where they could listen to songs and choose whether to download them, but without seeing the number of times the song had been downloaded by anyone else; the merit of the song was defined as the number of times that users in the independent group decided to download the song after listening to it. Experimenters looked at whether the merit of the song had any effect on the popularity levels it achieved in the eight other "worlds".
The authors summed it up: "In general, the 'best' songs never do very badly, and the 'worst' songs never do extremely well, but almost any other result is possible." They also noted that in the "social influence" worlds where users could see each others' downloads, increasing download numbers had a snowball effect that widened the difference between the successful songs and the unsuccessful: "We found that all eight social influence worlds exhibit greater inequality -- meaning popular songs are more popular and unpopular songs are less popular -- than the world in which individuals make decisions independently." Figures 3(A) and 3(C) in the paper show that the relationship between a song's merit and its success in any given world -- while not completely random -- is tenuous. And if you're a talented musician and you want to get really depressed about your prospects of hitting the big time, Figures 3(B) and 3(D) show the relationship between a song's measured merit and its actual number of sales in the real world. (Although those graphs may cheer you up if you're a struggling musician who hasn't made it big yet -- maybe it's not you, it's just the roll of the dice.)
As the Richard Thaler and Cass Sunstein put it in their all-around fascinating book Nudge , where I first read about the Salganik study:In many domains people are tempted to think, after the fact, that an outcome was entirely predictable, and that the success of a musician, an actor, an author, or a politician was inevitable in light of his or her skills and characteristics. Beware of that temptation. Small interventions and even coincidences, at a key stage, can produce large variations in the outcome. Today's hot singer is probably indistinguishable from dozens and even hundreds of equally talented performers whose names you've never heard. We can go further. Most of today's governors are hard to distinguish from dozens or even hundreds of politicians whose candidacies badly fizzled.
Is the blogosphere, or the "marketplace of ideas" in general, any different? If a random sample of bloggers were rated based on some independent measure of merit -- for example, independent ratings from a random sampling of blog readers, who were looking at the bloggers' writing samples for the first time, analogous to users in Salganik's "independent" world -- and then correlate that with the bloggers' traffic or some other measure of success, it's not hard to imagine the results would be similar to those of the 8-worlds experiment: the best often rise to the top, the very worst rarely do, but success in the vast middle would be close to random. In fact, while music listeners would have no logical reason to like a song just because others did, users in the blogosphere and other public forums would have several rational reasons to cluster around writers who are already popular: (1) errors are more likely to have been spotted and pointed out by someone else; (2) as an extension of that, others are more likely to have provided comments and other value-added content; (3) if you are the first person to spot an error, it's more important on a popular blog to point out the error and stop the misinformation from spreading, than on a minor blog that nobody has ever heard of. So the "snowball effect" of popularity in the blogosphere would be even more pronounced.
Then why do so many people believe in what Thaler and Sunstein call the "inevitability" of success based on merit, in domains like music, politics, and writing? I think it's because the belief is what scientists call an unfalsifiable one -- if the "best" acts are assumed to be the ones that end up on the top of the pile, then the marketplace has always sorted the "best" content to the top, by definition. Since the definition is circular, the premise could never be disproved by any amount of counter-evidence -- even if an act that used to be popular, suddenly falls under the radar, that could be seen as "proof" that they lost whatever magic touch they used to have, not as evidence of the arbitrariness of the market! The only disproof would be an artificial experiment like Salganik's, showing that once you get beyond a certain threshold of quality, commercial success has little relationship to independently measured merit -- but such experiments, which in Salganik's case required the cooperation of over 14,000 users, don't come along very often. And as long as most people don't realize how arbitrary the existing marketplaces are, there isn't enough demand to justify building a system that could work better -- indeed, to even justify asking the question of whether a system could be designed that would work better.
And that, I think, is how "censorship by glut" really works. It's not just the sheer amount of written content that censors small voices -- if you happen to know about a particular writer that you consider a fount of wisdom, then the existence of a billion other Web pages won't stop you from reading that writer's content. And it's not as if there aren't plenty of people who realize that success can be highly arbitrary. The problem is that as long as most people assume that the existing marketplace of ideas does a good job of sorting the best content to the top, then they'll be more inclined to stay with the most popular news sites and blogs, and even the minority who know that it's largely a lottery, will have no effective way of finding the best content among everything else, so they'll end up sticking with the most popular sites as well. Worse, as a secondary effect, most people with something useful to contribute won't even bother, if they don't already have a large built-in audience. I know plenty of people who could write insightful essays about social and technological issues, essays that would give most readers a new perspective such that they would definitely say afterwards: "That was worth my time to read it." But it wouldn't be worth it to the writers, because they know that their content isn't going to get magically sorted into its deserved place in the hierarchy.
(My own favorite blog that nobody's ever heard of is Seth Finkelstein's InfoThought, which is usually logical and insightful and is only about 25% of the time about how "nobody ever reads this blog, so what's the point". His Guardian columns are also good and usually don't have that subtext, perhaps because it's considered impolite to use a newspaper's column-inches (column-centimeters?) to complain that you have no voice.)
So can this problem be avoided, or is inequality and arbitrariness just a permanent part of the marketplace for content and ideas? You could create an artificial world that would sort user-submitted content according to some other algorithm -- and even if it didn't give good writers the fame that they theoretically deserved in the larger world, it might still provide them with enough of an audience within the artificial universe, to make it worth their time to keep writing. One option would be to use Salganik's "independence" world model, where users would read content without being able to see the ratings that other people had given to it, or without even seeing recommendations from similarly-minded friends within the system. The trouble is that without any information about what other readers liked, without any starting point to sort good content from bad content, it may not be worth the reader's time to read through all the dreck to find the occasional buried treasure. I believe about as strongly as a person can believe, that the existing marketplace for content is far from meritocratic, for example that there are probably thousands of songs on iTunes that I've never heard of but would nonetheless love -- but even I don't spend time listening to the 30-second clips of random songs on iTunes, because it takes too long to find the stuff I would like.
But I submit there is a solution -- a variant of an argument that I've suggested for stopping cheating on Digg, or building Wikia search into a meritocratic search engine, or helping the best writers rise to the top on Google Knol. The solution is sorting based on ratings from a random sample of users. The remainder of this speculation will be very theoretical, and will at times seem like a Rube-Goldberg approach to what should be a simple problem. But at each juncture, the complications to the algorithm are motivated by an argument that anything simpler would not work. At many points along the way, it will be tempting to throw up one's hands and say, "Why go to all this trouble, the existing system works well enough." But this statement is hard to quantify with any actual evidence -- unless you're just using the circular definition above, that whatever rises to the top is automatically the "best".
For music listeners, the gist of the algorithm is: When an artist submits a new song in the alt-rock category for example, the song is distributed to a random sample of 20 users who have indicated an interest in that genre. If the average rating from those users is high enough, the song gets recommended to all of the site's users who are interested in alt-rock. If the average rating is not high enough, then the artist receives a notification, perhaps with a list of comments from the listeners suggesting what to improve. As long as the initial random sample of users is large enough that the average rating is indicative of what the rest of the site's alt-rock fans would think, the good content will get to be enjoyed by all of the site's alt-rock customers, while the bad content would fizzle after only wasting the time of 20 people. If it turns out that a random selection of 20 users are typically too lazy to rate the songs that are submitted to them, you could even make artists submit $10 to have their songs rated by the focus group, and pay each of the 20 raters $0.50 each for their trouble. Artists can't withhold payment as revenge for a bad rating, so the average ratings should still be proportional to the song's actual quality.
At this point, you might object that this system suffers from the same unfalsifiable, circular reasoning as the belief that the marketplace rewards the "best" content, if the best content is the content that wins in the marketplace. If I define the "best" content to be the content that gets the highest average score in a random focus group, then of course this algorithm sorts the best content to the top, because that's how "best" was defined! But this system does actually have a non-trivial property: If you implement the system in multiple separate "worlds" (similar to those that Salganik created), then provided your focus groups are large enough to provide representative random samples, the same content should rise to the top in each of the worlds, unlike the results in Salganik's experiment.
This actually wouldn't be the case if the initial focus groups were not big enough -- then random variations in a few voters' opinions could cause many songs to succeed in one world and fail in another. So it's a non-trivial property that is not automatically true, and would not be true if you made an error in designing the system, like making the focus groups too small. But the larger the size of the random sample, the smaller the variance in the expected value of the average of their ratings, and the greater agreement you would expect between the results from different worlds.
As Salganik pointed out to me, this system does under-reward songs that might require repeated listenings over time to gain an appreciation of their qualities. But even this, strictly speaking, can be modeled in exchange for cash -- I'll pay 20 users $2 each if they listen to my song once today, once in three days, and once again a week after that (the site could stream the song to them to provide at least some likelihood that the users weren't cheating). This assumes some things, such as that repeated exposure has the same growing-on-you effect even if the exposure is forced -- but in the real world, songs often grow on you from repeated listenings that are "forced" anyway, if they're played in the doctor's office or on the radio when you don't bother to change the channel. And this might be more complicated than necessary -- often when a song grows on you, it at least interests you enough the first time you hear it, that you'd give it a positive rating on the first listen, which is all that the site requires for the song's success.
However, if you try to adapt this trick to a meritocracy for written content, you run into different problems. With a song, if you poll a random sample of users, the odds are very small that anyone being polled will be a vested interest in the success of the song, like one of the band members or one of the song's producers (assuming the population of users is large enough, and the song's producers have not been able to create a huge number of "sockpuppet" accounts to manipulate the voting). So you can assume the ratings will be free of any prior bias. But with a political post, for example, if you write a pro-Bush or anti-Bush essay, it's quite likely that among a random sample of users, there will be people who are biased to vote up (or vote down) any post that has anything good to say about the President. The essays voted to the top may not be the best-written ones, but simply the ones that pander to the most popularly held opinions.
But if the "best" essays are not the ones that receive the highest percentage of positive votes, even when polling a random sample of independent users -- which I was advocating as the gold standard for measuring merit -- then how do you define what makes the "best" essays, anyway? There are many possible answers, but I suggest: A necessary condition for being among the "best" essays would be to convince the most people of something that they didn't believe before, without resorting to tricks such as blatantly fabricating statistics or attributing made-up quotes. This is not a sufficient condition for merit -- maybe the point of view that you're convincing people of, is still wrong -- but I submit that if you're not at least changing some people's minds, then there's no point. An essay that changes a lot of people's minds in a random focus group, is usually worth reading, if only to see why it has that effect.
Unfortunately, this doesn't suggest a better way to poll users about the merit of an essay, because if you ask users, "Were you a Bush supporter before reading this essay?" and "Were you a Bush supporter afterwards?", Bush supporters are eventually going to figure out that the way to give the essay a high score on the mind-changing scale, would be to (falsely) say that they were not a Bush supporter before reading the essay, but they were one afterwards. So you'd still end up rewarding the essays that reinforce pre-existing opinions instead of the ones that change people's minds.
From here the counter-measures and counter-counter-measures get increasingly complicated. For each category of essays that a user wants to rate, such as Bush opinion pieces, you could require new users to enter their current opinion: either pro-Bush or anti-Bush. Then if they were asked to rate a pro-Bush essay, they would only be able to vote that the essay "changed their minds" by switching their registered opinion from "anti-Bush" to "pro-Bush". But Bush supporters could sign up initially as anti-Bush, just in the hopes of being part of a random focus group so they could cast their mind-changing vote for a Bush essay by changing their registration to "pro-Bush"! However, each user would only be able to do that once -- or do you allow users, after they've switched from anti-Bush to pro-Bush, to "reload" by spontaneously switching back to anti-Bush for no reason at all, so they're all set to cast a mind-changing vote for the next pro-Bush essay? Or would they only be allowed to switch back to anti-Bush, by casting a mind-changing vote as part of a random focus group for an anti-Bush essay -- thus giving a boost to an anti-Bush screed, as part of the price they pay for the next vote they cast for a pro-Bush piece? Then users could still game the system, by switching to "anti-Bush" when casting a vote for a very poorly written anti-Bush essay that they don't think anybody else will vote for anyway, and then switching back to "pro-Bush" only for the good essays that have a shot, hoping that their votes will coalesce around the decently-written pro-Bush essays and push them to the front page...
Am I over-thinking this? I submit this is an area where there's been too much under-thinking. Haven't we all been tempted to believe that the marketplace of ideas -- not to mention bands, blog posts, and business ventures -- efficiently sorts content to the place in the hierarchy of rewards that it deserves, without having any real evidence for this, except the circular definition of "quality" as being proportional to success? And the more people believe this, the more that marginalized voices will effectively be censored, even when they have something brilliant to contribute. We should at least think about ways that we could do better. Or else, prove logically that it can't be done (a logical proof can only approximate the real world, but it could show that such a pure meritocracy would be very improbable, or wouldn't work well). However I think the ideas above make it seem unlikely that a meritocracy is logically impossible. Maybe they're a step in the right direction. Maybe someone else's ideas would be better. The important thing is that a meritocratic algorithm be judged by something other than a circular definition, which simply decrees by fiat that the winning content is the best. -
Censorship By Glut
Frequent Slashdot contributor Bennett Haselton writes "A 2006 paper by Matthew Salganik, Peter Dodds and Duncan Watts, about the patterns that users follow in choosing and recommending songs to each other on a music download site, may be the key to understanding the most effective form of "censorship" that still exists in mostly-free countries like the US It also explains why your great ideas haven't made you famous, while lower-wattage bulbs always seem to find a platform to spout off their ideas (and you can keep your smart remarks to yourself)." Read on for the rest of Bennett's take on why the effects of peer ratings on a music download site go a long way towards explaining how good ideas can effectively be "censored" even in a country with no formal political censorship.
In a country where you're free to say almost anything in the political arena, I think the only real censorship of good ideas is what you could call "censorship by glut". If you had a brilliant, absolutely airtight argument that we should do something -- indict President Bush (or Barack Obama), or send foreign investment to Chechnya, or let kids vote -- but you weren't an established writer or well-known blogger, how much of a chance do you think your argument would have against the glut of Web rants and other pieces of writing out there? Especially if your argument required people to read it and think about it for at least an hour? Perhaps your situation could be compared to that of a brilliantly talented band submitting a song for Matthew Salganik's experiment.
What Salganik and his co-authors did was recruit users through advertisements on Bolt.com (skewing toward a teen demographic) to sign up for a free music download site. Users would be able to listen to full-length songs and then decide whether or not to download the song for free. Some users were randomly divided into eight artificial "worlds" in which, while a user was listening to a song, they could see the number of times that the song had been downloaded by other users in the same world -- but only by other users within their own world, not counting the downloads by users in other worlds. The test was to see whether certain songs could become popular in some worlds while languishing in others, despite the fact that all groups consisted of randomly assigned populations that all had equal access to the same songs. The experiment also attempted to measure the "merit" of individual songs by assigning some users to an "independent" group, where they could listen to songs and choose whether to download them, but without seeing the number of times the song had been downloaded by anyone else; the merit of the song was defined as the number of times that users in the independent group decided to download the song after listening to it. Experimenters looked at whether the merit of the song had any effect on the popularity levels it achieved in the eight other "worlds".
The authors summed it up: "In general, the 'best' songs never do very badly, and the 'worst' songs never do extremely well, but almost any other result is possible." They also noted that in the "social influence" worlds where users could see each others' downloads, increasing download numbers had a snowball effect that widened the difference between the successful songs and the unsuccessful: "We found that all eight social influence worlds exhibit greater inequality -- meaning popular songs are more popular and unpopular songs are less popular -- than the world in which individuals make decisions independently." Figures 3(A) and 3(C) in the paper show that the relationship between a song's merit and its success in any given world -- while not completely random -- is tenuous. And if you're a talented musician and you want to get really depressed about your prospects of hitting the big time, Figures 3(B) and 3(D) show the relationship between a song's measured merit and its actual number of sales in the real world. (Although those graphs may cheer you up if you're a struggling musician who hasn't made it big yet -- maybe it's not you, it's just the roll of the dice.)
As the Richard Thaler and Cass Sunstein put it in their all-around fascinating book Nudge , where I first read about the Salganik study:In many domains people are tempted to think, after the fact, that an outcome was entirely predictable, and that the success of a musician, an actor, an author, or a politician was inevitable in light of his or her skills and characteristics. Beware of that temptation. Small interventions and even coincidences, at a key stage, can produce large variations in the outcome. Today's hot singer is probably indistinguishable from dozens and even hundreds of equally talented performers whose names you've never heard. We can go further. Most of today's governors are hard to distinguish from dozens or even hundreds of politicians whose candidacies badly fizzled.
Is the blogosphere, or the "marketplace of ideas" in general, any different? If a random sample of bloggers were rated based on some independent measure of merit -- for example, independent ratings from a random sampling of blog readers, who were looking at the bloggers' writing samples for the first time, analogous to users in Salganik's "independent" world -- and then correlate that with the bloggers' traffic or some other measure of success, it's not hard to imagine the results would be similar to those of the 8-worlds experiment: the best often rise to the top, the very worst rarely do, but success in the vast middle would be close to random. In fact, while music listeners would have no logical reason to like a song just because others did, users in the blogosphere and other public forums would have several rational reasons to cluster around writers who are already popular: (1) errors are more likely to have been spotted and pointed out by someone else; (2) as an extension of that, others are more likely to have provided comments and other value-added content; (3) if you are the first person to spot an error, it's more important on a popular blog to point out the error and stop the misinformation from spreading, than on a minor blog that nobody has ever heard of. So the "snowball effect" of popularity in the blogosphere would be even more pronounced.
Then why do so many people believe in what Thaler and Sunstein call the "inevitability" of success based on merit, in domains like music, politics, and writing? I think it's because the belief is what scientists call an unfalsifiable one -- if the "best" acts are assumed to be the ones that end up on the top of the pile, then the marketplace has always sorted the "best" content to the top, by definition. Since the definition is circular, the premise could never be disproved by any amount of counter-evidence -- even if an act that used to be popular, suddenly falls under the radar, that could be seen as "proof" that they lost whatever magic touch they used to have, not as evidence of the arbitrariness of the market! The only disproof would be an artificial experiment like Salganik's, showing that once you get beyond a certain threshold of quality, commercial success has little relationship to independently measured merit -- but such experiments, which in Salganik's case required the cooperation of over 14,000 users, don't come along very often. And as long as most people don't realize how arbitrary the existing marketplaces are, there isn't enough demand to justify building a system that could work better -- indeed, to even justify asking the question of whether a system could be designed that would work better.
And that, I think, is how "censorship by glut" really works. It's not just the sheer amount of written content that censors small voices -- if you happen to know about a particular writer that you consider a fount of wisdom, then the existence of a billion other Web pages won't stop you from reading that writer's content. And it's not as if there aren't plenty of people who realize that success can be highly arbitrary. The problem is that as long as most people assume that the existing marketplace of ideas does a good job of sorting the best content to the top, then they'll be more inclined to stay with the most popular news sites and blogs, and even the minority who know that it's largely a lottery, will have no effective way of finding the best content among everything else, so they'll end up sticking with the most popular sites as well. Worse, as a secondary effect, most people with something useful to contribute won't even bother, if they don't already have a large built-in audience. I know plenty of people who could write insightful essays about social and technological issues, essays that would give most readers a new perspective such that they would definitely say afterwards: "That was worth my time to read it." But it wouldn't be worth it to the writers, because they know that their content isn't going to get magically sorted into its deserved place in the hierarchy.
(My own favorite blog that nobody's ever heard of is Seth Finkelstein's InfoThought, which is usually logical and insightful and is only about 25% of the time about how "nobody ever reads this blog, so what's the point". His Guardian columns are also good and usually don't have that subtext, perhaps because it's considered impolite to use a newspaper's column-inches (column-centimeters?) to complain that you have no voice.)
So can this problem be avoided, or is inequality and arbitrariness just a permanent part of the marketplace for content and ideas? You could create an artificial world that would sort user-submitted content according to some other algorithm -- and even if it didn't give good writers the fame that they theoretically deserved in the larger world, it might still provide them with enough of an audience within the artificial universe, to make it worth their time to keep writing. One option would be to use Salganik's "independence" world model, where users would read content without being able to see the ratings that other people had given to it, or without even seeing recommendations from similarly-minded friends within the system. The trouble is that without any information about what other readers liked, without any starting point to sort good content from bad content, it may not be worth the reader's time to read through all the dreck to find the occasional buried treasure. I believe about as strongly as a person can believe, that the existing marketplace for content is far from meritocratic, for example that there are probably thousands of songs on iTunes that I've never heard of but would nonetheless love -- but even I don't spend time listening to the 30-second clips of random songs on iTunes, because it takes too long to find the stuff I would like.
But I submit there is a solution -- a variant of an argument that I've suggested for stopping cheating on Digg, or building Wikia search into a meritocratic search engine, or helping the best writers rise to the top on Google Knol. The solution is sorting based on ratings from a random sample of users. The remainder of this speculation will be very theoretical, and will at times seem like a Rube-Goldberg approach to what should be a simple problem. But at each juncture, the complications to the algorithm are motivated by an argument that anything simpler would not work. At many points along the way, it will be tempting to throw up one's hands and say, "Why go to all this trouble, the existing system works well enough." But this statement is hard to quantify with any actual evidence -- unless you're just using the circular definition above, that whatever rises to the top is automatically the "best".
For music listeners, the gist of the algorithm is: When an artist submits a new song in the alt-rock category for example, the song is distributed to a random sample of 20 users who have indicated an interest in that genre. If the average rating from those users is high enough, the song gets recommended to all of the site's users who are interested in alt-rock. If the average rating is not high enough, then the artist receives a notification, perhaps with a list of comments from the listeners suggesting what to improve. As long as the initial random sample of users is large enough that the average rating is indicative of what the rest of the site's alt-rock fans would think, the good content will get to be enjoyed by all of the site's alt-rock customers, while the bad content would fizzle after only wasting the time of 20 people. If it turns out that a random selection of 20 users are typically too lazy to rate the songs that are submitted to them, you could even make artists submit $10 to have their songs rated by the focus group, and pay each of the 20 raters $0.50 each for their trouble. Artists can't withhold payment as revenge for a bad rating, so the average ratings should still be proportional to the song's actual quality.
At this point, you might object that this system suffers from the same unfalsifiable, circular reasoning as the belief that the marketplace rewards the "best" content, if the best content is the content that wins in the marketplace. If I define the "best" content to be the content that gets the highest average score in a random focus group, then of course this algorithm sorts the best content to the top, because that's how "best" was defined! But this system does actually have a non-trivial property: If you implement the system in multiple separate "worlds" (similar to those that Salganik created), then provided your focus groups are large enough to provide representative random samples, the same content should rise to the top in each of the worlds, unlike the results in Salganik's experiment.
This actually wouldn't be the case if the initial focus groups were not big enough -- then random variations in a few voters' opinions could cause many songs to succeed in one world and fail in another. So it's a non-trivial property that is not automatically true, and would not be true if you made an error in designing the system, like making the focus groups too small. But the larger the size of the random sample, the smaller the variance in the expected value of the average of their ratings, and the greater agreement you would expect between the results from different worlds.
As Salganik pointed out to me, this system does under-reward songs that might require repeated listenings over time to gain an appreciation of their qualities. But even this, strictly speaking, can be modeled in exchange for cash -- I'll pay 20 users $2 each if they listen to my song once today, once in three days, and once again a week after that (the site could stream the song to them to provide at least some likelihood that the users weren't cheating). This assumes some things, such as that repeated exposure has the same growing-on-you effect even if the exposure is forced -- but in the real world, songs often grow on you from repeated listenings that are "forced" anyway, if they're played in the doctor's office or on the radio when you don't bother to change the channel. And this might be more complicated than necessary -- often when a song grows on you, it at least interests you enough the first time you hear it, that you'd give it a positive rating on the first listen, which is all that the site requires for the song's success.
However, if you try to adapt this trick to a meritocracy for written content, you run into different problems. With a song, if you poll a random sample of users, the odds are very small that anyone being polled will be a vested interest in the success of the song, like one of the band members or one of the song's producers (assuming the population of users is large enough, and the song's producers have not been able to create a huge number of "sockpuppet" accounts to manipulate the voting). So you can assume the ratings will be free of any prior bias. But with a political post, for example, if you write a pro-Bush or anti-Bush essay, it's quite likely that among a random sample of users, there will be people who are biased to vote up (or vote down) any post that has anything good to say about the President. The essays voted to the top may not be the best-written ones, but simply the ones that pander to the most popularly held opinions.
But if the "best" essays are not the ones that receive the highest percentage of positive votes, even when polling a random sample of independent users -- which I was advocating as the gold standard for measuring merit -- then how do you define what makes the "best" essays, anyway? There are many possible answers, but I suggest: A necessary condition for being among the "best" essays would be to convince the most people of something that they didn't believe before, without resorting to tricks such as blatantly fabricating statistics or attributing made-up quotes. This is not a sufficient condition for merit -- maybe the point of view that you're convincing people of, is still wrong -- but I submit that if you're not at least changing some people's minds, then there's no point. An essay that changes a lot of people's minds in a random focus group, is usually worth reading, if only to see why it has that effect.
Unfortunately, this doesn't suggest a better way to poll users about the merit of an essay, because if you ask users, "Were you a Bush supporter before reading this essay?" and "Were you a Bush supporter afterwards?", Bush supporters are eventually going to figure out that the way to give the essay a high score on the mind-changing scale, would be to (falsely) say that they were not a Bush supporter before reading the essay, but they were one afterwards. So you'd still end up rewarding the essays that reinforce pre-existing opinions instead of the ones that change people's minds.
From here the counter-measures and counter-counter-measures get increasingly complicated. For each category of essays that a user wants to rate, such as Bush opinion pieces, you could require new users to enter their current opinion: either pro-Bush or anti-Bush. Then if they were asked to rate a pro-Bush essay, they would only be able to vote that the essay "changed their minds" by switching their registered opinion from "anti-Bush" to "pro-Bush". But Bush supporters could sign up initially as anti-Bush, just in the hopes of being part of a random focus group so they could cast their mind-changing vote for a Bush essay by changing their registration to "pro-Bush"! However, each user would only be able to do that once -- or do you allow users, after they've switched from anti-Bush to pro-Bush, to "reload" by spontaneously switching back to anti-Bush for no reason at all, so they're all set to cast a mind-changing vote for the next pro-Bush essay? Or would they only be allowed to switch back to anti-Bush, by casting a mind-changing vote as part of a random focus group for an anti-Bush essay -- thus giving a boost to an anti-Bush screed, as part of the price they pay for the next vote they cast for a pro-Bush piece? Then users could still game the system, by switching to "anti-Bush" when casting a vote for a very poorly written anti-Bush essay that they don't think anybody else will vote for anyway, and then switching back to "pro-Bush" only for the good essays that have a shot, hoping that their votes will coalesce around the decently-written pro-Bush essays and push them to the front page...
Am I over-thinking this? I submit this is an area where there's been too much under-thinking. Haven't we all been tempted to believe that the marketplace of ideas -- not to mention bands, blog posts, and business ventures -- efficiently sorts content to the place in the hierarchy of rewards that it deserves, without having any real evidence for this, except the circular definition of "quality" as being proportional to success? And the more people believe this, the more that marginalized voices will effectively be censored, even when they have something brilliant to contribute. We should at least think about ways that we could do better. Or else, prove logically that it can't be done (a logical proof can only approximate the real world, but it could show that such a pure meritocracy would be very improbable, or wouldn't work well). However I think the ideas above make it seem unlikely that a meritocracy is logically impossible. Maybe they're a step in the right direction. Maybe someone else's ideas would be better. The important thing is that a meritocratic algorithm be judged by something other than a circular definition, which simply decrees by fiat that the winning content is the best. -
Scientists Discover Proteins Controlling Evolution
Khemisty writes "Evolutionary changes are supposed to take place gradually and randomly, under pressure from natural selection. But a team of Princeton scientists investigating a group of proteins that help cells burn energy stumbled across evidence that this is not how evolution works. In fact, their discovery could revolutionize the way we understand evolutionary processes. They have evidence that organisms actually have the ability to control their own evolution." -
Damning Report On Sequoia E-Voting Machine Security
TechDirt notes the publication of the New Jersey voting machine study, the attempted suppression of which we have been discussing for a while now. The paper that the Princeton and Lehigh University researchers are releasing, as permitted by the Court, is "the same as the Court's redacted version, but with a few introductory paragraphs about the court case, Gusciora v. Corzine." What's new is the release of a 90-minute evidentiary video — the researchers have asked the court for permission to release a shorter version that hits the high points, as the high-res video is about 1 GB in size. See TechDirt's article for the report's executive summary listing eight ways the AVC Advantage 9.00 voting machine can be subverted. -
Judge Suppresses Report On Voting Systems
Irvu writes "A New Jersey Superior Court Judge has prohibited the release of an analysis conducted on the Sequoia AVC Advantage voting system. This report arose out of a lawsuit challenging on constitutional grounds the use of these systems. The study was conducted by Andrew Appel on behalf of the plaintiffs, after the judge in the case ordered the company to permit it. That same judge has now withheld it indefinitely from the public record on a verbal order." -
Turn an iPhone Into a Pocket Theremin
Earyauteur writes "The Unofficial Apple Weblog (TUAW) is running a story on an interesting motion-controlled iPhone application which uses the iPhone's 3-axis accelerometer to control a digital synthesizer. The musical instrument is played much like a theremin with the added ability to perform music using different musical scales. TUAW also links to a YouTube video which shows a performer demonstrating the iPhone instrument." -
Ivy League Computer Science Curricula Exposed
Doug Treadwell writes "Many people have wondered what the difference is between the Computer Science education given in the average public university versus one given in an Ivy League university (or a top level public university). There have also been discussions here on Slashdot about whether any Computer Science curriculum gives students the knowledge they need for the working world. As a computer science student both questions are very important to me, so I decided to answer them for myself and build a website to share what I found. I was able to find the required reading for hundreds of courses at Stanford, Princeton, Carnegie Mellon, and Berkeley; along with some other institutions. This should also help answer some of those 'What should I read?' questions." -
Cold Boot Attack Utilities Released At HOPE Conference
An anonymous reader writes "Jacob Appelbaum, one of the security researchers who worked on the cold boot attacks to recover encryption keys from memory even after reboot, has announced the release of the complete source code for the utilities at The Last HOPE in New York City. The hope (obligatory pun) is that the release of these tools will help to improve awareness of this attack vector and enable the development of countermeasures and mitigation techniques in both software and hardware. The full research paper (PDF) is also available."