Slashdot Mirror


Tracking the Congressional Attention Span

Turismo writes "Ars Technica covers a new research project that uses computers to look at 70 million words from the Congressional Record. The project's goal was to track what our representatives were talking about at any given time, and researchers were able to do it without human training or intervention. From the article: '...researchers found, for instance, that "judicial nominations" have consumed steadily more Congressional attention between 1997 and 2004. In fact, the topic produced the most number of words published in a single "day" of the Congressional Record: 230,000 on November 12, 2003.' It looks like automated topic analysis has truly arrived."

12 of 89 comments (clear)

  1. Or Maybe Not.. by vjmurphy · · Score: 5, Funny

    "It looks like automated topic analysis has truly arrived."

    Not according to my in-depth research. Looks like "automated topic analysis" isn't arriving at all.

    http://www.google.com/search?hl=en&q=%22automated+ topic+analysis%22&btnG=Google+Search

    --
    Vincent J. Murphy
    Spandex Justice
  2. I'm not sure this is the best metric... by PixelPirate · · Score: 5, Insightful

    Think about it: "Who thinks we should elect Joe Six-Pack"
    Lots of talk, chit-chat, chatter, etc...

    "Okay, now who would want to oppose the True American, Patriot, Love, Peace Act*"
    Cricket! Cricket!
    *And of course this Act happens to have about thirty-thousand ridders attached to it...

  3. Opposite Side by BigNumber · · Score: 4, Funny

    So what scored the lowest? Individual freedoms? Constitutional Rights? Fair use?

  4. TheyWorkForYou.com by Bogtha · · Score: 4, Informative

    Even with a large team of grad students at their disposal, researchers find it difficult to tag more than a small subset of the speeches in question

    Are there really that many speeches? TheyWorkForYou.com offer a similar service for the UK's Houses of Parliament, except it's done manually, and there's only a dozen volunteers working on it.

    --
    Bogtha Bogtha Bogtha
  5. Tracking the Congressional Attention Span by DrLang21 · · Score: 5, Funny

    The conclusion. Congress has ADD, just like me.

    --
    I see the glass as full with a FoS of 2.
  6. They're asking the wrong question by MarkusQ · · Score: 4, Funny

    Great. Now we know what congress has been talking about.

    Big deal.

    Wake me up when you can tell me what in the hell they were thinking.

    --MarkusQ

    P.S. Other than how to make sure that they--and Joe Lieberman--get re-elected I mean.

  7. Re:Pro-Gress vs Con-Gress by AltGrendel · · Score: 4, Interesting
    I would think that both "follow the money" and this type of record analysis would be the best thing. Think of it as the money as the input and the speeches as the output.

    Correlate the two and you'd really have something.

    No, not that. What I meant was who outside of Congress is trying to push buttons, and who inside Congress is helping them. Also, you'd be able to watch for what you may consider important topics to see how they are dealt with.

    --
    The simple truth is that interstellar distances will not fit into the human imagination

    - Douglas Adams

  8. Process Process Process by LaughingCoder · · Score: 4, Interesting

    That disease that has so infected business - talking about process (how) rather than products (what) - is readily apparent in Congress as well. I added up the percentages of the "Procedural [HouseKeeping]" categories (egads, there were 6 different line items - not sure what the distinctions were), and it was 50%!!! So, for half the time Congress is talking about *how* they are going to talk about things. Ugggh. I suppose, as one who believes that the less the government does, the better, I should be happy. But oh, the global warming from all that hot air!

    --
    The more you regulate a company, the worse its products become.
  9. Reading the Record???? by jackb_guppy · · Score: 5, Insightful

    The congressional record is a false document of what happened in congress. Watch C-Span one day and hear each person request "Unamious support to change or extend". This allows 30 second comment say to begainst the bill to become a 2 hr speech to supporting the bill WITHOUT editing marks.

    This program may count time on paper but can not count time that congress is actually spending.

  10. The CR is anything but accurate by sgtrock · · Score: 5, Informative

    30 years ago, I learned in my high school civics class that any Senator or Representative can insert anything he or she wants into it at any time. Examples that were pointed out to us were speeches on the floor of the Senate that were never made, modifications to committee meetings, etc. The CR is by no means an accurate measure of anything. Except maybe the size of their combined egos.

  11. Congressional Record vs. what's actually said... by jejones · · Score: 4, Informative

    They know, don't they, that a representative can have arbitrary text inserted in CR as if it had been read?

    Also, if you watch CSPAN while Congress is in session, in the evenings you'll see long stretches with just a few people who are delivering their rants into a nearly empty room. Can that be separated from the rest of the text?

  12. See also: Clustering senators by votes & topic by Anonymous Coward · · Score: 5, Interesting

    You might also be interested in another topic model that not only automatically discovers topics, but also automatically discovers topic-specific groupings of the senators by their votes. http://www.cs.umass.edu/~mccallum/papers/grouptopi c_linkkdd05.pdf "Group and Topic Discovery from Relations and Text."

    It uses not only word data (from the text of 16 years worth of bills voted on in the U.S. Senate), but also the senator's voting records.

    For example, you can see that Sen. Chafee (R-RI) (who was mentioned on this morning's NPR as a "liberal Republican") actually does fall into a cluster of Democrats, not fellow Republicans. When automatically discovering topics using word data alone (without the votes, as does the wustl.edu paper above) the topics on this Senate data are reasonably coherent, but the topics created by this "Group-Topic" new model are even more interesting because their discovery is driven by the need to predict the votes as well as the words. For example, "Social Security" doesn't appear in the old model, but pops out clearly in the new model because it has such a distinct voting pattern.

    Some of the other results are also pretty interesting---on Education and Domestic policy the Republicans are more split than the Democrats (forming 3 groups, to the Democrats 1 group). On other topics, the split is the other way around.

    Using the same technique, there is also an analysis of 60 years worth of voting records from the U.N. On the topic of "human rights", Nicaragua, Papua, Rwanda, Swaziland and Fiji all get clustered together---ouch!