Archive for the ‘stats’ Category

The Puzzling Nature of Success in Cultural Markets

Saturday, April 26th, 2008

Matthew Salganik gave a talk in the EE department with this title. He got his PhD in sociology last year under Duncan Watts, studying the (un)predictability of hits, blockbusters, best-sellers, etc. You probably read about it. If not, the basic idea is that they set up a website where people would come and listen to music and examined the influence of popularity on people’s listening habits. We’re not talking millions of songs here, just 48 chosen pretty much at random from unknown bands on PureVolume. Users could listen to any song and then after listening to it had the opportunity to download it.

As users arrived, they were assigned to one of eight completely separate “worlds”. In seven of these worlds, the users could see how many times each song had been downloaded, the last world served as a control in that users couldn’t see how popular each song was. The punchline is that in the worlds where people could influence each other, popular songs were downloaded a lot, but different songs became popular in each world. In the control group, some songs were still downloaded more than others, but the difference wasn’t as striking.

Popularity vs quality

The graph from the talk that really stuck with me was this one, taken from their Science paper. It shows the marketshare of each song in the control world versus its marketshare in each of the seven influence worlds. The marketshare in the control world is taken as an un-influenced measure of quality, while the marketshare in the influence worlds are taken as measures of popularity. What you can see is a triangular shape indicating that the “bad” songs were unpopular in all worlds, while the “good” songs were only popular in some of the worlds. Sagalnik said that this agreed with what people in hit-based industries told them, that it’s easy to predict what won’t be a hit, but hard to predict what will.

Zipcode demographics

Tuesday, February 12th, 2008

During my meeting with Dan this week, we stumbled upon a website that will tell you more that you ever wanted to know about any zipcode. Mine, 10001, seems to have a very interesting breakdown of population by age and gender, as can be seen above. It seems that 10001 is full of women in their early 20s and men in their 30s and 40s. On reflection, that would probably be the women at the Fashion Institute of Technology and the men of Chelsea. Funny thing is that only 1.7% of households self-reported as being gay. Other college towns have interesting demographic breakdowns as well, e.g. Ithaca, NY and Wellesley, MA.

Violence

Tuesday, April 24th, 2007

The world is a violent place and American society is violent. Being at a shooting and hearing about other shootings, these are the thoughts that make me the most sad. According to the violence policy center, in 2003, there were 12,000 firearm homicides in the US, not to mention 17,000 firearm suicides. This averages out to about 33 homicides per day. One Virginia Tech massacre every day is the average.

Not only is there violence in reality, there’s violence in all of the usual media outlets, bleeding and leading, etc. Violence infects social institutions and behaviors. People are violent to each other in dating, in sports, especially in business. The “free market” seems like a good excuse for violence against allies and competitors alike, negotiating down your soon-to-be employee’s salary, taking just a little bit more for yourself. The distinction between violence and good old fashioned competition seems to revolve around respect and the golden rule.

Why do so many people get shot? Because of the gun lobby, you say? Who is this gun lobby? The NRA? Who is the NRA? Now that’s a good question. Who is the NRA? It’s not at all clear from the news or their website. They don’t seem to publish a list of donors, individual or corporate. Are they like the Cigarette lobby, the Drug lobby, the Trial Lawyers’ lobby, funded by people and companies with a direct financial interest in the issues at hand? Is the NRA supported by Smith and Wesson and 4 million rifle-loving patsies? Or are they like the anti-abortion lobby, with no one standing to make much money from proposed legislation?

My fear is that violence is the natural order of human “civilization.” We’ve been killing each other for tens of thousands of years, why stop now? Since time immemorial, it seems that warlords have ruled with bands of thugs. Some have been more powerful than others, e.g. Sulla’s march on Rome, followed by many others, but on a pessimistic day it doesn’t seem difficult to describe world history in such terms. I would like to believe that the 21st century will usher in an era of peace and cooperation, that structural and technological changes have fundamentally changed the ways that people interact with each other, but then violence stares me in the face. Maybe I’ve just spent too much time watching Godfather movies and the news from Afghanistan.

Birth Control Stats

Friday, March 10th, 2006

Yesterday Graham was wondering about the popularity of various birth control methods. I found a paper about it on the CDC’s website. I got these numbers from table 6 in the paper. These are just the most popular methods (and whether birth control is used at all). Note that not having sex is filed under not using birth control. In general, I didn’t find the numbers very suprising, except for the number of women having sex and the rarity of unprotected sex.

age 15-44 15-19 20-24 25-29 30-34 35-39 40-44
using contraception 61.9 31.5 60.7 68.0 69.2 70.8 69.1
not using contraception 38.1 68.5 39.3 32.0 30.8 29.2 30.9
pill 18.9 16.7 31.9 25.6 21.8 13.2 7.6
female sterilization 16.7 0.0 2.2 10.3 19.0 29.2 34.7
condom 11.1 8.5 14.0 14.0 11.8 11.1 8.0
male sterilization 5.7 0.0 0.5 2.8 6.4 10.0 12.7
other contraceptive 9.5 6.3 12.1 15.3 10.2 7.3 6.1
no intercourse in 3 months before interview 18.1 56.2 17.9 8.9 7.6 9.1 10.8
had intercourse 7.4 69.0 8.4 8.0 7.0 7.7 6.7

How do People Die?

Friday, January 6th, 2006

This whole mine collapse incident got me wondering, rather morbidly, in what ways and how many people die in the US every year? Really what I’m concerned with are deaths not from disease but from accidents, homicides, or suicides. From the CDC website that collates such facts, I got the following information for 2002 (these categories are not mutually exclusive):

Cause Number of deaths
Injury Deaths 161,269
Unintentional Injury Deaths 106,742
Motor Vehicle Traffic Deaths 44,065
Suicides 31,655
Firearm Deaths 30,242
Poisoning Deaths 26,435
Homicides 17,638
Work Related Deaths 5,307

What Do People Eat?

Thursday, December 29th, 2005

It took a bit of digging, but I found stats on what people all over the world eat. These numbers come from the Food and Agriculture Organization of the United Nations. Specifically, I got them from the Food Supply sections of the Agricultural data in the FAOSTATS database. It would be nice to do it by country and put it all into map form using something like DIY Map, but for now I’ll just provide a table listing the data by continent.

This table shows the percentage of average daily calories per capita that come from various foods. The top section shows the breakdown between animal and vegetable foods, while the bottom section breaks it down even further into specific foods. The FAO db will let you break it down even further, but I didn’t want the table to be too big. So, where do people on the various continents get their calories?


world usa europe oceania near east south america asia africa
vegetal products 0.83 0.72 0.70 0.70 0.89 0.79 0.86 0.93
animal products 0.17 0.28 0.30 0.30 0.11 0.21 0.14 0.07
cereals – excluding beer 0.47 0.22 0.28 0.24 0.56 0.32 0.55 0.50
vegetable oils 0.09 0.17 0.13 0.11 0.09 0.10 0.08 0.08
sugar & sweeteners 0.09 0.18 0.11 0.13 0.09 0.17 0.06 0.06
meat 0.08 0.12 0.12 0.14 0.04 0.11 0.07 0.03
starchy roots 0.05 0.03 0.04 0.07 0.02 0.05 0.04 0.14
milk – excluding butter 0.04 0.10 0.09 0.09 0.04 0.06 0.03 0.03
alcoholic beverages 0.02 0.04 0.05 0.04 0.00 0.02 0.02 0.02
animal fats 0.02 0.03 0.06 0.04 0.01 0.02 0.02 0.01

Crime in NYC

Tuesday, December 27th, 2005

I thought that I’d seen a cool GIS visualization of crime in New York’s various police precincts, but it appears to be MIA. I’ll have to settle for text versions. To wit, the stats for south Manhattan:

1990 1995 1998 2001 2004
murder 124 46 26 24 27
rape 207 168 114 93 125
robbery 14866 6441 4193 2701 2127
fel assault 3997 2854 2281 1714 1405
buglary 16090 9168 5326 3720 3100
grand larceny 44811 24847 20131 16673 15160
gr larc auto 9446 4011 2526 1457 1025

Calorie Stats

Sunday, November 6th, 2005

I recently read here, citing Marion Nestle’s book Food Politics, that the food industry produces 3,800 calories per American per day. This is a very interesting way to measure how much people eat on average. Assuming that we’re not amassing vast stores of ring-dings in wearhouses in New Jersey, it would make sense that people should consume all of those calories, give or take what ends up in the trash or under the sofa. It seems that people are getting fatter because food companies want to sell more food.

Romance Novels

Wednesday, October 19th, 2005

Reading through the NYTimes book review, I noticed an add from the “Romance Writers of America” with some interesting statistics:

  • romance fiction totaled $1.2 billion in sales for 2004
  • 54.9% of all paperback sales in 2004 was romance — more than half
    of all paperback fiction sold
  • 39.3% of all fiction sold in 2004 was romance.
  • 64.6 million American’s read at least one romance novel in the
    past year

What do people do?

Wednesday, October 19th, 2005

As a graduate student, taking classes on interesting but nevertheless esoteric topics, conversation often turns to “What do real people actually do for a living?” Fortunately, the US Department of Labor, specifically the Bureau of Labor Statistics, is on the ball and has collected gobs of statistics on everything you’d ever want to know about the subject. Here, collected for your viewing pleasure, are the top 10 professions in these United States:

Occupation Employed (Millions)
Retail Salespeople 3.965
Cashiers 3.339
Office Clerks, general 2.674
General and Operations Managers 2.222
Registered Nurses 2.190
Food Preparation and Serving Workers 2.160
Laborers and Hand Movers 2.121
Janitors and Cleaners, except Maids 2.083
Waiters and Waitresses 2.009
Customer Service Representatives 1.908