Cash 4 Cars

  • Subscribe to our RSS feed.
  • Twitter
  • StumbleUpon
  • Reddit
  • Facebook
  • Digg

Friday, 25 November 2011

Genetic Data and Economics: Problems in Drawing Inferences

Posted on 04:00 by Unknown
A number of datasets that have economic and demographic information are also starting to have genetic information about the participants: in the U.S., some examples include National Longitudinal Study of Adolescent Health, the Wisconsin Longitudinal Study, and the Health and Retirement Survey. It is becoming possible, in other words, to look for connections between a person's genes and their education, income, and other economic outcomes. In the Fall 2011of my own Journal of Economic Perspectives, Jonathan Beauchamp, David Cesarini, and a host of co-authors tackle the issue of drawing inferences from this data in "Molecular Genetics and Economics."

The fundamental problem in these studies is that humans have a lot of genes.To be more specific, each person has about 3 billion "base pairs" of DNA material, and "genes" are combinations of these base pairs. However, the human genome includes more than just genes and DNA; there is also RNA and all sorts of other stuff. Figuring out the interactions between DNA, RNA, various proteins, and other ingredients is exciting and cutting-edge work in the life sciences.

For social scientists, working with this data is tricky. Current technologies create data on about 500,000 possible individual differences at the base-pair level in genes; before long, it will be a million and more. To those marinated in a bit of statistics, the problem can be phrased this way: If you have 500,000 independent variables in a least-squares regression, a whole lot of them will be statistically "significant" at conventional levels just by chance. For those to whom that statement carried no particular meaning, think of it this way:

When social scientists look at data, they are always trying to distinguish a real pattern from a pattern that could have happened by chance. To understand the difference, imagine watching a person flip a coin 10 times, and get "heads" every time. The odds of getting "heads" 10 times in a row with a fair coin is .5 raised to the power of 10, or .0009766--which is roughly one in a thousand. If you see a pattern that happens by chance only one time in a thousand, you would strongly suspect something is going on. Maybe it's a two-headed coin? But now imagine that you start off with 500,000 people each flipping a coin. After they have all flipped a coin 10 times, on average 488 of them will have gotten 10 straight heads. In this context, observing 10 straight heads is just what happens a certain amount of the time because of random chance when you start with very large numbers of people.

Bottom line: When you observe a particular event in a fairly small group, you can have some confidence (never complete certainty!) as to whether it occurred by chance. But if you see the same event happen for a small proportion of those in a really big group, then it certainly could have happened by chance. When you have 500,000 pieces of genetic data, it's like a big group, and any connections you see can happen by chance.

What's to be done? Beauchamp, Cesarini, and their co-authors suggest three steps.

First, a researcher who is working with 500,000 variables need to demand a much more extreme event before concluding that a connection is real. If I'm starting with 500,000 people flipping coins, I want to see someone flip heads maybe 100 times in a row before I conclude that something other than random chance is happening here. There are statistical methods for making this kind of correction, but they are still a work in progress. Research has found 180 different base-pairs that seem to be associated with height, but perhaps many more need to be considered as well, and perhaps considered all at once, not one at a time.

Second, it becomes extremely important to do the same calculation with multiple different datasets, to see whether the results are replicated. In their JEP article, they look at genetic determinants of education in two different datasets--and fail to replicate the results.

Third, if you're going to have really large numbers of variables, it's useful to have really large populations in your data, which isn't yet true of most of the datasets in this area.

In the same issue, Charles Manski also offers a comment on  this research in "Genes, Eyeglasses, and Social Policy."  Manski offers several useful  insights on this research. For example:

First, a finding that genes cause an effect is totally different from deciding about appropriate social policy. It seems likely that genes are highly correlated with poor eyesight, for example, but that genetic condition is easily and cheaply remedied with corrective lenses. Social policy should be about costs and benefits, not about whether something is "caused" by genes.

Second, it's important to be cautious about interactions of genes, environment and outcomes. If one looked at genetic patterns and the propensity to eat with chopsticks, for example, one might find a statistical correlation. But the obvious reason is that many of those with the common genetic pattern are also living in a common society, and it's society rather than genes which is causing correlation with chopsticks. In addition, certain traits like height are definitely highly inheritable, but they can still shift substantially over time as the environment alters--as in the way that average human heights have increased in the last century.

Third, Manski expresses some doubt that brute-force statistical calculations with hundreds of thousands of possible explanatory variables will ever yield solid inferences about causality. Instead, he suggests that over time, biologists, medical researchers and social scientists will develop better insights about how genes and all the rest of the activity in the human genome affects various traits. It will then be somewhat easier--if never actually easy--to understand cause and effect.


Email ThisBlogThis!Share to XShare to Facebook
Posted in econometrics, genetics | No comments
Newer Post Older Post Home

0 comments:

Post a Comment

Subscribe to: Post Comments (Atom)

Popular Posts

  • High Food Prices and Political Unrest
    Marco Lagi, Karla Z. Bertrand and Yaneer Bar-Yam of the New England Complex Systems Institute have a working paper up about "The Food C...
  • The Dispute over "Core Inflation"
    Is there a danger of inflation taking off? When the price of gasoline and food shoot through the roof, it seems like it. But central bank of...
  • Bruce Yandle on environmental economics
    David A. Price of the Richmond Fed has an interview with Bruce Yandle . On the difference between a “systems approach” and a “process approa...
  • Africa's Prospects: Half Full or Half Empty?
    There has been a flurry of articles recently with optimistic economic news about sub-Saharan Africa. For example, the December 3 issue of th...
  • Endorsing Association 3E: Ethics, Excellence, Economics
    I would like to take this opportunity to heartily endorse Association 3E: Ethics, Excellence, Economics. I discovered this organization last...
  • Spring 2011 Journal of Economic Perspectives On-line
    I'm the managing editor of the Journal of Economic Perspectives , published by the American Economic Association. It's an academic j...
  • Asian Century or Middle Income Trap?
    Will Asia come to dominate the global economy during the 21st century? The Asian Development Bank published a thoughtful report on the subje...
  • World Economic Forum Ranks U.S. Competitiveness
    The World Economic Forum is an independent organization that has been around since the early 1970s. It's perhaps best-known for the annu...
  • Sky-High Textbook Prices--And My Suggested Solution for Intro Economics
    High textbook prices are modest problem in the context of soaring costs of higher education, but many of the costs of tuition and room and b...
  • The Kuznets Curve and Inequality over the last 100 Years
    The Sveriges Riksbank Prize in Economic Sciences in Memory of Alfred Nobel first started being given in 1969, the backlog of worthy economis...

Categories

  • Africa
  • aging
  • agriculture
  • American dream
  • annuities
  • articles
  • banking
  • behavioral
  • biofuels
  • biomedical
  • brain science
  • budget deficits
  • capital flows
  • China
  • choice
  • cities
  • climate
  • column
  • convergence
  • credit rating agencies
  • crime
  • currency
  • debt
  • deficit
  • demand
  • demand and supply
  • deposit insurance
  • deregulation
  • development
  • disability insurance
  • drug policy
  • econometrics
  • economics in life
  • economists
  • education
  • employment
  • energy
  • environment
  • euro
  • Europe
  • exchange rates
  • exports
  • externalities
  • fdi
  • financial crisis
  • fiscal
  • fisfcal
  • food
  • food prices
  • free
  • game theory
  • gender
  • gender equality
  • genetics
  • geyser
  • globalization
  • gold
  • grades
  • Great Depression
  • Great Recession
  • growth
  • health
  • health care
  • higher education
  • history
  • households
  • housing
  • immigration
  • inequality
  • inflation
  • information
  • infrastructure
  • innovation
  • interest
  • international
  • international finance
  • international trade
  • interview
  • ipo
  • JEP
  • jobs
  • journals
  • Keynes
  • Krugman
  • labor
  • Labor Day
  • labor market
  • labor markets
  • long-term care
  • macro
  • macroeconomics
  • Medicare
  • microfinance
  • middle east
  • migration
  • minimum wage
  • monetary
  • monetary policy
  • moral hazard
  • Noriel Roubini
  • oil
  • olive oil
  • opportunity cost
  • payday loans
  • pension funds
  • policy evaluation
  • ponzi
  • population
  • postal service
  • poverty
  • price bubbles
  • price regulation
  • quotation
  • recovery
  • redistribution
  • regulation
  • resources
  • retirement
  • safety
  • Scrooge
  • social security
  • sociology
  • sunk costs
  • tax expenditures
  • tax policy
  • tax rates
  • taxes
  • teaching
  • teaching company
  • technology
  • textbooks
  • tourism
  • tradeoffs
  • transportation
  • unemployment
  • unions
  • usury
  • weak ties
  • WTO

Blog Archive

  • ▼  2011 (207)
    • ►  December (25)
    • ▼  November (28)
      • Too Much Imprisonment
      • Credit Rating Agencies
      • How Alexander Del Mar (Who?) Scooped Milton Friedman
      • International Travel: Boosting America's Biggest E...
      • Brain Science and Economics
      • Genetic Data and Economics: Problems in Drawing In...
      • Turkey Demand and Supply, and the Thanksgiving Din...
      • Underpurchasing of Annuities
      • True Love and Other Times When Monetary Incentives...
      • Long-Term Care Insurance in the U.S.
      • Fall 2011 issue of Journal of Economic Perspectives
      • The "Chermany" Problem of Unsustainable Exchange R...
      • Are U.S. Banks Vulnerable to a European Meltdown?
      • Unexpected Economics: My New Teaching Company Course
      • Heavier Cars Kill
      • Martin Shubik's Dollar Auction Game
      • Grade Inflation and Choice of Major
      • Job Openings, Labor Turnover, and the Beveridge Curve
      • An Alternative Poverty Measure from the Census Bureau
      • A State-Level Gold Standard?
      • Independence and Depression: Economics of the Amer...
      • Costs of Air Pollution in the U.S.
      • The Diminishing Gender Wage Gap in the U.S.
      • "Big Oil"--Actually Small and Vulnerable
      • Recognizing Non-formal and Informal Learning
      • What if Country Size Was Relative to Population? A...
      • Lorenz curves and Gini coefficients: CBO #3.
      • Federal Redistribution is Dropping: CBO #2
    • ►  October (27)
    • ►  September (29)
    • ►  August (29)
    • ►  July (28)
    • ►  June (32)
    • ►  May (9)
Powered by Blogger.

About Me

Unknown
View my complete profile