The GiveWell Blog

What I wrote in the Gates Foundation survey

Hey, the Gates Foundation is finally showing some interest in what others think. Here’s what I wrote in response to “Please share any comment or opinions you have about our web site.”

Not enough information. Not even close to enough information. Where are your evaluations & technical reports? Where’s your evidence for whether you’ve saved lives, how many lives you’ve saved, whether you’ve made any traction on education, what you think works in education? All I see is a list of grantees with 2-sentence descriptions (no account of how you choose them over other applicants) and a bunch of generalizing, salesy publications that don’t get specific about what was observed, how it was observed, and what specific practices you advocate. Glad you’re finally inviting feedback, because there are other people in existence trying to figure out where to give. As the leading foundation, you have an opportunity to create dialogue about how to help people as well as possible, and affect others’ giving as well as your own. This website does a great job burying that opportunity. Also, the look&feel is pretty drab.

Eat a peanut, save a child!!

What would you do to make the world a better place?

Would you …

Of course you would. And of course you can’t.

If people buy from (or look at ads for) companies based on their donations to charity, I promise the prices will rise by however much they’re donating. When sponsors give to charity for every home run the White Sox hit, they have a range in mind – and you can bet that if they end up owing way more than expected, somehow this is going to come out in next year’s pledge (even if it’s through their insurance company’s quote, etc., etc. …)

I’ve debated the specifics of various schemes along these lines before. Right now, I want to drop my usual preference for concreteness, step back from all the mechanics, and just be as general as I can be.

If someone tells you you can make a difference without either giving up anything valuable or doing anything useful, they are wrong.

When trying to figure out which schemes work and which don’t, it seems that’s about all you need to know.

As for the notion that these kinds of schemes “raise awareness” … man, I’m sick of hearing about “raising awareness.” If you’re not, just – read this.

I’m surprised by how many otherwise intelligent people get pulled in by promises of saving the world by yawning. Sure, they’re tempting, and it can be very complicated and confusing to figure out precisely why they don’t work … but the fundamental problem couldn’t be more obvious. Please don’t fall for this stuff. That’s all.

What’s so hard about rigorous self-evaluation?

I’m not trying to be a jerk. I honestly want to know.

Most of the self-evaluation we see from charities looks at clients, but doesn’t compare them to (otherwise similar) non-clients. So it’s probably effective at preaching to the choir, but not at winning over skeptics. When we bring up the issues with it, we constantly hear things like “Of course we’d love to do a randomized/longitudinal study, but those are expensive and difficult and the funding isn’t there.”

This is how I imagine an interested charity could evaluate itself rigorously:

  1. Use a lottery to pick clients. Many/most humanitarian charities don’t have enough money to serve everyone in need. (And if they do have enough, there’s an argument that they don’t need more.) Instead of dealing with this issue by looking for the “most motivated” applicants, or using first-come first-serve (as most do), get everyone interested to sign up, fill out whatever information you need (including contact info), etc. – then roll the dice.

    Unethical? I don’t think so. I don’t see how this is any worse than “first-come first-serve.” It could be slightly worse than screening for interest/motivation … but (a) it’s also cheaper; (b) I’m skeptical of how much is to be gained by taking the top 60% of a population vs. randomly selecting from the top 80%; (c) generating knowledge of how to help people has value too.

    Cost: seems marginal, cheaper than using an interest/motivation screen. You do have to enter the names into Excel, but for 1000 people, that’s what, 5 hours of data entry?
    Staff time: seems marginal.

  2. Get contact information from everyone. This is actually part of step 1, as described. These days you can get on the Web in a library, homeless people blog, and I’m guessing that even very low-income people can and do check email. Especially if you give them a reason to (see below).

    Cost: see above
    Staff time: none

  3. Follow up with both clients and randomly selected non-clients, offering some incentive to respond. Incentives can vary (many times, with clients, providing information can be a condition of continuing support services). But if push comes to shove, I don’t think a ton of people would turn down $50.

    Cost: worst case, sample 100 clients and 100 non-clients per class and pay them $50 ea. That’s a decent amt of sample size. Follow up with each cohort at 2, 5, and 10 years. That’s a total of 600 people followed up with = $30k/yr, absolute maximum.
    Staff time: you do have to decide how to follow up, but once you’ve done that, it’s a matter of sending emails.

  4. Check and record the followup responses. If possible, get applicants to provide proof (of employment, of test scores) for their $50. Have them mail it to you, and get temps to audit it and punch it into Excel.

    Cost: assuming each response takes 30min to process and data entry costs $10/hr, that’s $3k/yr.
    Staff time: none.

  5. And remember: Have Fun! Did you think I was going to put something about rigorous statistical analysis here? Forget it. Data can be analyzed by anyone at any time; only you, today, can collect it. When you have the time/funding/inclination, you can produce a report. But in the meantime, just having the data is incredibly valuable. If some wealthy foundation (or a punk like GiveWell) comes asking for results, dump it on them and say “Analyze it yourself.” They’re desperate for things to spend money on; they can handle it.

    (Also, I’m not a statistics expert, but it seems to me that if you have data that’s actually randomized like this, analyzing it is a matter of minutes not hours. Writing up your methodology nicely and footnoting it and getting peer-reviewed and all that is different, but you don’t need to do that if you’re just trying to gauge yourself internally.)

The big key here, to me, is randomization. Trying to make a good study out of a non-randomized sample can get very complicated and problematic indeed. But if you separate people randomly and check up on their school grades or incomes (even if you just use proxies like public assistance), you have a data set that is probably pretty clean and easy to analyze in a meaningful way. And as a charity deciding whom to serve, you’re the only one who can take this step that makes everything else so much easier.

I tried to be generous in estimating costs, and came out to ~$35k/yr, almost all of it in incentives to get people to respond. Nothing to sneeze at, but for a $10m+ charity, this doesn’t seem unworkable. (Maybe that’s what I’m wrong about?) And this isn’t $35k per study – this is $35k/yr to follow every cohort at 2, 5, and 10 years. That dataset wouldn’t be “decent,” it would be drool-worthy.

And the benefits just seem so enormous. First and foremost, for the organization itself – unless its directors are divinely inspired, don’t they want to know if their program is having an impact? Secondly, as a fundraising tool for the growing set of results-oriented foundations. Finally, just for the sake of knowledge and all that it can accomplish. Other charities can learn from you, and help other people better on their own dime.

The government can learn from you – stop worrying about charity replacing government and instead use charity (and its data) as an argument for expanding it. In debates from Head Start to charter schools to welfare, the research consensus is that we need more research – and the charities on the ground are the ones who are positioned to do it, just by adding a few tweaks onto the programs they’re already conducting.

So what’s holding everyone back? I honestly want to know. I haven’t spent much time around disadvantaged people and I’ve never run a large operation, so I could easily be missing something fundamental. I want to know how difficult and expensive good self-evaluation is, and why. Please share.

Cause 1: Where we stand

Now that Holden and I have finished drafting reviews for Cause 5 (to be made public in a couple of weeks), we’ve moved our focus to Cause 1: Help people in Africa avoid death and extreme debiliation.

Unlike Cause 5, in which organizations roughly followed the same model to help people, organizations applying for a Cause 1 grant take wildly divergent approaches. And, in most cases they are taking not just one approach, but are running a huge set of projects that don’t always have a clear overarching theme or approach. This obviously presents a large challenge, and makes it impossible for us to compare organizations as directly and quantitatively as we did for Cause 5.

Here’s what we’re thinking so far. Mostly, our applicants fall into the following broad categories: comprehensive community aid, providing lots of different kinds of services to a small group of a people; distribution, getting lots of small, inexpensive items to many people; corrective surgery, providing a relatively expensive but life-changing surgery to those with congenital deformities; and mammoths, which do just about everything for everyone everywhere.

Comprehensive community aid. These organizations go into a village and attempt to provide everything for the village including primary health services (for childbirth, pneumonia, etc.), distributing necessary medicine/products (bed-nets, ORS, de-worming pills), education about hygiene and protected sex, economic aid including farming technology, and much more. This is the model with the most intuitive appeal to us. When you’re trying to help people thousands of miles away from a culture you’ll probably never fully understand, it seems smart to work intensely with one group of people and document all the ways in which their lives change – that way you’re more likely to catch unintended consequences, adapt to changing problems, etc., and make sure you’re actually changing their lives for the better. (This seems far superior to deciding in advance on one problem, like AIDS, and attacking it furiously while leaving other problems unaddresed.)

But that documentation is essential – immersion doesn’t equal understanding, and if a charity isn’t measuring and reporting life change, we aren’t going to bet on it. So far (though we’re still working on it), we haven’t been able to get a real picture of the life change effected by an organization using this approach. Organizations do many activities for which they often offer no evidence of the eventual impact (e.g., they tell us how many families attended their HIV/AIDS awareness campaign, but don’t offer evidence for what effect they expect that to have).

Distributors. These organizations distribute cheap and potentially life-saving items: ORS to treat diarrhea, vitamin A supplements to prevent malnutrition and blindness, bednets to fight malaria, condoms to prevent HIV/AIDS, etc. The advantage of this approach, it seems, is the potential cost-effectiveness: by focusing on the cheapest, simplest diseases to treat, you can treat a lot more people.

We estimate that the stronger candidates in this area are saving lives for around $50-120 a pop … but this estimate is based largely on combining reports on number of units sold/distributed with academic research on the effects of these units (e.g., studying the effects of Vitamin A in developed-world hospitals,), plus a lot of guesswork about utilization (i.e., it’s one thing to sell or distribute condoms – but how much are they actually getting used?) Organizations only sometimes monitor the utilization of the products they distribute, and they rarely, if ever, measure the change in actual disease prevalence for the people they serve. So charities in this area may be helping a lot of people, but it’s hard for us to be confident in their effects.

Corrective surgery. These organizations perform a very specific procedure (or set of procedures) for people suffering from an ailment. They perform surgeries to correct debilitating deformities (cleft palate, severe burns, etc.) or correct vision impairment. The best of these organizations can tell us how many surgeries they perform and what conditions they correct, which along with their total expenses gives us a picture of how many lives they’re changing for each dollar they spend. These organizations don’t tell us a lot about how debilitating the conditions they fix truly are (leaving us to question the impact they’re having).

This model is attractive because each surgery affects a specific person. Knowing how many surgeries each organization performs tells how many people’s lives have been changed – there’s not a lot of doubt. But, this model doesn’t come close to achieving the cost-effectiveness of the (albeit somewhat theoretical) distribution model, with $/life impacted at 10-20x the cost. And, without a clear picture of the debilitation these surgeries prevent (to what degree they are somewhat cosmetic), we worry that their impact is even lower.

UNICEF. UNICEF does everything, everywhere. They distribute, perform surgeries, and somtimes just focus on providing all services to a set group of people. We won’t be able to evaluate the entirety of UNICEF’s programs, but we may be able to evaluate their Accelerated Childhood Survival and Development Program, which has an approach similar to “comprehensive community care” above, and which appears to be slated for a very large and growing role in UNICEF’s programming.

So, where are we going from here? At the moment, the only organizations that we can confidently say are helping people are those peforming corrective surgeries. But, we’re waiting on more information from distributors which, we hope, gives us more confidence that people are actually using the products they receive. We’re disappointed in what we’ve seen thus far from comprehensive community care. Even though it makes a lot of sense to provide everything to a small group of people (even at higher costs), we’re not convinced of the impact these programs (which are mostly more convential Africa-aid organizations) are having.

One more note: with so many different approaches to helping people, there’s no way that they’re going to be close in terms of cost-effectiveness. There’s no reason to think that an organization distributing inexpensive items across a continent is in the same ballpark as an organization providing corrective surgeries to a few thousand children each year. Donors need to understand what they’re getting for their dollar.