Cross-country Experiments Summary
At a Glance
This page summarizes our experiments in finding the best approach for training cross-country poverty estimation models for the following DHS countries: Philippines (PH), Cambodia (KH), Timor (TL) Leste, and Myanmar (MM).
Our main goal for these experiments to validate our approach on running models for countries without DHS ground truth data.
We find that training a model with both features and wealth index labels scaled per-country give us the most promising results.
\(r^2\) | ||||||
---|---|---|---|---|---|---|
Train | Test | Test single-country model | raw features and wealth index |
recomputed wealth index | scaled wealth index | scaled features and wealth index |
KH, MM, TL | PH | 0.57 | -0.19 | -0.01 | 0.25 | 0.48 |
PH, MM, TL | KH | 0.7 | 0.62 | -1.46 | 0.59 | 0.58 |
KH, PH, MM | TL | 0.6 | 0.37 | -2.76 | 0.34 | 0.54 |
KH, PH, TL | MM | 0.5 | 0.46 | 0.01 | 0.45 | 0.51 |
Overal Average | 0.59 | 0.32 | -1.06 | 0.41 | 0.52 |
Experiments
We trained four main types of models:
- Cross-country regressor with raw wealth index
- Cross-country regressor with wealth index recalculated using the pooled data for the four countries
- Cross-country regressor with both wealth index and input features scaled per-country
Table 1.1 summarizes the model performance for each experiment. Each experiment was evaluated using a Leave One Country Out strategy, wherein the model was trained on three countries and tested on the fourth country. For example, we train on KH, MM, and TL, and tested on PH. We trained a model for every possible split (4 in total) and calculated the mean \(r^2\) across all splits. This strategy emulates the situation wherein we train the poverty estimation model on data with ground truth data and rollout on countries without.
Cross-country regressor with raw wealth index
Learnings
- Training the model on raw features and wealth index produces inconsistent results which are highly dependent on the left-out country.
Cross-country regressor with wealth index recalculated using the pooled data for the four countries
Idea
Recomputing the wealth index from pooled country data from all countries will help the model predict better across countries
Experiment Performed
Combine raw country data and recompute the wealth index using PCA on a base set of features: drinking water, number of rooms, electrification, mobile telephone, radio, television, cars/trucks, refrigeration, motorcycle, floors, toilet
Learnings
- Aligning features across different countries makes it very difficult to include all features
- Recomputed index has poor correlation with original index
- Index would likely improve if we recompute using a larger dataset
Cross-country regressor with both wealth index and input features scaled per-country
Idea
Scaling the features and the DHS wealth index from absolute to relative values corrects country-level variations - Ex. 10 mbps download speed -> top 10% speed in that country
Experiment Performed
Trained a model using a) scaled features, b) scaled wealth index, and c) both, using different scaling methods
Learnings
- Scaling gives us the best cross-country results and works best when both features and labels are scaled
- Simpler method, but is scalable and already reported in other literature
- Model only gives us purely relative predictions without an absolute value