-
Notifications
You must be signed in to change notification settings - Fork 72
Allowing inclusion of multiple 'sites' as treatment group for synthetic control #165
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Comments
Hi @kashramli, thanks for this suggestion. I think this is something we could do relatively easily. As far as I understand, the way how this is done is to create a new aggregate unit which is simply the sum of the individual treated units. So one solution would be to do that as a manual pre-processing step. Though I can imagine that might result in an aggregate treated unit with values much higher than the individual untreated units. That may cause problems with interpolation (i.e. the synthetic control weights summing to 1). I had a quick look on the GeoLift website but didn't have luck finding anything specific on this issue of multiple treated units. Would you be able to point me in the right direction so I can look at the dataset and approach they take? |
Hi @drbenvincent, I can't find any explanation for how Meta's Geolift does it specifically, but I have found some papers that have introduced the concept of having multiple treatments. I'm not quite confident enough to evaluate and determine which approach (if any) is best. Extending synthetic control method for multiple treated units: an application to environmental intervention Examination of the synthetic control method for evaluating health policies with multiple treated units(pg. 1519) The inclusive synthetic control method. Inference for Synthetic Control Methods with Multiple Treated Units I'll keep looking and will report back if I find something more concrete. |
Hello again.....so I just asked them directly on their FB group, and they were super helpful in talking through their solutions. Here is the post: https://www.facebook.com/groups/fbgeolift/posts/1578515999315617/?comment_id=1581575829009634¬if_id=1676376929860659¬if_t=group_comment_mention |
Thanks @kashramli. Sorry for the delayed reply. I think their answer (of using the mean of the treated units) perhaps answers most (or all?) questions. Do you feel that your problem can be solved yourself by manually calculating a new mean-of-treated-units? It seems like that would be sufficient for simple synthetic cases, but perhaps there's more to it? Or do you think that it would be highly useful for CausalPy to include some specific functionality to do that? |
I'm also trying to mimic the Meta GeoLift approach (I'd much prefer a Bayesian answer with interpretable confidence intervals!). If you like I can try to code some functionality for detecting when an average of multiple units is needed. But I'm getting a different issue when using the mean of the treated units and I'm not sure what the problem is; the error I get is |
Quick update @kashramli / @pamant22. I know it's been a while, but we're finally getting around to focusing on geo testing. I have a work in progress PR which will demonstrate how to analyse data with multiple treatment methods. It will demonstrate 2 approaches. The first is a pooled approach which simply aggregates the treatment geo's and then proceeds to use the current functionality (i.e. the I've set it so that closing #338 will also close this issue. But I realise that this topic is quite rich. If there are more specific aspects of multi-cell geo lift testing that you'd like to be worked on a bit, then feel free to create a new issue - perhaps after #338 is done. |
I'm not sure if this is already possible and I haven't worked it out yet, but it would be great to have a way to easily include multiple 'sites' to make up the treatment group. In the example of geolift experiments, sometimes you would want to include multiple cities as the treatment group.
This is something easily done in Meta's GeoLift, so it would be great to see it here as well.
And if it is already possible....it would be great to have instructions included in the documentation
Thanks
The text was updated successfully, but these errors were encountered: