Is it possible to generate only "patients"? #1622
-
| Dear Synthea team For a project I am using you tool to generate large populations of patient (i.e., persons with a specific condition). I do not need healthy controls. What I am doing now is generating a VERY large population, both with patients and healthy controls, and filtering them while keeping only patients. However, this can become inefficient for diseases with low prevalence, if you need a very large patient population. Another question: I would like to create a new disease module, but I struggle to understand where "observation.csv" file is created. I admit I did not read the wiki to the last details, is it explained there? Or, is it there a place where I can find this piece of information? Many thanks | 
Beta Was this translation helpful? Give feedback.
Replies: 3 comments 9 replies
-
| You can use a "keep module" to specify which patients you want to keep. You can specify the "keep module" on the command line. See https://github.com/synthetichealth/synthea/wiki/Keep-Patients-Module The observations.csv file is only created if you enable the CSV export. The file will be in the  | 
Beta Was this translation helpful? Give feedback.
-
| Hi Synthea Team, Congratulations on the great tool you’ve created! I have a question about the logic behind patient generation. I understand how an individual disease module works — for example, I’ve carefully studied the dementia and hypertension modules — but I’m not sure how the system decides which subjects enter which pathology modules. Is the assignment to modules based on randomness, or do all subjects created go through all the disease modules? And in that case, how are healthy subjects generated? Thank you very much for your help! | 
Beta Was this translation helpful? Give feedback.
-
| Thank you! One last question: is it possible to prevent the creation of certain observations, such as QALY, DALY, or any others considered not relevant for a specific project, in order to reduce the size of the observations.csv file? Thank you again for your support! | 
Beta Was this translation helpful? Give feedback.
The keep switch does not filter out other diseases, it just specifies what you keep.... I suppose you can create logic in the keep module that explicitly excludes patients if they have any other ActiveConditions... but that would be a lot of work to list every disease other than Hypertension.
I'm not sure why your patients don't all have hypertension, without examining everything deeply. They probably do, it just doesn't show up in the data. It could be related to the "years of history" setting (search issues and discussions here)... or maybe something else.
The goal is to create patients. Patients typically have comorbidities. The world is like that, especially people with hypertension. …