Add files via upload#1
Conversation
I corrected unterminated strings in the values fore most PD RFQ-U variables and dx_criteria_application. I also changed all instances of "Don’t Know" (mostly in PD RFQ-U and some MERQ variables) to "Unknown", to be consistent with the rest of GP2 data.
|
I corrected unterminated strings in the values for most PD RFQ-U variables and dx_criteria_application. I also changed all instances of "Don’t Know" (mostly in PD RFQ-U and some MERQ variables) to "Unknown", to be consistent with other GP2 data. |
|
We can do this once a medications interest/project group in GP2 is formed. There is a group that is pending. We can compare what they propose as a dictionary once we receive it and modify where needed |
|
GP2_Data_Dictionary_ver1.1-3.csv My previous pull request regarding the unterminated strings for MERQ have not been addressed yet. Also, there were some other issues I corrected that I've been patching in the cohort harmonisation notebooks (in section 3B which autopopulates the coding sheet). Please see below for the list of changes made to the data dictionary. Summary of data dictionary corrections1. Values-string fixes
|
I have added some additional medications variables and created a draft for a medications list (based on the unique medication names found in datasets I have harmonised for GP2), which can be used to assign the medications variable based on the name. Lietsel mentioned that Datatecnica is generating a list of medication classes using UKB data and I believe another team is working on doing something similar within GP2, although I am not sure if the aim is to provide a comprehensive mapping for free-text medications names or if it is project-focused and including only a subset of medications. We should combine efforts, if possible, or my list can be used and added to in the interim for data harmonisation until it is replaced by a comprehensive mapping being developed in those projects.