Update: See reply — this issue has been resolved. Links to our analyses and code have been changed to archived versions in this post.
MEDI is a publicly-available indication resource standardized to ICD9/UMLS concepts for diseases and RxNorm ingredients for drugs. The accompanying publication clearly and concisely presents the analysis, which follows a rational, resourceful, and thorough methodology .
Table 2: Number of unique medications, ICD9 codes, and indication pairs extracted from each resource
Medications (% of total)
ICD9 codes (% of total)
Indication pairs (% of total)
Union of all resources
Our analysis found different resource-specific counts. The comparison is complicated since the resource to numeric identifier mapping is unknown:
We will reach out to the MEDI authors for assistance. Currently the discrepancy seems to have a negligible effect on the high-precision subset.
After contacting Dr. Wei-Qi Wei, we located the cause of the discrepancy. The integer values in the MENTIONEDBYRESOURCES column of MEDI_01212013_0.csv and MEDI_01212013_UMLS.csv refer to how many resources reported the indication. We had incorrectly assumed that this column referred to which resources reported the indication. Therefore, it appeared that each indication was only reported by a single resource.
Resource-specific indications data is not available from the MEDI website. However, the true counts for each resource combination are provided in manuscript Figure 2:
We would like to thank the authors for their prompt response and clarification.
Daniel Himmelstein: The authors do not plan on releasing the resource-specific indication data for the current MEDI database. However, they will consider doing so for future releases.