R/data_mesh_frequencies.R
data_mesh_frequencies.RdBaseline frequencies for MeSH descriptors computed from a local PostgreSQL
mirror of PubMed (April 2026). For each descriptor, counts reflect the
number of distinct PMIDs indexed with that term; proportions use the full
PubMed corpus of 39,703,112 PMIDs as denominator. Descriptor UI and
canonical name are joined from the NLM MeSH thesaurus.
Intended as a baseline for mesh_keyness against arbitrary
PubMed subsets.
data_mesh_frequenciesA data.table with 30,521 rows and 4 columns:
MeSH descriptor unique identifier (e.g.,
D000001)
Canonical MeSH descriptor name
Number of distinct PubMed records indexed with this descriptor
Proportion of all 39,703,112 PubMed PMIDs indexed with this descriptor
Computed from mesh_descriptor table in a local PubMed
PostgreSQL mirror; descriptor metadata from the NLM MeSH Thesaurus
(April 2026).