Data Sources

Data Sources

The All of Us Research Program aims to enroll one million or more people across the U.S. to help build one of the largest, most diverse health databases in history. By engaging people and communities who have been left out of medical research in the past and making their data available to a diverse set of researchers, the program seeks to improve understanding of health and disease and reduce health disparities.

Participants share data from multiple sources, including surveys and electronic health records (EHRs). Participants also contribute biosamples, which we use to generate genomic data. Some participants also share data from wearable devices. As the program grows, additional data sources will become available.

The All of Us Research Program’s Data and Research Center curates and validates these data as part of the data collection process. Anyone can explore summaries of the Data Snapshots and Data Browser. Registered researchers can access more in-depth data and a variety of research tools to conduct a wide range of studies through the Researcher Workbench.

Find out more about our Study & Data Methodology, including our use of the OMOP Common Data Model.

Electronic Health Records

All participants are invited to share their electronic health records (EHRs) with the program. All of Us removes direct identifiers from participants’ EHR data before adding this information to the Researcher Workbench. The All of Us Data Dictionary (Registered Tier | Controlled Tier) provides more detailed information about the data. We use the Observational Medical Outcomes Partnership (OMOP) Common Data Model (CDM) to standardize all EHR data. Learn more about OMOP CDM on our Data Methods page.

Biosamples And Bioassays

Some participants contribute blood and/or saliva, and urine samples that can be used for a wide variety of bioassays. Biosamples are processed and stored in our biobank at the Mayo Clinic. DNA extracted from blood or saliva is sent to our genome centers for genomic analysis, including whole genome sequencing, and genome-wide genotyping. Blood serum samples were analyzed for COVID-19 antibody testing.

An overview of the genomic data available can be seen on the Data Browser. Individual-level genomic data are available to researchers registered for Controlled Tier access in the Researcher Workbench.

Surveys

All participants are invited to complete surveys on the secure All of Us Participant Portal. Initial surveys include questions on health and lifestyle. Additional surveys on more specific subjects are added regularly. You can view survey questions and learn more about where the questions come from with our Survey Explorer.

Physical Measurements

Trained staff measure participants during initial appointments at our All of Us Research Program partner locations. Data from these visits include height, weight, BMI, waist circumference, hip circumference, blood pressure, and heart rate. Program staff also note, if applicable, participants’ pregnancy status and wheelchair use at the visit.

Mobile Health Data (wearable devices)

All of Us participants can contribute information from their own wearable health tracking devices, such as Fitbit. Wearables track biometric data like heart rate and blood pressure.