Notice Number: NOT-OD-17-044
Release Date: February 21, 2017
National Institutes of Health (NIH)
This Request for Information (RFI) seeks public comments on the data submission and access processes for the NIH National Center for Biotechnology Information (NCBI) database of Genotypes and Phenotypes (dbGaP), and on the management of data in dbGaP, in order to consider options to improve and streamline these processes and to maximize the utility of dbGaP.
Response to this RFI is voluntary. Responders are free to address any or all of the topics listed in the request or any other relevant topics respondents recognize as important for NIH to consider for dbGaP data submission, access, and management. Respondents should not feel compelled to address all items. Instructions on how to respond to this RFI are provided at http://osp.od.nih.gov/content/nih-request-information-processes-database-genotypes-and-phenotypes-dbgap-data-submission-access-and
NIH Policies for the sharing of genomic and associated phenotypic data, the 2014 NIH Genomic Data Sharing (GDS) Policy  and its predecessor, the 2007 NIH Policy for Sharing of Data Obtained in NIH Supported or Conducted Genome-Wide Association Studies (GWAS Policy)  , set forth expectations and responsibilities to ensure the broad and responsible sharing of genomic research data in a timely manner. Fundamental to NIH’s stewardship of these data is respect for and protection of research participants’ interests. In 2007, NIH developed dbGaP  to archive and distribute the results of human genotype-phenotype studies that fall under these policies, in a manner that is consistent with the consent of the study participants whose genomic data are to be shared. dbGaP, a controlled-access data repository, currently serves as a central portal to submit, locate and request access to human genomic (e.g., GWAS, sequencing, expression, epigenomics data) and associated phenotypic and exposure datasets. NIH has established a governance system to facilitate the development and oversight of consistent, transparent, and efficient processes for using dbGaP and related genomic data sharing activities under the NIH GDS Policy .
As of January 2017, dbGaP maintains 4,625 datasets from 786 studies, representing over 1.2 million unique research participants. To date, over 44,000 Data Access Requests (DARs) submitted by 4,898 investigators from 46 countries have been processed. Even though dbGaP is a rapidly growing and highly utilized resource and many improvements to the dbGaP data submission and access processes have been made , NIH believes that the processes for requesting and submitting data could be streamlined and improved. Through this RFI, NIH seeks public feedback on the dbGaP data submission and access processes, and data management practices, to inform NIH about how to make dbGaP systems more user-friendly and efficient as they continue to grow and evolve.
The NIH invites feedback pertaining to any opportunities or challenges related to the following topics, as well as potential areas and opportunities to improve understanding, efficiency, or transparency of the processes associated with these topics:
1. dbGaP Study Registration and Data Submission . Examples of areas of possible comments include, but are not limited to:
2. dbGaP Data Access Request (DAR) and Review . Examples of areas of possible comments include, but are not limited to:
3. Policies for the Management and Use of dbGaP Data. Over time, NIH has received feedback about existing practices for managing access to data subject to the GDS Policy. To further inform NIH policies and practices for the management of dbGaP data, NIH is interested in public feedback on topics such as the following:
4. General Comments. NIH welcomes general comments on any other topics with regard to dbGaP data submission, access, and management.
Comments on the topic areas of interest should be submitted electronically to the following webpage: https: http://osp.od.nih.gov/content/nih-request-information-processes-database-genotypes-and-phenotypes-dbgap-data-submission-access-and or mailed to: Office of Science Policy (OSP), National Institutes of Health, 6705 Rockledge Drive, Suite 750, Bethesda, MD 20892, or by fax to: 301-496-9839 by April 7, 2017.
This RFI is for planning purposes only and should not be construed as a policy, solicitation for applications, or as an obligation on the part of the Government to provide support for any ideas identified in response to it. Please note that the United States Government will not pay for the preparation of any information submitted or for its use of that information.
Comments received, including any personal information, will be posted without change after the close of the comment period to the NIH GDS website . Please do not include any proprietary, classified, confidential, or sensitive information in your response. We look forward to your input and hope that you will share this RFI document with your colleagues. Updates to this document, if any, will be noted.
The Government reserves the right to use any non-proprietary technical information in summaries of the state of the science, and any resultant solicitation(s). The NIH may use information gathered by this RFI to inform development or modification of data sharing databases, websites, policies and practices, processes and procedures, and supporting documentation (e.g., guidance, FAQs).
 dbGaP was developed by the National Center for Biotechnology Information, National Library of Medicine, NIH. https://www.ncbi.nlm.nih.gov/gap
 Insert link to publication when available: OSP Poliscope blog post describing previous improvements to dbGaP
 For the purposes of this document, genomic summary statistics are defined as: calculated summary statistics, including genotype counts, allele frequencies, effect size estimates and standard errors, and p-values calculated from a study sample.
Please direct all inquiries to:
NIH Office of Science Policy
Division of Scientific Data Sharing Policy