March 6, 2023
PA-20-272 - Administrative Supplements to Existing NIH Grants and Cooperative Agreements (Parent Admin Supp Clinical Trial Optional)
Office of Data Science Strategy (ODSS)
National Eye Institute (NEI)
National Heart, Lung, and Blood Institute (NHLBI)
National Human Genome Research Institute (NHGRI)
National Institute of Allergy and Infectious Diseases (NIAID)
National Institute of Arthritis and Musculoskeletal and Skin Diseases (NIAMS)
National Institute of Biomedical Imaging and Bioengineering (NIBIB)
Eunice Kennedy Shriver National Institute of Child Health and Human Development (NICHD)
National Institute on Deafness and Other Communication Disorders (NIDCD)
National Institute of Dental and Craniofacial Research (NIDCR)
National Institute of Diabetes and Digestive and Kidney Diseases (NIDDK)
National Institute on Drug Abuse (NIDA)
National Institute of General Medical Sciences (NIGMS)
National Institute of Mental Health (NIMH)
National Institute of Neurological Disorders and Stroke (NINDS)
National Institute of Nursing Research (NINR)
National Institute on Minority Health and Health Disparities (NIMHD)
National Library of Medicine (NLM)
Fogarty International Center (FIC)
National Center for Complementary and Integrative Health (NCCIH)
Office of Strategic Coordination (Common Fund)
National Cancer Institute (NCI)
All applications to this funding opportunity announcement should fall within the mission of the Institutes/Centers. The following NIH Offices may co-fund applications assigned to those Institutes/Centers.
Division of Program Coordination, Planning and Strategic Initiatives, Office of Research Infrastructure Programs (ORIP)
This Notice announces the availability of supplements to active grants which are intended to support collaborations that bring together expertise in biomedicine, data management, and artificial intelligence and machine learning (AI/ML) to make NIH-supported data useful and usable for AI/ML analytics. This initiative is aligned with the NIH Strategic Plan for Data Science, which describes actions aimed at modernizing the biomedical research data ecosystem and making data FAIR (Findable, Accessible, Interoperable, and Reusable) with high impact for open science. For the purposes of this Notice, AI/ML is inclusive of machine learning (ML), deep learning (DL), and neural networks (NN).
Artificial intelligence and machine learning (AI/ML) are a collection of data-driven technologies with the potential to significantly advance biomedical research. NIH makes a wealth of biomedical data available and reusable to research communities, however, not all of these data are able to be used efficiently and effectively by AI/ML applications. The goal of this Notice is to make the data generated through NIH-funded research AI/ML-ready and shared through repositories, knowledgebases or other data sharing resources.
For the purposes of this Notice, AI/ML is inclusive of machine learning (ML), deep learning (DL), and neural networks (NN). Making data AI/ML-ready is not simply formulaic. It requires engagement with and feedback from AI/ML applications. Furthermore, feedback from AI/ML applications can improve the understanding of the data to improve future re-use.
Some aspects of AI/ML-readiness are better understood than others. For example, data to be analyzed by AI/ML tools, such as PyTorch and TensorFlow, which are used to build and deploy AI/ML applications, must conform to specific data formats. The FAIR principles, through the use of data and metadata standards (ontologies, taxonomies, terminologies), facilitate combining data from different sources to support biomedical AI/ML applications.
Some other aspects of what is needed to make data AI/ML ready must be discovered through iterative and exploratory testing. These might include how to best represent information for a particular AI/ML use-case, how to correct for noise, and what level of specificity or uncertainty of labels is tolerable for a desired AI/ML application.
For many AI/ML applications, the training dataset must be sufficiently large to be considered AI/ML ready. Thus, readying these data for computation necessitates knowledge of big data management practices, for example how best to prepare data to be partitioned to enable computational feasibility.
Decentralized machine learning (sometimes referred to as Federated or distributed learning) is a paradigm of machine learning where a model is trained iteratively on data in multiple locations.� This paradigm can facilitate the use of data that, for privacy or other reasons, cannot be aggregated or moved.� Preparing data for decentralized ML requires harmonization and testing as well as capabilities for standardized access and, possibly, enhanced data and model governance to protect privacy.� � �
Furthermore, there are increasing expectations that AI/ML ready data be accompanied by documentation to include information about data provenance and bias to help researchers make more informed and ethical decisions about the selection of data and application of AI/ML-models. For example, imbalanced datasets can result in AI/ML algorithms that lead to biased clinical decisions and, potentially, a misalignment with NIH goals to improve minority health and reduce health disparities for marginalized populations. AI/ML-readiness should be guided by a concern for human and clinical impact and therefore requires attention to ethical, legal, and social implications of AI/ML including but not limited to (1) biases in datasets, algorithms, and applications; (2) issues related to identifiability and privacy; (3) impacts on disadvantaged or marginalized groups; (4) health disparities; and (5) unintended, adverse social, individual, and community consequences of research and development.
It is the NIH vision to establish a modernized and integrated biomedical data ecosystem that adopts the latest data science technologies, including AI and ML, and best practice guidelines arising from community consensus, such as the FAIR principles, and open-source development. This effort is described in the NIH Data Science Strategic Plan and led by the NIH Office of Data Science Strategy (ODSS).
This opportunity is intended to support collaborations that bring together expertise in biomedicine, data management, and AI/ML to improve the AI/ML-readiness of data generated from NIH-funded research and shared through repositories, knowledgebases or other data sharing resources.
Applications submitted in response to this NOSI are strongly encouraged to include the following information:
NIH is particularly interested in proposals that will advance the ethical development of AI-ready data, and transparent practices that enhance the ethical re-use of data for AI/ML applications.
These supplements may be used to support a variety of activities including, but not limited to, the following:
These efforts are expected to be informed by best practices in data management and engagement with the AI/ML community.
Significant skills in data management and AI/ML are expected to be needed to identify and address gaps in AI-readiness. Thus, supplements are primarily intended to provide support for data management and AI/ML collaborators, engagement events such as hackathons, and computing and storage costs required to improve the AI-readiness of data.
The scope of each proposed project is defined by and limited to the aims of the funded project for which the supplement is being sought.
Applicants partnering with industry to test novel methods or infrastructures may be considered. The integration of causal models and causal inference in AI/ML is within scope.
A broad range of projects involving the management of data repositories, or other shared data resources are eligible regardless of the scientific area of emphasis. Both open and controlled access data, including clinical data, are within scope.
Awardees should be willing to participate in virtual meetings organized by NIH. Applications that are not appropriate and out of scope for this NOSI include:
ICO Specific Considerations
Office of Strategic Coordination
The NIH Office of Strategic Coordination (Common Fund - https://commonfund.nih.gov/) supports multiple transformative research programs that generate new technologies, methods, and data.� Many of these programs produced rich public data sets containing multi-dimensional molecular and phenotypic data from humans and model organisms.� Established Common Fund data sets listed below are well-poised for increased community use:
OSC is interested in proposals that substantially leverage at least one of the above datasets.� Substantial leverage is defined as use and citation of the dataset(s) in the envisioned research products of the proposed work (manuscripts, presentations, book chapters, portals, etc.).
Application and Submission Information
To be eligible, the parent award must be able to receive funds in FY2023 (Oct. 1, 2022 - Sept. 30, 2023).� Applicants are strongly encouraged to contact the program officer of the parent award to confirm eligibility.
Funds must be used to meet increased costs that are within the scope of the approved award, but were unforeseen when the new or renewal application or grant progress report for non-competing continuation support was submitted.
Funds can be used to cover cost increases that are associated with achieving certain new research objectives, as long as the research objectives are within the original scope of the peer reviewed and approved project, or the cost increases are for unanticipated expenses within the original scope of the project.One-time supplement budget requests cannot exceed $200,000 direct costs. The number of awards will be contingent on availability of funds and receipt of meritorious applications.�
Eligible Activity Codes:
Additional funds may be awarded as supplements to parent awards using any Activity Code that is listed in PA-20-272 with the following exceptions:�Small business activity codes (such as R41, R42, R43, R44, U43, U44, and Fast Track) are excluded, as well as S10, P60, R13, and U13 awards.
Note that not all participating NIH Institutes and Centers (ICs) support all the activity codes that may otherwise be allowed. Applicants are therefore strongly encouraged to consult the program officer of the parent grant to confirm eligibility.
For awards that are already primarily funded to deliver reusable data to the community, applicants should provide strong justification for why additional funds are needed to support AI/ML-readiness given that these activities could have been supported through the parent award. �This award cannot be used to supplement the NIH Data Management Sharing (DMS) costs in parent award.�
Applications for this initiative must be submitted using PA-20-272 - Administrative Supplements to Existing NIH Grants and Cooperative Agreements (Parent Admin Supp Clinical Trial Optional) or its subsequent reissued equivalent.
All instructions in the SF424 (R&R) Application Guide and PA-20-272 must be followed, with the following additions:
Administrative Evaluation Process
Submitted applications must follow the guidelines of the IC that funds the parent grant. Administrative Supplements do not receive peer review. Each IC will conduct administrative reviews of applications submitted to their IC separately. The most meritorious applications will be evaluated by a trans-NIH panel of NIH staff and supported based upon availability of funds. The criteria described below will be considered in the administrative evaluation process:
It is strongly recommended that the applicants contact their respective program officers at the Institute supporting the parent award in advance to:
Confirm that the supplement falls within scope of the parent award;
Request the requirements of the IC for submitting applications for administrative supplements
Investigators planning to submit an application in response to this NOSI are also strongly encouraged to contact and discuss their proposed research/aims with the scientific contact listed on this NOSI in advance of the application receipt date.
Following submission, applicants are strongly encouraged to notify the program contact at the IC supporting the parent award that a request has been submitted in response to this NOSI in order to facilitate efficient processing of the request.
NIH is particularly interested in applications that enhance the participation of individuals from groups that are underrepresented in the biomedical, clinical, behavioral, and social sciences. (See�NIH�s Interest in Diversity NOT-OD-20-031.)
For further information, please consult our Frequently Asked Questions page.
Please direct all inquiries to:
Fenglou Mao, PhD
Office of Data Science Strategy (ODSS)
Division of Program Coordination, Planning, and Strategic Initiatives
Office of the Director