NETWORK FOR LARGE-SCALE SEQUENCING OF THE MOUSE GENOME

Release Date:  December 23, 1998

RFA:  HG-99-001

P.T.

National Human Genome Research Institute

Letter of Intent Receipt Date: March 1, 1999
Application Receipt Date: April 29, 1999

PURPOSE

The purpose of this RFA is to establish a Mouse Genome Sequencing Network that
will support the mapping and sequencing of the mouse genome.  The goals of the
Network are to generate the necessary mapping resources and begin production of
a working draft of the DNA sequence of the mouse genome.  Applications for both
pilot sequencing projects in new groups and the expansion of the capacity of
existing sequencing centers are encouraged. The long-term goal of the Mouse
Genome Sequencing Network is to complete a high quality DNA sequence of the mouse
genome in cooperation with the international mouse research community.

HEALTHY PEOPLE 2000

The National Institutes of Health (NIH) is committed to achieving the health
promotion and disease prevention objectives of "Healthy People 2000," a NIH-led
national activity for setting priority areas. This RFA, Network for Large Scale
Sequencing of the Mouse Genome, is related to several priority areas including
cancer, heart disease and stroke, diabetes and chronic disability conditions, and
maternal and infant health. Potential applicants may obtain a copy of "Healthy
People 2000" at  http://odphp.osophs.dhhs.gov/pubs/hp2000/.

REQUIREMENTS

Applications may be submitted by domestic non-profit and for-profit
organizations, private and public, such as universities, colleges, private
companies, hospitals, laboratories, units of state or local governments, and
eligible agencies of the Federal government. Applications from minority
individuals and women are encouraged. Applications from foreign institutions will
not be accepted; however subcontracts to foreign institutions will be considered.
For this RFA, NIH intramural investigators are eligible to become components of
the Mouse Genome Sequencing Network.  Applications from intramural groups will
be peer reviewed in competition with extramural investigators and, if successful,
will receive funding from an appropriate source.  Existing sequencing groups that
are requesting funding for expansion of sequencing capacity for mouse production
sequencing must have deposited 2 Mb of finished DNA sequence in a public database
by the application date.

MECHANISM OF SUPPORT

The administrative and funding mechanism to be used to support this program will
be the Cooperative Agreement (U01), an "assistance" mechanism, which is
distinguished from a regular research grant in that substantial scientific and/or
programmatic involvement by NHGRI staff with the awardee is anticipated. The
cooperative agreement is used when participation by NIH staff is warranted to
support and/or stimulate the recipient's activity by involvement in and otherwise
working jointly with the award recipient in a partner role; NIH staff will not
assume direction, prime responsibility, or a dominant role in the activity.
Details of the responsibilities, relationships, and governance of the studies
funded under cooperative agreement(s) are discussed later in this document under
the section "Terms and Conditions of Award".  Each component, with the exception
of NIH intramural projects, of the Mouse Genome Sequencing Network will be
awarded as a separate U01.

The length of project period that may be requested is discussed for each type of
project described under Research Objectives and Scope (see below).  The sizes of
the awards will vary depending upon the research objectives of each. The
anticipated award date is September 30, 1999.

It is NHGRI's intention to continue the Mouse Genome Sequencing Network through
the completion of the mouse genomic DNA sequence.  This RFA solicits applications
to initiate the Mouse Genome Sequencing Network.  During the third year of
support of the applications funded under this RFA, it is anticipated that a
renewal solicitation will be issued to complete the sequence.

FUNDS AVAILABLE

This is a trans-NIH initiative managed by the NHGRI on behalf of the NIH.  The
estimated funds available for the first year of support for awards under this RFA
will be $21 million (total costs) to support all of the objectives listed below.
It is expected that from one to four efforts will be funded for characterization
of mouse BAC libraries, generation of a mapping resource and construction of a
map of sequenced clones.  For the sequence production component of the Network,
it is anticipated that three to five new sequence production projects and up to
four existing centers will be funded.  One to three intramural projects may be
funded.  Applicants may request funding for any one or combination of the
objectives listed below.

This level of support is dependent on the receipt of a sufficient number of
applications of high scientific merit. Beyond the first year, the funding level
for the overall Mouse Genome Sequencing Network will be dependent upon the
availability of funds.  The funding level of each of the projects and sequencing
centers will be subject to an annual evaluation by the Scientific Advisory Panel
of the Mouse Genome Sequencing Network (see below for details). The usual NIH
policies governing grants administration and management will apply.

RESEARCH OBJECTIVES AND SCOPE

Background

The NHGRI is currently engaged, along with several other federal, private, and
international organizations, in a research program called the Human Genome
Project (HGP).  The goals of the HGP, as set out in a series of five-year plans
(see http://WWW.NHGRI.NIH.GOV/HGP/HGP_goals/ ), are to characterize the genomes
of human and selected model organisms through complete mapping and sequencing,
to develop technologies for genomic analysis, to examine the ethical, legal, and
social implications of human genetics research, and to train scientists who will
be able to utilize the tools and resources developed through the HGP to pursue
biological studies that will improve human health.

The HGP started in 1990.  Significant progress toward completing these goals has
been made in the past eight years and several goals have already been achieved.
The genetic mapping goals for both the human and the mouse have been met.  The
human and mouse physical mapping goals are nearly complete. The goal of
determining 80Mb of genomic DNA sequence by 1998 has been exceeded (for example,
over 200 Mb of human DNA sequence have been determined, as have 5 Mb of mouse
genomic DNA in regions syntenic to those that have been sequenced in the human). 
The DNA sequence of the E. coli and S. cerevisiae genomes has been completely
determined (as have those of several other microorganisms), and the sequence of
the C. elegans genome has just been published.

In the fall of 1998, the NHGRI and DOE published a new five-year plan (Science,
Vol. 282, p 682, October 23, 1998), http://www.nhgri.nih.gov/98plan/. The plan
calls for an accelerated timetable for the sequencing of the human genome with
completion of a "working draft" by 2001 and the final sequence by 2003. A
timetable for sequencing the mouse genome is also included.  The purpose of this
RFA is to fund research to establish the infrastructure and resources needed to
meet or exceed the goals for the mouse genome laid out in the new five-year plan.

The detailed characterization of the mouse genome is of widespread interest
across the NIH.  In order to help define and establish priorities for the genomic
analysis of the mouse, the NIH Director convened a meeting of a distinguished
group of scientists in March 1998; a follow-up meeting was held in October 1998.
Summaries of the two meetings can be found at
http://www.nih.gov/welcome/director/reports/mgenome.htm and  
http://www.nih.gov/welcome/director/reports/mgenom3.htm.  (Links to all the
reports concerning the mouse genome initiative can be found on the page
http://www.nih.gov/welcome/director/keymeet.htm, under the heading Special
Reports).  It was assumed at the first meeting that sequencing the mouse genome
would be based on prior construction of sequence-ready physical maps.  However,
at the second meeting, a "sequence first, map second" strategy was discussed and
recommended. The second meeting also recommended that a working draft quality
sequence based on mapped BACs be developed useful intermediate to the finished
sequence and should be taken on as an objective on an accelerated timetable.

At the October meeting, a strategy for efficiently generating a working draft
sequence was recommended.  This starts with the sequencing of clones chosen
randomly from a library of fingerprinted and end-sequenced BACs. These clones
would not have to be mapped at the time of selection for sequencing, but
subsequently would be mapped.  Close integration of all groups involved in such
a whole genome, BAC-based sequencing effort would be required.  The necessary
coordination could be achieved by a central server that would collect mapping and
sequencing information from all groups and allow database searches against
completed projects, on-going projects, and BAC end sequences to avoid extensive
overlaps as each sequencer chooses the next BAC for sequencing.  The National
Center for Biotechnology Information (NCBI) is committed to designing the central
server required for this effort. This sequencing approach would initially lead
to sequence coverage in small contigs and single BACs.  At an appropriate time,
a more directed strategy for clone selection would be implemented to enlarge the
contigs and close the gaps between them.  The attendees at the October meeting
also agreed that the strain that should be used for the initial sequencing is
C57BL6/J.  A BAC library for this strain has been constructed and is available
through commercial distributors.

This "sequence first, map later" approach recommended at the October 5, 1998
meeting was thought to have several benefits.  It would allow mouse sequencing
to scale up rapidly, and would eliminate the need for every group participating
in the effort to have a large-scale mapping capability.  Groups with only limited
experience with sequencing or in building maps, but who are interested in
becoming "start up" centers for mouse genomic sequencing, would be able to
concentrate their learning curves on sequencing alone, thereby increasing their
chances for success.  Although these advantages are sufficient to warrant focus
on this strategy at present, an important lesson of the genome project to date
is that the methods and strategies can emerge quickly.  Therefore, which ever
strategy an applicant proposes, (map first, sequence later or sequence first, map
later or yet another strategy) must remain flexible and willing to adapt to
changes in the field as strategies evolve rapidly.  Additionally, applicants must
be willing to coordinate efforts with the international community that is
participating in sequencing the mouse genome.

NHGRI has successfully initiated new large-scale sequencing efforts through a
pilot project program for human genome sequencing. This program was established
to test different strategies and technologies for scaling up the production of
high quality finished genomic DNA sequence efficiently and cost effectively. 
During the three-year program, significant progress has been made in many aspects
of sequencing, including technology implementation, management, cost reduction,
and capacity increase.  Many of the lessons learned over the past three years
will be used to scale up production to the levels necessary to complete the
sequencing of the human genome by 2003.  To date, the pilot project program has
produced more than 124 Mb of finished human DNA sequence.  The pilot mouse genome
sequencing program being solicited through this RFA will be patterned after the
human sequencing program in many ways.  It is expected that the experience that
has been gained to date in the sequencing of the human genome can be applied to
enable new groups to participate in the sequencing of the mouse genome and allow
the sequencing of the mouse genome to scale up more rapidly.

Research Scope

The initial goals of the Mouse Genome Sequencing Network will be to establish
mouse genomic mapping resources, to establish pilot sequencing projects and to
expand the capacity of existing large-scale sequencing groups to generate a
working draft sequence of 90% of the mouse genome sequence by 2003.  This RFA
calls for projects (from both the extramural and the NIH intramural research
community) with the following objectives:

A.  BAC library characterization: For the strategy of sequencing chosen BACs to
be successful, a considerable amount of information about the BAC clones is
necessary in order to guide and coordinate the effort.  Applications are sought
to fingerprint and end sequence the currently available C57BL/6J BAC library. 
If other BAC libraries are found to be necessary for successful completion of the
mouse genomic sequence, they will have to be characterized similarly. Proposals
are not limited to fingerprinting and BAC end sequencing and applicants are
encouraged to propose other, better ways of characterizing BAC libraries. 
However, the application must propose an efficient strategy for characterizing
an entire library in a way that will contribute to the success of a genome-wide,
working draft sequence strategy.  A plan to make the data rapidly available to
the research community must be included in the proposal.  Library
characterization should be completed in one to two years.

B.  Generation of a mapping resource, such as a set of radiation hybrids for
constructing a radiation hybrid map, and of a map containing anchor markers and
all of the sequenced clones.  Although the sequencing strategy being discussed
for the mouse genome does not require a map to guide the initial clone selection,
efficient completion of the sequence will depend on the availability of a map on
which sequenced clones have been localized.  Therefore, a suitable mapping
resource is needed to generate a map that can support the sequencing effort
beyond the initial random phase.  This map should contain enough existing mapped
markers to ensure that it can be easily cross-referenced to existing maps of the
mouse genome.  Proposals should also include enough effort to efficiently map all
clones sequenced during the term of the grant.  This is estimated to be 10,000
clones in the first three years.  As the mapping effort must be tightly linked
to the genomic sequencing projects, applicants must describe plans for
coordination and cooperation with the national and international sequencing
efforts, as well as with the centralized database.  The mapping resource should
be completed as soon as possible, ideally within one year.  Mapping will continue
until the mouse sequence is completed.

C.  Establishment of sequencing efforts by groups that wish to participate in the
sequencing of the mouse genome, whether or not they are presently engaged in
large-scale sequencing activities.  Applicants for production sequencing, in both
new and existing centers, must address the items listed in "Application Guidance
for Production Sequencing" under Application Procedures.  Applications to
complete this portion of the RFA may request three years of funding.

Two types of sequencing projects will be included in the Mouse Genome Sequencing
Network:

1.  New pilot projects for sequencing the mouse genome.  The initiation of the
mouse genomic sequencing effort provides an opportunity for NHGRI to establish
new groups interested in participating in large-scale sequencing. Applicants who
wish to participate in this way must propose concrete plans that address all
issues related to the rapid scale up of sequencing activities in a new center,
including the following:

o   Sequencing Experience: Although it is not expected that those applying for
new sequence production centers will have much experience with large-scale
production sequencing, any such experience should be described.  A list of
questions on this topic, to be answered as part of the progress report, is
attached.
o  Strategic Plan: All applications for pilot sequencing projects must present
a strategic plan, including a time line, that describes how and when the proposed
effort will scale up to the point at which it can make a substantial contribution
to the mouse genome sequencing effort.
o  Cost:  The unit cost of working draft sequence (measured as the cost per lane
of sequencing) in a pilot project may initially be quite high as compared to
established sequencing efforts (estimated to be between $5 and $7.50 per lane). 
However, from NHGRI's experience with the human genome pilot sequencing program,
it can be expected that these costs should rapidly decline as the pilot project
scales.  Thus, by the end of the second year, it is expected that the per lane
sequencing cost for a new pilot project would have decreased significantly and
be comparable to other sequencing centers by the end of the third year.  This
rapid decline in cost will be necessary if a group is to be competitive for
further scale up at the end of the initial three years.  NHGRI is seeking
ambitious but realistic plans to lower costs and scale up to initiate the project
as quickly as possible. A list of questions on this topic, to be answered as part
of the progress report, is attached.
o  Sequence Quality: Sequence quality is an important issue in the Human Genome
Project.  It is not expected that applicants for new pilot projects will have
much, if any, sequence data available to be checked for quality.  However, it is
anticipated that by the end of the first year of funding sufficient data
(sequence of at least two large-insert clones) will have been generated to allow
quality assessment. See section below, entitled "Sequence Quality" under NHGRI
Policies Concerning Pilot Project and Large-scale Sequencing for more information
about NHGRI plans to assess sequence quality.
o  New Technological Developments: Successful applications will need a clear-cut
scientific and budgetary strategy for maintaining a state-of-the-art sequencing
capability. Applicants must address the ways in which the proposed project will
allow incorporation/integration of any new technological developments in DNA
sequencing that arise during the term of the grant.

2.  Additional capacity for mouse genomics in experienced groups.  In order to
rapidly initiate the sequencing of the mouse genome, this RFA invites experienced
sequencing groups to apply for funds to expand their capacity to initiate or
expand mouse genomic sequencing. Experienced groups are defined as those that
have produced at least 2 Mb of finished DNA sequence which has been deposited in
a public data base.

In order to evaluate requests from existing centers, reviewers will need reliable
information about the current sequencing effort. A list of questions on this
topic, to be answered as part of the progress report, is attached.  If, at the
time of submission of the application, the quality of a center's sequence data
has not been checked in the past six months, then it will be analyzed in a
quality control exercise similar to that previously conducted for human sequence. 
The analysis will take place between the date of submission of the grant
application and the review. Information about previous quality control exercises
can be found at 
http://www.nhgri.nih.gov/Grant_info/Funding/Statements/RFA/quality_standard.html
NHGRI estimates that with the funds available for this RFA, new and existing
sequencing centers should be able to generate at least a 5-fold coverage of a
significant fraction of the mouse genome during the first three years of the
Sequencing Network and the entire mouse genome by 2003. However, applicants may
propose generating sequence of a different depth of coverage depending upon the
center's ability to scale its capacity and decrease its costs.  A principal
investigator may submit an application for one or more of the objectives A, B,
and C1 or C2 listed above.

NHGRI POLICIES CONCERNING PILOT PROJECT AND LARGE-SCALE SEQUENCING

Over the past two years, as the human sequencing program began to produce
significant quantities of human genomic DNA sequence, a number of issues arose
that required the development of new policies by NHGRI. These policies will be
extended to both extramural and NIH intramural investigators funded as part of
the Mouse Genome Sequencing Network. Thus, where appropriate, applicants must
present plans to adhere to the policies.

Intellectual Property.  In NHGRI's opinion, in the absence of specific biological
information, genomic DNA sequence information should not be patented but released
into the public domain where it will be freely available for use by the entire
research community.  NHGRI will monitor its grantees' activities with respect to
patenting genomic sequence. (see web site:
http://www.nhgri.nih.gov/Grant_info/Funding/Statements/RFA/intellectual_property.html).

Data Release.  Applicants should fully describe their plans for the release of
mapping data and of finished and unfinished sequence data.

Finished mapping and sequence data: The U.S. HGP has adopted a policy of
encouraging rapid release of mapping and sequence data into public databases.
This policy extends to all genomic sequence funded with NHGRI funds. (see
http://www.nhgri.nih.gov/Grant_info/Funding/Statements/RFA/data_release.html).

Unfinished sequence data: Participants in the international human DNA sequencing
effort have recommended that early stage sequence data, from any organism should
be rapidly released (Genome Research, Vol. 8, Issue 5, 413-413, May 1998). 
Accordingly, NHGRI has determined that its grantees should release DNA sequence
assemblies of 2,000 base pair units or larger from all organisms being sequenced
with its support within 24 hours of assembly (see
http://www.nhgri.nih.gov/Grant_info/Funding/Statements/).

Sequence Quality.  NHGRI has adopted a quality standard for finished DNA sequence
of 99.99% accuracy, with no gaps either within or between clones.  It is more
difficult to define a comparable quality standard for working draft sequence. 
The shotgun sequence reads that make up the working draft should be of high
enough quality that they will be used in finishing the sequence of the region and
not be discarded, i.e., be "interconvertible" with those that make up finished
sequence. In that way, the working draft sequence called for in this RFA will
contribute directly to high quality finished mouse sequence that meets the
quality standard.  Applicants must address how they will assess the quality of
their sequence reads to ensure that they meet the standard of
intereconvertibility.  If a better quality standard is developed for working
draft sequence, data from groups funded as part of the Mouse Genome Sequencing
Network will have to meet the standard.  At regular intervals, a quality control
center, funded by NHGRI through a separate mechanism, will check the quality of
sequence generated by the Mouse Genome Sequencing Network.  The quality control
center will participate as part of the Steering Committee of the Sequencing
Network.

SPECIAL REQUIREMENTS FOR COOPERATIVE AGREEMENTS

I. Definitions

ARBITRATION PANEL: A panel that is formed to review scientific or programmatic
disagreement (within the scope of the award) that may arise between award
recipients and NHGRI. It will be composed of three members: a designee of the
Steering Committee chosen without the NHGRI staff voting, one NHGRI designee, and
a third designee with expertise in the relevant area who is chosen by the other
two (in the case of an individual disagreement, the first member may be chosen
by the individual awardee). The Arbitration Panel will help resolve both
scientific and programmatic issues that develop during the course of work that
restrict progress.

AWARDEE: The institution to which the cooperative agreement is awarded.

COOPERATIVE AGREEMENT: An assistance mechanism in which there is anticipated
substantial NHGRI programmatic involvement with the recipient organization during
the performance of the planned activity.

MOUSE GENOME SEQUENCING NETWORK: A group of scientists, each funded by a separate
cooperative agreement, working together to complete the DNA sequence of the mouse
genome.  The Mouse Genome Sequencing Network will be made up of extramural, and
possibly NIH intramural, researchers.  In order to participate in the Mouse
Genome Sequencing Network, a P.I., whether funded through a cooperative agreement
or another mechanism, must agree to abide by the decisions of the Steering
Committee and the NHGRI policies that apply to genomic resources and large-scale
sequencing.

NHGRI PROGRAM DIRECTOR(S): A scientist(s) of the NHGRI extramural staff who
provides normal stewardship for the award and who, in addition, has substantial
scientific/programming involvement during conduct of this activity, as defined
in the terms and conditions of award. This involvement includes coordinating
NHGRI's participation in the Mouse Genome Sequencing Network, functioning as a
peer with the Principal Investigators, facilitating the partnership relationship
between NHGRI and the Mouse Genome  Sequencing Network, helping to maintain the
overall scientific balance in the program commensurate with new research and
emerging research opportunities, and ensuring that the Mouse Genome  Sequencing
Network program is consistent with the NHGRI missions and goals and with the
activities of other mouse sequencing groups.

PRINCIPAL INVESTIGATOR (P.I.): The person who assembles the project, is
responsible for submitting the application in response to this RFA, and is
responsible for the performance of the project. The Principal Investigator will
coordinate project activities scientifically and administratively.  The Principal
Investigator may be a researcher from the extramural or NIH intramural research
community.

STEERING COMMITTEE (SC): A committee that is the main governing board of the
Mouse Genome Sequencing Network. Membership includes the NHGRI Program
Director(s), the P.I. of each awarded cooperative agreement and intramural
project (including the projects to characterize the mouse libraries and the new
and existing sequence production centers) and the quality control center, and
three research scientists with relevant expertise, but who are not affiliated
with any of the projects participating in the Mouse Genome Sequencing Network.
Each group and advisor will have one vote.

SCIENTIFIC ADVISORY PANEL (AP): A panel that evaluates the progress of the Mouse
Genome Sequencing Network and provides recommendations to the Director, NHGRI
about continued support of the components of the Mouse Genome  Sequencing
Network. The Advisory Panel will be composed of four to six senior scientists
with relevant expertise and who are not P.I.s of a cooperative agreement involved
in the Mouse Genome Sequencing Network. It will be appointed by the NHGRI
Director.  The AP will meet at least annually and a report will be prepared to
the NIH Institute and Center Directors on the progress of the sequencing network.

II. Terms and Conditions of Award

The following terms and conditions will be incorporated into the award statement
and will be provided to the Principal Investigator, as well as the appropriate
institutional official, at the time of award. Intramural participants in the
Mouse Genome Sequencing Network will be held to similar conditions for their
funding.  The following special terms of award are in addition to, and not in
lieu of, otherwise applicable OMB administrative guidelines, HHS grant
administration regulations at 45 CFR Parts 74 and 92 [Part 92 is applicable when
State and local Governments are eligible to apply], and other HHS, NIH, and NIH
grant administration policies:

1.  The administrative and funding instrument used for this program will be the
Cooperative Agreement (U01) grants and intramural project(s).  The cooperative
agreement is an "assistance" mechanism (rather than an "acquisition" mechanism),
in which substantial NIH scientific and/or programmatic involvement with the
awardee is anticipated during the performance of the activity. Under the
Cooperative Agreement, the NIH purpose is to support and/or stimulate the
recipient's activity by involvement in and otherwise working jointly with the
award recipient in a partner role, but it is not to assume direction, prime
responsibility, or a dominant role in the activity. Consistent with this concept,
the dominant role and prime responsibility for the activity resides with the
awardee(s) for the project as a whole, although specific tasks and activities in
carrying out the study will be shared among the awardee(s) and the NHGRI Program
Director(s).

2.  P.I. Rights and Responsibilities

The P.I. (extramural or NIH intramural) will have the primary responsibility for
defining the details for the project within the guidelines of the RFA and for
performing the scientific activity. The P.I. will agree to accept close
coordination, cooperation, and participation of NHGRI staff in those aspects of
scientific and technical management of the project as described under "NHGRI
Program Staff Responsibilities".

The P.I. of a sequence production center will:

o  Determine experimental approaches, design protocols, set project milestones
and conduct experiments
o  Ensure that the genomic sequence produced meets a quality standard and cost
agreed upon at the time of award
o  Ensure that data are released according to NHGRI policies and that results are
published and submitted to a public database
o  Submit data for quality assessment in any manner specified by the Steering
Committee and the Advisory Committee.
o  Submit periodic progress reports in a standard format, as agreed upon by the
Steering Committee and the Advisory Committee
o  Adhere to the NHGRI policies regarding intellectual property, data release and
other policies that might be established during the course of this activity
o  Accept and implement the common guidelines and procedures approved by the
Steering Committee
o  Accept and participate in the cooperative nature of the group
o  Attend Steering Committee meetings
o  Coordinate and collaborate with other U.S. and international groups sequencing
the mouse genome

The P.I. (extramural or NIH intramural) of a project to characterize mapping
resources will:

o  Determine experimental approaches, design protocols, set project milestones
and conduct experiments
o  Ensure the production of resources that will be used in mapping and sequencing
the mouse genome
o  Ensure that data are released according to NHGRI policies and that results are
published and submitted to a public database
o  Submit periodic progress reports in a standard format, as agreed upon by the
Steering Committee and the Advisory Committee
o  Adhere to the NHGRI policies regarding intellectual property, data release and
other policies as might be established during the course of this activity
o  Accept and participate in the cooperative nature of the group
o  Accept and implement the common guidelines and procedures approved by the
Steering Committee
o  Attend Steering Committee meetings
o  Coordinate and collaborate with other U.S. and international groups generating
resources for sequencing the mouse genome

3. NHGRI Program Staff Responsibilities:

The NHGRI Program Director(s) will have substantial scientific/programmatic
involvement during the conduct of this activity through technical assistance,
advice and coordination such as participating in the design of Mouse Genome
Sequencing Network activities, advising in the selection of sources or resources,
coordinating or participating in collection and/or analysis of data, advising in
management and technical performance, or participating in the preparation of
publications. However, the role of NHGRI will be to facilitate and not to direct
the activities. It is anticipated that decisions in all activities will be
reached by consensus of the Mouse Genome Sequencing Network and that NHGRI staff
will be given the opportunity to offer input to this process. One to three NHGRI
Program Director(s) shall participate as a member of the Steering Committee. 
NHGRI staff will have total of one vote.

The Program Director(s) will:

o  Participate (with the other Steering Committee members) in the group process
of setting research priorities, deciding optimal research approaches and protocol
designs, and contributing to the adjustment of research protocols or approaches
as warranted. The Program Director(s) will assist and facilitate the group
process and not direct it.
o  Serve as liaison, helping to coordinate activities among the awardees; act as
a liaison to the NHGRI and the other Institutes and Centers of the NIH, and as
an information resource about extramural genome research activities. The Program
Director(s) will coordinate the efforts of the Sequencing Network with other U.S.
efforts and the international mouse sequencing community.
o  Attend the Steering Committee meetings as a voting member and assist in
developing operating guidelines, quality control procedures, and consistent
policies for dealing with recurrent situations that require coordinated action.
The Program Director(s) must be informed of all major interactions of members of
the Steering Committee. The NHGRI Program Director(s) will be responsible for
scheduling the time and preparing concise (3 to 4 pages) minutes or a summary of
the Steering Committee meetings, which will be delivered to members of the group
within 30 days after each meeting.  The Program Director(s) will report progress
to the other NIH Institutes and Centers two to three times per year.
o  Lend his/her relevant expertise and overall knowledge of the NHGRI- and NIH-
sponsored research to facilitate the selection of scientists not affiliated with
the awardee institutions who are to serve on the Advisory Committee and the
Steering Committee.
o  Serve as liaison between the Steering Committee and the Advisory Committee,
attending Advisory Committee meetings in a non-voting liaison member role.
o  Serve on subcommittees of the Steering Committee and the Advisory Panel, as
appropriate.
o  Provide advice in the management and technical performance of the
investigation.
o  Provide members of the Steering Committee access to the Quality Control
Center, which will also be a member of the Steering Committee.
o  Serves as scientific liaison between the awardees and other program staff at
NHGRI.
o  Assist in promoting the availability of the mouse genome sequence and related
resources to the scientific community at large.
o  Retain the option to recommend the withholding or reduction of support from
any project (extramural or NIH intramural) within the Mouse Genome Sequencing
Network that substantially fails to achieve its sequencing goals at the quality
and cost agreed to at the time of award, fails to remain state of the art in its
production sequencing capabilities, or fails to comply with the Terms and
Conditions of the award.
o  Participate in data analyses, interpretations, and where warranted, co-
authorship of the publication of results of studies conducted through the Mouse
Genome Sequencing Network.

4. Collaborative Responsibilities

The Steering Committee will serve as the main governing board of the Mouse Genome
Sequencing Network and will overlap substantially with the Human Genome
Sequencing Network Steering Committee. The Steering Committee membership will
include the NHGRI Program Director(s), the P.I. from each awarded cooperative
agreement (including those of the projects to characterize and generate mapping
resources, sequence production centers and the quality control center),
intramural projects and three research scientists with relevant expertise, but
who are not affiliated with any of the cooperative agreements or intramural
project(s). The rest of the Steering Committee will appoint the latter three
members by majority vote. One of these three members will be nominated to serve
as the Chair of the Steering Committee and will be appointed by the Program
Director(s). Additional members may be added by action of the Steering Committee.
Some of the outside scientists serving on the Mouse Genome Sequencing Steering
Committee may also serve on the Human Genome Sequencing Steering Committee. 
Other government staff may attend the Steering Committee meetings, if their
expertise is required for specific discussions. The National Center for
Biotechnology will maintain the central server to coordinate this project and
will attend the Steering Committee meetings. It will not be a member of the
committee or have a vote.

The Steering Committee will be responsible for discussing progress within the
Mouse Genome Sequencing Network, and for advising NHGRI as to how the Mouse
Genome Sequencing Network can complete the working draft mouse DNA sequence
within the stated goals of time and accuracy, and within budget. The Steering
Committee will work with the quality control center to develop uniform procedures
for data quality assessment. Members of the Steering Committee will be required
to accept and implement the common guidelines and procedures approved by the
Steering Committee.

The meetings of the Mouse and Human Genome Sequencing Steering Committees will
partially overlap and NHGRI staff will work with the steering committees to
select some overlapping outside scientists for the respective committees. Within
one month after award of the cooperative agreements and intramural projects, the
NHGRI Program Director(s) and the P.I.s will meet (perhaps by telephone
conference) to select the three outside committee members and to nominate a chair
from among those three.  The Program Director(s) will appoint the Chair and
schedule the first meeting of the Steering Committee once the Chair has been
selected. The Chair of the Steering Committee will be responsible for
coordinating the Committee's activities, preparing meeting agendas, and chairing
meetings. A meeting schedule will be developed at the first meeting.  At least
two meetings will be held each year, either in Bethesda or at one of the sites.
One of the meetings will partially overlap with the annual meeting of the
Advisory Committee.  The purpose of joint meeting will be to allow direct
interaction between members of the Mouse and Human Genome Sequencing Networks and
the Advisory Committee, prior to the latter's annual evaluation of the Mouse
Genome Sequencing Network's progress. Subcommittees will be established by the
Steering Committee, as it deems appropriate.

5. Scientific Advisory Panel

The Scientific Advisory Panel will be responsible for reviewing and evaluating
the progress of the Mouse Genome Sequencing Network toward producing a working
draft sequence of the mouse DNA sequence by 2003. The Advisory Panel will be
composed of four to six senior scientists with relevant expertise and may
partially overlap with the Advisory Panel for the Human Genome Network. The
Director, NHGRI, will select the members and Chair. The membership of the
Advisory Panel may be enlarged permanently, or on an ad hoc basis, as needed.

The Advisory Panel will meet at least once a year. The first part of this meeting
will be a joint meeting with the Steering Committee to allow the Advisory Panel
members to interact directly with the members of the Mouse Genome Sequencing
Network. Annually, the Advisory Panel will make recommendations regarding
progress of the Mouse Genome Sequencing Network and present advice about changes,
if any, which may be necessary in the Mouse Genome Sequencing Network program to
the Director, NHGRI. At the end of the second year of funding, the Advisory Panel
will perform a rigorous evaluation of the progress of the Mouse Genome Sequencing
Network (both extramural and NIH intramural projects) to determine if funds need
to be redistributed among the projects to enable those that have been most
successful to scale up in the third year in anticipation of further scale up
after renewal the following year.

6. Arbitration Process

Any disagreement that may arise on scientific/programmatic matters (within the
scope of the award), between award recipients or NIH intramural projects and the
NHGRI may be brought to arbitration.  An Arbitration Panel, composed of three
members - one Mouse Genome  Sequencing Network Steering Committee designee, one
NHGRI designee (with the NHGRI extramural member not voting), or the individual
U01 awardee or NIH intramural project in the event of an individual disagreement
and a third designee with expertise in the relevant area and chosen by the other
two designees, will be convened.  This special arbitration procedure in no way
affects the awardee's right to appeal an adverse action that is otherwise
appealable in accordance with NIH regulations 42 CFR Part 50, Subpart D and HHS
regulation at 45 CFR Part 16.

7. Yearly Milestones

All Awardees (extramural and NIH intramural) participating in the Mouse Genome
Sequencing Network will be asked to define yearly milestones at the time of the
award and to update these milestones annually at the anniversary date. In accord
with the procedures described above, NHGRI may withhold or reduce funds for
projects that substantially fail to meet their milestones or to maintain the
state of the art.

LETTER OF INTENT

Prospective applicants from both the extramural and the NIH intramural research
communities are asked to submit, by March 1, 1999, a letter of intent that
includes a descriptive title of the overall proposed research, the name, address
and telephone number of the Principal Investigator, the number and title of this
RFA, and a list of the key investigators and their institution(s) and projects.

The letter of intent is to be sent to:

Dr. Jane L. Peterson
Program Director, Large Scale Sequencing
National Human Genome Research Institute
38 Library Drive, Room 614, MSC 6050
Bethesda, MD 20892-6050

PUBLIC BRIEFING ON THE RESEARCH NETWORK FOR LARGE-SCALE SEQUENCING OF THE MOUSE
GENOME

Prospective applicants are invited to attend a briefing on this Sequencing
Network program on February 1, 1999 in the Natcher Building, Conference Room A
from 1-3 pm.  NHGRI staff will explain the purpose of the program, provide
detailed instructions about the application process and answer questions. 
Applicant institutions are urged to send a representative to this briefing.  For
further information about the meeting or accommodations in the area, please
contact the program staff listed in this RFA.

APPLICATION PROCEDURES

The research grant application form PHS 398 (rev. 4/98) is to be used in applying
for these grants. Application kits are available at most institutional offices
of sponsored research; from the Division of Extramural Outreach and Information
Resources, National Institutes of Health, 6701 Rockledge Drive, MSC 7910,
Bethesda, MD 20892-7910, telephone 301/710-0267, Email: GrantsInfo@nih.gov; and
from the program administrator listed under INQUIRIES.  Application kits are also
available on the web at:   http://www.nih.gov/grants/forms.htm

NIH intramural applicants see section "Additional Instructions for NIH Intramural
Project Applicants" below.

The RFA label available in the NIH 398 application form must be affixed to the
bottom of the face page of the application. Failure to use this label could
result in delayed processing of the application such that it may not reach the
review Panel in time for review. In addition, the RFA title and number must be
typed on line 2 of the face page of the application form and the YES box must be
marked.

Submit a signed, typewritten original of the application, including the
Checklist, and three signed photocopies, in one package to:

CENTER FOR SCIENTIFIC REVIEW
NATIONAL INSTITUTES OF HEALTH
6701 ROCKLEDGE DRIVE, SUITE 1040 - MSC 7710
BETHESDA, MD 20892-7710
BETHESDA, MD 20817 (for express/courier service)

At the time of submission, two additional copies of the application must also be
sent to:

Dr. Ken Nakamura
Office of Scientific Review
National Human Genome Research Institute
38 Library Drive, Room 609, MSC 6050
Bethesda, MD  20982-6050
Telephone: (301) 402-0838

Applications must be received by April 29, 1999.  If an application is received
after that date, it will be returned to the applicant without review. The Center
for Scientific Review (CSR) will not accept any application in response to this
RFA that is essentially the same as one currently pending initial review, unless
the applicant withdraws the pending application.  The CSR will not accept any
application that is essentially the same as one already reviewed.  This does not
preclude the submission of substantial revisions of applications already
reviewed, but such applications must include an introduction addressing the
previous critique.

APPLICATION GUIDANCE FOR PRODUCTION SEQUENCING

Applicants must consider and address the following in preparing applications for
sequence production projects called for in this RFA:

Sequence Production Plan. The applicant must present a plan and propose
milestones for achieving the proposed level of scale up. This plan must
thoroughly discuss the applicant's choice of sequence coverage for the working
draft, why it is the best choice for the funds requested and how it will lead
eventually to finished mouse sequence. The sequencing plan must cover all phases
of working draft sequence production, starting with subcloning of a BAC clone
through assembly and deposition of the sequence data in the unfinished division
of GenBank. It will be important to discuss bottlenecks or other problems that
may be anticipated as the project increases in scale and how they will be
addressed

Sequence Cost. The calculated cost of sequencing (both prior and projected
sequencing costs) must take into account all of the expenses associated with
working draft sequence production, beginning with construction of a subclone
library, through assembly of the working draft sequence to the depth proposed.
The total cost of sequencing must also include any production-related technology
development (see below) that has been or will be supported by the project.
However, the applicant must also provide a breakdown of costs so that the
reviewers can evaluate the contribution of different cost elements, such as
production-related technology development, to the reported total cost.

Sequence Quality: Applicants who have generated at least 2 Mb of finished
sequence must agree to submit their data for quality assessment during both the
pre-award period (in order to allow the peer reviewers to evaluate this important
factor), if it has not been evaluated by a NHGRI Quality Assessment exercise in
the last six months, and during the course of the project period.  The assessment
will be conducted using the methods employed in the previous NHGRI quality
assessment exercises
(http://www.nhgri.nih.gov/Grant_info/Funding/Statements/RFA/quality_standard.html), 
in which all NHGRI grantees funded for large-scale sequencing cooperated
in assessment of each other's data. This evaluation will be conducted after the
application is submitted, but prior to the review meeting. The finished sequence
to be evaluated will be chosen by NHGRI staff from a list of finished clones
submitted by the P.I. in the progress report. During the course of the project
period, the assessment will be done by an NHGRI-funded quality control center.
Each sequence production center must also implement an internal quality control
program that evaluates sequence accuracy and contiguity of the assembled working
draft sequence. If a program is already in use in the applicant's project,
evidence of its usefulness must be presented in the Progress Report.

Production-related technology implementation. One of the goals of the NHGRI
sequencing program is the continual improvement of the efficiency of production
sequencing and decrease in its cost. This will facilitate completion of the mouse
genome sequence in the shortest possible time and at the lowest possible cost.
Applicants must present a plan to address this issue and discuss how their
proposed project will implement new technologies and strategies as they become
available. Any cost incurred in technology implementation must be included in the
total cost of the sequence.

Management Plan. The management of a sequencing center requires a significant
commitment by the P.I. of the project.  A P.I. for a new pilot project is
expected to devote at least 30% effort to the project. A P.I. of an existing
large-scale sequencing center, who is already be devoting more than 30% effort
to the existing center, must commit an additional 10% to the mouse sequencing. 
If the P.I. of an existing project is not already committed to 30% effort on that
project, the commitment on this project should be greater than 10%.  The
applicant must propose a management plan for the project that takes into account
the changes that will occur as the project scales up.

Additional Instructions for NIH Intramural Projects

NIH intramural project applicants must use the NIH 398 application form using the
following instructions.

o  On the Face Page, fill out only items 1., 2., 3.  (leave 3c. blank), 4., 5,
6, 7, and 8.  The remainder of the items should be left blank, and the
application must not be signed by either the PI or an NIH Institute official. 
The RFA label must be affixed to the bottom of the Face Page, as described above
in section 2.
o  Do not submit Checklist", "Personnel Report", or "Personal Data" pages.
o  PI must provide information about current support in his laboratory,
specifically the yearly funding level of the laboratory, a brief description of
the research with an indication of whether there is overlap with the current
application.
o  The P.I. must obtain the approval of his/her NIH Institute Scientific Director
for applying, for collaboration, for participating as a component of the Mouse
Genome Sequencing Network under the terms and conditions of the RFA, and for
complying with the policies of the Steering Committee.  A copy of that letter of
approval must be provided as part of a cover letter, addressed to the NHGRI
Office of Scientific Review.
o The composite budget page and the individual budget pages for each part a
complete budget and budget justification for funds covering the years requested
in the format requested in the Form 398.
o  Submit one unsigned, typewritten original of the application, and twenty
photocopies of the application and six photocopies of the appendices to:

Scientific Review Administrator
Office of Scientific Review
National Human Genome Research Institute
38 Library Drive, Room 609, MSC 6050
Bethesda, MD  20982-6050

Do not send the application or any copies to the Center for Scientific Review.
NIH intramural project applications must be received by April 29, 1999.  If
an application is received after that date, it will be returned to the applicant
without review.

REVIEW CONSIDERATIONS

A. General Considerations

Upon receipt, applications will be reviewed for completeness by CSR and for
responsiveness by the NHGRI. Incomplete applications will be returned to the
applicant without further consideration. If NHGRI staff find that the application
is not responsive to the RFA, it will be returned without further consideration.

Applications that are complete and responsive to the RFA will be evaluated for
scientific and technical merit by an appropriate peer review group convened by
the NHGRI in accordance with the review criteria stated below.  As part of the
initial merit review, a process will be used by the initial review group in which
applications receive a written critique and undergo a process in which only those
applications deemed to have the highest scientific merit, generally the top half
of the applications under review, will be discussed, assigned a priority score,
and receive a second level review by the National Advisory Council for Human
Genome Research.

All applications will be judged on the basis of the scientific and technical
merit of the proposed projects and the documented ability of the investigators
to meet the RESEARCH OBJECTIVES of the RFA.

B. Review Criteria

The application must be directed toward attaining the programmatic goals as
stated under RESEARCH OBJECTIVES. The following criteria will be used by peer
review groups to evaluate these applications:

For projects to characterize mapping resources:

o  Scientific merit of the proposed research plan to provide well characterized
mapping resources that will be useful in sequencing the mouse genome
o  Likelihood that the data and resources generated will contribute significantly
to making the mouse genomic sequencing more efficient and cost effective
o  Cost and efficiency of the plans proposed
o  Prior demonstrated success and quality of the proposed plan for developing and
characterizing mapping resources
o  Track record of the P.I. and other key personnel
o  Past compliance with NHGRI data release policies, and plans for data and
materials release.
o  Plans to coordinate efforts with other U.S. and international large-scale
sequencing groups and mapping efforts
o  Availability of the facilities, resources, expertise and technology necessary
to perform the research, and the level of institutional commitment.
o  Appropriateness of the proposed budget and time-line in relation to the
proposed research.

For sequence production centers:

o  Pilot Sequence Projects (new centers)
o  Likelihood that the sequencing strategy, data management and overall
management plans proposed will support a successful scale up of the center so
that it is likely to make a significant contribution to the completion of the
working draft and complete sequence of the mouse genome.
o  Quality of the plans to integrate any new strategies and technology
developments into the sequencing effort
o  Sequence quality:
a)  Merit of sequence quality assessment plans, including  monitoring and
minimizing sequencing errors, and other QA/QC plans
b)  Results from NHGRI sequence quality assessment exercises
o  Track record of the P.I. and other key personnel
o  Past compliance with NHGRI policies, where appropriate, and plans for data
release
o  Plans to coordinate efforts with other U.S. and international large-scale
sequencing groups and mapping efforts
o  Availability of the facilities, resources, expertise and technology necessary
to perform the research, and the level of institutional commitment
o  Appropriateness of the proposed budget and time-line in relation to the
proposed research.

Existing large-scale sequencing centers:

o  Likelihood that the project  will produce a significant fraction of the mouse
working draft sequence:
o  Prior experience and quality of the proposed plan for:
a) Producing high quality sequence data
b) Increasing sequence throughput
c) Decreasing sequencing costs, including efficiency improvements due to
implementation of new technologies or strategies.
o  Prior experience and quality of the proposed plan for identifying and solving
critical integration problems, including adequacy of the informatics activities.
o  Sequence quality:
a)  Merit of sequence quality assessment plans, including monitoring and
minimizing sequencing errors, and other QA/QC plans
b)  Results from NHGRI sequence quality assessment exercises
o  Track Record of the P.I. and other key personnel
o  Quality of the center's existing management, including workflow, plans for
further scale-up to accommodate mouse genomic sequencing, divisions of
labor/responsibility among components, coordination between components,
appropriate staffing, training, etc.
o  Past compliance with NHGRI data release policies, and plans for data release.
o  Plans to coordinate efforts with other U.S. and international large-scale
sequencing groups and mapping efforts
o  Availability of the facilities, resources, expertise and technology necessary
to perform the research, and the level of institutional commitment.
o  Appropriateness of the proposed budget and time-line in relation to the
proposed research.

AWARD CRITERIA

Awards will be made on the basis of scientific and technical merit as determined
by peer review, including the significance of the projected contribution toward
meeting the NHGRI program goal of contributing to the completion of the mouse
genome working draft DNA sequence by the year 2003, program needs and balance,
data release and intellectual property, and the availability of funds.

INQUIRIES

Written and telephone inquiries concerning this RFA are encouraged. The
opportunity to clarify issues or questions about the RFA from potential
applicants is welcome.

Direct inquiries regarding programmatic issues to:

Dr. Jane L. Peterson (for sequencing)
Dr. Bettie Graham  (for mapping resources)
Division of Extramural Research
National Human Genome Research Institute
38 Library Drive, Room 614, MSC 6050
Bethesda, MD  20892-6050
Telephone:  (301) 496-7531
FAX:  (301) 480-2770
Email:  Jane_Peterson@nih.gov
Email:  Bettie_Graham@nih.gov

Direct inquiries regarding fiscal matters to:

Ms. Jean Cahill
Grants Administration Branch
National Human Genome Research Institute
38 Library Drive, Room 613, MSC 6050
Bethesda, MD  20892-6050
Telephone:  (301) 402-0733
FAX:  (301) 402-1951
Email:  Jean_Cahill@nih.gov

Schedule
Letter of Intent Receipt Date:  March 1, 1999
Application Receipt Date:       April 29, 1999
Scientific Review Date:         June 1999
Advisory Council Date:          September 1999
Anticipated Award Date:         September 1999

AUTHORITY AND REGULATIONS

This program is described in the catalog of Federal Domestic Assistance No.
93.172.  Awards are made under the authority of the Public Health Service Act,
Title IV, Part A (Public Law 78-410, as amended by Public Law 99-158, 42 USC 241
and 285) and administered under NIH grants policies and Federal Regulations 42
CFR Part 52 and 45 CFR Parts 74 and 92. This program is not subject to the
intergovernmental review requirements of Executive Order 122372 or Health Systems
Agency review.

The NIH strongly encourages all grant and contract recipients to provide a smoke-
free workplace and promote the non-use of all tobacco products. In addition,
Public Law 103-227, the Pro-Children Act of 1994, prohibits smoking in certain
facilities (or in some cases, any portion of a facility) in which regular or
routine education, library, day care, health care or early childhood development
services are provided to children. This is consistent with the NIH mission to
protect and advance the physical and mental health of the American people.

PROGRESS REPORT: list of issues/questions to be addressed in the Progress Report
for new pilot sequencing projects (have completed less than 2 Mb of sequence that
has been placed in a public database)

Instructions:
The following series of questions is from the last several reviews NHGRI has
conducted for large-scale sequencing. They are designed to provide reviewers with
answers to questions that are central to the review of your grant. The questions
also provide a format to assist the reviewers in locating the answer to these
pertinent questions.  Therefore, it is preferable that the answers to the
questions appear immediately after the questions, if they do not, the location
of the answer must be provided in that space.  In order for your application to
be considered by the review committee, these questions MUST be answered.

For the purposes of this RFA, completed data is defined as any sequence data
deposited in a publicly available database.

Section A. Text

Please answer the following questions. The total length for this section should
not exceed 10-15 pages (5000-7500 words).
We encourage brief, concise answers. Please focus these answers on your
experience, rather than future plans, unless specifically asked for.
1.  How does your experience support the promise of your group to successfully
scale? To what level (in lanes/year) do you propose to scale your sequencing
effort in the next grant year?  Discussion must include, but is not limited to:
o  Prior experience in sequencing (including sequence production for types of
sequence other than genomic sequence) and increasing throughput. What is the
amount of sequence (of any type; e.g., cDNA, STSs, ESTs) produced and deposited
in a public database in the last three-year period by your project? Based on an
average of the last three months of sequencing, what is the total current
capacity of your group (include lanes per month, percentage of successful lanes,
number of base pairs per lane at of least phred 20--or equivalent--quality)?
Please include capacity from sequencing DNA of other organisms and capacity due
to funding from all sources. If your rate of sequencing increased over the last
year, please describe how.  Have the rates of different parts of the sequencing
process changed, and how have these changes contributed to your overall progress?

o  Prior experience in attaining milestones. What example(s) can you provide that
demonstrate that you have proposed milestones for a research project and then met
them on schedule?  Examples need not be in sequence production, but, if possible,
include resource production of some type.  What internal metrics have you used
to evaluate progress in the past and what internal metrics (for example,
reads/month, failed lanes, base pairs per lane in GenBank, etc.) do you believe
will be the most useful to you in managing your project's sequencing performance?

o  If appropriate, prior demonstrated experience in decreasing costs.  If you
have no experience in resource production and cost reduction, describe your plans
to control and/or decrease costs during the pilot project period.  On what
evidence do you base your cost estimates?  Total costs include all equipment and
indirect costs, as well as direct costs. How do you plan to monitor costs
internally?  Note below you will be asked to report costs in a specific format. 
It may be useful to incorporate that format into your planning.
o  If appropriate, prior experience in solving the critical integration problems
that arise as a program increases in scale. If you have no experience in resource
production, describe your plans to achieve process integration in your sequencing
project.  Particular attention should be given to how the informatics activities
facilitate data management, within the components as well as post-sequencing.
Examples would be useful.

2. Describe your plans for incorporating new strategies and technologies into
your sequencing project, and how this will affect productivity. Are there unique
technologies and strategies that you plan to test? Do you have any experience in
testing technology in sequencing or other fields?  If so, discuss the advantages
they have provided. Please describe collaborative efforts that may have
contributed to your success.

3. Discuss how your group plans to check the quality of the sequence it produces.
(Note: if you are participating in the quality assessment exercise you will be
given an opportunity to address its results.  If your data has been checked by
a quality assessment exercise in the past six months, the results from that
exercise will be provided to the reviewers by NHGRI.)

4. Have you had a management plan in the past, and if so, how do you expect it
to change to meet the needs of this project?  If not, what management plan do you
envision for the three year period of this project? Discuss, for example:
workflow strategy, division of responsibility between various components,
production/materials quality, training, hiring, turnover, or other management
issues.

5. State your data release policy.

6. This is an extra space for you to provide any additional comments that may be
useful.

Section B. Graphical and Tabular Material

Please provide the following material.

1. If you are currently producing DNA sequence of any type, please provide a
graph showing sequence output per month for at least the last 12 months. Please
distinguish between different types of sequence, e.g.,  finished or unfinished
genomic, EST stequence, STS sequence, etc.  (This should be a noncumulative
monthly total.)

2. If you are currently producing DNA sequence of any type, please provide a
graph indicating the numbers of lanes run per week for the past year and the
number of failed lanes for the same time period.

3. If you are currently producing any type of DNA sequence, please provide a
table of :

a. database entries, giving, where appropriate, laboratory name, accession
number, total size, unique size, number of gaps, number of ambiguities, estimated
number of errors (if available for unedited clones), and number of edits (for
finished clones). Please provide detailed legends

b. clones in progress, giving, where appropriate, laboratory name, GenBank
accession number, size, number of gaps, and status (e.g., libraries constructed)

4. It is important for the reviewers of your application to evaluate your budget
request in terms of how much of the funds are needed for each activity. The
format given below will provide them with information to understand the cost of
your shotgun sequencing and give you an opportunity to explain how you plan to
decrease it. Therefore, please provide budget information in the format given
below for the previous year (if you currently have a funded sequencing effort)
and for all (up to three) years of your request.

The categories are defined as:

FTEs are the number (not costs) of full time equivalents (100% effort) in
personnel needed for each activity.
Labor costs are those costs, including benefits, attributed to personnel involved
in shotgun  sequencing activities, but excluding personnel in
management/administration, informatics and process development.
Supplies costs are those attributed to the supplies needed for the shotgun
sequencing activities, excluding management/administration, informatics and
process development.
Management/Administration costs are costs of personnel (including benefits) and
other expenses related to managing and administering the shotgun sequencing
component of the project (for example, supplies needed for administration would
be included here).
Informatics costs are the costs of personnel (including benefits), supplies and
equipment used in all aspects of data management.  Informatics activities are
those involving higher level activities such as research and development,
annotation, etc.
Information Management includes routine computerized information management
activities such as routine data tracking, systems administration, etc.
Process Development costs are those that are incurred in the short term in
improving the process or the technology being used for shotgun sequencing. These
costs include personnel (including benefits), supplies, etc.
Allocated costs are those that contribute to more than one of the production
activities. They are not to be distributed among the production activities in the
table, but considered separately.  Non-computer equipment is to be depreciated
over five years. The 'large' and 'small' categories reflect the institutionally-
defined threshold for size of equipment purchases to be subject to indirect
costs.  For currently funded efforts, equipment purchased for the last two grant
years (1997-1998) should be included in the depreciation figures.
Computer equipment is to be depreciated at three years or less; the depreciation
time should be indicated.
Indirect costs are those costs paid to the grantee institution as part of the
indirect cost agreement.
The total should be the total cost of each activity.

Cost Table

PRODUCTION
ACTIVITY                  #FTE'S     DIRECT   INDIRECT     TOTAL     %

Subclone Construction
--labor
--supplies

Template Preparation
--labor
--supplies

Sequencing Reaction
--labor
--supplies

Gel Electrophoresis
--labor
--supplies

ALLOCATED COSTS
Other
--labor
--supplies
--travel
--service contracts
--off-site training

Management and Admin.
--labor
--supplies

Informatics
--labor
--supplies

Information Management
--labor
--supplies

Process Development
--labor
--supplies

Equipment (non-computer)
--small
--large

Equipment (computer)

10. This is an extra space to provide any other graphical material you feel would
be helpful.

PROGRESS REPORT:  list of issues/questions to be addressed in the Progress Report
for existing sequencing groups (have already completed at least 2 Mb sequence
that has been placed in a public database)

Instructions:

The following series of questions is based on experience from the last several
reviews NHGRI has conducted for large-scale sequencing. The questions are
designed to provide reviewers with information that is central to the review of
your grant.  Because the questions are organized in a format that will assist the
reviewers in locating the answers, it is preferable that your answers appear
immediately after the questions; if they do not, the location of the answer must
be provided in that space.  In order for your application to be considered by the
review committee, these questions MUST be answered.

For the purposes of this RFA, completed data is defined as any sequence data
deposited in a publicly available database.  Thus, "completed" data includes
genomic sequence finished to Bermuda standards, unfinished, ESTs, STSs, etc.

Section A. Text

Please answer the following questions. The total length for this section should
not exceed 10-15 pages (5000-7500 words).  We encourage brief, concise answers.
Please focus these answers on your progress, rather than future plans, unless
specifically asked for.

1. To what level (in lanes/yr) do you propose to scale your shotgun capacity to
accommodate mouse genome sequencing in the first grant year? How does your past
effort support the ability of your group to successfully scale to this level?
Discussion must include, but is not limited to:
o  Prior experience in sequencing (including sequence production for other types
of sequence, in addition to genomic sequence) and in increasing throughput.  What
is the amount of sequence (of any type; e.g., cDNA, STSs, ESTs, etc.) produced
and deposited in a public database in the last three-year period by your project? 
Based on an average of the last three months of shotgun production, what is the
total current capacity (in lanes) of your center (include lanes per month,
percentage of successful lanes, number of basepairs per lane at of least phred
20--or equivalent--quality)?  Please include capacity from sequencing DNA of
other organisms and capacity due to funding from all sources.  How has your rate
of shotgun production increased over the last year?  How have the rates of
different parts of the process changed, and how have these changes contributed
to your overall progress?
o  Prior demonstrated success in attaining milestones.  What internal metrics
(for example, lanes/month, failed lanes, basepairs per lane in GenBank, etc.) do
you consider to be the most useful to you in managing your center's performance?
o  Prior demonstrated success in decreasing costs; costs should be given as cost
per lane. Total costs include all equipment and indirect costs, as well as direct
costs. How does your center monitor costs internally? Note that you must also
give your costs broken down into the categories given in the table below (Section
B, question 9), for the past year and projected for the future years of the
application.
o  Prior demonstrated success in solving the critical integration issues that
arise as a program increases in scale. Particular attention should be given to
how informatics activities facilitate data management, within the components as
well as post-sequencing.

2.  Describe your success in incorporating new strategies and technologies into
your sequencing process, and how this has affected productivity.  Are there
unique technologies and strategies that have been tested by your center?  If so,
discuss the advantages they have provided. Please describe collaborative efforts
with other centers that may have contributed to your success.

3. Discuss how your center checks the quality of the sequence it produces. For
this section, please make reference to graphical materials in section B below as
needed. (Note: if you are participating in the quality assessment exercise you
will be given an opportunity to address its results.  If your data has been
checked by a quality assessment exercise in the past six months, please include
the results from that exercise in this answer.  Note that the NHGRI staff will
provide reviewers with the report and your responses written at the time the
exercise was completed.)

4. How and why has your management plan evolved as your sequencing effort has
scaled up? Discuss, for example: workflow strategy, division of responsibility
between various components, production/materials quality control (distinct from
what was in part 3 above), training, hiring, turnover, or other management
issues.

5. State your data release policy.

6. This is an extra space for you to provide any additional comments that may be
useful.

Section B. Graphical and Tabular Material

Please provide the following material.

1. Please provide a graph indicating the numbers of lanes run per week for the
past year and the number of failed lanes for the same time period.

2. Please provide a histogram showing the cycle time (in months) of clones that
have been taken to the working draft stage, and one for those remaining in the
pipeline, indicating how long they have been in the pipeline. Cycle time is
defined as starting when a large insert clone is subcloned in preparation for the
sequencing pipeline, and ending when the assembled clone sequence is deposited
in a public database.

3. Please provide a graph showing shotgun sequence output per month for at least
the last 12 months.  (This should be a noncumulative monthly total.)

4. Please provide a table of completed large-insert clones (both clones that have
been finished to "Bermuda' standards and those that have been taken to a working
draft standard) over 30 kb, giving laboratory name, GenBank accession number,
total size, unique size, number of gaps, number of ambiguities (N's,Y's or R's;
these may be noted separately in the table cell), estimated number of errors (if
available for unedited clones), and number of edits (for finished clones).  If
this list includes more than 50 clones, the list may be submitted electronically
on a disc or by e-mail.

5. Please provide a table of unfinished clones in progress, giving laboratory
name, GenBank accession number, size, number of gaps, and status (e.g., libraries
constructed). If this list includes more than 50 clones, the list may be
submitted electronically on a disc or by e-mail.

6. It is important for the reviewers of your application to evaluate your budget
request in terms of how much of the funds are needed for each activity. The
format given below will provide them with information to understand the cost of
your strategy and give you an opportunity to explain how you plan to decrease the
costs of sequencing. Therefore, please provide budget information in the format
given below for the previous year (if you currently have a funded sequencing
effort) and for all (up to five) years of your request. (Note that for year
before those requested in the current proposal, the row labeled 'sequencing--
working draft' should have 0 for each entry.)

The categories are defined as:

Subclone library construction: Begins at the point at which a large-insert clone
is selected for sequencing, i.e. from the point at which the isolate containing
the large-insert clone is taken from a glycerol stock or a colony on a plate. 
Ends at the point at which agar plates with the subclone library (as colonies or
plaques) are available and ready to pick.
Template Preparation: Begins with the plated sub-clone libraries.  Ends with
samples ready to pipette into the sequencing reactions.
Sequencing Reaction: Begins with "dipping the pipette into" the template DNA. 
Ends at the point at which the sample is ready to be loaded onto the sequencing
gel (but does not include any pre-loading treatments, e.g. heating, addition of
loading dye).
Gel Electrophoresis: Begins with the pre-loading processing.  Ends with the
collection of the data.
FTEs are the number (not costs) of full time equivalents (100% effort) in
personnel needed for each activity.
Labor costs are those costs, including benefits, attributed to personnel involved
in shotgun  sequencing activities, but excluding personnel in
management/administration, informatics and process development.
Supplies costs are those attributed to the supplies needed for the shotgun
sequencing activities, excluding management/administration, informatics and
process development.
Management/Administration costs are costs of personnel (including benefits) and
other expenses related to managing and administering the shotgun sequencing
component of the project (for example, supplies needed for administration would
be included here).
Informatics costs are the costs of personnel (including benefits), supplies and
equipment used in all aspects of data management.  Informatics activities are
those involving higher level activities such as research and development,
annotation, etc.
Information Management includes routine computerized information management
activities such as routine data tracking, systems administration, etc.
Process Development costs are those that are incurred in the short term in
improving the process or the technology being used for shotgun sequencing. These
costs include personnel (including benefits), supplies, etc.
Allocated costs are those that contribute to more than one of the production
activities. They are not to be distributed among the production activities in the
table, but considered separately.  Non-computer equipment is to be depreciated
over five years. The 'large' and 'small' categories reflect the institutionally-
defined threshold for size of equipment purchases to be subject to indirect
costs.  For currently funded efforts, equipment purchased for the last two grant
years (1997-1998) should be included in the depreciation figures.
Computer equipment is to be depreciated at three years or less; the depreciation
time should be indicated.
Indirect costs are those costs paid to the grantee institution as part of the
indirect cost agreement.
The total should be the total cost of each activity.

Cost Table

PRODUCTION
ACTIVITY                  #FTE'S     DIRECT     INDIRECT   TOTAL  %

Subclone Construction
--labor
--supplies

Template Preparation
--labor
--supplies

Sequencing Reaction
--labor
--supplies

Gel Electrophoresis
--labor
--supplies

ALLOCATED COSTS
Other
--labor
--supplies
--travel
--service contracts
--off-site training

Management and Admin.
--labor
--supplies

Informatics
--labor
--supplies

Information Management
--labor
--supplies

Process Development
--labor
--supplies

Equipment (non-computer)
--small
--large

Equipment (computer)

10.  This is an extra space to provide any other graphical material you feel
would be helpful.


Return to Volume Index

Return to NIH Guide Main Index


Office of Extramural Research (OER) - Home Page Office of Extramural
Research (OER)
  National Institutes of Health (NIH) - Home Page National Institutes of Health (NIH)
9000 Rockville Pike
Bethesda, Maryland 20892
  Department of Health and Human Services (HHS) - Home Page Department of Health
and Human Services (HHS)
  USA.gov - Government Made Easy