The Description of the Invention
52 The specification contains a summary of the invention extending over eight paragraphs. The specification states at [0023]:
[0023] The present invention provides methods, systems, and compositions that allow the identification and selection of cattle with superior genetic potential for desirable characteristics. Accordingly, the present invention provides methods, compositions, and systems for managing, selecting and mating, breeding, and cloning cattle. These methods for identification and monitoring of key characteristics of individual animals and management of individual animals maximize their individual potential performance and edible meat value. The methods of the invention provide systems to collect, record and store such data by individual animal identification so that it is usable to improve future animals bred by the producer and managed by the feedlot. The methods, compositions, and systems provided herein utilize information regarding genetic diversity among cattle, particularly single nucleotide polymorphisms (SNPs), and the effect of nucleotide occurrences of SNPs on important traits.
And at [0029]-[0030]:
[0029] In another embodiment, the present invention provides a method for identifying a bovine target sequence, such as a gene, associated with a trait, by identifying an open reading frame present in a target region of the bovine genome, wherein the target region is located on the bovine genome less than or equal to about 500,000 nucleotides of a single nucleotide polymorphism (SNP) corresponding to position 300 of any one of SEQ ID NOS:19473 to 21982, and analyzing the open reading frame to determine whether it affects the trait, thereby identifying a bovine gene associated with the trait. In one aspect, the target region is located within about 5000 nucleotides of a single nucleotide polymorphism (SNP) corresponding to position 300 of any one of SEQ ID NOS:19473 to 21982.
[0030] In another embodiment, the present invention provides a method for identifying a bovine single nucleotide polymorphism (SNP) associated with a trait, that includes identifying a test SNP in a target region of a bovine genome, wherein the target region is less than or equal to about 500,000 nucleotides of a SNP position corresponding to position 300 of one of SEQ ID NOS:19473 to 21982, and identifying an association of the test SNP to the trait, thereby identifying the test SNP as associated with the trait In certain aspects, the target region includes at least 20 contiguous nucleotides of SEQ ID NOS:24493 to 64886. In another aspect, for example, the target region includes at least 20 contiguous nucleotides of SEQ ID NOS:19473 to 21982. The present invention also provides isolated polynucleotides that include the identified SNPs.
53 The specification also includes a detailed description of the invention. The specifications states at [0035]:
[0035] In one embodiment, the present invention provides an isolated polynucleotide that includes a fragment of at least 20 contiguous nucleotides of the bovine genome, or a complement thereof, wherein the isolated polynucleotide includes a nucleotide occurrence of a single nucleotide polymorphism (SNP) associated with a trait, wherein the SNP is in disequilibrium with a SNP corresponding to position 300 of any one of SEQ ID NOS:19473 to 21982. In certain aspects, the polynucleotide is located about 500,000 or less nucleotides from position 300 of SEQ ID NOS:19473 to 21982 on the bovine genome. As disclosed in the Examples herein, the linkage disequilibrium for cattle is about 500,000 nucleotides. Therefore, it is expected that other SNPs can be identified that are associated with the same traits based on the fact that these other SNPs are located less than or equal to about 500,000 nucleotides of the identified associated SNP on the bovine genome. In certain aspects, the polynucleotide is from an Angus, Charolais, Limousin, Hereford, Brahman, Simmental or Gelbvieh bovine subject.
None of the claims of the patent application is to an isolated polynucleotide. Nevertheless, the description at [0035] is significant in so far as it describes the relationship between SNPs located at position 300 of SEQ ID NOS: 19473 to 21982 and other SNPs located about 500,000 or less nucleotides from that position. The specification teaches that it is expected that other SNPs can be identified that are associated with the same traits based on the fact that they are located within a certain distance of the SNP located at position 300.
54 The detailed description of the invention also includes the following at [0126]-[0127]:
[0126] In another embodiment of the invention, a method is provided for identifying SNPs that are associated with a trait by using the associated SNPs disclosed herein. The method is based on the fact that other markers in close proximity to the associated SNP marker will also associate with the trait because markers in linkage disequilibrium with the associated SNP marker will also be in linkage disequilibrium with the gene(s) influencing the trait. SNPs in linkage disequilibrium can be used in lieu of determining a SNP or mutation to predict the presence or absence of a phenotypic trait or contributor to a phenotypic trait. Accordingly, in certain embodiments, the present invention provides a method for identifying a SNP associated with a trait, that includes identifying a test SNP that is in disequilibrium with a SNP corresponding to position 300 of SEQ ID NOS:19473 to 21982.
[0127] As illustrated in the Examples section, it has been determined that disequilibrium exists across the region of 500,000 bp from the associated SNP in each direction. Other markers within this 500,000 bp region will also be in disequilibrium with the associated SNP and with the trait of interest, and can be used to infer associations with the trait of interest. Genomic segments containing the markers can be adjacent to the associated SNP marker or contained within a separate island of sequence distant from the associated SNP.
The specification makes clear in paragraph [0127] that there will be other markers within 500,000 base pairs of position 300 of SEQ ID NOS: 19473 to 21982 that will be in linkage disequilibrium with the SNP at position 300 allowing them to be used to predict the presence or absence of a particular trait. It is not suggested that all SNPs within that distance will be in linkage disequilibrium with the SNPs at position 300.
55 The specification includes a number of examples that are used to illustrate the invention.
56 Example 1 is an illustration of the generation of a high density bovine genetic SNP map created through a whole genome sequencing of the bovine genome using the shotgun sequencing approach. Shotgun sequencing is a laboratory technique used for determining the DNA sequence of a genome which involves breaking the genome into a collection of small DNA fragments that are sequenced individually. A computer is then used to identify the overlaps in the DNA sequences and to place individual fragments in their correct order to reconstitute the genome. Each of the overlapping DNA sequences (or "contigs") is given an identifying number. For example, one of the relevant SNPs discussed in the examples is identified in the specification as SNP3 (SNP MMBT22302) which is located at position 300 on the contig identified as SEQ ID No 19666.
57 Example 2 illustrates how the high density SNP map was used to identify the SNPs associated with a series of bovine traits. These traits include marbling, tenderness, fat thickness, yield, and daily gain. According to the specification there are 2510 SNPs associated with those five traits. These SNPs are identified in electronic files incorporated by reference into the specification. These electronic files include tables that identify the 2510 SNPs and the traits with which they are associated and other SNPs within a distance of 500,000 nucleotides of them.
58 Example 3 is as follows:
Example 3
Determination of the distance of disequilibrium in cattle
[0198] This example utilizes a few of the associated SNPs disclosed in Example 2, to identify additional SNPs that are associated with the same traits, using the physical proximity on the genome of the SNPs. Furthermore, the results are used to calculate a distance of disequilibrium in cattle. In this example, "shear force" is used to refer to tenderness, "vision retail yield" is used to refer to retail yield, and "average daily gain" is used to refer to daily gain.
[0199] In the past 10 years numerous methods have been developed to identify alleles associated with phenotypic effects, traits or diseases. Linkage disequilibrium and measures of linkage disequilibrium have been of particular interest for studies of complex traits or diseases … LD occurs where blocks or regions of neighboring markers are co-inherited from a common ancestor. The degree of LD varies considerably throughout the genome and is a function of time, recombination events, mutation rate and population structure. The extent of LD can vary from a few thousand base pairs to several centimorgans. This has been most extensively documented in human studies … Similar results have been observed in other species including cattle … These studies and others have also shown that a SNP or multiple SNPs associated with a phenotype can be used as predictive of gene(s) causing differences in trait phenotypes within a region of high LD although they may or may not be the precise causative gene … While it has been established that markers can be identified that associate with a specific trait, and, therefore, become diagnostic for the trait, the distance that disequilibrium reaches has not been determined in cattle with a dense marker map. Therefore, an experiment to determine the disequilibrium distance in cattle was performed using the high-density SNP map disclosed in Example 1.
[0200] The high-density SNP map disclosed in Example 1 was used to identify SNPs that are in physical proximity to a few of the associated SNPs disclosed in Example 2. Nucleotide occurrences of the SNPs were determined using the method disclosed in Example 2. A determination of whether on-test SNPs was associated with a trait was performed as disclosed in Example 2.
[0201] As discussed above, the study was performed to verify the assumption that markers that are in close physical proximity on the bovine genome will associate with the same trait(s) because markers in linkage disequilibrium with the associated SNP marker will also be in linkage disequilibrium with the mutation(s) influencing the trait.
[0202] As indicated in Table 2, SNP3 (MMBT22302) is significantly associated with the trait of average daily gain ("ADG" in Table 2). Several SNPs were identified using the high-density SNP map of Example I that are located at various distances from SNP3 on the bovine genome (Table 2). For example, SNP2 is 466,047 nucleotides from SNP3. Furthermore, SNP5 was identified which is 408,732 nucleotides from SNP3. SNP6 was identified which is 1.0 million nucleotides from SNP3. Finally, SNP4 was identified, which is 308,742 nucleotides.
[0203] As illustrated in Table 2, SNPs that were located within 500,000 nucleotides of SNP3 also were associated with average daily gain, whereas those that were located greater than 500,000 nucleotides from SNP3 were not associated with average daily gain. For example, linkage disequilibrium reaches 466,047 bases to SNP2, but not to SNPI at 1.5 Mb; linkage disequilibrium reaches to 408,732 bases to SNP5, but not to SNP6 at 1.0 Mb. SNP4, which is 308,742 nucleotides from SNP3, was discovered by sequencing the contig of DNA that maps to this region in 4 different breeds of cattle. It is also in disequilibrium with average daily gain.
[0204] Table 2. Disequilibrium analysis in relation to SNP distance from MMBT22302.
59 The disequilibrium analysis presented in Table 2 shows that the four other SNPs within 500,000 base pairs of SNP3 are also associated with the average daily gain trait with which SNP3 is associated. The remaining two SNPs, neither of which was shown to be associated with that trait, were both located more than 1,000,000 base pairs in either direction of SNP3. A similar analysis appears in Table 3 in relation to SNP9 (MMBT03905) and in Table 4 in relation to SNP16 (MMBT02782).
60 The discussion of Example 3 concludes at [0208]:
[0208] The results of this Example indicate that disequilibrium in cattle exists across the region of 500,000 nucleotides from an associated SNP, in each direction. Therefore, it is expected that when an associated SNP is identified, other markers within this 500,000bp region will also be in disequilibrium with the associated SNP and with the trait of interest, and can be used to infer associations with the trait of interest.
61 There are two points to note about paragraph [0208]. First, the results included in Example 3 are said to indicate that disequilibrium (which in this context we understand to refer to LD) in cattle will exist within a distance of 500,000 nucleotides in either direction from SNP3 (or the other SNPs identified in Example 3 that are shown to be associated with a relevant trait). Secondly, these results are said to give rise to an expectation that other SNPs within the 500,000 base pair region either side of SNP3 will also be in disequilibrium with SNP3. This is said to provide an additional basis for inferring associations between SNPs located within the specified region and the relevant trait.
62 What to our minds is most significant about paragraph [0208] is that it does not expressly or impliedly suggest that every SNP within the 500,000 base pair region of an associated SNP will be in disequilibrium with the associated SNP. Our understanding of this part of the specification is confirmed by statements that appear in other parts of the document, and expert evidence accepted by the primary judge, which shows that the degree of LD between two polymorphisms is influenced by a number of different factors in addition to distance between their loci.