KAMBLE NITIN UTTAM (GB)
WO1998040503A1 | 1998-09-17 | |||
WO1998035051A1 | 1998-08-13 |
US8440431B2 | 2013-05-14 | |||
US8440432B2 | 2013-05-14 | |||
US8450471B2 | 2013-05-28 | |||
US8697359B1 | 2014-04-15 | |||
US4873192A | 1989-10-10 |
HIKARU SATOH ET AL: "Mutation of the Plastidial -alpha--Glucan Phosphorylase Gene in Rice Affects the Synthesis and Structure of Starch in the Endosperm", THE PLANT CELL, AMERICAN SOCIETY OF PLANT BIOLOGISTS, US, vol. 20, no. 7, 1 July 2008 (2008-07-01), pages 1833 - 1849, XP008145527, ISSN: 1040-4651, [retrieved on 20080711], DOI: 10.1105/TPC.107.054007
HIGGINS JODY E ET AL: "Characterization of starch phosphorylases in barley grains : Characterization of starch phosphorylases in barley grains", vol. 93, no. 9, 3 January 2013 (2013-01-03), GB, pages 2137 - 2145, XP093029810, ISSN: 0022-5142, Retrieved from the Internet
TICKLE P ET AL: "Characterization of plastidial starch phosphorylase in Triticum aestivum L. endosperm", JOURNAL OF PLANT PHYSIOLOGY, ELSEVIER, AMSTERDAM, NL, vol. 166, no. 14, 15 September 2009 (2009-09-15), pages 1465 - 1478, XP026498041, ISSN: 0176-1617, [retrieved on 20090612], DOI: 10.1016/J.JPLPH.2009.05.004
SONG LIU ET AL: "Editing of Rice Endosperm Plastidial Phosphorylase Gene OsPho1 Advances Its Function in Starch Synthesis", vol. 28, no. 3, 1 May 2021 (2021-05-01), AMSTERDAM, NL, pages 209 - 211, XP093029801, ISSN: 1672-6308, Retrieved from the Internet
DATABASE Uniprot [online] 5 December 2018 (2018-12-05), ANONYMOUS: "Alpha-1,4 glucan phosphorylase", XP093029715, retrieved from https://rest.uniprot.org/uniprotkb/A0A3B6KPU0.txt Database accession no. A0A3B6KPU0
DATABASE Uniprot [online] 8 May 2019 (2019-05-08), ANONYMOUS: "Alpha-1,4 glucan phosphorylase", XP093029722, retrieved from https://rest.uniprot.org/unisave/A0A446TFD3?format=txt&versions=10 Database accession no. A0A446TFD3
SAMBROOK ET AL.: "Molecular Cloning: A Library Manual", 1989, COLD SPRING HARBOR LABORATORY PRESS
KUNKEL, PROC. NATL. ACAD. SCI. USA, vol. 82, 1985, pages 488 - 492
KUNKEL ET AL., METHODS IN ENZYMOL., vol. 154, 1987, pages 367 - 382
"Techniques in Molecular Biology", 1983, MACMILLAN PUBLISHING COMPANY
KRYSAN ET AL., THE PLANT CELL, vol. 11, December 1999 (1999-12-01), pages 2283 - 2290
MA ET AL., MOLECULAR PLANT, DOI:10.1016/J.MOLP.2015.04.007, 2015
HAWKINS ECHEN JWATSON-LAZOWSKI AAHN-JARVIS JBARCLAY JEFAHY BHARTLEY MWARREN FJSEUNG D: "STARCH SYNTHASE 4 is required for normal starch granule initiation in amyloplasts of wheat endosperm", NEW PHYTOL, vol. 230, no. 6, June 2021 (2021-06-01), pages 2371 - 2386, XP071313894, DOI: 10.1111/nph.17342
PARK, S. H. ET AL.: "Starch granule size distribution of hard red winter and hard red spring wheat: Its effects on mixing and breadmaking quality", JOURNAL OF CEREAL SCIENCE, vol. 49, 2009, pages 98 - 105, XP025780216, DOI: 10.1016/j.jcs.2008.07.011
SACCOMANNO BBERBEZY PFINDLAY KSHOESMITH JUAUY CVIALLIS BTRAFFORD K: "Characterization of wheat lacking B-type starch granules", J CEREAL SCI., vol. 104, March 2022 (2022-03-01), pages 103398
STARK JRLYNN A: "Starch granules large and small", BIOCHEM SOC TRANS., vol. 20, no. 1, February 1992 (1992-02-01), pages 7 - 12
STODDARD, F.L.SARKER, R.: "Characterization of Starch in Aegilops Species", CEREAL CHEMISTRY, vol. 77, 2000, pages 445 - 447
WASHINGTON, J. M. ET AL.: "Developing Waxy Barley Cultivars for Food, Feed and Malt", BARLEY GENETICS, vol. VIII, 2000, pages 303 - 306
ZEEMAN SCTHORNEYCROFT DSCHUPP NCHAPPLE AWEEK MDUNSTAN HHALDIMANN PBECHTOLD NSMITH AMSMITH SM: "Plastidial alpha-glucan phosphorylase is not required for starch degradation in Arabidopsis leaves but has a role in the tolerance of abiotic stress", PLANT PHYSIOL., vol. 135, no. 2, June 2004 (2004-06-01), pages 849 - 58
CRAM ET AL., BMC PLANT BIOLOGY, vol. 19, 2019, pages 474
CLAIMS: 1. A genetically altered plant, part thereof or plant cell comprising reduced or abolished expression or activity of at least one PHS1 (plastidial a-glucan phosphorylase) gene. 2. The genetically altered plant, part thereof or plant cell of claim 1 , wherein the plant comprises a mutation in at least one gene encoding PHS1 and/or the PHS1 promoter. 3. The genetically altered plant, part thereof or plant cell of claim 1 or 2, wherein the mutation is a loss of function or partial loss of function mutation, wherein preferably the mutation reduces or abolished the phosphorylase activity of PHS1 . 4. The genetically altered plant, part thereof or plant cell of claim 1 , wherein the plant comprises an RNAi interference construct that reduces or abolishes the expression of at least one PHS1 gene. 5. The genetically altered plant, part thereof or plant cell of any preceding claim, wherein the PHS1 gene encodes a PHS1 polypeptide comprising SEQ ID NO: 3, 7, 11 , 15, 19, 23, 27, 31 , 35, 39, 43 or 47 or a functional variant or homologue thereof. 6. The genetically altered plant, part thereof or plant cell of claim 5, wherein the functional variant or homologue has at least 60% overall sequence identity to SEQ ID NO: 3, 7, 11 , 15, 19, 23, 27, 31 , 35, 39, 43 or 47. 7. The genetically altered plant, part thereof or plant cell of any preceding claim, wherein the plant is selected from wheat, barley, rye, maize, potato and sorghum. 8. A method of altering the starch granule characteristics of a plant, the method comprising reducing the expression or activity of at least one PHS1 (plastidial a- glucan phosphorylase) gene. 9. A method of producing a genetically altered plant with an altered starch granule characteristic, the method comprising reducing the expression or activity of at least one PHS1 (plastidial a-glucan phosphorylase) gene. 10. The method of claim 8 or 9, wherein the plant has a bimodal starch granule composition comprising a first population of starch granules of a first granule size range and at least a second population of starch granules of a second, smaller size range, and wherein the method comprises reducing the number of granules in the second population and/or increasing the size of starch granules in the second population. 11. The method of claims 8 to 10, wherein the plant comprises A-type and B-type starch granules, and wherein the method comprises reducing the number and/or increasing the size of B-type starch granules. 12. The method of claims 8 to 11 , wherein the method comprises introducing at least one mutation into at least one gene encoding PHS1 and/or the PHS1 promoter. 13. The method of claims 8 to 12, wherein the mutation is a loss of function or partial loss of function mutation, wherein preferably the mutation reduces or abolishes the phosphorylase activity of PHS1 . 14. The method of claims 8 to 11 , wherein the method comprises introducing and expressing in the plant an RNAi interference construct that reduces or abolishes the expression of at least one PHS1 gene. 15. The method of claims 8 to 14, wherein the PHS1 gene encodes a PHS1 polypeptide comprising SEQ ID NO: 3, 7, 11 , 15, 19, 23, 27, 31 , 35, 39, 43 or 47 or a functional variant or homologue thereof. 16. The method of claims 8 to 15, wherein the functional variant or homologue has at least 60% overall sequence identity to SEQ ID NO: 3, 7, 11 , 15, 19, 23, 27, 31 , 35, 39, 43 or 47. 17. The method of claims 9 to 18, wherein the plant is selected from wheat, barley, rye, maize, potato and sorghum. 18. A plant, plant part or plant cell obtained or obtainable by the method of any of claims 8 to 17. 19. Grain derived from the genetically altered plant of any of claims 1 to 7, wherein the grain is characterised by reduced or abolished expression of a PHS1 gene and/or reduced or abolished activity of a PHS1 polypeptide. 20. The grain of claim 19, wherein the grain comprises at least one mutation in a PHS1 gene and/or promoter. 21 . Starch obtained or obtainable from at least one plant cell of the genetically altered plant of any of claims 1 to 7 or the grain of any of claims 19 or 20. 22. A food or feed composition prepared from the grain of any of claims 19 or 20 or the starch of claim 21. 23. Use of the grain of any of claims 19 or 20 or the starch of claim 21 as a food or feedstuff, or in any pharmaceutical or industrial application. 24. A nucleic acid construct comprising a nucleic acid sequence encoding at least one DNA-binding domain or protospacer element that can bind to at least one target sequence in a PHS1 gene and/or promoter, wherein preferably the target sequence is selected from SEQ ID NO: 49 to 52, 84 or 85 or a variant thereof, wherein preferably, the sequence of the protospacer element is selected from SEQ ID Nos 53 to 56 or 86 or 87 or a variant thereof. 25. The nucleic acid construct of claim 24, wherein said construct encodes at least one single-guide RNA (sgRNA), wherein said sgRNA comprises the tracrRNA sequence and the crRNA or protospacer sequence, wherein the sgRNA comprises or consists of a sequence selected from SEQ ID Nos 58 to 61 or 88 or 89 or a functional variant thereof. An isolated plant cell transfected with at least one nucleic acid construct as defined in claim 24 or 25 or transfected with a single guide (sg) RNA molecule wherein the sgRNA comprises a sequence selected from SEQ ID Nos 62 to 65 or a functional variant thereof. |
FIELD OF THE INVENTION
The invention relates to methods for altering the starch granule composition in starch storage organs. Also described are genetically altered plants characterised by the above phenotype as well as methods of producing such plants.
BACKGROUND OF THE INVENTION
Starch is a vital source of calories in human diets and an important industrial raw material for the manufacture of many food and non-food products (such as biofuels, paper, pharmaceuticals and textiles) (Jobling, 2004; Smith, 2008; Santelia and Zeeman, 2010). Most plants store a portion of carbon fixed through photosynthesis during the day as starch in leaf chloroplasts. This starch is then degraded to provide energy for growth and metabolism at night. Some plants also accumulate high levels of starch in amyloplasts (i.e. non-photosynthetic chloroplasts for starch storage) of seeds and storage organs. Starch is the major carbohydrate component of many of our staple crops, including cereal grains (wheat, barley, maize, rye, oat), tubers and storage roots (potato, cassava, yam, sweet potato), and banana fruits.
Native starch exists as insoluble, semi-crystalline granules that are composed of two distinct glucose polymers, amylopectin and amylose (Zeeman et al., 2010; Pfister and Zeeman, 2016; Goren et al., 2018). Amylopectin is the major component of the starch granule and is a highly branched polymer consisting of a-1 ,4-linked glucan chains with a-1 ,6-linked branch points. The structure of amylopectin facilitates the formation of double helices between adjacent branches, which form the crystalline regions of the starch granule. Amylose consists of long linear a-1 ,4-linked chains with very few branches, and is thought to reside in the more amorphous regions of the starch granule. The ratio of amylopectin to amylose, as well as the size and shape of starch granules vary greatly depending on botanical source and organ. In Arabidopsis leaves, starch granules are approximately 1 pm in diameter, and contain about 8-10% amylose (Zeeman et al., 2002; Seung et al., 2015). Cereal starches typically contain approximately 15-30% amylose, but the morphology of the granules vary greatly between species (Jane, 1994; Jane et al. , 1999). In rice and maize, starch granules have a size distribution between 3-8 m and 5-20 pm respectively (Jane, 1994; Lindeboom et al., 2004). Cereal crops of the Triticeae (wheat, rye and barley) have a bimodal distribution of granule size, with larger A-type granules and smaller B-type granules. In wheat, A-type granules are 20-30 pm in diameter, while B-type granules (which initiate about 10-15 days after the initiation of A-type granules) are 2-7 pm in diameter (Bechtel et al., 1990; Howard et al., 2011). A-type granules account for more than 70% of wheat endosperm starch by weight, but less than 10% of the granules by number (Lindeboom et al., 2004).
Studying granule initiation not only has potential to reveal mechanisms underpinning this unique spatiotemporal pattern of A- and B-type granule formation, but can also lead to new approaches to modify starch granule size distributions. There is commercial interest in reducing the number of B-type granules in wheat and barley, and/or increasing their size. In brewing, B-type granules can escape filtration due to their small size. These granules gelatinise during wort boiling, and produce an undesirable “starch haze” in the final beer (Stark and Lynn 1992). B-type granules also ineffectively sediment in a liquid suspension, for example during the industrial preparation of starch and gluten from flour - where B-type granules are lost in the waste water during starch purification and hinders the effective separation of starch from gluten (Stoddard and Sarker, 2000). In wheat, B- type granules have also been proposed to have negative effects on both flour processing and bread-making quality (Park et al., 2009). The lack of B-type granules can also confer unique functional properties to wheat starch and flour and increase water absorption, creating further variation in starch properties that can be exploited to produce a range of food and non-food products (Saccomanno et al. 2022). Accordingly, there is significant value in reducing the number of B-type starch granules.
The initiation of proper A-type granules in wheat requires both STARCH SYNTHASE 4 (SS4) and B-GRANULE CONTENT1 (BGC1). Loss of either protein results in supernumerary granule initiations in most amyloplasts at early grain development, which later fuse to form ‘compound’ starch granules (Chia et al., 2020; Hawkins et al., 2021). To date, there have been no proteins identified as being specifically involved in B-type granule formation. There therefore exists a need to alter the starch granule composition of plants, and in particular to inhibit B-type granule formation or otherwise reduce the levels of B-type granules in plants. The present invention addresses this need.
SUMMARY OF THE INVENTION
We demonstrate that PHS1 is required for normal starch granule number and size distribution in the wheat endosperm, due to a specific role of the enzyme in B-type granule formation. We presented three lines of evidence to support this role: Firstly, mutants of durum wheat defective in phs1 had fewer but larger B-type granules, with no differences in the number or size of A-type granules (Figure 4 and 7). Secondly, granule size distributions in the mutants were identical to those of the wild type during the early stages of grain development, but deviated from the wild type at 18 dpa - the timepoint after B-type granules started forming in the wild type (Figures 7-8). Finally, the bgc1-1 phs1-1 double mutant had similar granule size distributions to the bgc1-1 mutant, suggesting that the further loss of PHS1 in a background that only has A-type granules has no effect on granule size distribution (Figure 9). Thus, PHS1 appears to be only required for normal B-type granule formation in the wheat endosperm, and not for A-type granule formation.
Based on our findings, we present a model where A- and B-type granule initiation occur via distinct mechanisms involving different enzymes (Figure 12). B-type granule initiation is fundamentally different to A-type granule initiation in that B-type granules initiate within amyloplasts that already contain an A-type granule. They can therefore be considered “secondary” granule initiations. The A-type granule can act as a source of substrates that can prime B-type granule formation, particularly as MOS are released through the process of amylopectin trimming. By contrast, A-type granules are the first granules to form within each amyloplast during early grain development, and are more likely to require de novo primer formation. We propose that PHS1 may only be important in the secondary B-type granule initiations, perhaps as it processes existing MOS substrates in the plastid in a manner that allows B-type granule initiation.
PHS1 catalyses a reversible reaction, whereby a-1 ,4-linked glucan chains can be degraded via a phosphorolysis reaction that releases glucose-1 -phosphate (G1 P), or the glucan chain can be extended using G1 P as a substrate. In addition to plastidial PHS1 , most plants have a cytosolic isoform, PHS2, which is important for maltooligosaccharide (MOS) metabolism during nighttime starch mobilisation. Compared to PHS2 and other non-plant phosphorylases, PHS1 has a unique 78-amino-acid (L78) insertion that may hinder binding to larger branched substrates, increasing its relative affinity for linear MOS.
It was not expected that PHS1 would be involved in the initiation of B-type granules, as previous reports suggested that in plants without a bimodal distribution of granules, PHS1 was largely redundant or had a negative effect on total starch content of the grain and thus yield (Zeeman et al. 2004; Satoh et al. 2008). Furthermore, in plants with a bimodal distribution of granules, RNAi silencing of PHS1 was previously reported to have no effect on starch granule size distributions (Higgins et al 2013). In contrast to previous reports, we show here that severely reducing or abolishing the expression or activity of PHS1 significantly inhibits the number and decreases the size of B-type granules without affecting starch content. This result could not have been predicted.
Since PHS1 mutations also did not affect plant growth or grain yield (Figure 3), PHS1 represents another gene target to reduce the content of B-type granule content in plants with a bimodal distribution of starch granules, such as wheat. Varieties with low B-type granules are desirable for bread making. Recently work with BGC1 mutants also demonstrated that starch without B-type granules had higher water absorption, reduced grain hardness and higher protein content (Saccomanno et al., 2022 ). As described above, B-type granules also cause processing problems due to their small size (Stoddard and Sarker, 2000; Park et al., 2009). Since reduced B-type granules can only be achieved through BGC1 by reducing gene dosage rather than through knockouts, the phenotype is easier to achieve in a polyploid species like wheat than in a diploid species like barley. Thus, PHS1 is a more suitable gene target for reducing B-type granules than BGC1 as the phenotype can be achieved through a homozygous knockout mutation.
In one aspect of the invention, there is provided a genetically altered plant, part thereof or plant cell comprising reduced or abolished expression or activity of at least one PHS1 (plastidial a-glucan phosphorylase) gene.
Preferably, the plant comprises a mutation in at least one gene encoding PHS1 and/or the PHS1 promoter. More preferably, the mutation is a loss of function or partial loss of function mutation, wherein preferably the mutation reduces or abolishes the phosphorylase activity of PHS1 .
In an alternative embodiment, the plant comprises an RNAi interference construct that reduces or abolishes the expression of at least one PHS1 gene.
In one embodiment, the PHS1 gene encodes a PHS1 polypeptide comprising SEQ ID NO: 3, 7, 11 , 15, 19, 23, 27, 31 , 35 or 39 or a functional variant or homologue thereof. Preferably, the functional variant or homologue has at least 60% overall sequence identity to SEQ ID NO: 3, 7, 11 , 15, 19, 23, 27, 31 , 35, 39, 43 or 47. In another embodiment, the homologue is selected from SEQ ID NO: 43 and 47.
Preferably, the plant has a bimodal starch granule composition comprising a first population of starch granules of a first granule size range and at least a second population of starch granules of a second, smaller size range, and wherein the method comprises reducing the number of granules in the second population and/or increasing the size of starch granules in the second population.
In one embodiment the plant is selected from the Triticeae tribe. In another embodiment, the plant is selected from wheat, barley, rye, maize, potato and sorghum. Preferably, the plant is a plant with a bimodal distribution of starch granules. In one embodiment, the plant is selected from wheat or barley or rye.
In another aspect of the invention, there is provided a method of altering the starch granule characteristics of a plant, the method comprising reducing the expression or activity of at least one PHS1 (plastidial a-glucan phosphorylase) gene.
In a further aspect of the invention, there is provided a method of producing a genetically altered plant with an altered starch granule characteristic, the method comprising reducing the expression or activity of at least one PHS1 (plastidial a-glucan phosphorylase) gene.
Preferably, the plant has a bimodal starch granule composition comprising a first population of starch granules of a first granule size range and at least a second population of starch granules of a second, smaller size range, and wherein the method comprises reducing the number of granules in the second population and/or increasing the size of starch granules in the second population.
In one embodiment the plant is selected from the Triticeae tribe. In another embodiment, the plant is selected from wheat, barley, rye, maize, potato and sorghum. Preferably, the plant is a plant with a bimodal distribution of starch granules. In one embodiment, the plant is selected from wheat or barley or rye.
In one embodiment, the plant comprises A-type and B-type starch granules, and wherein the method comprises reducing the number and/or increasing the size of B-type starch granules.
In one embodiment, the method comprises introducing at least one mutation into at least one gene encoding PHS1 and/or the PHS1 promoter. Preferably, the mutation is a loss of function or partial loss of function mutation, wherein preferably the mutation reduces or abolishes the phosphorylase activity of PHS1 .
In another embodiment, the method comprises introducing and expressing in the plant an RNAi interference construct that reduces or abolishes the expression of at least one PHS1 gene.
In one embodiment, the PHS1 gene encodes a PHS1 polypeptide comprising SEQ ID NO: 3, 7, 11 , 15, 19, 23, 27, 31 , 35 or 39 or a functional variant or homologue thereof. Preferably, the functional variant or homologue has at least 60% overall sequence identity to SEQ ID NO: 3, 7, 11 , 15, 19, 23, 27, 31 , 35, 39, 43 or 47. In another embodiment, the homologue is selected from SEQ ID NO: 43 and 47.
In one embodiment the soluble sugar contents of the grains of said plant is not affected.
In a preferred embodiment A-type starch granule initiation is not affected.
In another aspect there is provided a plant, plant part or plant cell obtained or obtainable by the method of the invention. In another aspect of the invention, there is provided a grain derived from the genetically altered plant of the invention, wherein the grain is characterised by reduced or abolished expression of a PHS1 gene and/or reduced or abolished activity of a PHS1 polypeptide. Preferably, the grain comprises at least one mutation in a PHS1 gene and/or promoter.
In another aspect of the invention there is provided starch obtained or obtainable from at least one plant cell of the genetically altered plant of the invention or the grain of the invention.
In another aspect, there is provided a food or feed composition prepared from the grain of the invention or the starch of the invention.
In another aspect, there is provided the use of the grain of the invention or the starch of the invention as a food or feedstuff.
In another aspect, there is provided the use of the grain of the invention or the starch of the invention in any pharmaceutical or industrial application.
DESCRIPTION OF THE FIGURES
The invention is further described in the following non-limiting figures:
Figure 1 . Identification of wheat PHS1 loci and generation of loss-of-function mutants. A) Gene models of PHS1 homeologs in Chinese Spring. Exons are depicted in dark blue and the 5’ and 3’ UTRs are depicted in light blue. The position of the start (ATG) and stop (TGA) codons are indicated with green arrows. The position of the mutations in the TILLING mutants are indicated with red arrows. B) Native PAGE of phosphorylase activity. Crude extracts of leaves from 4-week-old seedlings of the wild type (WT), phs1- 1 single (aa BB and AA bb) and double (aa bb) mutants, and wild-type segregant controls (AA BB) were separated on 7.5% polyacrylamide gels containing 0.3% glycogen (180 pg protein per well). Activity bands were visualised by staining in Lugol’s iodine solution. C) As for B), but with phs1-6.
Figure 2. Plant and grain phenotypes of Taphsl mutants. The wild-type sibling control (AA BB), single mutants (aa BB and AA bb) and the double mutant (aa bb) from phs1-1 and phs1-6 lines were compared with the Kronos wild type (WT). A) Photograph of plants at maturity. Bar = 5 cm. B) Grain yield per plant. C) Photographs of grains showing both dorsal and ventral sides, as well as a cut section through the middle of the grain. D) Thousand grain weight. E) Grain size, indicated as area. For panels D and E, data represent the mean ± SEM from n = 15-16 plants. Values with different letters are significantly different under a one-way ANOVA and Tukey’s posthoc test at p < 0.05.
Figure 3. The Taphsl mutants have normal leaf starch content. The wild-type sibling control (AA BB), single mutants (aa BB and AA bb) and double mutants (aa bb - for both Taphs1-1 and Taphs1-6) were compared with the Kronos wild type (WT). Seedlings were grown for two weeks under 16 h day/8 h night, and harvested at the End of Day (ED) and End of Night (EN). Values are the mean ±SEM from n = 5 plants, and those with different letters are significantly different under a one-way ANOVA and Tukey’s posthoc test at p < 0.05.
Figure 4. Endosperm starch from the Taphs1-1 mutant has fewer B-type granules. A) Scanning electron micrographs of purified endosperm starch. Bar = 10 pm. B) Granule size distributions were determined using a Coulter counter, and the data were expressed as relative % volume (of total starch) vs. granule diameter plots. C and D) B-type granule volume (% of total starch) and the average diameter of B-type granules were extracted from the relative volume vs. diameter plots by fitting a mixed normal distribution. E) Same as B, but expressed as relative starch granule number (% of total granule number) vs. diameter plots. F) The percentage of small granules by number (smaller than 10 pm) were calculated from the coulter counter data. G) The average diameter of A-type granules was calculated from the relative volume vs. diameter plots, as for panels C and D. For panels B-G, plots show the mean from the analysis of n=4-6 replicate starch extractions, each from grains from a separate plant. The shading (on panels B and E) and error bars (on panels C, D, F, G) represent the SEM. Values with different letters are significantly different under a one-way ANOVA with Tukey’s post-hoc test (p < 0.05).
Figure 5. Interaction between wheat PHS1 and BGC1 in Nicotiana benthamiana. A) Plastidial localisation of YFP-tagged PHS1 and BGC1 transiently expressed in Nicotiana leaves. Bar = 5 pm. B) Pairwise immunoprecipitation (IP) of PHS1 :YFP and BGC1 :RFP co-expressed in Nicotiana leaves, using anti-YFP beads. Input and IP samples were blotted with YFP (top panels) and RFP antibodies (bottom panels). Chloroplast-targeted YFP (cTP:YFP) and RFP (cTP:RFP) were used as controls to exclude unspecific binding to the fluorescent protein tags.
Figure 6. Expression of wheat PHS1 loci. A) Expression levels of PHS1 homeologs in the endosperm of durum wheat across different stages of grain development. Values are in transcripts per million (TPM) and are means ± SEM from n = 3 replicates per time point. B) Native PAGE of phosphorylase activity. Crude extracts were produced from dissected endosperms across different stages of grain development and were separated on 7.5% polyacrylamide gels containing 0.3% glycogen (180 pg protein per well). Activity bands were visualised after incubation with G1 P by staining in Lugol’s iodine solution.
Figure 7. Loss of PHS1 affects granule number but not total starch content. The endosperm was dissected from developing grains of WT, phs1-1 and phs1-6 double mutants and corresponding wild-type controls, harvested at 8, 14, 18 and 22 dpa, with n = 3 individual plants for each genotype per time point. A) Starch content of the endosperm. Values are expressed relative to the fresh weight of the dissected endosperm. B) Starch granule number in the endosperm. Starch was purified from dissected endosperm and the number of granules was determined using a Coulter counter running in volumetric mode (analysing 2 mL of the suspension). Values are expressed relative to the fresh weight of the dissected endosperm. Values with different letters are significantly different under a one-way ANOVA with Tukey’s post-hoc test (p < 0.05).
Figure 8. PHS1 affects granule size distributions during mid-grain development. The endosperm was dissected from developing grains of WT, phs1-1 and phs1-6 double mutants and corresponding wild-type controls, harvested at A-E) 8 dpa, B-F) 14 dpa, C- G)18 dpa and D-H) 22 dpa, with n = 3 individual plants for each genotype per time point. A-D) Granule size distributions were analysed on the Coulter counter and the data were expressed as relative % volume (of total starch) vs. granule diameter plots. The shading represents the ± SEM. E-H) Starch granule morphology observed using scanning electron microscopy. Bars = 15 pm.
Figure 9. Loss of PHS1 does not affect granule size distribution in the bgc1-1 mutant. A) Scanning electron micrographs of purified endosperm starch. Bar = 15 pm. B) Granule size distributions were determined using a Coulter counter, and the data were expressed as relative % volume (of total starch) vs. granule diameter plots. C) A-type granule diameter extracted from the relative volume vs. diameter plots by fitting a bimodal mixed normal distribution, except for genotypes with bgc1-1, where a unimodal distribution was fitted. D) The percentage of small granules by number (smaller than 10 pm) were calculated from the Coulter counter data. For panels B-D, plots show the mean from the analysis of n=3 replicate starch extractions, each from grains from a separate plant. The shading (in panel B) and error bars (in panels C and D) represent the ±SEM. Values with different letters are significantly different under a one-way ANOVA with Tukey’s post-hoc test (p < 0.05).
Figure 10. Soluble glucan quantification in the developing endosperm tissue. A) Total soluble glucans, B) methanol precipitable fraction of soluble glucans (phytoglycogen and long MOS), and C) the non-precipitable fraction (short MOS) in perchloric acid extracts of grain of WT, phs1-1 and phs1-6 double mutants and corresponding wild-type controls, harvested at 8, 14, 18 and 22 dpa, with n = 3 individual plants for each genotype per time point. Values are expressed relative to the fresh weight of the dissected endosperm. Values with different letters are significantly different under a one-way ANOVA with Tukey’s post-hoc test (p < 0.05).
Figure 11. Soluble sugar quantification in the developing endosperm tissue. A) Representative HPAEC-PAD chromatograms, used for the quantification of B) glucose, C) fructose, D) sucrose and E) maltose. Perchloric acid extracts of dissected endosperm harvested at 8, 14, 18 and 22 dpa were analysed, with n = 3 individual plants for each genotype per time point. Values are expressed relative to the fresh weight of the dissected endosperm. Values with different letters are significantly different under a oneway ANOVA with Tukey’s post-hoc test (p < 0.05).
Figure 12. Model of PHS1 action in B-type starch granules initiation in wheat endosperm. During wheat grain development, a single A-type granule initiates in each amyloplast during early grain development and grows around 6-8 days post anthesis (dpa). B-type granules initiate later in grain development, at least partially in stromules, around 15-20 days post anthesis. We propose PHS1 acts on maltooligosaccharides (MOS), likely released from A-type granules by isoamylases (ISA), to initiate B-type granules. This process is disrupted in mutants lacking a functional PHS1 protein, resulting in fewer initiations of B-type starch granules. The fewer B-type granules means each one has a greater share of substrates for granule growth, leading to larger B-type granules in the mutant at grain maturity.
DETAILED DESCRIPTION OF THE INVENTION
The present invention will now be further described. In the following passages, different aspects of the invention are defined in more detail. Each aspect so defined may be combined with any other aspect or aspects unless clearly indicated to the contrary. In particular, any feature indicated as being preferred or advantageous may be combined with any other feature or features indicated as being preferred or advantageous.
The practice of the present invention will employ, unless otherwise indicated, conventional techniques of botany, microbiology, tissue culture, molecular biology, chemistry, biochemistry and recombinant DNA technology, bioinformatics which are within the skill of the art. Such techniques are explained fully in the literature.
As used herein, the words "nucleic acid", "nucleic acid sequence", "nucleotide", "nucleic acid molecule" or "polynucleotide" are intended to include DNA molecules (e.g., cDNA or genomic DNA), RNA molecules (e.g., mRNA), natural occurring, mutated, synthetic DNA or RNA molecules, and analogs of the DNA or RNA generated using nucleotide analogs. It can be single-stranded or double-stranded. Such nucleic acids or polynucleotides include, but are not limited to, coding sequences of structural genes, anti-sense sequences, and non-coding regulatory sequences that do not encode mRNAs or protein products. These terms also encompass a gene. The term "gene" or “gene sequence” is used broadly to refer to a DNA nucleic acid associated with a biological function. Thus, genes may include introns and exons as in the genomic sequence, or may comprise only a coding sequence as in cDNAs, and/or may include cDNAs in combination with regulatory sequences.
The terms "polypeptide" and "protein" are used interchangeably herein and refer to amino acids in a polymeric form of any length, linked together by peptide bonds.
The term “granule” or “starch granule” as used herein refers to a semi-crystalline granule that is composed of two glucose polymers of glucans - amylopectin and amylose. The size, shape, amylose to amylopectin ratio, any crystalline to amorphous material ratio, starch supramolecular architecture, and amylose-lipid complexes may vary from plant to plant. Starch granules are synthesised and stored in plastids - particularly chloroplasts of leaves and non-photosynthetic plastids specialised for starch storage (amyloplasts) in seeds and vegetative storage organs.
The aspects of the invention involve recombinant DNA technology and exclude embodiments that are solely based on generating plants by traditional breeding methods.
In an aspect of the invention, there is provided a method for altering a starch granule characteristic in a plant by reducing or abolishing the expression of at least one plastidial a-glucan phosphorylase (referred to herein as “PHS1” or “PHO1” (phosphate 1)) nucleic acid and/or reducing or abolishing the activity of a PHS1 polypeptide.
As used here, altering a starch granule characteristic in a plant may mean altering the characteristic in all plastids or plastid-types (e.g. photosynthetic plastids (in leaves)) and non-photosynthetic plastids (e.g. amyloplasts). For example therefore, the starch granule characteristic may be altered in leaves or seeds or vegetative storage organs, or all of leaves, seeds and vegetative storage organs. Most preferably the starch granule characteristic is altered in seeds and/or vegetative storage organs.
The plant may comprise at least two populations of starch granules of different sizes - e.g. a first and second size. In one embodiment, the plant comprises a first population of starch granules that have a mean diameter larger than the mean diameter of the second population of starch granules. The first population may be referred to “A-type granules”. The second population may be referred to as “B-type granules”. Granules with a mean diameter of between 10 and 30 pm, preferably between 16 and 23 pm, more preferably above 10 pm and even more preferably above 15 pm may be referred to as A-type granules. Granules with a mean diameter of between 1 and 10 pm, preferably below 10 pm, and even more preferably between 4 and 7pm may be referred to as B-type granules. The shape of the different granule populations may also differ. For example, A-type granules may be discoid or lenticular, and B-type granules may be spherical or polygonal. In wheat, for example, A-type granules typically make up more than 70% of the total weight of starch, whereas B-type granules comprise up to 90% of the granules in number. In one embodiment, there is provided a method of reducing the number of granules in one population of starch granules or abolishing one population of starch granules. Preferably the method comprises reducing the number of B-type granules or abolishing B-type granules. By “reducing” is meant a reduction in the number of B-type starch granules in the plant by at least 10%, 20%, 30%, 40%, 50%, 60%, 70%, 80%, 90% 95% or more, more preferably a reduction of between 10 and 40%, more preferably between 25 and 30% in the number of B-type starch granules (as shown in Figure 4F) compared to the number of B-type starch granules present in a plant where the expression and/or activity of PHS1 is not altered. By “abolishing” is meant that no B-type granules are produced or can be detected. The percentage of small starch granules (including B-type granules and small A-type granules) (for example, below 10 pm, relative to the total number of starch granules) may be measured, as shown for example in Figure 4E. As shown in Figure 4F. there is a 25-30% reduction in the relative number of B-type granules in the PHS1 mutant.
Alternatively, the volume of B-type granules may be considered as a percentage of the total volume of granules (i.e. A and B-type granules combined). This may also be referred to herein as the granule volume. Typically, for example in wheat, the volume of B-type granules is approx. 20-30%. As shown in Figure 4C, the methods of the invention can be used to reduce the volume percent of B-type granules to around 10 and 15%. Thus, the methods of the invention can be used to reduce the B-granule volume (compared to wild-type) by at least 10%, 20%, 30%, 40%, 50%, 60%, 70%, 80%, 90% 95% or more, more preferably between 30 and 70% and even more preferably between 50 and 60%. This results of the method of the invention can also be considered as causing an increase in the mean granule size (or diameter) of the total granule population.
In an alternative or additional embodiment, the method may also comprise increasing the size (for example, the average granule diameter) of starch granules, and in particular where there exists two or more populations of starch granules, as described above, the method may comprise increasing the size of one population of starch granules. Accordingly, the method may comprise increasing the size, such as the average granule diameter, of B-type granules. This increase in size may be at least 10%, 20%, 30%, 40%, 50%, 60%, 70%, 80%, 90% 95% or more, preferably between 30 and 70% and even more preferably between 45 and 55% compared to the size of B-type starch granules present in a plant where the expression and/or activity of PHS1 is not reduced/abolished. As shown in Figure 4, the methods of the invention can be used to increase the size of B-type granules by about 60% (e.g. from about 5 to 8pm).
Reducing the amount of B-type granules, as well as increasing the size of B-type granules, can bring several advantages by reducing the heterogeneity in starch granule size. For example in brewing, B-type granules can escape filtration due to their small size, gelatinise during wort boiling, and produce an undesirable “starch haze” in the final beer (Stark and Lynn 1992). Reducing the total amount of B-type granules would help alleviate this problem, and increasing the size of any remaining B-type granules would reduce the chance that they escape filtration.
In the industrial preparation of starch and gluten from flour, B-type granules are lost in the waste during starch purification because they do not effectively sediment, and they hinder the effective separation of starch from gluten (Stoddard and Sarker, 2000). The amount of non-sedimenting granules can be reduced by both reducing total B-type granule volume, and increasing the volume of any remaining B-type granules so that they have increased sedimentation efficiency.
In wheat, B-type granules have been proposed to have negative effects on both flour processing and bread-making quality (Park et al., 2009). The lack of B-type granules can also confer unique functional properties to wheat starch and flour, including higher water absorption (Saccomanno et al. 2022). Thus, starches with altered B-type granule volume and size may have novel physical and chemical properties, which can be exploited in making various food and non-food products.
Thus, there are significant advantages to increasing both the number and the size of the B-type granules.
In summary, the method of the invention may reduce the number but increase the size of B-type granules in a plant.
Granule size may be measured by a number of techniques that would be known to the skilled person. In one embodiment, granule size can be measured using a particle size analyser, which uses laser scattering to measure the total volume of particles of a given size, expressed as a percentage of the total volume of all particles. In one example, purified starch can be suspended in water and measured on a particle size analyser, such as a Beckman-Coulter Multisizer 4e Coulter counter, or the Coulter LS-230 laserscattering instrument (Beckman Coulter). In another embodiment, granule size can be measured using light microscopy, where starch granule area in the images are measured using the Particle Analysis plugin of Imaged software (v.2.0.0; https://imagej.net/). The area can be used to calculate diameter, assuming the granules were perfect circles. Unlike the first method with the particle size analyser, this method calculates the percentage of granules with a given size relative to the total number of granules (rather than as volumes as measured on a Coulter counter), and is a direct measure of size (rather than inferred from laser scattering).
The methods described herein may also comprise the step of measuring an altered starch characteristic, for example, a decrease in the number of B-type granules and/or an increase in the size of B-type granules using the methods described above.
As used herein, the terms “reducing” means a decrease in the levels of PHS1 expression and/or activity by up to or more than 10%, 20%, 30%, 40%, 50%, 60%, 70%, 80% or 90% when compared to the level in a wild-type or control plant. In one embodiment, reducing means a decrease in at least 50% compared to the level in a wild-type or control plant. Reducing may or may not encompass changes in the absolute PHS1 transcript level, preferably it does not. Reducing also may or may not encompass abolishing expression. The term “abolish” expression means that no expression of PHS1 is detectable (no transcript) or that no functional PHS1 polypeptide is produced. Methods for determining the level of PHS1 expression and/or activity would be well known to the skilled person. These reductions can be measured by any standard technique known to the skilled person. For example, a reduction in the expression and/or content levels of at least PHS1 expression may be a measure of protein and/or nucleic acid levels and can be measured by any technique known to the skilled person, such as, but not limited to, any form of quantitative PCR, gel electrophoresis and immunoblotting or chromatography (e.g. HPLC).
A reduction in activity may also refer to a complete or partial loss-of-function. Accordingly, the method may result in complete or partial loss of phosphorylase activity. By complete, may mean that no phosphorylase activity can be detected. The phosphorylase activity can be measured using any known technique in the art, for example, using native PAGE with phosphorylase activity staining (as in Figure 1), or using in vitro assays for phosphorylase that measure Glc-1-P incorporation into any glucan acceptor substrate. The incorporation may be quantified by measuring Glc-1-P depletion or release of inorganic phosphate.
In a particular embodiment, the method comprises reducing or abolishing the expression of at least one PHS1 nucleic acid and/or reducing or abolishing the activity of a PHS1 polypeptide in a starch storage organ, such as a tuber or grain. Alternatively, or additionally, the method comprises reducing or abolishing the expression of at least one PHS1 nucleic acid and/or reducing or abolishing the activity of a PHS1 polypeptide in an amyloplast, preferably in the endosperm of the plant. Alternatively, or additionally, the method comprises reducing or abolishing the expression of at least one PHS1 nucleic acid and/or reducing or abolishing the activity of a PHS1 polypeptide in the leaves of the plant. As a result, the characteristics of starch granules is altered in at least one plastid or plastid-type in a plant.
In one embodiment, the method comprises introducing at least one mutation into the, preferably endogenous, gene encoding PHS1 and/or the PHS1 promoter. The mutation may be any mutation that reduces or abolishes the phosphorylase activity of PHS1. As such, the mutation may be considered to be a partial or loss-of-function mutation.
The method may not comprise introducing a mutation into any other gene. In other words, only the PHS1 gene and/or promoter is mutated.
In another aspect of the invention there is provided a genetically altered plant, part thereof or plant cell characterised in that the expression of PHS1 is reduced or abolished or the plant does not express a functional or fully functional PHS1 protein. In one embodiment, the plant is a reduction (knock down) or loss of function (knock out) mutant wherein the function of the PHS1 nucleic acid sequence is reduced or lost compared to a wild type or control plant. Preferably, the plant is a knock out and not a knock down, meaning that PHS1 expression is abolished or significantly abolished (e.g. by at least 80%, 90% or 95% or more) or that the plant expresses a PHS1 protein with no detectable function and/or activity (e.g. phosphorylase function). To this end, a mutation is introduced into either the PHS1 gene sequence or the corresponding promoter sequence which disrupts the transcription of the gene or the function of the protein. The mutation may be any mutation that reduces or abolishes the phosphorylase activity of PHS1 .
In one embodiment, the plant has a bimodal size distribution of starch granules (such as, but not limited to) wheat, rye and barley, and preferably said bimodal distribution comprises a first population of starch granules of a first size or size range and a second population of starch granules of a second smaller size or size range. For example, the plant may comprise A-type and B-type granules, as described in detail above. Preferably the plant is characterised by a reduced number of starch granules in the second population and/or an increase in size of starch granules in the second population. As such, the plant, plant part or plant cell may be characterised by fewer (e.g. a decrease in number compared to a wild-type or control plant) B-type granules and/or larger (e.g. an increase in granule diameter compared to a wild-type or control plant) B-type granules).
The plant may also be characterised by an increase in the average granule size.
The plant may comprise at least one mutation in the promoter and/or at least one mutation in at least one gene encoding PHS1. In one embodiment the plant may comprise a mutation in both the promoter and the at least one gene for PHS1. Preferably said mutation is introduced into a least one plant cell and a plant regenerated from the at least one mutated plant cell.
The plant may not comprise a mutation introduced into any other gene. In other words, only the PHS1 gene and/or promoter is mutated.
The mutation may be introduced into the coding region of the PHS1 gene. Alternatively, the mutation may be in an intronic sequence, a splice site or the 5’IITR or 3’IITR. In a further alternative, at least one mutation or structural alteration may be introduced into the PHS1 promoter such that the PHS1 gene is either not expressed (i.e. expression is abolished) or expression is reduced, as defined herein. In an alternative embodiment, at least one mutation may be introduced into the PHS1 gene such that the altered gene does not express a full-length (i.e. expresses a truncated) PHS1 protein or does not express a fully functional PHS1 protein. In this manner, the (phosphorylase) activity of the PHS1 polypeptide can be considered to be reduced or abolished as described herein. In any case, the mutation may result in the expression of PHS1 with no, significantly reduced or altered biological activity in vivo.
By “at least one mutation” is meant that where the PHS1 gene is present as more than one copy or homeologue (with the same or slightly different sequence) there is at least one mutation in at least one gene. In one embodiment, all genes are mutated. In another embodiment, where the plant is a tetrapioid, for example tetrapioid wheat, the PHS1 gene is mutated on the A genome only or the A and B genome. In another embodiment, where the plant is a hexapioid, for example hexapioid wheat, the PHS1 gene is mutated on the A and/or B and/or D genome or more preferably, the A and B and D genomes
In one embodiment, the sequence of the PHS1 gene comprises or consists of a nucleic acid sequence selected from SEQ ID NO: 1 , 5, 9, 13, 17, 21 , 25, 29, 33 and 37 (genomic) or 2, 6, 10, 14, 18, 22, 26, 30, 34 and 38 (CDS) or a functional variant or homologue thereof and encodes a polypeptide as defined in one of SEQ ID NO: 3, 7, 11 , 15, 19, 23, 27, 31 , 35 and 39 respectively or a functional variant or homologue thereof. The genomic DNA sequences of the 6A homeolog is shown in SEQ ID NO: 1 , 13, 25 and 33 respectively the 6B homelog is shown in SEQ ID NO: 17, 29 and 37, and the 6D (homeolog for e.g. Cadenza) is shown in SEQ ID NO: 21 , and functional variants thereof. The cDNA sequences are shown in SEQ ID NOs 14, 26 and 35 (6A), 18, 30 and 38 (6B) and 22 (6D- Cadenza).
As used throughout, by “PHS1 promoter” is meant a region extending at least or approx. 2 kbp upstream of the ATG codon of the PHS1 ORF. In one embodiment, the sequence of the PHS1 promoter comprises or consists of a nucleic acid sequence as defined in any one of SEQ ID NO: 4, 8, 12, 16, 20, 24, 28, 32, 36 and 40 or a functional variant or homologue thereof.. In one embodiment, the PHS1 promoter may also include 5’ UTR sequences.
In the above embodiments an ‘endogenous’ nucleic acid may refer to the native or natural sequence in the plant genome. Also included in the scope of this invention are functional variants (as defined herein) and homologs of the above identified sequences. Examples of homologs are shown in SEQ ID NOs: 41 to 48. Accordingly, in one embodiment, the homolog is barley and encodes a polypeptide selected from SEQ ID NOs: 43; or the homolog comprises or consists of a nucleic acid sequence selected from SEQ ID NOs: 41 or 42. In another embodiment, the PHS1 promoter homolog comprises or consists of a nucleic acid sequence as defined in SEQ ID NO: 44. In another embodiment, the homolog is rye and encodes a polypeptide selected from SEQ ID NOs: 43; or the homolog comprises or consists of a nucleic acid sequence selected from SEQ ID NOs: 41 or 42. In another embodiment, the PHS1 promoter homolog comprises or consists of a nucleic acid sequence as defined in SEQ ID NO: 44.
The term “functional variant” (or “variant”) as used herein with reference to any of the sequences described herein refers to a variant sequence or part of the sequence which retains the biological function of the full non-variant sequence. Accordingly, in the context of PHS1 , a functional variant also has phosphorylase activity. As used herein, a homolog may also be referred to as functional. That is, the homolog also has phosphorylase activity.
The functional variant may comprise variant of the PHS1 gene, which has sequence alterations that do not affect function, for example in non-conserved residues. Also encompassed is a variant that is substantially identical, i.e. has only some sequence variations, for example in non-conserved residues, compared to the wild type sequences as shown herein and is biologically active (e.g. has phosphorylase activity). Alterations in a nucleic acid sequence which result in the production of a different amino acid at a given site that do not affect the functional properties of the encoded polypeptide are well known in the art. For example, a codon for the amino acid alanine, a hydrophobic amino acid, may be substituted by a codon encoding another less hydrophobic residue, such as glycine, or a more hydrophobic residue, such as valine, leucine, or isoleucine. Similarly, changes which result in substitution of one negatively charged residue for another, such as aspartic acid for glutamic acid, or one positively charged residue for another, such as lysine for arginine, can also be expected to produce a functionally equivalent product. Nucleotide changes which result in alteration of the N-terminal and C-terminal portions of the polypeptide molecule would also not be expected to alter the activity of the polypeptide. Each of the proposed modifications is well within the routine skill in the art, as is determination of retention of biological activity of the encoded products.
In one embodiment, a functional variant has at least 25%, 26%, 27%, 28%, 29%, 30%, 31%, 32%, 33%, 34%, 35%, 36%, 37%, 38%, 39%, 40%, 41 %, 42%, 43%, 44%, 45%, 46%, 47%, 48%, 49%, 50%, 51 %, 52%, 53%, 54%, 55%, 56%, 57%, 58%, 59%, 60%,
61%, 62%, 63%, 64%, 65%, 66%, 67%, 68%, 69%, 70%, 71 %, 72%, 73%, 74%, 75%,
76%, 77%, 78%, 79%, 80%, 81 %, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%,
91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or at least 99% overall sequence identity to the non-variant nucleic acid or amino acid sequence.
The term homolog, as used herein, also designates a PHS1 promoter or PHS1 gene orthologue from other plant species. A homolog may have, in increasing order of preference, at least 25%, 26%, 27%, 28%, 29%, 30%, 31%, 32%, 33%, 34%, 35%, 36%, 37%, 38%, 39%, 40%, 41 %, 42%, 43%, 44%, 45%, 46%, 47%, 48%, 49%, 50%, 51 %,
52%, 53%, 54%, 55%, 56%, 57%, 58%, 59%, 60%, 61 %, 62%, 63%, 64%, 65%, 66%,
67%, 68%, 69%, 70%, 71 %, 72%, 73%, 74%, 75%, 76%, 77%, 78%, 79%, 80%, 81 %,
82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91 %, 92%, 93%, 94%, 95%, 96%,
97%, 98%, or at least 99% overall sequence identity to the amino acid represented by any of SEQ ID NO: 3, 7, 11 , 15, 19, 23, 27, 31 , 35, 39, 43 and 47 or to the nucleic acid sequences as shown by SEQ ID NOs: 1 , 2, 5, 6, 9, 10, 13, 14, 17, 18, 21 , 22, 25, 26, 29, 30, 33, 34, 37, 38, 41 , 42, 45 and 46. A PHS1 promoter orthologue may have, in increasing order of preference, at least 25%, 26%, 27%, 28%, 29%, 30%, 31%, 32%, 33%, 34%, 35%, 36%, 37%, 38%, 39%, 40%, 41 %, 42%, 43%, 44%, 45%, 46%, 47%,
48%, 49%, 50%, 51 %, 52%, 53%, 54%, 55%, 56%, 57%, 58%, 59%, 60%, 61 %, 62%,
63%, 64%, 65%, 66%, 67%, 68%, 69%, 70%, 71 %, 72%, 73%, 74%, 75%, 76%, 77%,
78%, 79%, 80%, 81 %, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91 %, 92%,
93%, 94%, 95%, 96%, 97%, 98%, or at least 99% overall sequence identity to the nucleic acid sequences as shown in SEQ ID NOs 4, 8, 12, 16, 20, 24, 28, 32, 36, 40, 44 and 48. In one embodiment, overall sequence identity is at least 70%, 71%, 72%, 73%, 74%, 75%, 76%, 77%, 78%, 79%, 80%, 81 %, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91 %, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99%, most preferably 90%, 91 %, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or at least 99%. Functional variants of PHS1 homologs as defined above are also within the scope of the invention.
PHS1 features a highly conserved Glycosyltransferase-35 (GT-35) domain typically found in starch and glycogen phosphorylases, and which constitutes most of the amino acid sequence (amino acids 77-962 of wheat PHS1-5A and PHS1-5D, and 80-965 of PHS1-5B of SEQ ID NO: 3, 11 , 15, 23, 27 or 35, or homologous positions in for example barley or rye). All plastidial phosphorylase isoforms also have an “L78 insertion” that is proposed to be involved in substrate selection (amino acids 483-559 of wheat PHS1-5A and PHS1-5D, and 486-562 of PHS1-5B of SEQ ID NO: 3, 11 , 15, 23, 27 or 35 or homologous positions in for example barley or rye). Accordingly, in one embodiment the PHS1 protein comprises at least one conserved GT-35 domain and/or L78 insertion domain and one mutation is introduced into at least GT-35 domain and/or L78 insertion domain to affect protein function. The mutation may encode an amino acid substitution that disrupts the function of the GT-35 domain and/or the L78 insertion domain.
Accordingly, in a further embodiment, the PHS1 nucleic acid (coding) sequence encodes a PHS1 protein with at least one conserved domain, wherein the conserved domain is a GT-35 domain and/or L78 insertion domain, as described above.
Two nucleic acid sequences or polypeptides are said to be "identical" if the sequence of nucleotides or amino acid residues, respectively, in the two sequences is the same when aligned for maximum correspondence as described below. The terms "identical" or percent "identity," in the context of two or more nucleic acids or polypeptide sequences, refer to two or more sequences or subsequences that are the same or have a specified percentage of amino acid residues or nucleotides that are the same, when compared and aligned for maximum correspondence over a comparison window, as measured using one of the following sequence comparison algorithms or by manual alignment and visual inspection. When percentage of sequence identity is used in reference to proteins or peptides, it is recognised that residue positions that are not identical often differ by conservative amino acid substitutions, where amino acids residues are substituted for other amino acid residues with similar chemical properties (e.g., charge or hydrophobicity) and therefore do not change the functional properties of the molecule. Where sequences differ in conservative substitutions, the percent sequence identity may be adjusted upwards to correct for the conservative nature of the substitution. Means for making this adjustment are well known to those of skill in the art. For sequence comparison, typically one sequence acts as a reference sequence, to which test sequences are compared. When using a sequence comparison algorithm, test and reference sequences are entered into a computer, subsequence coordinates are designated, if necessary, and sequence algorithm program parameters are designated. Default program parameters can be used, or alternative parameters can be designated. The sequence comparison algorithm then calculates the percent sequence identities for the test sequences relative to the reference sequence, based on the program parameters. Non-limiting examples of algorithms that are suitable for determining percent sequence identity and sequence similarity are the BLAST and BLAST 2.0 algorithms. The overall sequence identity of a variant can be determined using any number of sequence alignment programs known in the art. As an example, Emboss
Stretcher from the EMBL-EBI may be used: default ir extend = 4 for
The skilled person would understand that suitable homologues and the homologous positions in these sequences can be identified by sequence comparisons (e.g. BLAST, alignments) and identifications of conserved domains. Phylogenetic tree analysis using nucleotide or amino acid sequences can be used to establish orthology to PHS1. There are predictors in the art that can be used to identify such sequences. The function of the homologue can be identified as described herein and a skilled person would thus be able to confirm the function, for example using a phosphorylase assay. Homologous positions or as used herein “corresponding positions in homologous sequences” can thus be determined by performing sequence alignments once the homologous sequence has been identified. For example, homologues can be identified using a BLAST search of the plant genome of interest using the wheat PHS1 as a query (i.e. one of the sequences defined in SEQ ID NOs: 1 to 40).
Thus, the nucleotide sequences of the invention and described herein can also be used to isolate corresponding sequences from other organisms, particularly other plants, for example crop plants. In this manner, methods such as PCR, hybridization, and the like can be used to identify such sequences based on their sequence homology to the sequences described herein. Topology of the sequences and the characteristic domains structure can also be considered when identifying and isolating homologs. Sequences may be isolated based on their sequence identity to the entire sequence or to fragments thereof. In hybridization techniques, all or part of a known nucleotide sequence is used as a probe that selectively hybridizes to other corresponding nucleotide sequences present in a population of cloned genomic DNA fragments or cDNA fragments (i.e., genomic or cDNA libraries) from a chosen plant. The hybridization probes may be genomic DNA fragments, cDNA fragments, RNA fragments, or other oligonucleotides, and may be labelled with a detectable group, or any other detectable marker. Methods for preparation of probes for hybridization and for construction of cDNA and genomic libraries are generally known in the art and are disclosed in Sambrook, et al., (1989) Molecular Cloning: A Library Manual (2d ed., Cold Spring Harbor Laboratory Press, Plainview, New York).
Hybridization of such sequences may be carried out under stringent conditions. By "stringent conditions" or "stringent hybridization conditions" is intended conditions under which a probe will hybridize to its target sequence to a detectably greater degree than to other sequences (e.g., at least 2-fold over background). Stringent conditions are sequence dependent and will be different in different circumstances. By controlling the stringency of the hybridization and/or washing conditions, target sequences that are 100% complementary to the probe can be identified (homologous probing). Alternatively, stringency conditions can be adjusted to allow some mismatching in sequences so that lower degrees of similarity are detected (heterologous probing). Generally, a probe is less than about 1000 nucleotides in length, preferably less than 500 nucleotides in length.
Typically, stringent conditions will be those in which the salt concentration is less than about 1.5 M Na ion, typically about 0.01 to 1.0 M Na ion concentration (or other salts) at pH 7.0 to 8.3 and the temperature is at least about 30°C for short probes (e.g., 10 to 50 nucleotides) and at least about 60°C for long probes (e.g., greater than 50 nucleotides). Duration of hybridization is generally less than 24 hours, usually about 4 to 12. Stringent conditions may also be achieved with the addition of destabilizing agents such as formamide.
In a further embodiment, a variant as used herein can comprise a nucleic acid sequence encoding a PHS1 polypeptide as defined herein that is capable of hybridising under stringent conditions as defined herein to a nucleic acid sequence as defined herein, such as in any of SEQ ID NOs: 1 , 5, 9, 13, 17, 21 , 25, 29, 33, 37, 41 and 45 (genomic) or 2, 6, 10, 14, 18, 22, 26, 30, 34 and 38, 42 and 46 (CDS).
In one embodiment, the method comprises reducing or abolishing the expression of at least one nucleic acid encoding a PHS1 polypeptide or reducing or abolishing the activity of an PHS1 polypeptide, as described herein, wherein the method comprises introducing at least one mutation into at least one PHS1 gene and/or promoter, wherein the PHS1 gene comprises or consists of a. a nucleic acid sequence encoding a polypeptide as defined in one of SEQ ID NOs: SEQ ID NO: 3, 7, 11 , 15, 19, 23, 27, 31 , 35, 39, 43 and 47.; or b. a nucleic acid sequence as defined in one of SEQ ID NOs: 1 , 5, 9, 13, 17, 21 , 25, 29, 33, 37, 41 and 45 (genomic) or 2, 6, 10, 14, 18, 22, 26, 30, 34 and 38, 42 and 46 (CDS) or c. a nucleic acid sequence with at least 75%, 76%, 77%, 78%, 79%, 80%, 81 %, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91 %, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or at least 99% overall sequence identity to either (a) or (b); or d. a nucleic acid sequence encoding a PHS1 polypeptide as defined herein that is capable of hybridising under stringent conditions as defined herein to the nucleic acid sequence of any of (a) to (c); and wherein the PHS1 promoter comprises or consists of e. a nucleic acid sequence as defined in one of SEQ ID NOs: 4, 8, 12, 16, 20, 24,
28, 32, 36, 40, 44 and 48 ; f. a nucleic acid sequence with at least 75%, 76%, 77%, 78%, 79%, 80%, 81 %, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or at least 99% overall sequence identity to (e); or g. a nucleic acid sequence capable of hybridising under stringent conditions as defined herein to the nucleic acid sequence of any of (e) to (f).
In another embodiment, the genetically altered plant, plant part thereof or plant cell reduced or abolished expression of at least one nucleic acid encoding a PHS1 polypeptide or reduced or abolished activity of an PHS1 polypeptide, as described herein, wherein the genetically altered plant, part thereof or plant cell comprises at least one mutation in at least one PHS1 gene and/or promoter, wherein the PHS1 gene comprises or consists of a. a nucleic acid sequence encoding a polypeptide as defined in one of SEQ ID NOs: SEQ ID NO: 3, 7, 11 , 15, 19, 23, 27, 31 , 35, 39, 43 and 47; or b. a nucleic acid sequence as defined in one of SEQ ID NOs: 1 , 5, 9, 13, 17, 21 , 25,
29, 33, 37, 41 and 45 (genomic) or 2, 6, 10, 14, 18, 22, 26, 30, 34 and 38, 42 and 46 (CDS) or c. a nucleic acid sequence with at least 75%, 76%, 77%, 78%, 79%, 80%, 81 %, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91 %, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or at least 99% overall sequence identity to either (a) or (b); or d. a nucleic acid sequence encoding a PHS1 polypeptide as defined herein that is capable of hybridising under stringent conditions as defined herein to the nucleic acid sequence of any of (a) to (c); and wherein the PHS1 promoter comprises or consists of e. a nucleic acid sequence as defined in one of SEQ ID NOs: 4, 8, 12, 16, 20, 24, 28, 32, 36, 40, 44 and 48 ; f. a nucleic acid sequence with at least 75%, 76%, 77%, 78%, 79%, 80%, 81 %, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91 %, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or at least 99% overall sequence identity to (e); or g. a nucleic acid sequence capable of hybridising under stringent conditions as defined herein to the nucleic acid sequence of any of (e) to (f).
In one embodiment, a mutation is introduced into two PHS1 genes, wherein the first PHS1 gene encodes a protein selected from SEQ ID NO: 3, 7, 11 , 15, 19, 23, 27, 31 , 35 and 39 (preferably the PHS1 gene comprises or consists of a nucleic acid sequence selected from SEQ ID NO: 1 , 2, 5, 6, 9, 10, 13, 14, 17, 18, 21 , 22, 25, 26, 29, 30, 33, 34, 37 and 38) and wherein the second PHS1 gene encodes a protein selected from (different from the first) SEQ ID NO: 3, 7, 11 , 15, 19, 23, 27, 31 , 35 and 39 (preferably the PHS1 gene comprises or consists of a nucleic acid sequence selected, different from the first, from SEQ ID NO: 1 , 2, 5, 6, 9, 10, 13, 14, 17, 18, 21 , 22, 25, 26, 29, 30, 33, 34, 37 and 38).
In a further embodiment, a mutation is introduced into three PHS1 genes, wherein the first PHS1 gene encodes a protein as defined in SEQ ID NO: SEQ ID NO: 3, 7, 11 , 15, 19, 23, 27, 31 , 35 and 39 (preferably the PHS1 gene comprises or consists of a nucleic acid sequence selected from SEQ ID NO: 1 , 2, 5, 6, 9, 10, 13, 14, 17, 18, 21 , 22, 25, 26, 29, 30, 33, 34, 37and 38); and wherein the second PHS1 gene encodes a protein selected from (different from the first) SEQ ID NO: 3, 7, 11 , 15, 19, 23, 27, 31 , 35 and 39 (preferably the PHS1 gene comprises or consists of a nucleic acid sequence selected, different from the first, from SEQ ID NO: 1 , 2, 5, 6, 9, 10, 13, 14, 17, 18, 21 , 22, 25, 26, 29, 30, 33, 34, 37 and 38); and wherein the third PHS1 gene encodes a protein selected from (different from the first and second) SEQ ID NO: 3, 7, 11 , 15, 19, 23, 27, 31 , 35 and 39 (preferably the PHS1 gene comprises or consists of a nucleic acid sequence selected, different from the first and second, from SEQ ID NO: 1 , 2, 5, 6, 9, 10, 13, 14, 17, 18, 21 , 22, 25, 26, 29, 30, 33, 34, 37 and 38. In a preferred embodiment, the mutation that is introduced into the endogenous PHS1 gene or promoter thereof to alter the biological activity and/or expression levels of the PHS1 gene or protein can be selected from the following mutation types:
1. a "missense mutation", which is a change in the nucleic acid sequence that results in the substitution of one amino acid for another amino acid;
2. a "nonsense mutation" or "STOP codon mutation", which is a change in the nucleic acid sequence that results in the introduction of a premature STOP codon and, thus, the termination of translation (resulting in a truncated protein); in plants, the translation stop codons may be selected from "TGA" (UGA in RNA), "TAA" (UAA in RNA) and "TAG" (UAG in RNA); thus any nucleotide substitution, insertion, deletion which results in one of these codons to be in the mature mRNA being translated (in the reading frame) will terminate translation.
3. an "insertion mutation" of one or more nucleotides or one or more amino acids, due to one or more codons having been added in the coding sequence of the nucleic acid;
4. a "deletion mutation" of one or more nucleotides or of one or more amino acids, due to one or more codons having been deleted in the coding sequence of the nucleic acid;
5. a "frameshift mutation", resulting in the nucleic acid sequence being translated in a different frame downstream of the mutation. A frameshift mutation can have various causes, such as the insertion, deletion or duplication of one or more nucleotides.
6. a “splice site” mutation, which is a mutation that results in the insertion, deletion or substitution of a nucleotide at the site of splicing (i.e. either a splice acceptor or splice donor mutation) where any one or more of the above mutations leads to a loss or partial loss of function in PHS1 (e.g. a complete or partial loss of phosphorylase activity).
In one embodiment, the mutation is a STOP codon mutation. For example, the mutation may be a G to A mutation at position 1755 of SEQ ID NO: 33 or a corresponding position in a homologous sequence (this may be referred to as the K4533 mutation); or a G to A mutation at position 2025 of SEQ ID NO: 37 or a corresponding position in a homologous sequence (this may be referred to as the K2864 mutation).
In another embodiment, the mutation is a splice acceptor mutation. For example, the mutation may be a G to A mutation at position 4790 of SEQ ID NO: 33 or a corresponding position in a homologous sequence. This may be referred to as the K4367 mutation.
In another embodiment, the mutation is a splice donor mutation. For example, the mutation may be a G to A mutation at position 2287 of SEQ ID NO: 37 or a corresponding position in a homologous sequence. This may be referred to as the K0238 mutation.
In one embodiment, the method may comprise introducing one or more of the K4533, K4367, K0238 and K2864 (or homologous mutations) as described herein into a PHS1 nucleic acid. In a further embodiment, the method may comprise introducing the following mutations. a K4367 and a K2864 mutation (or homologous mutations) into a PHS1-1 nucleic acid. A plant comprising both mutations is described herein as PHS1-1 ; or a K4533 and a K0238 mutation (or homologous mutations) into a PHS1 nucleic acid. A plant comprising both mutations is described herein as PHS1-6.
In general, the skilled person will understand that at least one mutation as defined above and which leads to the insertion, deletion or substitution of at least one nucleic acid or amino acid compared to the wild-type PHS1 promoter or PHS1 nucleic acid or protein sequence can affect the biological activity of the PHS1 protein.
In one embodiment a mutation may be introduced into the PHS1 promoter and at least one mutation is introduced into the PHS1 gene.
In one embodiment, the mutation is introduced using mutagenesis or targeted genome editing. That is, in one embodiment, the invention relates to a method and plant that has been generated by genetic engineering methods as described above, and does not encompass naturally occurring varieties. Targeted genome modification or targeted genome editing is a genome engineering technique that uses targeted DNA double-strand breaks (DSBs) to stimulate genome editing through homologous recombination (HR)-mediated recombination events. To achieve effective genome editing via introduction of site-specific DNA DSBs, four major classes of customisable DNA binding proteins can be used: meganucleases derived from microbial mobile genetic elements, ZF nucleases based on eukaryotic transcription factors, transcription activator-like effectors (TALEs) from Xanthomonas bacteria, and the RNA-guided DNA endonuclease Cas9 from the type II bacterial adaptive immune system CRISPR (clustered regularly interspaced short palindromic repeats). Meganuclease, ZF, and TALE proteins all recognize specific DNA sequences through protein-DNA interactions. Although meganucleases integrate nuclease and DNA-binding domains, ZF and TALE proteins consist of individual modules targeting 3 or 1 nucleotides (nt) of DNA, respectively. ZFs and TALEs can be assembled in desired combinations and attached to the nuclease domain of Fokl to direct nucleolytic activity toward specific genomic loci.
Upon delivery into host cells via the bacterial type III secretion system, TAL effectors enter the nucleus, bind to effector-specific sequences in host gene promoters and activate transcription. Their targeting specificity is determined by a central domain of tandem, 33-35 amino acid repeats. This is followed by a single truncated repeat of 20 amino acids. The majority of naturally occurring TAL effectors examined have between 12 and 27 full repeats.
These repeats only differ from each other by two adjacent amino acids, their repeatvariable di-residue (RVD). The RVD that determines which single nucleotide the TAL effector will recognize: one RVD corresponds to one nucleotide, with the four most common RVDs each preferentially associating with one of the four bases. Naturally occurring recognition sites are uniformly preceded by a T that is required for TAL effector activity. TAL effectors can be fused to the catalytic domain of the Fokl nuclease to create a TAL effector nuclease (TALEN) which makes targeted DNA double-strand breaks (DSBs) in vivo for genome editing. The use of this technology in genome editing is well described in the art, for example in US 8,440,431 , US 8,440,432 and US 8,450,471. Cermak T et al. describes a set of customized plasmids that can be used with the Golden Gate cloning method to assemble multiple DNA fragments. As described therein, the Golden Gate method uses Type IIS restriction endonucleases, which cleave outside their recognition sites to create unique 4 bp overhangs. Cloning is expedited by digesting and ligating in the same reaction mixture because correct assembly eliminates the enzyme recognition site. Assembly of a custom TALEN or TAL effector construct and involves two steps: (i) assembly of repeat modules into intermediary arrays of 1-10 repeats and (ii) joining of the intermediary arrays into a backbone to make the final construct. Accordingly, using techniques known in the art it is possible to design a TAL effector that targets a PHS1 gene or promoter sequence as described herein.
Another genome editing method that can be used according to the various aspects of the invention is CRISPR. The use of this technology in genome editing is well described in the art, for example in US 8,697,359 and references cited herein. In short, CRISPR is a microbial nuclease system involved in defense against invading phages and plasmids. CRISPR loci in microbial hosts contain a combination of CRISPR-associated (Cas) genes as well as non-coding RNA elements capable of programming the specificity of the CRISPR-mediated nucleic acid cleavage (sgRNA). Three types (l-lll) of CRISPR systems have been identified across a wide range of bacterial hosts. One key feature of each CRISPR locus is the presence of an array of repetitive sequences (direct repeats) interspaced by short stretches of non-repetitive sequences (spacers). The non-coding CRISPR array is transcribed and cleaved within direct repeats into short crRNAs containing individual spacer sequences, which direct Cas nucleases to the target site (protospacer). The Type II CRISPR is one of the most well characterized systems and carries out targeted DNA double-strand break in four sequential steps. First, two noncoding RNA, the pre-crRNA array and tracrRNA, are transcribed from the CRISPR locus. Second, tracrRNA hybridizes to the repeat regions of the pre-crRNA and mediates the processing of pre-crRNA into mature crRNAs containing individual spacer sequences. Third, the mature crRNA:tracrRNA complex directs Cas9 to the target DNA via Watson- Crick base-pairing between the spacer on the crRNA and the protospacer on the target DNA next to the protospacer adjacent motif (PAM), an additional requirement for target recognition. Finally, Cas9 mediates cleavage of target DNA to create a double-stranded break within the protospacer.
One major advantage of the CRISPR-Cas9 system, as compared to conventional gene targeting and other programmable endonucleases is the ease of multiplexing, where multiple genes can be mutated simultaneously simply by using multiple sgRNAs each targeting a different gene. In addition, where two sgRNAs are used flanking a genomic region, the intervening section can be deleted or inverted (Wiles et al., 2015).
Cas9 is thus the hallmark protein of the type II CRISPR-Cas system, and is a large monomeric DNA nuclease guided to a DNA target sequence adjacent to the PAM (protospacer adjacent motif) sequence motif by a complex of two noncoding RNAs: CRISPR RNA (crRNA) and trans-activating crRNA (tracrRNA). The Cas9 protein contains two nuclease domains homologous to RuvC and HNH nucleases. The HNH nuclease domain cleaves the complementary DNA strand whereas the RuvC-like domain cleaves the non-complementary strand and, as a result, a blunt cut is introduced in the target DNA. Heterologous expression of Cas9 together with an sgRNA can introduce site-specific double strand breaks (DSBs) into genomic DNA of live cells from various organisms. For applications in eukaryotic organisms, codon optimized versions of Cas9, which is originally from the bacterium Streptococcus pyogenes, have been used.
The single guide RNA (sgRNA) is the second component of the CRISPR/Cas system that forms a complex with the Cas9 nuclease. sgRNA is a synthetic RNA chimera created by fusing crRNA with tracrRNA. The sgRNA guide sequence located at its 5' end confers DNA target specificity. Therefore, by modifying the guide sequence, it is possible to create sgRNAs with different target specificities. The canonical length of the guide sequence is 20 bp. In plants, sgRNAs have been expressed using plant RNA polymerase III promoters, such as U6 and U3. Accordingly, using techniques known in the art it is possible to design sgRNA molecules that targets a PHS1 gene or promoter sequence as described herein. In one embodiment, the method comprises using any of the nucleic acid constructs or sgRNA molecules described herein.
Cas9 expression plasmids for use in the methods of the invention can be constructed as described in the art.
In one embodiment, the method uses a sgRNA to introduce a targeted SNP or mutation, in particular one of the substitutions described herein, into a PHS1 gene. As explained herein, the introduction of a template DNA strand, following a sgRNA-mediated snip in the double-stranded DNA, can be used to produce a specific targeted mutation (i.e. a SNP) in the gene using homology directed repair. In an alternative embodiment, at least one mutation may be introduced into the PHS1 gene and/or promoter, particularly at the positions described above, using any CRISPR technique known to the skilled person. In another example, sgRNA (for example, as described herein) can be used with a modified Cas9 protein, such as nickase Cas9 or nCas9 or a “dead” Cas9 (dCas9) fused to a “Base Editor” - such as an enzyme, for example a deaminase such as cytidine deaminase, or TadA (tRNA adenosine deaminase) or ADAR or APOBEC. These enzymes are able to substitute one base for another. As a result no DNA is deleted, but a single substitution is made.
Alternatively, more conventional mutagenesis methods can be used to introduce at least one mutation into a PHS1 gene or PHS1 promoter sequence. These methods include both physical and chemical mutagenesis. A skilled person will know further approaches can be used to generate such mutants, and methods for mutagenesis and polynucleotide alterations are well known in the art. See, for example, Kunkel (1985) Proc. Natl. Acad. Sci. USA 82:488-492; Kunkel et al. (1987) Methods in Enzymol. 154:367-382; U.S. Patent No. 4,873,192; Walker and Gaastra, eds. (1983) Techniques in Molecular Biology (MacMillan Publishing Company, New York) and the references cited therein.
In one embodiment, insertional mutagenesis is used, for example using T-DNA mutagenesis (which inserts pieces of the T-DNA from the Agrobacterium tumefaciens Ti-Plasmid into DNA causing r loss of gene function mutations); site-directed nucleases (SDNs) or transposons as a mutagen. Insertional mutagenesis is an alternative means of disrupting gene function and is based on the insertion of foreign DNA into the gene of interest (see Krysan et al, The Plant Cell, Vol. 11 , 2283-2290, December 1999). Accordingly, in one embodiment, T-DNA is used as an insertional mutagen to disrupt PHS1 gene or PHS1 promoter expression. T-DNA not only disrupts the expression of the gene into which it is inserted, but also acts as a marker for subsequent identification of the mutation. Since the sequence of the inserted element is known, the gene in which the insertion has occurred can be recovered, using various cloning or PCR-based strategies. The insertion of a piece of T-DNA in the order of 5 to 25 kb in length generally produces a disruption of gene function. If a large enough population of T-DNA transformed lines is generated, there are reasonably good chances of finding a transgenic plant carrying a T-DNA insert within any gene of interest. Transformation of spores with T-DNA is achieved by an Agrobacterium-mediated method which involves exposing plant cells and tissues to a suspension of Agrobacterium cells. The details of this method are well known to a skilled person. In short, plant transformation by Agrobacterium results in the integration into the nuclear genome of a sequence called T-DNA, which is carried on a bacterial plasmid. The use of T-DNA transformation leads to stable single insertions. Further mutant analysis of the resultant transformed lines is straightforward and each individual insertion line can be rapidly characterized by direct sequencing and analysis of DNA flanking the insertion. Gene expression in the mutant is compared to expression of the PHS1 nucleic acid sequence in a wild type plant and phenotypic analysis is also carried out.
In another embodiment, mutagenesis is physical mutagenesis, such as application of ultraviolet radiation, X-rays, gamma rays, fast or thermal neutrons or protons. The targeted population can then be screened to identify an PHS1 mutant with reduced expression or activity.
In another embodiment of the various aspects of the invention, the method comprises mutagenizing a plant population with a mutagen. The mutagen may be a fast neutron irradiation or a chemical mutagen, for example selected from the following non-limiting list: ethyl methanesulfonate (EMS), methylmethane sulfonate (MMS), N-ethyl-N- nitrosurea (ENU), triethylmelamine (1'EM), N-methyl-N-nitrosourea (MNU), procarbazine, chlorambucil, cyclophosphamide, diethyl sulfate, acrylamide monomer, melphalan, nitrogen mustard, vincristine, dimethylnitosamine, N-methyl-N'-nitro- Nitrosoguanidine (MNNG), nitrosoguanidine, 2-aminopurine, 7,12 dimethyl- benz(a)anthracene (DMBA), ethylene oxide, hexamethylphosphoramide, bisulfan, diepoxyalkanes (diepoxyoctane (DEO), diepoxybutane (BEB), and the like), 2-methoxy- 6-chloro-9 [3-(ethyl-2-chloroethyl)aminopropylamino]acridine dihydrochloride (ICR-170) or formaldehyde. Again, the targeted population can then be screened to identify a PHS1 gene or promoter mutant.
In another embodiment, the method used to create and analyse mutations is targeting induced local lesions in genomes (TILLING), reviewed in Henikoff et al, 2004. In this method, seeds are mutagenised with a chemical mutagen, for example EMS. The resulting M1 plants are self-fertilised and the M2 generation of individuals is used to prepare DNA samples for mutational screening. DNA samples are pooled and arrayed on microtiter plates and subjected to gene specific PCR. The PCR amplification products may be screened for mutations in the PHS1 target gene or promoter using any method that identifies heteroduplexes between wild type and mutant genes. For example, but not limited to, denaturing high pressure liquid chromatography (dHPLC), constant denaturant capillary electrophoresis (CDCE), temperature gradient capillary electrophoresis (TGCE), or by fragmentation using chemical cleavage. Preferably the PCR amplification products are incubated with an endonuclease that preferentially cleaves mismatches in heteroduplexes between wild type and mutant sequences. Cleavage products are electrophoresed using an automated sequencing gel apparatus, and gel images are analyzed with the aid of a standard commercial image-processing program. Any primer specific to the PHS1 nucleic acid sequence may be utilized to amplify the PHS1 nucleic acid sequence within the pooled DNA sample. Preferably, the primer is designed to amplify the regions of the PHS1 gene where useful mutations are most likely to arise, specifically in the areas of the PHS1 gene that are highly conserved and/or confer activity as explained elsewhere. To facilitate detection of PCR products on a gel, the PCR primer may be labelled using any conventional labelling method. In an alternative embodiment, the method used to create and analyse mutations is EcoTILLING. EcoTILLING is molecular technique that is similar to TILLING, except that its objective is to uncover natural variation in a given population as opposed to induced mutations. The first publication of the EcoTILLING method was described in Comai et al. 2004.
Rapid high-throughput screening procedures thus allow the analysis of amplification products for identifying a mutation conferring the reduction or inactivation of the expression of the PHS1 gene as compared to a corresponding non-mutagenised wild type plant. Once a mutation is identified in a gene of interest, the seeds of the M2 plant carrying that mutation are grown into adult M3 plants and screened for the phenotypic characteristics associated with the target gene PHS1. Loss of and reduced function mutants with an altered starch granule size and/or number compared to a control can thus be identified.
In an alternative embodiment, the expression of the PHS1 gene may be reduced at either the level of transcription or translation. For example, expression of a PHS1 nucleic acid or PHS1 promoter sequence, as defined herein, can be reduced or silenced using a number of gene silencing methods known to the skilled person, such as, but not limited to, the use of small interfering nucleic acids (siNA) against PHS1. “Gene silencing" is a term generally used to refer to suppression of expression of a gene via sequence-specific interactions that are mediated by RNA molecules. The degree of reduction may be so as to totally abolish production of the encoded gene product, but more usually the abolition of expression is partial, with some degree of expression remaining. The term should not therefore be taken to require complete "silencing" of expression.
In one embodiment, the siNA may include, short interfering RNA (siRNA), doublestranded RNA (dsRNA), micro-RNA (miRNA), antagomirs and short hairpin RNA (shRNA) capable of mediating RNA interference. The inhibition of expression and/or activity can be measured by determining the presence and/or amount of PHS1 transcript using techniques well known to the skilled person (such as Northern Blotting, RT-PCR and so on).
Thus, according to the various aspects of the invention a plant may be transformed to introduce a RNAi, shRNA, snRNA, dsRNA, siRNA, miRNA, ta-siRNA, amiRNA or cosuppression molecule that has been designed to target the expression of an PHS1 nucleic acid sequence and selectively decreases or inhibit the expression of the gene or stability of its transcript. Preferably, the RNAi, snRNA, dsRNA, shRNA siRNA, miRNA, amiRNA, ta-siRNA or cosuppression molecule used according to the various aspects of the invention comprises a fragment of at least 17 nt, preferably 22 to 26 nt and can be designed on the basis of the information shown in any of SEQ ID NOs: 1 , 2,5, 6, 9, 10, 13, 14, 17, 18, 21 , 22, 25, 26, 29, 30, 33, 34, 37, 38, 41 , 42, 45 or 46. Guidelines for designing effective siRNAs are known to the skilled person.
Plants obtained or obtainable by such method which carry a mutation in the endogenous PHS1 gene or promoter locus are also within the scope of the invention, where preferably the mutation is a loss or partial loss of function mutation as described above.
Accordingly, the genetically altered plant or plant cell may alternatively comprise a nucleic acid construct expressing an RNAi molecule targeting the PHS1 gene as described herein. In one embodiment, said construct is stably incorporated into the plant genome. These techniques also include gene targeting using vectors that target the PHS1 gene and which allows for integration of a transgene at a specific site. The targeting construct is engineered to recombine with the target gene (PHS1), which is accomplished by incorporating sequences from the gene itself into the construct. Recombination then occurs in the region of that sequence within the gene, resulting in the insertion of a foreign sequence to disrupt the gene. With its sequence interrupted, the altered gene will be translated into a nonfunctional protein, if it is translated at all.
In a further aspect of the invention, there is provided a method of altering a physiochemical property of starch, the method comprising reducing the expression of at least one PHS1 nucleic acid and/or reducing the activity of a PHS1 polypeptide as described above. The physiochemical property may be selected from gelatinisation temperature, swelling power and viscosity. An increase as used herein may be at least 1 %, 2%, 3%, 4%, 5%, 6%, 7%, 8%, 9%, 10% 11 %, 12%, 13%, 14%, 15%, 16%, 17%, 18%, 19%, 20%, 30%, 40% or 50% in comparison to a control plant.
In another aspect of the invention, there is provided a method of shifting a plant with a bimodal size distribution of starch granules towards a unimodal distribution of starch granules, the method comprising reducing or abolishing the expression of at least one PHS1 nucleic acid and/or reducing or abolishing the activity of a PHS1 polypeptide as described above.
In a further aspect of the invention, there is provided a method of altering the initiation of B-type granule formation in a starch storage organ of a plant, the method comprising reducing or abolishing the expression of at least one PHS1 nucleic acid and/or reducing or abolishing the activity of at least one PHS1 polypeptide as described above. Preferably, said method does not affect initiation of A-type granule formation.
As described above, preferably the plant has a bimodal size distribution of starch granules, such as wheat, rye and barley, wherein preferably said bimodal distribution comprises a first population of starch granules of a first size or size range and a second population of starch granules of a second size or size range. In one embodiment the plant is selected from the Triticeae tribe.
In one embodiment, the method has no (obvious detectable) effect on total starch content and/or plant growth. By “plant growth” here is meant for example, but not limited to, grain yield, thousand grain weight and grain size. In an embodiment, the method has no (detectable) effect on the soluble sugar content of grains. Examples of soluble sugars include but are not limited to maltooligosacharides, glucose, fructose and sucrose.
In an embodiment, the method has no (detectable) effect on A-type granule initiation.
In an embodiment, the method disrupts or stops the interaction between PHS1 and BGC1 , altering the initiation of B-type granule formation, resulting in fewer and/or larger B-type granules. Preferably, A-type granule initiation is not (detectably) affected.
In another aspect of the invention there is provided a method for producing a genetically altered plant as described herein. In one embodiment, the method comprises introducing at least one mutation into the PHS1 gene and/or PHS1 promoter of preferably at least one plant cell using any mutagenesis technique described herein. Preferably said method further comprising regenerating a plant from the mutated plant cell.
The method may further comprise selecting one or more mutated plants, preferably for further propagation. Preferably said selected plants comprise at least one mutation in the PHS1 gene and/or promoter sequence. Preferably said plants are characterised by reduced or abolished PHS1 expression and/or a reduced or abolished levels of PHS1 polypeptide activity. Expression and/or activity levels (e.g. phosphorylase activity) of PHS1 can be measured by any standard technique known to the skilled person, including those described herein. A reduction is as described herein.
The selected plants may be propagated by a variety of means, such as by clonal propagation or classical breeding techniques. For example, a first generation (or T1) transformed plant may be selfed and homozygous second-generation (or T2) transformants selected, and the T2 plants may then further be propagated through classical breeding techniques. The generated transformed organisms may take a variety of forms. For example, they may be chimeras of transformed cells and non-transformed cells; clonal transformants (e.g., all cells transformed to contain the expression cassette); grafts of transformed and untransformed tissues (e.g., in plants, a transformed rootstock grafted to an untransformed scion). In a further aspect of the invention there is provided a plant obtained or obtainable by the above described methods.
For the purposes of the invention, a “genetically altered plant” or “mutant plant” is a plant that has been genetically altered compared to the naturally occurring wild type (WT) plant. In one embodiment, a mutant plant is a plant that has been altered compared to the naturally occurring wild type (WT) plant using a mutagenesis method, such as any of the mutagenesis methods described herein. In one embodiment, the mutagenesis method is targeted genome modification or genome editing. In one embodiment, the plant genome has been altered compared to wild type sequences using a mutagenesis method. Such plants have an altered phenotype as described herein, such as an altered starch granule characteristic. Therefore, in this example, an altered starch granule characteristic is conferred by the presence of an altered plant genome, for example, a mutated endogenous PHS1 gene or PHS1 promoter sequence. In one embodiment, the endogenous promoter or gene sequence is specifically targeted using targeted genome modification and the presence of a mutated gene or promoter sequence is not conferred by the presence of transgenes expressed in the plant. In other words, the genetically altered plant can be described as transgene-free.
A plant according to the various aspects of the invention, including the transgenic plants, methods and uses described herein may be a monocot or a dicot plant. Preferably, the plant is a crop plant or a biofuel plant. By crop plant is meant any plant which is grown on a commercial scale for human or animal consumption or use. In a preferred embodiment, the plant is a cereal. In one embodiment the plant is not Arabidopsis. In another embodiment, the plant is not rice.
In a most preferred embodiment, the plant is a plant that has a bimodal distribution of starch granules. For example, in one embodiment the plant is selected from the Triticeae tribe, including but not limited to wheat, barley and rye. In a further embodiment, the plant is wheat. In a further embodiment, the plant is barley.
The term "plant" as used herein encompasses whole plants, ancestors and progeny of the plants and plant parts, including seeds, fruit, shoots, stems, leaves, roots (including tubers), flowers, tissues and organs, wherein each of the aforementioned comprise the nucleic acid construct as described herein. The term "plant" also encompasses plant cells, suspension cultures, callus tissue, embryos, meristematic regions, gametophytes, sporophytes, pollen and microspores, again wherein each of the aforementioned comprises the nucleic acid construct as described herein.
The invention also extends to harvestable parts of a plant of the invention as described herein, but not limited to seeds, leaves, fruits, flowers, stems, roots, rhizomes, tubers and bulbs. The aspects of the invention also extend to products derived, preferably directly derived, from a harvestable part of such a plant, such as dry pellets or powders, oil, fat and fatty acids, starch or proteins. Another product that may derived from the harvestable parts of the plant of the invention is biodiesel. The invention also relates to food products and food supplements comprising the plant of the invention or parts thereof. In one embodiment, the food products may be animal feed. In another aspect of the invention, there is provided a product derived from a plant as described herein or from a part thereof.
The terms “seed” and “grain” as used herein can be used interchangeably. The terms "increase", "improve" or "enhance" as used herein are also interchangeable. Similarly, the terms starch “grain” or “granule” are also interchangeable.
In a most preferred embodiment, the plant part or harvestable product is a seed or grain. Therefore, in a further aspect of the invention, there is provided a seed or grain produced from a genetically altered plant as described herein.
In another embodiment of the invention, the plant part is a starch storage organ comprising starch granules which have a mean granule size which is increased by 3% or more, 5% or more, 10% or more, 15% or more, 20% or more, 25% or more, 30% or more, 35% or more, 40% or more or 45% or more compared to the mean granule size in a control or wild-type plant.
In an alternative embodiment, the plant part is pollen, a propagule or progeny of the genetically altered plant described herein. Accordingly, in a further aspect of the invention there is provided pollen, a propagule or progeny of the genetically altered plant as described herein. In another aspect of the invention, there is provided starch or a starch component obtained or obtainable from at least one plant cell of the genetically altered plant described herein or the grain of a genetically altered plant described herein.
There is also provided a food or feed composition prepared from the grain or starch or starch composition described herein.
Finally, there is also provided the use of the grain or starch described herein as a food or feedstuff, in biofuel (bioethanol) production or in any pharmaceutical, cosmetic or industrial application. Examples of industrial applications include the brewing, papermaking and plastic industries. Examples of food include bread, biscuits, baked goods based on wheat flour and pasta.
A control plant as used herein according to all of the aspects of the invention is a plant which has not been modified according to the methods of the invention. Accordingly, in one embodiment, the control plant does not have altered expression of a PHS1 nucleic acid and/or altered activity of a PHS1 polypeptide, as described herein. In an alternative embodiment, the plant been genetically modified, as described above. In one embodiment, the control plant is a wild type plant. The control plant is typically of the same plant species, preferably having the same genetic background as the modified plant.
Genome editing constructs for use with the methods for targeted genome modification described herein
By “crRNA” or CRISPR RNA is meant the sequence of RNA that contains the protospacer element and additional nucleotides that are complementary to the tracrRNA.
By “tracrRNA” (transactivating RNA) is meant the sequence of RNA that hybridises to the crRNA and binds a CRISPR enzyme, such as Cas9 thereby activating the nuclease complex to introduce double-stranded breaks at specific sites within the genomic sequence of at least one PHS1 nucleic acid or promoter sequence. By “protospacer element” is meant the portion of crRNA (or sgRNA) that is complementary to the genomic DNA target sequence, usually around 20 nucleotides in length. This may also be known as a spacer or targeting sequence.
By “sgRNA” (single-guide RNA) is meant the combination of tracrRNA and crRNA in a single RNA molecule, preferably also including a linker loop (that links the tracrRNA and crRNA into a single molecule). “sgRNA” may also be referred to as “gRNA" and in the present context, the terms are interchangeable. The sgRNA or gRNA provide both targeting specificity and scaffolding/binding ability for a Cas nuclease. A gRNA may refer to a dual RNA molecule comprising a crRNA molecule and a tracrRNA molecule.
By “TAL effector” (transcription activator-like (TAL) effector) or TALE is meant a protein sequence that can bind the genomic DNA target sequence (a sequence within the PHS1 gene or promoter sequence) and that can be fused to the cleavage domain of an endonuclease such as Fokl to create TAL effector nucleases or TALENS or meganucleases to create megaTALs. A TALE protein is composed of a central domain that is responsible for DNA binding, a nuclear-localisation signal and a domain that activates target gene transcription. The DNA-binding domain consists of monomers and each monomer can bind one nucleotide in the target nucleotide sequence. Monomers are tandem repeats of 33-35 amino acids, of which the two amino acids located at positions 12 and 13 are highly variable (repeat variable diresidue, RVD). It is the RVDs that are responsible for the recognition of a single specific nucleotide. HD targets cytosine; Nl targets adenine, NG targets thymine and NN targets guanine (although NN can also bind to adenine with lower specificity).
In another aspect of the invention there is provided a nucleic acid construct wherein the nucleic acid construct comprises a nucleic acid sequence that encodes at least one DNA- binding domain. In one embodiment, the DNA-binding domain can bind to a sequence in the PHS1 gene and/or promoter. Preferably said sequence is selected from one of SEQ ID NO: 49 to 52, 84 and 85 and are target sequences in a PHS1 gene. In one embodiment, the nucleic acid construct comprises one or more DNA-binding domains, such that the construct can bind to one or more, preferably at least two or three sequences in the PHS1 gene. In a further embodiment, said construct further comprises a nucleic acid encoding at least one sequence specific nuclease (SSN) such as Fokl or a Cas protein.
In one embodiment, the nucleic acid construct encodes at least one protospacer element wherein the sequence of the protospacer element is selected from SEQ ID NO: 53 to 56, 86 and 87 or a variant thereof. In one example, the nucleic acid construct may comprise one, two or three protospacer sequences, wherein the sequence of the protospacer sequences is selected from SEQ ID NO:53, 54, 55, 56, 86 and 87.
In a further embodiment, the nucleic acid construct comprises a crRNA-encoding sequence. As defined above, a crRNA sequence may comprise the protospacer elements as defined above and preferably additional nucleotides that are complementary to the tracrRNA. An appropriate sequence for the additional nucleotides will be known to the skilled person as these are defined by the choice of Cas protein.
In another embodiment, the nucleic acid construct further comprises a tracrRNA sequence. Again, an appropriate tracrRNA sequence would be known to the skilled person as this sequence is defined by the choice of Cas protein. Nonetheless, in one embodiment said sequence comprises or consists of a sequence as defined in SEQ ID NO:57 or a variant thereof.
In a further embodiment, the nucleic acid construct comprises at least one nucleic acid sequence that encodes a sgRNA (or gRNA). Again, as already discussed, sgRNA typically comprises a crRNA sequence or protospacer sequence and a tracrRNA sequence and preferably a sequence for a linker loop. In a preferred embodiment, the nucleic acid construct comprises at least one nucleic acid sequence that encodes a sgRNA sequence as defined in any of SEQ ID NO: 62 to 65 or variant thereof. More preferably the nucleic acid sequence that encodes a sgRNA comprises or consists of a sequence selected from SEQ ID NO: 58 to 61 , 88 and 89 or a variant thereof.
In a further embodiment, the nucleic acid construct may further comprise at least one nucleic acid sequence encoding an endoribonuclease cleavage site. Preferably the endoribonuclease is Csy4 (also known as Cas6f). Where the nucleic acid construct comprises multiple sgRNA nucleic acid sequences the construct may comprise the same number of endoribonuclease cleavage sites. In another embodiment, the cleavage site is 5’ of the sgRNA nucleic acid sequence. Accordingly, each sgRNA nucleic acid sequence is flanked by a endoribonuclease cleavage site.
The term ‘variant’ refers to a nucleotide sequence where the nucleotides are substantially identical to one of the above sequences. The variant may be achieved by modifications such as insertion, substitution or deletion of one or more nucleotides. In a preferred embodiment, the variant has at least 50%, at least 55%, at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91 %, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99% identity to any one of the above described sequences. In one embodiment, sequence identity is at least 90%. In another embodiment, sequence identity is 100%. Sequence identity can be determined by any one known sequence alignment program in the art.
The invention also relates to a nucleic acid construct comprising a nucleic acid sequence operably linked to a suitable plant promoter. A suitable plant promoter may be a constitutive or strong promoter or may be a tissues-specific promoter. In one embodiment, suitable plant promoters are selected from, but not limited to, oestrum yellow leaf curling virus (CmYLCV) promoter or switchgrass ubiquitin 1 promoter (Pvllbil) wheat U6 RNA polymerase III (Tall6) CaMV35S, wheat U6 or maize ubiquitin (e.g. Ubi 1) promoters. Alternatively, expression can be specifically directed to particular tissues of wheat seeds through gene expression-regulating sequences. In one embodiment, the promoter is selected from the U6 promoter (for example as defined in SEQ ID NO:68) and the ubiquitin 1 promoter (for example as defined in SEQ ID NO:69). In a preferred embodiment the sequences are codon-optimised for the plant in question.
The nucleic acid construct of the present invention may also further comprise a nucleic acid sequence that encodes a CRISPR enzyme. By “CRISPR enzyme” is meant an RNA- guided DNA endonuclease that can associate with the CRISPR system. Specifically, such an enzyme binds to the tracrRNA sequence. In one embodiment, the CRIPSR enzyme is a Cas protein (“CRISPR associated protein), preferably Cas 9 or Cpf1 , more preferably Cas9. In a specific embodiment Cas9 is codon-optimised Cas9 (optimised for the plant in which it is expressed). In one example, Cas9 has the sequence described in SEQ ID NO: 66 or a functional variant or homolog thereof. In another embodiment, the CRISPR enzyme is a protein from the family of Class 2 candidate x proteins, such as C2c1 , C2C2 and/or C2c3. In one embodiment, the Cas protein is from Streptococcus pyogenes. In an alternative embodiment, the Cas protein may be from any one of Staphylococcus aureus, Neisseria meningitides, Streptococcus thermophiles or Treponema denticola.
The term “functional variant” as used herein with reference to Cas9 refers to a variant Cas9 gene sequence or part of the gene sequence which retains the biological function of the full non-variant sequence, for example, acts as a DNA endonuclease, or recognition and/or binding to DNA. A functional variant also comprises a variant of the gene of interest which has sequence alterations that do not affect function, for example non-conserved residues. Also encompassed is a variant that is substantially identical, i.e. has only some sequence variations, for example in non-conserved residues, compared to the wild type sequences as shown herein and is biologically active. In one embodiment, a functional variant of SEQ ID NO: 66 has at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% overall sequence identity to the nucleic acid represented by SEQ ID NO:66. In a further embodiment, the Cas9 protein has been modified to improve activity.
Suitable homologs or orthologs can be identified by sequence comparisons and identifications of conserved domains. The function of the homolog or ortholog can be identified as described herein and a skilled person would thus be able to confirm the function when expressed in a plant.
In a further embodiment, the Cas9 protein has been modified to improve activity. For example, in one embodiment, the Cas9 protein may comprise the D10A amino acid substitution, this nickase cleaves only the DNA strand that is complementary to and recognized by the gRNA. In an alternative embodiment, the Cas9 protein may alternatively or additionally comprise the H840A amino acid substitution, this nickase cleaves only the DNA strand that does not interact with the sRNA. In this embodiment, Cas9 may be used with a pair (i.e. two) sgRNA molecules (or a construct expressing such a pair) and as a result can cleave the target region on the opposite DNA strand, with the possibility of improving specificity by 100-1500 fold. In a further embodiment, the Cas9 protein may comprise a D1135E substitution. The Cas 9 protein may also be the VQR variant. Alternatively, the Cas protein may be comprise a mutation in both nuclease domains, HNH and RuvCOlike and therefore is catalytically inactive. Rather than cleaving the target strand, this catalytically inactive Cas protein can be used to prevent the transcription elongation process, leading to a loss of function of incompletely translated proteins when co-expressed with a sgRNA molecule. An example of a catalytically inactive protein is dead Cas9 (dCas9) caused by a point mutation in RuvC and/or the HNH nuclease domains (Komor et al., 2016 and Nishida et al., 2016).
In a further embodiment, a Cas protein, such as Cas9 may be further fused with a repression effector, such as a histone-modifying/DNA methylation enzyme or a Cytidine deaminase (Komor et al. 2016) to effect site-directed mutagenesis. In the latter, the cytidine deaminase enzyme does not induce dsDNA breaks, but mediates the conversion of cytidine to uridine, thereby effecting a C to T (or G to A) substitution.
In a further embodiment, the nucleic acid construct comprises an endoribonuclease. Preferably the endoribonuclease is Csy4 (also known as Cas6f) and more preferably a codon optimised csy4, for example as defined in SEQ ID NO: 67. In one embodiment, where the nucleic acid construct comprises a cas protein, the nucleic acid construct may comprise sequences for the expression of an endoribonuclease, such as Csy4 expressed as a 5’ terminal P2A fusion (used as a self-cleaving peptide) to a cas protein, such as Cas9.
In one embodiment, the cas protein, the endoribonuclease and/or the endoribonuclease- cas fusion sequence may be operably linked to a suitable plant promoter. Suitable plant promoters are already described above, but in one embodiment, may be the Zea Mays Ubiquitin 1 promoter or U6 promoter.
Suitable methods for producing the CRISPR nucleic acids and vectors system are known, and for example are published in Molecular Plant (Ma et al., 2015, Molecular Plant, D0l:10.1016/j.molp.2015.04.007), which is incorporated herein by reference.
In an alternative aspect of the invention, the nucleic acid construct comprises at least one nucleic acid sequence that encodes a TAL effector, wherein said effector targets a PHS1 gene and/or promoter sequence, preferably selected from SEQ ID NO 49, 50, 51 , 52, 84 and 85. Methods for designing a TAL effector would be well known to the skilled person, given the target sequence. Examples of suitable methods are given in Sanjana et al., and Cermak T et al, both incorporated herein by reference. Preferably, said nucleic acid construct comprises two nucleic acid sequences encoding a TAL effector, to produce a TALEN pair. In a further embodiment, the nucleic acid construct further comprises a sequence-specific nuclease (SSN). Preferably such SSN is a endonuclease such as Fokl. In a further embodiment, the TALENs are assembled by the Golden Gate cloning method in a single plasmid or nucleic acid construct.
In another aspect of the invention, there is provided a sgRNA molecule, wherein the sgRNA molecule comprises a crRNA sequence and a tracrRNA sequence and wherein the crRNA sequence can bind to at least one sequence selected from SEQ ID NOs 49 to 52, 84 and 85 or a variant thereof. In one embodiment, the nucleic sequence of the sgRNA molecule is defined in any of SEQ ID NO: 58 to 61 , 88 or 89 or variant thereof. In other words, the RNA sequence of the sgRNA is encoded by a nucleic acid sequence selected from SEQ ID NO: 58 to 61 , 88 or 89. In one example only, the RNA sequence of one sgRNA of the invention is defined in SEQ ID NO: 62 to 65 or a variant thereof. A “variant” is as defined herein. In one embodiment, the sgRNA molecule may comprise at least one chemical modification, for example that enhances its stability and/or binding affinity to the target sequence or the crRNA sequence to the tracrRNA sequence. Such modifications would be well known to the skilled person, and include for example, but not limited to, the modifications described in Rahdar et al., 2015, incorporated herein by reference. In this example the crRNA may comprise a phosphorothioate backbone modification, such as 2’-fluoro (2’-F), 2’-O-methyl (2’-O-Me) and S-constrained ethyl (cET) substitutions.
In another aspect of the invention, there is provided an isolated nucleic acid sequence that encodes for a protospacer element (as defined in any of SEQ ID NOs: 43 to 56, 86 or 87), or a sgRNA (as described in any of SEQ ID NO: 58 to 61 , 88 or 89). There is also provided an isolated sgRNA molecule as defined in any of SEQ ID NO: 62 to 65.
In another aspect of the invention, there is provided a plant or part thereof or at least one isolated plant cell transfected with at least one nucleic acid construct as described herein. Cas9 and sgRNA may be combined or in separate expression vectors (or nucleic acid constructs, such terms are used interchangeably). In other words, in one embodiment, an isolated plant cell is transfected with a single nucleic acid construct comprising both sgRNA and Cas9 as described in detail above. In an alternative embodiment, an isolated plant cell is transfected with two nucleic acid constructs, a first nucleic acid construct comprising at least one sgRNA as defined above and a second nucleic acid construct comprising Cas9 or a functional variant or homolog thereof. The second nucleic acid construct may be transfected below, after or concurrently with the first nucleic acid construct. The advantage of a separate, second construct comprising a cas protein is that the nucleic acid construct encoding at least one sgRNA can be paired with any type of cas protein, as described herein, and therefore are not limited to a single cas function (as would be the case when both cas and sgRNA are encoded on the same nucleic acid construct).
In one embodiment, the nucleic acid construct comprising a cas protein is transfected first and is stably incorporated into the genome, before the second transfection with a nucleic acid construct comprising at least one sgRNA nucleic acid. In an alternative embodiment, a plant or part thereof or at least one isolated plant cell is transfected with mRNA encoding a cas protein and co-transfected with at least one nucleic acid construct as defined herein. Alternatively, as described in Example 2, the nucleic acid constructs can be transiently expressed in the target plant cell.
Cas9 expression vectors for use in the present invention can be constructed as described in the art. Examples of suitable promoters include the Actin, CaMV35S, wheat U6 or maize ubiquitin (e.g. Ubi 1 ) promoter, as described above.
In an alternative aspect of the present invention, there is provided an isolated plant cell (transiently or stably) transfected with at least one nucleic acid construct or sgRNA molecule as described herein.
In a further aspect of the invention, there is provided a genetically modified or edited plant comprising the transfected cell described herein. In one embodiment, the nucleic acid construct or constructs may be integrated in a stable form. In an alternative embodiment, the nucleic acid construct or constructs are not integrated (i.e. are transiently expressed). Accordingly, in a preferred embodiment, the genetically modified plant is free of any sgRNA and/or Cas protein nucleic acid. In other words, the plant is transgene free.
The term "introduction", “transfection” or "transformation" as referred to herein encompasses the transfer of an exogenous polynucleotide into a host cell, irrespective of the method used for transfer. Plant tissue capable of subsequent clonal propagation, whether by organogenesis or embryogenesis, may be transformed with a genetic construct of the present invention and a whole plant regenerated there from. The particular tissue chosen will vary depending on the clonal propagation systems available for, and best suited to, the particular species being transformed. Exemplary tissue targets include leaf disks, pollen, embryos, cotyledons, hypocotyls, megagametophytes, callus tissue, existing meristematic tissue (e.g., apical meristem, axillary buds, and root meristems), and induced meristem tissue (e.g., cotyledon meristem and hypocotyl meristem). The resulting transformed plant cell may then be used to regenerate a transformed plant in a manner known to persons skilled in the art.
The transfer of foreign genes into the genome of a plant is called transformation. Transformation of plants is now a routine technique in many species. Any of several transformation methods known to the skilled person may be used to introduce the nucleic acid construct or sgRNA molecule of interest into a suitable ancestor cell. The methods described for the transformation and regeneration of plants from plant tissues or plant cells may be utilized for transient or for stable transformation.
Transformation methods include the use of liposomes, electroporation, chemicals that increase free DNA uptake, injection of the DNA directly into the plant (microinjection), gene guns (or biolistic particle delivery systems (biolistics)) as described in the examples, lipofection, transformation using viruses or pollen and microprojection. Methods may be selected from the calcium/polyethylene glycol method for protoplasts, ultrasound- mediated gene transfection, optical or laser transfection, transfection using silicon carbide fibers, electroporation of protoplasts, microinjection into plant material, DNA or RNA-coated particle bombardment, infection with (non-integrative) viruses and the like. Transgenic plants, can also be produced via Agrobacterium tumefaciens mediated transformation, including but not limited to using the floral dip/ Agrobacterium vacuum infiltration method as described in Clough & Bent (1998) and incorporated herein by reference.
Accordingly, in one embodiment, at least one nucleic acid construct or sgRNA molecule as described herein can be introduced to at least one plant cell using any of the above described methods. In an alternative embodiment, any of the nucleic acid constructs described herein may be first transcribed to form a preassembled Cas9-sgRNA ribonucleoprotein and then delivered to at least one plant cell using any of the above described methods, such as lipofection, electroporation or microinjection.
Optionally, to select transformed plants, the plant material obtained in the transformation is, as a rule, subjected to selective conditions so that transformed plants can be distinguished from untransformed plants. For example, the seeds obtained in the abovedescribed manner can be planted and, after an initial growing period, subjected to a suitable selection by spraying. A further possibility is growing the seeds, if appropriate after sterilization, on agar plates using a suitable selection agent so that only the transformed seeds can grow into plants. As described in the examples, a suitable marker can be bar-phosphinothricin or PPT. Alternatively, the transformed plants are screened for the presence of a selectable marker, such as, but not limited to, GFP, GUS (P- glucuronidase). Other examples would be readily known to the skilled person. Alternatively, no selection is performed, and the seeds obtained in the above-described manner are planted and grown and PHS1 expression or protein levels measured at an appropriate time using standard techniques in the art. This alternative, which avoids the introduction of transgenes, is preferable to produce transgene-free plants.
Following DNA transfer and regeneration, putatively transformed plants may also be evaluated, for instance using PCR to detect the presence of the gene of interest, copy number and/or genomic organisation. Alternatively or additionally, integration and expression levels of the newly introduced DNA may be monitored using Southern, Northern and/or Western analysis, both techniques being well known to persons having ordinary skill in the art.
The generated transformed plants may be propagated by a variety of means, such as by clonal propagation or classical breeding techniques. For example, a first generation (or T1) transformed plant may be selfed and homozygous second-generation (or T2) transformants selected, and the T2 plants may then further be propagated through classical breeding techniques.
In a further related aspect of the invention, there is also provided, a method of obtaining a genetically modified plant as described herein, the method comprising a. selecting a part of the plant; b. transfecting at least one cell of the part of the plant of paragraph (a) with at least one nucleic acid construct as described herein or at least one sgRNA molecule as described herein, using the transfection or transformation techniques described above; c. regenerating at least one plant derived from the transfected cell or cells; d. selecting one or more plants obtained according to paragraph (c) that show altered expression or activity of PHS1 .
In a further embodiment, the method also comprises the step of screening the genetically modified plant for SSN (preferably CRISPR)-induced mutations in the PHS1 gene or promoter sequence. In one embodiment, the method comprises obtaining a DNA sample from a transformed plant and carrying out DNA amplification to detect a mutation in at least one PHS1 gene or promoter sequence.
In a further embodiment, the methods comprise generating stable T2 plants preferably homozygous for the mutation (that is a mutation in in at least one PHS1 gene or promoter sequence).
Plants that have a mutation in at least one PHS1 gene or promoter sequence can also be crossed with another plant also containing at least one mutation in at least one PHS1 gene or promoter sequence to obtain plants with additional mutations in the PHS1 gene or promoter sequence. The combinations will be apparent to the skilled person. Accordingly, this method can be used to generate a T2 plants with mutations on all or an increased number of homoelogs, when compared to the number of homoeolog mutations in a single T 1 plant transformed as described above.
A plant obtained or obtainable by the methods described above is also within the scope of the invention.
A genetically altered plant of the present invention may also be obtained by transference of any of the sequences of the invention by crossing, e.g., using pollen of the genetically altered plant described herein to pollinate a wild-type or control plant, or pollinating the gynoecia of plants described herein with other pollen that does not contain a mutation in at least one of the PHS1 gene or promoter sequence. The methods for obtaining the plant of the invention are not exclusively limited to those described in this paragraph; for example, genetic transformation of germ cells from the ear of wheat could be carried out as mentioned, but without having to regenerate a plant afterward.
In a further final aspect of the invention, there is provided a method of screening a population of plants and identifying and/or selecting a plant that will have altered expression and/or activity of PHS1 and therefore an alteration in granule size distribution in a plant, as described herein, compared to a control or wild-type plant, the method comprising detecting at least one polymorphism or mutation in the PHS1 gene and/or promoter, wherein said mutation or polymorphism leads to an alteration in the level of expression and/or activity of the PHS1 protein compared to the level in a plant not carrying said mutation or polymorphism (e.g. a control or wild-type plant). Said mutation or polymorphism may comprise at least one insertion and/or at least one deletion and/or substitution.
Suitable tests for assessing the presence of a polymorphism would be well known to the skilled person, and include but are not limited to, Isozyme Electrophoresis, Restriction Fragment Length Polymorphisms (RFLPs), Randomly Amplified Polymorphic DNAs (RAPDs), Arbitrarily Primed Polymerase Chain Reaction (AP-PCR), DNA Amplification Fingerprinting (DAF), Sequence Characterized Amplified Regions (SCARs), Amplified Fragment Length polymorphisms (AFLPs), Simple Sequence Repeats (SSRs-which are also referred to as Microsatellites), and Single Nucleotide Polymorphisms (SNPs). In one embodiment, Kompetitive Allele Specific PCR (KASP) genotyping is used.
The method may also comprise the step of assessing whether the polymorphism has an effect on a starch granule characteristic as described herein. Methods to screen for an effect on starch granule characteristic would be well known to the skilled person, but could be carried out using a coulter counter, a microscope, a flow cytometer or any other particle sizing instrument, again as described above.
In one embodiment, the method comprises a) obtaining a nucleic acid sample from a plant and b) carrying out nucleic acid amplification of one or more PHS1 gene and/or promoter alleles using one or more primer pairs. In a further embodiment, the method may further comprise introgressing the chromosomal region comprising at least one of said low-PHS1-expressing/activity polymorphisms into a second plant or plant germplasm to produce an introgressed plant or plant germplasm. Preferably the expression or activity of PHS1 in said second plant will be altered (compared to a control or wild-type plant), and more preferably said second plant will display an alteration in at least one of starch B-type granule number and/or size, as described above.
In a further aspect of the invention there is provided a method of altering a starch granule characteristic, as described above, in a plant, the method comprising a. screening a population of plants for at least one plant with at least one of the above described polymorphisms or mutations; and b. further altering (i.e. reducing/abolishing or increasing) the expression of at least one PHS1 nucleic acid and/or altering (i.e. reducing/abolishing or increasing) the activity of a PHS1 polypeptide in said plant by introducing at least one mutation into the nucleic acid sequence encoding PHS1 or at least one mutation into the promoter of PHS1 as described herein or using RNA interference as described herein.
By “further altering” is meant reducing or increasing the level of PHS1 expression to a level lower than that in the plant with the at least one of the above-described PHS1 polymorphisms. The terms “reducing” or “increasing” means a decrease in the levels of PHS1 expression and/or activity by up to 10%, 20%, 30%, 40%, 50%, 60%, 70%, 80% or 90% when compared to the level in a control plant or the level in the initially screened plants.
While the foregoing disclosure provides a general description of the subject matter encompassed within the scope of the present invention, including methods, as well as the best mode thereof, of making and using this invention, the following examples are provided to further enable those skilled in the art to practice this invention and to provide a complete written description thereof. However, those skilled in the art will appreciate that the specifics of these examples should not be read as limiting on the invention, the scope of which should be apprehended from the claims and equivalents thereof appended to this disclosure. Various further aspects and embodiments of the present invention will be apparent to those skilled in the art in view of the present disclosure. "and/or" where used herein is to be taken as specific disclosure of each of the two specified features or components with or without the other. For example "A and/or B" is to be taken as specific disclosure of each of (i) A, (ii) B and (iii) A and B, just as if each is set out individually herein.
Unless context dictates otherwise, the descriptions and definitions of the features set out above are not limited to any particular aspect or embodiment of the invention and apply equally to all aspects and embodiments which are described.
The foregoing application, and all documents and sequence accession numbers cited therein or during their prosecution ("appln cited documents") and all documents cited or referenced in the appln cited documents, and all documents cited or referenced herein ("herein cited documents"), and all documents cited or referenced in herein cited documents, together with any manufacturer's instructions, descriptions, product specifications, and product sheets for any products mentioned herein or in any document incorporated by reference herein, are hereby incorporated herein by reference, and may be employed in the practice of the invention. More specifically, all referenced documents are incorporated by reference to the same extent as if each individual document was specifically and individually indicated to be incorporated by reference.
The invention is now described in the following non-limiting examples.
Example 1 : The phs1 mutant of wheat has no obvious effects on vegetative growth.
To study the role of PHS1 in wheat, we first identified the gene models corresponding to PHS1. BLAST searches against the wheat RefSeq 1.1 genome of cultivar Chinese Spring (Appels et al., 2018) on Ensembl plants (Kersey et al., 2018) revealed three homeologs of PHS1 encoded on group 5 chromosomes. These were PHS1-5A (TraesCS5A02G395200.1), PHS-5B (TraesCS5B02G400000.2) and PHS-5D (TraesCS5D02G404500.1)(Figure 1A). In the durum wheat reference genome, these corresponded to PHS1-5A (TRITD5Av1G205670) and PHS1-5B (TRITD5Bv1G201740). All homeologs had 15 exons and 14 introns. We examined gene expression of the different PHS1 homeologs in Chinese Spring using the wheat expression browser (Borrill et al., 2016). All homeologs showed similar levels of expression across all examined tissues. PHS1 transcripts were detectable in leaves/shoots, roots, spike and grains. However, consistent with previous studies (Schupp and Ziegler, 2004), the highest level of expression in the endosperm was during early grain development (10 dpa), although h robust levels of transcript were detectable at later timepoints (20-30 dpa).
We generated two mutant wheat lines: phs1-1 and phs1-6 with mutations in both the A and B homeologs. The AA BB (wild-type sibling control), aa BB and AA bb (single mutants), and aa bb (double mutant) genotypes were selected in the F2 generation using KASP genotyping.
To verify the effect of the mutations on PHS1 activity, we separated leaf protein extracts on native PAGE gels and stained for phosphorylase activity. Wheat protein extracts are reported to produce three distinct bands using this technique, of which the plastidial phosphorylase has the highest mobility (Schupp and Ziegler, 2004). We observed four distinct activity bands, with two closely migrating bands in the region expected for plastidial phosphorylase (Figure 1 B). Both bands were absent in the phs1-1 and phs1- 6 aa bb double mutants, and the two aa BB and AA bb single mutants of phs1-1 were missing either one of the two bands. We concluded that the two bands represent PHS1- 5A and PHS1-5B activities, and the selected mutations for phs1-1 and phs1-6 mutant lines eliminate all detectable PHS1 activity.
Overall, we observed no detectable effect of the phs1 mutants on plant growth in glasshouses (Figure 2A). However, in some experiments, plants from all phs1-1 lines (including the mutants and the wild-type siblings) were slightly shorter than wild-type plants. This is likely due to background mutations rather than the loss of PHS1 activity, since it was also observed in the wild-type sibling from the phs1-1 cross, and was not observed in any of the phs1-6 lines. Similarly, there was no effect of phs1 mutations on the grain yield per plant, but a decrease was seen in all phs1-1 lines (Figure 2B) (on this point, it is important to note that the TILLING mutants are full of other mutations and therefore here, the comparison is not with wild-type but with the plants not carrying a mutation in PHS1 (e.g. AA BB)). Again, there was no significant difference in grain yield per plant or seed number between the phs1-1 double mutant and its wild-type sibling control, or between the phs1-6 double mutant and its wild-type sibling control. All lines produced seeds that appeared normal (Figure 2C), and we did not observe any significant effects of phs1 mutations on grain size or thousand grain weight (Figure 2D, E). We therefore conclude that loss of PHS1 activity does not affect vegetative growth or seed yield.
Example 2: Granule initiation in leaves
To test whether the loss of PHS1 affected leaf starch metabolism, we first quantified starch content in leaves of seedlings. We did not see a significant difference in starch content between the wild type and mutants, at either the end of day or end of night, and all lines turned over starch during the night (Figure 3).
Example 3: PHS1 is required for normal B-type granule size and number
We investigated the effect of phs1 mutations on starch synthesis in the endosperm. First, we measured starch content. The phs1-1 double mutant had identical starch content to the wild-type sibling control and the wild type (Table 1). Interestingly, the double mutant had a small but significant reduction in amylose content.
We then purified starch granules from mature grains and examined their morphology using Scanning Electron Microscopy (SEM). Interestingly, starch from the phs1 double mutants had visibly fewer B-type granules than the wild type (Figure 4A). No differences were observed between the single mutants and the control lines. Quantification of granule size distribution on the Coulter counter confirmed that the double mutant had fewer, larger B-type granules. When the relative volume of starch was plotted against granule diameter, the B-type granule peak was much less prominent in the double mutant than in the wild type, and appeared to be shifted towards the larger size range (Figure 4B). By fitting a mixed normal distribution to the diameter-volume plots, we calculated that the double mutant had less than half of the B-type granule content (by volume) compared to the wild-type and single mutants, despite having significantly larger B-type granules (Figure 4C, D). As the large size of the B-type granules in the double mutant affects the area of the B-type granule peak when plotted on a volume basis, we also plotted the size distributions on a relative number basis (Figure 4E), which showed a large shift in the proportion of A-type vs. B-type granules in the double mutant. As curves cannot be reliably fitted to the number-diameter plots due to the small size of the A-type granule peak, we set a numerical size cutoff for small granules (<10 pm), which includes B-type granules and some smaller A-type granules. This showed a significant 26% decrease in the relative number of small granules in the double mutant compared to the wild type (Figure 4F). By contrast, the size of A-type granules was unaffected by the Taphsl mutations (Figure 4G).
Taken together, these data suggest that loss of PHS1 results in the synthesis of fewer, but larger, B-type granules, suggesting it has a role in the initiation of B-type granules. Thus, mutations in PHS1 can be exploited to reduce the heterogeneity in starch granule size in wheat and other related Triticeae crops, creating higher quality grains for brewing and the purification of starch and gluten. Starch with fewer, larger B-type granules may also have novel physical and chemical properties that can be exploited in manufacturing both food and non-food products.
Example 4: BGC1 interacts with proteins involved in starch synthesis and other cellular processes
We confirmed that BGC1 interacts with PHS1 , in a series of heterologous assays carried out using transiently expressed proteins in Nicotiana benthamiana leaves. We confirmed that both PHS1 and BGC1 located to the chloroplast in this heterologous system (Figure 5A). PHS1 was located in the stroma, whereas BGC1 was located around starch granules, and in punctate structures. We then used pairwise immunoprecipitations with YFP/RFP-tagged proteins in Nicotiana (Figure 5B). BGC1 :RFP co-purified with PHS1 :YFP in the IP with anti-YFP beads when the two fusion proteins were coexpressed (Figure 5B). BGC1 :RFP did not purify in control reactions when co-expressed with a chloroplast-targeted YFP, nor did a chloroplast-targeted RFP co-purify with PHS1 :YFP.
Example 5: PHS1 expression
The expression patterns of PHS1 and BGC1 were compared during grain development. We used our previously generated RNAseq dataset for endosperm development in durum wheat cultivar, Kronos (Chen et al., 2022). PHS1 expression was detectable at all stages of grain development tested between 6-30 days post anthesis (dpa) but was particularly strong between 8-13 dpa (Figure 6A). No differences in expression were observed between the A- and B- homeologs. PHS1 activity levels in endosperm extracts were visualised on native PAGE gels. Consistent with the transcript levels, robust PHS1 activity was detected at all timepoints tested between 8-28 dpa, but was stronger at the earlier than later timepoints, especially at 12 dpa (Figure 6B). To assess PHS1 expression in different tissues, we used public data for bread wheat cultivar Chinese Spring in the wheat expression browser (Borrill et al., 2016). PHS1 transcripts were detectable in leaves/shoots, roots, spike, and grains, and all homeologs showed similar levels of expression across all examined tissues.
Example 6: PHS1 acts during B-type granule initiation in grain development
To examine if the loss of PHS1 specifically affected B-type granule formation, we measured starch content, granule number and granule size distributions throughout grain development. We harvested developing grains from the phs1-1 and phs1-6 double mutants, as well as their corresponding wild-type controls, and carried out all starch analyses on dissected endosperms. Similar to in mature grains, we did not observe significant differences in total starch content between the mutants and wild-type controls at any time point during grain development (Figure 7A). We then used the Coulter counter to quantify the number of starch granules per mg of endosperm, as well as to assess granule size distributions. The number of starch granules did not differ between the mutants and the wild type at 8 or 14 dpa. However, at 18 and 22 dpa, both the phs1 mutants had significantly fewer starch granules than the wild-type controls (Figure 7B).
Since B-type granules initiate typically between 15-20 dpa, and only A-type granules are present before this timepoint, phs1 mutants contain normal numbers of A-type granules (Figure 8A, B), but have fewer B-type granules than the wild type (Figure 8C, D). This was also strongly supported by the granule size distributions from the Coulter counter and in SEM (Figure 8A-H). The distributions were unimodal in the mutants and the wildtype controls at the 8 and 14 dpa timepoints (Figure 8A, B, E, F). However, at the 18 and 22 dpa timepoints, the distributions turned bimodal - suggesting all genotypes could initiate B-type granules (Figure 8C, D, G, H). However, the B-type granule peaks were smaller and shifted towards the larger sizes in the phs1 mutants (Figure 8C, D). Larger, and fewer B-type granules were observed in the SEM images of starch from these timepoints (Figure 8G, H). There were no defects in A-type granule morphology at any timepoint. Together, these data suggest that PHS1 is required for normal B-type granule initiation, but not A-type granule initiation (Figure 8).
Our finding that PHS1 is required solely for B-type granule initiation was further supported in a genetic approach. Reduced gene dosage of BGC1 affects the number of B-type granules - such as in the bgc1-1 mutant of durum wheat, Kronos, which almost has no B-type granules, due to a loss-of-function mutation in the BGC1-A 1 homeolog and a missense mutation in the BGC1-B1 homeolog. We crossed this bgc1-1 line to the phs1-1 line, and isolated a bgc1-1 phs1-1 quadruple mutant in the F2 generation. We also isolated from the cross a wild-type sibling control, and bgc1-1 and phs1-1 mutant siblings as controls. We purified starch granules from mature grains of these lines and examined their morphology using Scanning Electron Microscopy (SEM). Starch from the bgc1-1 and bgc1-1 phs1-1 mutants had visibly fewer B-type granules than the wild type, wild-type sibling control, and phs1-1 mutant siblings (Figure 9A). Coulter counter analysis of granule size distributions showed that the reduction of B-type granule number was stronger in bgc1-1 than in phs1-1 (Figure 9B), since bgc1-1 had almost no detectable B- type granule peak (when plotted by volume), whereas phs1-1 had a smaller B-type granule peak that was between that of the wild-type control and bgc1-1. Importantly, the granule size distribution of bgc1-1 phs1-1 quadruple mutants was almost identical to bgc1-1. This shows that the loss of PHS1 in the bgc1-1 background (which only has A- type granules), has no further effect on starch granule size distributions. Using curve- fitting on these plots, we calculated that the size of A-type granules was unaffected by the phs1 mutations in the bgc1-1 background (which only has A-type granules) (Figure 9C). Next, we calculated the percentage of starch granules that were smaller than <10 pm. There was a significant decrease in the relative number of small granules in the bgc1-1 phs1-1 quadruple mutant and bgc1-1 and phs1-1 mutant siblings compared to the wild type and wild-type sibling control (Figure 9D). Interestingly, like the phs1 double mutants, we observed that the bgc1-1 also has a small reduction in amylose content. Taken together, our results show that PHS1 has an exclusive role in B-type granule initiation, and does not appear to be required for A-type granule initiation.
Example 7: Loss of PHS1 in wheat does not affect total maltooligosaccharides (MOS) levels in the endosperm
We investigated whether there was MOS accumulation in the developing endosperm of wheat phs1 mutants. However, levels of MOS or other soluble sugars were unaltered in the phs1 mutants. We quantified the total soluble glucans in perchloric acid extracts. For all genotypes, soluble glucans were highest at 8 DPA, and decreased in abundance during grain development, but were always relatively low compared to starch content (approx. 10% of starch content at 8 dpa, and <0.001% at 22 dpa) (Figure 10). There were no consistent differences between the phs1 mutants and their controls. We then quantified the methanol precipitable fraction of soluble glucans (phytoglycogen and long MOS), and calculated the non-precipitable fraction (short MOS) as the difference between the total soluble glucan and precipitable glucans. Like the total glucans, these decreased in abundance as the grain developed, and there were no consistent differences between the phs1 mutants and controls. Additional High Performance Anion Exchange Chromatography with Pulsed Amperometric Detection (HPAEC-PAD) analyses confirmed this result, since maltose and other detectable MOS did not change in abundance or chain length distribution pattern in the phs1 mutants (Figure 11). In addition, glucose, fructose and sucrose levels remained unchanged in the mutants at all timepoints (Figure 11). Materials and Methods
Plant materials and growth
Wheat plants were grown in climate-controlled glasshouses for all grain analyses, and in controlled environment rooms (CERs) for the analysis of starch in leaves. The glasshouses were set to provide a minimum 16 h of light at 20°C, and 16°C during the dark. The CERs were set to provide 16 h light at 20°C (light intensity was 300 pmol photons m -2 s -1 ) and 8 h dark at 16°C. The glasshouses and CERs were set to 60% relative humidity.
Mutant lines of durum wheat (Triticum turgidum cv. Kronos) were obtained from the wheat in silico TILLING resource (http://www.wheat-tillinq.com)(Krasileva et al., 2017). Lines Kronos4533(K4533) and Kronos4367(K4367) were obtained for PHS1-5A, and lines Kronos2864(K2864) and Kronos0238(K0238) were obtained for PHS1-5B (Figure 1). Plants were crossed to combine mutations in the A- and B- homeologs. The wild-type segregant (AA BB), single homeolog mutants (aa BB, AA bb) and the double homeolog mutant (aa bb) were selected in the F2 generation using KASP V4.0 genotyping (LGC, Teddington) with the primers in Table S1.
Native gel analysis of PHS activity
The visualisation of a-glucan phosphorylase activity by native PAGE was carried out using the method of (Zeeman et al., 2004) with minor modifications. Leaf samples were homogenised in extraction medium [100 mM MOPS (3-(N-morpholino)propanesulfonic acid), pH 7.2, 1 mM DTT (dithiothreitol), 1 mM EDTA, and 10% (v/v) ethanediol, Complete protease inhibitor cocktail (Roche)], then spun at 20,000g for 10 min. Soluble proteins were collected in the supernatant, and the protein concentration of the extracts was determined using the Bradford method. Equal amounts of protein (180 pg) were loaded per lane on native PAGE gels (7.5% acrylamide and 0.3% oyster glycogen in resolving gel; 3.75% acrylamide in stacking gel) and were electrophoresed at 200V for 2 h at 4°C. Gels were washed twice in 100 mM Tris-HCI, pH 7.0, 1 mM DTT, then incubated in 100 mM Tris-HCI, pH 7.0, 1 mM DTT, 50 mM Glc-1-P overnight at 20°C. Activity bands were stained with Lugol’s iodine solution (L6146, Sigma, St. Louis).
Grain morphometries Grain yield per plant was quantified as the total weight of grains harvested from each plant. The thousand grain weight and grain size were quantified using the MARViN seed analyser (Marvitech GmbH, Wittenburg).
Starch and amylose content
Starch content of leaves and grains (in glucose equivalents) were quantified as described in Hawkins et al. (2021). Briefly, for leaves, two-week-old seedlings were harvested at the base of the lowest leaf and flash frozen in liquid N2, then homogenised in 0.7 M perchloric acid. Insoluble material was collected by centrifugation, washed three times in 80% ethanol, then resuspended in water. Starch was digested using a- amylase/amyloglucosidase (Roche, Basel), and the released glucose was assayed using the hexokinase/glucose-6-phosphate dehydrogenase assay (Roche). For grains, flour (5-10 mg) was dispersed in 100 mM sodium acetate buffer, pH 5, and the starch was digested with thermostable a-amylase at 99°C for 7 min. Amyloglucosidase was added to the digestion, and was further incubated at 50°C for 35 min. Both the thermostable a-amylase and amyloglucosidase were from the Total Starch Assay kit (K- TSTA, Megazyme, Bray). The digested sample was centrifuged to remove insoluble material, and glucose was measured in the supernatant as for the leaf starch quantification.
Starch purification - granule morphology, size distribution and amylose content
For starch purification from mature grains, grains (3-5 grains per extraction) were soaked overnight in ddH2O at 4°C, then homogenised in a mortar and pestle with excess ddH2O. The homogenates were filtered through a 70 pm nylon mesh, then centrifuged at 3000g, 5 min, before resuspending the starch pellet in water. The starch suspension was centrifuged at 2500g, 5 min on a cushion of 90% (v/v) Percoll, 50 mM Tris-HCI, pH 8. The pellet was washed twice in 50 mM Tris-HCI, pH 6.8, 10 mM EDTA, 4% SDS (v/v), 10 mM DTT, then twice in ddH2O, before finally resuspending in ddH2O.
The morphology of purified granules was examined using a Nova NanoSEM 450 (FEI, Hillsboro) scanning electron microscope (SEM). To quantify starch granule size distributions, the purified starch was suspended in Isoton II electrolyte solution (Beckman Coulter, Indianapolis), and particle sizes were measured using a Multisizer 4e Coulter counter (Beckman Coulter) fitted with a 70 pm aperture tube. A minimum of 100,000 particles was measured per sample. These data were used to produce relative volume vs. diameter and relative number vs. diameter plots. A mixed bimodal (two log-normal distributions) were fitted to the relative volume vs. diameter plots to calculate mean diameters of A- and B-type granules, and the B-type granule volume percentage (defined as volume occupied by B-type granules as a percentage of the total volume of starch).
Amylose content of the purified starch granules was estimated using an iodine colourimetry method (Washington et al., 2000). Briefly, starch (1 mg) was dispersed overnight at room temperature in 1 M NaOH. The solution was neutralised to pH 7 using 1 M HCI, and 5 pL of this solution was diluted in 220 pL of water and 25 pL Lugol’s iodine solution (Sigma). The reaction was incubated at room temperature for 10 minutes prior to absorbance measurements at 535 nm and 620 nm. Apparent amylose content was estimated using the formula:
Apparent amylose content = 1.4935*exp [2 - 7029 * (Absorbance 620/Absorbance 535)1
Isolation, cloning, plasmid construction for PHS1 and BGC1
Total RNA was extracted from leaves of two-week-old wheat seedlings using RNease kit with on-column DNase I digestion (Qiagen). cDNA was synthesized (using 2 pg RNA) using the Go Script™ Reverse Transcriptase Kit (Promega) following the manufacturer’s instruction. The full-length wheat PHS1 cDNA sequence was amplified using gene specific primers listed in Supplemental Table S3 and inserted into Gateway entry vector, pENTR, using the pENTR/D-TOPO kit (Invitrogen). To produce constructs with a C- terminal YFP or RFP tag, we recombined the coding sequences from PHS1 :pENTR and BGC1 :pDONR221 (Hawkins et al., 2020) into the plant expression vectors pB7YWG2 (35S promoter; C-terminal YFP) or pB7RWG2 (35S promoter; C-terminal RFP) using Gateway LR clonase (Invitrogen). To generate the chloroplast-targeted YFP and RFP proteins, the rubisco small subunit (RbcS) transit peptide (Kim et al., 2010) was cloned into pDONR221 , and recombined into the pB7YWG2 and pB7RWG2.
Transient expression of PHS1 and BGC1 in N. benthamiana and protein localisation
Constructs for YFP-tagged PHS1 and BGC1 expression were transformed into Agrobacterium tumefaciens (strain GV3101). Agrobacterium cells were grown at 28°C in Luria-Bertani medium with the appropriate antibiotics. Cells were pelleted, washed, and resuspended in the infiltration medium [10 mM 2-(N-morpholino) ethanesulfonic acid, 10 mM MgCI2, and 0.15 mM acetosyringone, pH 5.6], The cell suspension was then infiltrated into the intercellular spaces between abaxial epidermal cells of intact Nicotiana benthamiana leaves using a 1 mL plastic syringe. Infiltrated plants were incubated overnight in dark followed by 2 days in a 16 h/8 h light-dark cycle. YFP fluorescence in leaf samples was observed using a laser scanning confocal microscope (SP8; Leica) with a 63x water-immersion lens. Images were processed using LAS X software.
Pulldown assay, immunoprecipitations and mass spectrometry
For the pulldown assay to identify proteins associating with BGC1 , the bait protein (recombinant His-tagged BGC1) was expressed in E. coli as described in Hawkins et al. (2020) and purified in its native state using Ni-NTA agarose beads (Qiagen) as previously described (Seung et al., 2013). To produce endosperm extract, endosperms were dissected from developing grains (collected at 18 dpa) of wild-type Kronos plants and homogenised in ice-cold extraction medium [50 mM Tris-HCI, pH 8, 1 mM DTT, 1% (v/v) Triton X-100, 150 mM NaCI, and Roche Complete Protease Inhibitor cocktail] at a rate of 1 mL buffer per 100 mg tissue. Insoluble material was removed by centrifugation at full speed for 5 min at 4°C, and proteins were collected in the supernatant. Recombinant BGC1-His protein (2.5 pg) was added to the supernatant (1 mL) and was incubated for 1 h at 4°C. pMACS magnetic beads conjugated to anti-His (Miltenyi Biotec) were added and incubated for 1 h at 4°C to retrieve the bait protein together with interacting proteins. The beads were captured with a pColumn on a magnetic stand (Miltenyi Biotec), washed three times with wash medium [50 mM Tris-HCI, pH 8, 1 mM DTT, 1% (v/v) Triton X- 100, 300 mM NaCI, and Roche Complete Protease Inhibitor cocktail], then three times with wash medium without Triton X-100, before eluting the bound proteins with elution medium [50 mM Tris-HCI, pH 6.8, and 2% (w/v) SDS].
The eluted proteins were precipitated with chloroform/methanol according to (Pankow et al., 2016). Protein pellets were resuspended in 50 pl of 2.5% sodium deoxycholate (SDC; Merck) in 0.2 M EPPS-buffer (Merck), pH 8.5 and reduced, alkylated, and digested with trypsin in the SDC buffer according to standard procedures. After the digest, the SDC was precipitated by adjusting to 0.2% trifluoroacetic acid (TFA), and the clear supernatant subjected to C18 SPE. Samples were dried in a SpeedVac concentrator (Thermo Fisher Scientific, #SPD120) and the peptides dissolved in 0.1%TFA/3% acetonitrile.
Peptides were analysed by nanoLC-MS/MS on an Orbitrap Eclipse™ Tribrid™ mass spectrometer with a FAIMS Pro Duo source, coupled to an UltiMate® 3000 RSLCnano LC system (Thermo Fisher Scientific, Hemel Hempstead, UK). The samples were loaded and trapped using a trap cartridge (Pepmap Neo C18, 5 pm, 300 pm x 5 mm, Thermo) with 0.1% TFA at 15 pl min -1 for 3 min. The trap was then switched in-line with the analytical column (nanoEase M/Z column, HSS C18 T3, 100 A, 1.8 pm; Waters, Wilmslow, UK) for separation using the following gradient of solvents A (water, 0.1% formic acid) and B (80% acetonitrile, 0.1 % formic acid) at a flow rate of 0.2 pl min -1 : 0-3 min 3% B (during trapping); 3-10 min linear increase B to 7%; 10-100 min increase B to 32%; 100-148 min increase B to 50%; followed by a ramp to 99% B and re-equilibration to 3% B, for a total running time of 180 minutes. Mass spectrometry data were acquired with the FAIMS device set to three compensation voltages (-35V, -50V, -65V) at standard resolution for 1 s each with the following MS settings in positive ion mode: MS1/OT: resolution 120K, profile mode, mass range m/z 300-1800, spray voltage 2800 V, AGC 4e 5 , maximum injection time 50 ms; MS2/IT: data dependent analysis was performed using HCD fragmentation with the following parameters: cycle time of 1 s in IT turbo for each FAIMS CV, centroid mode, isolation window 1.0 Da, charge states 2-5, threshold 1.0e 4 , CE = 30, normalised AGC target 100%, max. inject time set to Auto, dynamic exclusion 1 count, 15 s exclusion, exclusion mass window ±10 ppm.
The acquired raw data were processed and quantified in Proteome Discoverer 3.0 (Thermo) using the incorporated search engine CHIMERYS® (MSAID® Munich, Germany). The processing workflow included recalibration of MS1 spectra (RC) and the Minora Feature Detector for quantification with min. trace length=7 and S/N threshold=3. The Top N Peak Filter (10 per 100 Da) was applied and the CHIMERYS® search was performed with the prediction model inferys_2.1_fragmentation, enzyme trypsin with 2 missed cleavages, peptide length 7-25, fragment tolerance 0.5 Da, variable modification oxidation (M), fixed modifications carbamidomethyl (C). Percolator was used for validation using q-value and FDR 0.01 (strict) and 0.05 (relaxed).
In the consensus workflow quantification was performed with a maximum RT Shift of 3 min and a mass tolerance of 4 ppm between runs. Protein quantification was based on the top 3 most abundant unique peptides per protein group. Missing values were replaced by low abundance resampling. Protein abundance ratios were calculated from the 3 replicates per sample. The hypothesis test was performed by a background-based t-test and the p-values adjusted according to BH.
For pairwise co-immunoprecipitations, proteins were transiently expressed in N. benthamiana leaves as described above, and proteins were extracted as described for the pulldown assay. The supernatant was incubated for 1 h at 4°C with pMACS magnetic beads conjugated to anti-YFP. After incubation, the beads were recovered using a pColumn (Miltenyi Biotec) on a magnetic stand. The beads were washed five times with wash medium [50 mM Tris-HCI, pH 8, 1 mM DTT, 1% (v/v) Triton X-100, 300 mM NaCI, and Roche Complete Protease Inhibitor cocktail] before eluting the bound proteins with SDS-PAGE loading buffer [50 mM Tris-HCI, pH 6.8, 2% (w/v) SDS, 100 mM DTT, 3% (v/v) glycerol, 0.005% (w/v) bromophenol blue]. The eluates were analysed using SDS- PAGE and immunoblotting, using anti-YFP (Torrey Pines; TP401 - 1 :5000) and anti-RFP (Abeam pic; ab34771 - 1 :2000) primary antibodies. Proteins were detected using chemiluminescence from horseradish peroxidase-coupled secondary antibodies Antirabbit HRP (Sigma; A0545), 1 :15,000.
Quantification of starch, sugars, MOS, granule number and size in developing grains
Endosperms were dissected from developing grains collected at 8, 14, 18 and 22 dpa. In each extraction, three to five individual endosperms of known fresh weight were used. After homogenisation in 0.7 M perchloric acid, homogenates were spun at 10,000g for 5 mins, and the supernatant was immediately neutralized using neutralisation buffer (2 M KOH, 400 mM MES). This neutralised soluble fraction was used for sugar and MOS quantification (see below).
The pellet was resuspended in ddH2O and equally divided into two fractions. One fraction was used to quantify starch content following the method described above for leaves. The other was used for starch purifications (as described above for mature grains), and all granules purified within this fraction was resuspended in known volumes of Isoton II electrolyte solution (Beckman Coulter, Indianapolis). The suspension was analysed in a Multisizer 4e Coulter counter (Beckman Coulter) fitted with a 70 pm aperture tube and running in volumetric mode (analysing 2 mL of the suspension). This gave the number of granules in the suspension, which could be used to calculate the number of granules per starting fresh weight of endosperm; as well as granule size distribution plots.
For the quantification of soluble glucans: Total soluble glucans (neutralised soluble fraction without precipitation) and methanol-precipitable soluble glucans (after precipitation) were quantified. For methanol precipitation, one volume of neutralised soluble fraction was mixed with 4 volumes of pure methanol, mixed, and incubated overnight at -20°C. Precipitated glucans were collected by centrifugation at 10,000g, 5 min, then washed with 75% methanol, and dried. Dry pellets were suspended in one volume of ddH2O. To quantify glucans, the neutralised soluble fraction and resuspended precipitated glucans were digested using a-amylase/amyloglucosidase (Roche, Basel), and released glucose was assayed using the hexokinase/glucose-6-phosphate dehydrogenase assay (Roche).
HPAEC-PAD was used to quantify glucose, fructose, sucrose, and maltose, and to visualise MOS accumulation patterns. The neutralised soluble fraction was purified on sequential columns of Dowex 50Wand Dowex 1 (Sigma) as described previously Seung et al. (2013). Purified samples were separated on an ICS-5000 HPLC fitted with a CarboPac™ PA20 column (3 x 250 mm, CV=1.06 ml; Dionex). The mobile phase consisted of eluate A (100 mM NaOH) and eluate B (150 mM NaOH, 500 mM sodium acetate), following a gradient program of: 0-7 min, 0% B; 7.0-26.5 min, a concave gradient to 80% B; 26.5-32.0 min, 80% B; 32.0-32.1 min, linear gradient to 0% B, 32.1- 40.0 min, 0% B with flow rate of 0.25 ml/min.
Statistical analysis Statistical analyses were carried out using the SPSS program (SPSS Statistics, IBM).
Accession numbers
The accession numbers corresponding to the genes investigated in this study are:
TraesCS5A02G395200 (PHS1-A1), TraesCS5B02G400000 (PHS1-B1)
TRITD5Av1G205670 (PHS1-A 1), TRITD5Bv1G201740 (PHS1-B1) :
TraesCS4A02G284000 (BGC1-A1), TraesCS4B02G029700 (BGC1-B1) :
TRITD4Av1G198830 (BGC1-A1), TRITD0Uv1G034540 (BGC1-B1).
References
Hawkins E, Chen J, Watson-Lazowski A, Ahn-Jarvis J, Barclay JE, Fahy B, Hartley M, Warren FJ, Seung D. STARCH SYNTHASE 4 is required for normal starch granule initiation in amyloplasts of wheat endosperm. New Phytol. 2021 Jun;230(6):2371-2386.
Park, S. H. et al. “Starch granule size distribution of hard red winter and hard red spring wheat: Its effects on mixing and breadmaking quality.” Journal of Cereal Science 49 (2009): 98-105.
Saccomanno B, Berbezy P, Findlay K, Shoesmith J, llauy C, Viallis B, Trafford K. Characterization of wheat lacking B-type starch granules. J Cereal Sci. 2022 Mar; 104: 103398
Stark JR, Lynn A. Starch granules large and small. Biochem Soc Trans. 1992 Feb;20(1):7-12.
Stoddard, F.L. and Sarker, R. (2000), Characterization of Starch in Aegilops Species. Cereal Chemistry, 77: 445-447.
Washington, J. M., et al. (2000). "Developing Waxy Barley Cultivars for Food, Feed and Malt." Barley Genetics VIII: 303-306.
Zeeman SC, Thorneycroft D, Schupp N, Chapple A, Week M, Dunstan H, Haldimann P, Bechtold N, Smith AM, Smith SM. Plastidial alpha-glucan phosphorylase is not required for starch degradation in Arabidopsis leaves but has a role in the tolerance of abiotic stress. Plant Physiol. 2004 Jun;135(2):849-58. SEQUENCE LISTING
SEQ ID NO: 1 >Wheat (hexapioid) PHS1 5A (Chinese Spring reference genome) genomic DNA
CATCACTCACTCGGGTACAGAGCGCAGCGGGCCGCGTGACTCTCGCGTCGTCGCATC GC GCGCGCTGCACACCACCGTCAGCGCGACCACTGCCGCCAATGGCGACCGCCTCGCCGCC GCTCGCCACCCCCTTCCGCCCGCTCGCCGTCGCCGGAGGTAGCGGCGGACTCGTCGTGG GCGCCAGGGCCGTGGTACCGCCGAGGCGGGGGCGGCGGGGGTTCGTGGTGCGGAGCG TGGCGAGCGATCGGGAGGTGCGGGGGCCCGCCCCGACCGAGGAAGGTAAGCGGCCGCG CGCTCCCATTGGCTGCCACGACCTGGAATGCTGGTCGGCTGTAATGTGGTTCGGATTTTC GCTGGTGCGACCTGAATTTGTGATCCTCTGACTGACGGCCTATGTGTTCCCCGGGTTTCG GGCTGTTGATTTTGTGCGCGCGCATTGCTAATCTACTTCTCTAGGATTTGTTAGGGCTCA G AGCAATCTTAAATTCGCGAATACACTTCGACGAATGTAGTCTGAATTTACTGAAACTAGC TT AGTTTGGCGCGTCAAGATAGACTGATGTATTTATGGTAGCAAATTCTGAAAACTTCTTCT GT GCGTGGACAGCCACCCCTCGTGGATGATGCAGCAAGTGCTAACACAGTGTTTGCTAGTAC TTGCTAGTATCTGTAGCTGTAACAAAATCCTTCGAATCTGAACTAGTGAGATAAATCGTT TG AGATTTTTGCTGAAGATTACACAGTATCAATATTTCTGAAGGTGATTGGAACAGTATAGC AT TTCAAACTGGCAACCAAGCAGGATCAGTATACTGACGCAATTTTTTTCTGAAGGTGGTTG G AATACCGGTATGAATTTTATAATCATTGAATTAAGGCGTAAGATTTGGTAACAAAAACTG CTT TTCACGGGAAACGATGAAACATGGCACAATTCAACCTGACTTTCTTTTATGAAAAGGCTC TC AAGCTGCCTTGTACTCCCTCCGTCCCATAATATAAGAACGTTTTTGACATTAGTGTAGTA CC AAAAACGTTCTTATATTATGGGACAGAGGGAGTATTTATTAATGGATAAAAAGAGAGGTA CA GGGTTCTAGTAGTAACATATGCAAATGGTGATACAGTATATTATCTACGATTTTGTGCAA GA ATTTTCACACGAAAACAATATTCCATGCAGTTTTTAGTTTTGACTTTTTTTTTTGNNNNN NNN NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNCGAACCCAACATACAG TACAATAGTACAAATGCCGGGGCGGCAGCAGAGAGAGAGGCAGGGGGCGACCCCCCCAT GCTATGTTACATGCAGCTTTGTTCTACCCACGCAGAGCCTGTAGCCAGTTTTGACTCTTT TT GAATTGTTTGGATTGCTTACCACATGTGTCACGTGGTTTTAGTTTTGACTCTTGACTGTT TAA
TACCTACTCTTTTTAAGATTTTGGATTCTAATGTCATCTGTCTTGCAACTCAGAGCT TTCAGC CGTGCTAACTTCCATTGATTCATCCGCCATCGCATCAAACATCCAGCACCATGCAGACTT C ACACCGTTATTCTCGCCAGAGCACTCTTCACCCCTAAAGGCTTACCATGCAACTGCCAAA A GTGTTTTTGATTCTCTGATATTGAATTGGAATGCAACATATGACTATTACAACAAAGTGA ATG CAAAGCAAGCTTATTACCTATCCATGGAATTTTTGCAGGTCTCGCAATATACTTTCTTCA GC CTTATCTCATACCTTTTGGTAACTGCGTTATGATGTTATATGTTTTTTTTTGTACTGCAG GGA AGAGCTCTTACAAATGCTATTGGCAACCTAGAGCTAACAGGACAATATGCAGAAGCGTTA A AACAACTTGGACAGAACCTAGAGGATGTTGCTAGTCAGGTAAGCCATTTTTGTCAGAGAA T GGTTTTTAGTCCATATCAAAATTATTGCCGAGCCATTTTTCTAAATAACATTTTATTATT ATTC ATGAACTTCCATATTTTTTTCAAATCTGCAAAAGGAGAAGTTTTTTTCCTGGGGGTTCCA CC ATCTGACTTATATTTGTTATGCAGTTTTCAGAGTGATTGACTATGAAATTACGCCAAAAC AC GAGCATTGCCTAGAGTGCCCTGTAGTTTTAGAAATGTTAAGATTGGATATGTTACTTTTC TA TTACAATGAAACAGGAACCAGATCCTGCCCTTGGCAATGGTGGACTAGGCCGTCTAGCGT CCTGTTTTCTGGATTCTATGGCAACCCTAAATTATCCAGCATGGGGATATGGACTTCGGT A CAGATATGGCCTCTTTAAGCAAATCATAACAAAGGATGGTCAGGAGGAGGTAGCTGAGAA T TGGCTAGAGGTAGTGATATGTCTTCTACTTGTTTACTGTTTGGAAGCGTTGATTATGTTG GT TTCTGTTTTAATGATCTGCATGTTAGCTTCTTTTGCCATTTATAACTGATGCTGTATCTC CAG ATGGGAAATCCATGGGAGATTGTAAGAAATGATGTCTCTTATCCTCTGAAATTCTATGGC AA AGTGGTTGAAGGCACTGATGGGAGAAAACACTGGATTGGAGGAGAGAATATCAAGGCTGT GGCACATGATGTTCCTATTCCTGGCTACAAGACTAAAACTACTAATAATCTTCGTCTTTG GT CAACAACAGTACCATCACAAAACTTTGATTTGGGAGCTTTTAATGCTGGGGATCATGTCA AG GCCAACGAAGCACATCTAAATGCTGAAAAGGTATTCCTTTGTCTGCATTTATTTTGAATA TA AATCATTAAATCTCAATAACTATTGATTTAAATGGTAAACACTAAACATTGTCTAGCTAG TAG TCACTGTGGTGTCCCTCAATGTCATTTTAGAATGTGCACAGTCAAATGTTTGATAACTTT GA TTGTAATAATATGATAATATACAGAAACTTTGATTGACACTAGATTTTTCATAAACATAC TTAT TGTATTTCCATGTAAATAGAAAAGCTGATGTTCAGAGGTTTGTGTGTCAAAATGAACAGA GT TTGTGCACACATAATATCTTCATCTGGTGTTTGCAGTTCTCTAGGTTGTGTTTGCGTTTT CTC CTGCTCTGCCTTAACTAATCTCAGCATATGCATCAGGCTACATTGCTCCTCTGTGTCTAT AA AAAATAAGTCTGATGATATATGGTTGCAGTCCAGTAGTTCGTAGATGGCACAGCATGGAA G AACTTTTCACATGGAAGCCTTACCCTTATAATTTTAATTATACCTTTTGAGTGTTTGTGC ATT GTGTACATACACCATTAAGTATGGGTGCTCAAAAATGGTTTATCCCAAACTGTGCAAAAG CC GCCACTGTTTGTGTATTAATTTTTTTGAACTAGCTAAACAACTTGACCGAACAAGATACA TAC AATTGCACTCACTTCTGAAGCAGCTCAGTTGGACGGATTTGTTCCAATCCCGTGCATGGG T TAAATTGTCGAGTTGTCCTCGCTTGTTCAGTAGTAGGAGTACTATCTATTATATTTATAC TAA AGTTTTCTTTTTGTGATACAGTACAAACTTTTAGTAATACTTACATTTGCGACCACTGTG GAC AAAATTTATTATGTCTAACTTAACTTTATCACTGGTCTTCTCATCATGCCAGATATGCCA CGT ATTGTATCCAGGGGACGAATCATCAGAGGGGAAAATTCTTCGCCTGAAACAACAATATAC A CTATGCTCGGCCTCCCTGCAGGACATTATTTCTCGTTTTGAGTCGAGGGCTGGTGATTCT C TCAACTGGGAGGACTTCCCCTCTAAAGTTGCAGTTCAGATGAATGACACCCATCCAACAC T GTGCATTCCCGAGTTAATGAGAATACTGATGGATGTTAAGGGATTGAGCTGGAATGAGTC T TGGAGTATCACGGAAAGGTATCACAGAAGGGTACTGCAGTTTGATGTGTTATGCTGTTCC T CCAGTTCCAGTTAACTAATTTTATTTCCTCATCTGTCTTCCATTCAATCATGTACTGATT TTCT GATATCCACATCATTTGTAGAACCGTTGCATACACTAACCACACAGTACTTCCTGAAGCT CT GGAGAAGTGGAGCTTGGACATTATGCAGAAACTTCTACCTCGACATGTTGAGATTATAGA A AGAATAGATGAGGAGGTATCAAATGTGCAGATAATATTATACAATTAATTTCATGCTCTG TA GTTCAACTTATGGTTTACTTGTCTTCATCGTTGATTTTAAACAGCTGATGAACACCATCG TCT CAAAATATGGAACGGCAGATATTTCACTGTTAAAACAGAAGCTTAAAGATATGAGGATCT TA GACAACGTTGATCTTCCAGCCTCTGTTGCTAAACTGTTTATTAAACCAAAAGAGAAAAAG GG GAAGTTGCTTGTTGAATCTTTGGAGTCTATTGCTGAAGCTGACGAGAAAACTGAGTCAGA A GAGGAGGAAAACATTCTATCCGAGACAACAGAGAAAAAGGGCGAATCTGACTCTGAAGAA GCTCCTGATGCAGAAAAGGAGGATCCTGAGTATGAGTTAGATCCATTTACAAAATACGAT C CTCAGTTACCTAGAGTTGTTCGAATGGCGAACCTCTGCGTTGTTGGTGGACATTCAGTTA A TGGTGTGGCTGAGATTCACAGTGAAATTGTTAAGCAAGATGTGTTCAATAGCTTTTATGA GG TATTTGCATGATGATTTGGAAACCGTGATCTTATTTTCCAATGGCTGCTATAATCGTCAG GT GATATGGTATTCACCATCTGCCATTTTCTTGTGTTCTTTGCCAGATGTGGCCTACTAAGT TC CAAAACAAAACAAATGGAGTGACCCCCAGGCGTTGGATCCGGTTTTGTAATCCTGAATTA A GTACAATAATTTCAAAATGGATAGGCTCTGATGACTGGATTCTGAACACTGATAAACTTG CA GGACTGAAGAAGGTACTTCCAGTAGTTTTATTCCCCAATAATCAGTAATCTAATGCTTCC TT CACTTCAATGTTAACTGTGCATTGTTTTTACATTGATGTGTGTTCATCCTGTAACAGTTT GCT GATGATGAGGATCTGCAATCAGAATGGCGTACTGCTAAAAGGAATAACAAGATGAAGGTA G TTTCGCTGATAAGAGACAAGACTGGATATGTTGTCAGCCCAGATGCAATGTTTGATGTGC A GGTATATTTTGTACTAGAAAACATGTGTCTTCTTATAAATATGGTTTAACTCAGCTCTTG GTA CTGGACACTTAGGTGAAAAGAATACATGAATATAAGCGGCAGCTTCTAAATATCCTTGGT AT CATTTACCGCTACAAGAAGATGAAAGAAATGAGTGCAAAAGATAGAATAAAGAGCTTTGT TC CAAGGGTATGCATATTTGGCGGGAAAGCTTTTGCCACTTACGTACAGGCAAAGAGGATTG T GAAGTTTATCACAGATGTCGCAGCTACTGTAAATTATGATCCTGATATTGGAGATCTACT GA AGGTAAATTTGTGTCTTTGATGCCACTTATCAGTCTTAATTTTCATGTTCTGTCACTGGC AAG TTGACTCCGACTGAAAAGCATTGAATGTTCAGAGTTTCAAAGGAAAACAGAGCAGTGAAT G TTCTTAATGTCGGTGCTTGTAAAACTGAGCAACTGAATAGCCACAGGTTTTTTAGAGAAC AA TATAGCCACAAATTTTTTTGACACTTTTATTGCTATCAGCATCTGATGCTGATATTACTG ATA ACTAGATGAGGTAAATATAAGGTGACAGGTTAAGTCTGAGGATTTTTGTTTGTTATCCAG GT TGTATTTGTCCCAGATTATAATGTTAGCGTTGCTGAGACGCTCATTCCTGCCAGTGAATT GT CCCAGCATATCAGGTAATATTCTGCCTACTCTTATATAGTATGTGAATACATAAAAAAAG AAT ATTACCTGGAGTTGGCTGACATAGAATTTTTCATGTGCCTTTTTGTTTGCTATAATATGT ACC TGGAGTCTTACTGGGATAGTGCCATCCAGAAGTGCCACCCCAGTTTCTATTTCTGTATAA G ACGTGCATTTTTAACAGTATTACTATCCATTCAGTACCGCTGGAATGGAAGCTAGTGGAA CC AGCAACATGAAGTTTGCAATGAATGGCTGTATTCTTATTGGAACTTTGGATGGTGCGAAT GT GGAGATCAGAGAAGAGGTTGGAGAGGAGAACTTTTTCCTCTTTGGTGCACATGCACCTGA A ATCGCTGGTTTGAGGCAAGAGAGAGCTGAAGGGAAGGTACCACACTTGTTTATATAGCTG T GCCTACTATCAAACTAGTAAAGATCTCTCCCATATGTGCTGTCCAAATAGTTTAGCTGAA GG AAAATCATAACTTTACAAGTTTCCTTTGCACTTGTAGTTTGTGCCTGACCCGAGATTTGA AG
AGGTTAAGGAATACGTCCGCAGTGGCATCTTTGGGACTGGCAACTATGATGAATTGA TGGG TTCTTTGGAAGGAAATGAAGGTTATGGACGTGCAGATTATTTTCTTGTTGGCAAGGATTT TC CCAGTTACATTGAATGCCAGGAAAAAGTTGATGAAGCATACCGAGATCAGAAGGTAAGTT G
TATAACTGACGCTGTTTGTTTTCTGGCTGAAATGCTCACAGTGATTATGCAAACTTT CTGAT
ATGTGATGGCTGACATCAGTGTTTCACTGTGAACGACTTCATTGATTCGTAGGAAGG AATAT
TTCTTAGTTTAAAGTTTCAAACTGAAGTATCTATCGGTGTGTTGGAAGATCATAAAC AATCA
GAGAATATAGTTGCTAGATGTGCTATATTACTCATCTGTTGGCTGTTGAGTGTTGAT ACATG
AGCTTTGATGTAGCTAGACCTTCAGGAAAGTGCTTCTGTGCATTCCAATAAAAAAAT AACTT
ATAACTTAACAAAACAAAACAATTAGGTTGAACCTCACCAGGGGGCCTTTTGTTATC TGTTT
TCTGAAATGAAAATCAGAAGTGGGGAGCTCCTCAATTCGAAAAAGAAAAAGAAAATC AGGG
GGTAACTGAGGGTCTGGTTTCCCTCTGTTGGAGCCCCCAAGAATTCTGTGGTCTTAT TTTT
GTAAGAGTATAAGATGCACTGCTCCGTTAAGTTTCTCCTTCACCCATTTGTTTTCAT TGCTT
CGTGACATTTCGTGTGACAGCTCAGATCACCATATGCATCCCATTTGTACACATTAT TTTGC
TACCGTGGATATTTCCACTCTGTTTCATGTGTGACATTTTTAATTCGTTACATTGTT ACTCTA
AACATTAGTCTAATAGATTTTCAGAGTTTTGACCCTCCTAATCACATAATATCCGTT TCGACC
TGCAGCTATGGACAAGAATGTCTATCCTCAACACAGCGGGTTCCCCCAAGTTCAGCA GTGA
CCGGACGATCCACGAGTATGCCAAGGACATATGGGGTATCAGCCCCGTCATCATGCC CTG
AATAATTCAAGCAGCTTCCAGGAACCTTGAGAGTCTATGAGGCGCGATTGGTCGCTG ATTT
GCCTTGATTCAGCGAGAATGTATTGCTCCCCCCAGTTTTGTATAGGCATATGCAAGT TCTG
ACATGGACCTGAGTGCTGAGAGCATGCAGGGTAGTTCATCCTTAGCAATATTGCATG TCAG
TTAGCCAGTATTACCCAGGCTTATGGTGTGTGTATTTCGCTGAATAAAGTTGGAGCA GAGT
CATCAAACAAACATACTTTGGTGTATCTGTAATGATAAATAAATGGTACATGTGACT GAACA
AAAAGTGTCGTTGGCGTGCACATGTAGTAGAAGCATGTTTCTTCAGAGAGGGTGCAA AGCT
GAAAGCACCCCAGCTTTGGAGTGAGAATGCAGAAGTTACGAAAACATTTGTTTACAT CCAA
AGTGCAACGGGATTCGTCAAGAGTTCAGAGTCTTCCATCACGGTCATAGAGCATGTC CAAT
AGCAGCCCTATAAACCTATAACAGAGCAGTAGCAGTGGTCCCTCACATTTTGAACCA AAAA
TGCAGCTCCAACGGCTGCCCTATTTGTAAAATTTACTTCAAAATTTGTTGGACAGCC CTAAA
ATGCAACACCAACTGATGTAAATGTACAACTATTCGGCTGCTGCCCTAAAATTTGCT CCAAG
TGGCGAACCACTCAGGACCAC
SEQ ID NO: 2 >Wheat (hexapioid) PHS1 5A (Chinese Spring reference genome) CDS
ATGGCGACCGCCTCGCCGCCGCTCGCCACCCCCTTCCGCCCGCTCGCCGTCGCCGGA G GTAGCGGCGGACTCGTCGTGGGCGCCAGGGCCGTGGTACCGCCGAGGCGGGGGCGGC GGGGGTTCGTGGTGCGGAGCGTGGCGAGCGATCGGGAGGTGCGGGGGCCCGCCCCGA CCGAGGAAGAGCTTTCAGCCGTGCTAACTTCCATTGATTCATCCGCCATCGCATCAAACA T CCAGCACCATGCAGACTTCACACCGTTATTCTCGCCAGAGCACTCTTCACCCCTAAAGGC T TACCATGCAACTGCCAAAAGTGTTTTTGATTCTCTGATATTGAATTGGAATGCAACATAT GA CTATTACAACAAAGTGAATGCAAAGCAAGCTTATTACCTATCCATGGAATTTTTGCAGGG AA GAGCTCTTACAAATGCTATTGGCAACCTAGAGCTAACAGGACAATATGCAGAAGCGTTAA A ACAACTTGGACAGAACCTAGAGGATGTTGCTAGTCAGGAACCAGATCCTGCCCTTGGCAA T GGTGGACTAGGCCGTCTAGCGTCCTGTTTTCTGGATTCTATGGCAACCCTAAATTATCCA G CATGGGGATATGGACTTCGGTACAGATATGGCCTCTTTAAGCAAATCATAACAAAGGATG G TCAGGAGGAGGTAGCTGAGAATTGGCTAGAGATGGGAAATCCATGGGAGATTGTAAGAAA TGATGTCTCTTATCCTCTGAAATTCTATGGCAAAGTGGTTGAAGGCACTGATGGGAGAAA A CACTGGATTGGAGGAGAGAATATCAAGGCTGTGGCACATGATGTTCCTATTCCTGGCTAC A AGACTAAAACTACTAATAATCTTCGTCTTTGGTCAACAACAGTACCATCACAAAACTTTG ATT
TGGGAGCTTTTAATGCTGGGGATCATGTCAAGGCCAACGAAGCACATCTAAATGCTG AAAA GATATGCCACGTATTGTATCCAGGGGACGAATCATCAGAGGGGAAAATTCTTCGCCTGAA A CAACAATATACACTATGCTCGGCCTCCCTGCAGGACATTATTTCTCGTTTTGAGTCGAGG G CTGGTGATTCTCTCAACTGGGAGGACTTCCCCTCTAAAGTTGCAGTTCAGATGAATGACA C CCATCCAACACTGTGCATTCCCGAGTTAATGAGAATACTGATGGATGTTAAGGGATTGAG C TGGAATGAGTCTTGGAGTATCACGGAAAGAACCGTTGCATACACTAACCACACAGTACTT C CTGAAGCTCTGGAGAAGTGGAGCTTGGACATTATGCAGAAACTTCTACCTCGACATGTTG A GATTATAGAAAGAATAGATGAGGAGCTGATGAACACCATCGTCTCAAAATATGGAACGGC A GATATTTCACTGTTAAAACAGAAGCTTAAAGATATGAGGATCTTAGACAACGTTGATCTT CC AGCCTCTGTTGCTAAACTGTTTATTAAACCAAAAGAGAAAAAGGGGAAGTTGCTTGTTGA AT CTTTGGAGTCTATTGCTGAAGCTGACGAGAAAACTGAGTCAGAAGAGGAGGAAAACATTC T ATCCGAGACAACAGAGAAAAAGGGCGAATCTGACTCTGAAGAAGCTCCTGATGCAGAAAA GGAGGATCCTGAGTATGAGTTAGATCCATTTACAAAATACGATCCTCAGTTACCTAGAGT TG TTCGAATGGCGAACCTCTGCGTTGTTGGTGGACATTCAGTTAATGGTGTGGCTGAGATTC A
CAGTGAAATTGTTAAGCAAGATGTGTTCAATAGCTTTTATGAGATGTGGCCTACTAA GTTCC
AAAACAAAACAAATGGAGTGACCCCCAGGCGTTGGATCCGGTTTTGTAATCCTGAAT TAAG
TACAATAATTTCAAAATGGATAGGCTCTGATGACTGGATTCTGAACACTGATAAACT TGCAG
GACTGAAGAAGTTTGCTGATGATGAGGATCTGCAATCAGAATGGCGTACTGCTAAAA GGAA
TAACAAGATGAAGGTAGTTTCGCTGATAAGAGACAAGACTGGATATGTTGTCAGCCC AGAT
GCAATGTTTGATGTGCAGGTGAAAAGAATACATGAATATAAGCGGCAGCTTCTAAAT ATCCT
TGGTATCATTTACCGCTACAAGAAGATGAAAGAAATGAGTGCAAAAGATAGAATAAA GAGC
TTTGTTCCAAGGGTATGCATATTTGGCGGGAAAGCTTTTGCCACTTACGTACAGGCA AAGA
GGATTGTGAAGTTTATCACAGATGTCGCAGCTACTGTAAATTATGATCCTGATATTG GAGAT
CTACTGAAGGTTGTATTTGTCCCAGATTATAATGTTAGCGTTGCTGAGACGCTCATT CCTGC
CAGTGAATTGTCCCAGCATATCAGTACCGCTGGAATGGAAGCTAGTGGAACCAGCAA CAT
GAAGTTTGCAATGAATGGCTGTATTCTTATTGGAACTTTGGATGGTGCGAATGTGGA GATC
AGAGAAGAGGTTGGAGAGGAGAACTTTTTCCTCTTTGGTGCACATGCACCTGAAATC GCTG
GTTTGAGGCAAGAGAGAGCTGAAGGGAAGTTTGTGCCTGACCCGAGATTTGAAGAGG TTA
AGGAATACGTCCGCAGTGGCATCTTTGGGACTGGCAACTATGATGAATTGATGGGTT CTTT
GGAAGGAAATGAAGGTTATGGACGTGCAGATTATTTTCTTGTTGGCAAGGATTTTCC CAGT
TACATTGAATGCCAGGAAAAAGTTGATGAAGCATACCGAGATCAGAAGCTATGGACA AGAA
TGTCTATCCTCAACACAGCGGGTTCCCCCAAGTTCAGCAGTGACCGGACGATCCACG AGT
ATGCCAAGGACATATGGGGTATCAGCCCCGTCATCATGCCCTGA
SEQ ID NO: 3 >Wheat (hexapioid) PHS1 5A (Chinese Spring reference genome) protein
MATASPPLATPFRPLAVAGGSGGLWGARAVVPPRRGRRGFVVRSVASDREVRGPAPT EEEL
SAVLTSIDSSAIASNIQHHADFTPLFSPEHSSPLKAYHATAKSVFDSLILNWNATYD YYNKVNAK
QAYYLSMEFLQGRALTNAIGNLELTGQYAEALKQLGQNLEDVASQEPDPALGNGGLG RLASCF
LDSMATLNYPAWGYGLRYRYGLFKQIITKDGQEEVAENWLEMGNPWEIVRNDVSYPL KFYGK
VVEGTDGRKHWIGGENIKAVAHDVPIPGYKTKTTNNLRLWSTTVPSQNFDLGAFNAG DHVKAN
EAHLNAEKICHVLYPGDESSEGKILRLKQQYTLCSASLQDIISRFESRAGDSLNWED FPSKVAV
QMNDTHPTLCIPELMRILMDVKGLSWNESWSITERTVAYTNHTVLPEALEKWSLDIM QKLLPRH
VEIIERIDEELMNTIVSKYGTADISLLKQKLKDMRILDNVDLPASVAKLFIKPKEKK GKLLVESLESI
AEADEKTESEEEENILSETTEKKGESDSEEAPDAEKEDPEYELDPFTKYDPQLPRWR MANLC
VVGGHSVNGVAEIHSEIVKQDVFNSFYEMWPTKFQNKTNGVTPRRWIRFCNPELSTI ISKWIGS
DDWILNTDKLAGLKKFADDEDLQSEWRTAKRNNKMKVVSLIRDKTGYWSPDAMFDVQ VKRIH
EYKRQLLNILGIIYRYKKMKEMSAKDRIKSFVPRVCIFGGKAFATYVQAKRIVKFIT DVAATVNYD
PDIGDLLKVVFVPDYNVSVAETLIPASELSQHISTAGMEASGTSNMKFAMNGCILIG TLDGANVEI
REEVGEENFFLFGAHAPEIAGLRQERAEGKFVPDPRFEEVKEYVRSGIFGTGNYDEL MGSLEG
NEGYGRADYFLVGKDFPSYIECQEKVDEAYRDQKLWTRMSILNTAGSPKFSSDRTIH EYAKDI
WGISPVIMP
SEQ ID NO: 4 >Wheat (hexapioid) PHS1 5A (Chinese Spring reference genome) promoter (-2kb upstream of ATG)
CACCACGCAGCCTAGGGGCACCGTCTGGCCATCTGGAGCCCCGGTGGCTCCCTTGGA CC
CCATCTTCTCGCTAGTGGGGCCTTGGTCCAAAATAATCCATGTAATTTTTCATGTTT TCCGG
CTTCGTATTTTTTTGGTAAAAACCCAAAAACAATAGAAAAGAAGAATTGGCCTCTGA CACTA GATTGATAGGTTAGTCCAAATCAATATTATCAAAAATTGTGCCCAAACCATATACAAATG ATA
TAAATATAACTAGAATACTTTATAAATTTAGATACGTTTGATAAAGTGCCAAGAATG GAAGAG AAGTGATCATATGCCGCCCTGTCCATATCGGGCTCATCAGAATGTTCCCACCATCAGACT C
CAGCCCCAGGATTATTAACTTATGTCGGCGGTGCGCTGAATGGATCTTGAAGAAGGC GGA
ATTTGTGTCGCCCTCACGTAACCAGGAGAACTTGGCAGGTTGACGCGCAATCGACCT CTC CAGCAAAGCAAGTCCTAGGTAGGCGCGCTTCAGCTCGCAGAGTAACCGTACTTCTGGAGG
AGCAAGCGGGTGGTGATCTTGAGCCGCCGAGACGAGCAATCAATTACCGAGCGAGAA GC ATCTGAGTGGAGATGTTACCGAGCGTTCTGGAACTCCAACTCTGCCGTCAGTGCATTGTC A
GCTTGTGCTTTGCGTAGATTCGCCGGAAAGGATCGGGGTCATGCACGCCAGAATTCC AGG TGTCAGAGACCACCTAATGAAAACCATCGAGCTTTGATCAAAATAGCTCAAGATGGAATC T
CCGATCACCTTGGGAGCGAAAATCTTACTAGTTTCGTTCAAGGCCAAGGTAGTTATA TATTT GTGTGTGCAATTACTAATCATTGGCCGCTTAGCAAACTCTTATTGGTTCGATAAACGTTA AA GTCATCCACTTTGAGAAAAAATGCTACTTGCTACAAACCCTTTCACTTTGGGAGGAGGGG G
GGTGGGTGGGGCAACATGTATCAGGATGATGGTGCCACCCTTGGCAAGGAATAAATC CTT
GCCGCCTCGTCCTTGTTTGGATGCTGCTTCTTGTCATCTCAAGGAGGCGTGTGGAGG GTG
GTGTGTCCCCGGATCTTGTTCTCTCGGTTGGGTTTTTTCTATGGCAGGTTTATCTCA GTGGT
AGTGTCACGTGAAGCAGATGACGCGACTATGCGTCTTGGTCCTCTGCGCTCTAATTC CGG
CAGATGAAGGTTGACCAGCTTCAGCCTCGTCGGGGAGCGTATGAGATTTGTGTTTTT TGAC
TCCATCTGGTCGGACCTGTGGCATTATGTAGACCCTCTCATGTATTTTTTTCTCCTG GACAG
TAGTCTGTTACACAAATCGTGGTCATCGGATCTTCTAGTCCACATCGGCGACTTTTT CGACC
GCCACTTCTACAAACTCCTTATTTTCAACAAGTTTCCTTCATTGAGATGGAAGTCCA GGCAA
TGGCTTTGCTCACGGCACACTGCCGGCGGATGCAAGAGAAAGAAGACATTCGACACC CCT
AAAACTATGAAGATGTAATTTTTAATCTTTGTAAGTGTTTTTGTGAGGATTTGTGCT ACTTTT
AATATATTGCCTTTTGCCTTTTGCAAGGGAAAAAAACTAAACCATTACACGAAGAGA AGAGC
CATTCCCAGCCGTCCAATAAACACGACAAAACGCTTTACAGCACGGGGGCAATTCCC AGC
CGTCGAAATGCGCTGCAACGGAACCAGCGCTCCCCGACACGGCGGCACATGAGCCAC AA
CTCGTACCATGCACCGTGCACCGGCGGTGGGCCCACTGGCCAGCGTCCCCACACGTG TA
CCGGCGGCCACAATTACACGGCACGGCACGGCGACGGAGAGATATTTTAACGCCTTC CAA
TTCAGAAGCTCCCCCATCACTCACTCGGGTACAGAGCGCAGCGGGCCGCGTGACTCT CGC
GTCGTCGCATCGCGCGCGCTGCACACCACCGTCAGCGCGACCACTGCCGCCA
SEQ ID NO: 5 >Wheat (hexapioid) PHS1 5B (Chinese Spring reference genome) genomic DNA
TTGCACGGCACGGCGACGGAGAGATATTTTTAACGCCTTCCAATTCAGAAGCTCCCC ATCA CTCGCTCGGGTATCAGAGCGGGCCGCGTGACTCTCGCGTCGTCGCATCGCGCGCGCTGC ACACCACCGTCAGCGCGACCACTGGCGCCAATGGCGACCGCCTCGCCGCCGCTCGCCAC CGCCTTCCGCCCGCTCGCCGCCGCCGGCGGCGCCGGAGGAGGAGGAGCGCACGCCGTG GGCGCCGCGGGCCGGGTCGCGCCGAGGCGGGGGCGCCGGGGCTTCGTGGTGCGGAGC GTGGCGAGCGATCGGGAGGTGCGGGGGCCCGCCTCGACCGAGGAAGGTAAGCGGCCGC GCGCTCCCGTTGGCTACGACGACCCGGAATGCTGGTCGGCTGTAATGTGGTTCGGATTTT CGCTGGTGCGACCTGAATTTGTGATCCTCTGACTGATGGTCTATGTGTTCCCGGGTTTCG G GCTGTTGATTTTGTGCGCGCGCATTGCTAATCTACTGCTCTAGGATCTGCTAGGGTTCAG A GGAATCTTAAATTCGTGAATAAGCTTCGACGAATGTATTCTGAATTTACTGAAACTGGCT TA GTTTGGGGCGTCAAGTTAGAATGATGTATTTATGGCACAGTAGCAAATTCTGACAAATTC TT CTGTGCGTGGACAGCCACCCCTTGTGGATGATGCAGCAAGTGTCAACATACTGTCGCTAC TATCTGTAGCTGTAACAAAATCTTACGCCTTGTTTTCTGGTGGTACAGTATATTATCATT TTA CAAAGAGTAGCTGTAACAAAATCTTACGCCTTGTTTTCTGGTGGGTTGCCATTGAGGAGT A AGTAGAATGCAAATGGTGATACAGTATATTATCTACCATTTTGTGCAAGAATTTTCACAC GG AAACAATATCCCATGCAATTTTTAGTTTTGACTCTCGACTGTTTATACCTACTCTTTGTA AAA TTTTGGATCCTAATATTGTTGTTCTTGCAACTCAGAGCTTTCAGCCGTGCTAACTTCCAT TG ATTCTTCCGCCATCGCGTCAAATATCCAGCACCATGCAGACTTCACACCGTTGTTCTCAC C AGAGCACTCTTCACCCCTAAAGGCTTACCATGCAACTGCTAAAAGTGTTTTTGATTCTCT GA TCATCAATTGGAATGCAACATATGACTATTACAACAAAGTGAATGCAAAGCAAGCTTATT AC CTGTCCATGGAGTTTTTGCAGGTCTCACAATATACTTCCTTTAGCCTTATCTCAATACCT TTT GGTAACTGCGTTATGATGTTATATGGTTGTTATCTGTGTACTGCAGGGAAGAGCTCTCAC A AATGCTATTGGCAATCTAGAGCTAACAGGACAATATGCAGAAGCGTTAAAACAACTTGGA C AGAACCTAGAGGATGTTGCTAGTCAGGTAAGTGATTTTTGTCGGAGATTGGTTTTTAGTC C GTATCAAAATTATCGCAGGGCCATTGTTTTTATTTTGAAAACGCATTTTATTATTAATCA TCA AGTTCCCTTTGTTTAAAAACTATCTGAAAAGAAGAAGTTTTTTTCCTGAGGGTTCCACCG TC TGACTTATGATCTTTATGCAGTTTTCAGAGTGACTATGAAATTACGCCAAAACACGATCT TAT TGATGCCTAGAGTGCCCTGTAGTTTAGAAATCTAAGATTGGATATGTTACTTTTCTATTA CAA TGAAACAGGAACCAGATCCTGCCCTTGGCAATGGTGGTCTAGGCCGTCTAGCGTCCTGTT TTCTGGATTCTATGGCAACCCTAAATTATCCAGCATGGGGATATGGACTTCGGTACAGAT AT GGCCTCTTTAAGCAAATCATAGCAAAGGATGGTCAGGAGGAGGTAGCTGAGAATTGGCTA GAGGTAGTGATATGTCTTCTACTTGTTTACTGTTTGGAAGTGTTGATTATGTTGGTTTAT TTA TGTTCTAATGACCTGCATGTTAGCTTCTTTTCTATTTATAACTGATGCTGTATCTCCAGA TGG GAAATCCATGGGAGATCGTAAGAAATGATGTCTCTTATCCTGTGAAATTCTATGGCAAAG TG GTTGAAGGCACTGATGGGAGAAAGCACTGGATTGGAGGAGAGAATATCAAGGCTGTGGCA CATGATGTTCCTATTCCTGGCTACAAGACTAAAACTACTAATAATCTTCGTCTTTGGTCA AC AACAGTACCATCACAAAATTTTGATTTGGGTGCCTTTAATGCTGGGGATCATGCCAAGGC C AATGAAGCACATCTAAATGCTGAAAAGGTATTCTTTCGTCTACTTTATTTTGAATATAAA CCA TTAACTCTCAACAACTACTGATTCAAAGGCTAAACAGTGCTTTGCTAGTAGTTACTGTGT TTT CTCTTAGATGTCATTTTATAATGTGTGCAGTCAATCGTTTGATATACACAAACCTATTTG GAT TTCTCCTGTAAAAATGTTGTCACTAGATTTTTCATGAAAAAACTTATTGTATATGTAAAT AGA AAAGCTGAGGCTCAGGTGCTTTGTGTTCAAATGACAAGTAAAAATGAACAGAGTTTGTGC A CCCATAACTTCTTCACCTGGTGTGTGCAATTCTTTCGGTTGTATTTATGCGTGTTCTTAT TCT GCCTGAATGATTCTCAGCATATGCATCAGGCTACATTTCTCCTCTGTGTCAAGAAAAAAT AA GTCTGACCAGATGCAGTCCAGTAGTCTTTAGATCTTTTCACATGGAAGCGTTACTCTATG AT TTTTATTATACCTTTAGAGTGTTTGTGCATTGTGCACATATACCATGAAGTATGGGTGTT CAA ATGTGGTTCTCTCAAACTGTGCTAAATCCACCACTGTTTGCCAGTATTCATTTTTCTGAA CTA GCAACAACTTGACTGGACAAGATACATTGTACTCACATCTGAAGCTGCTGGGTTGGACAG A TTTGTCCGACCCGTCTTTGAACATCTCTACTATTTAGTGTGCATGGGTTAAATTGTTGAC TTT TCCTCGTTTGTTCAGTAGTAGGAGCACTATCTAATTATATTTATATTCAAGTTTTCTCTG TGT GTTACGTTATAAACTTTTAGTCATACTTACATTTGTGGACAAAAGTTATCATGTCTAGCT TAA GTTTGTGACTGATCTTCTCATCATGCCAGATATGCCACGTATTGTATCCAGGGGACGAAT C ATCAGAGGGGAAAATTCTTCGCTTGAAACAACAATATACATTATGCTCGGCCTCCCTGCA G GACATTATTTCTCGTTTTGAGTCGAGGGCTGGTGATTCTCTCAACTGGGAGGACTTCCCC T CTAAAGTTGCAGTTCAGATGAATGACACTCATCCAACACTGTGCATTCCGGAGTTAATGA G AATACTGATGGATATAAAGGGATTGAGCTGGAATGAGGCTTGGAGTATCACAGAAAGGTA C CACAGAAGGGTACTGCAGTTTGATGTGTTATGCTGTTCCTCCAGCTCCAATTAACTAATT GT ATTGCCTCATCTGTCTTCCATTCAATTATGTACTGATTCTCTGATATCATTTGTAGAACC GTT GCATACACTAACCACACGGTACTTCCTGAAGCTCTGGAGAAGTGGAGCTTGGACATTATG C AGAAACTTCTACCCCGACATGTTGAGATCATAGAAACAATAGATGAGAAGGTAACAAATG T GCAGATAATATTATGCAATAAATTTCATTCTCTGTAGTTCAACTTATGGTTTACTTATCA ACA TCGTTGATTTTAAAACAGCTGATGAACAACATCGTCTCAAAATATGGAACGGCAGATATT TC ACTGTTAAAACAGAAGCTTAAAGATATGAGGATCTTAGACAACGTTGATCTTCCAGCCTC TG TTGCTAAACTGTTTATTAAACCTAAAGAGAAAACGGGGAAGTTGCTTGTTCAATCTTTGG AG TCTATTGCTGAAGGTGACGAGAAAACTGAGTCACAAGAGGAGGAAAACATTCTATCCGAG A CAGCAGAGAAAAAGGGCGGATCTGACTCTGAAGAAGCTCCTGATGCAGAAAAGGAGGATC CTGTGTATGAGTTAGATCCATTTGCAAAATACGATCCTCAGTTACCTAGAGTTGTTCGAA TG GCAAATCTCTGTGTTGTTGGTGGACATTCAGTTAATGGTGTGGCTGAGATTCACAGTGAA A TTGTTAAGCAAGATGTGTTCAATAGCTTTTATGAGGTATTTGCACGATGATTTGGAAACC AT GATCTTATTTTCTAACGGCTGCTATAATCGCCAGGTAATATGGTATCACCATCTGCCATT TT CTTATGTTATTTGCCAGATGTGGCCTACTAAGTTCCAAAACAAAACAAACGGAGTAACTC CC AGGCGTTGGATCCGGTTTTGTAATCCTGAATTAAGTGCAATCATTTCAAAATGGATAGGC TC TGATGACTGGATTCTGAACACTGATAAACTTGCAGGACTGAAGAAGGTACTTCCAGTAAT TT TAATTTCCAAATATCAGTAATCTAATGCTTCCTTCACTTCCATGTTACCTGTGCCTTGTT TTT ACATTGATGTGTGTTCATCCCGTAATAGTTTGCTGATGATGAGGATCTGCAATCAGAATG G CGTACTGCTAAAAGGAATAACAAGATGAAGGTAGTTTCGCTGATAAGAGACAAGACTGGA T ATGTTGTCAGCCCAGATGCGATGTTTGATGTGCAGGTATATTTTGGACTAGAAACATGTG T CTCCTGTTAAATATGGTTTAACTCGGATCTTGGTACTTGACACTTAGGTGAAAAGAATAC AT GAATATAAGCGGCAGCTGCTAAATATCCTTGGTATCGTTTACCGATACAAGAAGATGAAA G AAATGAGTGCCAAAGATAGAATAAAGAGCTTTGTTCCAAGGGTATGCATATTTGGTGGGA A AGCTTTTGCCACTTACGTACAGGCAAAGAGGATTGTGAAGTTTATCACAGATGTCGCAGC T ACTGTAAATTATGACCCTGATGTTGGAGATTTACTGAAGGTAAATTTGTGTCTTCAATGG CA CTTATCATTCTAAATTTTCATGATCTGTCACTAGCAATTTGACTCCGACTAGGGGTTACT GA AAAGCACTGAATGTTCCGAGTTTCAAAAGAAAACAGAGCAGTGAATGTTCTTAATGTCGG T GCTTGTAGAACTGAGAAACTGAATAGCCACAGTTTTTTTTTAGAGAACAATAGCCACAAT TG TTCTGATGCTGATTTTGCTGATAACTAGATGAGGTAAATACAAGGTGACAGGTTAAGTTC TG AGGATTTTTGTTTGTTATCCAGGTTGTATTTGTCCCAGATTATAATGTTAGCGTTGCTGA GA AGCTCATTCCTGCCAGTGAATTGTCCCAGCATATCAGGTAATATTCTGGCTACTCTTATA TA GTGCTCCCTCCGTCCCAAAATTCTTGTCTTAGATTTGTCCAAATACGGATGTATCTAGTT AC ATTTTAGTGTTAGATACATCCGTATCTAGACAAATGTAAGAGAAGAATTTTGGGACGGAG G GAGTATGTGAATACAAAGAAAAGGAGTCTTATCTGGAGGTGACTGATGTGGAAATTTTCA T GTGCCTTTTTGTTTGCTACTCCCTCCGTCCCATAATATAAGAGCGTTTTTGACAGTACAC TA TTTTCAAAAATGCTCTTATAATATGGGACGGAGGGAGTATAATATATACCTGGAGTCTTG CT GGGATAGTGACATCCAGAAGTGCCACGCCAGCTTCTATTTCTGTACAAGACGTGCATTTT T AACACTATTACTATCCATTCAGTACCGCTGGAATGGAAGCTAGTGGAACCAGCAACATGA A GTTTGCAATGAATGGTTGTATTCTTATTGGAACTTTGGATGGTGCGAATGTGGAGATCAG A
GAAGAGGTTGGAGAGGAGAACTTTTTCCTCTTTGGTGCACATGCACCTGAAATCGCT GGTT
TGAGGCAAGAGAGGGCTGAAGGAAAGGTACCACACTTGTTTATATAGCTGTGCCTAC TATC
AAACTAGTAAAGATATCTCCCATATGTGTGCTGTCCAAATAGTTTAGCTGAAGGAAA ATCAT
AACTTTATCATTTTCCTTTGCACTTGTAGTTTGTGCCTGACCCGAGATTTGAAGAGG TTAAG
GAATACGTCCGCAGTGGCGTCTTTGGGACTAGCAACTATGATGAATTGATGGGTTCT TTGG
AAGGAAATGAAGGTTATGGACGTGCAGATTATTTTCTTGTTGGCAAGGATTTTCCCA GTTAC
ATTGAATGCCAGCAAAAAGTTGATGAAGCATACCGAGATCAGAAGGTAAGTTGTATA ACTG
ACGCTGTTTGTTTTCTGGCTGAAATGCATACGGTGATTATGCAAACTTTCTGATATG TGATG
GCTGACATCAGTGTTTCACTGTTAATGACTTTACTGATTCGTAGGAAGGAATATTTC TTAATT
CAAAGTTTCAAACCGGAGTATCTATCGATGTGTTGGAACTTGGAAGATCATAAACAA TCAGA
GAATATAGTTGCTAGATGTGCTATACTACTCATCTGTTGGCTGTTGAGTGTTGATAC ATGAG
CTTTGATAAACCTAGCTAGACCTTCAGGAAAGTGCTTCTATGCATTCAATAAAAAAA TAACTT
TATAACTTAACAAAATAATTAGGTTGAACCTCGCCAGGGGGGCCCTTTTGTAATCTG TTTTC
TGAAATGAAAATCAGAAGCAGGGAGCTCCTCGATTCTAAAATGAAAAATAAAATCAG GGAT
AACTGAGGGTCTGGTTTCCCTCTGTTGGAGCCCCAGAGAATTCTGCGGTCTTATTTT TGTA
AGAGTATAAGATGCACTGCTCCGTTAAGTTTCTCCTTCATCCATTTGTTTTCTTTGC TTCGTG
ACATTTGGTGTAACAGAGCATCCCATTTGTACACATTATTTTGCTATCGTGGATATT TCCACT
CTGTTTCATGTGCGACATTTTAAATTCGTTACATCGTTACTAACTACACTTAGTCTG ATAGAT
TTTCAGAGTTTTGGTCCTCCTAATAACATAATATCCGTTTCTACCTGCAGCTATGGA CAAGA
ATGTCTATCCTCAACACAGCGGGTTCCCCCAAGTTCAGCAGTGACCGGACGATCCAT GAG
TACGCCAAGGACATATGGGATATCAGCCCCGTCATCATGCCCTGAATAATTCAAGCA GCTT
CCAGGAACCTTGAGAGTCTATGAGGCGCGATTGGTCGCATATTTGCCTCGATTCAGC GAG
AATGCATTGCTCCCCCCCAGTTTTGTATAGGCATATGCAAGTTCTGACATGGACCTG AGTG
CTGAGAGCATGCAGGGTAGTTAATCCTTAGCAATCTTGCGTGTCAGTTGGCCAGTAT TACC
CAGGCTTATGGTGTGTGTATTTCGCCGAATAAAGTTGGAGCAGAATCATCAAACAAA CATA
CTTTGGTGTATCTGTAATGATAAATAAATGGTAAATGACTGAACAAAAAGTGTCGTT GGCGC
ACACATGTAGTAGAAGCATGTTTCTTCAGAGAGGGTGCAAAGCTGAAAGCACCCCAA CTTT
GAAGTGAGAATGCAGAAGTCACGAAAACATTTGTTTACATCCAAAGTGCAATGGGAT TCGT
CAAGAGTTCAGAATCTTCCCTATAATAGAGTAGTAGCAGTGGTCCCTCACATTTTGA
SEQ ID NO: 6 >Wheat (hexapioid) PHS1 5B (Chinese Spring reference genome) CDS
ATGGCGACCGCCTCGCCGCCGCTCGCCACCGCCTTCCGCCCGCTCGCCGCCGCCGGC G
GCGCCGGAGGAGGAGGAGCGCACGCCGTGGGCGCCGCGGGCCGGGTCGCGCCGAGGC
GGGGGCGCCGGGGCTTCGTGGTGCGGAGCGTGGCGAGCGATCGGGAGGTGCGGGGGC
CCGCCTCGACCGAGGAAGAGCTTTCAGCCGTGCTAACTTCCATTGATTCTTCCGCCA TCGC
GTCAAATATCCAGCACCATGCAGACTTCACACCGTTGTTCTCACCAGAGCACTCTTC ACCC
CTAAAGGCTTACCATGCAACTGCTAAAAGTGTTTTTGATTCTCTGATCATCAATTGG AATGC
AACATATGACTATTACAACAAAGTGAATGCAAAGCAAGCTTATTACCTGTCCATGGA GTTTT
TGCAGGGAAGAGCTCTCACAAATGCTATTGGCAATCTAGAGCTAACAGGACAATATG CAGA
AGCGTTAAAACAACTTGGACAGAACCTAGAGGATGTTGCTAGTCAGGAACCAGATCC TGCC
CTTGGCAATGGTGGTCTAGGCCGTCTAGCGTCCTGTTTTCTGGATTCTATGGCAACC CTAA
ATTATCCAGCATGGGGATATGGACTTCGGTACAGATATGGCCTCTTTAAGCAAATCA TAGC
AAAGGATGGTCAGGAGGAGGTAGCTGAGAATTGGCTAGAGATGGGAAATCCATGGGA GAT
CGTAAGAAATGATGTCTCTTATCCTGTGAAATTCTATGGCAAAGTGGTTGAAGGCAC TGAT
GGGAGAAAGCACTGGATTGGAGGAGAGAATATCAAGGCTGTGGCACATGATGTTCCT ATT
CCTGGCTACAAGACTAAAACTACTAATAATCTTCGTCTTTGGTCAACAACAGTACCA TCACA
AAATTTTGATTTGGGTGCCTTTAATGCTGGGGATCATGCCAAGGCCAATGAAGCACA TCTA
AATGCTGAAAAGATATGCCACGTATTGTATCCAGGGGACGAATCATCAGAGGGGAAA ATTC
TTCGCTTGAAACAACAATATACATTATGCTCGGCCTCCCTGCAGGACATTATTTCTC GTTTT
GAGTCGAGGGCTGGTGATTCTCTCAACTGGGAGGACTTCCCCTCTAAAGTTGCAGTT CAG
ATGAATGACACTCATCCAACACTGTGCATTCCGGAGTTAATGAGAATACTGATGGAT ATAAA
GGGATTGAGCTGGAATGAGGCTTGGAGTATCACAGAAAGAACCGTTGCATACACTAA CCA
CACGGTACTTCCTGAAGCTCTGGAGAAGTGGAGCTTGGACATTATGCAGAAACTTCT ACCC
CGACATGTTGAGATCATAGAAACAATAGATGAGAAGCTGATGAACAACATCGTCTCA AAATA
TGGAACGGCAGATATTTCACTGTTAAAACAGAAGCTTAAAGATATGAGGATCTTAGA CAAC
GTTGATCTTCCAGCCTCTGTTGCTAAACTGTTTATTAAACCTAAAGAGAAAACGGGG AAGTT GCTTGTTCAATCTTTGGAGTCTATTGCTGAAGGTGACGAGAAAACTGAGTCACAAGAGGA G
GAAAACATTCTATCCGAGACAGCAGAGAAAAAGGGCGGATCTGACTCTGAAGAAGCT CCT GATGCAGAAAAGGAGGATCCTGTGTATGAGTTAGATCCATTTGCAAAATACGATCCTCAG T
TACCTAGAGTTGTTCGAATGGCAAATCTCTGTGTTGTTGGTGGACATTCAGTTAATG GTGTG GCTGAGATTCACAGTGAAATTGTTAAGCAAGATGTGTTCAATAGCTTTTATGAGATGTGG CC TACTAAGTTCCAAAACAAAACAAACGGAGTAACTCCCAGGCGTTGGATCCGGTTTTGTAA T
CCTGAATTAAGTGCAATCATTTCAAAATGGATAGGCTCTGATGACTGGATTCTGAAC ACTGA
TAAACTTGCAGGACTGAAGAAGTTTGCTGATGATGAGGATCTGCAATCAGAATGGCG TACT GCTAAAAGGAATAACAAGATGAAGGTAGTTTCGCTGATAAGAGACAAGACTGGATATGTT G
TCAGCCCAGATGCGATGTTTGATGTGCAGGTGAAAAGAATACATGAATATAAGCGGC AGCT GCTAAATATCCTTGGTATCGTTTACCGATACAAGAAGATGAAAGAAATGAGTGCCAAAGA TA GAATAAAGAGCTTTGTTCCAAGGGTATGCATATTTGGTGGGAAAGCTTTTGCCACTTACG TA CAGGCAAAGAGGATTGTGAAGTTTATCACAGATGTCGCAGCTACTGTAAATTATGACCCT G
ATGTTGGAGATTTACTGAAGGTTGTATTTGTCCCAGATTATAATGTTAGCGTTGCTG AGAAG CTCATTCCTGCCAGTGAATTGTCCCAGCATATCAGTACCGCTGGAATGGAAGCTAGTGGA A CCAGCAACATGAAGTTTGCAATGAATGGTTGTATTCTTATTGGAACTTTGGATGGTGCGA AT GTGGAGATCAGAGAAGAGGTTGGAGAGGAGAACTTTTTCCTCTTTGGTGCACATGCACCT
GAAATCGCTGGTTTGAGGCAAGAGAGGGCTGAAGGAAAGTTTGTGCCTGACCCGAGA TTT GAAGAGGTTAAGGAATACGTCCGCAGTGGCGTCTTTGGGACTAGCAACTATGATGAATTG A TGGGTTCTTTGGAAGGAAATGAAGGTTATGGACGTGCAGATTATTTTCTTGTTGGCAAGG A
TTTTCCCAGTTACATTGAATGCCAGCAAAAAGTTGATGAAGCATACCGAGATCAGAA GCTAT GGACAAGAATGTCTATCCTCAACACAGCGGGTTCCCCCAAGTTCAGCAGTGACCGGACGA TCCATGAGTACGCCAAGGACATATGGGATATCAGCCCCGTCATCATGCCCTGA
SEQ ID NO: 7 >Wheat (hexapioid) PHS1 5B (Chinese Spring reference genome) protein
MATASPPLATAFRPLAAAGGAGGGGAHAVGAAGRVAPRRGRRGFWRSVASDREVRGP AST
EEELSAVLTSIDSSAIASNIQHHADFTPLFSPEHSSPLKAYHATAKSVFDSLIINWN ATYDYYNKV
NAKQAYYLSMEFLQGRALTNAIGNLELTGQYAEALKQLGQNLEDVASQEPDPALGNG GLGRLA
SCFLDSMATLNYPAWGYGLRYRYGLFKQIIAKDGQEEVAENWLEMGNPWEIVRNDVS YPVKF
YGKVVEGTDGRKHWIGGENIKAVAHDVPIPGYKTKTTNNLRLWSTTVPSQNFDLGAF NAGDHA
KANEAHLNAEKICHVLYPGDESSEGKILRLKQQYTLCSASLQDIISRFESRAGDSLN WEDFPSK
VAVQMNDTHPTLCIPELMRILMDIKGLSWNEAWSITERTVAYTNHTVLPEALEKWSL DIMQKLLP
RHVEIIETIDEKLMNNIVSKYGTADISLLKQKLKDMRILDNVDLPASVAKLFIKPKE KTGKLLVQSL
ESIAEGDEKTESQEEENILSETAEKKGGSDSEEAPDAEKEDPVYELDPFAKYDPQLP RWRMA
NLCVVGGHSVNGVAEIHSEIVKQDVFNSFYEMWPTKFQNKTNGVTPRRWIRFCNPEL SAIISK
WIGSDDWILNTDKLAGLKKFADDEDLQSEWRTAKRNNKMKVVSLIRDKTGYVVSPDA MFDVQ
VKRIHEYKRQLLNILGIVYRYKKMKEMSAKDRIKSFVPRVCIFGGKAFATYVQAKRI VKFITDVAA
TVNYDPDVGDLLKWFVPDYNVSVAEKLIPASELSQHISTAGMEASGTSNMKFAMNGC ILIGTL
DGANVEIREEVGEENFFLFGAHAPEIAGLRQERAEGKFVPDPRFEEVKEYVRSGVFG TSNYDE
LMGSLEGNEGYGRADYFLVGKDFPSYIECQQKVDEAYRDQKLWTRMSILNTAGSPKF SSDRTI
HEYAKDIWDISPVIMP
SEQ ID NO: 8 >Wheat (hexapioid) PHS1 5B (Chinese Spring reference genome) promoter (-2kb upstream of ATG)
TCATCCTCGGTGAAGGGCTGCTCAAGTGCAGGGAGGTCAAAGTGATGTGGGTCAATG CTA
TGGGGATCAAGGGAGAGGGTCGGTGCTCAAAAGTGCCAAGAATGGAAGAGAAGTGAT CAT ATACCGCCTTGGCCATATTGGGCTCGCTGGAATGTGCCCACCATCAGGCTGCAGCCCTAG
AATCATTAACTTCTGTCGCTAGTGCGTGGAATGGATCTTGAAGAAGGCGGAATTTGT GTCG CCCTCACGTAGCCAGGAGAACTTGACAGGGACCTCTCCAGTGAAGCAAGTCCTAGGTAGG
CGCGCTTCAGCTCGCGGAGTAACTAGACTTCTGGAGGAGCTACCAGGTGGTGATCTT GAG
CCGCATCGAGACGAGCGGTCTACTCCCGAGCGAGAAGCATCTGAGTGGACATGTTAC TGA
GCGTTCTGGGGATCCAACTCTATAGTCAGCGCATTGTCAGCTTGAGCTTTGCGTAGA TCCA
TCGGAAAGGATCGGGGTCATGCACGCTAGAATTCCAGGTGTCAGAGACCACCTGACT AAA
ACCATCGAGCTTTCGCCAAAATAGCTGAAGATGGAATCTTTGATCACCTTGGGAGCA AAAA TTTCATTGGAGGCCAAGTTAGTTCTATATTTGTGTCTGCAATTACTGATCATTGGCCGCT TA
GCAAACTCCTATTGGTTCGGTAAAGCTTAAGGTCATCCACTTTGGAAAAATGCTACT TGCTA CAAACCCTCTCANNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN NNNNNNTGGCTTTTCTAGTTGAGAGGAAGCAAAGACGTATCAAGACGACAATATCACCCT C GGCAAGGAATAAGTCCTTCCCGCCTCGTCCTCGTTTCGATGCTGCTTCTTGTCATTGCGG T GAGGCGCGTGGAGGTTGGCGTGTCCCCAAATCTAGTTATCTGGGTCGGGGTTTTTCTTTG GCAGGTTTATCTCAGTGGTAGTGTCCCGTGAAGGGGATGACACGACTATGTGTCTTGGTC CTCCATGCTCTAGTTTTGACATGTGAAGGTTGGCCAACTTCAGCGTTGTCGGGGAGCATA T GAGATTTGTGTTCCTTTACTTCGCCTGGTCAGACCTGCGGTATTATATAAGCCCTCTCAT GT GTTTTCTTCTCTTGGGCAGCAGTCAGTTACGCAAATCGTGGTCGCTGACATCTTCTAGTC C ACATTGGCGACTTCCCGACCGCCGCTTCTACAAGCTCCTTATTTTTCAACAAGTTTTCTC CA CCGAGATGGAAATCAAGAGGCGGCAACGGCTTTGCTCATGGCGCACTGCCCATGGATGCA
GGAGAAGGAAGACATTCCACACCCCTCAAACTATGAAGATGTAATTTTTAGTTTTTG TAAGT ATTTTTGTGAGGATTTGTGCTACTTTTAATATATATGGCCTTTTGCCTTTTTCCAGGAAA GGA AAAACTAAACCATTACATGAGGCCATTCCCATCCGTCCAATAAATACGACAAAACGCTTT AC AGCACGGGGGCAATTCCCAGCCGTCGAAATGCGCTGCAACGGAACTAGCTCTCCCAGACA GACACAGCGGCACATGAGCCACGCCTCCCATCGCAACCGTGCACCGGCGGTGGGCCCAC CGGCCAGCGCCCCCACACGTGTACCGGCGGCCAGCATTGCACGGCACGGCGACGGAGA GATATTTTTAACGCCTTCCAATTCAGAAGCTCCCCATCACTCGCTCGGGTATCAGAGCGG G CCGCGTGACTCTCGCGTCGTCGCATCGCGCGCGCTGCACACCACCGTCAGCGCGACCAC TGGCGCCA
SEQ ID NO: 9 >Wheat (hexapioid) PHS1 5D (Chinese Spring reference genome) genomic DNA
AGTGGTAGTGTCTCGTGAAAGCGGATGACGCGACTATGTGTCTTGGTCCTCCACGCT CTA GTTTCGACAGGTGAAGGTTGGCCAGCTTCGGCCTCGTCGGGGAGCGTATGAGATTTGTGT TCCTTTACTTCGTCTGGCCAGACTTCCCGACCGCCGCTTCTACAAGCTCCTTATTTCAAG C AAGTTTTCTTCACCGAGATGGAAGTCCAGAGGCGGCAACAGCTTTGCTCACGGCGCAGTG GCGGTGGATGCAGGAGAAAGAAGACATTCAACACCCCTAAACTATGAAGATGTAATTTTT T ATTTTTGTAAGTGCTTTTGTGAGGACTTGTGCTACTTTTAATATGTGGCCTTTTGCCTTT TTC CAGAAAAAAAAAACTAAACCATTACACGAGGAGAAGAGCCATTCCCAGCCGTCCAATAAA C ACGACAAAACGCTTTACAGCACGGGGGCAGTTCCCAGCCGTCGAAATGCGCTGCAACGGA ACTAGTAGCTCTCCCTGACACGGCGGCACATGAGCCACGCCTTGCGCCGTGCACCGGCG GTGGGCCCACCGGCCAGTGCCCCCACACGTGTACCGGCGGCCAGCATTGCACGGCACG GCGACGGAGAGATATTTTTAACGCCTTCCAATTCAGAAGCTCCCCCATCACTCACTCGGG T ACAGAGCGGGCCGCGTGACTCTCGCGTCGTCGCATCGCGCGCGCTGCACACCACCGTCA GCGCGACCACTGCCACCAATGGCGACCGCCTCGCCGCCGCTCGCCACCGCCTTCCGCCC GCTCGCCGCCGGCGGCGGCACCGGAGGAGGAGGAGCGCACGCCGCGGGCTGGGTCGC GCCGAGGCGGGGGCGCCGGGGGTTCGTGGTGCGGAGCGTGGCGAGCGATCGGGAGGT GCGGGGGCCGGCCTCGACCGAGGAAGGTAAGCGCCGCGCGCTCCCGTTGGCTACGACG ACCCGGAATGCTGGTCGGCTGTAATGTGGTTCGGATTTTCGCTGGTGCGACCTGAATTTG T GATCCTCTGACTGACGGTCTCTGTGTTCCCCGGTTTCGGGCTATTGATTTTGTGCGCGCG C ATTGCTAATCTACTGCTCTAGGATTTGTTAGGGTTCAGAGGAATCTTAAATTCGTGCATA AA
CTTCGACGAATGTATTCTGAATTTACTGAAACTGGCTTAGTTTGGCGCATCAAGTTA GAATG ATGTATTTATGGTACAGTAGCAAATTCTGAGAAATTCTTCTGTGCCTGCACAGCCACCCC TT GTGGATGATGCAGCAAGTGTCAACATACTGTCGCTACTATCTGTAGCTGTAACAAAATCT TA CGCCTTGTTTTCTGGTGGTACAGTAGTATTATCATTTTACAAAGAGTAGCTGTAACAAAA TC TTACGCCTTGTTTTCTGGTGGCTTGCCATTGGGGAGTAGAATGCAAATGGTGATACAGTA T ATTATCTACGATTTTGTGCAAGAATTTTCACACGAAAACAATGTCCCATGCAGTTTTTAG TTT TGACTCTTGACTGTTTAATACCTACTCTTTGTAAGATTTCGGATCCTAATATTGTTGTTC TTG CAACTCAGAGCTTTCAGCCGTGCTAACTTCCATTGATTCTTCCGCCATCGCGTCAAATAT CC AGCACCATGCAGACTTCACACCGTTGTTCTCGCCAGAGCACTCTTCACCCCTAAAGGCTT A CCATGCAACTGCTAAAAGTGTTTTTGATTCTCTGATCATCAATTGGAATGCAACATATGA CT ATTACAACAAAGTGAATGCAAAGCAAGCTTATTACCTGTCCATGGAGTTTTTGCAGGTCT CA CAATATACTTCCTTTAGCCTTATCTCAATACTTTTTGGTAACTGCGTTATGATGTTATAT GGT TATTATCTGTGTACTGCAGGGAAGAGCTCTCACAAATGCTATTGGCAATCTAGAGCTAAC A GGACAATATGCAGAAGCGTTAAAACAACTTGGACATAACCTAGAGGATGTTGCTAGTCAG G TAAGTGATTTTTGTCAGAGATTGGTTTTTAGTCTGTATCAAAATTATCGCAGAGCCATTT TTT TTTTCTAAAAAACACATTTTATTATTAATCATCAAGTTCCTTTTGTTTAAAAACTATCTG AAAA GAAGAAGTATTTTCCCCTGGGGGTTCCAGCATCTGACTTATATTCTTTATGCAGTTTTCA GA GTGAGTGACTATGAAATTACGCCAAAACACGAGCTTATTGATGCCTAGAGTGCCCTGTAG T TTAGAAATCTATTATTGGATATGTTACTTTTCTATTACAATGAAACAGGAACCAGATCCT GCC CTTGGCAATGGTGGTCTAGGCCGTCTAGCGTCCTGTTTTCTGGATTCTATGGCAACCCTA A ATTATCCAGCATGGGGATATGGACTTCGGTACAGATATGGCCTCTTTAAGCAAATCATAA CA AAGGATGGTCAGGAGGAGGTAGCTGAGAATTGGCTAGAGGTAGTGATATGTCTTCTACTT G TTTACTGTTTCGAAGTGTTGACTATGTTGGTTTATGTTTTAATGACCTGCATGTTAGCTT CTT TTGCTGTTATAACTGATGCTGTATCTCCAGATGGGAAATCCATGGGAGATTGTAAGAAAT GA TGTCTCTTATCCTGTGAAATTCTACGGCAAAGTGGTTGAAGGGACTGATGGGAGAAAACA C TGGATTGGAGGAGAGAATATTAAGGCTGTGGCACATGATGTTCCTATTCCTGGCTACAAG A CTAAAACTACCAATAATCTTCGTCTTTGGTCAACAACAGTACCATCACAAAATTTTGATC TG GGAGCTTTCAATGCTGGGGATCATGCCAAGGCCAATGAAGCACATCTAAATGCTGAAAAG GTATTCTTCCGTCTACATTTATTTTGAACGTAAACCATTAACTCTCAACAACTATTGATT CAA AGGCTAAACACTGTCTTGCTAGTAGTTACTGTGTTTTCTCTTAGATGTCATTTTAGAATG TG CGCGGTCAATCGTTTGATAACTTTGATAGTAATAATATACTGGTATACACAAACCTATTT GG GTTTCTCATGTAAAAATGTCGTCACCAAATTTTTCATGAAAATACTTATTGTATTTTCAT GTAA ATAGAAAAGCTGATGGTCAGGTGTTTTGTGTGTCAAATGACAAGTAAAAATGAACAGAGT TT GTGCACCAATAACTTCTTCACCTGGTGTTTGTAATTCTTTCGGTTGTATTTATGTGTTTT CTT ATTCTGCCTTAATTAACTCAGCATATGCATCAGGTTACATTACTCCTCTGTGTCAAGAAA AA AGAAGTCTGATGAGATGCAGTCCAGTAGTCTGTAGATGGCACATTGTAGAAGAACTTTTC A CATGGAAACGTTACTCTGTGATTTTTGTTATGCCTTTTGAGTGTTTGTGCATTGTGCACA TAT ACCATTAAGTATAAGTGTTCAAAAGTGGTTCTCTCATACTGTGCAAACTAGCTAAACAAC TT GACTGGACAAGATAAATTGTACTCACATCTGAAGCTGCCGGGTTGGACAGATTTGTCCGA A CCATCTTTGAACATCCCTACTAGTGTGCATGGGTTAAATTGTTGACTTTTCCTCGTTTGT TC AGTAGTAGGACTATCTAAATATATTTATATTCAAGTTTTCTTTGTGTGTTACGTTATAAA CTTT TAGTCATAATTACATTTGTGACCATTGTGGACAAAATTTATCATGTCTAACTTAAGTTTG TGA CTGGTCTTCTCATCATGCCAGATATGCCACGTATTGTATCCAGGGGACGAATCATCAGAG G GGAAAATTCTTCGCTTGAAACAACAATATACATTATGCTCGGCCTCCCTGCAGGACATTA TT TCTCGTTTTGAGTCGAGGGCTGGTGATTCTCTCAACTGGGAGGACTTCCCCTCTAAAGTT G
CAGTTCAGATGAATGACACTCATCCAACACTGTGCATTCCCGAGTTAATGAGAATAC TGAT GGATATAAAGGGATTGAGCTGGAATGAGGCTTGGAGTATCACAGAAAGGTACCACAGAAG GGTACTGCAGTTTGATGTGTTCTGCTGTTCCTCCAGTTTCAATTAACTAATTTTATCCCT CAT ATGTCTTCCATTCAATAATAAACTGATTGTCTGATATCATTTGTAGAACCGTTGCATACA CTA ACCACACGGTACTTCCTGAAGCCCTGGAGAAGTGGAGCTTGGACATTATGCAGAAACTTC T ACCTCGACATGTTGAGATCATAGAAACAATAGATGAGGAGGTAACAAATGTGCAGATAAT A TTATACAATTAATTTCATGCTCTGTAGGTCAACTTATGGTTTACTTATCTTCATCGTTGA TTTT AAAACAGCTGATGAACAACATCGTCTCAAAATATGGAACGGCAGATATTTCACTGTTAAA AC AGAAGCTTAAAGATATGAGGATCTTAGACAACGTTGATCTTCCAGCCTCTGTGGCTAAAC T GTTTGTTAAAACAAAAATGAAAAAGGGGAAGTTGCTTGTTGAATCTTTGGAGTCTATTGC TG AAGCTGACGAGAAAACTGAGCCAGAAGAAGAGGAAAACATTCTATCCGAGACAACAGAGA AGAAGGGCGAGTCTGACTCTGAAGAAGCTCCTGATGCAGAAAAGGAGGATCCTGAGTATG AGTTAGATCCATTTGCAAAATACGATCCTCAGTTACCTAGAGTTGTTCGAATGGCAAACC TC TGTGTTGTTGGTGGACATTCAGTTAATGGTGTGGCTGAGATTCACAGTGAAATTGTTAAG C AAGATGTGTTCAATAGCTTTTATGAGGTATTTGCACGATGATTTGGAAACCATGATCTTA TC TTCTAACGGCTGCTATAATCGCCAGGTAATATGGTATTCACCATCTGCCATTTTCTTGTG TT CTTTGCCAGATGTGGCCTACTAAGTTCCAAAACAAAACAAACGGAGTAACTCCCAGGCGT T GGATCCGGTTTTGTAATCCTGAATTAAGTGCAATCATTTCAAAATGGATAGGCTCTGATG AC TGGATTCTGAACACTGATAAACTTGCAGGACTGAAGAAGGTACTTCCAGTAATTTTAATT TC CAATAATCAGTAATCTAATGCTTCCTTCACTTCCATGTTACCCGTGCCTTGTTTTTACAT TGA TATGTGTTCATCCCGTAATAGTTTGCTGATGATGAGGATCTGCAATCAGAATGGCGTACT G CTAAAAGGAATAACAAGATGAAAGTAGTTTCACTGATAAGAGATAAGACTGGATATGTTG TC AGCCCAGATGCGATGTTTGATGTGCAGGTATATTTTGGACAAGAAACATCTGTCTTCTGT TA AATATGGTTTAACTCGGATCTTGGTACTTGACACTTAGGTGAAAAGAATACATGAATATA AG CGGCAGCTGCTAAATATCCTTGGTATCGTTTACCGGTACAAGAAGATGAAAGAAATGAGT G CAAAAGATAGAATAAAGAGCTTTGTTCCAAGGGTATGCATATTTGGTGGGAAAGCTTTTG C CACTTATGTACAGGCAAAGAGGATTGTGAAGTTTATCACAGATGTCGCAGCTACTGTAAA TT ATGATCCTGATATTGGAGATTTACTGAAGGTAAATTTGTGTCTTCAATGGCACTTATTAT TCT TAATTTTCATGTTCTGTCACTGGGAATTTGACTCCGACTAGGGGTTACTGAAAAGCAGTG AA TGTTCCGAGTTTCAAAAGAAAACAGAGCAGTGAATGTTCTTAATGTCGGTGCTTGTAGAA C CGAGAAACTGAATAGCCACAGTATTTTTTAGAGAACAATAGCCACAATTGTTTTGACCCT TC TATTGCTATCAAGCATCTGATGCCGATTTTACTGATAACTAGATGAGGTAAATACAAGGT GA CAGGTTAAGTTCTGAGGATTTTTGTTTGTTATACAGGTTGTATTTGTCCCAGATTATAAT GTT AGCGTTGCTGAGAAGCTCATTCCTGCCAGTGAATTGTCTCAGCATATCAGGTAATATTCT G GCTACTCTTATATAGTATGTGAATACATAGAAAAGGAGTCTTACCTGGAGTTGGCTGATG TG GAAATTTTCATGTGCCGTTTTGTTTGCTATAATATATACCTGGAGTCTTGCTGGGATAGT GC CATCCAGAAGTGCCACGCCAGCTTCTATTTCTGTACAGGATGTGCATTTTTAACACTATT AC TATCCATTCAGTACCGCTGGAATGGAGGCTAGTGGAACCAGCAACATGAAGTTTGCAATG A ATGGTTGTATTCTTATTGGAACTTTGGATGGTGCGAATGTGGAGATCAGAGAAGAGGTTG G AGAGGAGAACTTTTTCCTCTTTGGTGCACATGCACCTGAAATCGCTGGTTTGAGGCAAGA G AGGGCTGAAGGAAAGGTACCATCACTTTGCAAACATCTCCAAATTCGTTCCTCCAGTATG T GAAACAAATATCCACTTGTTTATATAGCTGTGCCTACTATCAAACTACTCCCTCCGTCCA TA ATATAAGAGTGTTTTTTACACTACACTAGTGTCAAAAACGCTCTTATATTATGGGACGGA GG GAGTAGTAAAGATCTCTCCCAAATGTGTGCTGTCCAAATAGTTTAGCTGAAGGAAAATTA TA AATCATTTTCCTTTGCACTTGTAGTTTGTGCCTGACCCGAGATTTGAAGAGGTTAAGGAA TA CGTCCGCAGTGGCGTCTTTGGGACTAGCAACTATGATGAATTGATGGGTTCTTTGGAAGG A AATGAAGGTTATGGACGTGCAGATTATTTTCTTGTTGGCAAGGATTTTCCCAGTTACATT GA ATGCCAGGAAAAAGTTGATGAAGCATACCGAGATCAGAAGGTAAGTTGTATAACTGACTC T GTTTGTTTTCTGGCTGAAATGCACACAGTGATTATGCAAACTTTCTGATATGTGATGGCT GA CATCAGTGTTTCACTGTGAACGACTTCACTGATTCGTAGGAAGGAATATTTCTTAGTTCA AA GTTTCAAACTGAAGTATCTATCGGTGTGTTGGAAGATCATAAACAATCAGAGAATATAGT TG CTCGATGTGCTATATTACTCATCTGTTGGCTGTTGAGTGTTGATACATGAGCTTTGATGA AC CTAGCTAGACCTTCGGGAAAGTGCTTCTATGCATTCCAATAAAAAAAACTTATATCTTAA CA AAACAATTAGGTTGAACCTCGCCAGGGGGCTTTTTGTTATCTGTTTCCTGAAATGAAAAT CA GAAGCGGGGAGCTCCTCAATTTGAAAAAAAAATCAAGGGTAACTGAGGGTCTGGTTTCCC T CTGTTGGAGCCCCCAAGAATTCTGTGGTCTTATTTTTGTAATAGTATAAGATGCACTGCT CC GTTAAGTTTCTCCTTCATCCATTTGCTTTTATTGCTTCGTGACATTTCGTGTAACAGCTC AGA TTACCATGCATCCCATTACACATTATTTTGCTACTGTGGATATTTCCACTCTGTTTCATG CTT GACATTTTTAATTCGTTACATCGTGACCAACTAAACATAGTCTGATAGATTTTCAGAGTT TTG ATCCTCCTAATCACATAATATCCGTTTCTACCTGCAGCTATGGACAAGAATGTCTATCCT CA ACACAGCGGGTTCCCCCAAGTTCAGCAGTGACCGGACGATCCACGAGTACGCCAAGGACA TATGGGATATCAGCCCTGTCATCATGCCCTGAATAATTCAAGCAGCTTCCAGGAACCTTG A GAGTCTATGAGGCGCGATTGGTCGCAGATTTGCCTCGATTCAGCGAGAATGTATTGCTCC C CCCAGTTTTGTATAGGCATATGCAAGTTCTGGCATGGACCTGAGTGCTGAGAGCATGCAG GGTAGTTCATCCTTAGCAATCTTGCATGTCAGTTGGCCAGTATACCCAGGCTTATGGTGT G TATTTCGCCGAATAAAGTTGGAGCAGAATCATCAAACAAACATACTTTGGTGTATCTGTA AT GATAAATAAAGGGTAAATGACTGAACAAAAAGTGTCGTTGGCGCACACATGTAGTAGAAG C ATGTTTCTTCAGAGAGGGTGCAAAGCTGAAAGCACCCCCAGCTTTGAAATGAGAATGCAG A AGTTGGGAAAACATTTGTTTACATCCAAAGTGCAACGGGATTCGTCAAGAGTTCAGAGTC T TCCATCACGGCCATAGAGCATGTCCAATAGCAGTCCTATAATAGAGCAGTAGCAGTGGTC C CTCATATTTTGAACCAAAAACGCAGCTCCAACGGCTGCCCTATTTGTAAAATTTACTTCA AA ATTTGTTGGACAGCCCGAAAATGCAACACCAGCTGATGTAAATGTACAACACTTCGGCTG C TGCCCTAAAATTTGCTTCAAGTGGCGAACCACTCAAGACCACAAGAAAACAGTTGTTTTA C GTTACTTTGTGCCAAAAACGTTGCTCCAACAACTGCGCTATAATAGGCCACCCTAATGTA TT TTAGGGCTGTTGTTGGAGATACATATTTTCGGGTGCCTAAAAATAAAATCATTTTTTGGT GC CTTACGTTTTACGCCAACCACTCTTTTGTGCCCTATTATAGGGCTGCTATTGGAGATACT CT AACAGTTCCATTTTTCCATAAAAGTTTCTGATTTGGAGGGAGTTGAGCAAAAAACAAAGT TT TTGTAAACGTGAATTTTTAGTGGCCTTTATAGTTAGCCCAACTCCCCCTCCACCTAA
SEQ ID NO: 10 >Wheat (hexapioid) PHS1 5D (Chinese Spring reference genome) CDS
ATGGCGACCGCCTCGCCGCCGCTCGCCACCGCCTTCCGCCCGCTCGCCGCCGGCGGC G
GCACCGGAGGAGGAGGAGCGCACGCCGCGGGCTGGGTCGCGCCGAGGCGGGGGCGCC
GGGGGTTCGTGGTGCGGAGCGTGGCGAGCGATCGGGAGGTGCGGGGGCCGGCCTCGA C
CGAGGAAGAGCTTTCAGCCGTGCTAACTTCCATTGATTCTTCCGCCATCGCGTCAAA TATC CAGCACCATGCAGACTTCACACCGTTGTTCTCGCCAGAGCACTCTTCACCCCTAAAGGCT T ACCATGCAACTGCTAAAAGTGTTTTTGATTCTCTGATCATCAATTGGAATGCAACATATG AC TATTACAACAAAGTGAATGCAAAGCAAGCTTATTACCTGTCCATGGAGTTTTTGCAGGGA AG AGCTCTCACAAATGCTATTGGCAATCTAGAGCTAACAGGACAATATGCAGAAGCGTTAAA A CAACTTGGACATAACCTAGAGGATGTTGCTAGTCAGGAACCAGATCCTGCCCTTGGCAAT G
GTGGTCTAGGCCGTCTAGCGTCCTGTTTTCTGGATTCTATGGCAACCCTAAATTATC CAGC ATGGGGATATGGACTTCGGTACAGATATGGCCTCTTTAAGCAAATCATAACAAAGGATGG T CAGGAGGAGGTAGCTGAGAATTGGCTAGAGATGGGAAATCCATGGGAGATTGTAAGAAAT GATGTCTCTTATCCTGTGAAATTCTACGGCAAAGTGGTTGAAGGGACTGATGGGAGAAAA C ACTGGATTGGAGGAGAGAATATTAAGGCTGTGGCACATGATGTTCCTATTCCTGGCTACA A
GACTAAAACTACCAATAATCTTCGTCTTTGGTCAACAACAGTACCATCACAAAATTT TGATCT GGGAGCTTTCAATGCTGGGGATCATGCCAAGGCCAATGAAGCACATCTAAATGCTGAAAA GATATGCCACGTATTGTATCCAGGGGACGAATCATCAGAGGGGAAAATTCTTCGCTTGAA A CAACAATATACATTATGCTCGGCCTCCCTGCAGGACATTATTTCTCGTTTTGAGTCGAGG G CTGGTGATTCTCTCAACTGGGAGGACTTCCCCTCTAAAGTTGCAGTTCAGATGAATGACA C
TCATCCAACACTGTGCATTCCCGAGTTAATGAGAATACTGATGGATATAAAGGGATT GAGC TGGAATGAGGCTTGGAGTATCACAGAAAGAACCGTTGCATACACTAACCACACGGTACTT C CTGAAGCCCTGGAGAAGTGGAGCTTGGACATTATGCAGAAACTTCTACCTCGACATGTTG A GATCATAGAAACAATAGATGAGGAGCTGATGAACAACATCGTCTCAAAATATGGAACGGC A GATATTTCACTGTTAAAACAGAAGCTTAAAGATATGAGGATCTTAGACAACGTTGATCTT CC
AGCCTCTGTGGCTAAACTGTTTGTTAAAACAAAAATGAAAAAGGGGAAGTTGCTTGT TGAAT CTTTGGAGTCTATTGCTGAAGCTGACGAGAAAACTGAGCCAGAAGAAGAGGAAAACATTC T ATCCGAGACAACAGAGAAGAAGGGCGAGTCTGACTCTGAAGAAGCTCCTGATGCAGAAAA GGAGGATCCTGAGTATGAGTTAGATCCATTTGCAAAATACGATCCTCAGTTACCTAGAGT T GTTCGAATGGCAAACCTCTGTGTTGTTGGTGGACATTCAGTTAATGGTGTGGCTGAGATT C
ACAGTGAAATTGTTAAGCAAGATGTGTTCAATAGCTTTTATGAGATGTGGCCTACTA AGTTC CAAAACAAAACAAACGGAGTAACTCCCAGGCGTTGGATCCGGTTTTGTAATCCTGAATTA A GTGCAATCATTTCAAAATGGATAGGCTCTGATGACTGGATTCTGAACACTGATAAACTTG CA GGACTGAAGAAGTTTGCTGATGATGAGGATCTGCAATCAGAATGGCGTACTGCTAAAAGG A ATAACAAGATGAAAGTAGTTTCACTGATAAGAGATAAGACTGGATATGTTGTCAGCCCAG AT
GCGATGTTTGATGTGCAGGTGAAAAGAATACATGAATATAAGCGGCAGCTGCTAAAT ATCC TTGGTATCGTTTACCGGTACAAGAAGATGAAAGAAATGAGTGCAAAAGATAGAATAAAGA G CTTTGTTCCAAGGGTATGCATATTTGGTGGGAAAGCTTTTGCCACTTATGTACAGGCAAA G AGGATTGTGAAGTTTATCACAGATGTCGCAGCTACTGTAAATTATGATCCTGATATTGGA GA TTTACTGAAGGTTGTATTTGTCCCAGATTATAATGTTAGCGTTGCTGAGAAGCTCATTCC TG
CCAGTGAATTGTCTCAGCATATCAGTACCGCTGGAATGGAGGCTAGTGGAACCAGCA ACAT GAAGTTTGCAATGAATGGTTGTATTCTTATTGGAACTTTGGATGGTGCGAATGTGGAGAT CA GAGAAGAGGTTGGAGAGGAGAACTTTTTCCTCTTTGGTGCACATGCACCTGAAATCGCTG G TTTGAGGCAAGAGAGGGCTGAAGGAAAGTTTGTGCCTGACCCGAGATTTGAAGAGGTTAA GGAATACGTCCGCAGTGGCGTCTTTGGGACTAGCAACTATGATGAATTGATGGGTTCTTT G
GAAGGAAATGAAGGTTATGGACGTGCAGATTATTTTCTTGTTGGCAAGGATTTTCCC AGTTA CATTGAATGCCAGGAAAAAGTTGATGAAGCATACCGAGATCAGAAGCTATGGACAAGAAT G TCTATCCTCAACACAGCGGGTTCCCCCAAGTTCAGCAGTGACCGGACGATCCACGAGTAC GCCAAGGACATATGGGATATCAGCCCTGTCATCATGCCCTGA
SEQ ID NO: 11>Wheat (hexapioid) PHS1 5D (Chinese Spring reference genome) protein
MATASPPLATAFRPLAAGGGTGGGGAHAAGWVAPRRGRRGFWRSVASDREVRGPAST EEE LSAVLTSIDSSAIASNIQHHADFTPLFSPEHSSPLKAYHATAKSVFDSLIINWNATYDYY NKVNAK QAYYLSMEFLQGRALTNAIGNLELTGQYAEALKQLGHNLEDVASQEPDPALGNGGLGRLA SCF LDSMATLNYPAWGYGLRYRYGLFKQIITKDGQEEVAENWLEMGNPWEIVRNDVSYPVKFY GK VVEGTDGRKHWIGGENIKAVAHDVPIPGYKTKTTNNLRLWSTTVPSQNFDLGAFNAGDHA KAN EAHLNAEKICHVLYPGDESSEGKILRLKQQYTLCSASLQDIISRFESRAGDSLNWEDFPS KVAV QMNDTHPTLCIPELMRILMDIKGLSWNEAWSITERTVAYTNHTVLPEALEKWSLDIMQKL LPRH VEIIETIDEELMNNIVSKYGTADISLLKQKLKDMRILDNVDLPASVAKLFVKTKMKKGKL LVESLES IAEADEKTEPEEEENILSETTEKKGESDSEEAPDAEKEDPEYELDPFAKYDPQLPRVVRM ANLC VVGGHSVNGVAEIHSEIVKQDVFNSFYEMWPTKFQNKTNGVTPRRWIRFCNPELSAIISK WIGS
DDWILNTDKLAGLKKFADDEDLQSEWRTAKRNNKMKVVSLIRDKTGYWSPDAMFDVQ VKRIH EYKRQLLNILGIVYRYKKMKEMSAKDRIKSFVPRVCIFGGKAFATYVQAKRIVKFITDVA ATVNYD
PDIGDLLKVVFVPDYNVSVAEKLIPASELSQHISTAGMEASGTSNMKFAMNGCILIG TLDGANVE
IREEVGEENFFLFGAHAPEIAGLRQERAEGKFVPDPRFEEVKEYVRSGVFGTSNYDE LMGSLE
GNEGYGRADYFLVGKDFPSYIECQEKVDEAYRDQKLWTRMSILNTAGSPKFSSDRTI HEYAKDI
WDISPVIMP
SEQ ID NO: 12 >Wheat (hexapioid) PHS1 5D (Chinese Spring reference genome) promoter (- 2kb upstream of ATG)
TATGTTAGTCCAAATAAATATCATCTAAAATTGTGCCTAAACCATATACAAATGATG TAAATG TGGACATAATACTTCATAAAGTATAGATATGTTTGAGACGTATCAGGAGCGTCGACCTTT TT
GGGAAGCCAAGTGAGCAAAGCCTCATCAAGTCTTGGGAATCCACGGTCATTGAGGGA GAA AAATTTATCAAAGGTCTCACACATACCCCCTTTGATGATGGACCAGCAGGACCGCAGTAA T
TTGGCCGTGAAGCCATCTAGCCCCGTCAACTTGCCGGAAGGGAACCTCTTGACCGCC TCC TAAATTTCATCCTCAGTGAAGGGTTGCTCAAGTGCAGGGAGGTCAAAGTGATGCGGGTCG
GTACTATGGAGATCAAGGGAGAGAGTCAGTGCTCAAAAGTGTCAAGAATGGAAGAGA AGT GATCATATGCCGCCCTGGCCATATCGGGCTCGCCAGAATGTGCCCACCATCAGACTGCAC
CCCTAGGATTATTAACTTCTGTCGTCGGTGCACAGAATGGATCTTGAAGAAGGTGGA ATTT GTGTCGCCCTCACGTAGCCAGGAGAACCTGGCAGGCAGATGTGCAATCGACCTCTCCACC
GAAGCAAACCCTAGGTAGGCGCGCTTCAGCTCGCGGAGTAACCATACTTCTGGAGGA GCA AGCAGGCGGTGATCTTGAGCCGCATCGAGATGAGCGGTCAACTCCCGAGCGAGAAGCAT
CTGAGTGGAGATGTTACCAAGCGTTCTAGACCTCCAACTCTACAGTCAGCGCATTGT CAGC TTGAGCTTGGCGTAGATCCCCTGAAAATGATTGGGGTCATGCACGCCAGAATTCCAGGTG T CAGAGACCACCTAATGAAAACAATCGAGCTTTGGCCAAAATAGCTCAAAATGGAATCTCC G
ATCACCTTGGGAGTGAAAATGTTACTAGTTTCGTTGGAGGCCAAGGTAGTTATGTAT TTGTG TGTGCAATTACTAATCATTGGCCCTTAGCAAACTCCTATTGGTTCAAACCGTTTCACTTT GA
AAAAAATGCTACTTGCTACAAACCGTTTCACTTGGAGGGGCGTCTTTTCTAGTTGAG AGGG AGCAAAAGGCGTATCAAGACGACAACGCCACCCTCGGCAATGTACAAGTCCTTCCCCGTC
CTCGTTTCGATGCTGCTTTTTGTCATCGCAGCGAGGCGTGTAGAGGGTGGTGTGTCC CCA GATCTAGTTATCTGGGTTGGGGTTTTTCTTTGGCAGGTTTATCTCAGTGGTAGTGTCTCG TG
AAAGCGGATGACGCGACTATGTGTCTTGGTCCTCCACGCTCTAGTTTCGACAGGTGA AGG TTGGCCAGCTTCGGCCTCGTCGGGGAGCGTATGAGATTTGTGTTCCTTTACTTCGTCTGG C
CAGACTTCCCGACCGCCGCTTCTACAAGCTCCTTATTTCAAGCAAGTTTTCTTCACC GAGAT GGAAGTCCAGAGGCGGCAACAGCTTTGCTCACGGCGCAGTGGCGGTGGATGCAGGAGAA AGAAGACATTCAACACCCCTAAACTATGAAGATGTAATTTTTTATTTTTGTAAGTGCTTT TGT GAGGACTTGTGCTACTTTTAATATGTGGCCTTTTGCCTTTTTCCAGAAAAAAAAAACTAA AC CATTACACGAGGAGAAGAGCCATTCCCAGCCGTCCAATAAACACGACAAAACGCTTTACA G CACGGGGGCAGTTCCCAGCCGTCGAAATGCGCTGCAACGGAACTAGTAGCTCTCCCTGAC ACGGCGGCACATGAGCCACGCCTTGCGCCGTGCACCGGCGGTGGGCCCACCGGCCAGT GCCCCCACACGTGTACCGGCGGCCAGCATTGCACGGCACGGCGACGGAGAGATATTTTTA ACGCCTTCCAATTCAGAAGCTCCCCCATCACTCACTCGGGTACAGAGCGGGCCGCGTGAC TCTCGCGTCGTCGCATCGCGCGCGCTGCACACCACCGTCAGCGCGACCACTGCCACCA
SEQ ID NO: 13 >Wheat (hexapioid) PHS1 5A (Cadenza) genomic DNA
GGCACGGCGACGGAGAGATATTTTAACGCCTTCCAATTCAGAAGCTCCCCCATCACT CACT
CGGGTACAGAGCGCAGCGGGCCGCGTGACTCTCGCGTCGTCGCATCGCGCGCGCTGC A
CACCACCGTCAGCGCGACCACTGCCGCCAATGGCGACCGCCTCGCCGCCGCTCGCCA CC
CCCTTCCGCCCGCTCGCCGTCGCCGGAGGTAGCGGCGGACTCGTCGTGGGCGCCAGG G
CCGTGGTACCGCCGAGGCGGGGGCGGCGGGGGTTCGTGGTGCGGAGCGTGGCGAGCG
ATCGGGAGGTGCGGGGGCCCGCCCCGACCGAGGAAGGTAAGCGGCCGCGCGCTCCCA T
TGGCTGCCACGACCTGGAATGCTGGTCGGCTGTAATGTGGTTCGGATTTTCGCTGGT GCG
ACCTGAATTTGTGATCCTCTGACTGACGGCCTATGTGTTCCCCGGGTTTCGGGCTGT TGAT
TTTGTGCGCGCGCATTGCTAATCTACTTCTCTAGGATTTGTTAGGGCTCAGAGCAAT CTTAA
ATTCGCGAATACACTTCGACGAATGTAGTCTGAATTTACTGAAACTAGCTTAGTTTG GCGCG
TCAAGATAGACTGATGTATTTATGGTAGCAAATTCTGAAAACTTCTTCTGTGCGTGG ACAGC
CACCCCTCGTGGATGATGCAGCAAGTGCTAACACAGTGTTTGCTAGTACTTGCTAGT ATCT
GTAGCTGTAACAAAATCCTTCGAATCTGAACTAGTGAGATAAATCGTTTGAGATTTT TGCTG AAGATTACACAGTATCAATATTTCTGAAGGTGATTGGAACAGTATAGCATTTCAAACTGG CA
ACCAAGCAGGATCAGTATACTGACGCAATTTTTTTCTGAAGGTGGTTGGAATACCGG TATG
AATTTTATAATCATTGAATTAAGGCGTAAGATTTGGTAACAAAAACTGCTTTTCACG GGAAA
CGATGAAACATGGCACAATTCAACCTGACTTTCTTTTATGAAAAGGCTCTCAAGCTG CCTTG
TACTCCCTCCGTCCCATAATATAAGAACGTTTTTGACATTAGTGTAGTACCAAAAAC GTTCT
TATATTATGGGACAGAGGGAGTATTTATTAATGGATAAAAAGAGAGGTACAGGGTTC TAGTA
GTAACATATGCAAATGGTGATACAGTATATTATCTACGATTTTGTGCAAGAATTTTC ACACG
AAAACAATATTCCATGCAGTTTTTAGTTTTGACTTTTTTTTTTGAATTTTCACGCGC GCGGGG
GGGGGGGGGGGGGGGGGGGGNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN
NNNNNNNNNNNNNNNCAGAGAGAGAGGCAGGGGGCGACCCCCCCATGCTATGTTACA TG
CAGCTTTGTTCTACCCACGCAGAGCCTGTAGCCAGTTTTGACTCTTTTTGAATTGTT TGGAT
TGCTTACCACATGTGTCACGTGGTTTTAGTTTTGACTCTTGACTGTTTAATACCTAC TCTTTT
TAAGATTTTGGATTCTAATGTCATCTGTCTTGCAACTCAGAGCTTTCAGCCGTGCTA ACTTC
CATTGATTCATCCGCCATCGCATCAAACATCCAGCACCATGCAGACTTCACACCGTT ATTCT
CGCCAGAGCACTCTTCACCCCTAAAGGCTTACCATGCAACTGCCAAAAGTGTTTTTG ATTC
TCTGATATTGAATTGGAATGCAACATATGACTATTACAACAAAGTGAATGCAAAGCA AGCTT
ATTACCTATCCATGGAATTTTTGCAGGTCTCGCAATATACTTTCTTCAGCCTTATCT CATACC
TTTTGGTAACTGCGTTATGATGTTATATGTTTTTTTTTGTACTGCAGGGAAGAGCTC TTACAA
ATGCTATTGGCAACCTAGAGCTAACAGGACAATATGCAGAAGCGTTAAAACAACTTG GACA
GAACCTAGAGGATGTTGCTAGTCAGGTAAGCCATTTTTGTCAGAGAATGGTTTTTAG TCCAT
ATCAAAATTATTGCCGAGCCATTTTTCTAAATAACATTTTATTATTATTCATGAACT TCCATAT
TTTTTTCAAATCTGCAAAAGGAGAAGTTTTTTTCCTGGGGGTTCCACCATCTGACTT ATATTT
GTTATGCAGTTTTCAGAGTGATTGACTATGAAATTACGCCAAAACACGAGCATTGCC TAGA
GTGCCCTGTAGTTTTAGAAATGTTAAGATTGGATATGTTACTTTTCTATTACAATGA AACAGG
AACCAGATCCTGCCCTTGGCAATGGTGGACTAGGCCGTCTAGCGTCCTGTTTTCTGG ATTC
TATGGCAACCCTAAATTATCCAGCATGGGGATATGGACTTCGGTACAGATATGGCCT CTTT
AAGCAAATCATAACAAAGGATGGTCAGGAGGAGGTAGCTGAGAATTGGCTAGAGGTA GTG
ATATGTCTTCTACTTGTTTACTGTTTGGAAGCGTTGATTATGTTGGTTTCTGTTTTA ATGATC
TGCATGTTAGCTTCTTTTGCCATTTATAACTGATGCTGTATCTCCAGATGGGAAATC CATGG GAGATTGTAAGAAATGATGTCTCTTATCCTCTGAAATTCTATGGCAAAGTGGTTGAAGGC AC TGATGGGAGAAAACACTGGATTGGAGGAGAGAATATCAAGGCTGTGGCACATGATGTTCC
TATTCCTGGCTACAAGACTAAAACTACTAATAATCTTCGTCTTTGGTCAACAACAGT ACCAT CACAAAACTTTGATTTGGGAGCTTTTAATGCTGGGGATCATGTCAAGGCCAACGAAGCAC A
TCTAAATGCTGAAAAGGTATTCCTTTGTCTGCATTTATTTTGAATATAAATCATTAA ATCTCAA
TAACTATTGATTTAAATGGTAAACACTAAACATTGTCTAGCTAGTAGTCACTGTGGT GTCCC
TCAATGTCATTTTAGAATGTGCACAGTCAAATGTTTGATAACTTTGATTGTAATAAT ATGATA
ATATACAGAAACTTTGATTGACACTAGATTTTTCATAAACATACTTATTGTATTTCC ATGTAAA
TAGAAAAGCTGATGTTCAGAGGTTTGTGTGTCAAAATGAACAGAGTTTGTGCACACA TAATA
TCTTCATCTGGTGTTTGCAGTTCTCTAGGTTGTGTTTGCGTTTTCTCCTGCTCTGCC TTAAC
TAATCTCAGCATATGCATCAGGCTACATTGCTCCTCTGTGTCTATAAAAAATAAGTC TGATG
ATATATGGTTGCAGTCCAGTAGTTCGTAGATGGCACAGCATGGAAGAACTTTTCACA TGGA
AGCCTTACCCTTATAATTTTAATTATACCTTTTGAGTGTTTGTGCATTGTGTACATA CACCAT
TAAGTATGGGTGCTCAAAAATGGTTTATCCCAAACTGTGCAAAAGCCGCCACTGTTT GTGT
ATTAATTTTTTTGAACTAGCTAAACAACTTGACCGAACAAGATACATACAATTGCAC TCACTT
CTGAAGCAGCTCAGTTGGACGGATTTGTTCCAATCCCGTGCATGGGTTAAATTGTCG AGTT
GTCCTCGCTTGTTCAGTAGTAGGAGTACTATCTATTATATTTATACTAAAGTTTTCT TTTTGT
GATACAGTACAAACTTTTAGTAATACTTACATTTGCGACCACTGTGGACAAAATTTA TTATGT
CTAACTTAACTTTATCACTGGTCTTCTCATCATGCCAGATATGCCACGTATTGTATC CAGGG
GACGAATCATCAGAGGGGAAAATTCTTCGCCTGAAACAACAATATACACTATGCTCG GCCT
CCCTGCAGGACATTATTTCTCGTTTTGAGTCGAGGGCTGGTGATTCTCTCAACTGGG AGGA
CTTCCCCTCTAAAGTTGCAGTTCAGATGAATGACACCCATCCAACACTGTGCATTCC CGAG
TTAATGAGAATACTGATGGATGTTAAGGGATTGAGCTGGAATGAGTCTTGGAGTATC ACGG AAAGGTATCACAGAAGGGTACTGCAGTTTGATGTGTTATGCTGTTCCTCCAGTTCCAGTT AA CTAATTTTATTTCCTCATCTGTCTTCCATTCAATCATGTACTGATTTTCTGATATCCACA TCAT
TTGTAGAACCGTTGCATACACTAACCACACAGTACTTCCTGAAGCTCTGGAGAAGTG GAGC
TTGGACATTATGCAGAAACTTCTACCTCGACATGTTGAGATTATAGAAAGAATAGAT GAGGA GGTATCAAATGTGCAGATAATATTATACAATTAATTTCATGCTCTGTAGTTCAACTTATG GTT
TACTTGTCTTCATCGTTGATTTTAAACAGCTGATGAACACCATCGTCTCAAAATATG GAACG GCAGATATTTCACTGTTAAAACAGAAGCTTAAAGATATGAGGATCTTAGACAACGTTGAT CT
TCCAGCCTCTGTTGCTAAACTGTTTATTAAACCAAAAGAGAAAAAGGGGAAGTTGCT TGTTG
AATCTTTGGAGTCTATTGCTGAAGCTGACGAGAAAACTGAGTCAGAAGAGGAGGAAA ACAT
TCTATCCGAGACAACAGAGAAAAAGGGCGAATCTGACTCTGAAGAAGCTCCTGATGC AGAA
AAGGAGGATCCTGAGTATGAGTTAGATCCATTTACAAAATACGATCCTCAGTTACCT AGAGT
TGTTCGAATGGCGAACCTCTGCGTTGTTGGTGGACATTCAGTTAATGGTGTGGCTGA GATT
CACAGTGAAATTGTTAAGCAAGATGTGTTCAATAGCTTTTATGAGGTATTTGCATGA TGATT
TGGAAACCGTGATCTTATTTTCCAATGGCTGCTATAATCGTCAGGTGATATGGTATT CACCA
TCTGCCATTTTCTTGTGTTCTTTGCCAGATGTGGCCTACTAAGTTCCAAAACAAAAC AAATG
GAGTGACCCCCAGGCGTTGGATCCGGTTTTGTAATCCTGAATTAAGTACAATAATTT CAAAA
TGGATAGGCTCTGATGACTGGATTCTGAACACTGATAAACTTGCAGGACTGAAGAAG GTAC
TTCCAGTAGTTTTATTCCCCAATAATCAGTAATCTAATGCTTCCTTCACTTCAATGT TAACTG
TGCATTGTTTTTACATTGATGTGTGTTCATCCTGTAACAGTTTGCTGATGATGAGGA TCTGC
AATCAGAATGGCGTACTGCTAAAAGGAATAACAAGATGAAGGTAGTTTCGCTGATAA GAGA CAAGACTGGATATGTTGTCAGCCCAGATGCAATGTTTGATGTGCAGGTATATTTTGTACT AG AAAACATGTGTCTTCTTATAAATATGGTTTAACTCAGCTCTTGGTACTGGACACTTAGGT GA
AAAGAATACATGAATATAAGCGGCAGCTTCTAAATATCCTTGGTATCATTTACCGCT ACAAG
AAGATGAAAGAAATGAGTGCAAAAGATAGAATAAAGAGCTTTGTTCCAAGGGTATGC ATATT
TGGCGGGAAAGCTTTTGCCACTTACGTACAGGCAAAGAGGATTGTGAAGTTTATCAC AGAT
GTCGCAGCTACTGTAAATTATGATCCTGATATTGGAGATCTACTGAAGGTAAATTTG TGTCT
TTGATGCCACTTATCAGTCTTAATTTTCATGTTCTGTCACTGGCAAGTTGACTCCGA CTGAA
AAGCATTGAATGTTCAGAGTTTCAAAGGAAAACAGAGCAGTGAATGTTCTTAATGTC GGTG CTTGTAAAACTGAGCAACTGAATAGCCACAGGTTTTTTAGAGAACAATATAGCCACAAAT TT TTTTGACACTTTTATTGCTATCAGCATCTGATGCTGATATTACTGATAACTAGATGAGGT AAA
TATAAGGTGACAGGTTAAGTCTGAGGATTTTTGTTTGTTATCCAGGTTGTATTTGTC CCAGA
TTATAATGTTAGCGTTGCTGAGACGCTCATTCCTGCCAGTGAATTGTCCCAGCATAT CAGG
TAATATTCTGCCTACTCTTATATAGTATGTGAATACATAAAAAAAGAATATTACCTG GAGTTG
GCTGACATAGAATTTTTCATGTGCCTTTTTGTTTGCTATAATATGTACCTGGAGTCT TACTGG
GATAGTGCCATCCAGAAGTGCCACCCCAGTTTCTATTTCTGTATAAGACGTGCATTT TTAAC
AGTATTACTATCCATTCAGTACCGCTGGAATGGAAGCTAGTGGAACCAGCAACATGA AGTT
TGCAATGAATGGCTGTATTCTTATTGGAACTTTGGATGGTGCGAATGTGGAGATCAG AGAA
GAGGTTGGAGAGGAGAACTTTTTCCTCTTTGGTGCACATGCACCTGAAATCGCTGGT TTGA
GGCAAGAGAGAGCTGAAGGGAAGGTACCACACTTGTTTATATAGCTGTGCCTACTAT CAAA
CTAGTAAAGATCTCTCCCATATGTGCTGTCCAAATAGTTTAGCTGAAGGAAAATCAT AACTT
TACAAGTTTCCTTTGCACTTGTAGTTTGTGCCTGACCCGAGATTTGAAGAGGTTAAG GAATA
CGTCCGCAGTGGCATCTTTGGGACTGGCAACTATGATGAATTGATGGGTTCTTTGGA AGGA
AATGAAGGTTATGGACGTGCAGATTATTTTCTTGTTGGCAAGGATTTTCCCAGTTAC ATTGA
ATGCCAGGAAAAAGTTGATGAAGCATACCGAGATCAGAAGGTAAGTTGTATAACTGA CGCT
GTTTGTTTTCTGGCTGAAATGCTCACAGTGATTATGCAAACTTTCTGATATGTGATG GCTGA
CATCAGTGTTTCACTGTGAACGACTTCATTGATTCGTAGGAAGGAATATTTCTTAGT TTAAA
GTTTCAAACTGAAGTATCTATCGGTGTGTTGGAAGATCATAAACAATCAGAGAATAT AGTTG
CTAGATGTGCTATATTACTCATCTGTTGGCTGTTGAGTGTTGATACATGAGCTTTGA TGTAG
CTAGACCTTCAGGAAAGTGCTTCTGTGCATTCCAATAAAAAAATAACTTATAACTTA ACAAAA
CAAAACAATTAGGTTGAACCTCACCAGGGGGCCTTTTGTTATCTGTTTTCTGAAATG AAAAT
CAGAAGTGGGGAGCTCCTCAATTCGAAAAAGAAAAAGAAAATCAGGGGGTAACTGAG GGT
CTGGTTTCCCTCTGTTGGAGCCCCCAAGAATTCTGTGGTCTTATTTTTGTAAGAGTA TAAGA
TGCACTGCTCCGTTAAGTTTCTCCTTCACCCATTTGTTTTCATTGCTTCGTGACATT TCGTGT
GACAGCTCAGATCACCATATGCATCCCATTTGTACACATTATTTTGCTACCGTGGAT ATTTC
CACTCTGTTTCATGTGTGACATTTTTAATTCGTTACATTGTTACTCTAAACATTAGT CTAATA
GATTTTCAGAGTTTTGACCCTCCTAATCACATAATATCCGTTTCGACCTGCAGCTAT GGACA
AGAATGTCTATCCTCAACACAGCGGGTTCCCCCAAGTTCAGCAGTGACCGGACGATC CAC
GAGTATGCCAAGGACATATGGGGTATCAGCCCCGTCATCATGCCCTGAATAATTCAA GCAG
CTTCCAGGAACCTTGAGAGTCTATGAGGCGCGATTGGTCGCTGATTTGCCTTGATTC AGCG
AGAATGTATTGCTCCCCCCAGTTTTGTATAGGCATATGCAAGTTCTGACATGGACCT GAGT
GCTGAGAGCATGCAGGGTAGTTCATCCTTAGCAATATTGCATGTCAGTTAGCCAGTA TTAC
CCAGGCTTATGGTGTGTGTATTTCGCTGAATAAAGTTGGAGCAGAGTCATCAAACAA ACAT ACTTTGGTGTATCTGTAATGATAAATAAATGGTACATGTGACTGAACAAAAAGTGTCGTT GG
CGTGCACATGTAGTAGAAGCATGTTTCTTCAGAGAGGGTGCAAAGCTGAAAGCACCC CAG CTTTGGAGTGAGAATGCAGAAGTTACGAAAACATTTGTTTACATCCAAAGTGCAACGGGA T TCGTCAAGAGTTCAGAGTCTTCCATCACGGTCATAGAGCATGTCCAATAGCAGCCCTATA A ACCTATAACAGAGCAGTAGCAGTGGTCCCTCACATTTTGAACCAAAAATGCAGCTCCAAC G GCTGCCCTATTTGTAAAATTTACTTCAAAATTTGTTGGACAGCCCTAAAATGCAACACCA AC
TGATGTAAATGTACAACTATTCGGCTGCTGCCCTAAAATTTGCTCCAAGTGGCGAAC CACT CAGGACCACAAGAAAACAGTTGTTTTACGTTGTGCCAAAAATGTTGCTTCAACTACTGC
SEQ ID NO: 14 >Wheat (hexapioid) PHS1 5A (Cadenza) CDS
ATGGCGACCGCCTCGCCGCCGCTCGCCACCCCCTTCCGCCCGCTCGCCGTCGCCGGA G GTAGCGGCGGACTCGTCGTGGGCGCCAGGGCCGTGGTACCGCCGAGGCGGGGGCGGC GGGGGTTCGTGGTGCGGAGCGTGGCGAGCGATCGGGAGGTGCGGGGGCCCGCCCCGA CCGAGGAAGAGCTTTCAGCCGTGCTAACTTCCATTGATTCATCCGCCATCGCATCAAACA T CCAGCACCATGCAGACTTCACACCGTTATTCTCGCCAGAGCACTCTTCACCCCTAAAGGC T TACCATGCAACTGCCAAAAGTGTTTTTGATTCTCTGATATTGAATTGGAATGCAACATAT GA CTATTACAACAAAGTGAATGCAAAGCAAGCTTATTACCTATCCATGGAATTTTTGCAGGG AA GAGCTCTTACAAATGCTATTGGCAACCTAGAGCTAACAGGACAATATGCAGAAGCGTTAA A ACAACTTGGACAGAACCTAGAGGATGTTGCTAGTCAGGAACCAGATCCTGCCCTTGGCAA T GGTGGACTAGGCCGTCTAGCGTCCTGTTTTCTGGATTCTATGGCAACCCTAAATTATCCA G CATGGGGATATGGACTTCGGTACAGATATGGCCTCTTTAAGCAAATCATAACAAAGGATG G TCAGGAGGAGGTAGCTGAGAATTGGCTAGAGATGGGAAATCCATGGGAGATTGTAAGAAA TGATGTCTCTTATCCTCTGAAATTCTATGGCAAAGTGGTTGAAGGCACTGATGGGAGAAA A CACTGGATTGGAGGAGAGAATATCAAGGCTGTGGCACATGATGTTCCTATTCCTGGCTAC A AGACTAAAACTACTAATAATCTTCGTCTTTGGTCAACAACAGTACCATCACAAAACTTTG ATT TGGGAGCTTTTAATGCTGGGGATCATGTCAAGGCCAACGAAGCACATCTAAATGCTGAAA A GATATGCCACGTATTGTATCCAGGGGACGAATCATCAGAGGGGAAAATTCTTCGCCTGAA A CAACAATATACACTATGCTCGGCCTCCCTGCAGGACATTATTTCTCGTTTTGAGTCGAGG G CTGGTGATTCTCTCAACTGGGAGGACTTCCCCTCTAAAGTTGCAGTTCAGATGAATGACA C CCATCCAACACTGTGCATTCCCGAGTTAATGAGAATACTGATGGATGTTAAGGGATTGAG C TGGAATGAGTCTTGGAGTATCACGGAAAGAACCGTTGCATACACTAACCACACAGTACTT C CTGAAGCTCTGGAGAAGTGGAGCTTGGACATTATGCAGAAACTTCTACCTCGACATGTTG A GATTATAGAAAGAATAGATGAGGAGCTGATGAACACCATCGTCTCAAAATATGGAACGGC A GATATTTCACTGTTAAAACAGAAGCTTAAAGATATGAGGATCTTAGACAACGTTGATCTT CC AGCCTCTGTTGCTAAACTGTTTATTAAACCAAAAGAGAAAAAGGGGAAGTTGCTTGTTGA AT CTTTGGAGTCTATTGCTGAAGCTGACGAGAAAACTGAGTCAGAAGAGGAGGAAAACATTC T ATCCGAGACAACAGAGAAAAAGGGCGAATCTGACTCTGAAGAAGCTCCTGATGCAGAAAA GGAGGATCCTGAGTATGAGTTAGATCCATTTACAAAATACGATCCTCAGTTACCTAGAGT TG TTCGAATGGCGAACCTCTGCGTTGTTGGTGGACATTCAGTTAATGGTGTGGCTGAGATTC A CAGTGAAATTGTTAAGCAAGATGTGTTCAATAGCTTTTATGAGATGTGGCCTACTAAGTT CC AAAACAAAACAAATGGAGTGACCCCCAGGCGTTGGATCCGGTTTTGTAATCCTGAATTAA G TACAATAATTTCAAAATGGATAGGCTCTGATGACTGGATTCTGAACACTGATAAACTTGC AG GACTGAAGAAGTTTGCTGATGATGAGGATCTGCAATCAGAATGGCGTACTGCTAAAAGGA A TAACAAGATGAAGGTAGTTTCGCTGATAAGAGACAAGACTGGATATGTTGTCAGCCCAGA T GCAATGTTTGATGTGCAGGTGAAAAGAATACATGAATATAAGCGGCAGCTTCTAAATATC CT TGGTATCATTTACCGCTACAAGAAGATGAAAGAAATGAGTGCAAAAGATAGAATAAAGAG C TTTGTTCCAAGGGTATGCATATTTGGCGGGAAAGCTTTTGCCACTTACGTACAGGCAAAG A GGATTGTGAAGTTTATCACAGATGTCGCAGCTACTGTAAATTATGATCCTGATATTGGAG AT CTACTGAAGGTTGTATTTGTCCCAGATTATAATGTTAGCGTTGCTGAGACGCTCATTCCT GC CAGTGAATTGTCCCAGCATATCAGTACCGCTGGAATGGAAGCTAGTGGAACCAGCAACAT GAAGTTTGCAATGAATGGCTGTATTCTTATTGGAACTTTGGATGGTGCGAATGTGGAGAT C AGAGAAGAGGTTGGAGAGGAGAACTTTTTCCTCTTTGGTGCACATGCACCTGAAATCGCT G GTTTGAGGCAAGAGAGAGCTGAAGGGAAGTTTGTGCCTGACCCGAGATTTGAAGAGGTTA AGGAATACGTCCGCAGTGGCATCTTTGGGACTGGCAACTATGATGAATTGATGGGTTCTT T GGAAGGAAATGAAGGTTATGGACGTGCAGATTATTTTCTTGTTGGCAAGGATTTTCCCAG T TACATTGAATGCCAGGAAAAAGTTGATGAAGCATACCGAGATCAGAAGCTATGGACAAGA A TGTCTATCCTCAACACAGCGGGTTCCCCCAAGTTCAGCAGTGACCGGACGATCCACGAGT ATGCCAAGGACATATGGGGTATCAGCCCCGTCATCATGCCCTGA SEQ ID NO: 15 >Wheat (hexapioid) PHS1 5A (Cadenza) protein
MATASPPLATPFRPLAVAGGSGGLVVGARAVVPPRRGRRGFVVRSVASDREVRGPAP TEEEL SAVLTSIDSSAIASNIQHHADFTPLFSPEHSSPLKAYHATAKSVFDSLILNWNATYDYYN KVNAK QAYYLSMEFLQGRALTNAIGNLELTGQYAEALKQLGQNLEDVASQEPDPALGNGGLGRLA SCF LDSMATLNYPAWGYGLRYRYGLFKQIITKDGQEEVAENWLEMGNPWEIVRNDVSYPLKFY GK
VVEGTDGRKHWIGGENIKAVAHDVPIPGYKTKTTNNLRLWSTTVPSQNFDLGAFNAG DHVKAN EAHLNAEKICHVLYPGDESSEGKILRLKQQYTLCSASLQDIISRFESRAGDSLNWEDFPS KVAV QMNDTHPTLCIPELMRILMDVKGLSWNESWSITERTVAYTNHTVLPEALEKWSLDIMQKL LPRH VEIIERIDEELMNTIVSKYGTADISLLKQKLKDMRILDNVDLPASVAKLFIKPKEKKGKL LVESLESI
AEADEKTESEEEENILSETTEKKGESDSEEAPDAEKEDPEYELDPFTKYDPQLPRWR MANLC VVGGHSVNGVAEIHSEIVKQDVFNSFYEMWPTKFQNKTNGVTPRRWIRFCNPELSTIISK WIGS DDWILNTDKLAGLKKFADDEDLQSEWRTAKRNNKMKVVSLIRDKTGYWSPDAMFDVQVKR IH EYKRQLLNILGIIYRYKKMKEMSAKDRIKSFVPRVCIFGGKAFATYVQAKRIVKFITDVA ATVNYD
PDIGDLLKVVFVPDYNVSVAETLIPASELSQHISTAGMEASGTSNMKFAMNGCILIG TLDGANVEI REEVGEENFFLFGAHAPEIAGLRQERAEGKFVPDPRFEEVKEYVRSGIFGTGNYDELMGS LEG NEGYGRADYFLVGKDFPSYIECQEKVDEAYRDQKLWTRMSILNTAGSPKFSSDRTIHEYA KDI WGISPVIMP*
SEQ ID NO: 16 >Wheat (hexapioid) PHS1 5A (Cadenza) promoter (-2kb upstream of ATG)
CACCACGCAGCCTAGGGGCACCGTCTGGCCATCTGGAGCCCCGGTGGCTCCCTTGGA CC
CCATCTTCTCGCTAGTGGGGCCTTGGTCCAAAATAATCCATGTAATTTTTCATGTTT TCCGG
CTTCGTATTTTTTTGGTAAAAACCCAAAAACAATAGAAAAGAAGAATTGGCCTCTGA CACTA
GATTGATAGGTTAGTCCAAATCAATATTATCAAAAATTGTGCCCAAACCATATACAA ATGATA
TAAATATAACTAGAATACTTTATAAATTTAGATACGTTTGATAAAGTGCCAAGAATG GAAGAG
AAGTGATCATATGCCGCCCTGTCCATATCGGGCTCATCAGAATGTTCCCACCATCAG ACTC
CAGCCCCAGGATTATTAACTTATGTCGGCGGTGCGCTGAATGGATCTTGAAGAAGGC GGA
ATTTGTGTCGCCCTCACGTAACCAGGAGAACTTGGCAGGTTGACGCGCAATCGACCT CTC
CAGCAAAGCAAGTCCTAGGTAGGCGCGCTTCAGCTCGCAGAGTAACCGTACTTCTGG AGG
AGCAAGCGGGTGGTGATCTTGAGCCGCCGAGACGAGCAATCAATTACCGAGCGAGAA GC
ATCTGAGTGGAGATGTTACCGAGCGTTCTGGAACTCCAACTCTGCCGTCAGTGCATT GTCA
GCTTGTGCTTTGCGTAGATTCGCCGGAAAGGATCGGGGTCATGCACGCCAGAATTCC AGG
TGTCAGAGACCACCTAATGAAAACCATCGAGCTTTGATCAAAATAGCTCAAGATGGA ATCT
CCGATCACCTTGGGAGCGAAAATCTTACTAGTTTCGTTCAAGGCCAAGGTAGTTATA TATTT
GTGTGTGCAATTACTAATCATTGGCCGCTTAGCAAACTCTTATTGGTTCGATAAACG TTAAA
GTCATCCACTTTGAGAAAAAATGCTACTTGCTACAAACCCTTTCACTTTGGGAGGAG GGGG
GGTGGGTGGGGCAACATGTATCAGGATGATGGTGCCACCCTTGGCAAGGAATAAATC CTT
GCCGCCTCGTCCTTGTTTGGATGCTGCTTCTTGTCATCTCAAGGAGGCGTGTGGAGG GTG
GTGTGTCCCCGGATCTTGTTCTCTCGGTTGGGTTTTTTCTATGGCAGGTTTATCTCA GTGGT
AGTGTCACGTGAAGCAGATGACGCGACTATGCGTCTTGGTCCTCTGCGCTCTAATTC CGG
CAGATGAAGGTTGACCAGCTTCAGCCTCGTCGGGGAGCGTATGAGATTTGTGTTTTT TGAC
TCCATCTGGTCGGACCTGTGGCATTATGTAGACCCTCTCATGTATTTTTTTCTCCTG GACAG
TAGTCTGTTACACAAATCGTGGTCATCGGATCTTCTAGTCCACATCGGCGACTTTTT CGACC
GCCACTTCTACAAACTCCTTATTTTCAACAAGTTTCCTTCATTGAGATGGAAGTCCA GGCAA
TGGCTTTGCTCACGGCACACTGCCGGCGGATGCAAGAGAAAGAAGACATTCGACACC CCT
AAAACTATGAAGATGTAATTTTTAATCTTTGTAAGTGTTTTTGTGAGGATTTGTGCT ACTTTT
AATATATTGCCTTTTGCCTTTTGCAAGGGAAAAAAACTAAACCATTACACGAAGAGA AGAGC
CATTCCCAGCCGTCCAATAAACACGACAAAACGCTTTACAGCACGGGGGCAATTCCC AGC
CGTCGAAATGCGCTGCAACGGAACCAGCGCTCCCCGACACGGCGGCACATGAGCCAC AA
CTCGTACCATGCACCGTGCACCGGCGGTGGGCCCACTGGCCAGCGTCCCCACACGTG TA
CCGGCGGCCACAATTACACGGCACGGCACGGCGACGGAGAGATATTTTAACGCCTTC CAA
TTCAGAAGCTCCCCCATCACTCACTCGGGTACAGAGCGCAGCGGGCCGCGTGACTCT CGC
GTCGTCGCATCGCGCGCGCTGCACACCACCGTCAGCGCGACCACTGCCGCCA
SEQ ID NO: 17 >Wheat (hexapioid) PHS1 5B (Cadenza) genomic DNA
TTGCACGGCACGGCGACGGAGAGATATTTTTAACGCCTTCCAATTCAGAAGCTCCCC ATCA
CTCGCTCGGGTATCAGAGCGGGCCGCGTGACTCTCGCGTCGTCGCATCGCGCGCGCT GC ACACCACCGTCAGCGCGACCACTGGCGCCAATGGCGACCGCCTCGCCGCCGCTCGCCAC CGCCTTCCGCCCGCTCGCCGCCGCCGGCGGCGCCGGAGGAGGAGGAGCGCACGCCGTG GGCGCCGCGGGCCGGGTCGCGCCGAGGCGGGGGCGCCGGGGCTTCGTGGTGCGGAGC GTGGCGAGCGATCGGGAGGTGCGGGGGCCCGCCTCGACCGAGGAAGGTAAGCGGCCGC GCGCTCCCGTTGGCTACGACGACCCGGAATGCTGGTCGGCTGTAATGTGGTTCGGATTTT CGCTGGTGCGACCTGAATTTGTGATCCTCTGACTGATGGTCTATGTGTTCCCGGGTTTCG G GCTGTTGATTTTGTGCGCGCGCATTGCTAATCTACTGCTCTAGGATCTGCTAGGGTTCAG A GGAATCTTAAATTCGTGAATAAGCTTCGACGAATGTATTCTGAATTTACTGAAACTGGCT TA GTTTGGGGCGTCAAGTTAGAATGATGTATTTATGGCACAGTAGCAAATTCTGACAAATTC TT CTGTGCGTGGACAGCCACCCCTTGTGGATGATGCAGCAAGTGTCAACATACTGTCGCTAC TATCTGTAGCTGTAACAAAATCTTACGCCTTGTTTTCTGGTGGTACAGTATATTATCATT TTA CAAAGAGTAGCTGTAACAAAATCTTACGCCTTGTTTTCTGGTGGGTTGCCATTGAGGAGT A AGTAGAATGCAAATGGTGATACAGTATATTATCTACCATTTTGTGCAAGAATTTTCACAC GG AAACAATATCCCATGCAATTTTTAGTTTTGACTCTCGACTGTTTATACCTACTCTTTGTA AAA TTTTGGATCCTAATATTGTTGTTCTTGCAACTCAGAGCTTTCAGCCGTGCTAACTTCCAT TG ATTCTTCCGCCATCGCGTCAAATATCCAGCACCATGCAGACTTCACACCGTTGTTCTCAC C AGAGCACTCTTCACCCCTAAAGGCTTACCATGCAACTGCTAAAAGTGTTTTTGATTCTCT GA TCATCAATTGGAATGCAACATATGACTATTACAACAAAGTGAATGCAAAGCAAGCTTATT AC CTGTCCATGGAGTTTTTGCAGGTCTCACAATATACTTCCTTTAGCCTTATCTCAATACCT TTT GGTAACTGCGTTATGATGTTATATGGTTGTTATCTGTGTACTGCAGGGAAGAGCTCTCAC A AATGCTATTGGCAATCTAGAGCTAACAGGACAATATGCAGAAGCGTTAAAACAACTTGGA C AGAACCTAGAGGATGTTGCTAGTCAGGTAAGTGATTTTTGTCGGAGATTGGTTTTTAGTC C GTATCAAAATTATCGCAGGGCCATTGTTTTTATTTTGAAAACGCATTTTATTATTAATCA TCA AGTTCCCTTTGTTTAAAAACTATCTGAAAAGAAGAAGTTTTTTTCCTGAGGGTTCCACCG TC TGACTTATGATCTTTATGCAGTTTTCAGAGTGACTATGAAATTACGCCAAAACACGATCT TAT TGATGCCTAGAGTGCCCTGTAGTTTAGAAATCTAAGATTGGATATGTTACTTTTCTATTA CAA TGAAACAGGAACCAGATCCTGCCCTTGGCAATGGTGGTCTAGGCCGTCTAGCGTCCTGTT TTCTGGATTCTATGGCAACCCTAAATTATCCAGCATGGGGATATGGACTTCGGTACAGAT AT GGCCTCTTTAAGCAAATCATAGCAAAGGATGGTCAGGAGGAGGTAGCTGAGAATTGGCTA GAGGTAGTGATATGTCTTCTACTTGTTTACTGTTTGGAAGTGTTGATTATGTTGGTTTAT TTA TGTTCTAATGACCTGCATGTTAGCTTCTTTTCTATTTATAACTGATGCTGTATCTCCAGA TGG GAAATCCATGGGAGATCGTAAGAAATGATGTCTCTTATCCTGTGAAATTCTATGGCAAAG TG GTTGAAGGCACTGATGGGAGAAAGCACTGGATTGGAGGAGAGAATATCAAGGCTGTGGCA CATGATGTTCCTATTCCTGGCTACAAGACTAAAACTACTAATAATCTTCGTCTTTGGTCA AC AACAGTACCATCACAAAATTTTGATTTGGGTGCCTTTAATGCTGGGGATCATGCCAAGGC C AATGAAGCACATCTAAATGCTGAAAAGGTATTCTTTCGTCTACTTTATTTTGAATATAAA CCA TTAACTCTCAACAACTACTGATTCAAAGGCTAAACAGTGCTTTGCTAGTAGTTACTGTGT TTT CTCTTAGATGTCATTTTATAATGTGTGCAGTCAATCGTTTGATATACACAAACCTATTTG GAT TTCTCCTGTAAAAATGTTGTCACTAGATTTTTCATGAAAAAACTTATTGTATATGTAAAT AGA AAAGCTGAGGCTCAGGTGCTTTGTGTTCAAATGACAAGTAAAAATGAACAGAGTTTGTGC A CCCATAACTTCTTCACCTGGTGTGTGCAATTCTTTCGGTTGTATTTATGCGTGTTCTTAT TCT GCCTGAATGATTCTCAGCATATGCATCAGGCTACATTTCTCCTCTGTGTCAAGAAAAAAT AA GTCTGACCAGATGCAGTCCAGTAGTCTTTAGATCTTTTCACATGGAAGCGTTACTCTATG AT TTTTATTATACCTTTAGAGTGTTTGTGCATTGTGCACATATACCATGAAGTATGGGTGTT CAA ATGTGGTTCTCTCAAACTGTGCTAAATCCACCACTGTTTGCCAGTATTCATTTTTCTGAA CTA GCAACAACTTGACTGGACAAGATACATTGTACTCACATCTGAAGCTGCTGGGTTGGACAG A TTTGTCCGACCCGTCTTTGAACATCTCTACTATTTAGTGTGCATGGGTTAAATTGTTGAC TTT TCCTCGTTTGTTCAGTAGTAGGAGCACTATCTAATTATATTTATATTCAAGTTTTCTCTG TGT GTTACGTTATAAACTTTTAGTCATACTTACATTTGTGGACAAAAGTTATCATGTCTAGCT TAA GTTTGTGACTGATCTTCTCATCATGCCAGATATGCCACGTATTGTATCCAGGGGACGAAT C ATCAGAGGGGAAAATTCTTCGCTTGAAACAACAATATACATTATGCTCGGCCTCCCTGCA G GACATTATTTCTCGTTTTGAGTCGAGGGCTGGTGATTCTCTCAACTGGGAGGACTTCCCC T CTAAAGTTGCAGTTCAGATGAATGACACTCATCCAACACTGTGCATTCCGGAGTTAATGA G AATACTGATGGATATAAAGGGATTGAGCTGGAATGAGGCTTGGAGTATCACAGAAAGGTA C CACAGAAGGGTACTGCAGTTTGATGTGTTATGCTGTTCCTCCAGCTCCAATTAACTAATT GT ATTGCCTCATCTGTCTTCCATTCAATTATGTACTGATTCTCTGATATCATTTGTAGAACC GTT GCATACACTAACCACACGGTACTTCCTGAAGCTCTGGAGAAGTGGAGCTTGGACATTATG C AGAAACTTCTACCCCGACATGTTGAGATCATAGAAACAATAGATGAGAAGGTAACAAATG T GCAGATAATATTATGCAATAAATTTCATTCTCTGTAGTTCAACTTATGGTTTACTTATCA ACA
TCGTTGATTTTAAAACAGCTGATGAACAACATCGTCTCAAAATATGGAACGGCAGAT ATTTC
ACTGTTAAAACAGAAGCTTAAAGATATGAGGATCTTAGACAACGTTGATCTTCCAGC CTCTG
TTGCTAAACTGTTTATTAAACCTAAAGAGAAAACGGGGAAGTTGCTTGTTCAATCTT TGGAG
TCTATTGCTGAAGGTGACGAGAAAACTGAGTCACAAGAGGAGGAAAACATTCTATCC GAGA
CAGCAGAGAAAAAGGGCGGATCTGACTCTGAAGAAGCTCCTGATGCAGAAAAGGAGG ATC
CTGTGTATGAGTTAGATCCATTTGCAAAATACGATCCTCAGTTACCTAGAGTTGTTC GAATG
GCAAATCTCTGTGTTGTTGGTGGACATTCAGTTAATGGTGTGGCTGAGATTCACAGT GAAA
TTGTTAAGCAAGATGTGTTCAATAGCTTTTATGAGGTATTTGCACGATGATTTGGAA ACCAT
GATCTTATTTTCTAACGGCTGCTATAATCGCCAGGTAATATGGTATCACCATCTGCC ATTTT
CTTATGTTATTTGCCAGATGTGGCCTACTAAGTTCCAAAACAAAACAAACGGAGTAA CTCCC
AGGCGTTGGATCCGGTTTTGTAATCCTGAATTAAGTGCAATCATTTCAAAATGGATA GGCTC
TGATGACTGGATTCTGAACACTGATAAACTTGCAGGACTGAAGAAGGTACTTCCAGT AATTT
TAATTTCCAAATATCAGTAATCTAATGCTTCCTTCACTTCCATGTTACCTGTGCCTT GTTTTT
ACATTGATGTGTGTTCATCCCGTAATAGTTTGCTGATGATGAGGATCTGCAATCAGA ATGG
CGTACTGCTAAAAGGAATAACAAGATGAAGGTAGTTTCGCTGATAAGAGACAAGACT GGAT
ATGTTGTCAGCCCAGATGCGATGTTTGATGTGCAGGTATATTTTGGACTAGAAACAT GTGT
CTCCTGTTAAATATGGTTTAACTCGGATCTTGGTACTTGACACTTAGGTGAAAAGAA TACAT
GAATATAAGCGGCAGCTGCTAAATATCCTTGGTATCGTTTACCGATACAAGAAGATG AAAG
AAATGAGTGCCAAAGATAGAATAAAGAGCTTTGTTCCAAGGGTATGCATATTTGGTG GGAA
AGCTTTTGCCACTTACGTACAGGCAAAGAGGATTGTGAAGTTTATCACAGATGTCGC AGCT
ACTGTAAATTATGACCCTGATGTTGGAGATTTACTGAAGGTAAATTTGTGTCTTCAA TGGCA
CTTATCATTCTAAATTTTCATGATCTGTCACTAGCAATTTGACTCCGACTAGGGGTT ACTGA
AAAGCACTGAATGTTCCGAGTTTCAAAAGAAAACAGAGCAGTGAATGTTCTTAATGT CGGT
GCTTGTAGAACTGAGAAACTGAATAGCCACAGTTTTTTTTTAGAGAACAATAGCCAC AATTG
TTCTGATGCTGATTTTGCTGATAACTAGATGAGGTAAATACAAGGTGACAGGTTAAG TTCTG
AGGATTTTTGTTTGTTATCCAGGTTGTATTTGTCCCAGATTATAATGTTAGCGTTGC TGAGA
AGCTCATTCCTGCCAGTGAATTGTCCCAGCATATCAGGTAATATTCTGGCTACTCTT ATATA
GTGCTCCCTCCGTCCCAAAATTCTTGTCTTAGATTTGTCCAAATACGGATGTATCTA GTTAC
ATTTTAGTGTTAGATACATCCGTATCTAGACAAATGTAAGAGAAGAATTTTGGGACG GAGG
GAGTATGTGAATACAAAGAAAAGGAGTCTTATCTGGAGGTGACTGATGTGGAAATTT TCAT
GTGCCTTTTTGTTTGCTACTCCCTCCGTCCCATAATATAAGAGCGTTTTTGACAGTA CACTA
TTTTCAAAAATGCTCTTATAATATGGGACGGAGGGAGTATAATATATACCTGGAGTC TTGCT
GGGATAGTGACATCCAGAAGTGCCACGCCAGCTTCTATTTCTGTACAAGACGTGCAT TTTT
AACACTATTACTATCCATTCAGTACCGCTGGAATGGAAGCTAGTGGAACCAGCAACA TGAA
GTTTGCAATGAATGGTTGTATTCTTATTGGAACTTTGGATGGTGCGAATGTGGAGAT CAGA
GAAGAGGTTGGAGAGGAGAACTTTTTCCTCTTTGGTGCACATGCACCTGAAATCGCT GGTT
TGAGGCAAGAGAGGGCTGAAGGAAAGGTACCACACTTGTTTATATAGCTGTGCCTAC TATC
AAACTAGTAAAGATATCTCCCATATGTGTGCTGTCCAAATAGTTTAGCTGAAGGAAA ATCAT
AACTTTATCATTTTCCTTTGCACTTGTAGTTTGTGCCTGACCCGAGATTTGAAGAGG TTAAG
GAATACGTCCGCAGTGGCGTCTTTGGGACTAGCAACTATGATGAATTGATGGGTTCT TTGG
AAGGAAATGAAGGTTATGGACGTGCAGATTATTTTCTTGTTGGCAAGGATTTTCCCA GTTAC
ATTGAATGCCAGCAAAAAGTTGATGAAGCATACCGAGATCAGAAGGTAAGTTGTATA ACTG
ACGCTGTTTGTTTTCTGGCTGAAATGCATACGGTGATTATGCAAACTTTCTGATATG TGATG
GCTGACATCAGTGTTTCACTGTTAATGACTTTACTGATTCGTAGGAAGGAATATTTC TTAATT
CAAAGTTTCAAACCGGAGTATCTATCGATGTGTTGGAACTTGGAAGATCATAAACAA TCAGA
GAATATAGTTGCTAGATGTGCTATACTACTCATCTGTTGGCTGTTGAGTGTTGATAC ATGAG
CTTTGATAAACCTAGCTAGACCTTCAGGAAAGTGCTTCTATGCATTCAATAAAAAAA TAACTT
TATAACTTAACAAAATAATTAGGTTGAACCTCGCCAGGGGGGCCCTTTTGTAATCTG TTTTC
TGAAATGAAAATCAGAAGCAGGGAGCTCCTCGATTCTAAAATGAAAAATAAAATCAG GGAT
AACTGAGGGTCTGGTTTCCCTCTGTTGGAGCCCCAGAGAATTCTGCGGTCTTATTTT TGTA
AGAGTATAAGATGCACTGCTCCGTTAAGTTTCTCCTTCATCCATTTGTTTTCTTTGC TTCGTG
ACATTTGGTGTAACAGAGCATCCCATTTGTACACATTATTTTGCTATCGTGGATATT TCCACT
CTGTTTCATGTGCGACATTTTAAATTCGTTACATCGTTACTAACTACACTTAGTCTG ATAGAT
TTTCAGAGTTTTGGTCCTCCTAATAACATAATATCCGTTTCTACCTGCAGCTATGGA CAAGA
ATGTCTATCCTCAACACAGCGGGTTCCCCCAAGTTCAGCAGTGACCGGACGATCCAT GAG
TACGCCAAGGACATATGGGATATCAGCCCCGTCATCATGCCCTGAATAATTCAAGCA GCTT
CCAGGAACCTTGAGAGTCTATGAGGCGCGATTGGTCGCATATTTGCCTCGATTCAGC GAG AATGCATTGCTCCCCCCCAGTTTTGTATAGGCATATGCAAGTTCTGACATGGACCTGAGT G
CTGAGAGCATGCAGGGTAGTTAATCCTTAGCAATCTTGCGTGTCAGTTGGCCAGTAT TACC
CAGGCTTATGGTGTGTGTATTTCGCCGAATAAAGTTGGAGCAGAATCATCAAACAAA CATA
CTTTGGTGTATCTGTAATGATAAATAAATGGTAAATGACTGAACAAAAAGTGTCGTT GGCGC
ACACATGTAGTAGAAGCATGTTTCTTCAGAGAGGGTGCAAAGCTGAAAGCACCCCAA CTTT
GAAGTGAGAATGCAGAAGTCACGAAAACATTTGTTTACATCCAAAGTGCAATGGGAT TCGT
CAAGAGTTCAGAATCTTCCCTATAATAGAGTAGTAGCAGTGGTCCCTCACATTTTGA
SEQ ID NO: 18 >Wheat (hexapioid) PHS1 5B (Cadenza) CDS
ATGGCGACCGCCTCGCCGCCGCTCGCCACCGCCTTCCGCCCGCTCGCCGCCGCCGGC G GCGCCGGAGGAGGAGGAGCGCACGCCGTGGGCGCCGCGGGCCGGGTCGCGCCGAGGC
GGGGGCGCCGGGGCTTCGTGGTGCGGAGCGTGGCGAGCGATCGGGAGGTGCGGGGGC CCGCCTCGACCGAGGAAGAGCTTTCAGCCGTGCTAACTTCCATTGATTCTTCCGCCATCG C GTCAAATATCCAGCACCATGCAGACTTCACACCGTTGTTCTCACCAGAGCACTCTTCACC C CTAAAGGCTTACCATGCAACTGCTAAAAGTGTTTTTGATTCTCTGATCATCAATTGGAAT GC
AACATATGACTATTACAACAAAGTGAATGCAAAGCAAGCTTATTACCTGTCCATGGA GTTTT TGCAGGGAAGAGCTCTCACAAATGCTATTGGCAATCTAGAGCTAACAGGACAATATGCAG A AGCGTTAAAACAACTTGGACAGAACCTAGAGGATGTTGCTAGTCAGGAACCAGATCCTGC C CTTGGCAATGGTGGTCTAGGCCGTCTAGCGTCCTGTTTTCTGGATTCTATGGCAACCCTA A
ATTATCCAGCATGGGGATATGGACTTCGGTACAGATATGGCCTCTTTAAGCAAATCA TAGC AAAGGATGGTCAGGAGGAGGTAGCTGAGAATTGGCTAGAGATGGGAAATCCATGGGAGAT CGTAAGAAATGATGTCTCTTATCCTGTGAAATTCTATGGCAAAGTGGTTGAAGGCACTGA T GGGAGAAAGCACTGGATTGGAGGAGAGAATATCAAGGCTGTGGCACATGATGTTCCTATT
CCTGGCTACAAGACTAAAACTACTAATAATCTTCGTCTTTGGTCAACAACAGTACCA TCACA
AAATTTTGATTTGGGTGCCTTTAATGCTGGGGATCATGCCAAGGCCAATGAAGCACA TCTA AATGCTGAAAAGATATGCCACGTATTGTATCCAGGGGACGAATCATCAGAGGGGAAAATT C TTCGCTTGAAACAACAATATACATTATGCTCGGCCTCCCTGCAGGACATTATTTCTCGTT TT GAGTCGAGGGCTGGTGATTCTCTCAACTGGGAGGACTTCCCCTCTAAAGTTGCAGTTCAG
ATGAATGACACTCATCCAACACTGTGCATTCCGGAGTTAATGAGAATACTGATGGAT ATAAA GGGATTGAGCTGGAATGAGGCTTGGAGTATCACAGAAAGAACCGTTGCATACACTAACCA CACGGTACTTCCTGAAGCTCTGGAGAAGTGGAGCTTGGACATTATGCAGAAACTTCTACC C CGACATGTTGAGATCATAGAAACAATAGATGAGAAGCTGATGAACAACATCGTCTCAAAA TA
TGGAACGGCAGATATTTCACTGTTAAAACAGAAGCTTAAAGATATGAGGATCTTAGA CAAC GTTGATCTTCCAGCCTCTGTTGCTAAACTGTTTATTAAACCTAAAGAGAAAACGGGGAAG TT GCTTGTTCAATCTTTGGAGTCTATTGCTGAAGGTGACGAGAAAACTGAGTCACAAGAGGA G
GAAAACATTCTATCCGAGACAGCAGAGAAAAAGGGCGGATCTGACTCTGAAGAAGCT CCT GATGCAGAAAAGGAGGATCCTGTGTATGAGTTAGATCCATTTGCAAAATACGATCCTCAG T
TACCTAGAGTTGTTCGAATGGCAAATCTCTGTGTTGTTGGTGGACATTCAGTTAATG GTGTG GCTGAGATTCACAGTGAAATTGTTAAGCAAGATGTGTTCAATAGCTTTTATGAGATGTGG CC TACTAAGTTCCAAAACAAAACAAACGGAGTAACTCCCAGGCGTTGGATCCGGTTTTGTAA T
CCTGAATTAAGTGCAATCATTTCAAAATGGATAGGCTCTGATGACTGGATTCTGAAC ACTGA
TAAACTTGCAGGACTGAAGAAGTTTGCTGATGATGAGGATCTGCAATCAGAATGGCG TACT GCTAAAAGGAATAACAAGATGAAGGTAGTTTCGCTGATAAGAGACAAGACTGGATATGTT G
TCAGCCCAGATGCGATGTTTGATGTGCAGGTGAAAAGAATACATGAATATAAGCGGC AGCT GCTAAATATCCTTGGTATCGTTTACCGATACAAGAAGATGAAAGAAATGAGTGCCAAAGA TA GAATAAAGAGCTTTGTTCCAAGGGTATGCATATTTGGTGGGAAAGCTTTTGCCACTTACG TA CAGGCAAAGAGGATTGTGAAGTTTATCACAGATGTCGCAGCTACTGTAAATTATGACCCT G
ATGTTGGAGATTTACTGAAGGTTGTATTTGTCCCAGATTATAATGTTAGCGTTGCTG AGAAG CTCATTCCTGCCAGTGAATTGTCCCAGCATATCAGTACCGCTGGAATGGAAGCTAGTGGA A CCAGCAACATGAAGTTTGCAATGAATGGTTGTATTCTTATTGGAACTTTGGATGGTGCGA AT GTGGAGATCAGAGAAGAGGTTGGAGAGGAGAACTTTTTCCTCTTTGGTGCACATGCACCT
GAAATCGCTGGTTTGAGGCAAGAGAGGGCTGAAGGAAAGTTTGTGCCTGACCCGAGA TTT GAAGAGGTTAAGGAATACGTCCGCAGTGGCGTCTTTGGGACTAGCAACTATGATGAATTG A TGGGTTCTTTGGAAGGAAATGAAGGTTATGGACGTGCAGATTATTTTCTTGTTGGCAAGG A
TTTTCCCAGTTACATTGAATGCCAGCAAAAAGTTGATGAAGCATACCGAGATCAGAA GCTAT GGACAAGAATGTCTATCCTCAACACAGCGGGTTCCCCCAAGTTCAGCAGTGACCGGACGA TCCATGAGTACGCCAAGGACATATGGGATATCAGCCCCGTCATCATGCCCTGA SEQ ID NO: 19 >Wheat (hexapioid) PHS1 5B (Cadenza) protein
MATASPPLATAFRPLAAAGGAGGGGAHAVGAAGRVAPRRGRRGFWRSVASDREVRGP AST
EEELSAVLTSIDSSAIASNIQHHADFTPLFSPEHSSPLKAYHATAKSVFDSLIINWN ATYDYYNKV
NAKQAYYLSMEFLQGRALTNAIGNLELTGQYAEALKQLGQNLEDVASQEPDPALGNG GLGRLA
SCFLDSMATLNYPAWGYGLRYRYGLFKQIIAKDGQEEVAENWLEMGNPWEIVRNDVS YPVKF
YGKVVEGTDGRKHWIGGENIKAVAHDVPIPGYKTKTTNNLRLWSTTVPSQNFDLGAF NAGDHA
KANEAHLNAEKICHVLYPGDESSEGKILRLKQQYTLCSASLQDIISRFESRAGDSLN WEDFPSK
VAVQMNDTHPTLCIPELMRILMDIKGLSWNEAWSITERTVAYTNHTVLPEALEKWSL DIMQKLLP
RHVEIIETIDEKLMNNIVSKYGTADISLLKQKLKDMRILDNVDLPASVAKLFIKPKE KTGKLLVQSL
ESIAEGDEKTESQEEENILSETAEKKGGSDSEEAPDAEKEDPVYELDPFAKYDPQLP RWRMA
NLCVVGGHSVNGVAEIHSEIVKQDVFNSFYEMWPTKFQNKTNGVTPRRWIRFCNPEL SAIISK
WIGSDDWILNTDKLAGLKKFADDEDLQSEWRTAKRNNKMKVVSLIRDKTGYVVSPDA MFDVQ
VKRIHEYKRQLLNILGIVYRYKKMKEMSAKDRIKSFVPRVCIFGGKAFATYVQAKRI VKFITDVAA
TVNYDPDVGDLLKWFVPDYNVSVAEKLIPASELSQHISTAGMEASGTSNMKFAMNGC ILIGTL
DGANVEIREEVGEENFFLFGAHAPEIAGLRQERAEGKFVPDPRFEEVKEYVRSGVFG TSNYDE
LMGSLEGNEGYGRADYFLVGKDFPSYIECQQKVDEAYRDQKLWTRMSILNTAGSPKF SSDRTI
HEYAKDIWDISPVIMP*
SEQ ID NO: 20 >Wheat (hexapioid) PHS1 5B (Cadenza) promoter (-2kb upstream of ATG)
ATGCATTACCGAGTGGGCCCAGAGATACCTCTCCGTCATACGGAGTGACAAATCCCA GTCT CGATCCGTGTCAACCCAACAGACACTTTCGGAGATACCCGTAGTATACCTTTATAGTCAC C CAGTTATGTTGTGACGTTTGGTACACCCAAAGCACTCCTACGGTGTCCGGGAGTTACACG A TCTCATGGTCTAAGGAAAAGATACTTGACATTGGAAAAGCTCTAGCAAACGAACTACACG A TCTTGTGCTATGCTTAGGATTGGGTCTTGTCCATCACATCATTCTCCTAATGATGTGATC CC GTTATCAACGACATCCAATGTCCATAGTCAAGAAACCATGACTATCATTTGATCAACGAG CT AGTCAACTAGAGGCTTACTAGGGACATGTTGTGGTCTATGTATTCACACATGTATTACGA TT
TCCGGATAACACAATTATAACATGAATAACAGACGATTATCATGAACAAGGAAATAT AATAA TAATCTTTTATTATTGCCTCTAGGGCATATTTCCAACAATTGCGGTGAGGCGCGTGGAGG TT GGCGTGTCCCCAAATCTAGTTATCTGGGTTGGGGTTTTTCTTTGGAAGGTTTATCTCAGT G GTAGTGTCCCGTGAAGCGGATGACACGGCTATGTGTCTTGGTCCTCCATGCTCTAGTTTT G ACATGTGAAGGTTGGCCAACTTCGGCGTTGTCAGGGAGCATATGAGATTTGTGTTCCTTT A CTTCGCCTGGTCAGACCTACGGTATTATATAAGCCCTCTCATGTGTTTTCTTCTCTTGGG CA GCAGTCAGTTACGCAAATCGTGGTCGCTGACATCTTCTAGTCCACATCGGCGACTTCCCG A
CCGCCGCTTCTACAAGCTCCTTATTTTTCAACAAGTTTTCTCCACCGAGATGGAAGT CAAGA GGCGGCAACGGCTTTGCTCATGGCGCACTGCGGATGGATGCAGGAGAAGGAAGACATTC CACACCCCTCAAACTATGAAGATGTAATTTTTAGTTTTTGTAAGTATTTTTGTGAGGATT TGT GCTACTTTTAATATATATGGCCTTTTGCCTTTTTCCAGGAAAGGAAAAACTAAACCATTA CAT GAGGCCATTCCCATCCGTCCAATAAATACGACAAAACGCTTTACAGCACGGGGGCAATTC C CAGCCGTCGAAATGCGCTGCAACGGAACTAGCTCTCCCAGACAGACACAGCGGCACATGA GCCACGCCTCCCATCGCAACCGTGCACCGGCGGTGGGCCCACCGGCCAGCGCCCCCAC
ACGTGTACCGGCGGCCAGCATTGCACGGCACGGCGACGGAGAGATATTTTTAACGCC TTC CAATTCAGAAGCTCCCCATCACTCGCTCGGGTATCAGAGCGGGCCGCGTGACTCTCGCGT CGTCGCATCGCGCGCGCTGCACACCACCGTCAGCGCGACCACTGGCGCCA
SEQ ID NO: 21 >Wheat (hexapioid) PHS1 5D (Cadenza) genomic DNA
AGTGGTAGTGTCTCGTGAAAGCGGATGACGCGACTATGTGTCTTGGTCCTCCACGCT CTA
GTTTCGACAGGTGAAGGTTGGCCAGCTTCGGCCTCGTCGGGGAGCGTATGAGATTTG TGT
TCCTTTACTTCGTCTGGCCAGACTTCCCGACCGCCGCTTCTACAAGCTCCTTATTTC AAGC
AAGTTTTCTTCACCGAGATGGAAGTCCAGAGGCGGCAACAGCTTTGCTCACGGCGCA GTG
GCGGTGGATGCAGGAGAAAGAAGACATTCAACACCCCTAAACTATGAAGATGTAATT TTTT
ATTTTTGTAAGTGCTTTTGTGAGGACTTGTGCTACTTTTAATATGTGGCCTTTTGCC TTTTTC
CAGAAAAAAAAAACTAAACCATTACACGAGGAGAAGAGCCATTCCCAGCCGTCCAAT AAAC
ACGACAAAACGCTTTACAGCACGGGGGCAGTTCCCAGCCGTCGAAATGCGCTGCAAC GGA
ACTAGTAGCTCTCCCTGACACGGCGGCACATGAGCCACGCCTTGCGCCGTGCACCGG CG GTGGGCCCACCGGCCAGTGCCCCCACACGTGTACCGGCGGCCAGCATTGCACGGCACG GCGACGGAGAGATATTTTTAACGCCTTCCAATTCAGAAGCTCCCCCATCACTCACTCGGG T ACAGAGCGGGCCGCGTGACTCTCGCGTCGTCGCATCGCGCGCGCTGCACACCACCGTCA GCGCGACCACTGCCACCAATGGCGACCGCCTCGCCGCCGCTCGCCACCGCCTTCCGCCC GCTCGCCGCCGGCGGCGGCACCGGAGGAGGAGGAGCGCACGCCGCGGGCTGGGTCGC GCCGAGGCGGGGGCGCCGGGGGTTCGTGGTGCGGAGCGTGGCGAGCGATCGGGAGGT GCGGGGGCCGGCCTCGACCGAGGAAGGTAAGCGCCGCGCGCTCCCGTTGGCTACGACG ACCCGGAATGCTGGTCGGCTGTAATGTGGTTCGGATTTTCGCTGGTGCGACCTGAATTTG T GATCCTCTGACTGACGGTCTCTGTGTTCCCCGGTTTCGGGCTATTGATTTTGTGCGCGCG C ATTGCTAATCTACTGCTCTAGGATTTGTTAGGGTTCAGAGGAATCTTAAATTCGTGCATA AA CTTCGACGAATGTATTCTGAATTTACTGAAACTGGCTTAGTTTGGCGCATCAAGTTAGAA TG ATGTATTTATGGTACAGTAGCAAATTCTGAGAAATTCTTCTGTGCCTGCACAGCCACCCC TT GTGGATGATGCAGCAAGTGTCAACATACTGTCGCTACTATCTGTAGCTGTAACAAAATCT TA CGCCTTGTTTTCTGGTGGTACAGTAGTATTATCATTTTACAAAGAGTAGCTGTAACAAAA TC TTACGCCTTGTTTTCTGGTGGCTTGCCATTGGGGAGTAGAATGCAAATGGTGATACAGTA T ATTATCTACGATTTTGTGCAAGAATTTTCACACGAAAACAATGTCCCATGCAGTTTTTAG TTT TGACTCTTGACTGTTTAATACCTACTCTTTGTAAGATTTCGGATCCTAATATTGTTGTTC TTG CAACTCAGAGCTTTCAGCCGTGCTAACTTCCATTGATTCTTCCGCCATCGCGTCAAATAT CC AGCACCATGCAGACTTCACACCGTTGTTCTCGCCAGAGCACTCTTCACCCCTAAAGGCTT A CCATGCAACTGCTAAAAGTGTTTTTGATTCTCTGATCATCAATTGGAATGCAACATATGA CT ATTACAACAAAGTGAATGCAAAGCAAGCTTATTACCTGTCCATGGAGTTTTTGCAGGTCT CA CAATATACTTCCTTTAGCCTTATCTCAATACTTTTTGGTAACTGCGTTATGATGTTATAT GGT TATTATCTGTGTACTGCAGGGAAGAGCTCTCACAAATGCTATTGGCAATCTAGAGCTAAC A GGACAATATGCAGAAGCGTTAAAACAACTTGGACATAACCTAGAGGATGTTGCTAGTCAG G TAAGTGATTTTTGTCAGAGATTGGTTTTTAGTCTGTATCAAAATTATCGCAGAGCCATTT TTT TTTTCTAAAAAACACATTTTATTATTAATCATCAAGTTCCTTTTGTTTAAAAACTATCTG AAAA GAAGAAGTATTTTCCCCTGGGGGTTCCAGCATCTGACTTATATTCTTTATGCAGTTTTCA GA GTGAGTGACTATGAAATTACGCCAAAACACGAGCTTATTGATGCCTAGAGTGCCCTGTAG T TTAGAAATCTATTATTGGATATGTTACTTTTCTATTACAATGAAACAGGAACCAGATCCT GCC CTTGGCAATGGTGGTCTAGGCCGTCTAGCGTCCTGTTTTCTGGATTCTATGGCAACCCTA A ATTATCCAGCATGGGGATATGGACTTCGGTACAGATATGGCCTCTTTAAGCAAATCATAA CA AAGGATGGTCAGGAGGAGGTAGCTGAGAATTGGCTAGAGGTAGTGATATGTCTTCTACTT G TTTACTGTTTCGAAGTGTTGACTATGTTGGTTTATGTTTTAATGACCTGCATGTTAGCTT CTT TTGCTGTTATAACTGATGCTGTATCTCCAGATGGGAAATCCATGGGAGATTGTAAGAAAT GA TGTCTCTTATCCTGTGAAATTCTACGGCAAAGTGGTTGAAGGGACTGATGGGAGAAAACA C TGGATTGGAGGAGAGAATATTAAGGCTGTGGCACATGATGTTCCTATTCCTGGCTACAAG A CTAAAACTACCAATAATCTTCGTCTTTGGTCAACAACAGTACCATCACAAAATTTTGATC TG GGAGCTTTCAATGCTGGGGATCATGCCAAGGCCAATGAAGCACATCTAAATGCTGAAAAG GTATTCTTCCGTCTACATTTATTTTGAACGTAAACCATTAACTCTCAACAACTATTGATT CAA AGGCTAAACACTGTCTTGCTAGTAGTTACTGTGTTTTCTCTTAGATGTCATTTTAGAATG TG CGCGGTCAATCGTTTGATAACTTTGATAGTAATAATATACTGGTATACACAAACCTATTT GG GTTTCTCATGTAAAAATGTCGTCACCAAATTTTTCATGAAAATACTTATTGTATTTTCAT GTAA ATAGAAAAGCTGATGGTCAGGTGTTTTGTGTGTCAAATGACAAGTAAAAATGAACAGAGT TT GTGCACCAATAACTTCTTCACCTGGTGTTTGTAATTCTTTCGGTTGTATTTATGTGTTTT CTT ATTCTGCCTTAATTAACTCAGCATATGCATCAGGTTACATTACTCCTCTGTGTCAAGAAA AA AGAAGTCTGATGAGATGCAGTCCAGTAGTCTGTAGATGGCACATTGTAGAAGAACTTTTC A CATGGAAACGTTACTCTGTGATTTTTGTTATGCCTTTTGAGTGTTTGTGCATTGTGCACA TAT ACCATTAAGTATAAGTGTTCAAAAGTGGTTCTCTCATACTGTGCAAACTAGCTAAACAAC TT GACTGGACAAGATAAATTGTACTCACATCTGAAGCTGCCGGGTTGGACAGATTTGTCCGA A CCATCTTTGAACATCCCTACTAGTGTGCATGGGTTAAATTGTTGACTTTTCCTCGTTTGT TC AGTAGTAGGACTATCTAAATATATTTATATTCAAGTTTTCTTTGTGTGTTACGTTATAAA CTTT TAGTCATAATTACATTTGTGACCATTGTGGACAAAATTTATCATGTCTAACTTAAGTTTG TGA CTGGTCTTCTCATCATGCCAGATATGCCACGTATTGTATCCAGGGGACGAATCATCAGAG G GGAAAATTCTTCGCTTGAAACAACAATATACATTATGCTCGGCCTCCCTGCAGGACATTA TT TCTCGTTTTGAGTCGAGGGCTGGTGATTCTCTCAACTGGGAGGACTTCCCCTCTAAAGTT G CAGTTCAGATGAATGACACTCATCCAACACTGTGCATTCCCGAGTTAATGAGAATACTGA T GGATATAAAGGGATTGAGCTGGAATGAGGCTTGGAGTATCACAGAAAGGTACCACAGAAG GGTACTGCAGTTTGATGTGTTCTGCTGTTCCTCCAGTTTCAATTAACTAATTTTATCCCT CAT ATGTCTTCCATTCAATAATAAACTGATTGTCTGATATCATTTGTAGAACCGTTGCATACA CTA
ACCACACGGTACTTCCTGAAGCCCTGGAGAAGTGGAGCTTGGACATTATGCAGAAAC TTCT
ACCTCGACATGTTGAGATCATAGAAACAATAGATGAGGAGGTAACAAATGTGCAGAT AATA
TTATACAATTAATTTCATGCTCTGTAGGTCAACTTATGGTTTACTTATCTTCATCGT TGATTTT
AAAACAGCTGATGAACAACATCGTCTCAAAATATGGAACGGCAGATATTTCACTGTT AAAAC
AGAAGCTTAAAGATATGAGGATCTTAGACAACGTTGATCTTCCAGCCTCTGTGGCTA AACT
GTTTGTTAAAACAAAAATGAAAAAGGGGAAGTTGCTTGTTGAATCTTTGGAGTCTAT TGCTG
AAGCTGACGAGAAAACTGAGCCAGAAGAAGAGGAAAACATTCTATCCGAGACAACAG AGA
AGAAGGGCGAGTCTGACTCTGAAGAAGCTCCTGATGCAGAAAAGGAGGATCCTGAGT ATG
AGTTAGATCCATTTGCAAAATACGATCCTCAGTTACCTAGAGTTGTTCGAATGGCAA ACCTC
TGTGTTGTTGGTGGACATTCAGTTAATGGTGTGGCTGAGATTCACAGTGAAATTGTT AAGC
AAGATGTGTTCAATAGCTTTTATGAGGTATTTGCACGATGATTTGGAAACCATGATC TTATC
TTCTAACGGCTGCTATAATCGCCAGGTAATATGGTATTCACCATCTGCCATTTTCTT GTGTT
CTTTGCCAGATGTGGCCTACTAAGTTCCAAAACAAAACAAACGGAGTAACTCCCAGG CGTT
GGATCCGGTTTTGTAATCCTGAATTAAGTGCAATCATTTCAAAATGGATAGGCTCTG ATGAC
TGGATTCTGAACACTGATAAACTTGCAGGACTGAAGAAGGTACTTCCAGTAATTTTA ATTTC
CAATAATCAGTAATCTAATGCTTCCTTCACTTCCATGTTACCCGTGCCTTGTTTTTA CATTGA
TATGTGTTCATCCCGTAATAGTTTGCTGATGATGAGGATCTGCAATCAGAATGGCGT ACTG
CTAAAAGGAATAACAAGATGAAAGTAGTTTCACTGATAAGAGATAAGACTGGATATG TTGTC
AGCCCAGATGCGATGTTTGATGTGCAGGTATATTTTGGACAAGAAACATCTGTCTTC TGTTA
AATATGGTTTAACTCGGATCTTGGTACTTGACACTTAGGTGAAAAGAATACATGAAT ATAAG
CGGCAGCTGCTAAATATCCTTGGTATCGTTTACCGGTACAAGAAGATGAAAGAAATG AGTG
CAAAAGATAGAATAAAGAGCTTTGTTCCAAGGGTATGCATATTTGGTGGGAAAGCTT TTGC
CACTTATGTACAGGCAAAGAGGATTGTGAAGTTTATCACAGATGTCGCAGCTACTGT AAATT
ATGATCCTGATATTGGAGATTTACTGAAGGTAAATTTGTGTCTTCAATGGCACTTAT TATTCT
TAATTTTCATGTTCTGTCACTGGGAATTTGACTCCGACTAGGGGTTACTGAAAAGCA GTGAA
TGTTCCGAGTTTCAAAAGAAAACAGAGCAGTGAATGTTCTTAATGTCGGTGCTTGTA GAAC
CGAGAAACTGAATAGCCACAGTATTTTTTAGAGAACAATAGCCACAATTGTTTTGAC CCTTC
TATTGCTATCAAGCATCTGATGCCGATTTTACTGATAACTAGATGAGGTAAATACAA GGTGA
CAGGTTAAGTTCTGAGGATTTTTGTTTGTTATACAGGTTGTATTTGTCCCAGATTAT AATGTT
AGCGTTGCTGAGAAGCTCATTCCTGCCAGTGAATTGTCTCAGCATATCAGGTAATAT TCTG
GCTACTCTTATATAGTATGTGAATACATAGAAAAGGAGTCTTACCTGGAGTTGGCTG ATGTG
GAAATTTTCATGTGCCGTTTTGTTTGCTATAATATATACCTGGAGTCTTGCTGGGAT AGTGC
CATCCAGAAGTGCCACGCCAGCTTCTATTTCTGTACAGGATGTGCATTTTTAACACT ATTAC
TATCCATTCAGTACCGCTGGAATGGAGGCTAGTGGAACCAGCAACATGAAGTTTGCA ATGA
ATGGTTGTATTCTTATTGGAACTTTGGATGGTGCGAATGTGGAGATCAGAGAAGAGG TTGG
AGAGGAGAACTTTTTCCTCTTTGGTGCACATGCACCTGAAATCGCTGGTTTGAGGCA AGAG
AGGGCTGAAGGAAAGGTACCATCACTTTGCAAACATCTCCAAATTCGTTCCTCCAGT ATGT
GAAACAAATATCCACTTGTTTATATAGCTGTGCCTACTATCAAACTACTCCCTCCGT CCATA
ATATAAGAGTGTTTTTTACACTACACTAGTGTCAAAAACGCTCTTATATTATGGGAC GGAGG
GAGTAGTAAAGATCTCTCCCAAATGTGTGCTGTCCAAATAGTTTAGCTGAAGGAAAA TTATA
AATCATTTTCCTTTGCACTTGTAGTTTGTGCCTGACCCGAGATTTGAAGAGGTTAAG GAATA
CGTCCGCAGTGGCGTCTTTGGGACTAGCAACTATGATGAATTGATGGGTTCTTTGGA AGGA
AATGAAGGTTATGGACGTGCAGATTATTTTCTTGTTGGCAAGGATTTTCCCAGTTAC ATTGA
ATGCCAGGAAAAAGTTGATGAAGCATACCGAGATCAGAAGGTAAGTTGTATAACTGA CTCT
GTTTGTTTTCTGGCTGAAATGCACACAGTGATTATGCAAACTTTCTGATATGTGATG GCTGA
CATCAGTGTTTCACTGTGAACGACTTCACTGATTCGTAGGAAGGAATATTTCTTAGT TCAAA
GTTTCAAACTGAAGTATCTATCGGTGTGTTGGAAGATCATAAACAATCAGAGAATAT AGTTG
CTCGATGTGCTATATTACTCATCTGTTGGCTGTTGAGTGTTGATACATGAGCTTTGA TGAAC
CTAGCTAGACCTTCGGGAAAGTGCTTCTATGCATTCCAATAAAAAAAACTTATATCT TAACA
AAACAATTAGGTTGAACCTCGCCAGGGGGCTTTTTGTTATCTGTTTCCTGAAATGAA AATCA
GAAGCGGGGAGCTCCTCAATTTGAAAAAAAAATCAAGGGTAACTGAGGGTCTGGTTT CCCT
CTGTTGGAGCCCCCAAGAATTCTGTGGTCTTATTTTTGTAATAGTATAAGATGCACT GCTCC
GTTAAGTTTCTCCTTCATCCATTTGCTTTTATTGCTTCGTGACATTTCGTGTAACAG CTCAGA
TTACCATGCATCCCATTACACATTATTTTGCTACTGTGGATATTTCCACTCTGTTTC ATGCTT
GACATTTTTAATTCGTTACATCGTGACCAACTAAACATAGTCTGATAGATTTTCAGA GTTTTG
ATCCTCCTAATCACATAATATCCGTTTCTACCTGCAGCTATGGACAAGAATGTCTAT CCTCA ACACAGCGGGTTCCCCCAAGTTCAGCAGTGACCGGACGATCCACGAGTACGCCAAGGACA TATGGGATATCAGCCCTGTCATCATGCCCTGAATAATTCAAGCAGCTTCCAGGAACCTTG A
GAGTCTATGAGGCGCGATTGGTCGCAGATTTGCCTCGATTCAGCGAGAATGTATTGC TCCC
CCCAGTTTTGTATAGGCATATGCAAGTTCTGGCATGGACCTGAGTGCTGAGAGCATG CAG
GGTAGTTCATCCTTAGCAATCTTGCATGTCAGTTGGCCAGTATACCCAGGCTTATGG TGTG
TATTTCGCCGAATAAAGTTGGAGCAGAATCATCAAACAAACATACTTTGGTGTATCT GTAAT
GATAAATAAAGGGTAAATGACTGAACAAAAAGTGTCGTTGGCGCACACATGTAGTAG AAGC
ATGTTTCTTCAGAGAGGGTGCAAAGCTGAAAGCACCCCCAGCTTTGAAATGAGAATG CAGA
AGTTGGGAAAACATTTGTTTACATCCAAAGTGCAACGGGATTCGTCAAGAGTTCAGA GTCT
TCCATCACGGCCATAGAGCATGTCCAATAGCAGTCCTATAATAGAGCAGTAGCAGTG GTCC
CTCATATTTTGAACCAAAAACGCAGCTCCAACGGCTGCCCTATTTGTAAAATTTACT TCAAA
ATTTGTTGGACAGCCCGAAAATGCAACACCAGCTGATGTAAATGTACAACACTTCGG CTGC
TGCCCTAAAATTTGCTTCAAGTGGCGAACCACTCAAGACCACAAGAAAACAGTTGTT TTAC
GTTACTTTGTGCCAAAAACGTTGCTCCAACAACTGCGCTATAATAGGCCACCCTAAT GTATT
TTAGGGCTGTTGTTGGAGATACATATTTTCGGGTGCCTAAAAATAAAATCATTTTTT GGTGC
CTTACGTTTTACGCCAACCACTCTTTTGTGCCCTATTATAGGGCTGCTATTGGAGAT ACTCT
AACAGTTCCATTTTTCCATAAAAGTTTCTGATTTGGAGGGAGTTGAGCAAAAAACAA AGTTT
TTGTAAACGTGAATTTTTAGTGGCCTTTATAGTTAGCCCAACTCCCCCTCCACCTAA
SEQ ID NO: 22 >Wheat (hexapioid) PHS1 5D (Cadenza) CDS
ATGGCGACCGCCTCGCCGCCGCTCGCCACCGCCTTCCGCCCGCTCGCCGCCGGCGGC G GCACCGGAGGAGGAGGAGCGCACGCCGCGGGCTGGGTCGCGCCGAGGCGGGGGCGCC GGGGGTTCGTGGTGCGGAGCGTGGCGAGCGATCGGGAGGTGCGGGGGCCGGCCTCGAC CGAGGAAGAGCTTTCAGCCGTGCTAACTTCCATTGATTCTTCCGCCATCGCGTCAAATAT C CAGCACCATGCAGACTTCACACCGTTGTTCTCGCCAGAGCACTCTTCACCCCTAAAGGCT T ACCATGCAACTGCTAAAAGTGTTTTTGATTCTCTGATCATCAATTGGAATGCAACATATG AC TATTACAACAAAGTGAATGCAAAGCAAGCTTATTACCTGTCCATGGAGTTTTTGCAGGGA AG AGCTCTCACAAATGCTATTGGCAATCTAGAGCTAACAGGACAATATGCAGAAGCGTTAAA A CAACTTGGACATAACCTAGAGGATGTTGCTAGTCAGGAACCAGATCCTGCCCTTGGCAAT G GTGGTCTAGGCCGTCTAGCGTCCTGTTTTCTGGATTCTATGGCAACCCTAAATTATCCAG C ATGGGGATATGGACTTCGGTACAGATATGGCCTCTTTAAGCAAATCATAACAAAGGATGG T CAGGAGGAGGTAGCTGAGAATTGGCTAGAGATGGGAAATCCATGGGAGATTGTAAGAAAT GATGTCTCTTATCCTGTGAAATTCTACGGCAAAGTGGTTGAAGGGACTGATGGGAGAAAA C ACTGGATTGGAGGAGAGAATATTAAGGCTGTGGCACATGATGTTCCTATTCCTGGCTACA A GACTAAAACTACCAATAATCTTCGTCTTTGGTCAACAACAGTACCATCACAAAATTTTGA TCT GGGAGCTTTCAATGCTGGGGATCATGCCAAGGCCAATGAAGCACATCTAAATGCTGAAAA GATATGCCACGTATTGTATCCAGGGGACGAATCATCAGAGGGGAAAATTCTTCGCTTGAA A CAACAATATACATTATGCTCGGCCTCCCTGCAGGACATTATTTCTCGTTTTGAGTCGAGG G CTGGTGATTCTCTCAACTGGGAGGACTTCCCCTCTAAAGTTGCAGTTCAGATGAATGACA C TCATCCAACACTGTGCATTCCCGAGTTAATGAGAATACTGATGGATATAAAGGGATTGAG C TGGAATGAGGCTTGGAGTATCACAGAAAGAACCGTTGCATACACTAACCACACGGTACTT C CTGAAGCCCTGGAGAAGTGGAGCTTGGACATTATGCAGAAACTTCTACCTCGACATGTTG A GATCATAGAAACAATAGATGAGGAGCTGATGAACAACATCGTCTCAAAATATGGAACGGC A GATATTTCACTGTTAAAACAGAAGCTTAAAGATATGAGGATCTTAGACAACGTTGATCTT CC AGCCTCTGTGGCTAAACTGTTTGTTAAAACAAAAATGAAAAAGGGGAAGTTGCTTGTTGA AT CTTTGGAGTCTATTGCTGAAGCTGACGAGAAAACTGAGCCAGAAGAAGAGGAAAACATTC T ATCCGAGACAACAGAGAAGAAGGGCGAGTCTGACTCTGAAGAAGCTCCTGATGCAGAAAA GGAGGATCCTGAGTATGAGTTAGATCCATTTGCAAAATACGATCCTCAGTTACCTAGAGT T GTTCGAATGGCAAACCTCTGTGTTGTTGGTGGACATTCAGTTAATGGTGTGGCTGAGATT C ACAGTGAAATTGTTAAGCAAGATGTGTTCAATAGCTTTTATGAGATGTGGCCTACTAAGT TC CAAAACAAAACAAACGGAGTAACTCCCAGGCGTTGGATCCGGTTTTGTAATCCTGAATTA A GTGCAATCATTTCAAAATGGATAGGCTCTGATGACTGGATTCTGAACACTGATAAACTTG CA GGACTGAAGAAGTTTGCTGATGATGAGGATCTGCAATCAGAATGGCGTACTGCTAAAAGG A ATAACAAGATGAAAGTAGTTTCACTGATAAGAGATAAGACTGGATATGTTGTCAGCCCAG AT GCGATGTTTGATGTGCAGGTGAAAAGAATACATGAATATAAGCGGCAGCTGCTAAATATC C TTGGTATCGTTTACCGGTACAAGAAGATGAAAGAAATGAGTGCAAAAGATAGAATAAAGA G CTTTGTTCCAAGGGTATGCATATTTGGTGGGAAAGCTTTTGCCACTTATGTACAGGCAAA G
AGGATTGTGAAGTTTATCACAGATGTCGCAGCTACTGTAAATTATGATCCTGATATT GGAGA TTTACTGAAGGTTGTATTTGTCCCAGATTATAATGTTAGCGTTGCTGAGAAGCTCATTCC TG
CCAGTGAATTGTCTCAGCATATCAGTACCGCTGGAATGGAGGCTAGTGGAACCAGCA ACAT
GAAGTTTGCAATGAATGGTTGTATTCTTATTGGAACTTTGGATGGTGCGAATGTGGA GATCA
GAGAAGAGGTTGGAGAGGAGAACTTTTTCCTCTTTGGTGCACATGCACCTGAAATCG CTGG
TTTGAGGCAAGAGAGGGCTGAAGGAAAGTTTGTGCCTGACCCGAGATTTGAAGAGGT TAA
GGAATACGTCCGCAGTGGCGTCTTTGGGACTAGCAACTATGATGAATTGATGGGTTC TTTG
GAAGGAAATGAAGGTTATGGACGTGCAGATTATTTTCTTGTTGGCAAGGATTTTCCC AGTTA
CATTGAATGCCAGGAAAAAGTTGATGAAGCATACCGAGATCAGAAGCTATGGACAAG AATG
TCTATCCTCAACACAGCGGGTTCCCCCAAGTTCAGCAGTGACCGGACGATCCACGAG TAC
GCCAAGGACATATGGGATATCAGCCCTGTCATCATGCCCTGA
SEQ ID NO: 23 >Wheat (hexapioid) PHS1 5D (Cadenza) protein
MATASPPLATAFRPLAAGGGTGGGGAHAAGWVAPRRGRRGFVVRSVASDREVRGPAS TEEE LSAVLTSIDSSAIASNIQHHADFTPLFSPEHSSPLKAYHATAKSVFDSLIINWNATYDYY NKVNAK QAYYLSMEFLQGRALTNAIGNLELTGQYAEALKQLGHNLEDVASQEPDPALGNGGLGRLA SCF LDSMATLNYPAWGYGLRYRYGLFKQIITKDGQEEVAENWLEMGNPWEIVRNDVSYPVKFY GK VVEGTDGRKHWIGGENIKAVAHDVPIPGYKTKTTNNLRLWSTTVPSQNFDLGAFNAGDHA KAN EAHLNAEKICHVLYPGDESSEGKILRLKQQYTLCSASLQDIISRFESRAGDSLNWEDFPS KVAV QMNDTHPTLCIPELMRILMDIKGLSWNEAWSITERTVAYTNHTVLPEALEKWSLDIMQKL LPRH VEIIETIDEELMNNIVSKYGTADISLLKQKLKDMRILDNVDLPASVAKLFVKTKMKKGKL LVESLES IAEADEKTEPEEEENILSETTEKKGESDSEEAPDAEKEDPEYELDPFAKYDPQLPRWRMA NLC VVGGHSVNGVAEIHSEIVKQDVFNSFYEMWPTKFQNKTNGVTPRRWIRFCNPELSAIISK WIGS DDWILNTDKLAGLKKFADDEDLQSEWRTAKRNNKMKVVSLIRDKTGYWSPDAMFDVQVKR IH EYKRQLLNILGIVYRYKKMKEMSAKDRIKSFVPRVCIFGGKAFATYVQAKRIVKFITDVA ATVNYD PDIGDLLKVVFVPDYNVSVAEKLIPASELSQHISTAGMEASGTSNMKFAMNGCILIGTLD GANVE IREEVGEENFFLFGAHAPEIAGLRQERAEGKFVPDPRFEEVKEYVRSGVFGTSNYDELMG SLE GNEGYGRADYFLVGKDFPSYIECQEKVDEAYRDQKLWTRMSILNTAGSPKFSSDRTIHEY AKDI WDISPVIMP*
SEQ ID NO: 24 >Wheat (hexapioid) PHS1 5D (Cadenza) promoter (-2kb upstream of ATG)
TATGTTAGTCCAAATAAATATCATCTAAAATTGTGCCTAAACCATATACAAATGATG TAAATG
TGGACATAATACTTCATAAAGTATAGATATGTTTGAGACGTATCAGGAGCGTCGACC TTTTT
GGGAAGCCAAGTGAGCAAAGCCTCATCAAGTCTTGGGAATCCACGGTCATTGAGGGA GAA
AAATTTATCAAAGGTCTCACACATACCCCCTTTGATGATGGACCAGCAGGACCGCAG TAAT
TTGGCCGTGAAGCCATCTAGCCCCGTCAACTTGCCGGAAGGGAACCTCTTGACCGCC TCC
TAAATTTCATCCTCAGTGAAGGGTTGCTCAAGTGCAGGGAGGTCAAAGTGATGCGGG TCG
GTACTATGGAGATCAAGGGAGAGAGTCAGTGCTCAAAAGTGTCAAGAATGGAAGAGA AGT
GATCATATGCCGCCCTGGCCATATCGGGCTCGCCAGAATGTGCCCACCATCAGACTG CAC
CCCTAGGATTATTAACTTCTGTCGTCGGTGCACAGAATGGATCTTGAAGAAGGTGGA ATTT
GTGTCGCCCTCACGTAGCCAGGAGAACCTGGCAGGCAGATGTGCAATCGACCTCTCC ACC
GAAGCAAACCCTAGGTAGGCGCGCTTCAGCTCGCGGAGTAACCATACTTCTGGAGGA GCA
AGCAGGCGGTGATCTTGAGCCGCATCGAGATGAGCGGTCAACTCCCGAGCGAGAAGC AT
CTGAGTGGAGATGTTACCAAGCGTTCTAGACCTCCAACTCTACAGTCAGCGCATTGT CAGC
TTGAGCTTGGCGTAGATCCCCTGAAAATGATTGGGGTCATGCACGCCAGAATTCCAG GTGT
CAGAGACCACCTAATGAAAACAATCGAGCTTTGGCCAAAATAGCTCAAAATGGAATC TCCG
ATCACCTTGGGAGTGAAAATGTTACTAGTTTCGTTGGAGGCCAAGGTAGTTATGTAT TTGTG
TGTGCAATTACTAATCATTGGCCCTTAGCAAACTCCTATTGGTTCAAACCGTTTCAC TTTGA
AAAAAATGCTACTTGCTACAAACCGTTTCACTTGGAGGGGCGTCTTTTCTAGTTGAG AGGG
AGCAAAAGGCGTATCAAGACGACAACGCCACCCTCGGCAATGTACAAGTCCTTCCCC GTC
CTCGTTTCGATGCTGCTTTTTGTCATCGCAGCGAGGCGTGTAGAGGGTGGTGTGTCC CCA
GATCTAGTTATCTGGGTTGGGGTTTTTCTTTGGCAGGTTTATCTCAGTGGTAGTGTC TCGTG
AAAGCGGATGACGCGACTATGTGTCTTGGTCCTCCACGCTCTAGTTTCGACAGGTGA AGG
TTGGCCAGCTTCGGCCTCGTCGGGGAGCGTATGAGATTTGTGTTCCTTTACTTCGTC TGGC
CAGACTTCCCGACCGCCGCTTCTACAAGCTCCTTATTTCAAGCAAGTTTTCTTCACC GAGAT
GGAAGTCCAGAGGCGGCAACAGCTTTGCTCACGGCGCAGTGGCGGTGGATGCAGGAG AA
AGAAGACATTCAACACCCCTAAACTATGAAGATGTAATTTTTTATTTTTGTAAGTGC TTTTGT GAGGACTTGTGCTACTTTTAATATGTGGCCTTTTGCCTTTTTCCAGAAAAAAAAAACTAA AC
CATTACACGAGGAGAAGAGCCATTCCCAGCCGTCCAATAAACACGACAAAACGCTTT ACAG
CACGGGGGCAGTTCCCAGCCGTCGAAATGCGCTGCAACGGAACTAGTAGCTCTCCCT GAC
ACGGCGGCACATGAGCCACGCCTTGCGCCGTGCACCGGCGGTGGGCCCACCGGCCAG T
GCCCCCACACGTGTACCGGCGGCCAGCATTGCACGGCACGGCGACGGAGAGATATTT TTA
ACGCCTTCCAATTCAGAAGCTCCCCCATCACTCACTCGGGTACAGAGCGGGCCGCGT GAC
TCTCGCGTCGTCGCATCGCGCGCGCTGCACACCACCGTCAGCGCGACCACTGCCACC A
SEQ ID NO: 25 >Wheat (tetrapioid) PHS1 5A (Svevo reference genome) genomic DNA
CGAGGGCGGCGGCGAAGGGATCTGGTGTGGTGTCGCTCGAGGGCACCGGACTACCGG A
GGCTTCGCTCGCTCGACCGCACGTTGCGAGTCCAAGACGATTTTTTTTCGCTGGTTT TCTT
TCGTCGGTTTAATTTCGTTCGATCTACTTTGCCTGCGAGGGGAGTGCCTGCAGGCAT CGCT
CCCTGATAACGTAGGTTTGTTCGGTTTTTTTAACGTACGAGGGGATGTGGGAGGAAA TACC
AAAAAAGTATCAAAAAAGACCGGGTGAGGTGGGACGAAAATAAAACCTGGAATGCGA CCTA
CCAACTGAGATATTAGGAGTAGAGATATATGTTTTGTATCTCCTTTTTTTGGTATAA TTTTTTT
TGGTGAAACGCCGTTTCTTTTGTAAGCGAAGATCCATATCTCCGATCAGGCAGATTC TGCA
CATTCTTTTCTTATTTTAGGATTGCTCTTGCAAAAAGAAGATCATCAATCAACTTAG GGATCC
TCCACGCAGACCCACCACGCAGCCTAGGGGCACCGTCTGGCCATCTGGAGCCCCGGT GG
CTCCCTTGGACCCCATCTTCTCGCTAGTGGGGCCTTGGTCCAAAATAATCCATGTAA TTTTT
CATGTTTTCCGGCTTCGTATTTTTTGGTAAAAACCCAAAAACAATAGAAAAGAAGAA TTGGC
CTCTGACACTAGATTGATAGGTTAGTCCAAATCAATATTATCAAAAATTGTGCCCAA ACCAT
ATACAAATGATATAAATATAACTAGAATACTTCATAAATTTAGATACGTTTGATAAA GTGCCA
AGAATGGAAGAGAAGTGATCATATGCCGCCCTGTCCATATCGGGCTCATCAGAATGT TCCC
ACCATCAGACTCCAGCCCCAGGATTATTAACTTATGTCGGCGGTGCGCGGAATGGAT CTTG
AAGAAAGCGGAATTTGTGTCGCCCTCACGTAACCAGGAGAACTTGGCAGGTTGACGC GCA
ATCGACCTCTCCAGCAAAGCAAGTCCTAGGTAGGCGCGCTTCAGCTCGCAGAGTAAC CGT
ACTTCTGGAGGAGCAAGCGGGTGGTGATCTTGAGCCGCCGAGACGAGCAATCAATTA CCG
AGCGAGAAGCATCTGAGTGGAGATGTTACCGAGCGTTCTGGAGCTCCAACTCTGCCG TCA
GTGCATTGTCAGCTTGTGCTTTGCGTAGATTCGCCGGAAAGGATCAGGATCATGCAC GCC
AGAATTCCAGGTGTCAGAGACCACCTAATGAAAACCATCGAGCTTTGATCAAAATAG CTCA
AGATGGAATCTCCGATCACCTTGGGAGCGAAAATCTTACTAGTTTCGTTCAAGGCCA AGGT
AGTTATATATTTGTGTGTGCAATTATTAATCATTGGCCGCTTAGCAAACTCTTATTG GTTCGA
TAAACATTAAAGTCATCCCCACTTTGAGAAAAAATGCTACTTGCTACAAACCCTNNN NNNNN
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN N
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN N
NNNNNNGCAAGTAATAAATCCTTGCCGCCTCGTCCTTGTTTGGATGCTGCTTCTTGT CATC
TCAAGGAGGCGTGTGGAGGGTGGTGTGTCCCCGGATCTTGTTCTCTCGGTTGGGTTT TTT
CTATGGCAGGTTTATCTCAGTGGTAGTGTCACGTGAAGCAGATGACGCGACTATGCG TCTT
GGTCCTCTGCGCTCTAATTCCGGCAGATGAAGGTTGACCAGCTTCAGCCTCGTCGGG GAG
CGTATGAGATTTGTGTTTTTTGACTCCATCTGGTCGGACCTGTGGCATTATGTAGAC CCTCT
CATGTATTCTTTTTCTCCTGGACAGTAGTCTGTTACACAAATCGTGGTCGTCGGATC TTCTA
GTCCACATCGGCGACTTTTCCGACCGCCACTTCTACAAACTCCTTATTTTCAACAAG TCTCC
TTCATTGAGATGGAAGTCCAGGCAATGGCTTTGCTCACGGCACACTGCCGGCGGATG CAA
GAGAAAGAAGACATTCGACACCCCTAAAACTATGAAGATGTAATTTTTAATCTTTGT AAGTG
TTTTTGTGAGGATTTGTGCTACTTTTAATATATTGCCTTTTGCCTTTTGCAAGGGAA AAAAAA
CTAAACCATTACACGAAGAGAAGAGCCATTCCCAGCCGTCCAATAAACACGACAAAA CGCT
TTACAGCACGGGGGCAATTCCCAGCCGTCGAAATGCGCTGCAACGGAACCAGCGCTC CC
CGACACGGCGGCACATGAGCCACAACTCGTACCATGCACCGTGCACCGGCGGTGGGC CC
ACTGGCCAGCGTCCCCACACGTGTACCGGCGGCCACAATTACACGGCACGGCACGGC GA
CGGAGAGATATTTTAACGCCTTCCAATTCAGAAGCTCCCCCATCACTCACTCGGGTA CAGA
GCGCAGCGGGCCGCGTGACTCTCGCGTCGTCGCATCGCGCGCGCTGCACACCACCGT CA
GCGCGACCACTGCCGCCAATGGCGACCGCCTCGCCGCCGCTCGCCACCCCCTTCCGC CC
GCTCGCCGTCGCCGGAGGTAGCGGCGGACTCGTCGTGGGCGCCAGGGCCGTGGTACC G
CCGAGGCGGGGGCGGCGGGGGTTCGTGGTGCGGAGCGTGGCGAGCGATCGGGAGGTG
CGGGGGCCCGCCCCGACCGAGGAAGGTAAGCGGCCGCGCGCTCCCATTGGCTGCCAC G ACCTGGAATGCTGGTCGGCTGTAATGTGGTTCGGATTTTCGCTGGTGCGACCTGAATTTG T
GATCCTCTGACTGACGGCCTATGTGTTCCCCGGGTTTCGGGCTGTTGATTTTGTGCG CGC GCATTGCTAATCTACTTCTCTAGGATTTGTTAGGGCTCAGAGCAATCTTAAATTCGCGAA TA CACTTCGACGAATGTAGTCTGAATTTACTGAAACTAGCTTAGTTTGGCGCGTCAAGATAG A CTGATGTATTTATGGTAGCAAATTCTGAAAACTTCTTCTGTGCGTGGACAGCCACCCCTC GT GGATGATGCAGCAAGTGCTAACACAGTGTTTGCTAGTACTTGCTAGTATCTGTAGCTGTA A CAAAATCCTTCGAATCTGAACTAGTGAGATAAATCGTTTGAGATTTTTGCTGAAGATTAC AC AGTATCAATATTTCTGAAGGTGATTGGAACAGTATAGCATTTCAAACTGGCAACCAAGCA G GATCAGTATACTGACGCAATTTTTTTCTGAAGGTGGTTGGAATACCGGTATGAATTTTAT AA TCATTGAATTAAGGCGTAAGATTTGGTAACAAAAACTGCTTTTCACGGGAAACGATGAAA CA TGGCACAATTCAACCTGACTTTCTTTTATGAAAAGGCTCTCAAGCTGCCTTGTACTCCCT CC GTCCCATAATATAAGAACGTTTTTGACATTAGTGTAGTACCAAAAACGTTCTTATATTAT GGG ACAGAGGGAGTATTTATTAATGGATAAAAAGAGAGGTACAGGGTTCTAGTAGTAACATAT G CAAATGGTGATACAGTATATTATCTACGATTTTGTGCAAGAATTTTCACACGAAAACAAT ATT CCATGCAGTTTTTAGTTTTGACTTTTTTTTTTGAATTTTCACGCGCGCGGGGGGTGGGGG G GGGGGGAGAGAACTCCCCCACCTGAATATATTTCTCGAACCCAACATACAGTACAATAGT A CAAATGCCGGGGCGGCAGCAGAGAGAGAGGCAGGGGGCGACCCCCCCATGCTATGTTAC ATGCAGCTTTGTTCTACCCACGCAGAGCCTGTAGCCAGTTTTGACTCTTTTTGAATTGTT TG GATTGCTTACCACATGTGTCACGTGGTTTTAGTTTTGACTCTTGACTGTTTAATACCTAC TCT TTTTAAGATTTTGGATTCTAATGTCATCTGTCTTGCAACTCAGAGCTTTCAGCCGTGCTA AC TTCCATTGATTCATCCGCCATCGCATCAAACATCCAGCACCATGCAGACTTCACACCGTT AT TCTCGCCAGAGCACTCTTCACCCCTAAAGGCTTACCATGCAACTGCCAAAAGTGTTTTTG A TTCTCTGATATTGAATTGGAATGCAACATATGACTATTACAACAAAGTGAATGCAAAGCA AG CTTATTACCTATCCATGGAATTTTTGCAGGTCTCGCAATATACTTTCTTCAGCCTTATCT CAT ACCTTTTGGTAACTGCGTTATGATGTTATATGTTTTTTTTTGTACTGCAGGGAAGAGCTC TTA CAAATGCTATTGGCAACCTAGAGCTAACAGGACAATATGCAGAAGCGTTAAAACAACTTG G ACAGAACCTAGAGGATGTTGCTAGTCAGGTAAGCCATTTTTGTCAGAGAATGGTTTTTAG TC CATATCAAAATTATTGCCGAGCCATTTTTCTAAATAACATTTTATTATTATTCATGAACT TCCA TATTTTTTTCAAATCTGCAAAAGGAGAAGTTTTTTTCCTGGGGGTTCCACCATCTGACTT ATA TTTGTTATGCAGTTTTCAGAGTGATTGACTATGAAATTACGCCAAAACACGAGCATTGCC TA GAGTGCCCTGTAGTTTTAGAAATGTTAAGATTGGATATGTTACTTTTCTATTACAATGAA ACA GGAACCAGATCCTGCCCTTGGCAATGGTGGACTAGGCCGTCTAGCGTCCTGTTTTCTGGA TTCTATGGCAACCCTAAATTATCCAGCATGGGGATATGGACTTCGGTACAGATATGGCCT C TTTAAGCAAATCATAACAAAGGATGGTCAGGAGGAGGTAGCTGAGAATTGGCTAGAGGTA G TGATATGTCTTCTACTTGTTTACTGTTTGGAAGCGTTGATTATGTTGGTTTCTGTTTTAA TGA TCTGCATGTTAGCTTCTTTTGCCATTTATAACTGATGCTGTATCTCCAGATGGGAAATCC AT GGGAGATTGTAAGAAATGATGTCTCTTATCCTCTGAAATTCTATGGCAAAGTGGTTGAAG G CACTGATGGGAGAAAACACTGGATTGGAGGAGAGAATATCAAGGCTGTGGCACATGATGT TCCTATTCCTGGCTACAAGACTAAAACTACTAATAATCTTCGTCTTTGGTCAACAACAGT AC CATCACAAAACTTTGATTTGGGAGCTTTTAATGCTGGGGATCATGTCAAGGCCAACGAAG C ACATCTAAATGCTGAAAAGGTATTCCTTTGTCTGCATTTATTTTGAATATAAATCATTAA ATCT CAATAACTATTGATTTAAATGGTAAACACTAAACATTGTCTAGCTAGTAGTCACTGTGGT GT CCCTCAATGTCATTTTAGAATGTGCACAGTCAAATGTTTGATAACTTTGATTGTAATAAT ATG ATAATATACAGAAACTTTGATTGACACTAGATTTTTCATAAACATACTTATTGTATTTCC ATGT AAATAGAAAAGCTGATGTTCAGAGGTTTGTGTGTCAAAATGAACAGAGTTTGTGCACACA TA ATATCTTCATCTGGTGTTTGCAGTTCTCTAGGTTGTGTTTGCGTTTTCTCCTGCTCTGCC TT AACTAATCTCAGCATATGCATCAGGCTACATTGCTCCTCTGTGTCTATAAAAAATAAGTC TG ATGATATATGGTTGCAGTCCAGTAGTTCGTAGATGGCACAGCATGGAAGAACTTTTCACA T
GGAAGCCTTACCCTTATAATTTTAATTATACCTTTTGAGTGTTTGTGCATTGTGTAC ATACAC CATTAAGTATGGGTGCTCAAAAATGGTTTATCCCAAACTGTGCAAAAGCCGCCACTGTTT GT GTATTAATTTTTTTGAACTAGCTAAACAACTTGACCGAACAAGATACATACAATTGCACT CAC TTCTGAAGCAGCTCAGTTGGACGGATTTGTTCCAATCCCGTGCATGGGTTAAATTGTCGA G TTGTCCTCGCTTGTTCAGTAGTAGGAGTACTATCTATTATATTTATACTAAAGTTTTCTT TTT GTGATACAGTACAAACTTTTAGTAATACTTACATTTGCGACCACTGTGGACAAAATTTAT TAT GTCTAACTTAACTTTATCACTGGTCTTCTCATCATGCCAGATATGCCACGTATTGTATCC AG GGGACGAATCATCAGAGGGGAAAATTCTTCGCCTGAAACAACAATATACACTATGCTCGG C CTCCCTGCAGGACATTATTTCTCGTTTTGAGTCGAGGGCTGGTGATTCTCTCAACTGGGA G GACTTCCCCTCTAAAGTTGCAGTTCAGATGAATGACACCCATCCAACACTGTGCATTCCC G AGTTAATGAGAATACTGATGGATGTTAAGGGATTGAGCTGGAATGAGTCTTGGAGTATCA C GGAAAGGTATCACAGAAGGGTACTGCAGTTTGATGTGTTATGCTGTTCCTCCAGTTCCAG T TAACTAATTTTATTTCCTCATCTGTCTTCCATTCAATCATGTACTGATTTTCTGATATCC ACAT CATTTGTAGAACCGTTGCATACACTAACCACACAGTACTTCCTGAAGCTCTGGAGAAGTG G AGCTTGGACATTATGCAGAAACTTCTACCTCGACATGTTGAGATTATAGAAAGAATAGAT GA GGAGGTATCAAATGTGCAGATAATATTATACAATTAATTTCATGCTCTGTAGTTCAACTT ATG GTTTACTTGTCTTCATCGTTGATTTTAAACAGCTGATGAACACCATCGTCTCAAAATATG GA ACGGCAGATATTTCACTGTTAAAACAGAAGCTTAAAGATATGAGGATCTTAGACAACGTT GA TCTTCCAGCCTCTGTTGCTAAACTGTTTATTAAACCAAAAGAGAAAAAGGGGAAGTTGCT TG TTGAATCTTTGGAGTCTATTGCTGAAGCTGACGAGAAAACTGAGTCAGAAGAGGAGGAAA A CATTCTATCCGAGACAACAGAGAAAAAGGGCGAATCTGACTCTGAAGAAGCTCCTGATGC A GAAAAGGAGGATCCTGAGTATGAGTTAGATCCATTTACAAAATACGATCCTCAGTTACCT AG AGTTGTTCGAATGGCGAACCTCTGCGTTGTTGGTGGACATTCAGTTAATGGTGTGGCTGA G ATTCACAGTGAAATTGTTAAGCAAGATGTGTTCAATAGCTTTTATGAGGTATTTGCATGA TG ATTTGGAAACCGTGATCTTATTTTCCAATGGCTGCTATAATCGTCAGGTGATATGGTATT CA CCATCTGCCATTTTCTTGTGTTCTTTGCCAGATGTGGCCTACTAAGTTCCAAAACAAAAC AA ATGGAGTGACCCCCAGGCGTTGGATCCGGTTTTGTAATCCTGAATTAAGTACAATAATTT CA AAATGGATAGGCTCTGATGACTGGATTCTGAACACTGATAAACTTGCAGGACTGAAGAAG G TACTTCCAGTAGTTTTATTCCCCAATAATCAGTAATCTAATGCTTCCTTCACTTCAATGT TAA CTGTGCATTGTTTTTACATTGATGTGTGTTCATCCTGTAACAGTTTGCTGATGATGAGGA TC TGCAATCAGAATGGCGTACTGCTAAAAGGAATAACAAGATGAAGGTAGTTTCGCTGATAA G AGACAAGACTGGATATGTTGTCAGCCCAGATGCAATGTTTGATGTGCAGGTATATTTTGT AC TAGAAAACATGTGTCTTCTTATAAATATGGTTTAACTCAGCTCTTGGTACTGGACACTTA GG TGAAAAGAATACATGAATATAAGCGGCAGCTTCTAAATATCCTTGGTATCGTTTACCGCT AC AAGAAGATGAAAGAAATGAGTGCAAAAGATAGAATAAAGAGCTTTGTTCCAAGGGTATGC A TATTTGGCGGGAAAGCTTTTGCCACTTACGTACAGGCAAAGAGGATTGTGAAGTTTATCA C AGATGTCGCAGCTACTGTAAATTATGATCCTGATATTGGAGATCTACTGAAGGTAAATTT GT GTCTTTGATGCCACTTATCAGTCTTAATTTTCATGTTCTGTCACTGGCAAGTTGACTCCG AC TGAAAAGCATTGAATGTTCAGAGTTTCAAAGGAAAACAGAGCAGTGAATGTTCTTAATGT CG GTGCTTGTAAAACTGAGCAACTGAATAGCCACAGGTTTTTTAGAGAACAATATAGCCACA AA TTTTTTTGACACTTTTATTGCTATCAGCATCTGATGCTGATATTACTGATAACTAGATGA GGT AAATATAAGGTGACAGGTTAAGTCTGAGGATTTTTGTTTGTTATCCAGGTTGTATTTGTC CC AGATTATAATGTTAGCGTTGCTGAGACGCTCATTCCTGCCAGTGAATTGTCCCAGCATAT CA GGTAATATTCTGCCTACTCTTATATAGTATGTGAATACATAAAAAAAGAATATTACCTGG AGT TGGCTGACATAGAATTTTTCATGTGCCTTTTTGTTTGCTATAATATGTACCTGGAGTCTT ACT GGGATAGTGCCATCCAGAAGTGCCACCCCAGTTTCTATTTCTGTATAAGACGTGCATTTT TA ACAGTATTACTATCCATTCAGTACCGCTGGAATGGAAGCTAGTGGAACCAGCAACATGAA G TTTGCAATGAATGGCTGTATTCTTATTGGAACTTTGGATGGTGCGAATGTGGAGATCAGA G AAGAGGTTGGAGAGGAGAACTTTTTCCTCTTTGGTGCACATGCACCTGAAATCGCTGGTT T GAGGCAAGAGAGAGCTGAAGGGAAGGTACCACACTTGTTTATATAGCTGTGCCTACTATC A AACTAGTAAAGATCTCTCCCATATGTGCTGTCCAAATAGTTTAGCTGAAGGAAAATCATA AC TTTACAAGTTTCCTTTGCACTTGTAGTTTGTGCCTGACCCGAGATTTGAAGAGGTTAAGG AA TACGTCCGCAGTGGCATCTTTGGGACTGGCAACTATGATGAATTGATGGGTTCTTTGGAA G GAAATGAAGGTTATGGACGTGCAGATTATTTTCTTGTTGGCAAGGATTTTCCCAGTTACA TT GAATGCCAGGAAAAAGTTGATGAAGCATACCGAGATCAGAAGGTAAGTTGTATAACTGAC G CTGTTTGTTTTCTGGCTGAAATGCTCACAGTGATTATGCAAACTTTCTGATATGTGATGG CT GACATCAGTGTTTCACTGTGAACGACTTCATTGATTCGTAGGAAGGAATATTTCTTAGTT TA AAGTTTCAAACTGAAGTATCTATCGGTGTGTTGGAAGATCATAAACAATCAGAGAATATA GT TGCTAGATGTGCTATATTACTCATCTGTTGGCTGTTGAGTGTTGATACATGAGCTTTGAT GT AGCTAGACCTTCAGGAAAGTGCTTCTGTGCATTCCAATAAAAAAATAACTTATAACTTAA CA AAACAAAACAATTAGGTTGAACCTCACCAGGGGGCCTTTTGTTATCTGTTTTCTGAAATG AA AATCAGAAGTGGGGAGCTCCTCAATTCGAAAAAGAAAAAGAAAATCAGGGGGTAACTGAG GGTCTGGTTTCCCTCTGTTGGAGCCCCCAAGAATTCTGTGGTCTTATTTTTGTAAGAGTA TA AGATGCACTGCTCCGTTAAGTTTCTCCTTCACCCATTTGTTTTCATTGCTTCGTGACATT TC GTGTGACAGCTCAGATCACCATATGCATCCCATTTGTACACATTATTTTGCTACCGTGGA TA TTTCCACTCTGTTTCATGTGTGACATTTTTAATTCGTTACATTGTTACTCTAAACATTAG TCTA ATAGATTTTCAGAGTTTTGACCCTCCTAATCACATAATATCCGTTTCGACCTGCAGCTAT GG ACAAGAATGTCTATCCTCAACACAGCGGGTTCCCCCAAGTTCAGCAGTGACCGGACGATC
CACGAGTATGCCAAGGACATATGGGGTATCAGCCCCGTCATCATGCCCTGAATAATT CAAG CAGCTTCCAGGAACCTTGAGAGTCTATGAGGCGCGATTGGTCGCTGATTTGCCTTGATTC A GCGAGAATGTATTGCTCCCCCCAGTTTTGTATAGGCATATGCAAGTTCTGACATGGACCT G
AGTGCTGAGAGCATGCAGGGTAGTTCATCCTTAGCAATATTGCATGTCAGTTAGCCA GTAT
TACCCAGGCTTATGGTGTGTGTATTTCGCTGAATAAAGTTGGAGCAGAGTCATCAAA CAAA
CATACTTTGGTGTATCTGTAATGATAAATAAATGGTACATGTGACTGAACAAAAAGT GTCGT
TGGCGTGCACATGTAGTAGAAGCATGTTTCTTCAGAGAGGGTGCAAAGCTGAAAGCA CCC
CAGCTTTGGAGTGAGAATGCAGAAGTTACGAAAACATTTGTTTACATCCAAAGTGCA ACGG
GATTCGTCAAGAGTTCAGAGTCTTCCATCACGGTCATAGAGCATGTCCAATAGCAGC CCTA
TAAACCTATAACAGAGCAGTAGCAGTGGTCCCTCACATTTTGAACCAAAAATGCAGC TCCA
ACGGCTGCCCTATTTGTAAAATTTACTTCAAAATTTGTTGG
SEQ ID NO: 26 >Wheat (tetrapioid) PHS1 5A (Svevo reference genome) CDS
ATGGCGACCGCCTCGCCGCCGCTCGCCACCCCCTTCCGCCCGCTCGCCGTCGCCGGA G GTAGCGGCGGACTCGTCGTGGGCGCCAGGGCCGTGGTACCGCCGAGGCGGGGGCGGC GGGGGTTCGTGGTGCGGAGCGTGGCGAGCGATCGGGAGGTGCGGGGGCCCGCCCCGA CCGAGGAAGAGCTTTCAGCCGTGCTAACTTCCATTGATTCATCCGCCATCGCATCAAACA T CCAGCACCATGCAGACTTCACACCGTTATTCTCGCCAGAGCACTCTTCACCCCTAAAGGC T TACCATGCAACTGCCAAAAGTGTTTTTGATTCTCTGATATTGAATTGGAATGCAACATAT GA CTATTACAACAAAGTGAATGCAAAGCAAGCTTATTACCTATCCATGGAATTTTTGCAGGG AA GAGCTCTTACAAATGCTATTGGCAACCTAGAGCTAACAGGACAATATGCAGAAGCGTTAA A ACAACTTGGACAGAACCTAGAGGATGTTGCTAGTCAGGAACCAGATCCTGCCCTTGGCAA T GGTGGACTAGGCCGTCTAGCGTCCTGTTTTCTGGATTCTATGGCAACCCTAAATTATCCA G CATGGGGATATGGACTTCGGTACAGATATGGCCTCTTTAAGCAAATCATAACAAAGGATG G TCAGGAGGAGGTAGCTGAGAATTGGCTAGAGATGGGAAATCCATGGGAGATTGTAAGAAA TGATGTCTCTTATCCTCTGAAATTCTATGGCAAAGTGGTTGAAGGCACTGATGGGAGAAA A CACTGGATTGGAGGAGAGAATATCAAGGCTGTGGCACATGATGTTCCTATTCCTGGCTAC A AGACTAAAACTACTAATAATCTTCGTCTTTGGTCAACAACAGTACCATCACAAAACTTTG ATT TGGGAGCTTTTAATGCTGGGGATCATGTCAAGGCCAACGAAGCACATCTAAATGCTGAAA A GATATGCCACGTATTGTATCCAGGGGACGAATCATCAGAGGGGAAAATTCTTCGCCTGAA A CAACAATATACACTATGCTCGGCCTCCCTGCAGGACATTATTTCTCGTTTTGAGTCGAGG G CTGGTGATTCTCTCAACTGGGAGGACTTCCCCTCTAAAGTTGCAGTTCAGATGAATGACA C CCATCCAACACTGTGCATTCCCGAGTTAATGAGAATACTGATGGATGTTAAGGGATTGAG C TGGAATGAGTCTTGGAGTATCACGGAAAGAACCGTTGCATACACTAACCACACAGTACTT C CTGAAGCTCTGGAGAAGTGGAGCTTGGACATTATGCAGAAACTTCTACCTCGACATGTTG A GATTATAGAAAGAATAGATGAGGAGCTGATGAACACCATCGTCTCAAAATATGGAACGGC A GATATTTCACTGTTAAAACAGAAGCTTAAAGATATGAGGATCTTAGACAACGTTGATCTT CC AGCCTCTGTTGCTAAACTGTTTATTAAACCAAAAGAGAAAAAGGGGAAGTTGCTTGTTGA AT CTTTGGAGTCTATTGCTGAAGCTGACGAGAAAACTGAGTCAGAAGAGGAGGAAAACATTC T ATCCGAGACAACAGAGAAAAAGGGCGAATCTGACTCTGAAGAAGCTCCTGATGCAGAAAA GGAGGATCCTGAGTATGAGTTAGATCCATTTACAAAATACGATCCTCAGTTACCTAGAGT TG TTCGAATGGCGAACCTCTGCGTTGTTGGTGGACATTCAGTTAATGGTGTGGCTGAGATTC A CAGTGAAATTGTTAAGCAAGATGTGTTCAATAGCTTTTATGAGATGTGGCCTACTAAGTT CC AAAACAAAACAAATGGAGTGACCCCCAGGCGTTGGATCCGGTTTTGTAATCCTGAATTAA G TACAATAATTTCAAAATGGATAGGCTCTGATGACTGGATTCTGAACACTGATAAACTTGC AG GACTGAAGAAGTTTGCTGATGATGAGGATCTGCAATCAGAATGGCGTACTGCTAAAAGGA A TAACAAGATGAAGGTAGTTTCGCTGATAAGAGACAAGACTGGATATGTTGTCAGCCCAGA T
GCAATGTTTGATGTGCAGGTGAAAAGAATACATGAATATAAGCGGCAGCTTCTAAAT ATCCT TGGTATCGTTTACCGCTACAAGAAGATGAAAGAAATGAGTGCAAAAGATAGAATAAAGAG C TTTGTTCCAAGGGTATGCATATTTGGCGGGAAAGCTTTTGCCACTTACGTACAGGCAAAG A GGATTGTGAAGTTTATCACAGATGTCGCAGCTACTGTAAATTATGATCCTGATATTGGAG AT CTACTGAAGGTTGTATTTGTCCCAGATTATAATGTTAGCGTTGCTGAGACGCTCATTCCT GC CAGTGAATTGTCCCAGCATATCAGTACCGCTGGAATGGAAGCTAGTGGAACCAGCAACAT GAAGTTTGCAATGAATGGCTGTATTCTTATTGGAACTTTGGATGGTGCGAATGTGGAGAT C AGAGAAGAGGTTGGAGAGGAGAACTTTTTCCTCTTTGGTGCACATGCACCTGAAATCGCT G GTTTGAGGCAAGAGAGAGCTGAAGGGAAGTTTGTGCCTGACCCGAGATTTGAAGAGGTTA AGGAATACGTCCGCAGTGGCATCTTTGGGACTGGCAACTATGATGAATTGATGGGTTCTT T GGAAGGAAATGAAGGTTATGGACGTGCAGATTATTTTCTTGTTGGCAAGGATTTTCCCAG T TACATTGAATGCCAGGAAAAAGTTGATGAAGCATACCGAGATCAGAAGCTATGGACAAGA A TGTCTATCCTCAACACAGCGGGTTCCCCCAAGTTCAGCAGTGACCGGACGATCCACGAGT
ATGCCAAGGACATATGGGGTATCAGCCCCGTCATCATGCCCTGA
SEQ ID NO: 27 >Wheat (tetrapioid) PHS1 5A (Svevo reference genome) protein
MATASPPLATPFRPLAVAGGSGGLVVGARAVVPPRRGRRGFVVRSVASDREVRGPAP TEEEL SAVLTSIDSSAIASNIQHHADFTPLFSPEHSSPLKAYHATAKSVFDSLILNWNATYDYYN KVNAK QAYYLSMEFLQGRALTNAIGNLELTGQYAEALKQLGQNLEDVASQEPDPALGNGGLGRLA SCF LDSMATLNYPAWGYGLRYRYGLFKQIITKDGQEEVAENWLEMGNPWEIVRNDVSYPLKFY GK VVEGTDGRKHWIGGENIKAVAHDVPIPGYKTKTTNNLRLWSTTVPSQNFDLGAFNAGDHV KAN EAHLNAEKICHVLYPGDESSEGKILRLKQQYTLCSASLQDIISRFESRAGDSLNWEDFPS KVAV
QMNDTHPTLCIPELMRILMDVKGLSWNESWSITERTVAYTNHTVLPEALEKWSLDIM QKLLPRH VEIIERIDEELMNTIVSKYGTADISLLKQKLKDMRILDNVDLPASVAKLFIKPKEKKGKL LVESLESI AEADEKTESEEEENILSETTEKKGESDSEEAPDAEKEDPEYELDPFTKYDPQLPRWRMAN LC VVGGHSVNGVAEIHSEIVKQDVFNSFYEMWPTKFQNKTNGVTPRRWIRFCNPELSTIISK WIGS DDWILNTDKLAGLKKFADDEDLQSEWRTAKRNNKMKVVSLIRDKTGYWSPDAMFDVQVKR IH EYKRQLLNILGIVYRYKKMKEMSAKDRIKSFVPRVCIFGGKAFATYVQAKRIVKFITDVA ATVNYD PDIGDLLKVVFVPDYNVSVAETLIPASELSQHISTAGMEASGTSNMKFAMNGCILIGTLD GANVEI REEVGEENFFLFGAHAPEIAGLRQERAEGKFVPDPRFEEVKEYVRSGIFGTGNYDELMGS LEG NEGYGRADYFLVGKDFPSYIECQEKVDEAYRDQKLWTRMSILNTAGSPKFSSDRTIHEYA KDI WGISPVIMP
SEQ ID NO: 28 >Wheat (tetrapioid) PHS1 5A (Svevo reference genome) promoter (-2kb upstream of ATG)
CTAGTGGGGCCTTGGTCCAAAATAATCCATGTAATTTTTCATGTTTTCCGGCTTCGT ATTTTT TGGTAAAAACCCAAAAACAATAGAAAAGAAGAATTGGCCTCTGACACTAGATTGATAGGT TA GTCCAAATCAATATTATCAAAAATTGTGCCCAAACCATATACAAATGATATAAATATAAC TAG AATACTTCATAAATTTAGATACGTTTGATAAAGTGCCAAGAATGGAAGAGAAGTGATCAT AT GCCGCCCTGTCCATATCGGGCTCATCAGAATGTTCCCACCATCAGACTCCAGCCCCAGGA TTATTAACTTATGTCGGCGGTGCGCGGAATGGATCTTGAAGAAAGCGGAATTTGTGTCGC C CTCACGTAACCAGGAGAACTTGGCAGGTTGACGCGCAATCGACCTCTCCAGCAAAGCAAG TCCTAGGTAGGCGCGCTTCAGCTCGCAGAGTAACCGTACTTCTGGAGGAGCAAGCGGGTG GTGATCTTGAGCCGCCGAGACGAGCAATCAATTACCGAGCGAGAAGCATCTGAGTGGAGA TGTTACCGAGCGTTCTGGAGCTCCAACTCTGCCGTCAGTGCATTGTCAGCTTGTGCTTTG C GTAGATTCGCCGGAAAGGATCAGGATCATGCACGCCAGAATTCCAGGTGTCAGAGACCAC CTAATGAAAACCATCGAGCTTTGATCAAAATAGCTCAAGATGGAATCTCCGATCACCTTG G GAGCGAAAATCTTACTAGTTTCGTTCAAGGCCAAGGTAGTTATATATTTGTGTGTGCAAT TA TTAATCATTGGCCGCTTAGCAAACTCTTATTGGTTCGATAAACATTAAAGTCATCCCCAC TTT GAGAAAAAATGCTACTTGCTACAAACCCTNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNGCAAGTAATAAATCCTTG CCGCCTCGTCCTTGTTTGGATGCTGCTTCTTGTCATCTCAAGGAGGCGTGTGGAGGGTGG TGTGTCCCCGGATCTTGTTCTCTCGGTTGGGTTTTTTCTATGGCAGGTTTATCTCAGTGG TA GTGTCACGTGAAGCAGATGACGCGACTATGCGTCTTGGTCCTCTGCGCTCTAATTCCGGC
AGATGAAGGTTGACCAGCTTCAGCCTCGTCGGGGAGCGTATGAGATTTGTGTTTTTT GACT CCATCTGGTCGGACCTGTGGCATTATGTAGACCCTCTCATGTATTCTTTTTCTCCTGGAC AG TAGTCTGTTACACAAATCGTGGTCGTCGGATCTTCTAGTCCACATCGGCGACTTTTCCGA C CGCCACTTCTACAAACTCCTTATTTTCAACAAGTCTCCTTCATTGAGATGGAAGTCCAGG CA ATGGCTTTGCTCACGGCACACTGCCGGCGGATGCAAGAGAAAGAAGACATTCGACACCCC TAAAACTATGAAGATGTAATTTTTAATCTTTGTAAGTGTTTTTGTGAGGATTTGTGCTAC TTTT AATATATTGCCTTTTGCCTTTTGCAAGGGAAAAAAAACTAAACCATTACACGAAGAGAAG AG CCATTCCCAGCCGTCCAATAAACACGACAAAACGCTTTACAGCACGGGGGCAATTCCCAG CCGTCGAAATGCGCTGCAACGGAACCAGCGCTCCCCGACACGGCGGCACATGAGCCACA ACTCGTACCATGCACCGTGCACCGGCGGTGGGCCCACTGGCCAGCGTCCCCACACGTGT ACCGGCGGCCACAATTACACGGCACGGCACGGCGACGGAGAGATATTTTAACGCCTTCCA ATTCAGAAGCTCCCCCATCACTCACTCGGGTACAGAGCGCAGCGGGCCGCGTGACTCTCG CGTCGTCGCATCGCGCGCGCTGCACACCACCGTCAGCGCGACCACTGCCGCCA SEQ ID NO: 29 >Wheat (tetrapioid) PHS1 5B (Svevo reference genome) genomic DNA
AACTAAACCATTACATGAGGCCATTCCCATCCGTCCAATAAATACGACAAAACGCTT TACAG CACGGGGGCAATTCCCAGCCGTCGAAATGCGCTGCAACGGAACTAGCTCTCCCAGACAGA CACAGCGGCACATGAGCCACGCCTCCCATCGCAACCGTGCACCGGCGGTGGGCCCACCG GCCAGCGCCCCCACACGTGTACCGGCGCCCAGCATTGCACGGCACGGCGACGGAGAGAT ATTTTTAACGCCTTCCAATTCAGAAGCTCCCCCATCACTCACTCGGGTACAGAGCGCAGC G GGCCGCGTGACTCTCGCGTCGTCGCATCGCGCGCGCTGCACACCACCGTCAGCGCGACC ACTGCCACCAATGGCGACCGCCTCGCCGCCGCTCGCCACCGCCTTCCGCCCGCTCGCAG CCGGCGGCGGCGCCGGAGGAGGAGGAGCGCACGGCGTGGGCGCCGCGGGCCGGGTCG CGCCGAGGCGGGGGCGCCGGGGCTTCGTGGTGCGGAGCGTGGCGAGCGATCGGGAGG TGCGGGGCCCCGCCTCGACCGAGGAAGGTAAGCCGTCGCGCGCTCCCGTTGGCTGCGA CGACCCGGAATGCTGGTCGGCTGTAATGTGGTTCGGATTTTCGCTGGTGCGACCTGAATT TCTGATCCTCTGACTAATGGTCTCTCTGTTCCCGGGTTTCGGGCTGTTGATTTTGTGCGC G CGCATTGCTAATCTACTGCTCTAGGATTTGTTAGGGTTCAGAGGAATCTTAAATTCACGA AT AAACTTCGACGAGTGTATTCTGAATTTACTGAAACTGGCTTAGTTTGGGGCGTCAAGTTA GA ATGGTGTATTTATGGCACAGTAGCAAATTCTGAGAAATGCTTCTGTGCGTGCACAGCCAC C CCTTGTGGATGATGCAGCAAGTGTCAACATACCGTCGCTACTATCTGTAGCTGTAACAAA A TCTTACGCCTTGTTTTCTGGTGGTACAGTAATATTATCATTTTACAAAGAGTGGCTGTAA CA AAATCTTACGCCTTGTTTTCTGGTGGCTTGCCATTGAGGAGTAGAATGCAAATGGCGATA C AGTATATTATCTACGATTTTGTGCAAGAATTTTCACACGTATCCCATGCAATTTTTAGTT TTG ACTCTCGACTGTTTATACCTACTCTTTGTAAAATTTTGGATCCTAATATTGTTGTTCTTG CAA CTCAGAGCTTTCAGCCGTGCTAACTTCCATTGATTCTTCCGCCATCGCGTCAAATATCCA G CACCATGCAGACTTCACACCGTTGTTCTCGCCAGAGCACTCTTCACCCCTAAAGGCTTAC C ATGCAACTGCTAAAAGTGTTTTTGATTCTCTGATCATCAATTGGAATGCAACATATGACT ATT ACAACAAAGTGAATGCAAAGCAAGCTTATTACCTGTCCATGGAGTTTTTGCAGGTCTCAC AA TATACTTCCTTTAGCCTTACCTCAATACTTTTTGGTAACTGCGTTATGATGTTATATGGT TGT TATCTGTGTACTGCAGGGAAGAGCTCTCACAAATGCTATTGGCAATCTAGAGCTAACAGG A CAATATGCAGAAGCGTTAAAACAACTTGGACAGAACCTAGAGGATGTTGCTAGTCAGGTA A GTGATTTTTGTCGGAGATTGGTTTTTAGTCCGTATCAAAATTATCGCAGGGCCATTATTT TTA TTTTGAAAACGCATTTTATTATTAATCATCAAGTTCCCTTTGTTTAAAAACAATCTGAAA AGAA GAAGTTTTTTTCCTGAGGGTTCCACCGTCTGACTTATGATCTTTATGCAGTTTTCAGAGT GA CTATGAAATTACGCCAAAACACGATCTTATTGATGCCTAGAGTGCCCTGTAGTTTAGAAA TC TAAGATTGGATATGTTACTTTTCTATTACAATGAAACAGGAACCAGATCCTGCCCTTGGC AA TGGTGGTCTAGGCCGTCTAGCGTCCTGTTTTCTGGATTCTATGGCAACCCTAAATTATCC A GCATGGGGATATGGACTTCGGTACAGATACGGCCTCTTTAAGCAAATCATAACAAAGGAT G GTCAGGAGGAGGTAGCTGAGAATTGGCTAGAGGTAGTGATGTGTCTTCTACTTGTTTACT G TTTGGAAGTGTTGATTATGTTGGTTTATTTATGTTCTAATGACCTGCATGTTAGCTTCTT TTC TATTTATAACTGATGCTGTATCTCCAGATGGGAAATCCATGGGAGATCGTAAGAAATGAT GT CTCTTATCCTGTGAAATTCTATGGCAAAGTGGTTGAAGGCACTGATGGGAGAAAACACTG G ATTGGAGGAGAGAATATCAAGGCTGTGGCACATGATGTTCCTATTCCTGGCTACAAGACT A AAACTGCTAATAATCTTCGTCTTTGGTCAACAACAGTACCATCACAAAATTTTGATTTGG GT GCCTTTAATGCTGGGGATCATGCCAAGGCCAATGAAGCACATCTAAATGCTGAAAAGGTA T TCTTTCGTCTACTTTATTTTGAATATAAACCATTAACTCTCAACAACTACTGATTCAAAG GCT AAACAGTGCTTTGCTAGTAGTTACTGTGTTTTCTCTTAGATGTCATTTCATAATGTGTGC AGT CAATCGTTTGATATACACAAACCTATTTGGATTTCTCCTGTAAAAATGTTGTCACTAGAT TTT TCATGAAAAAACTTATTGTATATGTAAATAGAAAAGCTGATGCTCAGGTGCTTTGTGTTC AA ATGACAAGTAAAAATGAACAGAGTTTGTGCACCCATAACTTCTTCACCTGGTGTGTGCAA TT CGTTCGGTTGTATTTATGCGTGTTCTTATTCTGCCTGAATGATTCTCAGCATATGCATCA GG CTACATTTCTCCTCTGTGTCAAGAAAAAATAAGTCTGACCAGATGCAGTCCAGTAGTCTT TA GATCTTTTCACATGGAAGCGTTACTCTATGATTTTTATTATACCTTTAGAGTGTTTGTGC ATT GTGCACATATACCATGAAGTATGGGTGTTCAAATGTGGTTCTCTCAAACTGTGCTAAACC CA CCACTGTTTGCCAGTATTCATTTTTCTGAACTAGCAACAACTTGACTGGACAAGATACAT TG TACTCACATCTGAAGCTGCTGGGTTGGACAGATTTGTCCGACCCGTCTTTGAACATCTCT A CTATTTAGTGTGCATGGGTTAAATTGTTGACTTTTCCTCGTTTGTTCAGTAGTAGGAGCA CT ATCTAATTATATTTATATTCAAGTTTTCTCTGTGTGTTACGTTATAAACTTTTAGTCATA CTTA CATTTGTGGACAAAAGTTATCATGTCTAACTTAAGTTTGTGACTGATCTTCTCATCATGC CA GATATGCCACGTATTGTATCCAGGGGACGAATCATCAGAGGGGAAAATTCTTCGCTTGAA A CAACAATATACATTATGCTCGGCCTCCCTGCAGGACATTATTTCTCGTTTTGAGTCGAGG G CTGGTGATTCTCTCAACTGGGAGGACTTCCCCTCTAAAGTTGCAGTTCAGATGAATGACA C TCATCCAACACTGTGCATTCCGGAGTTAATGAGAATACTGATGGATATAAAGGGATTGAG C TGGAATGAGGCTTGGAGTATCACAGAAAGGTACCACAGAAGGGTACTGCAGTTTGATGTG T TATGCTGTTCCTCCAGCTCCAATTAACTAATTGTATTGCCTCATCTGTCTTCCATTCAAT TAT GTACTGATTCTCTGATATCATTTGTAGAACCGTTGCATACACTAACCACACGGTACTTCC TG AAGCTCTGGAGAAGTGGAGCTTGGACATTATGCAGAAACTTCTACCCCGACATGTTGAGA T CATAGAAACAATAGATGAGAAGGTAACAAATGTGCAGATAATATTATGCAATAAATTTCA TT CTCTGTAGTTCAACTTATGGTTTACTTATCAACATCGTTGATTTTAAAACAGCTGATGAA CAA CATCGTCTCAAAATATGGAACGGCAGATATTTCACTGTTAAAACAGAAGCTTAAAGATAT GA GGATCTTAGACAACGTTGATCTTCCAGCCTCTGTTGCTAAACTGTTTATTAAACCTAAAG AG AAAAAGGGGAAGTTGCTTGTTGAATCTTTGGAGTCTATTGCTGAAGGTGACGAGAAAACT G AGTCACAAGAGGAGGAAAACATTCTATCCGAGACAGCAGAGAAAAAGGGCGGATCTGACT CTGAAGAAGCTCCTGATGCAGAAAAGGAGGATACTGTGTATGAGTTAGATCCATTTGCAA A ATACGATCCTCAGTTACCTAGAGTTGTTCGAATGGCAAATCTCTGTGTTGTTGGTGGACA TT CAGTTAATGGTGTGGCTGAGATTCACAGTGAAATTGTTAAGCAAGATGTGTTCAATAGCT TT TATGAGGTATTTGCACGATGATTTGGAAACCATGATCTTATTTTCTAACGGCTGCTATAA TC GCCAGGTAATATGGTATCACCATCTGCCATTTTCTTATGTTATTTGCCAGATGTGGCCTA CT AAGTTCCAAAACAAAACAAACGGAGTAACTCCCAGGCGTTGGATCCGGTTTTGTAATCCT G AATTAAGTGCAATCATTTCAAAATGGATAGGCTCTGATGACTGGATTCTGAACACTGATA AA CTTGCAGGACTGAAGAAGGTACTTCCAGTAATTTTAATTTCCAAATATCAGTAATCTAAT GC TTCCTTCACTTCCATGTTACCTGTGCCTTGTTTTTACATTGATGTGTGTTCATCCCGTAA TAG TTTGCTGATGATGAGGATCTGCAATCAGAATGGCGTACTGCTAAAAGGAATAACAAGATG A AGGTAGTTTCGCTGATAAGAGACAAGACTGGATATGTTGTCAGCCCAGATGCAATGTTTG A TGTGCAGGTATATTTTGGACTAGAAACATGTGTCTCCTGTTAAATATGGTTTAACTCGGA TC TTGGTACTTGACACTTAGGTGAAAAGAATACATGAATATAAGCGGCAGCTGCTAAATATC CT TGGTATCGTTTACCGATACAAGAAGATGAAAGAAATGAGTGCCAAAGATAGAATAAAGAG C TTTGTTCCAAGGGTATGCATATTTGGTGGGAAAGCTTTTGCCACTTACGTACAGGCAAAG A GGATTGTGAAGTTTATCACAGATGTCGCAGCTACTGTAAATTATGACCCTGATGTTGGAG AT TTACTGAAGGTAAATTTGTGTCTTCAATGGCACTTATCATTCTAAATTTTCATGATCTGT CAC TAGCAATTTGACTCCGACTAGGGGTTACTGAAAAGCACTGAATGTTCCGAGTTTCAAAAG A AAACAGAGCAGTGAATGTTCTTAATGTCGGTGCTTGTAGAACTGAGAAACTGAATAGCCA C AGTTTTTTTTAGAGAACAATAGCCACAATTGTTCTGATGCTGATTTTGCTGATAACTAGA TGA GGTAAATACAAGGTGACAGGTTAAGTTCTGAGGATTTTTGTTTGTTATCCAGGTTGTATT TG TCCCAGATTATAATGTTAGCATTGCTGAGAAGCTCATTCCTGCCAGTGAATTGTCCCAGC AT ATCAGGTAATATTCTGGCTACTCTTATATAGTATGTGAATACAAAGAAAAGGAGTCTTAT CT GGAGGTGACTGATGTGGAAATTTTCATGTGCCTTTTTGTTTGCTACTCCCTCTGTCCCAT AA TATAAGAGCGCTTTTGACAGTACACTAGTGTCAAAAACGCTCTTATAATATGGGACGGAG G GAGTATAATATATACCTGGAGTCTTGCTGGGATAGTGACATCCAGAAGTGCCACGCCAGC T TCTATTTCTGTACAAGACGTGCATTTTTAACACTATTACTATCCATTCAGTACCGCTGGA ATG GAAGCTAGTGGAACAAGCAACATGAAGTTTGCAATGAATGGTTGTATTCTTATTGGAACT TT GGATGGTGCGAATGTGGAGATCAGAGAAGAGGTTGGAGAGGAGAACTTTTTCCTCTTTGG TGCACATGCACCTGAAATCGCTGGTTTGAGGCAAGAGAGAGCTGAAGGAAAGGTACCACA CTTGTTTATAGAACTGTGCCTAACTATGAAACTAGTAAAGATCTCTCCCAAATGTGTGTT GT CCAAATAGTTTAAAGGAAGAAAAATTATAACTTTACCATTTTCCTTTGCACTTGTAGTTT GTG CCTGACCCGAGATTTGAAGAGGTTAAGGAATACGTCCGCAGTGGCATCTTTGGGACTAGC AACTATGATGAATTGATGGGTTCTTTGGAAGGAAATGAAGGTTATGGACGTGCAGATTAT TT TCTTGTTGGCAAGGATTTTCCCAGTTACATTGAATGCCAGCAAAAAGTTGATGAAGCATA CC GAGATCAGAAGGTAAGTTGTATAACTGACGCTGTTTGTTTTCTGGCTGAAATGCATACAG T GATTATGCAAACTTTCTGATATGTGATGGCTGCCATCAGTGTTTCACTGTGAACGACTTC AT
TGATTCGTAGGAAGGAATATTTCTTAGTTCAAAGTTTCAAACTGGATTATCTATAGG TGTGTT GGAAGATCATAAACAATCAGAGAATATAGTTGCTAGATGTGCTATACTATATGTTGGCTG TT GAGTGTTGATACATGAGCTTTGATGAACCTAGCTAGACCTTCAGGAAAGTGCTTCTATGC A TTCCAATAAAAAAAACTTATAACTTAACAAAACAATTAGGTTGAACCTCGCCAGGGAGCG TT TTGTTATCTGTTTTCTGAAATGAAAATCAGAAGCGGGGAGCTCCTCGATTCTAAAATGAA AA AAAAATCCAGGGATAACTGAGGGCCTGGTTTCCCTCTGTTGGAGCCCCAAAGAATTCTGT G GCCTTATTTTTGTAAGAGTATAAGATGCACTGCTCCGTTAAGTTTCTCCTTCATCCATTT GTT TTCATTGCTTCGTGACATTTCGTGTAACAGAGCATCCCATTTGTACACATTATTTTGCTA TCG TGGATATTTCCACTCTGTTTCATGTGTAACATTTTTAATTCGTTACATCCTTGCTAACTA CAC
ATAGTCTGATAGATTTTCAGAGTTTTGGTCCTCCTAATAACATAATATCCGTTTCTA CCTGCA
GCTATGGACAAGAATGTCCATCCTCAACACAGTTGGTTCCCCCAAGTTCAGCAGTGA CCGG
ACGATCCACGAGTATGCCAAGGACATATGGGATATCAGCCCCGTCATCATGCCCTGA ATTA
TTCAAGCAGCTTTCAGGAACATTCAGAGTCTATGAGGCGCGATTGGTCGCAATGTAT TGCT
CCCCCCAGTTTTGTATAGGCACATATGCAAGTTCTGACATGGACCTGAGTGCTGAGA GCAT
GCAGGGTAGTTCATCCTTAGCAATATTGCATGTCACTTAGCCCGTATTACCCAGGCT TATG
GTGTGTGTATTTCGCCGAATAAAGTTGGAGCAGAATCATCAAACAAACATACTTTGG TGTAT
CTGTAATGATAAATAAATGGTAAATGACTGAACAAAAAGTGTCGTTGGCGCACACAT GTAGT
AGAAGCATGTTTCTTCAGAGAGGCTGCAAAACTGAAAGCACCCTAGCTTTGAAGTGA GAAT
GAAGAAGTTACGGAAACATTTGTTTACATCCAAAGTGCAACGGGATTCGTCAAGACT CAAG
AGTTCAGAGTCTTCCGTCACGGCCATAGGGCATGTCCAACAGCAGCCCTATAATAGA GCA
GTAGCAGTGGTCTCTCACATTTTGAACCAAAAACGCAGCTCCAACGGCTGCCCTATT TGTA
AAATTTACTTCAAAATTTGTTGGACAGCCCGAAAATGCAACACCAG
SEQ ID NO: 30 >Wheat (tetrapioid) PHS1 5B (Svevo reference genome) CDS
ATGGCGACCGCCTCGCCGCCGCTCGCCACCGCCTTCCGCCCGCTCGCAGCCGGCGGC G GCGCCGGAGGAGGAGGAGCGCACGGCGTGGGCGCCGCGGGCCGGGTCGCGCCGAGGC GGGGGCGCCGGGGCTTCGTGGTGCGGAGCGTGGCGAGCGATCGGGAGGTGCGGGGCC CCGCCTCGACCGAGGAAGAGCTTTCAGCCGTGCTAACTTCCATTGATTCTTCCGCCATCG C GTCAAATATCCAGCACCATGCAGACTTCACACCGTTGTTCTCGCCAGAGCACTCTTCACC C CTAAAGGCTTACCATGCAACTGCTAAAAGTGTTTTTGATTCTCTGATCATCAATTGGAAT GC AACATATGACTATTACAACAAAGTGAATGCAAAGCAAGCTTATTACCTGTCCATGGAGTT TT TGCAGGGAAGAGCTCTCACAAATGCTATTGGCAATCTAGAGCTAACAGGACAATATGCAG A AGCGTTAAAACAACTTGGACAGAACCTAGAGGATGTTGCTAGTCAGGAACCAGATCCTGC C CTTGGCAATGGTGGTCTAGGCCGTCTAGCGTCCTGTTTTCTGGATTCTATGGCAACCCTA A ATTATCCAGCATGGGGATATGGACTTCGGTACAGATACGGCCTCTTTAAGCAAATCATAA C AAAGGATGGTCAGGAGGAGGTAGCTGAGAATTGGCTAGAGATGGGAAATCCATGGGAGAT CGTAAGAAATGATGTCTCTTATCCTGTGAAATTCTATGGCAAAGTGGTTGAAGGCACTGA T GGGAGAAAACACTGGATTGGAGGAGAGAATATCAAGGCTGTGGCACATGATGTTCCTATT C CTGGCTACAAGACTAAAACTGCTAATAATCTTCGTCTTTGGTCAACAACAGTACCATCAC AA AATTTTGATTTGGGTGCCTTTAATGCTGGGGATCATGCCAAGGCCAATGAAGCACATCTA A ATGCTGAAAAGATATGCCACGTATTGTATCCAGGGGACGAATCATCAGAGGGGAAAATTC T TCGCTTGAAACAACAATATACATTATGCTCGGCCTCCCTGCAGGACATTATTTCTCGTTT TG AGTCGAGGGCTGGTGATTCTCTCAACTGGGAGGACTTCCCCTCTAAAGTTGCAGTTCAGA T GAATGACACTCATCCAACACTGTGCATTCCGGAGTTAATGAGAATACTGATGGATATAAA G GGATTGAGCTGGAATGAGGCTTGGAGTATCACAGAAAGAACCGTTGCATACACTAACCAC A CGGTACTTCCTGAAGCTCTGGAGAAGTGGAGCTTGGACATTATGCAGAAACTTCTACCCC G ACATGTTGAGATCATAGAAACAATAGATGAGAAGCTGATGAACAACATCGTCTCAAAATA TG GAACGGCAGATATTTCACTGTTAAAACAGAAGCTTAAAGATATGAGGATCTTAGACAACG TT GATCTTCCAGCCTCTGTTGCTAAACTGTTTATTAAACCTAAAGAGAAAAAGGGGAAGTTG CT TGTTGAATCTTTGGAGTCTATTGCTGAAGGTGACGAGAAAACTGAGTCACAAGAGGAGGA A AACATTCTATCCGAGACAGCAGAGAAAAAGGGCGGATCTGACTCTGAAGAAGCTCCTGAT GCAGAAAAGGAGGATACTGTGTATGAGTTAGATCCATTTGCAAAATACGATCCTCAGTTA C CTAGAGTTGTTCGAATGGCAAATCTCTGTGTTGTTGGTGGACATTCAGTTAATGGTGTGG C TGAGATTCACAGTGAAATTGTTAAGCAAGATGTGTTCAATAGCTTTTATGAGATGTGGCC TA CTAAGTTCCAAAACAAAACAAACGGAGTAACTCCCAGGCGTTGGATCCGGTTTTGTAATC C TGAATTAAGTGCAATCATTTCAAAATGGATAGGCTCTGATGACTGGATTCTGAACACTGA TA AACTTGCAGGACTGAAGAAGTTTGCTGATGATGAGGATCTGCAATCAGAATGGCGTACTG C TAAAAGGAATAACAAGATGAAGGTAGTTTCGCTGATAAGAGACAAGACTGGATATGTTGT C AGCCCAGATGCAATGTTTGATGTGCAGGTGAAAAGAATACATGAATATAAGCGGCAGCTG C TAAATATCCTTGGTATCGTTTACCGATACAAGAAGATGAAAGAAATGAGTGCCAAAGATA GA ATAAAGAGCTTTGTTCCAAGGGTATGCATATTTGGTGGGAAAGCTTTTGCCACTTACGTA CA GGCAAAGAGGATTGTGAAGTTTATCACAGATGTCGCAGCTACTGTAAATTATGACCCTGA T GTTGGAGATTTACTGAAGGTTGTATTTGTCCCAGATTATAATGTTAGCATTGCTGAGAAG CT CATTCCTGCCAGTGAATTGTCCCAGCATATCAGTACCGCTGGAATGGAAGCTAGTGGAAC A AGCAACATGAAGTTTGCAATGAATGGTTGTATTCTTATTGGAACTTTGGATGGTGCGAAT GT GGAGATCAGAGAAGAGGTTGGAGAGGAGAACTTTTTCCTCTTTGGTGCACATGCACCTGA A
ATCGCTGGTTTGAGGCAAGAGAGAGCTGAAGGAAAGTTTGTGCCTGACCCGAGATTT GAA
GAGGTTAAGGAATACGTCCGCAGTGGCATCTTTGGGACTAGCAACTATGATGAATTG ATGG
GTTCTTTGGAAGGAAATGAAGGTTATGGACGTGCAGATTATTTTCTTGTTGGCAAGG ATTTT
CCCAGTTACATTGAATGCCAGCAAAAAGTTGATGAAGCATACCGAGATCAGAAGCTA TGGA CAAGAATGTCCATCCTCAACACAGTTGGTTCCCCCAAGTTCAGCAGTGACCGGACGATCC A CGAGTATGCCAAGGACATATGGGATATCAGCCCCGTCATCATGCCCTGA
SEQ ID NO: 31 >Wheat (tetrapioid) PHS1 5B (Svevo reference genome) protein
MATASPPLATAFRPLAAGGGAGGGGAHGVGAAGRVAPRRGRRGFVVRSVASDREVRG PAST
EEELSAVLTSIDSSAIASNIQHHADFTPLFSPEHSSPLKAYHATAKSVFDSLIINWN ATYDYYNKV
NAKQAYYLSMEFLQGRALTNAIGNLELTGQYAEALKQLGQNLEDVASQEPDPALGNG GLGRLA
SCFLDSMATLNYPAWGYGLRYRYGLFKQIITKDGQEEVAENWLEMGNPWEIVRNDVS YPVKF
YGKVVEGTDGRKHWIGGENIKAVAHDVPIPGYKTKTANNLRLWSTTVPSQNFDLGAF NAGDHA
KANEAHLNAEKICHVLYPGDESSEGKILRLKQQYTLCSASLQDIISRFESRAGDSLN WEDFPSK
VAVQMNDTHPTLCIPELMRILMDIKGLSWNEAWSITERTVAYTNHTVLPEALEKWSL DIMQKLLP
RHVEIIETIDEKLMNNIVSKYGTADISLLKQKLKDMRILDNVDLPASVAKLFIKPKE KKGKLLVESL
ESIAEGDEKTESQEEENILSETAEKKGGSDSEEAPDAEKEDTVYELDPFAKYDPQLP RVVRMA
NLCVVGGHSVNGVAEIHSEIVKQDVFNSFYEMWPTKFQNKTNGVTPRRWIRFCNPEL SAIISK
WIGSDDWILNTDKLAGLKKFADDEDLQSEWRTAKRNNKMKVVSLIRDKTGYVVSPDA MFDVQ
VKRIHEYKRQLLNILGIVYRYKKMKEMSAKDRIKSFVPRVCIFGGKAFATYVQAKRI VKFITDVAA
TVNYDPDVGDLLKWFVPDYNVSIAEKLIPASELSQHISTAGMEASGTSNMKFAMNGC ILIGTLD
GANVEIREEVGEENFFLFGAHAPEIAGLRQERAEGKFVPDPRFEEVKEYVRSGIFGT SNYDELM
GSLEGNEGYGRADYFLVGKDFPSYIECQQKVDEAYRDQKLWTRMSILNTVGSPKFSS DRTIHE
YAKDIWDISPVIMP*
SEQ ID NO: 32 >Wheat (tetrapioid) PHS1 5B (Svevo reference genome) promoter (-2kb upstream of ATG)
GTTGGCTCCCACATGCTCCTCGATGATCTTCATCGGATGAACCACGATGTCGAGGAT TCAA GCAACCCCGTATACAATTCCCTTTGTCAATCGATACGTTACTTGCCCGAGATTCGATCGT C
GGTATCCCAATACCTCGCTCAATCTCATTACCGGCAAGTCACTTTACTCGTACCGTA ATGCA TGATCCCGTGACCAAACACTTGGTCACTTTGAGCTCATTATGATGATGCATTACCGAGTG G
GCCCAGAGATACCTCTCCGTCATACGGAGTGACAAATCCCAGTCTCGATCCGTGTCA ACC CAACAGACACTTTCGGAGATACCCGTAGTATACCTTTATAGTCACCCAGTTATGTTGTGA CG
TTTGGTACACCCAAAGCACTCCTACGGTGTCCGGGAGTTACACGATCTCATGGTCTA AGGA AAAGATACTTGACATTGGAAAAGCTCTAGCAAACGAACTACACGATCTTGTGCTATGCTT AG GATTGGGTCTTGTCCATCACATCATTCTCCTAATGATGTGATCCCGTTATCAACGACATC CA ATGTCCATAGTCAAGAAACCATGACTATCATTTGATCAACGAGCTAGTCAACTAGAGGCT TA CTAGGGACATGTTGTGGTCTATGTATTCACACATGTATTACGATTTCCGGATAACACAAT TA TAACATGAATAACAGACGATTATCATGAACAAGGAAATATAATAATAATCTTTTATTATT GCC TCTAGGGCATATTTCCAACAATTGCGGTGAGGCGCGTGGAGGTTGGCGTGTCCCCAAATC TAGTTATCTGGGTCAGGTTTTTTCTTTGGTAGGTTTATCTCAGTGGTAGTGTCCCGTGAA GC GGATGACACGACTATGTGTCTTGGTCCTCCATGCTCTAGTTGGGATGGAATCTCCGACCA C CTATCGAAAACCATCAAGCTTTCGCTAAAATTAGTCAAGATGGAATCTCCGATCACCTTG GG AGCGAAAATTTCATTGGAGGCCAAGGTAGTTATGTATTTGTGTGTGCAATTACCGACCAT TG GCCGCTTAGCAAACTCCTATTGGCTCGGTAAAGCNNNNNNNNNNGTTGGCGTGTCCCCAA ATCTAGTTATCTGGGTTGGGGTTTTTCTTTGGAAGGTTTATCTCAGTGGTAGTGTCCCGT GA AGCGGATGACACGGCTATGTGTCTTGGTCCTCCATGCTCTAGTTTTGACATGTGAAGGTT G GCCAACTTCGGCGTTGTCAGGGAGCATATGAGATTTGTGTTCCTTTACTTCGCCTGGTCA G ACCTACGGTATTATATAAGCCCTCTCATGTGTTTTCTTCTCTTGGGCAGCAGTCAGTTAC GC AAATCGTGGTCGCTGACATCTTCTAGTCCACATCGGCGACTTCCCGACCGCCGCTTCTAC A AGCTCCTTATTTTTCAACAAGTTTTCTCCACCGAGATGGAAGTCAAGAGGCGGCAACGGC T TTGCTCATGGCGCACTGCGGATGGATGCAGGAGAAGGAAGACATTCCACACCCCTCAAAC TATGAAGATGTAATTTTTAGTTTTTGTAAGTATTTTTGTGAGGATTTGTGCTACTTTTAA TATA TATGGCCTTTTGCCTTTTTCCAGGAAAGGAAAAACTAAACCATTACATGAGGCCATTCCC AT CCGTCCAATAAATACGACAAAACGCTTTACAGCACGGGGGCAATTCCCAGCCGTCGAAAT GCGCTGCAACGGAACTAGCTCTCCCAGACAGACACAGCGGCACATGAGCCACGCCTCCC
ATCGCAACCGTGCACCGGCGGTGGGCCCACCGGCCAGCGCCCCCACACGTGTACCGG C
GCCCAGCATTGCACGGCACGGCGACGGAGAGATATTTTTAACGCCTTCCAATTCAGA AGC
TCCCCCATCACTCACTCGGGTACAGAGCGCAGCGGGCCGCGTGACTCTCGCGTCGTC GC
ATCGCGCGCGCTGCACACCACCGTCAGCGCGACCACTGCCACCA
SEQ ID NO: 33 >Wheat (tetrapioid) PHS1 5A (Kronos) genomic DNA
GGCACGGCGACGGAGAGATATTTTAACGCCTTCCAATTCAGAAGCTCCCCCATCACT CACT CGGGTACAGAGCGCAGCGGGCCGCGTGACTCTCGCGTCGTCGCATCGCGCGCGCTGCA CACCACCGTCAGCGCGACCACTGCCGCCAATGGCGACCGCCTCGCCGCCGCTCGCCACC CCCTTCCGCCCGCTCGCCGTCGCCGGAGGTAGCGGCGGACTCGTCGTGGGCGCCAGGG CCGTGGTACCGCCGAGGCGGGGGCGGCGGGGGTTCGTGGTGCGGAGCGTGGCGAGCG ATCGGGAGGTGCGGGGGCCCGCCCCGACCGAGGAAGGTAAGCGGCCGCGCGCTCCCAT TGGCTGCCACGACCTGGAATGCTGGTCGGCTGTAATGTGGTTCGGATTTTCGCTGGTGCG ACCTGAATTTGTGATCCTCTGACTGACGGCCTATGTGTTCCCCGGGTTTCGGGCTGTTGA T TTTGTGCGCGCGCATTGCTAATCTACTTCTCTAGGATTTGTTAGGGCTCAGAGCAATCTT AA ATTCGCGAATACACTTCGACGAATGTAGTCTGAATTTACTGAAACTAGCTTAGTTTGGCG CG TCAAGATAGACTGATGTATTTATGGTAGCAAATTCTGAAAACTTCTTCTGTGCGTGGACA GC CACCCCTCGTGGATGATGCAGCAAGTGCTAACACAGTGTTTGCTAGTACTTGCTAGTATC T GTAGCTGTAACAAAATCCTTCGAATCTGAACTAGTGAGATAAATCGTTTGAGATTTTTGC TG AAGATTACACAGTATCAATATTTCTGAAGGTGATTGGAACAGTATAGCATTTCAAACTGG CA ACCAAGCAGGATCAGTATACTGACGCAATTTTTTTCTGAAGGTGGTTGGAATACCGGTAT G AATTTTATAATCATTGAATTAAGGCGTAAGATTTGGTAACAAAAACTGCTTTTCACGGGA AA CGATGAAACATGGCACAATTCAACCTGACTTTCTTTTATGAAAAGGCTCTCAAGCTGCCT TG TACTCCCTCCGTCCCATAATATAAGAACGTTTTTGACATTAGTGTAGTACCAAAAACGTT CT TATATTATGGGACAGAGGGAGTATTTATTAATGGATAAAAAGAGAGGTACAGGGTTCTAG TA GTAACATATGCAAATGGTGATACAGTATATTATCTACGATTTTGTGCAAGAATTTTCACA CG AAAACAATATTCCATGCAGTTTTTAGTTTTGACTTTTTTTTTTGAATTTTCNACTTTTTT TTTTG AATTTTCACGCGCGCGGGGGGTGGGGGGGGGGGGAGAGAACTCCCCCACCTGAATATAT TTCTCGAACCCAACATACAGTACAATAGTACAAATGCCGGGGCGGCAGCAGAGAGAGAGG CAGGGGGCGACCCCCCCATGCTATGTTACATGCAGCTTTGTTCTACCCACGCAGAGCCTG TAGCCAGTTTTGACTCTTTTTGAATTGTTTGGATTGCTTACCACATGTGTCACGTGGTTT TA GTTTTGACTCTTGACTGTTTAATACCTACTCTTTTTAAGATTTTGGATTCTAATGTCATC TGT CTTGCAACTCAGAGCTTTCAGCCGTGCTAACTTCCATTGATTCATCCGCCATCGCATCAA A
CATCCAGCACCATGCAGACTTCACACCGTTATTCTCGCCAGAGCACTCTTCACCCCT AAAG GCTTACCATGCAACTGCCAAAAGTGTTTTTGATTCTCTGATATTGAATTGGAATGCAACA TA TGACTATTACAACAAAGTGAATGCAAAGCAAGCTTATTACCTATCCATGGAATTTTTGCA GG TCTCGCAATATACTTTCTTCAGCCTTATCTCATACCTTTTGGTAACTGCGTTATGATGTT ATA TGTTTTTTTTTGTACTGCAGGGAAGAGCTCTTACAAATGCTATTGGCAACCTAGAGCTAA CA GGACAATATGCAGAAGCGTTAAAACAACTTGGACAGAACCTAGAGGATGTTGCTAGTCAG G TAAGCCATTTTTGTCAGAGAATGGTTTTTAGTCCATATCAAAATTATTGCCGAGCCATTT TTC TAAATAACATTTTATTATTATTCATGAACTTCCATATTTTTTTCAAATCTGCAAAAGGAG AAGT TTTTTTCCTGGGGGTTCCACCATCTGACTTATATTTGTTATGCAGTTTTCAGAGTGATTG ACT ATGAAATTACGCCAAAACACGAGCATTGCCTAGAGTGCCCTGTAGTTTTAGAAATGTTAA GA TTGGATATGTTACTTTTCTATTACAATGAAACAGGAACCAGATCCTGCCCTTGGCAATGG TG GACTAGGCCGTCTAGCGTCCTGTTTTCTGGATTCTATGGCAACCCTAAATTATCCAGCAT G GGGATATGGACTTCGGTACAGATATGGCCTCTTTAAGCAAATCATAACAAAGGATGGTCA G GAGGAGGTAGCTGAGAATTGGCTAGAGGTAGTGATATGTCTTCTACTTGTTTACTGTTTG G AAGCGTTGATTATGTTGGTTTCTGTTTTAATGATCTGCATGTTAGCTTCTTTTGCCATTT ATA ACTGATGCTGTATCTCCAGATGGGAAATCCATGGGAGATTGTAAGAAATGATGTCTCTTA TC CTCTGAAATTCTATGGCAAAGTGGTTGAAGGCACTGATGGGAGAAAACACTGGATTGGAG GAGAGAATATCAAGGCTGTGGCACATGATGTTCCTATTCCTGGCTACAAGACTAAAACTA C TAATAATCTTCGTCTTTGGTCAACAACAGTACCATCACAAAACTTTGATTTGGGAGCTTT TAA TGCTGGGGATCATGTCAAGGCCAACGAAGCACATCTAAATGCTGAAAAGGTATTCCTTTG T CTGCATTTATTTTGAATATAAATCATTAAATCTCAATAACTATTGATTTAAATGGTAAAC ACTA AACATTGTCTAGCTAGTAGTCACTGTGGTGTCCCTCAATGTCATTTTAGAATGTGCACAG TC AAATGTTTGATAACTTTGATTGTAATAATATGATAATATACAGAAACTTTGATTGACACT AGA TTTTTCATAAACATACTTATTGTATTTCCATGTAAATAGAAAAGCTGATGTTCAGAGGTT TGT GTGTCAAAATGAACAGAGTTTGTGCACACATAATATCTTCATCTGGTGTTTGCAGTTCTC TA GGTTGTGTTTGCGTTTTCTCCTGCTCTGCCTTAACTAATCTCAGCATATGCATCAGGCTA CA TTGCTCCTCTGTGTCTATAAAAAATAAGTCTGATGATATATGGTTGCAGTCCAGTAGTTC GT AGATGGCACAGCATGGAAGAACTTTTCACATGGAAGCCTTACCCTTATAATTTTAATTAT AC CTTTTGAGTGTTTGTGCATTGTGTACATACACCATTAAGTATGGGTGCTCAAAAATGGTT TA TCCCAAACTGTGCAAAAGCCGCCACTGTTTGTGTATTAATTTTTTTGAACTAGCTAAACA AC TTGACCGAACAAGATACATACAATTGCACTCACTTCTGAAGCAGCTCAGTTGGACGGATT T GTTCCAATCCCGTGCATGGGTTAAATTGTCGAGTTGTCCTCGCTTGTTCAGTAGTAGGAG T ACTATCTATTATATTTATACTAAAGTTTTCTTTTTGTGATACAGTACAAACTTTTAGTAA TACT TACATTTGCGACCACTGTGGACAAAATTTATTATGTCTAACTTAACTTTATCACTGGTCT TCT CATCATGCCAGATATGCCACGTATTGTATCCAGGGGACGAATCATCAGAGGGGAAAATTC T TCGCCTGAAACAACAATATACACTATGCTCGGCCTCCCTGCAGGACATTATTTCTCGTTT TG AGTCGAGGGCTGGTGATTCTCTCAACTGGGAGGACTTCCCCTCTAAAGTTGCAGTTCAGA T GAATGACACCCATCCAACACTGTGCATTCCCGAGTTAATGAGAATACTGATGGATGTTAA G GGATTGAGCTGGAATGAGTCTTGGAGTATCACGGAAAGGTATCACAGAAGGGTACTGCAG TTTGATGTGTTATGCTGTTCCTCCAGTTCCAGTTAACTAATTTTATTTCCTCATCTGTCT TCC ATTCAATCATGTACTGATTTTCTGATATCCACATCATTTGTAGAACCGTTGCATACACTA ACC ACACAGTACTTCCTGAAGCTCTGGAGAAGTGGAGCTTGGACATTATGCAGAAACTTCTAC C TCGACATGTTGAGATTATAGAAAGAATAGATGAGGAGGTATCAAATGTGCAGATAATATT AT ACAATTAATTTCATGCTCTGTAGTTCAACTTATGGTTTACTTGTCTTCATCGTTGATTTT AAA CAGCTGATGAACACCATCGTCTCAAAATATGGAACGGCAGATATTTCACTGTTAAAACAG AA GCTTAAAGATATGAGGATCTTAGACAACGTTGATCTTCCAGCCTCTGTTGCTAAACTGTT TA TTAAACCAAAAGAGAAAAAGGGGAAGTTGCTTGTTGAATCTTTGGAGTCTATTGCTGAAG CT GACGAGAAAACTGAGTCAGAAGAGGAGGAAAACATTCTATCCGAGACAACAGAGAAAAAG GGCGAATCTGACTCTGAAGAAGCTCCTGATGCAGAAAAGGAGGATCCTGAGTATGAGTTA GATCCATTTACAAAATACGATCCTCAGTTACCTAGAGTTGTTCGAATGGCGAACCTCTGC GT TGTTGGTGGACATTCAGTTAATGGTGTGGCTGAGATTCACAGTGAAATTGTTAAGCAAGA T GTGTTCAATAGCTTTTATGAGGTATTTGCATGATGATTTGGAAACCGTGATCTTATTTTC CAA TGGCTGCTATAATCGTCAGGTGATATGGTATTCACCATCTGCCATTTTCTTGTGTTCTTT GC CAGATGTGGCCTACTAAGTTCCAAAACAAAACAAATGGAGTGACCCCCAGGCGTTGGATC C GGTTTTGTAATCCTGAATTAAGTACAATAATTTCAAAATGGATAGGCTCTGATGACTGGA TT CTGAACACTGATAAACTTGCAGGACTGAAGAAGGTACTTCCAGTAGTTTTATTCCCCAAT AA TCAGTAATCTAATGCTTCCTTCACTTCAATGTTAACTGTGCATTGTTTTTACATTGATGT GTG TTCATCCTGTAACAGTTTGCTGATGATGAGGATCTGCAATCAGAATGGCGTACTGCTAAA A GGAATAACAAGATGAAGGTAGTTTCGCTGATAAGAGACAAGACTGGATATGTTGTCAGCC C AGATGCAATGTTTGATGTGCAGGTATATTTTGTACTAGAAAACATGTGTCTTCTTATAAA TAT GGTTTAACTCAGCTCTTGGTACTGGACACTTAGGTGAAAAGAATACATGAATATAAGCGG C AGCTTCTAAATATCCTTGGTATCGTTTACCGCTACAAGAAGATGAAAGAAATGAGTGCAA AA GATAGAATAAAGAGCTTTGTTCCAAGGGTATGCATATTTGGCGGGAAAGCTTTTGCCACT T ACGTACAGGCAAAGAGGATTGTGAAGTTTATCACAGATGTCGCAGCTACTGTAAATTATG A TCCTGATATTGGAGATCTACTGAAGGTAAATTTGTGTCTTTGATGCCACTTATCAGTCTT AAT TTTCATGTTCTGTCACTGGCAAGTTGACTCCGACTGAAAAGCATTGAATGTTCAGAGTTT CA AAGGAAAACAGAGCAGTGAATGTTCTTAATGTCGGTGCTTGTAAAACTGAGCAACTGAAT A GCCACAGGTTTTTTAGAGAACAATATAGCCACAAATTTTTTTGACACTTTTATTGCTATC AGC ATCTGATGCTGATATTACTGATAACTAGATGAGGTAAATATAAGGTGACAGGTTAAGTCT GA GGATTTTTGTTTGTTATCCAGGTTGTATTTGTCCCAGATTATAATGTTAGCGTTGCTGAG AC GCTCATTCCTGCCAGTGAATTGTCCCAGCATATCAGGTAATATTCTGCCTACTCTTATAT AG TATGTGAATACATAAAAAAAGAATATTACCTGGAGTTGGCTGACATAGAATTTTTCATGT GC
CTTTTTGTTTGCTATAATATGTACCTGGAGTCTTACTGGGATAGTGCCATCCAGAAG TGCCA CCCCAGTTTCTATTTCTGTATAAGACGTGCATTTTTAACAGTATTACTATCCATTCAGTA CCG CTGGAATGGAAGCTAGTGGAACCAGCAACATGAAGTTTGCAATGAATGGCTGTATTCTTA T TGGAACTTTGGATGGTGCGAATGTGGAGATCAGAGAAGAGGTTGGAGAGGAGAACTTTTT CCTCTTTGGTGCACATGCACCTGAAATCGCTGGTTTGAGGCAAGAGAGAGCTGAAGGGAA GGTACCACACTTGTTTATATAGCTGTGCCTACTATCAAACTAGTAAAGATCTCTCCCATA TG TGCTGTCCAAATAGTTTAGCTGAAGGAAAATCATAACTTTACAAGTTTCCTTTGCACTTG TA GTTTGTGCCTGACCCGAGATTTGAAGAGGTTAAGGAATACGTCCGCAGTGGCATCTTTGG GACTGGCAACTATGATGAATTGATGGGTTCTTTGGAAGGAAATGAAGGTTATGGACGTGC A GATTATTTTCTTGTTGGCAAGGATTTTCCCAGTTACATTGAATGCCAGGAAAAAGTTGAT GA
AGCATACCGAGATCAGAAGGTAAGTTGTATAACTGACGCTGTTTGTTTTCTGGCTGA AATG CTCACAGTGATTATGCAAACTTTCTGATATGTGATGGCTGACATCAGTGTTTCACTGTGA AC
GACTTCATTGATTCGTAGGAAGGAATATTTCTTAGTTTAAAGTTTCAAACTGAAGTA TCTATC
GGTGTGTTGGAAGATCATAAACAATCAGAGAATATAGTTGCTAGATGTGCTATATTA CTCAT CTGTTGGCTGTTGAGTGTTGATACATGAGCTTTGATGTAGCTAGACCTTCAGGAAAGTGC T
TCTGTGCATTCCAATAAAAAAATAACTTATAACTTAACAAAACAAAACAATTAGGTT GAACCT CACCAGGGGGCCTTTTGTTATCTGTTTTCTGAAATGAAAATCAGAAGTGGGGAGCTCCTC A ATTCGAAAAAGAAAAAGAAAATCAGGGGGTAACTGAGGGTCTGGTTTCCCTCTGTTGGAG C CCCCAAGAATTCTGTGGTCTTATTTTTGTAAGAGTATAAGATGCACTGCTCCGTTAAGTT TC
TCCTTCACCCATTTGTTTTCATTGCTTCGTGACATTTCGTGTGACAGCTCAGATCAC CATAT GCATCCCATTTGTACACATTATTTTGCTACCGTGGATATTTCCACTCTGTTTCATGTGTG AC ATTTTTAATTCGTTACATTGTTACTCTAAACATTAGTCTAATAGATTTTCAGAGTTTTGA CCCT CCTAATCACATAATATCCGTTTCGACCTGCAGCTATGGACAAGAATGTCTATCCTCAACA CA
GCGGGTTCCCCCAAGTTCAGCAGTGACCGGACGATCCACGAGTATGCCAAGGACATA TGG GGTATCAGCCCCGTCATCATGCCCTGAATAATTCAAGCAGCTTCCAGGAACCTTGAGAGT C TATGAGGCGCGATTGGTCGCTGATTTGCCTTGATTCAGCGAGAATGTATTGCTCCCCCCA G TTTTGTATAGGCATATGCAAGTTCTGACATGGACCTGAGTGCTGAGAGCATGCAGGGTAG T
TCATCCTTAGCAATATTGCATGTCAGTTAGCCAGTATTACCCAGGCTTATGGTGTGT GTATT TCGCTGAATAAAGTTGGAGCAGAGTCATCAAACAAACATACTTTGGTGTATCTGTAATGA TA AATAAATGGTACATGTGACTGAACAAAAAGTGTCGTTGGCGTGCACATGTAGTAGAAGCA T GTTTCTTCAGAGAGGGTGCAAAGCTGAAAGCACCCCAGCTTTGGAGTGAGAATGCAGAAG
TTACGAAAACATTTGTTTACATCCAAAGTGCAACGGGATTCGTCAAGAGTTCAGAGT CTTCC ATCACGGTCATAGAGCATGTCCAATAGCAGCCCTATAAACCTATAACAGAGCAGTAGCAG T GGTCCCTCACATTTTGAACCAAAAATGCAGCTCCAACGGCTGCCCTATTTGTAAAATTTA CT TCAAAATTTGTTGGACAGCCCTAAAATGCAACACCAACTGATGTAAATGTACAACTATTC GG
CTGCTGCCCTAAAATTTGCTCCAAGTGGCGAACCACTCAGGACCACAAGAAAACAGT TGTT TTACGTTGTGCCAAAAATGTTGCTTCAACTACTGC
SEQ ID NO: 34 >Wheat (tetrapioid) PHS1 5A (Kronos) CDS
ATGGCGACCGCCTCGCCGCCGCTCGCCACCCCCTTCCGCCCGCTCGCCGTCGCCGGA G GTAGCGGCGGACTCGTCGTGGGCGCCAGGGCCGTGGTACCGCCGAGGCGGGGGCGGC GGGGGTTCGTGGTGCGGAGCGTGGCGAGCGATCGGGAGGTGCGGGGGCCCGCCCCGA CCGAGGAAGAGCTTTCAGCCGTGCTAACTTCCATTGATTCATCCGCCATCGCATCAAACA T CCAGCACCATGCAGACTTCACACCGTTATTCTCGCCAGAGCACTCTTCACCCCTAAAGGC T TACCATGCAACTGCCAAAAGTGTTTTTGATTCTCTGATATTGAATTGGAATGCAACATAT GA CTATTACAACAAAGTGAATGCAAAGCAAGCTTATTACCTATCCATGGAATTTTTGCAGGG AA GAGCTCTTACAAATGCTATTGGCAACCTAGAGCTAACAGGACAATATGCAGAAGCGTTAA A ACAACTTGGACAGAACCTAGAGGATGTTGCTAGTCAGGAACCAGATCCTGCCCTTGGCAA T GGTGGACTAGGCCGTCTAGCGTCCTGTTTTCTGGATTCTATGGCAACCCTAAATTATCCA G CATGGGGATATGGACTTCGGTACAGATATGGCCTCTTTAAGCAAATCATAACAAAGGATG G TCAGGAGGAGGTAGCTGAGAATTGGCTAGAGATGGGAAATCCATGGGAGATTGTAAGAAA TGATGTCTCTTATCCTCTGAAATTCTATGGCAAAGTGGTTGAAGGCACTGATGGGAGAAA A CACTGGATTGGAGGAGAGAATATCAAGGCTGTGGCACATGATGTTCCTATTCCTGGCTAC A
AGACTAAAACTACTAATAATCTTCGTCTTTGGTCAACAACAGTACCATCACAAAACT TTGATT TGGGAGCTTTTAATGCTGGGGATCATGTCAAGGCCAACGAAGCACATCTAAATGCTGAAA A GATATGCCACGTATTGTATCCAGGGGACGAATCATCAGAGGGGAAAATTCTTCGCCTGAA A CAACAATATACACTATGCTCGGCCTCCCTGCAGGACATTATTTCTCGTTTTGAGTCGAGG G CTGGTGATTCTCTCAACTGGGAGGACTTCCCCTCTAAAGTTGCAGTTCAGATGAATGACA C CCATCCAACACTGTGCATTCCCGAGTTAATGAGAATACTGATGGATGTTAAGGGATTGAG C TGGAATGAGTCTTGGAGTATCACGGAAAGAACCGTTGCATACACTAACCACACAGTACTT C CTGAAGCTCTGGAGAAGTGGAGCTTGGACATTATGCAGAAACTTCTACCTCGACATGTTG A GATTATAGAAAGAATAGATGAGGAGCTGATGAACACCATCGTCTCAAAATATGGAACGGC A GATATTTCACTGTTAAAACAGAAGCTTAAAGATATGAGGATCTTAGACAACGTTGATCTT CC AGCCTCTGTTGCTAAACTGTTTATTAAACCAAAAGAGAAAAAGGGGAAGTTGCTTGTTGA AT CTTTGGAGTCTATTGCTGAAGCTGACGAGAAAACTGAGTCAGAAGAGGAGGAAAACATTC T ATCCGAGACAACAGAGAAAAAGGGCGAATCTGACTCTGAAGAAGCTCCTGATGCAGAAAA GGAGGATCCTGAGTATGAGTTAGATCCATTTACAAAATACGATCCTCAGTTACCTAGAGT TG
TTCGAATGGCGAACCTCTGCGTTGTTGGTGGACATTCAGTTAATGGTGTGGCTGAGA TTCA
CAGTGAAATTGTTAAGCAAGATGTGTTCAATAGCTTTTATGAGATGTGGCCTACTAA GTTCC
AAAACAAAACAAATGGAGTGACCCCCAGGCGTTGGATCCGGTTTTGTAATCCTGAAT TAAG
TACAATAATTTCAAAATGGATAGGCTCTGATGACTGGATTCTGAACACTGATAAACT TGCAG
GACTGAAGAAGTTTGCTGATGATGAGGATCTGCAATCAGAATGGCGTACTGCTAAAA GGAA
TAACAAGATGAAGGTAGTTTCGCTGATAAGAGACAAGACTGGATATGTTGTCAGCCC AGAT
GCAATGTTTGATGTGCAGGTGAAAAGAATACATGAATATAAGCGGCAGCTTCTAAAT ATCCT
TGGTATCGTTTACCGCTACAAGAAGATGAAAGAAATGAGTGCAAAAGATAGAATAAA GAGC
TTTGTTCCAAGGGTATGCATATTTGGCGGGAAAGCTTTTGCCACTTACGTACAGGCA AAGA
GGATTGTGAAGTTTATCACAGATGTCGCAGCTACTGTAAATTATGATCCTGATATTG GAGAT
CTACTGAAGGTTGTATTTGTCCCAGATTATAATGTTAGCGTTGCTGAGACGCTCATT CCTGC
CAGTGAATTGTCCCAGCATATCAGTACCGCTGGAATGGAAGCTAGTGGAACCAGCAA CAT
GAAGTTTGCAATGAATGGCTGTATTCTTATTGGAACTTTGGATGGTGCGAATGTGGA GATC
AGAGAAGAGGTTGGAGAGGAGAACTTTTTCCTCTTTGGTGCACATGCACCTGAAATC GCTG
GTTTGAGGCAAGAGAGAGCTGAAGGGAAGTTTGTGCCTGACCCGAGATTTGAAGAGG TTA
AGGAATACGTCCGCAGTGGCATCTTTGGGACTGGCAACTATGATGAATTGATGGGTT CTTT
GGAAGGAAATGAAGGTTATGGACGTGCAGATTATTTTCTTGTTGGCAAGGATTTTCC CAGT
TACATTGAATGCCAGGAAAAAGTTGATGAAGCATACCGAGATCAGAAGCTATGGACA AGAA
TGTCTATCCTCAACACAGCGGGTTCCCCCAAGTTCAGCAGTGACCGGACGATCCACG AGT
ATGCCAAGGACATATGGGGTATCAGCCCCGTCATCATGCCCTGA
SEQ ID NO: 35 >Wheat (tetrapioid) PHS1 5A (Kronos) protein
MATASPPLATPFRPLAVAGGSGGLVVGARAVVPPRRGRRGFVVRSVASDREVRGPAP TEEEL SAVLTSIDSSAIASNIQHHADFTPLFSPEHSSPLKAYHATAKSVFDSLILNWNATYDYYN KVNAK QAYYLSMEFLQGRALTNAIGNLELTGQYAEALKQLGQNLEDVASQEPDPALGNGGLGRLA SCF LDSMATLNYPAWGYGLRYRYGLFKQIITKDGQEEVAENWLEMGNPWEIVRNDVSYPLKFY GK VVEGTDGRKHWIGGENIKAVAHDVPIPGYKTKTTNNLRLWSTTVPSQNFDLGAFNAGDHV KAN EAHLNAEKICHVLYPGDESSEGKILRLKQQYTLCSASLQDIISRFESRAGDSLNWEDFPS KVAV
QMNDTHPTLCIPELMRILMDVKGLSWNESWSITERTVAYTNHTVLPEALEKWSLDIM QKLLPRH VEIIERIDEELMNTIVSKYGTADISLLKQKLKDMRILDNVDLPASVAKLFIKPKEKKGKL LVESLESI AEADEKTESEEEENILSETTEKKGESDSEEAPDAEKEDPEYELDPFTKYDPQLPRWRMAN LC VVGGHSVNGVAEIHSEIVKQDVFNSFYEMWPTKFQNKTNGVTPRRWIRFCNPELSTIISK WIGS DDWILNTDKLAGLKKFADDEDLQSEWRTAKRNNKMKWSLIRDKTGYVVSPDAMFDVQVKR IH EYKRQLLNILGIVYRYKKMKEMSAKDRIKSFVPRVCIFGGKAFATYVQAKRIVKFITDVA ATVNYD PDIGDLLKVVFVPDYNVSVAETLIPASELSQHISTAGMEASGTSNMKFAMNGCILIGTLD GANVEI REEVGEENFFLFGAHAPEIAGLRQERAEGKFVPDPRFEEVKEYVRSGIFGTGNYDELMGS LEG NEGYGRADYFLVGKDFPSYIECQEKVDEAYRDQKLWTRMSILNTAGSPKFSSDRTIHEYA KDI WGISPVIMP*
SEQ ID NO: 36 >Wheat (tetrapioid) PHS1 5A (Kronos) promoter (-2kb upstream of ATG)
ATAAATATAACTAGAATACTTCATAAATTTAGATACGTTTGATAAAGTGCCAAGAAT GGAAGA
GAAGTGATCATATGCCGCCCTGTCCATATCGGGCTCATCAGAATGTTCCCACCATCA GACT
CCAGCCCCAGGATTATTAACTTATGTCGGCGGTGCGCGGAATGGATCTTGAAGAAAG CGG
AATTTGTGTCGCCCTCACGTAACCAGGAGAACTTGGCAGGTTGACGCGCAATCGACC TCT
CCAGCAAAGCAAGTCCTAGGTAGGCGCGCTTCAGCTCGCAGAGTAACCGTACTTCTG GAG
GAGCAAGCGGGTGGTGATCTTGAGCCGCCGAGACGAGCAATCAATTACCGAGCGAGA AG
CATCTGAGTGGAGATGTTACCGAGCGTTCTGGAGCTCCAACTCTGCCGTCAGTGCAT TGTC
AGCTTGTGCTTTGCGTAGATTCGCCGGAAAGGATCAGGATCATGCACGCCAGAATTC CAG
GTGTCAGAGACCACCTAATGAAAACCATCGAGCTTTGATCAAAATAGCTCAAGATGG AATC
TCCGATCACCTTGGGAGCGAAAATCTTACTAGTTTCGTTCAAGGCCAAGGTAGTTAT ATATT
TGTGTGTGCAATTATTAATCATTGGCCGCTTAGCAAACTCTTATTGGTTCGATAAAC ATTAAA
GTCATCCCCACTTTGAGAAAANNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN NN
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN N
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN N
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN N NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN
NNNNNNNNNNNNNNNNTGGGGGGGGGGGGGGGGGGGGGGGGGCAATGCCATCCTTGG
CAAGTAATAAATCCTTGCCGCCTCGTCCTTGTTTGGATGCTGCTTCTTGTCATCTCA AGGAG
GCGTGTGGAGGGTGGTGTGTCCCCGGATCTTGTTCTCTCGGTTGGGTTTTTTCTATG GCA
GGTTTATCTCAGTGGTAGTGTCACGTGAAGCAGATGACGCGACTATGCGTCTTGGTC CTCT
GCGCTCTAATTCCGGCAGATGAAGGTTGACCAGCTTCAGCCTCGTCGGGGAGCGTAT GAG
ATTTGTGTTTTTTGACTCCATCTGGTCGGACCTGTGGCATTATGTAGACCCTCTCAT GTATT
CTTTTTCTCCTGGACAGTAGTCTGTTACACAAATCGTGGTCGTCGGATCTTCTAGTC CACAT
CGGCGACTTTTCCGACCGCCACTTCTACAAACTCCTTATTTTCAACAAGTCTCCTTC ATTGA
GATGGAAGTCCAGGCAATGGCTTTGCTCACGGCACACTGCCGGCGGATGCAAGAGAA AGA
AGACATTCGACACCCCTAAAACTATGAAGATGTAATTTTTAATCTTTGTAAGTGTTT TTGTGA
GGATTTGTGCTACTTTTAATATATTGCCTTTTGCCTTTTGCAAGGGAAAAAAAACTA AACCAT
TACACGAAGAGAAGAGCCATTCCCAGCCGTCCAATAAACACGACAAAACGCTTTACA GCAC
GGGGGCAATTCCCAGCCGTCGAAATGCGCTGCAACGGAACCAGCGCTCCCCGACACG GC
GGCACATGAGCCACAACTCGTACCATGCACCGTGCACCGGCGGTGGGCCCACTGGCC AG
CGTCCCCACACGTGTACCGGCGGCCACAATTACACGGCACGGCACGGCGACGGAGAG AT
ATTTTAACGCCTTCCAATTCAGAAGCTCCCCCATCACTCACTCGGGTACAGAGCGCA GCGG
GCCGCGTGACTCTCGCGTCGTCGCATCGCGCGCGCTGCACACCACCGTCAGCGCGAC CA
CTGCCGCCA
SEQ ID NO: 37 >Wheat (tetrapioid) PHS1 5B (Kronos) genomic DNA
TTGCACGGCACGGCGACGGAGAGATATTTTTAACGCCTTCCAATTCAGAAGCTCCCC CATC ACTCACTCGGGTACAGAGCGCAGCGGGCCGCGTGACTCTCGCGTCGTCGCATCGCGCGC GCTGCACACCACCGTCAGCGCGACCACTGCCACCAATGGCGACCGCCTCGCCGCCGCTC GCCACCGCCTTCCGCCCGCTCGCAGCCGGCGGCGGCGCCGGAGGAGGAGGAGCGCACG GCGTGGGCGCCGCGGGCCGGGTCGCGCCGAGGCGGGGGCGCCGGGGCTTCGTGGTGC GGAGCGTGGCGAGCGATCGGGAGGTGCGGGGCCCCGCCTCGACCGAGGAAGGTAAGCC GTCGCGCGCTCCCGTTGGCTGCGACGACCCGGAATGCTGGTCGGCTGTAATGTGGTTCG GATTTTCGCTGGTGCGACCTGAATTTCTGATCCTCTGACTAATGGTCTCTCTGTTCCCGG G TTTCGGGCTGTTGATTTTGTGCGCGCGCATTGCTAATCTACTGCTCTAGGATTTGTTAGG G TTCAGAGGAATCTTAAATTCACGAATAAACTTCGACGAGTGTATTCTGAATTTACTGAAA CT GGCTTAGTTTGGGGCGTCAAGTTAGAATGGTGTATTTATGGCACAGTAGCAAATTCTGAG A AATGCTTCTGTGCGTGCACAGCCACCCCTTGTGGATGATGCAGCAAGTGTCAACATACCG T CGCTACTATCTGTAGCTGTAACAAAATCTTACGCCTTGTTTTCTGGTGGTACAGTAATAT TA TCATTTTACAAAGAGTGGCTGTAACAAAATCTTACGCCTTGTTTTCTGGTGGCTTGCCAT TG AGGAGTAGAATGCAAATGGCGATACAGTATATTATCTACGATTTTGTGCAAGAATTTTCA CA CGTATCCCATGCAATTTTTAGTTTTGACTCTCGACTGTTTATACCTACTCTTTGTAAAAT TTT GGATCCTAATATTGTTGTTCTTGCAACTCAGAGCTTTCAGCCGTGCTAACTTCCATTGAT TC TTCCGCCATCGCGTCAAATATCCAGCACCATGCAGACTTCACACCGTTGTTCTCGCCAGA G CACTCTTCACCCCTAAAGGCTTACCATGCAACTGCTAAAAGTGTTTTTGATTCTCTGATC AT CAATTGGAATGCAACATATGACTATTACAACAAAGTGAATGCAAAGCAAGCTTATTACCT GT CCATGGAGTTTTTGCAGGTCTCACAATATACTTCCTTTAGCCTTACCTCAATACTTTTTG GTA ACTGCGTTATGATGTTATATGGTTGTTATCTGTGTACTGCAGGGAAGAGCTCTCACAAAT GC TATTGGCAATCTAGAGCTAACAGGACAATATGCAGAAGCGTTAAAACAACTTGGACAGAA C CTAGAGGATGTTGCTAGTCAGGTAAGTGATTTTTGTCGGAGATTGGTTTTTAGTCCGTAT CA AAATTATCGCAGGGCCATTATTTTTATTTTGAAAACGCATTTTATTATTAATCATCAAGT TCC CTTTGTTTAAAAACAATCTGAAAAGAAGAAGTTTTTTTCCTGAGGGTTCCACCGTCTGAC TT ATGATCTTTATGCAGTTTTCAGAGTGACTATGAAATTACGCCAAAACACGATCTTATTGA TG CCTAGAGTGCCCTGTAGTTTAGAAATCTAAGATTGGATATGTTACTTTTCTATTACAATG AAA CAGGAACCAGATCCTGCCCTTGGCAATGGTGGTCTAGGCCGTCTAGCGTCCTGTTTTCTG GATTCTATGGCAACCCTAAATTATCCAGCATGGGGATATGGACTTCGGTACAGATACGGC C TCTTTAAGCAAATCATAACAAAGGATGGTCAGGAGGAGGTAGCTGAGAATTGGCTAGAGG T AGTGATGTGTCTTCTACTTGTTTACTGTTTGGAAGTGTTGATTATGTTGGTTTATTTATG TTC
TAATGACCTGCATGTTAGCTTCTTTTCTATTTATAACTGATGCTGTATCTCCAGATG GGAAAT CCATGGGAGATCGTAAGAAATGATGTCTCTTATCCTGTGAAATTCTATGGCAAAGTGGTT G AAGGCACTGATGGGAGAAAGCACTGGATTGGAGGAGAGAATATCAAGGCTGTGGCACATG ATGTTCCTATTCCTGGCTACAAGACTAAAACTACTAATAATCTTCGTCTTTGGTCAACAA CA GTACCATCACAAAATTTTGATTTGGGTGCCTTTAATGCTGGGGATCATGCCAAGGCCAAT G
AAGCACATCTAAATGCTGAAAAGGTATTCTTTCGTCTACTTTATTTTGAATATAAAC CATTAA
CTCTCAACAACTACTGATTCAAAGGCTAAACAGTGCTTTGCTAGTAGTTACTGTGTT TTCTC
TTAGATGTCATTTCATAATGTGTGCAGTCAATCGTTTGATATACACAAACCTATTTG GATTTC
TCCTGTAAAAATGTTGTCACTAGATTTTTCATGAAAAAACTTATTGTATATGTAAAT AGAAAA
GCTGATGCTCAGGTGCTTTGTGTTCAAATGACAAGTAAAAATGAACAGAGTTTGTGC ACCC
ATAACTTCTTCACCTGGTGTGTGCAATTCGTTCGGTTGTATTTATGCGTGTTCTTAT TCTGC
CTGAATGATTCTCAGCATATGCATCAGGCTACATTTCTCCTCTGTGTCAAGAAAAAA TAAGT
CTGACCAGATGCAGTCCAGTAGTCTTTAGATCTTTTCACATGGAAGCGTTACTCTAT GATTT
TTATTATACCTTTAGAGTGTTTGTGCATTGTGCACATATACCATGAAGTATGGGTGT TCAAAT
GTGGTTCTCTCAAACTGTGCTAAACCCACCACTGTTTGCCAGTATTCATTTTTCTGA ACTAG
CAACAACTTGACTGGACAAGATACATTGTACTCACATCTGAAGCTGCTGGGTTGGAC AGAT
TTGTCCGACCCGTCTTTGAACATCTCTACTATTTAGTGTGCATGGGTTAAATTGTTG ACTTTT
CCTCGTTTGTTCAGTAGTAGGAGCACTATCTAATTATATTTATATTCAAGTTTTCTC TGTGTG
TTACGTTATAAACTTTTAGTCATACTTACATTTGTGGACAAAAGTTATCATGTCTAA CTTAAG
TTTGTGACTGATCTTCTCATCATGCCAGATATGCCACGTATTGTATCCAGGGGACGA ATCAT
CAGAGGGGAAAATTCTTCGCTTGAAACAACAATATACATTATGCTCGGCCTCCCTGC AGGA
CATTATTTCTCGTTTTGAGTCGAGGGCTGGTGATTCTCTCAACTGGGAGGACTTCCC CTCT
AAAGTTGCAGTTCAGATGAATGACACTCATCCAACACTGTGCATTCCGGAGTTAATG AGAA
TACTGATGGATATAAAGGGATTGAGCTGGAATGAGGCTTGGAGTATCACAGAAAGGT ACCA
CAGAAGGGTACTGCAGTTTGATGTGTTATGCTGTTCCTCCAGCTCCAATTAACTAAT TGTAT
TGCCTCATCTGTCTTCCATTCAATTATGTACTGATTCTCTGATATCATTTGTAGAAC CGTTGC
ATACACTAACCACACGGTACTTCCTGAAGCTCTGGAGAAGTGGAGCTTGGACATTAT GCAG
AAACTTCTACCCCGACATGTTGAGATCATAGAAACAATAGATGAGAAGGTAACAAAT GTGC
AGATAATATTATGCAATAAATTTCATTCTCTGTAGTTCAACTTATGGTTTACTTATC AACATC
GTTGATTTTAAAACAGCTGATGAACAACATCGTCTCAAAATATGGAACGGCAGATAT TTCAC
TGTTAAAACAGAAGCTTAAAGATATGAGGATCTTAGACAACGTTGATCTTCCAGCCT CTGTT
GCTAAACTGTTTATTAAACCTAAAGAGAAAAAGGGGAAGTTGCTTGTTGAATCTTTG GAGTC
TATTGCTGAAGGTGACGAGAAAACTGAGTCACAAGAGGAGGAAAACATTCTATCCGA GACA
GCAGAGAAAAAGGGCGGATCTGACTCTGAAGAAGCTCCTGATGCAGAAAAGGAGGAT ACT
GTGTATGAGTTAGATCCATTTGCAAAATACGATCCTCAGTTACCTAGAGTTGTTCGA ATGGC
AAATCTCTGTGTTGTTGGTGGACATTCAGTTAATGGTGTGGCTGAGATTCACAGTGA AATTG
TTAAGCAAGATGTGTTCAATAGCTTTTATGAGGTATTTGCACGATGATTTGGAAACC ATGAT
CTTATTTTCTAACGGCTGCTATAATCGCCAGGTAATATGGTATCACCATCTGCCATT TTCTTA
TGTTATTTGCCAGATGTGGCCTACTAAGTTCCAAAACAAAACAAACGGAGTAACTCC CAGG
CGTTGGATCCGGTTTTGTAATCCTGAATTAAGTGCAATCATTTCAAAATGGATAGGC TCTGA
TGACTGGATTCTGAACACTGATAAACTTGCAGGACTGAAGAAGGTACTTCCAGTAAT TTTAA
TTTCCAAATATCAGTAATCTAATGCTTCCTTCACTTCCATGTTACCTGTGCCTTGTT TTTACA
TTGATGTGTGTTCATCCCGTAATAGTTTGCTGATGATGAGGATCTGCAATCAGAATG GCGT
ACTGCTAAAAGGAATAACAAGATGAAGGTAGTTTCGCTGATAAGAGACAAGACTGGA TATG TTGTCAGCCCAGATGCAATGTTTGATGTGCAGGTATATTTTGGACTAGAAACATGTGTCT CC TGTTAAATATGGTTTAACTCGGATCTTGGTACTTGACACTTAGGTGAAAAGAATACATGA AT
ATAAGCGGCAGCTGCTAAATATCCTTGGTATCGTTTACCGATACAAGAAGATGAAAG AAAT
GAGTGCCAAAGATAGAATAAAGAGCTTTGTTCCAAGGGTATGCATATTTGGTGGGAA AGCT
TTTGCCACTTACGTACAGGCAAAGAGGATTGTGAAGTTTATCACAGATGTCGCAGCT ACTG
TAAATTATGACCCTGATGTTGGAGATTTACTGAAGGTAAATTTGTGTCTTCAATGGC ACTTA
TCATTCTAAATTTTCATGATCTGTCACTAGCAATTTGACTCCGACTAGGGGTTACTG AAAAG
CACTGAATGTTCCGAGTTTCAAAAGAAAACAGAGCAGTGAATGTTCTTAATGTCGGT GCTT
GTAGAACTGAGAAACTGAATAGCCACAGTTTTTTTTAGAGAACAATAGCCACAATTG TTCTG
ATGCTGATTTTGCTGATAACTAGATGAGGTAAATACAAGGTGACAGGTTAAGTTCTG AGGAT
TTTTGTTTGTTATCCAGGTTGTATTTGTCCCAGATTATAATGTTAGCATTGCTGAGA AGCTCA
TTCCTGCCAGTGAATTGTCCCAGCATATCAGGTAATATTCTGGCTACTCTTATATAG TATGT
GAATACAAAGAAAAGGAGTCTTATCTGGAGGTGACTGATGTGGAAATTTTCATGTGC CTTTT
TGTTTGCTACTCCCTCTGTCCCATAATATAAGAGCGCTTTTGACAGTACACTAGTGT CAAAA
ACGCTCTTATAATATGGGACGGAGGGAGTATAATATATACCTGGAGTCTTGCTGGGA TAGT
GACATCCAGAAGTGCCACGCCAGCTTCTATTTCTGTACAAGACGTGCATTTTTAACA CTATT
ACTATCCATTCAGTACCGCTGGAATGGAAGCTAGTGGAACAAGCAACATGAAGTTTG CAAT GAATGGTTGTATTCTTATTGGAACTTTGGATGGTGCGAATGTGGAGATCAGAGAAGAGGT T GGAGAGGAGAACTTTTTCCTCTTTGGTGCACATGCACCTGAAATCGCTGGTTTGAGGCAA G
AGAGAGCTGAAGGAAAGGTACCACACTTGTTTATAGAACTGTGCCTAACTATGAAAC TAGT
AAAGATCTCTCCCAAATGTGTGTTGTCCAAATAGTTTAAAGGAAGAAAAATTATAAC TTTACC
ATTTTCCTTTGCACTTGTAGTTTGTGCCTGACCCGAGATTTGAAGAGGTTAAGGAAT ACGTC
CGCAGTGGCATCTTTGGGACTAGCAACTATGATGAATTGATGGGTTCTTTGGAAGGA AATG
AAGGTTATGGACGTGCAGATTATTTTCTTGTTGGCAAGGATTTTCCCAGTTACATTG AATGC
CAGCAAAAAGTTGATGAAGCATACCGAGATCAGAAGGTAAGTTGTATAACTGACGCT GTTT
GTTTTCTGGCTGAAATGCATACAGTGATTATGCAAACTTTCTGATATGTGATGGCTG CCATC
AGTGTTTCACTGTGAACGACTTCATTGATTCGTAGGAAGGAATATTTCTTAGTTCAA AGTTT
CAAACTGGATTATCTATAGGTGTGTTGGAAGATCATAAACAATCAGAGAATATAGTT GCTAG
ATGTGCTATACTATATGTTGGCTGTTGAGTGTTGATACATGAGCTTTGATGAACCTA GCTAG
ACCTTCAGGAAAGTGCTTCTATGCATTCCAATAAAAAAAACTTATAACTTAACAAAA CAATTA
GGTTGAACCTCGCCAGGGAGCGTTTTGTTATCTGTTTTCTGAAATGAAAATCAGAAG CGGG
GAGCTCCTCGATTCTAAAATGAAAAAAAAATCCAGGGATAACTGAGGGCCTGGTTTC CCTC
TGTTGGAGCCCCAAAGAATTCTGTGGCCTTATTTTTGTAAGAGTATAAGATGCACTG CTCC
GTTAAGTTTCTCCTTCATCCATTTGTTTTCATTGCTTCGTGACATTTCGTGTAACAG AGCATC
CCATTTGTACACATTATTTTGCTATCGTGGATATTTCCACTCTGTTTCATGTGTAAC ATTTTT
AATTCGTTACATCCTTGCTAACTACACATAGTCTGATAGATTTTCAGAGTTTTGGTC CTCCTA
ATAACATAATATCCGTTTCTACCTGCAGCTATGGACAAGAATGTCCATCCTCAACAC AGTTG
GTTCCCCCAAGTTCAGCAGTGACCGGACGATCCACGAGTATGCCAAGGACATATGGG ATA
TCAGCCCCGTCATCATGCCCTGAATTATTCAAGCAGCTTTCAGGAACATTCAGAGTC TATG AGGCGCGATTGGTCGCAATGTATTGCTCCCCCCAGTTTTGTATAGGCACATATGCAAGTT C
TGACATGGACCTGAGTGCTGAGAGCATGCAGGGTAGTTCATCCTTAGCAATATTGCA TGTC
ACTTAGCCCGTATTACCCAGGCTTATGGTGTGTGTATTTCGCCGAATAAAGTTGGAG CAGA
ATCATCAAACAAACATACTTTGGTGTATCTGTAATGATAAATAAATGGTAAATGACT GAACAA AAAGTGTCGTTGGCGCACACATGTAGTAGAAGCATGTTTCTTCAGAGAGGCTGCAAAACT G AAAGCACCCTAGCTTTGAAGTGAGAATGAAGAAGTTACGGAAACATTTGTTTACATCCAA AG
TGCAACGGGATTCGTCAAGA
SEQ ID NO: 38 >Wheat (tetrapioid) PHS1 5B (Kronos) CDS
ATGGCGACCGCCTCGCCGCCGCTCGCCACCGCCTTCCGCCCGCTCGCAGCCGGCGGC G GCGCCGGAGGAGGAGGAGCGCACGGCGTGGGCGCCGCGGGCCGGGTCGCGCCGAGGC GGGGGCGCCGGGGCTTCGTGGTGCGGAGCGTGGCGAGCGATCGGGAGGTGCGGGGCC
CCGCCTCGACCGAGGAAGAGCTTTCAGCCGTGCTAACTTCCATTGATTCTTCCGCCA TCGC
GTCAAATATCCAGCACCATGCAGACTTCACACCGTTGTTCTCGCCAGAGCACTCTTC ACCC
CTAAAGGCTTACCATGCAACTGCTAAAAGTGTTTTTGATTCTCTGATCATCAATTGG AATGC
AACATATGACTATTACAACAAAGTGAATGCAAAGCAAGCTTATTACCTGTCCATGGA GTTTT
TGCAGGGAAGAGCTCTCACAAATGCTATTGGCAATCTAGAGCTAACAGGACAATATG CAGA
AGCGTTAAAACAACTTGGACAGAACCTAGAGGATGTTGCTAGTCAGGAACCAGATCC TGCC
CTTGGCAATGGTGGTCTAGGCCGTCTAGCGTCCTGTTTTCTGGATTCTATGGCAACC CTAA
ATTATCCAGCATGGGGATATGGACTTCGGTACAGATACGGCCTCTTTAAGCAAATCA TAAC
AAAGGATGGTCAGGAGGAGGTAGCTGAGAATTGGCTAGAGATGGGAAATCCATGGGA GAT
CGTAAGAAATGATGTCTCTTATCCTGTGAAATTCTATGGCAAAGTGGTTGAAGGCAC TGAT
GGGAGAAAGCACTGGATTGGAGGAGAGAATATCAAGGCTGTGGCACATGATGTTCCT ATT
CCTGGCTACAAGACTAAAACTACTAATAATCTTCGTCTTTGGTCAACAACAGTACCA TCACA
AAATTTTGATTTGGGTGCCTTTAATGCTGGGGATCATGCCAAGGCCAATGAAGCACA TCTA
AATGCTGAAAAGATATGCCACGTATTGTATCCAGGGGACGAATCATCAGAGGGGAAA ATTC
TTCGCTTGAAACAACAATATACATTATGCTCGGCCTCCCTGCAGGACATTATTTCTC GTTTT
GAGTCGAGGGCTGGTGATTCTCTCAACTGGGAGGACTTCCCCTCTAAAGTTGCAGTT CAG
ATGAATGACACTCATCCAACACTGTGCATTCCGGAGTTAATGAGAATACTGATGGAT ATAAA
GGGATTGAGCTGGAATGAGGCTTGGAGTATCACAGAAAGAACCGTTGCATACACTAA CCA
CACGGTACTTCCTGAAGCTCTGGAGAAGTGGAGCTTGGACATTATGCAGAAACTTCT ACCC
CGACATGTTGAGATCATAGAAACAATAGATGAGAAGCTGATGAACAACATCGTCTCA AAATA
TGGAACGGCAGATATTTCACTGTTAAAACAGAAGCTTAAAGATATGAGGATCTTAGA CAAC
GTTGATCTTCCAGCCTCTGTTGCTAAACTGTTTATTAAACCTAAAGAGAAAAAGGGG AAGTT
GCTTGTTGAATCTTTGGAGTCTATTGCTGAAGGTGACGAGAAAACTGAGTCACAAGA GGAG GAAAACATTCTATCCGAGACAGCAGAGAAAAAGGGCGGATCTGACTCTGAAGAAGCTCCT GATGCAGAAAAGGAGGATACTGTGTATGAGTTAGATCCATTTGCAAAATACGATCCTCAG T
TACCTAGAGTTGTTCGAATGGCAAATCTCTGTGTTGTTGGTGGACATTCAGTTAATG GTGTG
GCTGAGATTCACAGTGAAATTGTTAAGCAAGATGTGTTCAATAGCTTTTATGAGATG TGGCC
TACTAAGTTCCAAAACAAAACAAACGGAGTAACTCCCAGGCGTTGGATCCGGTTTTG TAAT
CCTGAATTAAGTGCAATCATTTCAAAATGGATAGGCTCTGATGACTGGATTCTGAAC ACTGA
TAAACTTGCAGGACTGAAGAAGTTTGCTGATGATGAGGATCTGCAATCAGAATGGCG TACT
GCTAAAAGGAATAACAAGATGAAGGTAGTTTCGCTGATAAGAGACAAGACTGGATAT GTTG
TCAGCCCAGATGCAATGTTTGATGTGCAGGTGAAAAGAATACATGAATATAAGCGGC AGCT
GCTAAATATCCTTGGTATCGTTTACCGATACAAGAAGATGAAAGAAATGAGTGCCAA AGATA
GAATAAAGAGCTTTGTTCCAAGGGTATGCATATTTGGTGGGAAAGCTTTTGCCACTT ACGTA
CAGGCAAAGAGGATTGTGAAGTTTATCACAGATGTCGCAGCTACTGTAAATTATGAC CCTG
ATGTTGGAGATTTACTGAAGGTTGTATTTGTCCCAGATTATAATGTTAGCATTGCTG AGAAG
CTCATTCCTGCCAGTGAATTGTCCCAGCATATCAGTACCGCTGGAATGGAAGCTAGT GGAA
CAAGCAACATGAAGTTTGCAATGAATGGTTGTATTCTTATTGGAACTTTGGATGGTG CGAAT
GTGGAGATCAGAGAAGAGGTTGGAGAGGAGAACTTTTTCCTCTTTGGTGCACATGCA CCT
GAAATCGCTGGTTTGAGGCAAGAGAGAGCTGAAGGAAAGTTTGTGCCTGACCCGAGA TTT GAAGAGGTTAAGGAATACGTCCGCAGTGGCATCTTTGGGACTAGCAACTATGATGAATTG A
TGGGTTCTTTGGAAGGAAATGAAGGTTATGGACGTGCAGATTATTTTCTTGTTGGCA AGGA
TTTTCCCAGTTACATTGAATGCCAGCAAAAAGTTGATGAAGCATACCGAGATCAGAA GCTAT GGACAAGAATGTCCATCCTCAACACAGTTGGTTCCCCCAAGTTCAGCAGTGACCGGACGA TCCACGAGTATGCCAAGGACATATGGGATATCAGCCCCGTCATCATGCCCTGA
SEQ ID NO: 39 >Wheat (tetrapioid) PHS1 5B (Kronos) protein
MATASPPLATAFRPLAAGGGAGGGGAHGVGAAGRVAPRRGRRGFVVRSVASDREVRG PAST
EEELSAVLTSIDSSAIASNIQHHADFTPLFSPEHSSPLKAYHATAKSVFDSLIINWN ATYDYYNKV
NAKQAYYLSMEFLQGRALTNAIGNLELTGQYAEALKQLGQNLEDVASQEPDPALGNG GLGRLA
SCFLDSMATLNYPAWGYGLRYRYGLFKQIITKDGQEEVAENWLEMGNPWEIVRNDVS YPVKF
YGKVVEGTDGRKHWIGGENIKAVAHDVPIPGYKTKTTNNLRLWSTTVPSQNFDLGAF NAGDHA
KANEAHLNAEKICHVLYPGDESSEGKILRLKQQYTLCSASLQDIISRFESRAGDSLN WEDFPSK
VAVQMNDTHPTLCIPELMRILMDIKGLSWNEAWSITERTVAYTNHTVLPEALEKWSL DIMQKLLP
RHVEIIETIDEKLMNNIVSKYGTADISLLKQKLKDMRILDNVDLPASVAKLFIKPKE KKGKLLVESL
ESIAEGDEKTESQEEENILSETAEKKGGSDSEEAPDAEKEDTVYELDPFAKYDPQLP RWRMA
NLCVVGGHSVNGVAEIHSEIVKQDVFNSFYEMWPTKFQNKTNGVTPRRWIRFCNPEL SAIISK
WIGSDDWILNTDKLAGLKKFADDEDLQSEWRTAKRNNKMKVVSLIRDKTGYVVSPDA MFDVQ
VKRIHEYKRQLLNILGIVYRYKKMKEMSAKDRIKSFVPRVCIFGGKAFATYVQAKRI VKFITDVAA
TVNYDPDVGDLLKVVFVPDYNVSIAEKLIPASELSQHISTAGMEASGTSNMKFAMNG CILIGTLD
GANVEIREEVGEENFFLFGAHAPEIAGLRQERAEGKFVPDPRFEEVKEYVRSGIFGT SNYDELM
GSLEGNEGYGRADYFLVGKDFPSYIECQQKVDEAYRDQKLWTRMSILNTVGSPKFSS DRTIHE
YAKDIWDISPVIMP*
SEQ ID NO: 40 >Wheat (tetrapioid) PHS1 5B (Kronos) promoter (-2kb upstream of ATG)
CCCACCCGGTGGACCCCCGGGACCCTTCCGGTGGTTCCAGTACAATACCGGTGACCC CC
GAAACTTTCCCGATGGCCGAAACCTGACTTCCTATATATAATTCTTTACCTCCAAAC CATTTT
GGAACTCCTCGAACGTCCGGGATCTCATCCGGGACTCCGAACAACTTTCGGTTTACT GCAT
ACTCATATCTCTACAACCCTAGCGTCACCGAACCTTAAGTGTGTAGACCCTACGGAT TCGG
GAGACACGCAGACATGACCGAGACGGCTCTCCGGTCAATAACCAACAGCGGGATCTA GAT
ACCCATGTTGGCTCCCACATGCTCCTCGATGATCTTCATCGGATGAACCACGATGTC GAGG
ATTCAAGCAACCCCGTATACAATTCCCTTTGTCAATCGATACGTTACTTGCCCGAGA TTCGA
TCGTCGGTATCCCAATACCTCGCTCAATCTCATTACCGGCAAGTCACTTTACTCGTA CCGTA
ATGCATGATCCCGTGACCAAACACTTGGTCACTTTGAGCTCATTATGATGATGCATT ACCGA
GTGGGCCCAGAGATACCTCTCCGTCATACGGAGTGACAAATCCCAGTCTCGATCCGT GTC
AACCCAACAGACACTTTCGGAGATACCCGTAGTATACCTTTATAGTCACCCAGTTAT GTTGT
GACGTTTGGTACACCCAAAGCACTCCTACGGTGTCCGGGAGTTACACGATCTCATGG TCTA
AGGAAAAGATACTTGACATTGGAAAAGCTCTAGCAAACGAACTACACGATCTTGTGC TATG
CTTAGGATTGGGTCTTGTCCATCACATCATTCTCCTAATGATGTGATCCCGTTATCA ACGAC
ATCCAATGTCCATAGTCAAGAAACCATGACTATCATTTGATCAACGAGCTAGTCAAC TAGAG GCTTACTAGGGACATGTTGTGGTCTATGTATTCACACATGTATTACGATTTCCGGATAAC AC
AATTATAACATGAATAACAGACGATTATCATGAACAAGGAAATATAATAATAATCTT TTATTAT
TGCCTCTAGGGCATATTTCCAACAATTGCGGTGAGGCGCGTGGAGGTTGGCGTGTCC CCA
AATCTAGTTATCTGGGTCAGGTTTTTTCTTTGGTAGGTTTATCTCAGTGGTAGTGTC CCGTG
AAGCGGATGACACGGCTATGTGTCTTGGTCCTCCATGCTCTAGTTTTGACATGTGAA GGTT
GGCCAACTTCGGCGTTGTCAGGGAGCATATGAGATTTGTGTTCCTTTACTTCGCCTG GTCA
GACCTACGGTATTATATAAGCCCTCTCATGTGTTTTCTTCTCTTGGGCAGCAGTCAG TTACG
CAAATCGTGGTCGCTGACATCTTCTAGTCCACATCGGCGACTTCCCGACCGCCGCTT CTAC
AAGCTCCTTATTTTTCAACAAGTTTTCTCCACCGAGATGGAAGTCAAGAGGCGGCAA CGGC
TTTGCTCATGGCGCACTGCGGATGGATGCAGGAGAAGGAAGACATTCCACACCCCTC AAA
CTATGAAGATGTAATTTTTAGTTTTTGTAAGTATTTTTGTGAGGATTTGTGCTACTT TTAATAT
ATATGGCCTTTTGCCTTTTTCCAGGAAAGGAAAAACTAAACCATTACATGAGGCCAT TCCCA
TCCGTCCAATAAATACGACAAAACGCTTTACAGCACGGGGGCAATTCCCAGCCGTCG AAAT
GCGCTGCAACGGAACTAGCTCTCCCAGACAGACACAGCGGCACATGAGCCACGCCTC CC
ATCGCAACCGTGCACCGGCGGTGGGCCCACCGGCCAGCGCCCCCACACGTGTACCGG C
GCCCAGCATTGCACGGCACGGCGACGGAGAGATATTTTTAACGCCTTCCAATTCAGA AGC
TCCCCCATCACTCACTCGGGTACAGAGCGCAGCGGGCCGCGTGACTCTCGCGTCGTC GC
ATCGCGCGCGCTGCACACCACCGTCAGCGCGACCACTGCCACCA
SEQ ID NO: 41 >Barley PHS1 (Morex reference genome) genomic DNA
AAAAAAAGTGCTAAGACTTCTTAAAGAAACAGGGCGTGAGATGGAAGTGCAGAGGCG GCA GCGAATTAGCTCACGGGGCACTACCGGTGGATGTAGGAAAAAGAAGGCATTTGACACCTC
TCAAAACTATGAAGATGTAAATTTTAATTTCTAAGTGTTTTTGTGAGGATTTGTGCT ATTTAAT ACTCCCTCAATTCCAAAATAAATGTTTTAATTTTGTATTATCTCTAATACAAAGTTATAC TAAC TTTAAGATACCTATTTTAAAACAGAGGAAATATATGCATTTTGCCTTTCTCACAAAAAAA ACC CCTAAACCATCACACGAGGGGAAGAGCCATTCCCAGCCGTCCAATAAACACGACAAAAAG
CTGTACAGCACGGGAGCAATTCCCAGCCGTCGAAATGCGCTGCAACGGAAGTAGCGC TCC
CTGACACAGCGGCACATGAGCCACGCCTCGCACCACCGCACCGTGCACCGGCGCGGG G CCCACCGGCCAGCGTCCCCACACGCGTACCGGCGGCCAGCATTGCGCGGCACGGCGAC
GGAGAGATATTTTTAACGCCTTCCAGTTCGGAAGCGCCCCCCATCACTCACTCGGGT ACAG GGCGGGCCGCGTGAGTCTCGCGTCGTCGCATCGCGCGCGCTGCACACCCACCGTCGTCA GCGCGGACCACCGGCGCCCCACCCAATGGCGACCGCCTCGCCGCCGCTCGCCACCGCC
TTCCGCCCGCTCGCCGCCGCCGGAGGCAGCGGGGGGCTCGTCGTGGGCGCCGGGGCC GGGGCGGCGCCGAGGCGGGGGCGGCGGGGGCTCGTGGTGCGGAGCGTGGCGAGCGAT
CGGGAGGTGCGGGGCCCCGCCTCCACCGAGGAAGGTAAGCGGCCGCGCGCTCCCGTT G GCAGCGACGACACGGAATGCCGGTCGGCTGTAATGTGGTTTGGGTTCCCGCTGGTGCGG
CCCGTGAATTCGTGATCCTCTGACTGATTGTGCTCCCGGGTTTCGGTGGCGCGGGAG CTT
TGGGTGGGCGCGCGCATTGCTAATGTACTGCTCTAGGATTTGTTAGGGTTCAGAGGA AAC TTAAATTCGCGAATAAACTTCAACGAATGTATTCTGAGTTTCTGAAGCTGGCTTAGTTTG GC
GCGTCAAGTTAGAATGTGTATTTATGGTAGCAAATTCTGAGAAATTCTTCTATGCGT GGAGA GCCACCCCCCTGTGGATGCTGCAGCAAGCGCTAATATACTGTTTGCTAGTATCTGTAGCT G
TAACAAAATCCTTCAAATCTGAACTAGCGAAATAAATATGAGATTTTTGCTGAAGAT TATACA GTGTCAATATTTCTGAAGGTGGTTGGGACAGTATAGCACTTCAAACCAGCAACCAGCAGG A TCAGTATACCGACTCATCTTTTTTTTCTGAAGGTGGTTGGAATACCGGTATCGTTTTTAT AAT TATTAAATTAAGGCGTAAGATTTGGTAGCAAAAACTATTTTCACCAGAAACGATGAAACA TG
GCACAATTCAACCTGAGTACCTGACTACCTTTTTTGATGAAAAGGCTATGGGAGCTC CCTTT
CATTTATTAATTAATGGATAAAAAGAGAGGTGGAGGGTTCTAGTAGAATGCAAATGG TGATA CAGTATATTATCTACTATTTTGTACAAGATTTTTCACACGAAAACAATATCCCATGCAGT TTT
TTGTTTCGACTCTTGACTGTTTAATACCTACTCTTTATAAGGTTTCGGATCCTAATA TTGCTG TTCTTGCAACTCAGAGCTTTCAGCCGTGCTAACTTCCATTGATTCTTCCGCCATCGCGTC AA ATATCCAGCACCATGCAGACTTCACACCGTTGTTCTCGCCAGAGCACTCTTCACCCTTAA A GGCTTACCATGCAACTGCTAAAAGTGTTTTTGATTCTCTGATAATGAATTGGAATGCAAC AT
ATGACTATTACAACAAAGTGAATGCAAAGCAAGCTTATTACCTGTCCATGGAGTTTT TGCAG GTCTCGCAATATACTTCCTTTAGCTTTATCTCATACCTTTTGGTAACTGCGTTACGATGT TAT
ATGGTTATTTTTTGTGTACTGCAGGGAAGAGCTCTTACAAATGCTATTGGCAACCTA GAACT AACAGGACAATATGCAGAAGCATTAAAACAACTTGGACATAACCTAGAGGATGTTGCTAG T
CAGGTAAGCGTTTTTTGTCAGAGATTGGTTTTTAGTCCGTATATAAATTATCATTGA ACCATT ATCTAAAAAACACATTTTATTATTAATCATCAAGTTCCTTTTGTTTAAAAACTATCTGAA AAGA AGAAGTTTTTTTCCTGGGGGCCACCATCTGACTTATATTCTTTATGCGTTTTCCAGAGTG AG TGACTATAAAATTACGCCAAAACACGAGCTTATTGATGCCTAGAGTGCCCTGTAGTTTAT AA ATCTTAAGATTGGATATGATACTTTTCTATTACAATGAAACAGGAACCAGATCCTGCCCT TG GCAATGGTGGTCTAGGCCGTCTAGCGTCCTGTTTCCTGGATTCTTTGGCAACCCTAAATT A TCCAGCATGGGGATATGGACTTCGGTACAGATATGGCCTCTTTAAGCAAATCATAACAAA G GATGGTCAGGAGGAGGTAGCTGAGAATTGGCTAGAGGTAGTGATATGTCTTCTACTTGTT T ACTGTTTGGAATTGTTGATTATGTTGGTTTATGTTTTAATGACCTGCATGTTAGCTTCTT TTG CCATTAACTGATGCTGTATCTCCAGATGGGAAATCCATGGGAGATTGTAAGAAATGATGT C TCTTATCCTGTGAAATTCTATGGCAAAGTGGTTGAAGGCACTGATGGGAGAAAACACTGG A TTGGAGGAGAGAATATCAAGGCTGTGGCACATGACGTTCCTATTCCTGGCTACAAGACTA A AACTACTAACAATCTTCGTCTTTGGTCGACAACAGTACCATCACAAAATTTTGATTTGGG AG CTTTTAATGCTGGGGACCATGCCAAGGCCAATGAAGCGCATCTAAATGCTGAAAAGGTAT T CTTTCGTCTACATTTTTTTTTGAATATAAACCATTAACTCTCAATAACTATCGATTCAAA GGCT AAACAGTGTTTTGCTAGTAGTTACTGTGTTTTCATTTTAGAATGTGTGCAGTCAATCTTT TAA TAACTTTGATTGTAATAATATACTGATATACACAAACCTATTTGAATTTCTCGTGTACAA ATG TCGTCACTAGATTTTTCATGAAAATACTTATTGTATTTTCATGTAAATAGAAAAGCTGAT GGT TAGATGTTTTGTGTCAAGTGACAAATGAAAATGAACAGTGTTTGTACACCCATAACTTCT TC ACATGGTGTTTGCAATTCTTTCGGTTGTATTTATGTGTTTTCTTATTCTGAGTTAATTAA TCTC AGCATATGCATCAGGCTACATTACTCCTGTGTGTCAAGAAAAAATAAGTCTGACGAGGTG C AGTCCAGTAGTGTGTAGATGGCACATTGTAGAATAGCTTTTCTCATGGAAGCGTTACTTT AT GATTTTTGTTATACCTTTTGAGTGTTTGTGCATTGTGCACATATACCATTAAGTATGAGT GTT CAAAAGTGGTTCTCTCAAACTGTGCAAAACCCGCCACTGTTTGCGAGTATTCATTTATCT GA ACTAGTTAAACAACTTGACTGGACAAGATGCATTGTACTGACATCTGAAGCTGCCGGGTT G GACAGATTTGTGCTCACATCTATTTAGCGTACATGGATTAAATTGTTCAATTTTCCTCAT TTG TTTAGCTTGAGGAGTACTATCTAATTATATTTATATTCAAATTTTCTTTGTGTGTTACGT TATA AACTTCTAGTCATACTTACATTTGTGGCCATTGTGGACAAAATTTATCATGTCTAACTTA AGT TTGTGATTGGTCTTCTCATCATGCCAGATATGCCACGTATTGTATCCAGGTGACGAATCA TC AGAGGGGAAAATTCTTCGCTTGAAACAACAATATACATTATGCTCGGCCTCCCTGCAGGA C ATCATTTCTCGTTTTGAGTCGAGGGCTGGTGATTCTCTCAACTGGGAGGACTTCCCCTCT A AAGTTGCAGTCCAGATGAATGACACTCATCCAACACTGTGCATTCCCGAGTTAATGAGAA T ACTGATGGATATAAAGGGATTGAGCTGGAATGAGGCTTGGAGTATCACAGAAAGGTACCA C AGAAGGGTACTGCAGTTTGATGTGTTATATGCTGTTCCTCCAGTTCTATTTACCTAATTT TAT TTCCTCATCTGTCTTCCATTCAATTATGTACTGATTTTCTGATATCATTTGTAGAACCGT TGC ATACACTAACCACACGGTATTGCCTGAAGCTCTAGAGAAGTGGAGCTTGGACATTATGCA G AAACTTCTACCTCGACATGTTGAGATCATAGAAACAATAGATGAGGAGGTAACAAATGTG C AAATAATATTATACAATTAATTTCATGCTCTGTAGTTCAACTTATGGTTTACTTATCTTA ATCA TTGATTTAAAAACAGCTGATGAACAACATCGTCTCAAAATATGGAACGGCAGATATTTCA CT
GTTAAAGCAGAAGCTTAAAGATATGAGGATCTTAGACAACGTTGATCTTCCAGCCTC TGTTG CTAAACTGTTTATTAAACCTAAAGAGAAAAGGGGAAAGTTGCTTGTCGAATCTTTGGAGT CT ATTGCTGAAGCTGACGAGAAAACTGAGTCACAAGAGGTGGAAAACATTCTATCTGAGACA A CAGAGAAAAAGGCTGAATCTGACTCTGAAGAAGCTCCTGATGCAGAAAAGGAGGATCCTG AGTATGAGTTAGATCCATTTGCAAAATACGATCCTCAGTTTCCTAGAGTTGTTCGAATGG CA AACCTCTGTGTTGTTGGTGGACATTCAGTTAATGGTGTGGCTGAAATTCACAGTGAAATT GT TAAGCAAGATGTGTTCAATAGCTTTTATGAGGTATTTGCACGATGATTTGGAAACTGTGA TC TTATTTTCCAATGGCTGCTATAATCGTCAGGTGGTATGGTATTCACCATCTGCCATTTTG TT GTGTTCTTTGCCAGATGTGGCCTACTAAGTTCCAAAACAAAACAAATGGAGTGACCCCCA G GCGTTGGATCCGATTTTGTAATCCTGAATTAAGTACAATCATTTCAAAATGGATAGGCTC TG ATGACTGGATCCTGAACACTGATAAACTTGCAGGACTGAAGAAGGTACTTCCAGTAGTTT T ATTTTCCATTAATCAGTAATCTAATGCTTCCTTCACTTCTATGTTACCTGTGCCTTGTTA CAT TGATGTGTGTTCATCCTGTAACAGTTTGCTGATGATGAGGATCTACAATCAGAATGGCGT A CTGCTAAAAGGAATAACAAGATGAAGGTAGTTTCGCTGATAAGAGACAAGACTGGATATA T TGTCAGCCCAGATGCGATGTTTGATGTGCAGGTATATTTTATACTAGAAAACATGTGTTT TC TTATAAATATGGTTTAGCTCAGCTCTTGGTACTGGACACTTAGGTGAAAAGAATACATGA GT ATAAGCGGCAGCTGCTAAATATCCTTGGTATCGTTTACCGCTACAAGAAGATGAAAGAAA T GAGTGCAAAAGATAGAAGAAAGAGCTTTGTTCCAAGGGTATGCATATTTGGTGGGAAAGC T TTTGCCACTTACGTGCAGGCAAAGAGGATTGTGAAGTTTATCACAGATGTTGCAGCTACT G TAAATTATGATCCTGATATTGGAGATTTACTGAAGGTAAATTTGTGTCTTTGATGTCACT TGT CGTTCTTAATTCTCATGTTCTGTCACTGGCAATTTGACTCCGACTAGAGTTACTGAAAAG CA
GTGAATGTTCAGAGTTTCAAAAGAAAACAGAGCAGTGAATGTTCGTAATGTCGGTGC TTGC
AAAACTGAGAAACTGAATAGCCACAGATTTTTATAGAGAACAATATAGCCACAATTG TTTTG
ACCCTTTTATTGCTATCAAGCATCTGATGCTTATATTACTGATAACTAGATGAGGTA AATACA
AGGAGACACGCTAAGTTCTGAGGATTTTTGTTTGTTATCCAGGTTGTATTTGTCCCA GATTA
TAATGTTAGCGTTGCTGAGACGCTCATTCCTGCCAGTGAATTGTCCCAGCATATCAG GTAA
TATTCTGGCTACTCTTTTTCATAGCATGTGAATACATAGAGAAAGAATCTTAGCTGG AGTTG
GCTGATGTGGAAATTTTCATGTGCCTTTTTGTTTGCTCTAATATATACCTGGAGTCT TACCA
GGATAGTGCCACCCCAGTTTCTATTTCTGTACAAGACGTGCATTTTTAACACTATTA CTATC
CATTCAGTACTGCTGGAATGGAAGCTAGTGGAACCAGCAACATGAAGTTTGCAATGA ATGG
TTGTCTTCTTATTGGAACTTTGGATGGTGCGAATGTGGAGATCAGAGAAGAGGTTGG AGAG
GAGAATTTTTTCCTCTTTGGTGCACATGCACCTGAAATCGCTGGTTTGAGGCAAGAG AGAG
CTGAAGGAAAGGTAGGTACCACACTTGTTTATATAGCCGTGCCCACGATCAAACTAG CAAA
GATCTCTCCCAAATGTGTGCCGCCCAAGTAGTTTATCTGAAGGAAAATCATAACTTT ACCAT
CTTCCTTTGCACTTGTAGTTTGTGCCTGACCTGAGATTTGAAGAGGTTAAGGAATAC GTCC
GCAGTGGCGTCTTTGGGACTAGCAACTATGATGAATTAATGGGTTCTTTGGAAGGAA ATGA
AGGTTATGGACGTGCAGATTATTTTCTTGTTGGCAAGGATTTTCCCAGTTACATTGA ATGCC
AGGAAAAAGTTGATGAAGCATACCGAGATCAGAAGGTAAGTTGTATAACTGACGCCT TTTC
TTTTATGACTGAAATGCACACAGTGATTGTGTGACTTCCTGGTATGTGATGGCTGAC ATCAG
TGTTTCACTATGAACGATTTCACTGATTTGTAGGAATATTTCTTAATTCAAACTGAA GTATCT
ATTGATCTATTAGTATGTTGGAATAACATCATAAACAATCAGAGAATACGGTTGCTA GATGT
GATATATGAGCTTTAAGGAACCTAGCTAGACATTCAGGGAAGTGTTTCTATGCATTT CAATC
CAAACATAAACTGACGGAACAATTAAGTTTAAACTCACCGCGGGGCTTTGTGTTATT GGGTT
TTCTGAAATAAAATCTGGAGCGAGGAGCTCCTCAATTCGAAAAGAAGGCCGAAACAA TTCA
GTGTAACTGGGGACCTGGTTCCCTCTGTTGGAGTCTCCCAAGAATTCTGCAGTCTTG TTTC
TGGGAGAGTATAAAATGCACCACCCTGTTCAGATTCCCCTTCACCCACTTGGTTTTC TTGCT
TGGCGACATTTTATGAGACGGCTTAGATTATCATTTGTACACTATGGATATTTCCAC TCTGT
TTCAAGTGTGATTAACAGTGTTAATTTGCTACTAACCAGACATAGACTGATAGATAG TTGGA
GTGTTTAATCCTCCTAATCACATGATATCCGTCTCTACCTGCAGCTATGGACAAGAA TGTCT
ATCCTCAACACAGCGGGTTCCCCCAAGTTCAGCAGTGACCGGACGATCCACGAGTAC GCC
AAGGACATATGGGATATCAGCCCCGTCATCATGCCCTGAGTAATTCAAGCAGTTTCC AGGA
ACCTCGAGTCTGTGAGGCGTCACAGATTTGCCTCACTCCTCAGTGAGAGGGTCTTGT TGG
GCCCAGTTTTGTATAGGCATATGCAAGCTCTGACATGAACCTGAGAGCTGAGAGCAT GCAG
GATAGTTCATCTTTAACAAACTTGCATGCCGGTTAGCCAGTGTCACCCGGGCTTATA GTGT
GCGAATTTCGCCGAATAATGTTGGAGCAGAATCATCAAACAAACAAACTTTGGTGTA TCTGT
GATGATAAATAAATGGCAAATGATTGAATGAGAAGTGTCACTGGCGCACACATGTAG AAGC
ATGTTTCTTCGGAGAGGGTGCAAAGCGGAAACACCCCAGCTTTGAAGTGAGAATGCA GAA
ATTACTGCCACGTTTGGTACCATGCATCATCAGGGCAGTCAGGCATACTAAATGCAG TTTT
GGCCTGTTTGGTTGACCGAGTTGCATCAAAATCCTGGTCCGCATGAATATTACTCCC TCCG
TTTTCTAAATATAAATCTTTTTAAAGATTTCACTAAGAGACTACATACATATGTATA TGGATAT
GCTTTAGAAAGTAGATTCATTCATTTTACTTTGTATGTAGTCCTTTAATCAAATCTC TAAAAA
GACTTATATTTAGGAACCGAGGGAGTAAAACACCTCTGGTCAAGAGATGTTTGAATC GGCC
GTTTTTTCACAGCCAGGCTCAAGCCATGCACCTCGAGGCATGTGGGCCATACCTGGC CAA
ACGGGCCGGGCCGGGCCGGGCCGGGCCGGCCTGGCCCGGCCATAAACAGGCCGACAA
GACCCGCCCAGACCCGAATATTAGCCGGACCGTGCCGTGCCGGCTCATGTACGCAGC CTT
TCGGCCCAGGCACGACACGGGA
SEQ ID NO: 42 >Barley PHS1 (Morex reference genome) CDS
ATGGCGACCGCCTCGCCGCCGCTCGCCACCGCCTTCCGCCCGCTCGCCGCCGCCGGA G GCAGCGGGGGGCTCGTCGTGGGCGCCGGGGCCGGGGCGGCGCCGAGGCGGGGGCGG CGGGGGCTCGTGGTGCGGAGCGTGGCGAGCGATCGGGAGGTGCGGGGCCCCGCCTCCA CCGAGGAAGAGCTTTCAGCCGTGCTAACTTCCATTGATTCTTCCGCCATCGCGTCAAATA T CCAGCACCATGCAGACTTCACACCGTTGTTCTCGCCAGAGCACTCTTCACCCTTAAAGGC T TACCATGCAACTGCTAAAAGTGTTTTTGATTCTCTGATAATGAATTGGAATGCAACATAT GA CTATTACAACAAAGTGAATGCAAAGCAAGCTTATTACCTGTCCATGGAGTTTTTGCAGGG AA GAGCTCTTACAAATGCTATTGGCAACCTAGAACTAACAGGACAATATGCAGAAGCATTAA AA
CAACTTGGACATAACCTAGAGGATGTTGCTAGTCAGGAACCAGATCCTGCCCTTGGC AATG GTGGTCTAGGCCGTCTAGCGTCCTGTTTCCTGGATTCTTTGGCAACCCTAAATTATCCAG C
ATGGGGATATGGACTTCGGTACAGATATGGCCTCTTTAAGCAAATCATAACAAAGGA TGGT
CAGGAGGAGGTAGCTGAGAATTGGCTAGAGATGGGAAATCCATGGGAGATTGTAAGA AAT
GATGTCTCTTATCCTGTGAAATTCTATGGCAAAGTGGTTGAAGGCACTGATGGGAGA AAAC
ACTGGATTGGAGGAGAGAATATCAAGGCTGTGGCACATGACGTTCCTATTCCTGGCT ACAA
GACTAAAACTACTAACAATCTTCGTCTTTGGTCGACAACAGTACCATCACAAAATTT TGATTT
GGGAGCTTTTAATGCTGGGGACCATGCCAAGGCCAATGAAGCGCATCTAAATGCTGA AAA
GATATGCCACGTATTGTATCCAGGTGACGAATCATCAGAGGGGAAAATTCTTCGCTT GAAA
CAACAATATACATTATGCTCGGCCTCCCTGCAGGACATCATTTCTCGTTTTGAGTCG AGGG
CTGGTGATTCTCTCAACTGGGAGGACTTCCCCTCTAAAGTTGCAGTCCAGATGAATG ACAC
TCATCCAACACTGTGCATTCCCGAGTTAATGAGAATACTGATGGATATAAAGGGATT GAGC
TGGAATGAGGCTTGGAGTATCACAGAAAGAACCGTTGCATACACTAACCACACGGTA TTGC
CTGAAGCTCTAGAGAAGTGGAGCTTGGACATTATGCAGAAACTTCTACCTCGACATG TTGA
GATCATAGAAACAATAGATGAGGAGCTGATGAACAACATCGTCTCAAAATATGGAAC GGCA
GATATTTCACTGTTAAAGCAGAAGCTTAAAGATATGAGGATCTTAGACAACGTTGAT CTTCC
AGCCTCTGTTGCTAAACTGTTTATTAAACCTAAAGAGAAAAGGGGAAAGTTGCTTGT CGAAT
CTTTGGAGTCTATTGCTGAAGCTGACGAGAAAACTGAGTCACAAGAGGTGGAAAACA TTCT
ATCTGAGACAACAGAGAAAAAGGCTGAATCTGACTCTGAAGAAGCTCCTGATGCAGA AAAG
GAGGATCCTGAGTATGAGTTAGATCCATTTGCAAAATACGATCCTCAGTTTCCTAGA GTTGT
TCGAATGGCAAACCTCTGTGTTGTTGGTGGACATTCAGTTAATGGTGTGGCTGAAAT TCAC
AGTGAAATTGTTAAGCAAGATGTGTTCAATAGCTTTTATGAGATGTGGCCTACTAAG TTCCA
AAACAAAACAAATGGAGTGACCCCCAGGCGTTGGATCCGATTTTGTAATCCTGAATT AAGT
ACAATCATTTCAAAATGGATAGGCTCTGATGACTGGATCCTGAACACTGATAAACTT GCAG
GACTGAAGAAGTTTGCTGATGATGAGGATCTACAATCAGAATGGCGTACTGCTAAAA GGAA
TAACAAGATGAAGGTAGTTTCGCTGATAAGAGACAAGACTGGATATATTGTCAGCCC AGAT
GCGATGTTTGATGTGCAGGTGAAAAGAATACATGAGTATAAGCGGCAGCTGCTAAAT ATCC
TTGGTATCGTTTACCGCTACAAGAAGATGAAAGAAATGAGTGCAAAAGATAGAAGAA AGAG
CTTTGTTCCAAGGGTATGCATATTTGGTGGGAAAGCTTTTGCCACTTACGTGCAGGC AAAG
AGGATTGTGAAGTTTATCACAGATGTTGCAGCTACTGTAAATTATGATCCTGATATT GGAGA
TTTACTGAAGGTTGTATTTGTCCCAGATTATAATGTTAGCGTTGCTGAGACGCTCAT TCCTG
CCAGTGAATTGTCCCAGCATATCAGTACTGCTGGAATGGAAGCTAGTGGAACCAGCA ACAT
GAAGTTTGCAATGAATGGTTGTCTTCTTATTGGAACTTTGGATGGTGCGAATGTGGA GATC
AGAGAAGAGGTTGGAGAGGAGAATTTTTTCCTCTTTGGTGCACATGCACCTGAAATC GCTG
GTTTGAGGCAAGAGAGAGCTGAAGGAAAGTTTGTGCCTGACCTGAGATTTGAAGAGG TTAA
GGAATACGTCCGCAGTGGCGTCTTTGGGACTAGCAACTATGATGAATTAATGGGTTC TTTG
GAAGGAAATGAAGGTTATGGACGTGCAGATTATTTTCTTGTTGGCAAGGATTTTCCC AGTTA
CATTGAATGCCAGGAAAAAGTTGATGAAGCATACCGAGATCAGAAGCTATGGACAAG AATG TCTATCCTCAACACAGCGGGTTCCCCCAAGTTCAGCAGTGACCGGACGATCCACGAGTAC GCCAAGGACATATGGGATATCAGCCCCGTCATCATGCCCTGA
SEQ ID NO: 43 >Barley PHS1 (Morex reference genome) protein
MATASPPLATAFRPLAAAGGSGGLVVGAGAGAAPRRGRRGLVVRSVASDREVRGPAS TEEEL
SAVLTSIDSSAIASNIQHHADFTPLFSPEHSSPLKAYHATAKSVFDSLIMNWNATYD YYNKVNAK
QAYYLSMEFLQGRALTNAIGNLELTGQYAEALKQLGHNLEDVASQEPDPALGNGGLG RLASCF
LDSLATLNYPAWGYGLRYRYGLFKQIITKDGQEEVAENWLEMGNPWEIVRNDVSYPV KFYGKV
VEGTDGRKHWIGGENIKAVAHDVPIPGYKTKTTNNLRLWSTTVPSQNFDLGAFNAGD HAKANE
AHLNAEKICHVLYPGDESSEGKILRLKQQYTLCSASLQDIISRFESRAGDSLNWEDF PSKVAVQ
MNDTHPTLCIPELMRILMDIKGLSWNEAWSITERTVAYTNHTVLPEALEKWSLDIMQ KLLPRHVE
IIETIDEELMNNIVSKYGTADISLLKQKLKDMRILDNVDLPASVAKLFIKPKEKRGK LLVESLESIAE
ADEKTESQEVENILSETTEKKAESDSEEAPDAEKEDPEYELDPFAKYDPQFPRVVRM ANLCVV
GGHSVNGVAEIHSEIVKQDVFNSFYEMWPTKFQNKTNGVTPRRWIRFCNPELSTIIS KWIGSDD
WILNTDKLAGLKKFADDEDLQSEWRTAKRNNKMKVVSLIRDKTGYIVSPDAMFDVQV KRIHEYK
RQLLNILGIVYRYKKMKEMSAKDRRKSFVPRVCIFGGKAFATYVQAKRIVKFITDVA ATVNYDPD
IGDLLKVVFVPDYNVSVAETLIPASELSQHISTAGMEASGTSNMKFAMNGCLLIGTL DGANVEIR
EEVGEENFFLFGAHAPEIAGLRQERAEGKFVPDLRFEEVKEYVRSGVFGTSNYDELM GSLEGN
EGYGRADYFLVGKDFPSYIECQEKVDEAYRDQKLWTRMSILNTAGSPKFSSDRTIHE YAKDIW
DISPVIMP SEQ ID NO: 44 >Barley PHS1 (Morex reference genome) promoter (-2kb upstream of ATG)
TCGAGAACTATGTAGACATGACCCGAGAGATTCCTCGGTCAATATCCAATAGCGGGA CCTG
GATGCCCATATTGGATCCTACGTATTCTACGAAGATCTTATCGTTTGAACCTCAGTG CCAAG
GATTCATATAATCCCGTATGTCATTCCCTTTGTCCTTCGGTATGTTACTTGCCCGAG ATTCG
ATCGTCAGTATCCGTATACGTATTTCAATCTCGTTTACCGGCAAGTCTCTTTACTCG TTCCG
TAATACAAGATCCCGCAACTTACACTAAGTCACATTGCTTGCAAGGCTTGTGTGTGA TGTTG
TATTACCGAGTGGGCCCCGAGATACCTCTCCGTCACACGGAGTGACAAATCCCAGTC TCG
ATCCATACTAACTCAATGAACACCTTCGGAGATACCTGTAGAGCATCTTTATAGTCA CCCAG
TTACGTTGCGACGTTTGATACACACAAAGTATTCCTCCGGTGTCAGTGAGTTATATG ATCTC
ATGGTCATAGGAACAAATACTTGACACGCAGAAAACTGTAGCAACAAAATGACACGA TCAA
CATGCTACGTCTATTAGTTTGGGTCTAGTCCATCACGTGATTCTCCTAATGACGTGA TCCAG
TTATCAAGCAACAACACCTTGTTTATAATCAGAAGACACTGACTATCTTTGATCAAC TGGCT
AGCCAACTAGAGGCTTGCTAGGGACAGTGTTTTGTCTATGTATCCACACATGTATAT AAGTC
TTCATTCAATACAATTATAGCATGGATAATAAACGATTATCTTGATACAGGAATTAT AATAAT
AACTATATTATTATTGCCTCTAGGACATAATTCCAACAATATATGCCATCTTCCGTG TTTTAT
TCTTCTGGGCAGCGGTCTGTTATGCAAAACGTGGTCGCCGATATCTTCTAGTCTATA TCGA
TGACTTTTCGACAACTGCATCTGCAAGCTCCTTAGTTTCAACAAGTTTATTCCACTG AGGGG
GTGTTTCTTTACAGGGACTTTTTGCTGTAGGGACTAAAAAACGTCCCTTTTAGTCCC ATGTG
AAAGAAACAGGAGAGACTTTTAGGGACTAAAAGGGGGTATTTGGAACTAAAGGAAGA AGTC
CCTATGGGAGGGACTTTTTGAGACTTTTTTGACACTTTTTACAACAGTGCCCCTACT TTTAT
CATGTCATTTAATAACTTCTAGTACATTACTAGGGGTAACATGGTCTTTTTACATGT CATTTA
ATGACTTCCAATCCCTATTTAGTCCCTGGAAAGAAACGGGTAGGGACTAGAGACTTT TTAG
TTGGTATTAAAAAAAAAGTGCTAAGACTTCTTAAAGAAACAGGGCGTGAGATGGAAG TGCA
GAGGCGGCAGCGAATTAGCTCACGGGGCACTACCGGTGGATGTAGGAAAAAGAAGGC AT
TTGACACCTCTCAAAACTATGAAGATGTAAATTTTAATTTCTAAGTGTTTTTGTGAG GATTTG
TGCTATTTAATACTCCCTCAATTCCAAAATAAATGTTTTAATTTTGTATTATCTCTA ATACAAA
GTTATACTAACTTTAAGATACCTATTTTAAAACAGAGGAAATATATGCATTTTGCCT TTCTCA
CAAAAAAAACCCCTAAACCATCACACGAGGGGAAGAGCCATTCCCAGCCGTCCAATA AACA
CGACAAAAAGCTGTACAGCACGGGAGCAATTCCCAGCCGTCGAAATGCGCTGCAACG GAA
GTAGCGCTCCCTGACACAGCGGCACATGAGCCACGCCTCGCACCACCGCACCGTGCA CC
GGCGCGGGGCCCACCGGCCAGCGTCCCCACACGCGTACCGGCGGCCAGCATTGCGCG G
CACGGCGACGGAGAGATATTTTTAACGCCTTCCAGTTCGGAAGCGCCCCCCATCACT CAC
TCGGGTACAGGGCGGGCCGCGTGAGTCTCGCGTCGTCGCATCGCGCGCGCTGCACAC CC
ACCGTCGTCAGCGCGGACCACCGGCGCCCCACCCA
SEQ ID NO: 45 >Rye PHS1 genomic DNA
CATCGATGACTTCCAGACCGCCGCTTCTACAAGCTCCTTATTTTCAAGTCGCTTCTA CAAGC
TCCATATTTTCAACAAGTTTCCTTGGCAACGGCTTTGCTCATGGCGCACTGCCCGTG GTTG
CAAGAGAAAGAATACATTCGACACCCCTAAAACCATGCAGATGTAATTTCTTTTTGT AAATG
TTTTTGTGACCTTTTGCCTTTTTCCAAAAAAACAAACAAACTAAACCATCACACGAG GAGAA
GAGCCATTCCCAGCCGTCCAATAAACACGACAAAACGCTTTACAGCACGGGGGCAAC TCC
CAGCCGTCGAAAATGCATTGCAACGGAACTACTAGCTCTCCCTGACACAGCGGCACA TGA
GCCACGCCTCGCACCGCAACCGTGCACCGGCGGTGGGCCCACCGGCCAGTGTCCCCC A
CACGTGTACCGGCGGCCAGCATTGCACGGCACGGCGACGGAGAGATATTTTAACGCC TTC
CAGTTCAGAAGCACCCCCCATCACTCACTCGGGTACAGAGCGGGGCCGCGTGACTCT CG
CGTCGTCGCATCGCGCGCGCTGCACACCACCGTCGTCAGTGCGACCACTGGCACCAA TG
GCGACCGCCTCGCCGCCGCTCGCCACCGCCTTCCGCCCGCTCGCCGCCGGCAGCGGC G
GCACCCGGGGAGGAGGAGAAGGAGCGCACGCCGCGGGCCGGGTCGCGCCGAGGCGGG
GGCGCCGGGGGTTCGTGGTGCGGAGCGTGGCGAGCGATCGAGAGGTGCGGGGGCCCG
CCTCGACCGAGGAAGGTAAGCGGCCGCGCGCTCCCGTTGGCTTCGACAACCCGGATG CT
GGCCGGCTGTAATGTGGTTTGGATTTCCGCTGGTGCGACCCGTGAATTTATGATCCT CTGA
CTGATGGTCTCTGTGTTCCCGGGTTTCGGGCTATTGGTTTTGTGCGCGCGCATTGCT AATC
TACTGCTCTAGGATTTGTTAGGGTTCAGAGGAATCTTAATTCGCGAATAAACTTTCG AGGAA TGTGTTCTGAATTTACTGAAACTGGTTTAGTTTGGTGCGTCAAGTTAGAATGATGTATTT AT GGTACAGTAGCAAACTCTGAGTAATTCTACTGTGCGTGGACAGCCACCCCTGGTGGATGA T GCAGCAAGGGTCAACATACTGTCGCTACTATCTGTAGCTGCAACAAAATACTTGGAATCT G AACTAGTGAAATAAATCATTTAAGATTTTTGCTGAAGATTACACAGTATCAATGTTTCTG AAG GTGATTGGAACAGTATAGCACTTCAAAATGGCAACCAAGCAGGATCAGTATACTGACACA T CTTTTTTCTAAAGGTGGTTCGAATACCGGTATGAAGTTTATAATTATTTAATTAAGGCGT AAG ATTTGGTAGCAAAAACTATTTTTACGGGAAATGATGAAACATGGCACAAAAACTCTAGGA GC TGCCTTTCATTTATTAATCGATAAAAAGAGAGGTGCAGGGTTCTAGTAGAATGCAAATGG TG ATACAGTATATTATCTACCATTTTGTGCAAGCATTTTCACACGAAAACAATATCCCATGC AGT TTTTAGTTTTGACTCTCGACTGTTTAATACCTACTCTTTGTAAGATTTCAGATCCTAATA TTG CTGTTCTTGCAACTCAGAGCTTTCAGCCGTGCTAACTTCCATTGATTCTTCCGCCATCGC GT CAAATATCCAGCACCATGCAGACTTCACACCGTTGTTCTCGCCAGAGCACTCTTCACCCC T AAAGGCTTACCATGCAACTGCTAAAAGTGTTTTTGATTCTCTGATCATGAATTGGAATGC AA CGTATGACTATTACAACAAAGTGAATGCAAAGCAAGCTTATTACCTGTCCATGGAGTTTT TG CAGGTCTCGCAACATACTTCCTTTAGCCTTATCTCATACGTTTTGGTAACTGCGTTATGA TG TTATATGTTTTTTTTTTTTTTTGTACTGCAGGGAAGAGCTCTTACAAATGCTATTGGCAA CCT AGAGCTAACAGGACAATATGCAGAAGCGTTAAAACAACTTGGACAGAATCTAGAGGATGT T GCTAGTCAGGTAAGCGATTTCTGTCAAAGAATAGTTTTTAGTCCATATCAAAATTATTGC CG AGCCATTTTTTTTTGAAAAAGTAACATTTTATTATTATTCATCAACTTCCATTTTATTCA AAAC TATCTGCAAAAGGAGAAAATTTTCTCCTGGGGGTTCCACCATCTGTCTTATATTTGTTAT GC AGTTTTTAGAGTGATTGACTATGAAATTACGCCAAAACATGAGCTTTGCCTAGAGTGCCC TG TAGTTTAGAAATCTTAAGATTGGATATGTTACTTTTCTATTACAATGAAACAGGAACCAG ATC CTGCCCTTGGCAATGGTGGTCTAGGCCGTCTAGCGTCCTGTTTTCTGGATTCTATGGCTA C CCTAAATTATCCAGCATGGGGATATGGACTTCGGTACAGATATGGCCTTTTTAAGCAAAT CA TAACGAAGGATGGTCAGGAGGAGGTAGCTGAGAATTGGCTAGAGGTAGTGATATGTTTTC T ACTTGTTTACTGTTTGGAAGTGTTGATTATGTTGGTTTCAGTTTTAATGACCTGCACGTT AG CTTCTTTTGCCATTTATAACTGTTGCTGTATCTCCAGATGGGAAATCCATGGGAGATTGT AA GAAATGATGTCTCTTATCCTGTGAAATTCTATGGCAAAGTGGTTGAAGGCACTGATGGGA G AAAACACTGGATTGGAGGAGAGAATATCAAGGCTGTGGCACATGATGTTCCTATTCCTGG C TACAAGACTAAAACTACTAATAATCTTCGCCTTTGGTCAACAACAGTACCATCACAAAAC TTT GATTTGGGAGCTTTTAATGCTGGGGATCATGCCAAGGCCAATGAAGCACACCTAAATGCT G AAAAGGTATTCTTTTGTCTGCATTTATTTTGAATATAAATCATTAAATCTCAATAACTAT TGAT TTGAATGGTAAACACTAAACAATGTCTAGCTAGTAGTCACCACGGTGTCCCTTGATGTCA TT TTAGAATGTGCGCAGTCAAATGTTTGATAACTTTGATTGTAATAATATACAAATATTCAG AAA CTTTGATTGACACTAGATTTTTCATAAACATACTTATTGTATTTCCATGTAAATAGAAAA GCT GATGTTCAGAGGTTTGTGTGTCAAAATGACAAGTAAATATGAACAGAGTTTGTGCACACA TA ACCTCTTCATCTGGTGTTTGCAGTTCTCTAGGTTGTGTTTGTGTTTTCTCCTGCTCTACC TTA ACTAATCTCAGCATATGCATCAGGTACATTGCTCCTCTGTGTCTATAAAAAATAAGTCTG AT GAGATATGGTTGCAGTCCAGTAGTTCGTAGTTGGCACAGCATGGAAGAACTTTTCACATG G AAGCCTTACCCTTATGATTTTAATTATACCTTCGGAGTGTTTGTGCATCGTGTACATAGA CC ATTAAGTATGGGTGTTCAAAAATAGTTTTCCCAAACTGTGTAAAAGCCACCACTGTTTGT GT ATTCATTTTTTCGAACTAGCTAAACAACTTGACCGGAGAAGATACATACAATTGTACTCA CA TCTGAAGCTGCTCAGTTGGACAGATTTGTTCCAATCCCGTGCATGGGTTAAATTGTCGAC G TTTCCTCACTTGTTCAGTAGTAGGAGTACTATCTATTATATTTATACTAAAGTTTTCTTT TTGT GATATGGTACAAACTTTTAATAATACTTACATTTGCAAACACTGTGGACAAAATTTATTA TGT CTAACTTAACTTTATGACTGGTCTTCGCATCATGCCAGATATGCCACGTATTGTATCCAG GG GACGAATCGTCAGAGGGGAAAATTCTTCGCCTGAAACAACAATATACACTATGCTCGGCC T CCCTGCAGGACATTATTTCTCGTTTTGAGTCGAGGGCTGGTGATTCTCTCAACTGGCAGG A CTTCCCCTCTAAAGTTGCAGTTCAGATGAATGACACCCATCCAACACTGTGCATTCCCGA G TTAATGAGAATACTGATGGATGTTAAGGGATTGAGCTGGACTGAGTCTTGGAGTATCACG G AAAGGTATCACAGAAGGGTACTGCAGTTTGATGTGTTATGCTGTTCCTCCTGTTCTAACT AA TTTTATTTCCTCATCAGTGTTCCATTCAATCATGTACTGATTTTCTGACACCCACATCAT TTG TAGAACCGTTGCATACACTAACCACACAGTACTTCCTGAAGCTCTGGAGAAGTGGAGCTT G GACATTATGCAGAAACTTCTACCTCGACATGTTGAGATTATAGAAAGAATAGATGAGGAG G TATCAAATGTGCAGATAATATTATACGATTAATTTCATTCTCTGTATTTCAACTTATGGT TTAC TTGTCTTCATCGTTGATTTTAAACAGCTAATGAACACCATCATCTCAAAATATGGAACGG CA GATATTTCACTGTTAAAACAGAAGCTTAAAGATATGAGGATCTTAGACAACGTTGATCTT CC AGCCTCTGTTGCTAAACTGTTTATTAAACCTAAAGAGAAAAAGGGGAAGTTGCTTGTTGA AT CTTTGGAGTCTATTGCTGAAGCTGACGAGAAAAGTGAGTCACAAGAGGAGGAAAACATTC A ATCTGAGACAACAGAGAAAAAGGGTGAATCTGACTCTGAAGAAGCCGCTGATGCAGAAAA GGAGGATCCTGAGTATGAGTTAGATCCATTTGCAAAATACGATCCTCAGTTACCTAGAGT T GTTCGTATGGCAAACCTCTGTGTTGTTGGTGGACATTCAGTTAATGGTGTTGCCGAGATT C ACAGTGAAATTGTTAAGCAAGATGTGTTCAATAGCTTTTATGAGGTATTTGCACAATGAT TT GGAAACCGTGATCTTATTTTTCAATGACTGCTATAATCGTCAGGTGATATGCTATTCACC AT CTGCCATTTTCTTGTGTTCTTTGCCAGATGTGGCCTGCTAAGTTCCAAAACAAAACAAAT GG AGTGACCCCCAGGCGTTGGATCCGGTTTTGTAATCCTGAATTAAGTACAATCATTTCAAA AT GGATAGGCTCCGATGACTGGATTCTGAACACTGATAAACTTGCAGGACTGAAGAAGGTAC T TCCAGTAGCTTTATTCCCCAATAATCAGTAATCTAATGCTTCCTTCACTTCAATGTTAAC TGT ACCTTGTTTTTAAATTGATGTGTTTTCATCCTGTAACAGTTTGCTGATGATGAGGATCTG CAA TCAGAATGGCGTACTGCTAAAAGGAATAACAAGATGAAGGTAGTTTCGCTGATAAGAGAC A AGACTGGATATGTTGTCAGCCCAGATGCAATGTTTGATGTGCAGGTATATTTTGTACTAG AA AACATGCGTCTTTTTATAAATATGGTTTGACTCAGCTCCTGGTACTGGACACTTAGGTGA AA AGAATACATGAATATAAACGGCAGCTGCTAAATATCCTTGGTATCGTTTACCGCTACAAG AA GATGAAAGAAATGAGTGCAAAAGATAGAATAAAGAGCTTTGTTCCAAGGGTATGCATATT TG GCGGGAAAGCTTTTGCCACTTACGTACAGGCAAAGAGGATTGTGAAGTTTATCACAGATG T CGCAGCTACTGTAAATTATGATCCTGATATTGGAGATTTACTGAAGGTAAATTTGTGTCT TT GATGCCACTTATCGATTTTAATTTTCATGTTCTGTCATTGGCAAGTTGACTCAGACTGAA AA GCAGTGAATGTTCAGAGTTTCAAAAGAAAACAGAGCAGTGAATGTTCTTAATGCCGGTGC T TGTACAACTGAGAAACTGAATAACCGCAGTTCTTTTGGAGAACAATATAGCCACAATTGT TT TGACCCTTTTATTGCTATCAAGCATCTGATGCTGATATTATTGATAACTAGATGAGGTAA ATA CAAGGCGACAGGTTAAGTCTGAGGATTTTTGTTTGTTATCCAGGTTGTATTTGTCCCAGA TT ATAATGTTAGCGTTGCTGAGACGCTCATTCCTGCCAGTGAATTGTCCCAGCATATCAGGT A ATATTCTGCCTATTCTTATATAGTACTCCCTCTGTAAACAAATATAAGAGCATTTAGATC ACT ATTTTAGTGATCTAAACGCTCTTATGTTTGTTTACACAGGGAGTATGTGAGTACATCAAA AA AGAATATTACCTTTTCGTTTGCTATAATATATACCTAGAGTCTTACTGGGATAGTGCCAC CC CAGTTTCTATTTCTGTACAAGATGTGCATTTTTAACAGTTACTATCCATTCAGTACCGCT GGA ATGGAAGCTAGTGGAACCAGCAACATGAAGTTTGCAATGAATGGTTGTATTCTTATTGGA A CTTTGGATGGTGCGAATGTGGAGATCAGAGAAGAGGTTGGAGAGGAGAACTTTTTCCTCT T TGGTGCACATGCACCTGAAATCGCTGGTTTGCGGCAAGAGAGGGCTGAAGGAAAGGTACC ACACTTGTTTATATAGCTGTGCCTACTATCAAACTAGTAAAGATATCTCCCATATGTGTG CT GTCCAAATAGTTTAGCTGAGGGAAAATCATAACTTTACCATTTTCCTTTGCACTTGTAGT TTG TGCCTGACCCGAGATTTGAAGAGGTTAAGGAATACGTCCGCAGTGGCGTCTTTGGGACTA GTAACTATGATGAATTGATGGGTTCTTTGGAAGGAAATGAAGGTTATGGACGTGCAGATT A TTTTCTTGTTGGCAAGGATTTTCCCAGTTACATTGAATGCCAGGAAAAAGTTGATGAAGC AT ACCGAGATCAGAAGGTAAGTTGTATAACTGACGCTGTTTGTTTTCTGGCTGAAATGCACA T GGTGATTATGCAAACTTTCTGATATGTGATGGCTGACATCAGTGTTTCGCTGTGAACGAC TT CACTGATTCGTAGGAAGGAATATTTCTTAGTTCAAAGTTTCAAACCGGAGTATCTATCGA TG TGTTGGAACTTGGAAGATCATAAACAATCAGAGAATATAGTTGCTAGATGTGCTATACTA CT CATCTGTTGGCTGTTGAGTGTTGATACATGAGCTTTGATGAACCTAGCTAGACCTTCATG AA AATGCTTCTATGCATTCCAATAAAAAAAATAACTTTATAACTTAACAAAACAATTAGGTT GAA CCTCGCCAGGGGGCCTTTTGTAATCTGTTTTCTGAAATGAAAATCAGAAGCAGGGAGCTC C TCGATTCTAAAATGAAAAATAAAATCAGGGATAACTGAGGATCTGGTTTTCCTCTGTTGG AG CCCCAAAGAATTCTGTGGTCTTATTATTGTAAGAGTATAAGATGCACTGCTCCGATAAGT TT CTCCTTCATCCATTTGTTTTTATTGCTTTGTGACATTTCGTGTGACAGCTCAGATCACCA TG CATCCCAGTTGTACACATTATTTTGCTATCGTGGATATTTCCACTCTGTTTCATGTGCGA CA TTTTTAATTTGTTACATTGTTACTAACTACACATACTCTGATAGATTTTCAGAGTTTTGG TCCT CCTAATCACATAATATCTGTCTCTACCTGCAGCTATGGACAAGAATGTCTATCCTCAACA CA GCTGGTTCCCCCAAGTTCAGCAGTGACCGGACGATCCACGAGTACGCCAAGGACATATGG GACATCAGCCCCGTCATCATGCCCTGAATAATTCAAGCAGCTTCCTGGAACCTTGAGAGT C TATGAGGCGCGATTGGTCGCAGATTTGCCTCCATTCAGCGAGAATGTATTGCGCCCCCCA GTTTTGTATAGGCATATGCAAGTTCTGACATGGACCTGAGTGCTGAGAGCATGCAGGGTA G TTCATCCTTAGCAATATTGCATGTCAGTTAGCCAGTAAGTATTACCCAGGCTTATGGTGT GT GTATTTCGCCGAATAAAGTTGGAGCAGAATCATCAAACAAACATACTTTGGTGTATCTGT AA TGATAAATAAATGGTAAATGACTGAACAAAAAGTGTCGTTGGCGCACACATGTAGTAGAA G CATGTTTCTTCCGGAGAGGGTGCAAAGCTGAAAGCACCCCAGTTTTGAAGTGAGAATGCA GAAGTTACAAAAACATTTGTTTACATGCAAAGTGCAACGGGATTCGTCAAGAGTTCAGGG C
TCCTTTGATTCAAAGGATTTGCATAGGAATTTTGGAGGATTAGAGTCCTTAGGAATT TTTTCT ACGTTGAGGCCTCATTTGATTTATAGGATAGAAATATCATAGGAATAGGAAAATCATAGT AA
GTGAGATGATATGC
SEQ ID NO: 46 >Rye PHS1 CDS
ATGGCGACCGCCTCGCCGCCGCTCGCCACCGCCTTCCGCCCGCTCGCCGCCGGCAGC G GCGGCACCCGGGGAGGAGGAGAAGGAGCGCACGCCGCGGGCCGGGTCGCGCCGAGGC GGGGGCGCCGGGGGTTCGTGGTGCGGAGCGTGGCGAGCGATCGAGAGGTGCGGGGGC CCGCCTCGACCGAGGAAGAGCTTTCAGCCGTGCTAACTTCCATTGATTCTTCCGCCATCG C GTCAAATATCCAGCACCATGCAGACTTCACACCGTTGTTCTCGCCAGAGCACTCTTCACC C CTAAAGGCTTACCATGCAACTGCTAAAAGTGTTTTTGATTCTCTGATCATGAATTGGAAT GC AACGTATGACTATTACAACAAAGTGAATGCAAAGCAAGCTTATTACCTGTCCATGGAGTT TT TGCAGGGAAGAGCTCTTACAAATGCTATTGGCAACCTAGAGCTAACAGGACAATATGCAG A AGCGTTAAAACAACTTGGACAGAATCTAGAGGATGTTGCTAGTCAGGAACCAGATCCTGC C CTTGGCAATGGTGGTCTAGGCCGTCTAGCGTCCTGTTTTCTGGATTCTATGGCTACCCTA A ATTATCCAGCATGGGGATATGGACTTCGGTACAGATATGGCCTTTTTAAGCAAATCATAA CG AAGGATGGTCAGGAGGAGGTAGCTGAGAATTGGCTAGAGATGGGAAATCCATGGGAGATT GTAAGAAATGATGTCTCTTATCCTGTGAAATTCTATGGCAAAGTGGTTGAAGGCACTGAT G GGAGAAAACACTGGATTGGAGGAGAGAATATCAAGGCTGTGGCACATGATGTTCCTATTC C TGGCTACAAGACTAAAACTACTAATAATCTTCGCCTTTGGTCAACAACAGTACCATCACA AA ACTTTGATTTGGGAGCTTTTAATGCTGGGGATCATGCCAAGGCCAATGAAGCACACCTAA A TGCTGAAAAGATATGCCACGTATTGTATCCAGGGGACGAATCGTCAGAGGGGAAAATTCT T CGCCTGAAACAACAATATACACTATGCTCGGCCTCCCTGCAGGACATTATTTCTCGTTTT GA GTCGAGGGCTGGTGATTCTCTCAACTGGCAGGACTTCCCCTCTAAAGTTGCAGTTCAGAT G AATGACACCCATCCAACACTGTGCATTCCCGAGTTAATGAGAATACTGATGGATGTTAAG G GATTGAGCTGGACTGAGTCTTGGAGTATCACGGAAAGAACCGTTGCATACACTAACCACA C AGTACTTCCTGAAGCTCTGGAGAAGTGGAGCTTGGACATTATGCAGAAACTTCTACCTCG A CATGTTGAGATTATAGAAAGAATAGATGAGGAGCTAATGAACACCATCATCTCAAAATAT GG AACGGCAGATATTTCACTGTTAAAACAGAAGCTTAAAGATATGAGGATCTTAGACAACGT TG ATCTTCCAGCCTCTGTTGCTAAACTGTTTATTAAACCTAAAGAGAAAAAGGGGAAGTTGC TT GTTGAATCTTTGGAGTCTATTGCTGAAGCTGACGAGAAAAGTGAGTCACAAGAGGAGGAA A ACATTCAATCTGAGACAACAGAGAAAAAGGGTGAATCTGACTCTGAAGAAGCCGCTGATG C AGAAAAGGAGGATCCTGAGTATGAGTTAGATCCATTTGCAAAATACGATCCTCAGTTACC TA GAGTTGTTCGTATGGCAAACCTCTGTGTTGTTGGTGGACATTCAGTTAATGGTGTTGCCG A GATTCACAGTGAAATTGTTAAGCAAGATGTGTTCAATAGCTTTTATGAGATGTGGCCTGC TA AGTTCCAAAACAAAACAAATGGAGTGACCCCCAGGCGTTGGATCCGGTTTTGTAATCCTG A ATTAAGTACAATCATTTCAAAATGGATAGGCTCCGATGACTGGATTCTGAACACTGATAA AC TTGCAGGACTGAAGAAGTTTGCTGATGATGAGGATCTGCAATCAGAATGGCGTACTGCTA A AAGGAATAACAAGATGAAGGTAGTTTCGCTGATAAGAGACAAGACTGGATATGTTGTCAG C CCAGATGCAATGTTTGATGTGCAGGTGAAAAGAATACATGAATATAAACGGCAGCTGCTA A ATATCCTTGGTATCGTTTACCGCTACAAGAAGATGAAAGAAATGAGTGCAAAAGATAGAA TA AAGAGCTTTGTTCCAAGGGTATGCATATTTGGCGGGAAAGCTTTTGCCACTTACGTACAG G CAAAGAGGATTGTGAAGTTTATCACAGATGTCGCAGCTACTGTAAATTATGATCCTGATA TT GGAGATTTACTGAAGGTTGTATTTGTCCCAGATTATAATGTTAGCGTTGCTGAGACGCTC AT TCCTGCCAGTGAATTGTCCCAGCATATCAGTACCGCTGGAATGGAAGCTAGTGGAACCAG CAACATGAAGTTTGCAATGAATGGTTGTATTCTTATTGGAACTTTGGATGGTGCGAATGT GG AGATCAGAGAAGAGGTTGGAGAGGAGAACTTTTTCCTCTTTGGTGCACATGCACCTGAAA T CGCTGGTTTGCGGCAAGAGAGGGCTGAAGGAAAGTTTGTGCCTGACCCGAGATTTGAAGA GGTTAAGGAATACGTCCGCAGTGGCGTCTTTGGGACTAGTAACTATGATGAATTGATGGG T TCTTTGGAAGGAAATGAAGGTTATGGACGTGCAGATTATTTTCTTGTTGGCAAGGATTTT CC CAGTTACATTGAATGCCAGGAAAAAGTTGATGAAGCATACCGAGATCAGAAGCTATGGAC A AGAATGTCTATCCTCAACACAGCTGGTTCCCCCAAGTTCAGCAGTGACCGGACGATCCAC GAGTACGCCAAGGACATATGGGACATCAGCCCCGTCATCATGCCCTGA
SEQ ID NO: 47 >Rye PHS1 protein
MATASPPLATAFRPLAAGSGGTRGGGEGAHAAGRVAPRRGRRGFWRSVASDREVRGP AST
EEELSAVLTSIDSSAIASNIQHHADFTPLFSPEHSSPLKAYHATAKSVFDSLIMNWN ATYDYYNK VNAKQAYYLSMEFLQGRALTNAIGNLELTGQYAEALKQLGQNLEDVASQEPDPALGNGGL GRL ASCFLDSMATLNYPAWGYGLRYRYGLFKQIITKDGQEEVAENWLEMGNPWEIVRNDVSYP VK FYGKVVEGTDGRKHWIGGENIKAVAHDVPIPGYKTKTTNNLRLWSTTVPSQNFDLGAFNA GDH AKANEAHLNAEKICHVLYPGDESSEGKILRLKQQYTLCSASLQDIISRFESRAGDSLNWQ DFPS KVAVQMNDTHPTLCIPELMRILMDVKGLSWTESWSITERTVAYTNHTVLPEALEKWSLDI MQKL LPRHVEIIERIDEELMNTIISKYGTADISLLKQKLKDMRILDNVDLPASVAKLFIKPKEK KGKLLVES LESIAEADEKSESQEEENIQSETTEKKGESDSEEAADAEKEDPEYELDPFAKYDPQLPRV VRMA NLCVVGGHSVNGVAEIHSEIVKQDVFNSFYEMWPAKFQNKTNGVTPRRWIRFCNPELSTI ISK WIGSDDWILNTDKLAGLKKFADDEDLQSEWRTAKRNNKMKVVSLIRDKTGYVVSPDAMFD VQ VKRIHEYKRQLLNILGIVYRYKKMKEMSAKDRIKSFVPRVCIFGGKAFATYVQAKRIVKF ITDVAA TVNYDPDIGDLLKVVFVPDYNVSVAETLIPASELSQHISTAGMEASGTSNMKFAMNGCIL IGTLD GANVEIREEVGEENFFLFGAHAPEIAGLRQERAEGKFVPDPRFEEVKEYVRSGVFGTSNY DEL MGSLEGNEGYGRADYFLVGKDFPSYIECQEKVDEAYRDQKLWTRMSILNTAGSPKFSSDR TIH EYAKDIWDISPVIMP
SEQ ID NO: 48 >Rye PHS1 promoter (-2kb upstream of ATG)
TTGCTCGATCGATAGAGGGGCGCGCTATACTCGGGACGGTCTTCACGGGGCCTGCGG AG GCGCGTGCAACCGACGGGCCAGGCCGGTCGCACCGCGTAGATGAGGCAGATCGCGGCG ACGGGCGGGTGACCAATTCGATCGCGCGCGAGGTCGGGGACACTGCTCGGTGGAGGTG GGGTGGCGGCGGAGTCCGAGATGAAGCCGGCGATGACAAACGTCGTGGGACATCGTGCG GACGAGCGCGTCAGCACCGGCGCCCGGGCAACGAGGCCCGAGCGACCAAGGTAGCAGC
GGCGCCCGAGCCCGACGCAACCGACGAGCCTGACGCAGCCGGCCCGACACAGCCGAG G ACGAGGCCGACGAGGTAACACGCAGGGTCGCCGTGTAGACGCGACGAAGACGGATGGC GTGAATCGATTGACGTGCCGGCACGCAAGCGACGGCTATGATTGGCCGCCGCATGGCGC GAGGCCGCCGGCTGGGGGCGATGCAGGTGCCCCGGTCGGAGCAGGGCGGTGACGGCC GGCGCGGGGCGACGGGCGTGCCGGCGCGCGAACGACGGCCTTGAGGGGCCGCCTGTC
GTGCGAGGCCGCCGGGTTGGTGAAGTAGCCGGCCAGTCGCGTCGGTGTTAGAGTAGA GG CGCGCGGATGTCAACGGCGGCGGGTGGACGACGCAAACCGATCTTAGATCGGAAAACCA AAAAGTAAACGCCGATCAAAGCGACCGGTGAGAGAGAGAAAAAACTGGAGCAAAGGCTCG GGAAAAGACTCTCTAGGGAAGTCGGTCAACACAACTGACGGACGAACCCTAGGTATGGGT GACGCGGCCCCCAGCGGCGGTCATGGAGGCCGACCGCTCCGGGGGCGGCGCGCGGGT
GCGGAAGCCGCGGCGGTGGCTAGGGTTTAGGATCGATTAGGCTGATACCATGTCAGA AG GGAGATAGAGGTTTGGCGGAATAGTTGTTGTATTGCTTGAGCCTCGTGGGCATATATATA T AGAAGTACATGGTCATCTTGAAGTACAAGGCAAGGTAAAATAATATCCTATGCTATCCTA GC TTTCCTAATAATCACGATACTCAACATTGTCATTGTAGTGAGGCATGTGAAGGGTGGTTT GT CCCAAATCTAGTTAGATGGGTTGGGGTTTTTCTTTGGCAGGTTTATCTCAGTGGTAGTGT C
CCGTGAAGCGGATGACACGACTAGGTGTCTTGGTCCTTCACGCTCTAGTTTTGGCAG GTG AAGGTTGGTCAGCTTCGCCTCGTCGGGGAGCTTATGAGATTTGTGTATCTTTACTTCGTC C GGTCAAACCTGTGGCATTATATAAGCACTCTCATGTGTTTTCTTCTTTTGGACAGCAGTC TG TTGTGTAAATCGTAGTCGTCGACATCTTCTAGTCCACATCGATGACTTCCAGACCGCCGC T TCTACAAGCTCCTTATTTTCAAGTCGCTTCTACAAGCTCCATATTTTCAACAAGTTTCCT TGG
CAACGGCTTTGCTCATGGCGCACTGCCCGTGGTTGCAAGAGAAAGAATACATTCGAC ACC CCTAAAACCATGCAGATGTAATTTCTTTTTGTAAATGTTTTTGTGACCTTTTGCCTTTTT CCA AAAAAACAAACAAACTAAACCATCACACGAGGAGAAGAGCCATTCCCAGCCGTCCAATAA A CACGACAAAACGCTTTACAGCACGGGGGCAACTCCCAGCCGTCGAAAATGCATTGCAACG GAACTACTAGCTCTCCCTGACACAGCGGCACATGAGCCACGCCTCGCACCGCAACCGTGC
ACCGGCGGTGGGCCCACCGGCCAGTGTCCCCCACACGTGTACCGGCGGCCAGCATTG CA CGGCACGGCGACGGAGAGATATTTTAACGCCTTCCAGTTCAGAAGCACCCCCCATCACTC ACTCGGGTACAGAGCGGGGCCGCGTGACTCTCGCGTCGTCGCATCGCGCGCGCTGCACA CCACCGTCGTCAGTGCGACCACTGGCACCA
As a full knockout of PHS1 gene function is most desirable in wheat, the most ideal target sequences are those that are common to all three homeologs of PHS1 (i.e., so that the PHS1 5A, 5B and 5D copies can be edited simultaneously). Suitable target sequences can be found using resources such as the Wheat CRISPR tool htps://crispr.bioinfo.nrc.ca/WheatCrispr/ (Cram et al., 2019. BMC Plant Biology 19:474).
Examples of four ideal target sequences are given below, where the PAM sequence (NGG) is underlined. Moreover, the first two target sequences can also target PHS1 in barley, and the first three target sequences can also target PHS1 in rye.
CRISPR constructs
SEQ ID NO: 49 Wheat PHS1 target sequence 1 (also works in barley, rye) ACCCTAAATTATCCAGCATGGGG
SEQ ID NO: 50 Wheat PHS1 target sequence 2 (also works in barley, rye) CAGGAACCAGATCCTGCCCTTGG
SEQ ID NO: 51 Wheat PHS1 target sequence 3 (also works in rye)
GTTGCATGGTAAGCCTTTAGGG
SEQ ID NO: 52 Wheat PHS1 target sequence 4
CAGGGGACGAATCATCAGAGGGG
SEQ ID NO: 53 Wheat PHS1 protospacer sequence 1
ACCCTAAATTATCCAGCATG
SEQ ID NO: 54 Wheat PHS1 protospacer sequence 2
CAGGAACCAGATCCTGCCCT
SEQ ID NO: 55 Wheat PHS1 protospacer sequence 3
GTTGCATGGTAAGCCTTTA
SEQ ID NO: 56 Wheat PHS1 protospacer sequence 4
CAGGGGACGAATCATCAGAG
SEQ ID NO:57 tracrRNA sequence
GUUUUAGAGCUAGAAAUAGCAAGUUAAAAUAAGGCUAGUCCGUUAUCAACUUGAAAA AG UGGCACCGAGUCGGUGCUUUUUUU
SEQ ID NO: 58 Wheat PHS1 complete sgRNA-encoding nucleic acid sequence 1
ACCCTAAATTATCCAGCATGGTTTTAGAGCTAGAAATAGCAAGTTAAAATAAGGCTA GTCCG
TTATCAACTTGAAAAAGTGGCACCGAGTCGGTGCTTTTTTT
SEQ ID NO: 59 Wheat PHS1 complete sgRNA-encoding nucleic acid sequence 2
CAGGAACCAGATCCTGCCCTGTTTTAGAGCTAGAAATAGCAAGTTAAAATAAGGCTA GTCC
GTTATCAACTTGAAAAAGTGGCACCGAGTCGGTGCTTTTTTT
SEQ ID NO: 60 Wheat PHS1 complete sgRNA-encoding nucleic acid sequence 3 GTTGCATGGTAAGCCTTTAGTTTTAGAGCTAGAAATAGCAAGTTAAAATAAGGCTAGTCC GT
TATCAACTTGAAAAAGTGGCACCGAGTCGGTGCTTTTTTT
SEQ ID NO: 61 Wheat PHS1 complete sgRNA-encoding nucleic acid sequence 4
CAGGGGACGAATCATCAGAGGTTTTAGAGCTAGAAATAGCAAGTTAAAATAAGGCTA GTCC
GTTATCAACTTGAAAAAGTGGCACCGAGTCGGTGCTTTTTTT
SEQ ID NO: 62 Wheat PHS1 complete sgRNA RNA sequence 1
ACCCUAAAUUAUCCAGCAUGGUUUUAGAGCUAGAAAUAGCAAGUUAAAAUAAGGCUA GU
CCGUUAUCAACUUGAAAAAGUGGCACCGAGUCGGUGCUUUUUUU
SEQ ID NO: 63 Wheat PHS1 complete sgRNA RNA sequence 2
CAGGAACCAGAUCCUGCCCUGUUUUAGAGCUAGAAAUAGCAAGUUAAAAUAAGGCUA GU
CCGUUAUCAACUUGAAAAAGUGGCACCGAGUCGGUGCUUUUUUU
SEQ ID NO: 64 Wheat PHS1 complete sgRNA RNA sequence 3
GUUGCAUGGUAAGCCUUUAGUUUUAGAGCUAGAAAUAGCAAGUUAAAAUAAGGCUAG UC
CGUUAUCAACUUGAAAAAGUGGCACCGAGUCGGUGCUUUUUUU
SEQ ID NO: 65 Wheat PHS1 complete sgRNA RNA sequence 4
CAGGGGACGAAUCAUCAGAGGUUUUAGAGCUAGAAAUAGCAAGUUAAAAUAAGGCUA GU
CCGUUAUCAACUUGAAAAAGUGGCACCGAGUCGGUGCUUUUUUU
SEQ ID NO:66 Cas9 sequence
ATGGCTCCTAAGAAGAAGCGGAAGGTTGGTATTCACGGGGTGCCTGCGGCTGACAAG AAG
TACTCCATCGGCCTCGACATCGGCACCAACAGCGTCGGCTGGGCGGTGATCACCGAC GA
GTACAAGGTCCCGTCCAAGAAGTTCAAGGTCCTGGGCAACACCGACCGCCACTCCAT CAA
GAAGAACCTCATCGGCGCCCTCCTCTTCGACTCCGGCGAGACGGCGGAGGCGACCCG CC
TCAAGCGCACCGCCCGCCGCCGCTACACCCGCCGCAAGAACCGCATCTGCTACCTCC AG
GAGATCTTCTCCAACGAGATGGCGAAGGTCGACGACTCCTTCTTCCACCGCCTCGAG GAG
TCCTTCCTCGTGGAGGAGGACAAGAAGCACGAGCGCCACCCCATCTTCGGCAACATC GTC
GACGAGGTCGCCTACCACGAGAAGTACCCCACTATCTACCACCTTCGTAAGAAGCTT GTTG
ACTCTACTGATAAGGCTGATCTTCGTCTCATCTACCTTGCTCTCGCTCACATGATCA AGTTC
CGTGGTCACTTCCTTATCGAGGGTGACCTTAACCCTGATAACTCCGACGTGGACAAG CTCT
TCATCCAGCTCGTCCAGACCTACAACCAGCTCTTCGAGGAGAACCCTATCAACGCTT CCGG
TGTCGACGCTAAGGCGATCCTTTCCGCTAGGCTCTCCAAGTCCAGGCGTCTCGAGAA CCT
CATCGCCCAGCTCCCTGGTGAGAAGAAGAACGGTCTTTTCGGTAACCTCATCGCTCT CTCC
CTCGGTCTGACCCCTAACTTCAAGTCCAACTTCGACCTCGCTGAGGACGCTAAGCTT CAGC
TCTCCAAGGATACCTACGACGATGATCTCGACAACCTCCTCGCTCAGATTGGAGATC AGTA
CGCTGATCTCTTCCTTGCTGCTAAGAACCTCTCCGATGCTATCCTCCTTTCGGATAT CCTTA
GGGTTAACACTGAGATCACTAAGGCTCCTCTTTCTGCTTCCATGATCAAGCGCTACG ACGA
GCACCACCAGGACCTCACCCTCCTCAAGGCTCTTGTTCGTCAGCAGCTCCCCGAGAA GTA
CAAGGAGATCTTCTTCGACCAGTCCAAGAACGGCTACGCCGGTTACATTGACGGTGG AGC
TAGCCAGGAGGAGTTCTACAAGTTCATCAAGCCAATCCTTGAGAAGATGGATGGTAC TGAG
GAGCTTCTCGTTAAGCTTAACCGTGAGGACCTCCTTAGGAAGCAGAGGACTTTCGAT AACG
GCTCTATCCCTCACCAGATCCACCTTGGTGAGCTTCACGCCATCCTTCGTAGGCAGG AGG
ACTTCTACCCTTTCCTCAAGGACAACCGTGAGAAGATCGAGAAGATCCTTACTTTCC GTATT
CCTTACTACGTTGGTCCTCTTGCTCGTGGTAACTCCCGTTTCGCTTGGATGACTAGG AAGT
CCGAGGAGACTATCACCCCTTGGAACTTCGAGGAGGTTGTTGACAAGGGTGCTTCCG CCC
AGTCCTTCATCGAGCGCATGACCAACTTCGACAAGAACCTCCCCAACGAGAAGGTCC TCC
CCAAGCACTCCCTCCTCTACGAGTACTTCACGGTCTACAACGAGCTCACCAAGGTCA AGTA
CGTCACCGAGGGTATGCGCAAGCCTGCCTTCCTCTCCGGCGAGCAGAAGAAGGCTAT CGT TGACCTCCTCTTCAAGACCAACCGCAAGGTCACCGTCAAGCAGCTCAAGGAGGACTACTT
CAAGAAGATCGAGTGCTTCGACTCCGTCGAGATCAGCGGCGTTGAGGACCGTTTCAA CGC
TTCTCTCGGTACCTACCACGATCTCCTCAAGATCATCAAGGACAAGGACTTCCTCGA CAAC
GAGGAGAACGAGGACATCCTCGAGGACATCGTCCTCACTCTTACTCTCTTCGAGGAT AGG
GAGATGATCGAGGAGAGGCTCAAGACTTACGCTCATCTCTTCGATGACAAGGTTATG AAGC
AGCTCAAGCGTCGCCGTTACACCGGTTGGGGTAGGCTCTCCCGCAAGCTCATCAACG GTA
TCAGGGATAAGCAGAGCGGCAAGACTATCCTCGACTTCCTCAAGTCTGATGGTTTCG CTAA
CAGGAACTTCATGCAGCTCATCCACGATGACTCTCTTACCTTCAAGGAGGATATTCA GAAG
GCTCAGGTGTCCGGTCAGGGCGACTCTCTCCACGAGCACATTGCTAACCTTGCTGGT TCC
CCTGCTATCAAGAAGGGCATCCTTCAGACTGTTAAGGTTGTCGATGAGCTTGTCAAG GTTA
TGGGTCGTCACAAGCCTGAGAACATCGTCATCGAGATGGCTCGTGAGAACCAGACTA CCC
AGAAGGGTCAGAAGAACTCGAGGGAGCGCATGAAGAGGATTGAGGAGGGTATCAAGG AG
CTTGGTTCTCAGATCCTTAAGGAGCACCCTGTCGAGAACACCCAGCTCCAGAACGAG AAG
CTCTACCTCTACTACCTCCAGAACGGTAGGGATATGTACGTTGACCAGGAGCTCGAC ATCA
ACAGGCTTTCTGACTACGACGTCGACCACATTGTTCCTCAGTCTTTCCTTAAGGATG ACTC
CATCGACAACAAGGTCCTCACGAGGTCCGACAAGAACAGGGGTAAGTCGGACAACGT CCC
TTCCGAGGAGGTTGTCAAGAAGATGAAGAACTACTGGAGGCAGCTTCTCAACGCTAA GCT
CATTACCCAGAGGAAGTTCGACAACCTCACGAAGGCTGAGAGGGGTGGCCTTTCCGA GCT
TGACAAGGCTGGTTTCATCAAGAGGCAGCTTGTTGAGACGAGGCAGATTACCAAGCA CGT
TGCTCAGATCCTCGATTCTAGGATGAACACCAAGTACGACGAGAACGACAAGCTCAT CCGC
GAGGTCAAGGTGATCACCCTCAAGTCCAAGCTCGTCTCCGACTTCCGCAAGGACTTC CAG
TTCTACAAGGTCCGCGAGATCAACAACTACCACCACGCTCACGATGCTTACCTTAAC GCTG
TCGTTGGTACCGCTCTTATCAAGAAGTACCCTAAGCTTGAGTCCGAGTTCGTCTACG GTGA
CTACAAGGTCTACGACGTTCGTAAGATGATCGCCAAGTCCGAGCAGGAGATCGGCAA GGC
CACCGCCAAGTACTTCTTCTACTCCAACATCATGAACTTCTTCAAGACCGAGATCAC CCTC
GCCAACGGCGAGATCCGCAAGCGCCCTCTTATCGAGACGAACGGTGAGACTGGTGAG AT
CGTTTGGGACAAGGGTCGCGACTTCGCTACTGTTCGCAAGGTCCTTTCTATGCCTCA GGTT
AACATCGTCAAGAAGACCGAGGTCCAGACCGGTGGCTTCTCCAAGGAGTCTATCCTT CCA
AAGAGAAACTCGGACAAGCTCATCGCTAGGAAGAAGGATTGGGACCCTAAGAAGTAC GGT
GGTTTCGACTCCCCTACTGTCGCCTACTCCGTCCTCGTGGTCGCCAAGGTGGAGAAG GGT
AAGTCGAAGAAGCTCAAGTCCGTCAAGGAGCTCCTCGGCATCACCATCATGGAGCGC TCC
TCCTTCGAGAAGAACCCGATCGACTTCCTCGAGGCCAAGGGCTACAAGGAGGTCAAG AAG
GACCTCATCATCAAGCTCCCCAAGTACTCTCTTTTCGAGCTCGAGAACGGTCGTAAG AGGA
TGCTGGCTTCCGCTGGTGAGCTCCAGAAGGGTAACGAGCTTGCTCTTCCTTCCAAGT ACG
TGAACTTCCTCTACCTCGCCTCCCACTACGAGAAGCTCAAGGGTTCCCCTGAGGATA ACGA
GCAGAAGCAGCTCTTCGTGGAGCAGCACAAGCACTACCTCGACGAGATCATCGAGCA GAT
CTCCGAGTTCTCCAAGCGCGTCATCCTCGCTGACGCTAACCTCGACAAGGTCCTCTC CGC
CTACAACAAGCACCGCGACAAGCCCATCCGCGAGCAGGCCGAGAACATCATCCACCT CTT
CACGCTCACGAACCTCGGCGCCCCTGCTGCTTTCAAGTACTTCGACACCACCATCGA CAG
GAAGCGTTACACGTCCACCAAGGAGGTTCTCGACGCTACTCTCATCCACCAGTCCAT CACC
GGTCTTTACGAGACTCGTATCGACCTTTCCCAGCTTGGTGGTGATAAGCGTCCTGCT GCCA
CCAAAAAGGCCGGACAGGCTAAGAAAAAGAAGTAG
SEQ ID NO: 67 Csy4 endoribonuclease nucleic acid sequence
ATGGACCACTACCTCGACATCAGGCTCAGGCCAGACCCAGAGTTCCCACCAGCCCAG CTC
ATGTCCGTCCTCTTCGGCAAGCTCCACCAGGCCCTCGTGGCCCAGGGCGGCGACAGG AT
CGGCGTGTCCTTCCCAGACCTCGACGAGTCCAGGTCCAGGCTCGGCGAGAGGCTCCG CA
TCCACGCCTCCGCCGACGACCTCAGGGCCCTCCTCGCCAGGCCGTGGCTGGAGGGCC TC
AGGGACCACCTCCAGTTCGGCGAGCCAGCCGTGGTGCCACACCCAACCCCATACAGG CA
AGTGTCCAGGGTGCAAGCCAAGTCCAACCCAGAGAGGCTCAGGAGGAGGCTCATGAG GA
GGCACGACCTCTCCGAGGAAGAGGCCAGGAAGCGCATCCCAGACACCGTGGCCAGGG CC
CTCGACCTCCCATTCGTGACCCTCAGGTCCCAGTCCACCGGCCAGCACTTCCGCCTC TTC
ATCAGGCACGGCCCACTCCAGGTGACCGCCGAGGAGGGCGGCTTTACCTGCTACGGC CT
CTCCAAGGGCGGCTTCGTGCCGTGGTTC
SEQ ID NO: 68 Wheat U6 promoter GACCAAGCCCGTTATTCTGACAGTTCTGGTGCTCAACACATTTATATTTATCAAGGAGCA CA
TTGTTACTCACTGCTAGGAGGGAATCGAACTAGGAATATTGATCAGAGGAACTACGA GAGA
GCTGAAGATAACTGCCCTCTAGCTCTCACTGATCTGGGTCGCATAGTGAGATGCAGC CCA
CGTGAGTTCAGCAACGGTCTAGCGCTGGGCTTTTAGGCCCGCATGATCGGGCTTTTG TCG
GGTGGTCGACGTGTTCACGATTGGGGAGAGCAACGCAGCAGTTCCTCTTAGTTTAGT CCC
ACCTCGCCTGTCCAGCAGAGTTCTGACCGGTTTATAAACTCGCTTGCTGCATCAGAC TTG
SEQ ID NO: 69 Maize Ubiquitinl promoter
TGCAGTGCAGCGTGACCCGGTCGTGCCCCTCTCTAGAGATAATGAGCATTGCATGTC TAA
GTTATAAAAAATTACCACATATTTTTTTTGTCACACTTGTTTGAAGTGCAGTTTATC TATCTTT
ATACATATATTTAAACTTTACTCTACGAATAATATAATCTATAGTACTACAATAATA TCAGTGT
TTTAGAGAATCATATAAATGAACAGTTAGACATGGTCTAAAGGACAATTGAGTATTT TGACA
ACAGGACTCTACAGTTTTATCTTTTTAGTGTGCATGTGTTCTCCTTTTTTTTTGCAA ATAGCT
TCACCTATATAATACTTCATCCATTTTATTAGTACATCCATTTAGGGTTTAGGGTTA ATGGTT
TTTATAGACTAATTTTTTTAGTACATCTATTTTATTCTATTTTAGCCTCTAAATTAA GAAAACTA
AAACTCTATTTTAGTTTTTTTATTTAATAATTTAGATATAAAATAGAATAAAATAAA GTGACTA
AAAATTAAACAAATACCCTTTAAGAAATTAAAAAAACTAAGGAAACATTTTTCTTGT TTCGAG
TAGATAATGCCAGCCTGTTAAACGCCGTCGACGAGTCTAACGGACACCAACCAGCGA ACC
AGCAGCGTCGCGTCGGGCCAAGCGAAGCAGACGGCACGGCATCTCTGTCGCTGCCTC TG
GACCCCTCTCGAGAGTTCCGCTCCACCGTTGGACTTGCTCCGCTGTCGGCATCCAGA AAT
TGCGTGGCGGAGCGGCAGACGTGAGCCGGCACGGCAGGCGGCCTCCTCCTCCTCTCA C
GGCACCGGCAGCTACGGGGGATTCCTTTCCCACCGCTCCTTCGCTTTCCCTTCCTCG CCC
GCCGTAATAAATAGACACCCCCTCCACACCCTCTTTCCCCAACCTCGTGTTGTTCGG AGCG
CACACACACACAACCAGATCTCCCCCAAATCCACCCGTCGGCACCTCCGCTTCAAG
SEQ ID NO: 84 Barley PHS1 target sequence
CGGAAGGCGGTGGCGAGCGGCGG
SEQ ID NO: 85 Barley PHS1 target sequence 2
GAGCGTGGCGAGCGATCGGGAGG
SEQ ID NO: 86 Barley PHS1 protospacer sequence 1
CGGAAGGCGGTGGCGAGCGG
SEQ ID NO: 87 Barley PHS1 protospacer sequence 2
GAGCGTGGCGAGCGATCGGG
SEQ ID NO: 88 Barley PHS1 complete sgRNA-encoding nucleic acid sequence 1
CGGAAGGCGGTGGCGAGCGGGTTTTAGAGCTAGAAATAGCAAGTTAAAATAAGGCTA GTC
CGTTATCAACTTGAAAAAGTGGCACCGAGTCGGTGCTTTTTTT
SEQ ID NO: 89 Barley PHS1 complete sgRNA-encoding nucleic acid sequence 2
GAGCGTGGCGAGCGATCGGGGTTTTAGAGCTAGAAATAGCAAGTTAAAATAAGGC
TAGTCCGTTATCAACTTGAAAAAGTGGCACCGAGTCGGTGCTTTTTTT