Categories
biology Genetics Plants

James and the Tiny Corn Part 3: Even Tinier

Back in 2015 we were one of the first groups to get to try out Fast Flowering Mini-Maize (FFMM) [1]. The plants were about two feet tall, flowered in five weeks, and were ready to harvest only 61 days after we planted them. But what if I told you that the same genotype could be even smaller?

This past summer a technician in the lab rediscovered our carefully guarded stash of FFMM seeds and we decided it was time to increase them. While we did most of the increase in the greenhouse, the idea came up at the same time we were finalizing the plans for our summer nursery* so we decided to plant the line in the field as well.

And this was the result:

Fast Flowering Mini Maize in the field in Lincoln Nebraska in the summer of 2021.
Fast Flowering Mini Maize in the field in Lincoln Nebraska in the summer of 2021. Planted May 13th. Photo July 1st. Non-fast flowering non-mini maize in the background was planted approximately one week earlier.
Categories
Genetics genomics Genotyping Plant breeding

Resequencing the sorghum association panel

A really nice thing about many crop plants is that through natural self pollination it is possible to create true breeding inbred lines. Inbred lines plants that are homozygous across all or nearly all of their genomes. If the same inbred plant is the used as the mother and father to produce new seeds, all those seeds will be genetically identical to the parent plant. Just like identical twins. And like identical twins, inbred lines make it possible to understand a LOT more about the interplay of genetics and environment since we have a chance to see how different or similar the characteristics of genetically identical individuals turn out to be.

Categories
Genetics genomics Genotyping

Correcting genotyping errors when constructing genetic maps from genotyping by sequencing — GBS — data.

When doing anything even vaguely related to quantitative genetics I would chose more missing data over more genotyping errors any day of the week. There are lots of approaches to making missing data less of a pain. The most straightforward of these is called imputation. Imputation essentially means using the genetic markers where you do have information to guess what the most likely genotypes would be at the markers where you don’t have any direct information on what the genotype is. This is possible because of a phenomenon known as linkage disequilibrium or “LD.” Both imputation and LD deserve their own entire write ups and they are on the list of potential topics for when I have another slow Sunday afternoon. For now the  only thing you have to know about them is that, when information on a specific genetic marker is missing, it is often possible to guess with fairly high accuracy what that missing information SHOULD be. But when the information on a specific genetic marker is WRONG… well it’s usually a bit more of a mess (but I think the software solutions for this are getting better! Details at the end of the post.)

Figure 1: Genotype calls along chromosome 1 for six recombinant inbred lines (RILs).

Categories
Genetics genomics

Hybrid vigor and missing genes

Thinking about defining the number of genes present in the maize genome reminded me of an old* story about the trouble of defining what truly represents a gene and how really awesome ideas can sometimes come years before the data needed to support them.

The year is 2002. The first complete version of the human genome is still a year away. The genomes of two plant species have already been published (rice and arabidopsis) but in terms of shere genome size, both species are a drop in the bucket compared to the human genome, or other plant genomes like corn or wheat. But none of this is particularly important except to set the stage.

Two researchers at Rutgers University were sequencing a tiny piece of the maize genome (~0.01%) that surrounded a single gene call bronze1 — the fifth most studied gene in maize — when they found something unexpected.

They had previously 10 identified genes in a single stretch of 32-kb of the maize genome. (A similar gene density throughout the remainder of the maize genome would have resulted in a maize genome containing more than 700,000 genes!) However it was already known that the maize genome was split between small gene-rich islands and vast desolate expanses of transposons (referred to as transposon nests**), and in fact the same study identified a couple of these nests of transposons on either side of their gene rich island (see part A of the second picture in this post).

Below I'll use cartoons, but here's a real and to scale example of a gene rich island I picked at random from maize chromosome 3. Genes and intergenic spaces are to scale. Base image generated with GenomeViewer, part of the CoGe toolkit. http://www.genomevolution.org/CoGe/

Their initial sequencing used DNA from a breed of corn called McC, which I must admit I’ve only ever read about in this particular paper. However, when they decided to sequenced the same region from the genome of B73*** they made three discoveries which I’ve listed in increasing order of strangeness:

Categories
biology Genetics genomics

Welcome to transposon week here at James and the Giant Corn!

I’m just about wrapped up with the big project I’ve been working on recently. Hope to be able to say more about it in the not-too-distant future. Having to be secretive in science sucks.

But there’s a lot of be happy about! I’m done teaching for a long time. As much as I enjoyed working with the kids in my class, the other responsibilities of teaching (grading, sitting through lectures without the chance to break in for the discussions and arguments that make academia so fun, grading, designing assignments, grading) were really starting to wear me down.

And I’m only three weeks (June 22nd) from either passing my qualifying exam or becoming a beaten and broken shell of a man. For three hours four professors will question me on everything I’ve learned (or should have learned but didn’t) in my education up to this point, and everything I propose to spend the next few years of my life doing. This may not sound like a good thing, but it is. Because my qualifying exam has been hanging over my head all semester,

The lab has a new paper in press, having run the sequential gauntlets of Peer Review, Editorial Evaluation, and finally (and perhaps most dreaded) Your-Figures-Aren’t-High-Resolution-Enough e-mails from the journal’s publication department. But more on the details of that whenever the paper actually shows up.

But what was the point of this entry again? Oh yeah. Transposons. I have a soft spot from transposons (I’m guessing most people who work with maize genetics do). Today we may know that transposons are found in practically every genome under the sun, but they were discovered first in maize using old school genetics (breeding plants together and counting traits in the offspring), before DNA sequencing was a gleam in its inventor’s eye.

And on top of that, some delightfully high-copy number transposons are in the middle of proving a major scientific point for me, so I figured the least I could do was devote a week to them here on the site.

If you’re not a geneticist, should you still care about transposons? Absolutely! Transposons are one of the best arguments, not for why genetic engineering is safe, but for why, if anyone worried about hypothetical unintended consequences of genetic engineering should be worried about any food with DNA in it (and as far as I know, that’s all food.) To paraphrase a seinfield character: “No food for you!”

The week’s schedule:

Categories
agriculture biofortified Genetics Plants

Where the superpowers of superweeds come from

Superman had the yellow sun of earth, spiderman had a radioactive spider-bite, but what about superweeds, where does their super power (surviving application of Round-up/glyphosate) come from?

To understand how superweeds survive, we first have to understand why normal weeds (the Jimmy Olsens and Lois Lanes of the plant world) die. <– last superhero reference of this post I promise.

Categories
agriculture Genetics Plants

Don’t judge the genetic diversity of a species by its cover

Photo: ekpatterson, flickr (click photo to see in original context)

There are more differences in the genomes of two unrelated corn plants than between the genomes of a human and a chimpanzee (two species separated by 3.5 million years of evolution).

On the other hand, two unrelated human beings, members of the same species, have more than four times as many genetic differences as two unrelated heirloom tomatoes.

Genetic Diversity:

Corn vs. Corn > Human vs. Chimpanzee >> Human vs. Human >> Heirloom Tomato vs. Heirloom Tomato

Now the fact that any two human beings are more closely related to each other than either is to a chimpanzee should be obvious to anyone who gives it a moments thought.

I plan to poll my sections tomorrow to see how many of them would put corn and heirloom tomatoes in the opposite positions, but many have figured out my feelings about corn, so they’ll probably guess it’s a trap.

Categories
biology Genetics

Helitron Capture Creating New Genes?

One of the things that has made annotating genes in the maize genome so difficult (there are currently two sets of gene models one with only 32,000 genes, which is low estimate, and the other with 100,000 is far too many) is the presence of large numbers of gene fragments that have been captured and duplicated by a class of transposon called helitrons (yes I know that sounds like a character from Transformers).

The helitron captured fragments are copied from real genes (often multiple pieces are captured from different genes) which is why many gene annotation programs (trained to recongize the difference between genes and non-coding DNA) will identify the fragments being genes themselves.

What if some of those fragments actually are genes? By combining pieces from completely different genes, helitrons could be a whole new source of crazy new genes that natural selection could act upon.

That is the question the authors of this poster are trying to get at, by identifying more helitron fragments and checking to see if those fragments were actually expressed in the genome.

Allison Barbaglia et al. “Accessing the transcriptional activity of Helitron-captured genes of maize” Poster #243 2010 Maize Meeting

Categories
biology Genetics genomics

Missing Genes on a Massive Scale

Edit: stripped out all the numbers as they clearly applied to an earlier version of the data and I don’t know if the new ones are intended for public release yet.

Last november when the maize genome was published, one of the companion papers looked at genes where a different number of copies were found in different breds of maize (this is called Copy Number Variation) and genes found in B73 (the variety of maize that was sequenced) but completely missing from the genomes of other varietes. There’s a great post on that paper written up by Mary at OpenHelix.

A few months later, it sounds like this dataset has grown substantially. Over XXXX B73 genes (that’s X% of the filtered B73 gene set!) that appear to be lost (or have sequences so different they no longer register) in at least some varities of maize. And because the new dataset incorporates data from XX different maize breds and XX different teosinte* lines they’re able to identify some of the losses as older because they’re found in multiple comparisons, while some appear to be lost in only a single breed, and might represent more recent losses.

Sit back and think about that for a second. At least X% of the genes in corn sometimes go missing. This could have implications for everything from inbreeding depressions and hybrid vigor, to the kind of basic research I’m actually working on myself.

As you can imagine I’d love to get my hands on this dataset myself, but the next best thing will be to take furious notes when Nathan Springer talks about the project on Friday morning**, and being sure to swing by Steven Eichten’s poster soak in the awesomeness.

Ruth A. Swanson-Wagner et al. “Combined Analysis of genomic structural variation and gene expression variation between maize and teosinte populations” Talk #1 2010 Maize Meeting (Presented by Nathan Spinger)

Steven R. Eichten et al. “Extenisve Copy Number Variation Among Maize Lines” Poster #139 2010 Maize Meeting

*Teosinte is the wild species from which maize/corn was domesticated.

**And he’s talking at 8:30 AM on a day when I still plan on being heavily jet lagged.

Categories
biology Genetics

Abnormal Chromosome 10

There is a piece of DNA that is sometimes found on the end of the tenth maize chromosome. In plants that possess this extra chromosome segment, chromosome knobs* (including one that’s a part of the extra segment included in abnormal chromosome 10) start to act like centromeres**. But this story graduates from odd to downright weird when I tell you that possessing this extra centromere-like activity gives a chromosome an unfair advantage in being passed on to the next generation.

Plants, like animals, possess two complete genome copies, one from each parent. They’ll only pass on one copy (mixtures of pieces from each parent) to their offspring. Any given sequence has a 50% chance of being passed on which seems fair given the plant is passing on 50% of its total genetic material. But abnormal chromosome ten cheats (using those extra centromere-like sequences I mentioned earlier). It has up to an 83% chance of being passed on.

Since the breed of corn (B73) the maize genome was based on has the normal version of chromosome 10, we know very little about the extra DNA found in abnormal chromosome 10. The authors of this poster are going to correct that oversight, by sequencing the region, figuring out how (and how long ago) abnormal chromosome 10 came into being, and hopefully identifying the genes within the region that make chromosome-knobs act like centromeres.

*Knobs are dense segments of DNA that scientists have been able to spot visually within chromosomes since before we knew for sure that chromosomes carried genetic information.

**Centromeres are the part of the chromosomes that bind together during cell division (the center of the X in the traditional drawing of a chromosome). They’re also the place where the molecular machinery that pulls chromosomes apart at the end of the process of cell division.

Lisa Kanizay and Kelly R. Dawe “Uncovering the sequence and structure of maize abnormal chromosome 10” Poster #165 2010 Maize Meeting.