<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>James and the Giant Corn</title>
	<atom:link href="http://www.jamesandthegiantcorn.com/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.jamesandthegiantcorn.com</link>
	<description>Genetics: Studying the Source Code of Nature</description>
	<lastBuildDate>Fri, 11 Jan 2013 22:19:55 +0000</lastBuildDate>
	<language>en-US</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.5</generator>
		<item>
		<title>Headed to PAG</title>
		<link>http://www.jamesandthegiantcorn.com/2013/01/11/headed-to-pag/</link>
		<comments>http://www.jamesandthegiantcorn.com/2013/01/11/headed-to-pag/#comments</comments>
		<pubDate>Fri, 11 Jan 2013 22:19:55 +0000</pubDate>
		<dc:creator>James</dc:creator>
				<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://www.jamesandthegiantcorn.com/?p=2134</guid>
		<description><![CDATA[ [...]]]></description>
				<content:encoded><![CDATA[<p>This will be my third year attending the Plant and Animal Genome conference in sunny San Diego. I&#8217;ve been fortunate enough to get to experience the conference in a bunch of different roles.</p>
<ul>
<li>My first year I was an overwhelmed young grad student with a poster and the silly idea that I could pack my schedule full of sessions all day every day without suffering melting of the brain. (You really need to pick and choose at PAG. It&#8217;s like an all you can eat buffet of science, it is all to easy to go overboard.)</li>
<li>My second year I returned to PAG as an actual presenter giving two talks to packed sessions (which isn&#8217;t an endorsement of my own science I was sandwiched between successful scientists who also happened to be gifted speakers both times).</li>
<li>And now in my third year I&#8217;ll get to see PAG through the eyes of an exhibitor. No, this doesn&#8217;t represent my post-PhD career path. This year PAG happened to fall in the break between filling my dissertation and the start of my next &#8220;real&#8221; job.</li>
</ul>
<p>Anyway, my plane is about to board so I should wrap this up. To all the rest of you who are coming to the conference, hope you have a great conference, don&#8217;t push yourselves too hard, and drop me a line if you&#8217;d like me to hook you up with a free t-shirt. <img src='http://www.jamesandthegiantcorn.com/wp-includes/images/smilies/icon_wink.gif' alt=';-)' class='wp-smiley' /> </p>
<p><a class="a2a_dd a2a_target addtoany_share_save" href="http://www.addtoany.com/share_save#url=http%3A%2F%2Fwww.jamesandthegiantcorn.com%2F2013%2F01%2F11%2Fheaded-to-pag%2F&amp;title=Headed%20to%20PAG" id="wpa2a_2"><img src="http://www.jamesandthegiantcorn.com/wp-content/plugins/add-to-any/share_save_171_16.png" width="171" height="16" alt="Share"/></a></p>]]></content:encoded>
			<wfw:commentRss>http://www.jamesandthegiantcorn.com/2013/01/11/headed-to-pag/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Changes in Perspective</title>
		<link>http://www.jamesandthegiantcorn.com/2012/12/19/changes-in-perspective/</link>
		<comments>http://www.jamesandthegiantcorn.com/2012/12/19/changes-in-perspective/#comments</comments>
		<pubDate>Thu, 20 Dec 2012 01:45:39 +0000</pubDate>
		<dc:creator>James</dc:creator>
				<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://www.jamesandthegiantcorn.com/?p=2130</guid>
		<description><![CDATA[ [...]]]></description>
				<content:encoded><![CDATA[<p>An old PhDComics explains the change in perspective which comes with graduating:</p>
<p style="text-align: left;"><a href="http://www.phdcomics.com/comics/archive.php?comicid=281" rel="attachment wp-att-2131"><img class="aligncenter size-full wp-image-2131" alt="phd020802s" src="http://www.jamesandthegiantcorn.com/wp-content/uploads/2012/12/phd020802s.gif" width="600" height="271" /></a>My transformation obviously isn&#8217;t complete yet though. Lab meetings with pizza sounds like a wonderful idea.</p>
<p style="text-align: center;">
<p><a class="a2a_dd a2a_target addtoany_share_save" href="http://www.addtoany.com/share_save#url=http%3A%2F%2Fwww.jamesandthegiantcorn.com%2F2012%2F12%2F19%2Fchanges-in-perspective%2F&amp;title=Changes%20in%20Perspective" id="wpa2a_4"><img src="http://www.jamesandthegiantcorn.com/wp-content/plugins/add-to-any/share_save_171_16.png" width="171" height="16" alt="Share"/></a></p>]]></content:encoded>
			<wfw:commentRss>http://www.jamesandthegiantcorn.com/2012/12/19/changes-in-perspective/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Completion</title>
		<link>http://www.jamesandthegiantcorn.com/2012/12/18/completion/</link>
		<comments>http://www.jamesandthegiantcorn.com/2012/12/18/completion/#comments</comments>
		<pubDate>Wed, 19 Dec 2012 00:53:56 +0000</pubDate>
		<dc:creator>James</dc:creator>
				<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://www.jamesandthegiantcorn.com/?p=2122</guid>
		<description><![CDATA[ [...]]]></description>
				<content:encoded><![CDATA[<p>Over the last couple of years my posts here have really dropped off. It hasn&#8217;t been because I ran out of material or lost interest in blogging but simply because more and more of my time and energy have been consumed by a single goal&#8230; graduating.</p>
<p>So it gives me great pleasure to report that, as of December 14th (last Friday), I have reached that goal.</p>
<div id="attachment_2124" class="wp-caption aligncenter" style="width: 378px"><a href="http://www.jamesandthegiantcorn.com/2012/12/18/completion/a99b99zcyai2-si/" rel="attachment wp-att-2124"><img class=" wp-image-2124" alt="A99B99zCYAI2-Si" src="http://www.jamesandthegiantcorn.com/wp-content/uploads/2012/12/A99B99zCYAI2-Si.jpeg" width="368" height="277" /></a><p class="wp-caption-text">Behold! The lollipop handed to every newly minted Berkeley PhD when their thesis is accepted.</p></div>
<p style="text-align: center;">
<p style="text-align: left;">What was my thesis about you ask? Well I still don&#8217;t have a good elevator speech, so let me simply say that the first part of my thesis has to do with how plant genomes change over time and the second part demonstrated a new method for learning the function of pieces of DNA which don&#8217;t code for proteins but instead determine where and when neighboring genes will be turned on or off.</p>
<p style="text-align: left;">So what&#8217;s next? This whole site traces its origins back to travel posts I put up to let friends and family know how I was doing as I interviewed as various graduate schools. So I suppose there would be a fair bit of symmetry to shutting it down as I leave grad school, but I don&#8217;t want to do that. Now that I&#8217;m finished with my PhD, I&#8217;m looking forward to rediscovering the things I used to do for fun, and I remember writing updates here used to be a lot of fun.</p>
<p style="text-align: left;">On a more practical level, what comes next for me is a 2000 mile drive from California to the midwest (with all my worldly possessions packed into the back of my car) to visit family for the holidays. I am suddenly very conscious of the fact I haven&#8217;t driven on snow in more than four years. After that it&#8217;ll be onward to a post-doc.</p>
<p style="text-align: left;">If you&#8217;ve left an unanswered comment in the last six months or so and are still interested in me getting back to you, let me know.</p>
<p style="text-align: left;">For now&#8230; it is good to be back.</p>
<p style="text-align: left;"><a href="http://www.jamesandthegiantcorn.com/2012/12/18/completion/grad_school/" rel="attachment wp-att-2125"><img class="aligncenter size-full wp-image-2125" alt="grad_school" src="http://www.jamesandthegiantcorn.com/wp-content/uploads/2012/12/grad_school.png" width="593" height="219" /></a></p>
<p><a class="a2a_dd a2a_target addtoany_share_save" href="http://www.addtoany.com/share_save#url=http%3A%2F%2Fwww.jamesandthegiantcorn.com%2F2012%2F12%2F18%2Fcompletion%2F&amp;title=Completion" id="wpa2a_6"><img src="http://www.jamesandthegiantcorn.com/wp-content/plugins/add-to-any/share_save_171_16.png" width="171" height="16" alt="Share"/></a></p>]]></content:encoded>
			<wfw:commentRss>http://www.jamesandthegiantcorn.com/2012/12/18/completion/feed/</wfw:commentRss>
		<slash:comments>5</slash:comments>
		</item>
		<item>
		<title>Guide to Reconstructing The Maize Subgenomes.</title>
		<link>http://www.jamesandthegiantcorn.com/2012/06/29/guide-to-reconstructing-the-maize-subgenomes/</link>
		<comments>http://www.jamesandthegiantcorn.com/2012/06/29/guide-to-reconstructing-the-maize-subgenomes/#comments</comments>
		<pubDate>Fri, 29 Jun 2012 17:13:10 +0000</pubDate>
		<dc:creator>James</dc:creator>
				<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://www.jamesandthegiantcorn.com/?p=2117</guid>
		<description><![CDATA[ [...]]]></description>
				<content:encoded><![CDATA[<p><em>Because I get so many questions about this step in one of my published papers. (Well more accurately, my PI gets questions about this step and he sometimes forwards them on to me for an answer). The paper referred to in this guide is <a href="http://www.pnas.org/content/108/10/4069.abstract">this one. </a></em></p>
<p>There are two completely different steps to reconstructing maize subgenomes: 1) putting together ancestral chromosome pairs 2) grouping one copy of each ancestral chromosome together into subgenome 1 and the other copy of each ancestral subgenome 2.</p>
<p><strong>Ancestral chromosome pair reconstruction:<span id="more-2117"></span></strong></p>
<p>This step hinges on a single assertion backed up by a number of papers: inversions and other forms of within chromosome rearrangements significantly outnumber translocations (movement of sequence from one chromosome to another). This observation has been backed up by a number of previous studies.</p>
<p>Give that fact (within chromosome rearrangements are more common than between chromosome rearrangments), when two different pieces of the same maize chromosome are orthologous to different parts of the same sorghum chromosome, we can say that those parts of the maize chromosome originated from the same copy of that sorghum chromosome in the twenty chromosome (two equivalent to each single chromosome of sorghum) tetraploid ancestor of maize.</p>
<p>A good example of this is the comparison between maize chromosome 1 and sorghum chromosome 1 in Figure 1 of the paper you are inquiring about. There are actually two separate segments of maize chromosome 1 which are orthologous to different parts of sorghum chromosome 1. In theory each segment could have come from a different copy sorghum chromosome 1 in the tetraploid maize ancestor and were combined on one maize chromosome through translacations, but the most parsimonious explanation is that both segments came from a single maize chromosome.</p>
<p>Once we reassemble that first copy of sorghum chromosome 1 (the two pieces on maize chromosome 1), we know that any remaining pieces of the maize genome orthologous to sorghum chromosome 1 must have come from the second copy of that chromosome in the tetraploid ancestor of maize. In this case there is one piece on maize chromosome 5 and one piece on maize chromosome 9, but since we have already reconstructed one whole copy of the chromosome, the only source for these two segments is the second chromosome copy.</p>
<p>Repeat this logic 9 more times and you have reconstructed ten pairs of ancestral maize chromosomes, just like we did in the paper. (And as cited in our PNAS paper, we weren&#8217;t the first to come up with this idea. Before the maize genome was even complete or the sorghum genome was published, another group was able to derive approximately the same ten chromosome pairs by comparing a genetic map of maize to the rice genome: <a href="http://www.plosgenetics.org/article/info%3Adoi%2F10.1371%2Fjournal.pgen.0030123">http://www.plosgenetics.org/article/info%3Adoi%2F10.1371%2Fjournal.pgen.0030123</a> )</p>
<p><strong>Assigning ancestral chromosomes to subgenomes</strong></p>
<p>In this step we examined the number of maize-sorghum orthologs shared between each maize ancestral chromosome and the orthologous sorghum chromosome. This is figure 2 from our 2011 PNAS paper. As you can see in that figure, for each pair of maize ancestral chromosomes, one copy retains a lower percentage of genes found in sorghum along its entire length and the other copy retains a higher percentage of genes found in sorghum along its entire length.  (Too avoid being thrown off by new/transposed genes in sorghum we excluded sorghum genes without orthologs in rice, but you can get the same results without that filtering step, the main difference is that the % retained is lower.)</p>
<p>The maize ancestral chromosome copy which retained a higher percentage of genes also found in sorghum was assigned to maize subgenome 1 and the maize ancestral chromosome copy which retained a smaller percentage of genes also found in sorghum  was assigned to maize subgenome 2.</p>
<p>Then we went back and colored maize subgenome 1 blue in figures 1 &amp; 2 and maize subgenome 2 red in those same two figures. Which I think is one of the things that confuses people. The red and blue color coding in Figure 1 shows a distinction we didn&#8217;t actually make until after the analysis shown in Figure 2.</p>
<p>Schnable, James C., Nathan M. Springer, and Michael Freeling. <a href="http://www.pnas.org/content/108/10/4069.full">“Differentiation of the Maize Subgenomes by Genome Dominance and Both Ancient and Ongoing Gene Loss</a>.” Proceedings of the National Academy of Sciences 108, no. 10 (March 8, 2011): 4069 –4074.</p>
<p><a class="a2a_dd a2a_target addtoany_share_save" href="http://www.addtoany.com/share_save#url=http%3A%2F%2Fwww.jamesandthegiantcorn.com%2F2012%2F06%2F29%2Fguide-to-reconstructing-the-maize-subgenomes%2F&amp;title=Guide%20to%20Reconstructing%20The%20Maize%20Subgenomes." id="wpa2a_8"><img src="http://www.jamesandthegiantcorn.com/wp-content/plugins/add-to-any/share_save_171_16.png" width="171" height="16" alt="Share"/></a></p>]]></content:encoded>
			<wfw:commentRss>http://www.jamesandthegiantcorn.com/2012/06/29/guide-to-reconstructing-the-maize-subgenomes/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Success in Grad School</title>
		<link>http://www.jamesandthegiantcorn.com/2012/05/05/success-in-grad-school/</link>
		<comments>http://www.jamesandthegiantcorn.com/2012/05/05/success-in-grad-school/#comments</comments>
		<pubDate>Sat, 05 May 2012 17:13:52 +0000</pubDate>
		<dc:creator>James</dc:creator>
				<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://www.jamesandthegiantcorn.com/?p=2114</guid>
		<description><![CDATA[ [...]]]></description>
				<content:encoded><![CDATA[<p>Success in grad school doesn&#8217;t come from working incredibly hard.</p>
<p>It comes from setting unrealistically fast deadlines for yourself. And then meeting them.</p>
<p>Sometimes that means working early mornings, late nights, and weekends. Sometimes it means coming up with a new approach, getting the results in three hours, and sneaking out of lab at 3:30. But the point is the results are what matter. If you can find ways to be unexpectedly productive you&#8217;re much less likely to burn out entirely than if you can only ever meet your own deadlines by burning the midnight oil at both ends<em> (mixed metaphor intended)</em>.</p>
<p>Working hard for the sake of appearing to work hard (either to others or to yourself) is the surest road to burnout and lack of results.</p>
<p>P.S. Productivity goes up at least 5-fold when not also teaching. <img src='http://www.jamesandthegiantcorn.com/wp-includes/images/smilies/icon_biggrin.gif' alt=':-D' class='wp-smiley' /> </p>
<p>P.P.S. If the reagents you are working with are as old as you are, you need to worry. <img src='http://www.jamesandthegiantcorn.com/wp-includes/images/smilies/icon_wink.gif' alt=';-)' class='wp-smiley' />  (That falls into the working hard but not getting results category.)</p>
<p><a class="a2a_dd a2a_target addtoany_share_save" href="http://www.addtoany.com/share_save#url=http%3A%2F%2Fwww.jamesandthegiantcorn.com%2F2012%2F05%2F05%2Fsuccess-in-grad-school%2F&amp;title=Success%20in%20Grad%20School" id="wpa2a_10"><img src="http://www.jamesandthegiantcorn.com/wp-content/plugins/add-to-any/share_save_171_16.png" width="171" height="16" alt="Share"/></a></p>]]></content:encoded>
			<wfw:commentRss>http://www.jamesandthegiantcorn.com/2012/05/05/success-in-grad-school/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Pretend Grant Deadlines</title>
		<link>http://www.jamesandthegiantcorn.com/2012/04/29/pretend-grant-deadlines/</link>
		<comments>http://www.jamesandthegiantcorn.com/2012/04/29/pretend-grant-deadlines/#comments</comments>
		<pubDate>Mon, 30 Apr 2012 06:42:45 +0000</pubDate>
		<dc:creator>James</dc:creator>
				<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://www.jamesandthegiantcorn.com/?p=2108</guid>
		<description><![CDATA[ [...]]]></description>
				<content:encoded><![CDATA[<p>No chance of getting actual funding, just a silly course I signed up for this semester before I realized how crazy everything was going to be between TAing, trying to teach myself how to make RNA-seq libraries, and at least half a dozen collaborations (all of them urgent). I&#8217;ve been writing and analyzing and figure making for the past two days straight and turned in my final grant proposal at 10:50 tonight with a good 70 minutes to spare.</p>
<p>And all I can say is&#8230;.</p>
<p><em>what a rush! </em>This is why I love what I do for a living. Two days of improvising and lit-searching and throwing different approaches against the wall to see what would stick. And at in the last 24 hours I finally managed to turn my proposal into a project I would actually enjoy carrying out.</p>
<p>The only problem is that now I kind of want to spend next weekend doing the same thing. Ideally with a shot at actually getting some cash if I successfully sold people on the value of my research. It&#8217;s been a couple of months but I&#8217;ve finally been re-bitten by the science bug! Speaking of which, I should wrap this up. My alarm is set for 7 AM tomorrow so I can get to lab in time to squeeze in an RNA extraction before class. I&#8217;m taking yet another shot at building a proper sequencing library. Wish me luck!</p>
<p><a class="a2a_dd a2a_target addtoany_share_save" href="http://www.addtoany.com/share_save#url=http%3A%2F%2Fwww.jamesandthegiantcorn.com%2F2012%2F04%2F29%2Fpretend-grant-deadlines%2F&amp;title=Pretend%20Grant%20Deadlines" id="wpa2a_12"><img src="http://www.jamesandthegiantcorn.com/wp-content/plugins/add-to-any/share_save_171_16.png" width="171" height="16" alt="Share"/></a></p>]]></content:encoded>
			<wfw:commentRss>http://www.jamesandthegiantcorn.com/2012/04/29/pretend-grant-deadlines/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>In which I apologize to R</title>
		<link>http://www.jamesandthegiantcorn.com/2012/04/23/in-which-i-apologize-to-r/</link>
		<comments>http://www.jamesandthegiantcorn.com/2012/04/23/in-which-i-apologize-to-r/#comments</comments>
		<pubDate>Mon, 23 Apr 2012 21:47:55 +0000</pubDate>
		<dc:creator>James</dc:creator>
				<category><![CDATA[biology]]></category>

		<guid isPermaLink="false">http://www.jamesandthegiantcorn.com/?p=2102</guid>
		<description><![CDATA[ [...]]]></description>
				<content:encoded><![CDATA[<p>R, you may be a confusing and hard to understand language where every package comes with its own set of quirks and foibles. You may make me feel less like a programmer and more like a not-very-well trained magician fumbling around for the right incantation to make magic happen.</p>
<p>But when you work, you do awesome things.</p>
<div id="attachment_2103" class="wp-caption aligncenter" style="width: 520px"><a href="http://www.jamesandthegiantcorn.com/wp-content/uploads/2012/04/awesome2.png"><img class=" wp-image-2103 " title="awesome2" src="http://www.jamesandthegiantcorn.com/wp-content/uploads/2012/04/awesome2.png" alt="" width="510" height="512" /></a><p class="wp-caption-text">Sex specific splicing of a gene of unknown function of a gene syntenically conserved in all grass species.</p></div>
<p>With only four days work I was able to go from a giant pile of reads (from the still not properly appreciated Davidson 2011 The Plant Genome) to figures like the one above.</p>
<p>So what is the figure above showing you? One of a large number of genes which show a different pattern of splicing in male and female reproductive organs in maize.* The region &#8220;E8&#8243; is usually treated as exonic in female reproductive tissues but is spliced out like an intron in male reproductive tissues. What does it mean (if anything)? I have no idea yet! But it would have been a real pain to try to re-invent the wheel for identifying these deferentially spliced genes in python. In R, once I figured out the right incantation, it&#8217;s practically plug and play for any gene you could possibly be interested in. Including the software for the (actually quite useful) visualization shown above.</p>
<p>So thank you R. What you do &#8212; once I can figure out how to make you do it &#8212; you do incredibly well.</p>
<p>*Maize makes it easy for us by separating female and male flowers into two entirely different organs (the ear and tassel respectively).</p>
<p>Data from:</p>
<div class="csl-bib-body" style="line-height: 2; padding-left: 2em; text-indent: -2em;">
<div class="csl-entry"><span style="font-variant: small-caps;">Davidson</span> R. M., <span style="font-variant: small-caps;">Hansey</span> C. N., <span style="font-variant: small-caps;">Gowda</span> M., <span style="font-variant: small-caps;">Childs</span> K. L., <span style="font-variant: small-caps;">Lin</span> H., <span style="font-variant: small-caps;">Vaillancourt</span> B., <span style="font-variant: small-caps;">Sekhon</span> R. S., <span style="font-variant: small-caps;">Leon</span> N. <span style="font-variant: small-caps;">de</span>, <span style="font-variant: small-caps;">Kaeppler</span> S. M., <span style="font-variant: small-caps;">Jiang</span> N., <span style="font-variant: small-caps;">Buell</span> C. R., 2011  Utility of RNA Sequencing for Analysis of Maize Reproductive Transcriptomes. Plant Genome <strong>4</strong>: 191–203. doi:<a href="http://dx.doi.org/10.3835/plantgenome2011.05.0015">10.3835/plantgenome2011.05.0015</a>.</div>
</div>
<p>Analyzed using the R package DEXSeq:</p>
<p>Anders S, Reyes A, Huber W. 2012 <a href="http://precedings.nature.com/documents/6837/version/2/files/npre20126837-2.pdf">Detecting differential usage of exons from RNA-Seq Data</a>. Unpublished. (Link is to a PDF)</p>
<p><a class="a2a_dd a2a_target addtoany_share_save" href="http://www.addtoany.com/share_save#url=http%3A%2F%2Fwww.jamesandthegiantcorn.com%2F2012%2F04%2F23%2Fin-which-i-apologize-to-r%2F&amp;title=In%20which%20I%20apologize%20to%20R" id="wpa2a_14"><img src="http://www.jamesandthegiantcorn.com/wp-content/plugins/add-to-any/share_save_171_16.png" width="171" height="16" alt="Share"/></a></p>]]></content:encoded>
			<wfw:commentRss>http://www.jamesandthegiantcorn.com/2012/04/23/in-which-i-apologize-to-r/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>I/O Limited: Assorted Updates</title>
		<link>http://www.jamesandthegiantcorn.com/2012/04/22/io-limited-assorted-updates/</link>
		<comments>http://www.jamesandthegiantcorn.com/2012/04/22/io-limited-assorted-updates/#comments</comments>
		<pubDate>Sun, 22 Apr 2012 20:56:31 +0000</pubDate>
		<dc:creator>James</dc:creator>
				<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://www.jamesandthegiantcorn.com/?p=2099</guid>
		<description><![CDATA[ [...]]]></description>
				<content:encoded><![CDATA[<p>I doubt this will be of interest to that many people but here&#8217;s the list of what I&#8217;m working on this sunday (each item is a separate project/collaboration):</p>
<ul>
<li>Downloading, decompressing and quality/adapter trimming more than 800 million RNA-seq reads (four full Hiseq 2000 lanes).</li>
<li>Attempting to make my very own transcriptome assembly for a species where the genome is available but doesn&#8217;t look to be published anytime soon.</li>
<li>Figuring out how to look at differential use of exons in maize between male and female floral structures.  (Later on this will involve using some R packages. I&#8217;m not looking forward to that part. R always makes me feel like I&#8217;m coding with one hand tied behind my back).</li>
</ul>
<p>The surprising part is that I&#8217;m not being held up by a lack of processors to throw at the problem (the usual problem in computation work), nor a limited supply of RAM (probably the biggest problem in bioinformatics specifically). Instead I&#8217;m hitting the limit of how fast all these various programs can read data off of hard drives and write results back. Right now I am waiting for a little surplus capacity to free up.</p>
<p>It&#8217;s hard to believe that eight months from now this will all be over.  I started my education back in 1990. If they kept numbering years in school after high school I&#8217;d be a 20th grader right now. But my adviser has informed me that I need to have graduated by this December, so that&#8217;s what I have to make happen. Next week is my last as a graduate student instructor. This summer and part of the fall will be a mad sprint to finish up various projects and collaborations and get them written up for publications, then thesis writing, signing, and submitting are all that stand between me and (hopefully) the last degree I&#8217;ll ever need to earn.</p>
<p><a class="a2a_dd a2a_target addtoany_share_save" href="http://www.addtoany.com/share_save#url=http%3A%2F%2Fwww.jamesandthegiantcorn.com%2F2012%2F04%2F22%2Fio-limited-assorted-updates%2F&amp;title=I%2FO%20Limited%3A%20Assorted%20Updates" id="wpa2a_16"><img src="http://www.jamesandthegiantcorn.com/wp-content/plugins/add-to-any/share_save_171_16.png" width="171" height="16" alt="Share"/></a></p>]]></content:encoded>
			<wfw:commentRss>http://www.jamesandthegiantcorn.com/2012/04/22/io-limited-assorted-updates/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>qTeller part 2: Eye candy!</title>
		<link>http://www.jamesandthegiantcorn.com/2012/03/13/qteller-part-2-eye-candy/</link>
		<comments>http://www.jamesandthegiantcorn.com/2012/03/13/qteller-part-2-eye-candy/#comments</comments>
		<pubDate>Tue, 13 Mar 2012 23:31:36 +0000</pubDate>
		<dc:creator>James</dc:creator>
				<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://www.jamesandthegiantcorn.com/?p=2087</guid>
		<description><![CDATA[ [...]]]></description>
				<content:encoded><![CDATA[<p>qTeller isn&#8217;t just for<a href="http://www.jamesandthegiantcorn.com/2012/03/13/qteller-a-way-to-find-candidate-genes/"> generating spreadsheets full of data on genes within an genomic region</a>. It can also visualize published expression data for a single gene. For example here is the expression pattern of a gene called golden plant2 involved in regulating photosynthetic development in maize which was first  described <a href="http://www.maizegdb.org/cgi-bin/displayrefrecord.cgi?id=13314">all the way back in 1926 in an article in the american naturalist</a>:</p>
<p><a href="http://www.jamesandthegiantcorn.com/wp-content/uploads/2012/03/g2.png"><img class="aligncenter size-large wp-image-2088" title="g2" src="http://www.jamesandthegiantcorn.com/wp-content/uploads/2012/03/g2-1024x614.png" alt="" width="580" height="347" /></a></p>
<p>As you can see golden plant2 is expressed at high levels in photosynthetic tissues and not expressed at all in tissues like roots, endosperm, and pollen. Do you know how long it would have taken me to profile the plant-wide expression pattern of a gene this comprehensively by isolating RNA from different tissues using qPCR? WEEKS! Do you know how long it took for me to get the same level of insight with qTeller? 90 seconds!</p>
<p>Do you know how long it&#8217;ll take you to regenerate this same analysis? 30 seconds. Just click <a href="http://qteller.com/qteller3/bar_chart.php?name=GRMZM2G087804&amp;info=">this link</a>. There have been <a href="http://dx.doi.org/10.1105/tpc.111.092668">so</a> <a href="http://dx.doi.org/10.3835/plantgenome2011.05.0015">many</a> <a href="http://dx.doi.org/10.1038/ng.703">awesome</a> <a href="http://dx.doi.org/10.1105/tpc.109.065714">RNA-seq</a> <a href="http://dx.doi.org/10.1371/journal.pgen.1000737">papers</a> coming out recently for maize. I know when I arrive in Portland on Thursday for the <a href="http://www.maizegdb.org/maize_meeting/2012/">Maize Genetics Conference</a> I&#8217;m going to see a whole lot more even bigger/better RNA-seq datasets which people haven&#8217;t finished writing up yet. Some of these datasets have been on posters since the very first maize meeting I went to back in 2009 when I was a wide-eyed first year and may _never_ get published.* But others will be published weeks or months from now, making this visualization all the more powerful.</p>
<p>But for now, MORE EYE CANDY:</p>
<p><a href="http://www.maizegdb.org/cgi-bin/displaylocusrecord.cgi?id=12048">Anther ear1</a></p>
<div id="attachment_2089" class="wp-caption aligncenter" style="width: 590px"><a href="http://www.jamesandthegiantcorn.com/wp-content/uploads/2012/03/an1.png"><img class="size-large wp-image-2089" title="an1" src="http://www.jamesandthegiantcorn.com/wp-content/uploads/2012/03/an1-1024x614.png" alt="" width="580" height="347" /></a><p class="wp-caption-text">Expression of anther ear1, a mutant in the gibberellic acid biosynthetic pathway</p></div>
<p><a href="http://qteller.com/qteller3/bar_chart.php?name=GRMZM2G081554&amp;info=">Link to regenerate analysis</a></p>
<p><a href="http://www.maizegdb.org/cgi-bin/displaylocusrecord.cgi?id=12242">Glossy1</a></p>
<p><a href="http://www.jamesandthegiantcorn.com/wp-content/uploads/2012/03/gl1b.png"><img class="aligncenter size-large wp-image-2090" title="gl1b" src="http://www.jamesandthegiantcorn.com/wp-content/uploads/2012/03/gl1b-1024x614.png" alt="" width="580" height="347" /></a></p>
<p><a href="http://qteller.com/qteller3/bar_chart.php?name=GRMZM2G114642&amp;info=">Link to regenerate analysis. </a></p>
<p><em>Glossy1</em> mutants change the type of wax produced on the leaves of developing maize seedlings, so it makes sense that the gene shows high expression in both maize seedlings and mature leaves. I can even sort of explain away the high expression in developing seeds and embryos since the the primordia which will eventually become the first leaves of the next generation of corn plants are beginning to form, But why in the world does <em>glossy1</em> show such high levels of expression in anthers?</p>
<p>*Here is the relevant excerpt from <a href="http://www.jamesandthegiantcorn.com/2011/07/13/what-not-to-do-with-your-fresh-rna-seq-dataset-a-rant/">my previous rant on data analysis</a>:</p>
<blockquote><p>I recently did the math on a PLoS Genetics paper published in late 2009 based on on a single in-depth analysis of RNA-seq comparisons of mutant and non-mutant siblings. Today we could generate the same dataset, with twice the depth of sequencing for less than $1000 dollars. (INCLUDING regent costs). The takeaway lesson here:<strong> just because your dataset was expensive to generate doesn’t mean you don’t have to worry about the competition stealing the glory if you take more than a year to publish. </strong></p></blockquote>
<p><a class="a2a_dd a2a_target addtoany_share_save" href="http://www.addtoany.com/share_save#url=http%3A%2F%2Fwww.jamesandthegiantcorn.com%2F2012%2F03%2F13%2Fqteller-part-2-eye-candy%2F&amp;title=qTeller%20part%202%3A%20Eye%20candy%21" id="wpa2a_18"><img src="http://www.jamesandthegiantcorn.com/wp-content/plugins/add-to-any/share_save_171_16.png" width="171" height="16" alt="Share"/></a></p>]]></content:encoded>
			<wfw:commentRss>http://www.jamesandthegiantcorn.com/2012/03/13/qteller-part-2-eye-candy/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>qTeller: an easier way to find candidate genes</title>
		<link>http://www.jamesandthegiantcorn.com/2012/03/13/qteller-a-way-to-find-candidate-genes/</link>
		<comments>http://www.jamesandthegiantcorn.com/2012/03/13/qteller-a-way-to-find-candidate-genes/#comments</comments>
		<pubDate>Tue, 13 Mar 2012 22:13:18 +0000</pubDate>
		<dc:creator>James</dc:creator>
				<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://www.jamesandthegiantcorn.com/?p=2075</guid>
		<description><![CDATA[ [...]]]></description>
				<content:encoded><![CDATA[<p>Hunting for good candidate genes is something biologists spend a lot of their time doing. Here are a couple of hypothetical examples:</p>
<p>A) Suzzy the grad student is mapping a recessive mutant which makes the pollen of cornplants shrivel up and die. By examining a bunch of known genetic markers in plants with dead pollen and normal pollen producing siblings of those plants she has narrowed the location of the gene responsible for her trait down to a region of only a couple of megabases on the fifth chromosome of maize. Since the whole maize genome contains over 2,300 megabases of sequence that means she&#8217;s already ruled out 99.9% of the genome. But her region still contains, say, a dozen genes and she needs to know which one she should check first to see if mutation in it is responsible for her mutant phenotype.</p>
<p>B) Johnny is another grad student. He wants to understand how corn plants genetically regulate how wide their leaves will grow to be. By measuring a lot of plants descended from two parents, each with known genotypes, he can identify regions of the genome where inheriting information from one parent or the other seems to be correlated with either wider or narrower leaves. He calls these regions quantitative trait loci (or QTLs). Now he has picked the genetic region that seems to have the biggest effect, and he wants to know what gene within the region is actually responsible for the effect.</p>
<p>There are a number of ways for both Johnny and Suzzy to narrow down their lists to the genes most likely responsible for the changes they are each observing in corn plants:<span id="more-2075"></span></p>
<ol>
<li> A okay candidate might contain a protein domain with a function related to the plant trait they were studying.</li>
<li>A good candidate might show a specific pattern expression in the part of the plant where they observe their phenotype (pollen for Suzzy and developing leaves for Johnny).</li>
<li>A GREAT candidate would be a gene which has already been studied by another group and has a mutant phenotype related to the phenotype currently observed.*</li>
</ol>
<p>Checking out the protein domains of genes under a QTL or mutant interval can be accomplished in almost any genome browser (for example the ones provided by <a href="http://www.maizegdb.org/">MaizeGDB</a>, or <a href="http://genomevolution.org/CoGe/">CoGe</a>).</p>
<p>Checking the expression patterns of genes is harder. One option for plants is a website called <a href="http://www.plexdb.org/modules/PD_browse/experiment_browser.php?experiment=ZM37">PlexDB</a> which lets you look up the expression of individual genes in different people&#8217;s microarray experiments. The biggest problem with microarrays is that it really is impossible to compare expression between different people&#8217;s microarray experiments.</p>
<p>Identifying known mutant genes within a genomic interval used to be a real pain. Now they could search <a href="http://genomevolution.org/wiki/index.php/Classical_Maize_Genes">the classical maize gene list</a> to see if any of the genes in their interval appear on it, but it is still not what you&#8217;d call efficient.</p>
<p>Now imagine <a href="http://qteller.com/qteller3/">a website that let you check all those things in one step</a>!</p>
<div id="attachment_2076" class="wp-caption aligncenter" style="width: 556px"><a href="http://www.jamesandthegiantcorn.com/wp-content/uploads/2012/03/spreadsheet_example.png"><img class=" wp-image-2076 " title="spreadsheet_example" src="http://www.jamesandthegiantcorn.com/wp-content/uploads/2012/03/spreadsheet_example-1024x262.png" alt="" width="546" height="140" /></a><p class="wp-caption-text">Example of data on genes in a region 10 megabases from the start of maize chromosome1. Click to zoom in and make the text big enough to read.</p></div>
<p>The automated annotations of protein domains are the same ones you&#8217;d find in any of the genome browsers above, but instead of checking each gene in turn, qTeller reports them all in a handy sortable spreadsheet (which you can either view online or download to your computer).</p>
<p>It also incorporates measurements of gene expression using RNA-seq data from papers published by the maize community.** To make the numbers as comparable as possible I went back to the raw reads provided by NCBI&#8217;s Sequence Read Archive and taken each dataset through the same analytical pipeline.</p>
<p>And, of course, it reports any classical maize genes which lie within the interval a researcher is studying (taken from the version two classical maize gene list.)</p>
<p>Plus syntenic orthologs in other species. Because, at least in the grasses, genes with mutant phenotypes are <a href="http://www.plosone.org/article/info%3Adoi%2F10.1371%2Fjournal.pone.0017855">disproportionately likely to have been retained at the same location in the genome of lots of different grass species</a>.</p>
<p><a href="http://www.jamesandthegiantcorn.com/wp-content/uploads/2012/03/Front_page.png"><img class="aligncenter size-full wp-image-2077" title="Front_page" src="http://www.jamesandthegiantcorn.com/wp-content/uploads/2012/03/Front_page.png" alt="" width="1055" height="767" /></a></p>
<p>Suzie found a gene within her interval which was expressed at much higher levels in pollen than in leaf tissue:</p>
<p><a href="http://www.jamesandthegiantcorn.com/wp-content/uploads/2012/03/Suzzy-candidate.png"><img class="aligncenter size-full wp-image-2082" title="Suzzy-candidate" src="http://www.jamesandthegiantcorn.com/wp-content/uploads/2012/03/Suzzy-candidate.png" alt="" width="638" height="158" /></a></p>
<p>And Johnny realized that his QTL contained the mutant gene <a href="http://www.maizegdb.org/cgi-bin/displaylocusrecord.cgi?id=969623">milkweed pod1</a>, a classical maize mutant known from previously published papers to be involved regulating in leaf development.</p>
<p><a href="http://www.jamesandthegiantcorn.com/wp-content/uploads/2012/03/JohnnyCandidate.png"><img class="aligncenter size-full wp-image-2083" title="JohnnyCandidate" src="http://www.jamesandthegiantcorn.com/wp-content/uploads/2012/03/JohnnyCandidate.png" alt="" width="603" height="212" /></a></p>
<p><em>So yeah, <a href="http://qteller.com/qteller3/">qTeller</a> is what I&#8217;ve been working on for the past few months. Please let me know if you run into any bugs or have any questions.  -James</em></p>
<p>*Admittedly this would be rather disappointing news for the grad students involved (it&#8217;s a lot harder to get a splashy paper out of rediscovering a known mutant), but it&#8217;s much better to find out you might be studying a known gene early so you can check and cut your losses if it turns out to be true, instead of after you sink another couple of years into recloning and recharacterizing it.</p>
<p>**The authors of all these papers deserve a whole bunch of credit for generating these datasets:</p>
<ul>
<li>Waters AJ, Makarevitch I, Eichten SR, Swanson-Wagner RA, Yeh C-T, et al. (2011) Parent-of-Origin Effects on Gene Expression and DNA Methylation in the Maize Endosperm. <em>The Plant Cell</em> doi:<a href="http://dx.doi.org/10.1105/tpc.111.092668">10.1105/tpc.111.092668</a>.</li>
<li>Davidson RM, Hansey CN, Gowda M, Childs KL, Lin H, et. al. (2011) Utility of RNA Sequencing for Analysis of Maize Reproductive Transcriptomes. <em>The Plant Genome</em>4:191-203 doi:<a href="http://dx.doi.org/10.3835/plantgenome2011.05.0015">10.3835/plantgenome2011.05.0015</a></li>
<li>Li, P., Ponnala, L., Gandotra, N., Wang, L., Si, Y., et al. (2010) The developmental dynamics of the maize leaf transcriptome. <em>Nature Genetics</em> 42: 1060-1067. doi:<a href="http://dx.doi.org/10.1038/ng.703">10.1038/ng.703</a></li>
<li>Wang, X., Elling, A.A., Li, X., Li, N., Peng, Z., et al. (2009) Genome-Wide and Organ-Specific Landscapes of Epigenetic Modifications and Their Relationships to mRNA and Small RNA Transcriptomes in Maize. <em>Plant Cell</em> 21: 1053-1069. doi:<a href="http://dx.doi.org/10.1105/tpc.109.065714">10.1105/tpc.109.065714</a></li>
<li>Jia, Y., Lisch, D.R., Ohtsu, K., Scanlon, M.J., Nettleton, D., et al. (2009) Loss of RNA Dependent RNA Polymerase 2 (RDR2) Function Causes Widespread and Unexpected Changes in the Expression of Transposons, Genes, and 24-nt Small RNAs. <em>PLoS Genet</em> 5: e1000737. doi:<a href="http://dx.doi.org/10.1371/journal.pgen.1000737">10.1371/journal.pgen.1000737</a></li>
<li><a href="http://maizegametophyte.org/">The Maize Gametophyte Project</a>: Unpublished Dataset <a href="http://trace.ncbi.nlm.nih.gov/Traces/sra/?study=SRP006965">SRP006965</a></li>
</ul>
<p><a class="a2a_dd a2a_target addtoany_share_save" href="http://www.addtoany.com/share_save#url=http%3A%2F%2Fwww.jamesandthegiantcorn.com%2F2012%2F03%2F13%2Fqteller-a-way-to-find-candidate-genes%2F&amp;title=qTeller%3A%20an%20easier%20way%20to%20find%20candidate%20genes" id="wpa2a_20"><img src="http://www.jamesandthegiantcorn.com/wp-content/plugins/add-to-any/share_save_171_16.png" width="171" height="16" alt="Share"/></a></p>]]></content:encoded>
			<wfw:commentRss>http://www.jamesandthegiantcorn.com/2012/03/13/qteller-a-way-to-find-candidate-genes/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
	</channel>
</rss>
