Thank you for your detailed and patient explanation

Several of my suspicions are confirmed

I must point out that at the outset I already knew that comparing original and shuffled DNA would NOT allow identification and/or elimination of the false positives. But only a calculation of the percentage of elements that are likely to be false positives. So your final reply confirms that unequivocally

You assert that because there is no predictive power due to the shuffling, this is useless. As a biologist, I would argue against that claim. When you need to experimentally verify a set of predictions, I would take a method that yields ~ 10% FDR over another method that suffers from ~ 40%. That way time, energy and resources are better utilized. So it does not matter so much which ones are real are not, if a large enough sample size is cross-verified experimentally, it should check out as per theoretical FDR predictions. IF it does not, something is wrong with the computation pipeline or the experimental verification protocol or both.

What is interesting however is your clear statement that since one sample is original and another is shuffled, the FDR calculation as # elements found in shuffled DNA vs. the original DNA is completely bogus! :) In my experience, this is how FDRs for sequence based analyses have been reported in published literature. I do not know if there are other viable methods to assess FDR. But your statement is of concern, in terms of any disconnect existing between the theory of FDR and how it might be applied by biologists

In any case, your replies were all enlightening. Thank you for taking the time to reply with patient explanations. I have enough fodder to go beyond my confusion and proceed with my analyses. Cheers!


In reply to Re^8: Window size for shuffling DNA? by onlyIDleft
in thread Window size for shuffling DNA? by onlyIDleft

Title:
Use:  <p> text here (a paragraph) </p>
and:  <code> code here </code>
to format your post, it's "PerlMonks-approved HTML":



  • Posts are HTML formatted. Put <p> </p> tags around your paragraphs. Put <code> </code> tags around your code and data!
  • Titles consisting of a single word are discouraged, and in most cases are disallowed outright.
  • Read Where should I post X? if you're not absolutely sure you're posting in the right place.
  • Please read these before you post! —
  • Posts may use any of the Perl Monks Approved HTML tags:
    a, abbr, b, big, blockquote, br, caption, center, col, colgroup, dd, del, details, div, dl, dt, em, font, h1, h2, h3, h4, h5, h6, hr, i, ins, li, ol, p, pre, readmore, small, span, spoiler, strike, strong, sub, summary, sup, table, tbody, td, tfoot, th, thead, tr, tt, u, ul, wbr
  • You may need to use entities for some characters, as follows. (Exception: Within code tags, you can put the characters literally.)
            For:     Use:
    & &amp;
    < &lt;
    > &gt;
    [ &#91;
    ] &#93;
  • Link using PerlMonks shortcuts! What shortcuts can I use for linking?
  • See Writeup Formatting Tips and other pages linked from there for more info.