comment on

This produces a slightly different output for your sample than you provided--but I think mine is right :)

(You'll have to explain to me why it isn't, if it isn't.)

This could probably be made quite a bit more efficient with further thought, but I wanted to check if it is correct first.

#! perl -slw
use strict;

my $k = 25;
my %repo = (
    "readA" => "GCTGAGGCAGGAGAATTGCTTGAACCTGGGAGGCA",
    "readB" => "TACTCAGGAGGCTGAGGCAGGAGAATTGCTTGAAC",
    "readC" => "GCTGAGGCAGGAGAATTGCTTGAACTTAGGGGATG",
    "readD" => "TACTCGGGAGGCTGAGGCAGGAGAATTGCTTGAAC",
);
my @order = ( "readA_1", "readB_2", "readC_1", "readD_2");

my( @heads, @tails, $common );
while( @order ) {
    my( $s1, $p1, $s2, $p2 ) = map split( '_', shift @order ), 1 .. 2;
    ( $s1, $s2 ) = ( $s2, $s1 ) if $p1 > $p2;
    push @heads, substr $repo{ $s2 }, 0, length( $repo{ $s2 } ) - $k;
    push @tails, substr $repo{ $s1 }, $k;
    $common = substr $repo{ $s1 }, -$k unless $common;
}

my $head = '';
for my $p ( 0 .. length( $heads[0] )-1 ) {
    my %uniq;
    ++$uniq{ substr $heads[ $_ ], $p, 1 } for 0 .. $#heads;
    if( keys %uniq > 1 ) {
        $head .= '(' . join( ',', keys %uniq ) . ')';
    }
    else {
        $head .= each %uniq;
    }
}

my $tail = '';
for my $p ( 0 .. length( $tails[0] )-1 ) {
    my %uniq;
    ++$uniq{ substr $tails[ $_ ], $p, 1 } for 0 .. $#tails;
    if( keys %uniq > 1 ) {
        $tail .= '(' . join( ',', keys %uniq ) . ')';
    }
    else {
        $tail .= each %uniq;
    }
}
print $head, $common, $tail;

__END__
c:\test>868716
TACTC(A,G)GGAGGAGAATTGCTTGAACCTGGGAGGCA(T,C)T(A,G)GG(A,G)G(A,G)(T,C)(A
+,G)
[download]

Examine what is said, not who speaks -- Silence betokens consent -- Love the truth but pardon error.

"Science is about questioning the status quo. Questioning authority".

In the absence of evidence, opinion is indistinguishable from prejudice.

RIP an inspiration; A true Folk's Guy

In reply to Re: Mustering Reads by BrowserUk
in thread Mustering Reads by neversaint

Posts are HTML formatted. Put <p> </p> tags around your paragraphs. Put <code> </code> tags around your code and data!

Titles consisting of a single word are discouraged, and in most cases are disallowed outright.

Read Where should I post X? if you're not absolutely sure you're posting in the right place.

Please read these before you post! —

Posts may use any of the Perl Monks Approved HTML tags:

a, abbr, b, big, blockquote, br, caption, center, col, colgroup, dd, del, details, div, dl, dt, em, font, h1, h2, h3, h4, h5, h6, hr, i, ins, li, ol, p, pre, readmore, small, span, spoiler, strike, strong, sub, summary, sup, table, tbody, td, tfoot, th, thead, tr, tt, u, ul, wbr

You may need to use entities for some characters, as follows. (Exception: Within code tags, you can put the characters literally.)

	For:		Use:
	&		`&`
	<		`<`
	>		`>`
	[		`[`
	]		`]`

Link using PerlMonks shortcuts! What shortcuts can I use for linking?

See Writeup Formatting Tips and other pages linked from there for more info.