in reply to Re^3: mismatching characters in dna sequence
in thread mismatching characters in dna sequence

ok, i think inline::c is a little beyond me. i'm trying it out with the following code:

# V5: Inline::C #!/opt/local/bin/perl use strict; use Inline C => Config => BUILD_NOISY => 1; use Inline C => <<'END_C', NAME => 'dnacmp', CLEAN_AFTER_BUILD => 0; void dnacmp( char *a, char *b ) { int n = 0; Inline_Stack_Vars; Inline_Stack_Reset; while( *a && *b ) { if( *a != *b ) { Inline_Stack_Push( sv_2mortal( newSVpvf( "%d:%c:%c", n, *a +, *b ) ) ); } a++; b++; n++; } Inline_Stack_Done; return; } SV *dnacmp2( char *a, char *b ) { SV *res = newSVpv( "", 0 ); int n = 0; while( *a && *b ) { if( *a != *b ) sv_catpvf( res, "%d:%c:%c ", n, *a, *b ); ++a, ++b, ++n; } return res; } END_C use Bio::Seq; use Bio::DB::Sam; my $bamfile = Bio::DB::Sam->new( -bam => "eg1.bam" ); my @allReads = $bamfile->features( -type => 'match' ); for my $read (@allReads) { ( $ref, $matches, $query ) = $read->padded_alignment; print $read->qname, "\n", dnacmp2('$ref', '$query'); }

first, i extract some information from the .bam file. then, i create the target and the test sequence on the fly in the subsequent for loop. $ref represents the target and $query represents the test (both are just strings). i then try to feed these two variables into dnacmp2 and print a few more things before it ($read->qname represents a sequence identifier), but it throws the following error:

Global symbol "$query" requires explicit package name at mutations.pl +line 133. Execution of mutations.pl aborted due to compilation errors. Pradeep-Bandarus-MacBook-Pro:test pradeepbandaru$ perl mutations.pl Global symbol "$ref" requires explicit package name at mutations.pl li +ne 133. Global symbol "$matches" requires explicit package name at mutations.p +l line 133. Global symbol "$query" requires explicit package name at mutations.pl +line 133. Execution of mutations.pl aborted due to compilation errors.
sorry if this is a rookie question, but what's going on with this?

Replies are listed 'Best First'.
Re^5: mismatching characters in dna sequence
by BrowserUk (Patriarch) on Dec 30, 2011 at 20:42 UTC

    Hm. The first and most obvious problem is you've forgotten my here:

    my( $ref, $matches, $query ) = $read->padded_alignment; #...^^

    A more subtle potential problem is not having the shebang line as the first line of the file:

    # V5: Inline::C #!/opt/local/bin/perl

    I don't know what that first line is meant to do, but either delete it or move it below the Inline C code.

    Make those two corrections and then see what happens.

    You do have a C compiler correctly installed don't you?


    With the rise and rise of 'Social' network sites: 'Computers are making people easier to use everyday'
    Examine what is said, not who speaks -- Silence betokens consent -- Love the truth but pardon error.
    "Science is about questioning the status quo. Questioning authority".
    In the absence of evidence, opinion is indistinguishable from prejudice.

    The start of some sanity?

      thanks...this is really quick! it was really the missing "my" that did me in. is there a quick way to tabulate all the respective transitions? e.g.
      A:T = 234 A:G = 492 A:C = 273 T:A = 400 ...
      and so on... basically output the total number of each type of transition?
        basically output the total number of each type of transition?

        Per sequence compared, or for all the sequences?


        With the rise and rise of 'Social' network sites: 'Computers are making people easier to use everyday'
        Examine what is said, not who speaks -- Silence betokens consent -- Love the truth but pardon error.
        "Science is about questioning the status quo. Questioning authority".
        In the absence of evidence, opinion is indistinguishable from prejudice.

        The start of some sanity?