Renyulb28 has asked for the wisdom of the Perl Monks concerning the following question:
Hi Monks, I have DNA data for a number of individuals at a few thousand markers. The data is in a matrix format with the rows being individuals and columns being markers. For example:
3851 A A G G T T A A C C ... 3854 A A G G T T A A C C ... 3871 A A G G T T A A G G ...
The first column is the individual ID, and each column after that is a marker observation. I need to count the number of occurrence of each character per column, and then output a text file such as:
column 2 3 A column 10 2 C 1 G
I only know how to do this per column using the unix grep command, but that would be extremely time consuming and necessary. The output does not need to be as specific as I have put, as long as it gives the column number and how many occurrences of each character. Thank you for any advice.
|
|---|
| Replies are listed 'Best First'. | |
|---|---|
|
Re: Count # of occurrences per column
by ikegami (Patriarch) on May 11, 2011 at 17:58 UTC | |
|
Re: Count # of occurrences per column
by wind (Priest) on May 11, 2011 at 17:48 UTC | |
by ikegami (Patriarch) on May 11, 2011 at 18:07 UTC | |
by wind (Priest) on May 11, 2011 at 18:40 UTC | |
|
Re: Count # of occurrences per column
by LanX (Saint) on May 12, 2011 at 09:53 UTC |