Beefy Boxes and Bandwidth Generously Provided by pair Networks
more useful options
 
PerlMonks  

Re: how to identify a fixed width file

by reasonablekeith (Deacon)
on May 14, 2008 at 15:33 UTC ( #686546=note: print w/replies, xml ) Need Help??


in reply to how to identify a fixed width file

Why don't you try running through the file counting up the number of times a line of a given length is seen...
my %line_count_by_length; while (<DATA>) { my $line_length = length($_); $line_count_by_length{$line_length}++; }
If any (or a sufficiently large portion of) those line counts represent a big percentage of the total line count, you could make a guess that the file was fixed width. Perhaps also giving a weighting on how many different line lengths are represented in the file, compared to how many you might expect given the file's length?
---
my name's not Keith, and I'm not reasonable.

Replies are listed 'Best First'.
Re^2: how to identify a fixed width file
by ftumsh (Scribe) on May 14, 2008 at 15:48 UTC

    My initial stab was a count of record lengths which was fine until the different length files cropped up.

    I think bringing that back along with some analysis of the counts, along with tachyon/mortitz' text OR should go a long way to solving this

Log In?
Username:
Password:

What's my password?
Create A New User
Domain Nodelet?
Node Status?
node history
Node Type: note [id://686546]
help
Chatterbox?
and the web crawler heard nothing...

How do I use this? | Other CB clients
Other Users?
Others avoiding work at the Monastery: (2)
As of 2022-08-18 04:24 GMT
Sections?
Information?
Find Nodes?
Leftovers?
    Voting Booth?

    No recent polls found

    Notices?