Three options for you:
- Build an index of line numbers to offsets.
- Use DB_File's DB_RECNO interface to tie the text
file to a hash.
- Read through the file each time.
Indexing is easy (code untested, as usual. You can
find tested similar code in the Perl Cookbook):
open(IDX, "> index") or die;
open(DAT, "< datafile") or die;
$pos = 0;
do {
printf IDX "%8d" $pos;
$line = <DAT>;
$pos = tell(DAT);
} while (defined $line);
close(IDX);
close(DAT);
and to use it:
open(IDX, "< index") or die;
open(DAT, "< datafile") or die;
# assume $lnum is the line number to go to
seek(IDX, 8*$lnum, 0) or die;
read(IDX, $offset, 8) or die;
seek(DAT, $offset, 0) or die;
# now DAT filehandle is positioned to read the
# record you wanted
The DB_RECNO interface from DB_File is certainly easier
to use, but it's probably not as efficient as it can't
keep the index on disk as my code here does.
Hope this gives you some pointers.
Nat
Posts are HTML formatted. Put <p> </p> tags around your paragraphs. Put <code> </code> tags around your code and data!
Titles consisting of a single word are discouraged, and in most cases are disallowed outright.
Read Where should I post X? if you're not absolutely sure you're posting in the right place.
Please read these before you post! —
Posts may use any of the Perl Monks Approved HTML tags:
- a, abbr, b, big, blockquote, br, caption, center, col, colgroup, dd, del, details, div, dl, dt, em, font, h1, h2, h3, h4, h5, h6, hr, i, ins, li, ol, p, pre, readmore, small, span, spoiler, strike, strong, sub, summary, sup, table, tbody, td, tfoot, th, thead, tr, tt, u, ul, wbr
You may need to use entities for some characters, as follows. (Exception: Within code tags, you can put the characters literally.)
| |
For: |
|
Use: |
| & | | & |
| < | | < |
| > | | > |
| [ | | [ |
| ] | | ] |
Link using PerlMonks shortcuts! What shortcuts can I use for linking?
See Writeup Formatting Tips and other pages linked from there for more info.