in reply to Re: HTML-Parser: a newbie question: need to extract exactly line 999 out of 5000 files..
in thread HTML-Parser: a newbie question: need to extract exactly line 999 out of 5000 files..

Are you absolutely sure that this is true, “in the general case?”
Of course it is not true in the "general case" and nobody ever claimed that.

I understand the question to be a one-off task to move the content of 5000 machine-generated html-files into a database.

That in such a scenario you use all available information to make the task easier is only natural.

  • Comment on Re^2: HTML-Parser: a newbie question: need to extract exactly line 999 out of 5000 files..

Replies are listed 'Best First'.
Re^3: HTML-Parser: a newbie question: need to extract exactly line 999 out of 5000 files..
by Perlbeginner1 (Scribe) on Oct 16, 2010 at 09:29 UTC
    hello Morgon hello sundialsvc4 -

    thanks for the kind words and for opening my eyes for the power of PERL !

    that is just amazing! Well i get a headace when i see to browse 5000 files and do all the work by hand. This would take more than several weeks.

    so i decide to use PERL - since it is very very powerful - i try to nail down the issues while using PERL.

    See one of the example sites: http://www.kultusportal-bw.de/servlet/PB/menu/1188427/index.html?COMPLETEHREF=http://www.kultus-bw.de/did_abfrage/detail.php?id=04313488
    in the grey shadowed block you see the wanted information: 17 lines that are wanted. Note - i have 5000 different HTML-files - that all are structured in the very same way!

    That means i would be happy to have a template that can be runned with HTML::TokeParser::Simple and DBI.
    That would be great!!

    ,,, and now i try to get more infos about HTML::TokeParser::Simple and DBI.... I have a manual of DBI - The book of Tim Bunce and aligator xy! At the moment i am on page 25... ;-)

    @Morgon, sundialsvc4: i love to hear from you....

    regards
    perlbeginne1