in reply to question about extracting data tables

You can extract tabular data from HTML files, XML files, CSV files, Fixed Width Files, and many other formats using AnyData (a tied-hash interface) or DBD::AnyData (a DBI/SQL interface) modules. For HTML tables, both use the excellent HTML::TableExtract module that b10m mentioned.
  • Comment on Re: question about extracting data tables

Replies are listed 'Best First'.
Re^2: question about extracting data tables
by dchandler (Sexton) on Dec 27, 2004 at 00:01 UTC
    dumb question... is XML the same as SGML? I think these files are SGML, will anydata work on them?
      is XML the same as SGML?

      XML is a subset of SGML. So, generally speaking, an SGML tool will probably work with XML data, but an XML tool might not work with SGML data. Your specific case might not be a problem, though, so it's worth a shot.

      Yes, as revdiablo said, XML is a subset of SGML, so is HTML. Some SGML can be parsed with an XML parser, it all depends.