After a long hiatus into the dark and scary world of PHP, I have returned with a question.

I have a project involving the parsing of a large (30MB) tab separated text file. The code was an existing project that I'm new to, the original author used Text::xSV for the parsing, so I don't really have the option to use something else.

The problem is the fields are not quoted in the text file but some of the data contains quotes. What i need is a way for Text::xSV to simply ignore them. I know I can use filtering to change them to some other character and then change them back once they are separated, but I'm hoping there's a better way.

I can't include the actual line due to non-disclosure agreements but its basically somethng like this

00001     Widget     ACTIVE     widget maker's inc     15"x15"

Here is the code I'm using to parse, bascially the same as the example in the pod

my $csv = Text::xSV->new(sep=>"\t" ); $csv->open_file("xxxxx.csv"); $csv->read_header(); # Make the headers case insensitive foreach my $field ($csv->get_fields) { if (lc($field) ne $field) { $csv->alias($field, lc($field)); } } while( $csv->get_row() ) { # save to a db }
Thanks!

In reply to Text::xSV quote problem by cfreak

Title:
Use:  <p> text here (a paragraph) </p>
and:  <code> code here </code>
to format your post, it's "PerlMonks-approved HTML":



  • Posts are HTML formatted. Put <p> </p> tags around your paragraphs. Put <code> </code> tags around your code and data!
  • Titles consisting of a single word are discouraged, and in most cases are disallowed outright.
  • Read Where should I post X? if you're not absolutely sure you're posting in the right place.
  • Please read these before you post! —
  • Posts may use any of the Perl Monks Approved HTML tags:
    a, abbr, b, big, blockquote, br, caption, center, col, colgroup, dd, del, details, div, dl, dt, em, font, h1, h2, h3, h4, h5, h6, hr, i, ins, li, ol, p, pre, readmore, small, span, spoiler, strike, strong, sub, summary, sup, table, tbody, td, tfoot, th, thead, tr, tt, u, ul, wbr
  • You may need to use entities for some characters, as follows. (Exception: Within code tags, you can put the characters literally.)
            For:     Use:
    & &amp;
    < &lt;
    > &gt;
    [ &#91;
    ] &#93;
  • Link using PerlMonks shortcuts! What shortcuts can I use for linking?
  • See Writeup Formatting Tips and other pages linked from there for more info.