You could say more about your data set.
How many lines are we talking about?
Are the names and titles all english or anglified(sp)?
How many
odd names are there?
Massaging messy data into formal structures
is often easier if you do not attempt a complete
algorithmic solution. Exploring the data is much of
the problem, so solving it as you explore it is a
possibility.
Often it is easier to solve the problem bit by bit.
Copying out all the two-field lines and solving them
is probably trivial.
This warms you up to do the three field lines, or maybe
you see that the titles are not very varied, and decide
to handle that aspect first.
By the time you get to the hard cases your remaining data
set may be quite small.
The approach I'm proposing is efficient in certain
situations. Your situation may or may not be such.
Posts are HTML formatted. Put <p> </p> tags around your paragraphs. Put <code> </code> tags around your code and data!
Titles consisting of a single word are discouraged, and in most cases are disallowed outright.
Read Where should I post X? if you're not absolutely sure you're posting in the right place.
Please read these before you post! —
Posts may use any of the Perl Monks Approved HTML tags:
- a, abbr, b, big, blockquote, br, caption, center, col, colgroup, dd, del, details, div, dl, dt, em, font, h1, h2, h3, h4, h5, h6, hr, i, ins, li, ol, p, pre, readmore, small, span, spoiler, strike, strong, sub, summary, sup, table, tbody, td, tfoot, th, thead, tr, tt, u, ul, wbr
You may need to use entities for some characters, as follows. (Exception: Within code tags, you can put the characters literally.)
| |
For: |
|
Use: |
| & | | & |
| < | | < |
| > | | > |
| [ | | [ |
| ] | | ] |
Link using PerlMonks shortcuts! What shortcuts can I use for linking?
See Writeup Formatting Tips and other pages linked from there for more info.