Hello Venerable Monks
I have a number of flat text files (500). The text contains narrative which may contain product names (zero, one or more in each narrative), set IN CAPITALS quite helpfully. I would like to anonymise the data a little and replace the current product names with random other names. I have a randomised array of product names which contains far more names than I can use. My question is, is there a shortcut to this through regular expressions or some other means?
As an example:
"The respondent uses the following products XXX, YYYYYYYYY, ZZZZZZZ around the house and they are considering using QQQQQQQ too. They are particularly impressed with ZZZZZZZ."
Which I would like to change to
"The respondent uses the following products AAA, BBBBBB, CCCCCC around the house and they are considering using DDDDDDDD too. They are particularly impressed with CCCCCC."
Any help would be really appreciated as I am slowly getting up to speed with PERL but not fast enough!
Thanks in advance,
Stevee
Posts are HTML formatted. Put <p> </p> tags around your paragraphs. Put <code> </code> tags around your code and data!
Titles consisting of a single word are discouraged, and in most cases are disallowed outright.
Read Where should I post X? if you're not absolutely sure you're posting in the right place.
Please read these before you post! —
Posts may use any of the Perl Monks Approved HTML tags:
- a, abbr, b, big, blockquote, br, caption, center, col, colgroup, dd, del, details, div, dl, dt, em, font, h1, h2, h3, h4, h5, h6, hr, i, ins, li, ol, p, pre, readmore, small, span, spoiler, strike, strong, sub, summary, sup, table, tbody, td, tfoot, th, thead, tr, tt, u, ul, wbr
You may need to use entities for some characters, as follows. (Exception: Within code tags, you can put the characters literally.)
| |
For: |
|
Use: |
| & | | & |
| < | | < |
| > | | > |
| [ | | [ |
| ] | | ] |
Link using PerlMonks shortcuts! What shortcuts can I use for linking?
See Writeup Formatting Tips and other pages linked from there for more info.