I'm working on tagging a large linguistic corpus, but being a perl/programming newbie I'm having some problems. The corpus is divided by line, at the start of each line there is a delimiter, for instance:
il yadayadayada
df yadayadayada
What I need to do is add a tag around each line (<il> or <df> in the above cases) where the contents of the tag match the two character string at the head of each line:
<il> il yadayada <il>
Does anyone have a snippet that would give me a clue as to how I get just the first two letter string out and into a tag?
Thanks in advance.
In reply to tagging question by bagerson
| For: | Use: | ||
| & | & | ||
| < | < | ||
| > | > | ||
| [ | [ | ||
| ] | ] |