I don't think your solution works well, you are showing us a mix of multiple columns.
As long as the fonts are not scrambled, you can use a proper solution like described here:
Parsing PDFs by text position?
Cheers Rolf
(addicted to the Perl Programming Language :)
Wikisyntax for the Monastery
In reply to Re: regex for unicode email addresses
by LanX
in thread regex for unicode email addresses
by Aldebaran
| For: | Use: | ||
| & | & | ||
| < | < | ||
| > | > | ||
| [ | [ | ||
| ] | ] |