It doesn't look like there is. If you have few unique words (by taking smaller portions of a bigger string if necessary) you could always replace each word by a char and run the character version of the algorithm on it:

use v5.14; use Data::Dump qw/pp/; my @chars = ('0'..'9', 'a'..'z', 'A'..'Z'); # Up to scalar(@chars) different words (actually @chars+1 because of u +ndef, but that wouldn't help readability) $_ = <<STR; Jack and Jill went up the hill to fetch a pail of water Jack fell down and broke his crown and Jill came tumbling after STR my @words = /\w+/g; my %replace; my $asChars = join '', map { $replace{$_}//=shift(@chars) } @words; # +'defined-orcish manoeuver' :D # say pp \%replace; say $asChars; my %reverse = reverse %replace; say join ' ', map $reverse{$_}, split //, $asChars; __DATA__ 0123456789abc0de1fgh12ijk Jack and Jill went up the hill to fetch a pail of water Jack fell down + and broke his crown and Jill came tumbling after

Edit: for the comparison to work, you have to use the same %replace hash for all strings. And the $h{$_}//=NewVal() idiom (Orcish Maneuver) means that any word that's already known will be replaced by the existing substitute, while an unknown word will add a new entry in the hash. Here I use // instead of || because otherwise '0' would be an invalid (false) character.


In reply to Re: Levenstein distance transcription by Eily
in thread Levenstein distance transcription by Anonymous Monk

Title:
Use:  <p> text here (a paragraph) </p>
and:  <code> code here </code>
to format your post, it's "PerlMonks-approved HTML":



  • Posts are HTML formatted. Put <p> </p> tags around your paragraphs. Put <code> </code> tags around your code and data!
  • Titles consisting of a single word are discouraged, and in most cases are disallowed outright.
  • Read Where should I post X? if you're not absolutely sure you're posting in the right place.
  • Please read these before you post! —
  • Posts may use any of the Perl Monks Approved HTML tags:
    a, abbr, b, big, blockquote, br, caption, center, col, colgroup, dd, del, details, div, dl, dt, em, font, h1, h2, h3, h4, h5, h6, hr, i, ins, li, ol, p, pre, readmore, small, span, spoiler, strike, strong, sub, summary, sup, table, tbody, td, tfoot, th, thead, tr, tt, u, ul, wbr
  • You may need to use entities for some characters, as follows. (Exception: Within code tags, you can put the characters literally.)
            For:     Use:
    & &amp;
    < &lt;
    > &gt;
    [ &#91;
    ] &#93;
  • Link using PerlMonks shortcuts! What shortcuts can I use for linking?
  • See Writeup Formatting Tips and other pages linked from there for more info.