in reply to Re^6: Tips on how to perform this regex query
in thread Tips on how to perform this regex query

Edit distance as a measure of loanword adaptation.

That links to a corrupted (or non-)pdf.

Bioinformatics uses a bit more complex edit distance measures than the traditional one.

That's really understating the difference. From the link you provided:

In a constant gap penalty, every gap receives some predetermined constant penalty, regardless of its length. Thus, the insertion or deletion of 1000 contiguous symbols is penalized equally to that of a single symbol.

That "constant gap penalty" completely changes the dynamics of the algorithm. And thus makes the modfied algorithm useful for finding alignments of subsequences; which the standard edit distance it completely useless for.

Typos: an article about finding likely typos with edit distance.

The Damerau–Levenshtein algorithm is quite different -- allows/measures transpositions -- to the Levenshtein algorithm -- which does not.


With the rise and rise of 'Social' network sites: 'Computers are making people easier to use everyday'
Examine what is said, not who speaks -- Silence betokens consent -- Love the truth but pardon error.
"Science is about questioning the status quo. Questioning authority".
In the absence of evidence, opinion is indistinguishable from prejudice.
pExamine what is said, not who speaks -- Silence betokens consent -- Love the truth but pardon error.
  • Comment on Re^7: Tips on how to perform this regex query

Replies are listed 'Best First'.
Re^8: Tips on how to perform this regex query
by no_slogan (Deacon) on Jan 11, 2014 at 15:31 UTC
    That links to a corrupted (or non-)pdf.

    It's fine. It has a compressed crossreference table, and you should update to a PDF reader that understands that.

    Crossreference compression is interesting -- it uses the PNG predictor algorithm. But this is getting really off-topic.