in reply to Re^10: Counting bytes in a Unicode document
in thread Counting bytes in a Unicode document

And encoding errors are replaced with U+FFFD, which may have a different length than the original.

  • Comment on Re^11: Counting bytes in a Unicode document