last time I convert mdict to stardict xml, before convert xml to stardict ifo, need to dump html with w3m, after that \u200b remaining become a problem. I found when use utf8::all, \s will match \u200b, none any \p{..} could match it if donot set 'use utf8::all'.