in reply to Sorting/Cleansing a Duplicate File

The most common way to do this, as shown, is to dump all of the values into a hash ... where you only care about the keys, not the values ... then read all of the keys back out with sort( keys( varname ) ).   Unicode can be a wrinkle.