I want to process a unicode file (2 bytes per character) and want to capture some text I wanted. However I cannot get it right with open(), is open() support unicode ? Howabout read(), is it only read byte-by-byte only ? Any other module or API should I use ?