I am really sorry guys. I did not state the problem correctly.
Typically we update the securities once a day and our transaction feeds run through out the day. After the process runs that updates security master, I am building a cache file (huge -- close to couple of million). I want to use this cache file to store the asset_id as a new column in my transaction feeds, so I can avoid caching (lookup) while loading the warehouse. I can work on a flag to see whether my cache file is ready to be used or not.
Should I be using hash to look up cusip/sedol/isin and pick up asset_id and how does this perform with large data files?
Thanks and appreciate your response.
Cheers.
Raj