Hello monks,
I was seeking for a help. Actually I have a list of drugs that are related to thrombosis. And what I need to do is list out the proteins that are targeted by those drugs. What I did till now is, I extracted the PubMed IDs that refers to those drugs. I have a file containing the proteins that are related to blood coagulation. If any of the proteins in the file are mentioned in the abstract, I listed them. But the problem is, in the abstract, there are few words that is a protein name but is not actually a protein. The word refers to some other context. For example, "Xa" is also a protein(Factor Xa) but sometimes Xa is found in the abstract in some other context. I need to eliminate such words. And I am confused how can I do this. Any helps are appreciated.