>tr|Q8PV56|Q8PV56_METMA PEP-CTERM sorting domain-containing protein OS +=Methanosarcina mazei (strain ATCC BAA-159 / DSM 3647 / Goe1 / Go1 / +JCM 11833 / OCM 88) OX=192952 GN=MM_2118 PE=4 SV=1
You know the Q8PV56 part is already a unique UniProt identifier, right? UniProt calls it an 'Accession Number'. There would seem to be no need to use the whole line as a key.
Here is the info in case you want to excise the accession number with a regular expression: accession_numbers
There can exist also multiple isoforms, which have similar accession numbers but postfixed with dash+integer, like so: P68250-3
In reply to Re: Potential hash key ids limitations
by erix
in thread Potential hash key ids limitations
by Anonymous Monk
| For: | Use: | ||
| & | & | ||
| < | < | ||
| > | > | ||
| [ | [ | ||
| ] | ] |