in reply to Re: Help on format or better way to do..?
in thread Help on format or better way to do..?
Thanks
I actually started with the hash you mentioned and decided to go for the mixture because lets say if we type Bill and the actual person's name for Bill could be William so I thought I have to handle that scenario..
example
Main Reson that I changed the structure of the hash is below mention concept.. Do you think I am on the wrong path or does it make sense...
A => B, C, D, E
B => A, F, G, H
|
|---|
| Replies are listed 'Best First'. | |
|---|---|
|
Re^3: Help on format or better way to do..?
by graff (Chancellor) on Nov 28, 2007 at 00:43 UTC | |
If "B,C,D,E" are all nicknames for "A", and "A,F,G,H" are all nicknames for "B", and if the purpose of this data structure is to provide a one-shot lookup for a given string (i.e. to get the "immediate nickname set" for that string), then okay, that structure makes sense. (Well, sort of, I guess... but are you saying you have cases where A is a nickname for B and B is also a nickname for A? I'm having trouble with that.) But if the purpose is to pursue all possible "respelling" relations in a set (e.g. "A" can be respelled as any of "B,C,D,E", and for each of those, use the same structure to find all possible respellings), then you have a problem of circularity: A can be respelled as B, which can be respelled as A, which can be respelled as B, which... (infinite loop). Actually, it's not at all clear now what you are really trying to do, so I'm not sure what advice to give about the data structure. There are two basic directions that seem to be at issue:
If the structure you are looking for is not one of those two, then you need to be more clear about what kind of structure you are looking for and how you want it to organize things. You seem to be giving us simplified fake examples, and maybe they are too simple or maybe they don't accurately reflect your data or your task. What are you really trying to do? UPDATE: There is a third direction you might be thinking about: many-to-many relations, e.g. a nickname like "Chas" might relate to both "Charles" and "Chastity", but "Chuck" is also a nickname for "Charles". This is another hash of arrays, where some array values (the "real" names in this case) can occur with two or more hash keys. In any case, the important thing is that the hash keys are one set of entities (e.g. nicknames), and the hash values, whether scalars or arrays, are a distinct set of entities (e.g. real names). Just don't get them confused. | [reply] |
by learnperl (Acolyte) on Nov 29, 2007 at 22:59 UTC | |
Thanks Graff, Sorry I didn't get a chance to reply for this, My Grandmother passed away and it has been a very emotional time since she had a major impact on my life.
Answer to your question which was "What am I trying to do here "
So in mathematically I am thinking of a set theory implementation I am sorry I miss lead you in my original post. I didn't put entire code since it has other subroutines that have no impact on this situation also I am new to perl so the code is 200 lines long. This is the entire code that I have which is related to the issue, sorry I should have put this before rather than putting a part of the code.
Thanks LearnPerl | [reply] [d/l] |
by graff (Chancellor) on Nov 30, 2007 at 02:03 UTC | |
the concept of the hash I am trying to have is, It strikes me as a bit wasteful to have so many copies of all the names. And it would be worthwhile to figure out an easy way to initialize the complete structure from the simplest possible listing of name sets. Here's what I would propose: That plays a little game with the name strings to make sure that you can keep track of different name sets containing the same nickname: just add spaces at the end of a previously seen nickname until it becomes a unique key in the altnames hash. Then when searching the hash, make sure to look for the target name optionally followed by spaces. (If the results are going to a web page, the extra spaces won't affect the display.) Note that by storing multiple references to the same array in the hash, you are not using extra memory to store copies of the names -- each nameset is stored exactly once. | [reply] [d/l] [select] |