Alexander Johannesen wrote:
>
>
> We're using a number of indexes that pulls all sorts of interesting
> stuff out of the MARC records, and do analysis and clustering on top,
Are these indexes really browsable, up and down (and displaying not just
lines that somehow match the user's input but beyond that as far as
they want to go?)?
>
> We've been toying with other means of "fixing" misspellings that
> briefly dips its toe into latent semantic parsing, lexical parsing and
> wordnet integration
All of that, besides being tied in with one language, will not do for
personal or corporate names. What any plain decent good enough catalog
should display is something like this (the arrows pointing to the
authority spelling, where it has been linked, in this case different
from LC's):
...
1 shoskes, henry
1 shostack moskewicz, lorraine -> moskewicz, lorraine s
1 shostack, albert l
1 shostack, g l
23 shostak, arthur b
7 shostak, jerome
11 shostak, marjorie
1 shostak, peter
1 shostak, robert
2 shostak, robert e
1 shostak, stanley
93 shostakovic, dimitri -> sostakovic, dmitrij d
9 shostakovich, d -> sostakovic, dmitrij d
113 shostakovich, dimitri -> sostakovic, dmitrij d
1 shostakovich, dimitri d -> sostakovic, dmitrij d
1 shostakovich, dimitry -> sostakovic, dmitrij d
1 shostakovich, dmitri
25 shostakovich, dmitri -> sostakovic, dmitrij d
106 shostakovich, dmitrii -> sostakovic, dmitrij d
2 shostakovich, dmitrij d -> sostakovic, dmitrij d
1 shostakovich, dmitry -> sostakovic, dmitrij
96 shostakovich, dmitry -> sostakovic, dmitrij d
1 shostakovich, maxim
2 shostakovitch, dimitri -> sostakovic, dmitrij d
1 shostakovitch, dmitri -> sostakovic, dmitrij d
2 shostakovskii, mikhail f -> sostakovskij, michail f
1 shosteck, robert
2 shostrom, everett
17 shostrom, everett l
2 shotam, nirmala puru -> puru shotam, nirmala
1 shotbolt, charles r
1 shoter, l
...
If this looks too exotic, try find Stephen Hawking's works in
LibraryThing. [clue: try Hawkin, Hawkins, Hawkings as well]
B. Eversberg
Received on Wed Jun 21 2006 - 07:53:59 EDT