Re: [Scid-users] Relying on a Tournament File

Alexander Wagner Wed, 24 Jun 2009 11:08:15 -0700

Benoit St-Pierre wrote:

Hi!


> When trying to maintain a huge database (something akin to
> a master game repository), I encounter an annoying
> problem.  Doublechecking names easily lead to a nightmare
> : I have to choose some name of a player without knowing
> exactly who to pick.  Szabo, for instance, seems to be a
> popular name for a chessplayer.

It is. It is just a popular name anyway. VIAF lists 827
persons of that name. And VIAF is only built for persons who
published some book or the like...

> So I got this idea : why not use a filename of tournaments
> ?  Ideally, this file would contain the list of all its
> players.  Doublechecking names should get easier, then.

What you suggest would be to build up a database of all
tournaments and their participants.

> Besides, once the historical work has been done,

This will be a historical work, indeed.

> it would be independant of any database.  Since
> collections of games are no real database

Eh? Why is a collection of games no real database? Do you
mean in the IBM sense? (DB2 is a real database, Access not.)

BTW: I'd surely collect those metadata in a real database.

> (i.e.  the instances are devoid of any rigid designator, a
> unique ID, say), we need to work around this limitation.
> Working outside the database seems to me the only way to
> go, if we want a historical record of actual games, with
> less and less data error.

Well, working within a central database would be in fact the
only way to go. But you suggest to build up an authority
file (or registry if you prefer that naming) for all chess
games of the world. Even if you just start out with
important tournaments of the past and work through them,
this is a huge effort.

> Of course, if one has not enough information about a game,
> that solution can't help.

Right.

> But I don't see any solution to the lack of information,
> except trying to build a tournament file that would be
> orthogonal to a file of chessplayers' names.

Acutally, the real solution would be something like VIAF.

There you build up an authority file based upon the
publications of a person. For your project you'd build up
the authority file containing tournaments and persons based
upon the games of chess they played. Its very similar (just
to avoid the word "identical ;)

Still, you miss a point here: you'll have to individualize
the players and the games. You'd have to end up at an
authority record for each tournament connected to the
authority records of each participating player and all that
would be connected to the actual games being the backend of
the whole thing.

> That's just food for thought, of course.  I thought you
> might like it, so here it is.  I also thought it was a
> good idea to share.

It is a good idea, but it is a huge effort. Have a look at
VIAF's record for Garry Kasparov e.g.

    http://viaf.org/24621810

You might want to unfold the MARC-21 section, check out the
400's and 700's, just to get an idea about the spellings for
his name. And just to get a vague idea about the size of the
project you might just call up http://viaf.org to see who is
participating in this game.  And it's only about the
names part... (Plus using databases of books we already have
in huge databases built by autopsy using trained personel.
Creating such a catalogue record for one book from scratch
takes about 15min on the average.)

> PS:  I am on vacations until end of July, so I won't
> contribute to the debate, if there is one.  There should :
> Alex being for, Pascal seeing technical problems and no
> real relevance, etc.  ;-)

Debate is over. I know that I'm right and I've quite some
forces behind me. ;) LoC, BNB, DNB, BNF (just to name the
really big ones) can't be wrong.

Still I see the difficulty to get it done in a spare time
project. I see a possibility to gradually work in linkages
to VIAF.

Franz: getting VIAF record IDs into the spelling file is the
way to go to get linkage up into Wikipedia and whatever.
VIAF-ID is persistend, of course.  (Try to search for
118721097 at de.wikipedia.org. :) At the moment they use
PND-IDs as german wikipedia mainly set this up with DNB but
thats about to change and they go for VIAF-IDs (there's a
1:1 mapping PND <-> VIAF) plus if I got them right this
should start to spread out over the whole wikipedia.
Wikipedia wants to reuse as much data from authority files
as possible. Guess why ;) The national libraries are
currently investigating if they could use data from
Wikipedia to enrich their authority files. There was just
recently a talk about this issue at the Bibliothekartag in
Erfurt.

-- 

Kind regards,                /                 War is Peace.
                             |            Freedom is Slavery.
Alexander Wagner            |         Ignorance is Strength.
                             |
                             | Theory     : G. Orwell, "1984"
                            /  In practice:   USA, since 2001

------------------------------------------------------------------------------
_______________________________________________
Scid-users mailing list
Scid-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/scid-users

Re: [Scid-users] Relying on a Tournament File

Reply via email to