PDA

View Full Version : Question about Baseball Statistics Database


Hagbard Celine
03-01-2002, 01:39 PM
I forget who always talks about this (Pseu, Obi?).

First of all, did I download the correct one? After running a search on Google I went to http://www.redsoxdiehard.com and downloaded something called baseballdb.exe. It seems to have everything.

My question: Is there a way to show stats by "career" rather than by season. For instance, as a simple example, could I run a query for most Home Runs in a career, rather than just most Home Runs in a Season?

I pretty much figured out how the database works, but I haven't figured this out yet and it's bugging me.

If I choose, say Hank Aaron, I see 22 seasons listed with the totals given underneath. Is there any way to eliminate the Year field so I just get the totals?

Pseudolus
03-01-2002, 01:43 PM
I'll check out that site, but the database I use is lahman_ma_40.zip from http://www.baseball1.com/statistics/ . No 2001 stats, but we should see those Real Soon Now. This is an Accel db with multiple tables. Career stats aren't in their explicitly, so you have to play with it and tell it to group by player ID (don't use names, there are multiple players with the same name) to roll them up.

Hagbard Celine
03-01-2002, 01:48 PM
That one seems...much bigger!!

I have a feeling it'll be a lot better though because it's in Access. The other one is not and you don't have nearly as much functionality as you do in Access.

Compare a download time of 1.5 mins to 9.5 mins!

Thanks for the link!

Pseudolus
03-01-2002, 01:56 PM
You're welcome! The Lahman database is great because you have total control over it. It's not as easy to jump right in and run queries, though, because you have to construct everything from scratch. It doesn't even have rate stats like BA or SLG, just counting ones like AB, H, BB, etc.

The basic structure is to have one line of data for each player, for each season, for each team he played for. So John Smith, playing for the same team all of 1999 will have one line of 1999 numbers, but if he was traded twice in 2000 he'll have 3 lines for that year. A good first thing to do is to write queries to sum up individual seasons and entire careers, so you can later query off of these.

Have fun!

<font size=-1>[ This Message was edited by: Pseudolus on 2002-03-01 13:56 ]</font>