Just curious, what is PCA?
Announcement
Collapse
No announcement yet.
What is PCA?
Collapse
X

Originally posted by Honus Wagner RulesJust curious, what is PCA?
PCA is short for "Pythagorean Comparitive Analysis". It is being created by out very own Matt Souders (SABR Matt). If you have any questions as to specifics, ask him.
Comment

There hasn't been up to this point enough information out there about the system, and I still refer to it in discussion about players...for that I apologize.
PCA is a work in progress...answers to specific questions are likely to get you a "we're working on this idea.." at present...The first results from the new SFBT (SoudersFiato Baseball Translations) system may be coming very soon...we're starting with Strength of Schedule and Park analysis which should be interesting.
Comment

Originally posted by SABR MattThere hasn't been up to this point enough information out there about the system, and I still refer to it in discussion about players...for that I apologize.
PCA is a work in progress...answers to specific questions are likely to get you a "we're working on this idea.." at present...The first results from the new SFBT (SoudersFiato Baseball Translations) system may be coming very soon...we're starting with Strength of Schedule and Park analysis which should be interesting.
Thanks,
HWRStrikeouts are boring! Besides that, they're fascist. Throw some ground balls  it's more democratic.Crash Davis
Comment

HWR...a short description of PCA is nearly impossible....but boiled down to simplest terms...
PCA determines how many wins a team was actually worth (not always exactly the same as how many they actually had...this is dependent on strength of schedule, luck, and the consistancy of the team among many other factors) and distributes those wins to the players through a series of calculations based on an extension of Pythagorean logic...I used the pythagorean to place the margin (minimal expected production for a major league player) and to compare the player's actual performance with that margin, and divide up credit to various skill units on a team and the various players within skill units on a team.
OK...that wasn't terribly simple...LOL
PCA is a combination of the ideas put forth by Pete Palmer, Bill James, and Baseball Prospectus on the best ways to rate the players along with my own adaptations...it results in Wins Created ratings for each player in each season.
Comment

Originally posted by SABR MattHWR...a short description of PCA is nearly impossible....but boiled down to simplest terms...
PCA determines how many wins a team was actually worth (not always exactly the same as how many they actually had...this is dependent on strength of schedule, luck, and the consistancy of the team among many other factors) and distributes those wins to the players through a series of calculations based on an extension of Pythagorean logic...I used the pythagorean to place the margin (minimal expected production for a major league player) and to compare the player's actual performance with that margin, and divide up credit to various skill units on a team and the various players within skill units on a team.
OK...that wasn't terribly simple...LOL
PCA is a combination of the ideas put forth by Pete Palmer, Bill James, and Baseball Prospectus on the best ways to rate the players along with my own adaptations...it results in Wins Created ratings for each player in each season.Last edited by Honus Wagner Rules; 09072005, 03:19 PM.Strikeouts are boring! Besides that, they're fascist. Throw some ground balls  it's more democratic.Crash Davis
Comment

It is based on what the team actually did do...adjusted so that they appear to have faced average competition...so indirectly yes.
I'm in the process of reworking some aspects of the methodology...as new information becomes available you will hear about it here.
Comment

HWR...I give a complete description of PCA already. Or rather...I make the description available to anyone who wants it. If you're a member of the Yahoo group baseballdatabank, it's in the file list (it's called The PCA Manifest) or I can email it to you.
Comment

Gee, HWR sure got very curious about PCA all of a sudden :hyper:
Matt, how far down the road can we expect some type of LQ adjustment? I realize that you have bigger (and more) fish to fry as of right now, but down the road, do you see yourself tackling that issue? And if you do, am I correct in assuming that not all players will be affected equally, since clearly the upper elites would still be just that. Thanks.
Comment

The grave (IMHO) mistake of most league quality adjustments is that they treat every part of the quality distribution as though it can be manipulated in identical ways to make sense of the quality spectrum.
In answer to your question...league quality remains on the agenda for the overhaul which I expect to begin in earnest early this summer. There are three steps to total normalization (the process by which any baseball player can have his performance altered so that it can be seen to have occured on a level playing field with the rest of baseball).
1) We need to assess the strength of schedule each player faced. I'm in the process of working with a colleague of mine who is cooperating with baseballreference.com in the hopes of acquiring and digitizing the boxscore database (which goes all the way back to 1876 though the level of detail is not particularly great). We will be able to use that data (we hope) to see how many PA or batters faced each player had against each team. Since the FSIA has already been used to calculate how intrinsically skilled each team was relative to their league, we can use that information to rate each player's strength of schedule faced.
2) We need to normalize PCA wins created (which themselves have already been adjusted using SoS calculations) rates through curve fitting so that all leagues and all players see their performance placed on one curve that defines the leaguerelative talent distribution identically for all time.
3) We need to see how players move from year to year along the alltime talent curve to judge how the quality of the league changes from season to season. If players from 1941 who also played in 1942 see their percentile ranking improve overall in 1942, we can conclude that 1942 is a weaker league and put a figure on how much weaker it is. This method has been done by the Hardball Times, but they only used production metrics normalized for league average (not totally normalized as I described above), and reached what IMHO is an overly smoothed looking nearly linear league quality curve.
I think if you do all of the outlined adjusting before you get to the last step, you're going to get something more useful...but I could be wrong. I will find out when I reach that point in the analysis.
Comment

One important problem I will need to correct for is the fact that good players stick and bad players don't. I may need to group the data and see if there are different performance change curves for different subsets of the MLB population.
Comment

Originally posted by Sultan_18951948 View PostGee, HWR sure got very curious about PCA all of a sudden :hyper:
Matt, how far down the road can we expect some type of LQ adjustment? I realize that you have bigger (and more) fish to fry as of right now, but down the road, do you see yourself tackling that issue? And if you do, am I correct in assuming that not all players will be affected equally, since clearly the upper elites would still be just that. Thanks.Strikeouts are boring! Besides that, they're fascist. Throw some ground balls  it's more democratic.Crash Davis
Comment

Originally posted by SABR Matt View PostHWR...I give a complete description of PCA already. Or rather...I make the description available to anyone who wants it. If you're a member of the Yahoo group baseballdatabank, it's in the file list (it's called The PCA Manifest) or I can email it to you.Strikeouts are boring! Besides that, they're fascist. Throw some ground balls  it's more democratic.Crash Davis
Comment
Ad Widget
Collapse
Comment