The more enthused, the much more likely to enter into the Open and report data. This highlights a general limitation with this data. Yet, after excluding people who didn’t do any self-reporting, men made up 67.2% of the pool. Out of the data I extracted, men made up 58.8% of the pool. Notice the much larger percentage of men reporting marks than women. After excluding athletes from the data that didn’t report any marks, there were 1,935 athletes, 1,301 of which were male, 634 female. The majority of these athletes did not self-report their marks. Of these athletes, 3,159 were male, 2,217 were female. This method led to the extraction of data from 5,376 athletes. #FMINER BUTTON SOFTWARE#Once the athlete’s profile page loaded, the software would extract the self-reported marks from the athlete.Once a url of a CrossFit team loaded, the software would load every athlete url of the team’s “open team roster.”.However, some of the urls didn’t exist or didn’t have any athletes listed on those teams. In theory, this means the project would have loaded the urls of 300 CrossFit teams. It took a look at the urls of every twentieth CrossFit team starting from 1 through 6,000.In FMiner, I designed a data extraction project that worked like this: What I needed to do is scrape data off of as many athletes’ pages within reason. To collect data off of the CrossFit Games’ website, I needed to use a technique called “web scraping.” Web scraping takes a look at web pages and collects specific text/data on those web pages and translates that data collection into spreadsheets and tables. So I needed to pull data from the CrossFit Games’ website. Knowing what a weak time/weight is, what’s average and what’s good can help you focus on your weaknesses, help you understand your strengths, and help trainers evaluate their athletes, to better tailor training and ensure safety. While the goal in working out is to improve your own self, it’s still nice to know how you stack up compared to the rest of the world. What I wanted to know is using these self-reported marks, what does the normal distribution look like for these marks? What’s an average time/weight? What’s a good time/weight? Best times/marks for classic CrossFit workouts Fran, Helen, Grace, Filthy 50, and Fight Gone Bad.Īthletes also self-report their gender, height, weight and age.One-rep max lifts for clean and jerks, snatch, deadlift, and back squat.On a CrossFit Games athlete profile page, athletes can self-report their best marks for the following: This blog post will cover the methodology I used to pull data from to see what the normal distribution looks like for classic CrossFit workouts, one-rep maxes, sprints and other stuff listed on a CrossFit Games athlete profile page.
0 Comments
Leave a Reply. |
AuthorWrite something about yourself. No need to be fancy, just an overview. ArchivesCategories |