1.
a.
Consider the data in the following table representing world record fastest times for the 100 m sprint and the year in which each occurred, from 1912 through 2002:
Using software such as Excel, Python, or similar tools, the regression line can be found. For this data, the linear model would be , where is years since 1900. Compute the MAE, MAPE, MSE, and RMSE for this model.
| Year | Time |
| 1912 | 10.6 |
| 1921 | 10.4 |
| 1930 | 10.3 |
| 1936 | 10.2 |
| 1956 | 10.1 |
| 1960 | 10 |
| 1968 | 9.95 |
| 1983 | 9.93 |
| 1988 | 9.92 |
| 1991 | 9.9 |
| 1991 | 9.86 |
| 1994 | 9.85 |
| 1996 | 9.84 |
| 1999 | 9.79 |
Table
6.10
100-Meter Spring Records
b.
Use the model to predict the world record fastest time for the 100 m spring in 2023. In August of 2023, Noah Lyles ran the 100 m sprint in 9.83 seconds. How does your prediction compare to this value?
2.
If today is a cloudy day, then there is 68% chance of rain tomorrow. If today is not cloudy, then there is only 15% chance of rain tomorrow. Build a discrete logistic regression model based on this data.