1.
a.
Consider the data in the following table representing world record fastest times for the 100 m sprint and the year in which each occurred, from 1912 through 2002:
Using software such as Excel, Python, or similar tools, the regression line can be found. For this data, the linear model would be , where is years since 1900. Compute the MAE, MAPE, MSE, and RMSE for this model.
Year | Time |
1912 | 10.6 |
1921 | 10.4 |
1930 | 10.3 |
1936 | 10.2 |
1956 | 10.1 |
1960 | 10 |
1968 | 9.95 |
1983 | 9.93 |
1988 | 9.92 |
1991 | 9.9 |
1991 | 9.86 |
1994 | 9.85 |
1996 | 9.84 |
1999 | 9.79 |
b.
Use the model to predict the world record fastest time for the 100 m spring in 2023. In August of 2023, Noah Lyles ran the 100 m sprint in 9.83 seconds. How does your prediction compare to this value?
2.
If today is a cloudy day, then there is 68% chance of rain tomorrow. If today is not cloudy, then there is only 15% chance of rain tomorrow. Build a discrete logistic regression model based on this data.