Skip to ContentGo to accessibility pageKeyboard shortcuts menu
OpenStax Logo
Principles of Data Science

Quantitative Problems

Principles of Data ScienceQuantitative Problems

1.
 
a.
Consider the data in the following table representing world record fastest times for the 100 m sprint and the year in which each occurred, from 1912 through 2002:
Year Time
1912 10.6
1921 10.4
1930 10.3
1936 10.2
1956 10.1
1960 10
1968 9.95
1983 9.93
1988 9.92
1991 9.9
1991 9.86
1994 9.85
1996 9.84
1999 9.79
Table 6.10 100-Meter Spring Records
Using software such as Excel, Python, or similar tools, the regression line can be found. For this data, the linear model would be y ^ = 0.00769 x + 10.55 , where x is years since 1900. Compute the MAE, MAPE, MSE, and RMSE for this model.
b.
Use the model to predict the world record fastest time for the 100 m spring in 2023. In August of 2023, Noah Lyles ran the 100 m sprint in 9.83 seconds. How does your prediction compare to this value?
2.
If today is a cloudy day, then there is 68% chance of rain tomorrow. If today is not cloudy, then there is only 15% chance of rain tomorrow. Build a discrete logistic regression model based on this data.
3.
Compute the information contained in the events.
a.
Obtaining a head and a tail on a flip of two fair coins
b.
Obtaining all heads on the flip of three fair coins
c.
Rolling 10 fair 6-sided dice and obtaining all 1s
4.
Compute the entropy of the following discrete random variables.
a.
The distribution X whose values are given in the following table.
X 1 2 3 4
p ( x ) 0.7 0.15 0.05 0.05
Table 6.11 Distribution X Values
b.
The binomial distribution with p = 1 / 3 and n = 4 . Recall, the binomial distribution is defined by:
P ( x ) = ( n x ) p x ( 1 p ) n x
Citation/Attribution

This book may not be used in the training of large language models or otherwise be ingested into large language models or generative AI offerings without OpenStax's permission.

Want to cite, share, or modify this book? This book uses the Creative Commons Attribution-NonCommercial-ShareAlike License and you must attribute OpenStax.

Attribution information
  • If you are redistributing all or part of this book in a print format, then you must include on every physical page the following attribution:
    Access for free at https://openstax.org/books/principles-data-science/pages/1-introduction
  • If you are redistributing all or part of this book in a digital format, then you must include on every digital page view the following attribution:
    Access for free at https://openstax.org/books/principles-data-science/pages/1-introduction
Citation information

© Dec 19, 2024 OpenStax. Textbook content produced by OpenStax is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike License . The OpenStax name, OpenStax logo, OpenStax book covers, OpenStax CNX name, and OpenStax CNX logo are not subject to the Creative Commons license and may not be reproduced without the prior and express written consent of Rice University.