摘要: |
Many areas of machine learning and data mining focus on point estimates of key parameters. In transportation, however, the inherent variance, and, critically, the need to understand the limits of that variance and the impact it may have, have long been understood to be important. Indeed, variance and other risk measures that capture the cost of the spread around the mean, are critical factors in understanding how people act. Thus they are critical for prediction, as well as for purposes of long term planning, where controlling risk may be equally important to controlling the mean (the point estimate).
There has been tremendous progress on large scale optimization techniques to enable the solution of large scale machine learning and data analytics problems. Stochastic Gradient Descent and its variants is probably the most-used large-scale optimization technique for learning. This has not yet seen an impact on the problem of statistical inference - namely, obtaining distributional information that might allow us to control the variance and hence the risk of certain solutions. |