Abstract
Stochastic gradient descent (SGD) is commonly used for optimization in large-scale machine learning problems. Langford et al. (2009) introduce a spars......
小提示:本篇文献需要登录阅读全文,点击跳转登录