Export Citations
Save this search
Please login to be able to save your searches and receive alerts for new content matching your search criteria.
- research-articleDecember 2024
Distributed Lasso Algorithm Based on Alternating Direction Method of Multipliers
ICRSA '24: Proceedings of the 2024 7th International Conference on Robot Systems and ApplicationsPages 82–86https://doi.org/10.1145/3702468.3702483Lasso is widely known as a sparse estimation method for regression coefficients in linear regression models, and the Alternating Direction Method of Multipliers (ADMM) is one of the representative computational methods for lasso. In this paper, we propose ...
- research-articleMarch 2024
Dimension reduction in contextual online learning via nonparametric variable selection
The Journal of Machine Learning Research (JMLR), Volume 24, Issue 1Article No.: 136, Pages 6246–6329We consider a contextual online learning (multi-armed bandit) problem with high-dimensional covariate x and decision y. The reward function to learn, f(x, y), does not have a particular parametric form. The literature has shown that the optimal regret is ...
- research-articleSeptember 2022
Logistic Regression via Excel Spreadsheets: Mechanics, Model Selection, and Relative Predictor Importance
INFORMS Transactions on Education (ITED), Volume 23, Issue 1Pages 1–11https://doi.org/10.1287/ited.2021.0263Logistic regression is one of the most fundamental tools in predictive analytics. Graduate business analytics students are often familiarized with implementation of logistic regression using Python, R, SPSS, or other software packages. However, an ...
- research-articleJanuary 2022
Soybean price forecasting based on Lasso and regularized asymmetric ν-TSVR
Journal of Intelligent & Fuzzy Systems: Applications in Engineering and Technology (JIFS), Volume 43, Issue 4Pages 4859–4872https://doi.org/10.3233/JIFS-212525Asymmetric ν-twin Support vector regression (Asy-ν-TSVR) is an effective regression model in price prediction. However, there is a matrix inverse operation when solving its dual problem. It is well known that it may be not reversible, therefore a ...
- research-articleSeptember 2021
Personalized Dynamic Pricing with Machine Learning: High-Dimensional Features and Heterogeneous Elasticity
We consider a seller who can dynamically adjust the price of a product at the individual customer level, by utilizing information about customers’ characteristics encoded as a d-dimensional feature vector. We assume a personalized demand model, parameters ...
-
- research-articleJuly 2021
A Cross-Sectional Machine Learning Approach for Hedge Fund Return Prediction and Selection
We apply four machine learning methods to cross-sectional return prediction for hedge fund selection. We equip the forecast model with a set of idiosyncratic features, which are derived from historical returns of a hedge fund and capture a variety of fund-...
- research-articleJanuary 2021
Estimating the Lasso's effective noise
The Journal of Machine Learning Research (JMLR), Volume 22, Issue 1Article No.: 276, Pages 12658–12689Much of the theory for the lasso in the linear model Y = Xβ* + ε hinges on the quantity 2∥XΤε∥∞/n, which we call the lasso's effective noise. Among other things, the effective noise plays an important role in finite-sample bounds for the lasso, the ...
- research-articleJanuary 2021
Lasso Hyperinterpolation Over General Regions
SIAM Journal on Scientific Computing (SISC), Volume 43, Issue 6Pages A3967–A3991https://doi.org/10.1137/20M137793XThis paper develops a fully discrete soft thresholding polynomial approximation over a general region, named Lasso hyperinterpolation. This approximation is an $\ell_1$-regularized discrete least squares approximation under the same conditions of ...
- research-articleApril 2020
Online Risk Monitoring Using Offline Simulation
INFORMS Journal on Computing (INFORMS-IJOC), Volume 32, Issue 2Pages 356–375https://doi.org/10.1287/ijoc.2019.0892Estimating portfolio risk measures and classifying portfolio risk levels in real time are important yet challenging tasks. In this paper, we propose to build a logistic regression model using data generated in past simulation experiments and to use the ...
- research-articleJuly 2019
ET-Lasso: A New Efficient Tuning of Lasso-type Regularization for High-Dimensional Data
KDD '19: Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data MiningPages 607–616https://doi.org/10.1145/3292500.3330910The $L_1 $ regularization (Lasso) has proven to be a versatile tool to select relevant features and estimate the model coefficients simultaneously and has been widely used in many research areas such as genomes studies, finance, and biomedical imaging. ...
- review-articleJune 2019
Bayesian regularization: From Tikhonov to horseshoe
Bayesian regularization is a central tool in modern‐day statistical and machine learning methods. Many applications involve high‐dimensional sparse signal recovery problems. The goal of our paper is to provide a review of the literature on penalty‐based ...
Comparison of geometry of a unit ball induced by Normal, Laplace, Cauchy and Horseshoe priors. image image
- articleJanuary 2019
Semi-analytic resampling in Lasso
An approximate method for conducting resampling in Lasso, the l1 penalized linear regression, in a semi-analytic manner is developed, whereby the average over the resampled datasets is directly computed without repeated numerical sampling, thus enabling ...
- research-articleAugust 2018
Integration of Cancer Data through Multiple Mixed Graphical Model
BCB '18: Proceedings of the 2018 ACM International Conference on Bioinformatics, Computational Biology, and Health InformaticsPages 341–350https://doi.org/10.1145/3233547.3233557The state of the art in bio-medical technologies has produced many genomic, epigenetic, transcriptomic, and proteomic data of varied types across different biological conditions. Historically, it has always been a challenge to produce new ways to ...
- research-articleJune 2018
Understanding and Predicting the Legislative Process in the Chamber of Deputies of Brazil
SBSI '18: Proceedings of the XIV Brazilian Symposium on Information SystemsArticle No.: 23, Pages 1–8https://doi.org/10.1145/3229345.3229371In this article, based on of open legislative data mining, we propose a methodology to create a model capable of indicating which characteristics have a positive or negative impact on the approval of a bill by the Chamber of Deputies. Added to the ...
- short-paperMay 2018
Natural language or not (NLON): a package for software engineering text analysis pipeline
MSR '18: Proceedings of the 15th International Conference on Mining Software RepositoriesPages 387–391https://doi.org/10.1145/3196398.3196444The use of natural language processing (NLP) is gaining popularity in software engineering. In order to correctly perform NLP, we must pre-process the textual information to separate natural language from other information, such as log messages, that are ...
- articleJanuary 2018
On tight bounds for the Lasso
We present upper and lower bounds for the prediction error of the Lasso. For the case of random Gaussian design, we show that under mild conditions the prediction error of the Lasso is up to smaller order terms dominated by the prediction error of its ...
- research-articleJanuary 2018
A Highly Efficient Semismooth Newton Augmented Lagrangian Method for Solving Lasso Problems
SIAM Journal on Optimization (SIOPT), Volume 28, Issue 1Pages 433–458https://doi.org/10.1137/16M1097572We develop a fast and robust algorithm for solving large-scale convex composite optimization models with an emphasis on the $\ell_1$-regularized least squares regression (lasso) problems. Despite the fact that there exist a large number of solvers in the ...
- research-articleDecember 2017
Laplacian Black Box Variational Inference
BDIOT '17: Proceedings of the International Conference on Big Data and Internet of ThingPages 91–95https://doi.org/10.1145/3175684.3175700Black box variational inference (BBVI) is a recently proposed estimation method for parameters of statistical models. BBVI is an order of magnitude faster than Markov chain Monte Carlo (MCMC). The computation of BBVI is similar to maximum a posteriori ...
- articleMarch 2017
Post selection shrinkage estimation for high-dimensional data analysis
In high-dimensional data settings where pï źï źï źn, many penalized regularization approaches were studied for simultaneous variable selection and estimation. However, with the existence of covariates with weak effect, many existing variable selection ...
- articleJanuary 2017
Saturating splines and feature selection
We extend the adaptive regression spline model by incorporating saturation, the natural requirement that a function extend as a constant outside a certain range. We fit saturating splines to data via a convex optimization problem over a space of ...