We consider the problem of binary class prob-ability estimation (CPE) when one class is rare compared to the other. It is well known that stan-dard algorithms such as logistic regression do not perform well in this setting as they tend to under-estimate the probability of the rare class. Com-mon fixes include under-sampling and weight-ing, together with various correction schemes. Recently, Wang & Dey (2010) suggested the use of a parametrized family of asymmetric link functions based on the generalized extreme value (GEV) distribution, which has been used for modeling rare events in statistics. The approach showed promising initial results, but combined with the logarithmic CPE loss implicitly used in their work, it results in a non-co...
Estimation of rare-event probabilities in high-dimensional settings via importance sampling is a dif...
Thesis (Ph. D.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer...
The univariate generalized extreme value (GEV) distribution is the most commonly used tool for analy...
This paper develops a method for modelling binary response data in a regression model with highly un...
We aim at proposing a new model for binary rare events, i.e. binary dependent variable with a very s...
We aim at proposing a new model for binary rare events, i.e. binary depen- dent variable with a ver...
What are the natural loss functions for binary class probability estimation? This question has a sim...
A new bivariate Generalised Linear Model (GLM) is proposed for binary rare events, i.e. binary depen...
We aim at proposing a new model for binary rare events, i.e. binary dependent variable with a very ...
We investigate to which extent one can recover class probabilities within the empirical risk minimiz...
In order to model credit defaults we propose a Generalized Linear Model (McCullagh and Neleder, 1989...
We aim at proposing a Generalized Additive Model (GAM) for binary rare events, i.e. binary dependen...
The generalised extreme value (GEV) distribution is a three parameter family that describes the asym...
Rare events represent a great analytical challenge. The maximum likelihood-based (ML) binary logit m...
A boosting-based machine learning algorithm is presented to model a binary response with large imbal...
Estimation of rare-event probabilities in high-dimensional settings via importance sampling is a dif...
Thesis (Ph. D.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer...
The univariate generalized extreme value (GEV) distribution is the most commonly used tool for analy...
This paper develops a method for modelling binary response data in a regression model with highly un...
We aim at proposing a new model for binary rare events, i.e. binary dependent variable with a very s...
We aim at proposing a new model for binary rare events, i.e. binary depen- dent variable with a ver...
What are the natural loss functions for binary class probability estimation? This question has a sim...
A new bivariate Generalised Linear Model (GLM) is proposed for binary rare events, i.e. binary depen...
We aim at proposing a new model for binary rare events, i.e. binary dependent variable with a very ...
We investigate to which extent one can recover class probabilities within the empirical risk minimiz...
In order to model credit defaults we propose a Generalized Linear Model (McCullagh and Neleder, 1989...
We aim at proposing a Generalized Additive Model (GAM) for binary rare events, i.e. binary dependen...
The generalised extreme value (GEV) distribution is a three parameter family that describes the asym...
Rare events represent a great analytical challenge. The maximum likelihood-based (ML) binary logit m...
A boosting-based machine learning algorithm is presented to model a binary response with large imbal...
Estimation of rare-event probabilities in high-dimensional settings via importance sampling is a dif...
Thesis (Ph. D.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer...
The univariate generalized extreme value (GEV) distribution is the most commonly used tool for analy...