Innovative Techniques in Imbalanced Data, Causal Inference, and Semi-Supervised Learning

The recent developments in the research area indicate a strong focus on addressing challenges related to imbalanced data, causal inference, and semi-supervised learning. A significant trend is the exploration of innovative sampling techniques and model calibration methods to handle class imbalance, which is critical for improving the performance of classification algorithms in real-world applications. Additionally, there is a growing interest in leveraging probabilistic principles and domain-specific insights to enhance machine learning models, particularly in regression and classification tasks. The integration of network information and disentangled representation learning for treatment effect estimation from observational data is also gaining traction, offering new avenues for causal inference. Furthermore, advancements in semi-supervised learning are being driven by the need to optimize the use of both labeled and unlabeled data, with a particular emphasis on risk control and prediction-powered inference. These developments collectively point towards a more nuanced and sophisticated approach to machine learning, where theoretical rigor and practical applicability are balanced to address complex, real-world problems.

Sources

Class flipping for uplift modeling and Heterogeneous Treatment Effect estimation on imbalanced RCT data

EvoSampling: A Granular Ball-based Evolutionary Hybrid Sampling with Knowledge Transfer for Imbalanced Learning

RegMixMatch: Optimizing Mixup Utilization in Semi-Supervised Learning

Semi-Supervised Risk Control via Prediction-Powered Inference

ABC3: Active Bayesian Causal Inference with Cohn Criteria in Randomized Experiments

Constructing Confidence Intervals for Average Treatment Effects from Multiple Datasets

Probability-Informed Machine Learning

Leveraging Group Classification with Descending Soft Labeling for Deep Imbalanced Regression

A Conformal Approach to Feature-based Newsvendor under Model Misspecification

Mediation Analysis for Probabilities of Causation

Treatment Effects Estimation on Networked Observational Data using Disentangled Variational Graph Autoencoder

GBRIP: Granular Ball Representation for Imbalanced Partial Label Learning

Built with on top of