• 学生活动

Optimal Semi-supervised Subsampling via Predictive Inference

简介:

In big data era, subsampling or sub-data selection techniques are often adopted to extract a fraction of informative individuals from the massive data. Existing subsampling algorithms focus mainly on obtaining a representative subset to achieve best estimation accuracy under a given class of models. In this talk, we consider a semi-supervised setting wherein a small or moderate sized “labeled” data is available in addition to a much larger sized “unlabeled” data. The goal is to sample from the unlabeled data with a give budget to obtain informative individuals that are characterized by their unobserved responses. I will introduce an optimal subsampling procedure that is able to maximize the diversity of the selected subsample and control false selection rate (FSR) simultaneously, allowing us to explore reliable information as much as possible.

时间: 2022-12-20 (Tuesday) 16:30-18:00
地点: 腾讯会议:37586125504,经济楼N302
会议语言: 中文
主办单位: 中国科学院大学经济与管理学院、中国科学院预测科学研究中心、厦门大学邹至庄经济研究院、NSFC"计量建模与经济政策研究”基础科学中心
承办单位:
专题网站:
联系人信息: 许老师,0592-2182991