TY - GEN
T1 - Handling Class Imbalance in Black-Box Unsupervised Domain Adaptation with Synthetic Minority Over-Sampling
AU - Zou, Yawen
AU - Gu, Chunzhi
AU - Wang, Zi
AU - Li, Guang
AU - Yu, Jun
AU - Zhang, Chao
N1 - Publisher Copyright:
© 2024 IEEE.
PY - 2024
Y1 - 2024
N2 - Black-box unsupervised domain adaptation (BBUDA) is a challenging task that transfers knowledge from the source domain to the target domain without access to the source data and source model, thus alleviating public concerns about data security. However, BBUDA requires the source model to function as a black-box predictor for the target data, and the pseudo-labels often exhibit class imbalance, which degrades the performance. To tackle this problem, we propose employing the synthetic minority oversampling technique (SMOTE) and adaptive sampling to rebalance data. Given that predictions often contain errors, we first select reliable high-confidence data before using SMOTE to generate synthetic samples for the minority class. Second, we incrementally select high-confidence data from the remaining low-confidence data with an adaptive sampling rate for each class, in which the minority class (with the fewest samples) is assigned a higher sampling rate and the majority class (with the most samples) is assigned a lower sampling rate. The experimental results demonstrate that our method can mitigate the class imbalance and further improve the performance of the target model.
AB - Black-box unsupervised domain adaptation (BBUDA) is a challenging task that transfers knowledge from the source domain to the target domain without access to the source data and source model, thus alleviating public concerns about data security. However, BBUDA requires the source model to function as a black-box predictor for the target data, and the pseudo-labels often exhibit class imbalance, which degrades the performance. To tackle this problem, we propose employing the synthetic minority oversampling technique (SMOTE) and adaptive sampling to rebalance data. Given that predictions often contain errors, we first select reliable high-confidence data before using SMOTE to generate synthetic samples for the minority class. Second, we incrementally select high-confidence data from the remaining low-confidence data with an adaptive sampling rate for each class, in which the minority class (with the fewest samples) is assigned a higher sampling rate and the majority class (with the most samples) is assigned a lower sampling rate. The experimental results demonstrate that our method can mitigate the class imbalance and further improve the performance of the target model.
KW - Black-Box Unsupervised Domain Adaptation (BBUDA)
KW - Class Imbalance
KW - Pseudo-Labels
UR - http://www.scopus.com/inward/record.url?scp=85218194211&partnerID=8YFLogxK
U2 - 10.1109/VCIP63160.2024.10849930
DO - 10.1109/VCIP63160.2024.10849930
M3 - 会議への寄与
AN - SCOPUS:85218194211
T3 - 2024 IEEE International Conference on Visual Communications and Image Processing, VCIP 2024
BT - 2024 IEEE International Conference on Visual Communications and Image Processing, VCIP 2024
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2024 IEEE International Conference on Visual Communications and Image Processing, VCIP 2024
Y2 - 8 December 2024 through 11 December 2024
ER -