Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Zifeng Cheng

Unifying Token and Span Level Supervisions for Few-Shot Sequence Labeling

Jul 20, 2023
Zifeng Cheng, Qingyu Zhou, Zhiwei Jiang, Xuemin Zhao, Yunbo Cao, Qing Gu

Figure 1 for Unifying Token and Span Level Supervisions for Few-Shot Sequence Labeling

Figure 2 for Unifying Token and Span Level Supervisions for Few-Shot Sequence Labeling

Figure 3 for Unifying Token and Span Level Supervisions for Few-Shot Sequence Labeling

Figure 4 for Unifying Token and Span Level Supervisions for Few-Shot Sequence Labeling

Few-shot sequence labeling aims to identify novel classes based on only a few labeled samples. Existing methods solve the data scarcity problem mainly by designing token-level or span-level labeling models based on metric learning. However, these methods are only trained at a single granularity (i.e., either token level or span level) and have some weaknesses of the corresponding granularity. In this paper, we first unify token and span level supervisions and propose a Consistent Dual Adaptive Prototypical (CDAP) network for few-shot sequence labeling. CDAP contains the token-level and span-level networks, jointly trained at different granularities. To align the outputs of two networks, we further propose a consistent loss to enable them to learn from each other. During the inference phase, we propose a consistent greedy inference algorithm that first adjusts the predicted probability and then greedily selects non-overlapping spans with maximum probability. Extensive experiments show that our model achieves new state-of-the-art results on three benchmark datasets.

* Accepted by ACM Transactions on Information Systems

Via

Access Paper or Ask Questions

Controlling Class Layout for Deep Ordinal Classification via Constrained Proxies Learning

Mar 01, 2023
Cong Wang, Zhiwei Jiang, Yafeng Yin, Zifeng Cheng, Shiping Ge, Qing Gu

Figure 1 for Controlling Class Layout for Deep Ordinal Classification via Constrained Proxies Learning

Figure 2 for Controlling Class Layout for Deep Ordinal Classification via Constrained Proxies Learning

Figure 3 for Controlling Class Layout for Deep Ordinal Classification via Constrained Proxies Learning

Figure 4 for Controlling Class Layout for Deep Ordinal Classification via Constrained Proxies Learning

For deep ordinal classification, learning a well-structured feature space specific to ordinal classification is helpful to properly capture the ordinal nature among classes. Intuitively, when Euclidean distance metric is used, an ideal ordinal layout in feature space would be that the sample clusters are arranged in class order along a straight line in space. However, enforcing samples to conform to a specific layout in the feature space is a challenging problem. To address this problem, in this paper, we propose a novel Constrained Proxies Learning (CPL) method, which can learn a proxy for each ordinal class and then adjusts the global layout of classes by constraining these proxies. Specifically, we propose two kinds of strategies: hard layout constraint and soft layout constraint. The hard layout constraint is realized by directly controlling the generation of proxies to force them to be placed in a strict linear layout or semicircular layout (i.e., two instantiations of strict ordinal layout). The soft layout constraint is realized by constraining that the proxy layout should always produce unimodal proxy-to-proxies similarity distribution for each proxy (i.e., to be a relaxed ordinal layout). Experiments show that the proposed CPL method outperforms previous deep ordinal classification methods under the same setting of feature extractor.

* Accepted by AAAI 2023

Via

Access Paper or Ask Questions

Learning to Classify Open Intent via Soft Labeling and Manifold Mixup

Apr 16, 2022
Zifeng Cheng, Zhiwei Jiang, Yafeng Yin, Cong Wang, Qing Gu

Figure 1 for Learning to Classify Open Intent via Soft Labeling and Manifold Mixup

Figure 2 for Learning to Classify Open Intent via Soft Labeling and Manifold Mixup

Figure 3 for Learning to Classify Open Intent via Soft Labeling and Manifold Mixup

Figure 4 for Learning to Classify Open Intent via Soft Labeling and Manifold Mixup

Open intent classification is a practical yet challenging task in dialogue systems. Its objective is to accurately classify samples of known intents while at the same time detecting those of open (unknown) intents. Existing methods usually use outlier detection algorithms combined with K-class classifier to detect open intents, where K represents the class number of known intents. Different from them, in this paper, we consider another way without using outlier detection algorithms. Specifically, we directly train a (K+1)-class classifier for open intent classification, where the (K+1)-th class represents open intents. To address the challenge that training a (K+1)-class classifier with training samples of only K classes, we propose a deep model based on Soft Labeling and Manifold Mixup (SLMM). In our method, soft labeling is used to reshape the label distribution of the known intent samples, aiming at reducing model's overconfident on known intents. Manifold mixup is used to generate pseudo samples for open intents, aiming at well optimizing the decision boundary of open intents. Experiments on four benchmark datasets demonstrate that our method outperforms previous methods and achieves state-of-the-art performance. All the code and data of this work can be obtained at https://github.com/zifengcheng/SLMM.

* Accepted by IEEE/ACM Transactions on Audio Speech and Language

Via

Access Paper or Ask Questions