ACL2021

On the Cost-Effectiveness of Stacking of Neural and Non-Neural Methods for Text Classification: Scenarios and Performance Prediction

Christian Gomes, Marcos André Gonçalves, Leonardo Rocha, Sérgio D. Canuto

Abstract

Nowadays Neural Network algorithms have excelled in Automatic Text Classification (ATC). However, such enhanced performance comes at high computational costs. Stacking of simpler classifiers that exploit algorithmic and representational complementarity has also been shown to produce superior performance in ATC, enjoying high effectiveness and potentially lower computational costs than complex neural networks. In this master's thesis, we present the first and largest comparative study to exploit the cost-effectiveness of Stacking in ATC, consisting of Transformers and non-neural algorithms. We investigate cost-effective ensemble vs. the best model and propose a low-cost oracle-based prediction method.