Deep Learning for Tabular and Categorical Data

So far, tree-based models like XGBoost are still the state-of-the-art for tabular data, despite the success of deep learning in many areas. Our goal is to develop competitive deep learning methods for tabular data, as common in many real-world data science tasks. One problem is the fact that tabular data typically contains high-cardinality categorical features, which are challenging for deep learning methods. Therefore, a focus of this project is to develop scalable methods that can handle such features. One option are so-called mixed-effects neural networks.

In addition, we are working on gradient descent methods for learning tree-based models, combining the strengths of deep learning models and tree-based models.

Contact

Stefan Lüdtke

stefan.luedtke2@uni-rostock.de