ML2R became the Lamarr Institute – find all current information here!

Coordinator

Helena Kotthaus
Dr. Helena Kotthaus

Coordinator

Helena Kotthaus
Dr. Helena Kotthaus

Challenges and Open Research Questions

With the heterogeneity of modern computing architectures and the massive amount of data generated by edge devices, Machine Learning research is facing new challenges. Especially in time-critical or resource-constrained environments, targeted coordination of model and execution platform is indispensable. In particular, FPGAs (Field Programmable Gate Arrays) are well suited for IoT applications to save energy. At the same time, it is necessary to adapt new implementations of known ML algorithms to specific hardware requirements. To enable reliable application, resource requirements of several implementations for one algorithm need to be characterized. For example, even the well-known decision tree algorithm still offers novel research questions: Which implementation is best suited for which computing architecture and application requirements?

Current Research Activities

We investigate models of Machine Learning for distributed and strongly limited computer architectures and evaluate modern hardware regarding its suitability for Machine Learning. Learning procedures are examined and, if necessary, adapted to improve their executability on strongly resource-limited devices. Some of these adjustments go hand in hand with an approximation which must be theoretically justified and have error barriers. The results are used directly for algorithm curation, which includes describing limits of applicability as well as computational and energy requirements. Our current resource-aware ML research covers the following aspects:

Analysis of ML resource-requirements:

    • Resource-utilization labels including runtime, memory, and energy requirements of ML algorithms
    • Enable the certification process for resource-constrained and safety-critical ML applications

Investigating hard- and software for resource-constrained scenarios:

    • Resource-aware and distributed on-device learning
    • Decentralized data generation on edge devices
    • Resource-efficient implementations of anomaly detection algorithms
    • End-to-end learning from astroparticle image streams using deep learning on FPGAs
    • Image recognition-based tracking of logistic entities
    • C Code-generation for fast inference on IoT systems

Optimization of algorithms:

    • Resource-aware data acquisition through active class selection
    • Resource-aware hyperparameter-optimization for ML algorithms

Recommended Reading and Selected Publications

From_ML2R

P. Welke, F. Alkhoury, C. Bauckhage, S. Wrobel: Decision Snippet Features. ICPR, 2021. More_

From_ML2R

M. Bunse, D. Weichert, A. Kister, K. Morik: Optimal Probabilistic Classification in Active Class Selection. ICDM, 2020. Video_

From_ML2R

S. Buschjäger, P.-J. Honysz, K. Morik: Randomized Outlier Detection with Trees. In: International Journal of Data Science and Analytics, 2020. More_

From_ML2R

S. Buschjäger, L. Pfahler, J. Buss, K. Morik, W. Rhode: On-Site Gamma-Hadron Separation with Deep Learning on FPGAs. ECML PKDD, 2020. To_document Video

From_ML2R

O. Urbann, S. Camphausen, A. Moos, I. Schwarz, S. Kerner, M. Otten: A C Code Generator for Fast Inference and Simple Deployment of Convolutional Neural Networks on Resource Constrained Systems. C4ML Workshop at CGO, 2020. More_

Recommended

B. Li, S. Cen, Y. Chen, Y. Chi: Communication-Efficient Distributed Optimization in Networks with Gradient Tracking and Variance Reduction. In: JMLR 21(180), 2020. More_

Recommended

H. Rakhshani, H. I. Fawaz, L. Idoumghar, G. Forestier, J. Lepagnot, J. Weber, M. Brévilliers, P. Muller: Neural Architecture Search for Time Series Classification. IJCNN, 2020. More_

Recommended

P. Kairouz, et al.: Advances and Open Problems in Federated Learning. In: Foundations and Trends in Machine Learning 4(1), 2019. More_

Recommended

C. C. Aggarwal: Outlier Analysis. Springer, 2016. More_

Recommended

C. Bockermann, K. Brügge, J. Buss, A. Egorov, K. Morik, W. Rhode, T. Ruhe: Online Analysis of High-Volume Data Streams Astroparticle Physics. ECML-PKDD, 2015. More_

Recommended

D. Sculley, G. Holt, D. Golovin, E. Davydov, T. Phillips, D. Ebner, V. Chaudhary, M. Young, J. Crespo, D. Dennison: Hidden technical debt in machine learning systems. NIPS, 2015. More_