DSF: Student Theses & Projects

Writing a thesis or research project at our institute

The Institute for Data Science Foundations offers several topics for theses and research projects, and you are also invited to propose your own. The information on this website primarily considers theses, but the same application process also holds for research projects. In case you want to conduct a research project, please just indicate this in the title of the application template provided below and follow the same instructions as for a thesis. Overall, we expect a strong commitment and initiative from the candidates. We accept students with a data science and machine learning background, close to the interests and fields of expertise of the institute’s scientific members.

Thesis process

To obtain a transparent and clear idea of what to expect when writing a thesis with us, please consider the thesis process detailed here.

Application guidelines

There are two ways to determine a potential topic for a thesis with us:

Own topic and personal initiative: Students may propose their own topic and submit a detailed description of their proposed project and the methods they intend to use, emphasizing the close connection to the interests and fields of research of the institute’s scientific members. These are often related to our teaching activities. These include machine learning, neural networks, reinforcement learning, control theory, information theory, robotics, statistical learning theory, information geometry, and algebraic statistics.
A natural and convenient way to learn about our research and teaching interests is to attend our lectures or seminars. However, you can generally also apply if you have not participated in our teaching activities. It is a good idea to also look at our team’s publications to get an idea about the research interests of our team members.
Application for a topic publicly offered by us: On our website, you find a list of topics offered by our institute’s members. If you are interested in one of these topics, please consider applying.

In both cases, you can apply by sending the following two documents to our office:

A transcript of the grades you have received so far. If you apply for a Master’s thesis, please also send us your Bachelor’s grades.
A filled-out version of this template, to provide us with a detailed description of your thesis ideas. In case you select a topic offered on our website, please also fill out all sections of the template as appropriately as possible, so that we can see if you have understood the topic correctly.

We will only accept applications in German or English (preferred) if they strictly follow the template (max. 2 pages) and if you also attach a transcript of your grades.

Based on your application documents, we will decide whether to consider your application or not. It is unlikely that we consider your application if we get the impression that you show a lack of commitment. However, if you have prepared your template with care, there is a good chance that we invite you for a personal interview and presentation (~10 min). Based on your presentation and the interview, we will finally decide whether to accept you as a candidate or not. Our decision will be based on how we perceive your commitment and eagerness, on your previous knowledge in the field of your chosen topic, and on how close your topic is to the research focus of our team. If you prepare your presentation well, and if the topic is close to at least one of our team members’ active research, there is a very good chance that we accept your application.

Thesis and presentation templates

We highly encourage you to write your thesis in LaTeX. Please use this template. We recommend that you copy the template directly in Overleaf and write your thesis with Overleaf. This has the advantage that you do not need to install any software and that your supervisor can directly see the progress of your thesis and give feedback. However, you are generally free to use any text editor you like, as long as you use the same formatting as our template.

For your presentation(s) and the defense, we have a PowerPoint template here.

Topics offered by our institute

Investigations into Neural Network Architectures for Mobile Robots: An Embodied Intelligence Perspective

Supervisor: Dr. Adwait Datar

Topic Description:

Embodied Intelligence, defined in (Roy et al., 2021) as the purposeful exchange of energy and information with a physical environment, is an active area of research. The physical environment here may represent either the real world with real physical constraints or a simulated environment with artificially enforced constraints. A natural question that arises here is how to choose a suitable controller architecture that is well-suited to the physical constraints of agent. The choice of the architecture depends on the span of possible movements as well as on the class of desired behaviours expected from the agent. (Ay, 2015) proposes geometric design principles to address some of these questions in a very general setting.

In this project, we follow up on these ideas and apply them on concrete examples of mobile robots with neural-network architectures. An example of a desired behaviour for mobile robots is the ability to move from an arbitrary position in space to a different arbitrary position in space. We start by looking at agents with simple linear physical constraints and progressively increase the complexity of the physical constraints (such as by bringing in non-holonomic constraints for example). For each class of physical constraints, we study a variety of control architectures and investigate if and how the knowledge of the physical constraints can be incorporated into the design of a suitable controller architecture. A concrete project revolving around these ideas will be formalized after a discussion with the student based on the format of the thesis (project work/ bachelor thesis/ master thesis) and the student's interest.

Requirements:

A good understanding of systems theory, control theory, neural networks
Experience with MATLAB/ Python Programming (Python is preferred)
Access to a laptop or computer.
There is a possibility to use our virtual robotics lab Scilab-RL (https://scilab-rl.github.io/Scilab-RL/) in which case a recent notebook with at least an i5 processor or comparable CPU and 8 GB memory is sufficient. There is also a possibility to work with the MyoSuite environment (https://github.com/MyoHub/myosuite).

Nice-to-have:

Knowledge of probability theory
Knowledge of kinematic and dynamic robot models, especially mobile robots

Literature:

Ay, N., 2015. Geometric design principles for brains of embodied agents. KI-Künstliche Intelligenz, 29(4), pp.389-399.
Roy, N., Posner, I., Barfoot, T., Beaudoin, P., Bengio, Y., Bohg, J., Brock, O., Depatie, I., Fox, D., Koditschek, D. and Lozano-Perez, T., 2021. From machine learning to robotics: Challenges and opportunities for embodied intelligence. arXiv preprint arXiv:2110.15245.
Russ Tedrake. Underactuated Robotics: Algorithms for Walking, Running, Swimming, Flying, and Manipulation (Course Notes for MIT 6.832)

Learning of Linear Recurrent Neural Networks via Gradient Descent and Sub-Space Identification

Supervisor: Dr. Adwait Datar

Topic Description:

It is well known that learning dynamics in recurrent neural networks (RNNs) suffer from a variety of problems such as vanishing and exploding gradients. In order to make progress towards understanding some of these problems, some recent works (Li et al., 2020, Hardt et al., 2018) study linear RNNs which are essentially linear time-invariant (LTI) systems.
System identification (Ljung, 1998, Qin, 2006) of LTI systems is a well-developed field with powerful tools available at our disposal. In particular, sub-space identification tools allow us to identify state-space models from data without relying on iterative techniques such as gradient descent.
This project takes this up by investigating and comparing the performance of sub-space identification methods with the performance of iterative methods such as gradient descent learning on benchmark examples. The key aspects in this investigations include
Computational complexity: How does computational cost depend on the system size?
Sample complexity: How is learning improved as data-size increases?
Speed of convergence (learning) with gradient descent learning
Effect of different parameterizations: What effect does parameterization have on gradient descent learning?
A concrete project revolving around these ideas will be formalized after a discussion with the student based on the format of the thesis (project work/ bachelor thesis/ master thesis) and the student's interest.

Requirements:

A good understanding of gradient descent learning algorithms, linear systems theory and state-space models
Experience with MATLAB/ Python Programming
Access to a laptop or computer

Nice-to-have:

Knowledge of system identification
Knowledge of RNNs
Knowledge of iterative optimization algorithms and their analyses

Literature:

Li, Z., Han, J., Weinan, E. and Li, Q., 2020, October. On the Curse of Memory in Recurrent Neural Networks: Approximation and Optimization Analysis. In International Conference on Learning Representations.
Hardt, Moritz, Tengyu Ma, and Benjamin Recht. "Gradient descent learns linear dynamical systems." Journal of Machine Learning Research 19, no. 29 (2018): 1-44
Qin, S. Joe. "An overview of subspace identification." Computers & chemical engineering 30, no. 10-12 (2006): 1502-1513
Ljung, L., 1998. System identification. In Signal analysis and prediction (pp. 163-173). Boston, MA: Birkhäuser Boston.

Conflict driven sampling in reinforcement learning

Supervisor: Jan Benad

Topic Description:

In reinforcement learning an agent gathers experience by interacting with its environment. That data is stored in a so-called replay buffer and reused later for training the agent. Usually, experience transitions are uniformly sampled from the replay buffer, regardless of their significance. Prior work (Schaul et al., 2016), however, showed that some sort of prioritization might be beneficial.
We consider environments that change over time. Experience gained early on is therefore quickly outdated. Prioritized sampling seems indicated. For that very scenario the students are expected to develop a sampling mechanism taking into account changes in the environment.

Requirements:

Extensive experience in Python programming
Experience with Pytorch, Tensorflow or Jax
Experience with supervised regression problems
Access to a laptop or computer

Nice-to-have:

Experience in reinforcement learning
Experience with GIT
Participation in our seminar “Introduction to RL” is encouraged

Literature:

Key concepts in RL: https://spinningup.openai.com/en/latest/spinningup/rl_intro.html
A first intuitive overview of a well-known RL algorithm SAC: https://spinningup.openai.com/en/latest/algorithms/sac.html
Rich Sutton and Andrew Barto, “Reinforcement Learning: An Introduction”: http://incompleteideas.net/sutton/book/RLbook2018.pdf
A selection of related works:
- T. Schaul, J. Quan, I. Antonoglou, and D. Silver, “Prioritized Experience Replay”. ICLR 2016. 10.48550/arXiv.1511.05952.
- Y. Oh, J. Shin, E. Yang, and S. J. Hwang, “Model-augmented Prioritized Experience Replay”, ICLR 2022. https://openreview.net/pdf?id=WuEiafqdy9H
- I. Kauvar, C. Doyle, L. Zhou, and N. Haber, “Curious Replay for Model-based Adaptation”. ICML 2023. https://openreview.net/pdf?id=7p7YakZP2H

World Models and Reinforcement Learning

Supervisor: Frank Röder

Topic Description:

Intelligent agents require predictive capabilities to forecast future states of the world and plan ahead before acting. Current methods in reinforcement learning (RL) utilize learned models of the world to solve a multitude of challenging problems [1-8]. However, it remains unclear how to model the world accurately without capturing unnecessary details, while retaining enough information for effective learning through imagination and advance planning.
This thesis offers students the opportunity to work on projects related to model-based RL methods that approximate a model of the world to improve overall agent learning. The focus can be on applying a specific model-based method to a new problem domain or reimplementing a prior idea that could be enhanced by refining the mathematical formulation of the objective.
We provide a selection of reference materials related to decision-time planning [1, 3] and background planning [2, 4-7]. Further ideas include exploring the mathematics involved in latent dynamic models [2, 3, 5, 7] or comparing the architectures discussed in the provided literature.

Requirements:

Experience with Python programming
Prior experience working with PyTorch, JAX, or TensorFlow
Access to reasonable compute resources to run experiments

Nice-to-Have:

Prior experience with reinforcement learning
Attendance of our institute’s course(s)
Knowledge of information theory

Literature:

K. Chua, R. Calandra, R. McAllister, and S. Levine, “Deep Reinforcement Learning in a Handful of Trials using Probabilistic Dynamics Models,” in Conference on Neural Information Processing Systems, S. Bengio, H. Wallach, H. Larochelle, K. Grauman, N. Cesa-Bianchi, and R. Garnett, Eds., in NIPS’18, vol. 31. Montréal, Canada: Curran Associates, Inc., 2018, pp. 4754–4765.
D. Hafner, T. Lillicrap, J. Ba, and M. Norouzi, “Dream to Control: Learning Behaviors by Latent Imagination,” in International Conference on Learning Representations, Dec. 2020.
D. Hafner et al., “Learning Latent Dynamics for Planning from Pixels,” in International Conference on Machine Learning, K. Chaudhuri and R. Salakhutdinov, Eds., in Proceedings of Machine Learning Research. PMLR, 2019, pp. 2555--2565.
D. Hafner, T. Lillicrap, M. Norouzi, and J. Ba, “Mastering Atari with Discrete World Models,” in International Conference on Learning Representations, 2021.
D. Hafner, J. Pasukonis, J. Ba, and T. Lillicrap, “Mastering Diverse Domains through World Models,” Jan. 10, 2023
M. P. Deisenroth and C. E. Rasmussen, “PILCO: A model-based and data-efficient approach to policy search,” in Proceedings of the 28th International Conference on Machine Learning, in ICML’11. Bellevue, Washington, USA: Omnipress, 2011, pp. 465–472. doi: 10.5555/3104482.3104541.
D. Ha and J. Schmidhuber, “Recurrent World Models Facilitate Policy Evolution,” in Advances in Neural Information Processing Systems, S. Bengio, H. Wallach, H. Larochelle, K. Grauman, N. Cesa-Bianchi, and R. Garnett, Eds., Curran Associates, Inc., 2018, pp. 2450–2462.
M. Janner, J. Fu, M. Zhang, and S. Levine, “When to Trust Your Model: Model-Based Policy Optimization,” in Conference on Neural Information Processing Systems, H. Wallach, H. Larochelle, A. Beygelzimer, F. d’ Alché-Buc, E. Fox, and R. Garnett, Eds., Vancouver, Canada: Curran Associates, Inc., 2019, pp. 12519–12530.

Policy Architectures in Reinforcement Learning Agents with Legged Locomotion

Supervisor: Dr. Adwait Datar

Topic Description:

Legged locomotion in reinforcement learning agents, whether in simulated environments or real-world settings, presents unique challenges due to complex physical constraints and dynamic interactions with the environment. Designing effective policy architectures for such agents is crucial for achieving robust and efficient locomotion.
This project aims to explore various policy architectures for walking agents, focusing on how different design choices affect learning performance, robustness, and control quality. Students will begin by familiarizing themselves with existing locomotion environments from key literature, including Hafner et al. (2020) and Schumacher et al. (2023, 2024). The investigation will cover comparisons between static and dynamic policies, architectures with encoding and decoding layers of varying state dimensions, and the influence of different sensor modalities and sampling rates on system stability.
A key component of the project is the evaluation of each architecture’s performance, particularly in terms of learning efficiency and the robustness of the learned policies. Additionally, students will explore potential connections between the physical constraints of legged agents and the most effective policy architectures, aiming to formulate a hypothesis based on their findings. The exact scope and direction of the project will be tailored in consultation with the student, depending on the thesis format (project work, bachelor’s thesis, or master’s thesis) and their specific interests.

Requirements:

Basic understanding of reinforcement learning, control theory, and neural networks
Proficiency in Python programming
There is a possibility to use our virtual robotics lab Scilab-RL (https://scilab-rl.github.io/Scilab-RL/) in which case a recent notebook with at least an i5 processor or comparable CPU and 8 GB memory is sufficient. There is also a possibility to work with the MyoSuite environment (https://github.com/MyoHub/myosuite).

Nice-to-have:

Knowledge of optimal control theory and advanced reinforcement learning algorithms
Familiarity with musculoskeletal models and robotic locomotion

Literature:

Hafner, Roland, Tim Hertweck, Philipp Klöppner, Michael Bloesch, Michael Neunert, Markus Wulfmeier, Saran Tunyasuvunakool, Nicolas Heess, and Martin Riedmiller. “Towards General and Autonomous Learning of Core Skills: A Case Study in Locomotion.” arXiv, August 6, 2020. https://doi.org/10.48550/arXiv.2008.12228.
Schumacher, Pierre, Daniel Häufle, Dieter Büchler, Syn Schmitt, and Georg Martius. “DEP-RL: Embodied Exploration for Reinforcement Learning in Overactuated and Musculoskeletal Systems.” arXiv, April 27, 2023. https://doi.org/10.48550/arXiv.2206.00484
Schumacher, Pierre, Lorenz Krause, Jan Schneider, Dieter Büchler, Georg Martius, and Daniel Haeufle. “Learning to Control Emulated Muscles in Real Robots: A Software Test Bed for Bio-Inspired Actuators in Hardware.” In 2024 10th IEEE RAS/EMBS International Conference for Biomedical Robotics and Biomechatronics (BioRob), 806–13, 2024. https://doi.org/10.1109/BioRob60516.2024.10719699
Hamrani, Abderrachid, Md Munim Rayhan, Telusma Mackenson, Dwayne McDaniel, and Leonel Lagos. “Smart Quadruped Robotics: A Systematic Review of Design, Control, Sensing and Perception.” Advanced Robotics 39, no. 1 (January 2, 2025): 3–29. https://doi.org/10.1080/01691864.2024.2411684.
Matni, Nikolai, Aaron D. Ames, and John C. Doyle. “A Quantitative Framework for Layered Multirate Control: Toward a Theory of Control Architecture.” IEEE Control Systems Magazine 44, no. 3 (June 2024): 52–94. https://doi.org/10.1109/MCS.2024.3382388.
Song, Yunlong, Sangbae Kim, and Davide Scaramuzza. “Learning Quadruped Locomotion Using Differentiable Simulation.” arXiv, October 15, 2024. https://doi.org/10.48550/arXiv.2403.14864.
Wieber, Pierre-Brice, Russ Tedrake, and Scott Kuindersma. “Modeling and Control of Legged Robots.” In Springer Handbook of Robotics, edited by Bruno Siciliano and Oussama Khatib, 1203–34. Cham: Springer International Publishing, 2016. https://doi.org/10.1007/978-3-319-32552-1_48.
Xin, Guiyang, Songyan Xin, Oguzhan Cebe, Mathew Jose Pollayil, Franco Angelini, Manolo Garabini, Sethu Vijayakumar, and Michael Mistry. “Robust Footstep Planning and LQR Control for Dynamic Quadrupedal Locomotion.” IEEE Robotics and Automation Letters 6, no. 3 (July 2021): 4488–95. https://doi.org/10.1109/LRA.2021.3068695.

Selected completed theses

Enhancing Multi-Agent Reinforcement Learning for Cooperation and Competition through Algorithmic Diversity

Student: Maik Marius Rebaum

Supervisor: Frank Röder

Thesis type: M.Sc. thesis

Date: April 2025

Abstract:

This thesis investigates how heterogeneous algorithm training can improve agent performance in competitive Multi-Agent Reinforcement Learning (MARL) environments. Using the AbstractSim soccer simulator, three state-of-the-art algorithms - Heterogeneous-Agent Proximal Policy Optimization (HAPPO), Multi-Agent Soft Actor-Critic (MASAC), and Multi-Agent Twin Delayed Deep Deterministic Policy Gradient (MATD3) - were implemented and evaluated in both homogeneous and heterogeneous team configurations. A central finding is that agents trained against algorithmically diverse opponents outperform those trained in a homogeneous setting, similar to traditional self-play or adversarial learning. The observed improvements are explained through the lens of the Nash equilibrium: heterogeneous training changes the time of convergence to local equilibriaby forcing agents to adapt to constantly shifting opponent strategies. This modified exploration phase allows more eﬀective and resilient behaviors to emerge, particularly in HAPPO and MATD3 agents, which exhibited stronger oﬀensive capabilities in the soccer simulation than demonstrated when using homogeneous training algorithms. HAPPO also benefits from the heterogeneous training environment by developing stronger defensive behaviors. MASAC, while less responsive to heterogeneous opponents, serves as a useful training partner due to its strong exploratory behavior and helps improve the performance of both HAPPO and MATD3. Additionally, all three algorithms developed emergent behaviors in heterogeneous training that are not seen in the homogeneous case. Beyond empirical results, this work contributes a multiagent training framework compatible with diverse algorithms and oﬀers a structured analysis of how agent dynamics evolve under mixed training conditions. These findings highlight the practical value of algorithmic diversity as a tool to overcome the stagnation in training that occurs in traditional MARL self-play.

Modelling deterministic dynamics in simulated environments by supervised learning

Student: Leon Sierau

Supervisor: Dr. Manfred Eppe

Thesis type: B.Sc. thesis

Date: November 2024

Abstract:

This thesis investigates the ability of forward models, implemented as multi-layer perceptrons (MLPs), to learn deterministic transition dynamics in various simulated reinforcement learning (RL) environments. Through supervised learning, different MLP architectures —deterministic MLPs, probabilistic MLPs, and ensembles of deterministic MLPs— are evaluated in environments with dynamics of increasing complexity. The results indicate, that, while simple MLPs can accurately represent the transition dynamics in low to medium complexity environments, their performance decreases significantly in more challenging environments, where even extensive training does not result in an accurate representation of the transition dynamics.
A key finding is, that MLPs seem to be capable of learning basic state transitions, but to struggle to represent contextual features such as obstacles, which is critical for complex tasks. The experiments further demonstrate that there is no one-fits-all solution for training forward models; the choice of architecture and training method depends heavily on the specific environment and task.
Recursively applying one-step models leads to significant compounding errors, even in simpler environments, rendering this approach less effective for long-horizon planning. However, ensembles of models show potential in stabilizing compounding errors and could play an essential role in controlling them.
Our results provide some support for an association between the predictive disagreement of a model ensemble and the ensembles prediction error. Yet, this evidence remains inconclusive and on its own does not justify using increasing ensemble disagreement as an indication of worsening predictions.

Güte von Restricted Boltzmann Machines in Relation zur Anzahl der versteckten Neuronen und der Label-Verteilung

Student: Moustafa Alsayd Ahmad

Supervisor: Prof. Dr. Nihat Ay

Thesis type: B.Sc. thesis

Date: November 2024

Abstract:

In dieser Arbeit wird die Modellierungskapazität von Restricted Boltzmann Machines (RBMs) unter verschiedenen Konfigurationen untersucht, insbesondere in Bezug auf die Anzahl der versteckten Neuronen und der Label-Neuronen sowie den Einfluss der Hamming-Distanz. Ziel der Arbeit ist es, zu überprüfen, inwieweit ein RBM in der Lage ist, die zugrunde liegende Datenverteilung zu lernen und qualitativ hochwertige Daten zu generieren, selbst wenn die Anzahl der versteckten Neuronen stark reduziert wird.
Für die Experimente wurden ein selbst erstellter 4x4-Pixel-Datensatz sowie der MNIST-Datensatz verwendet. Es wurden Modelle mit variierenden Anzahlen versteckter Neuronen trainiert, um den Einfluss der Reduktion der Modellkapazität zu bewerten. Die Ergebnisse zeigen, dass eine größere Anzahl versteckter Neuronen die Qualität der generierten Daten verbessert, aber auch Modelle mit einer redu- zierten Anzahl versteckter Neuronen stabile und qualitativ akzeptable Ergebnisse liefern können.
Darüber hinaus wurde der Einfluss der Anzahl von Label-Neuronen und der Hamming-Distanz auf die generierten Daten untersucht. Modelle mit mehr Label- Neuronen und einer größeren Hamming-Distanz erzielten qualitativ hochwertigere Ergebnisse, da sie eine bessere Differenzierung zwischen den Ziffern ermöglichten.
Die Arbeit zeigt, dass RBMs eine vielversprechende Methode zur Modellierung und Generierung von Daten darstellen, wobei die richtige Wahl der Parameter wie die Anzahl der versteckten und Label-Neuronen sowie die Hamming-Distanz von entscheidender Bedeutung für die Generierung qualitativ hochwertiger Daten ist.

Enhanced Reinforcement Learning with Prioritized Transition Sampling

Student: Fin Michael Armbrecht

Supervisor: Frank Röder

Thesis type: B.Sc. thesis

Date: February 2024

Abstract:

The field of machine learning has received a lot of attention recently, with revolutionary breakthroughs being achieved for example in the field of autonomous driving. But it is not only autonomous vehicles that work with machine learning and can be improved by it, many industrial processes or even production systems can also benefit from machine learning. Researchers are therefore constantly trying to find new ways to improve and accelerate the learning processes of artificial intelligence and to make them more skillful and efficient by trying to imitate the learning process of humans. The ability to remember past events is one of the keys in the learning process. Reinforcement Learning can use this by interacting with the environment and achieving valuable learning results and remember past events through the addition of Hindsight Experience Replay. The approach of this thesis is to evaluate the achieved results and to improve the learning efficiency of artificial intelligence with the help of a newly developed prioritization process, which selects important events more often to remember them. Our results have shown that the newly developed prioritization process has a significant impact on the learning efficiency of artificial intelligence and outperforms other existing state-of-the-art approaches.

Information-Theoretic Analysis of Context in Air Traffic Communication applied to RNN-T Prediction Networks

Student: Luis Scheuch

Supervisor: Prof. Dr. Nihat Ay

Thesis type: B.Sc. thesis

Date:October 2023

Abstract:

This thesis examines the information content as well as the context size of English and Aviation English using common measures from information theory, such as the entropy, mutual information or conditional entropy It then interprets the impact on the prediction network (PN) for the currently state-of-the-art speech-to-text (STT) deep learning architecture RNN-Transducer (RNN-T). The used corpora are the Corpus of Contemporary American English (COCA), the Cornell Movie-Dialogs (CMD) corpus and an internal air traffic control (ATC) corpus, which will be presented and critically analyzed for their relevance and statistical significance. The results will be used to estimate the context size of Aviation English and whether Aviation English is easier to predict than common English. It will be analyzed to what extent English and Aviation English are comparable and what that might imply to the transferability of research from other domains to ATC. Following that, I will suggest a modification to the by Albesano, Andrés-Ferrer, Ferri, et al. suggested, for natural language optimized RNN-T PN architecture, to improve its performance in the ATC domain. Additionally, I will try to investigate the structure of ATC speech in comparison with the structure of English by modifying the given ATC corpus and analyzing the effect of introduced placeholders.
The main results are, Aviation English is generally easier to predict than common English, the internal ATC corpus has a most meaningful context size of 9 words and results from other domains can’t directly be transferred to ATC.

A comparative study of Latent Dirichlet Allocation and Neural Topic Model for short text topic modelling based on topic coherence

Student: Shreya Purkayastha

Supervisor: Prof. Dr. Nihat Ay

Thesis type: M.Sc. thesis

Date:July 2023

Abstract:

With the increasing prevalence of short texts such as tweets and search queries on the internet, there is a growing interest in analyzing these texts to extract valuable insights. However, analyzing short texts comes with its own set of challenges, including sparseness, non-standardization, and noise. This research study focuses on exploring topic modelling techniques for short texts using two algorithms: Latent Dirichlet Allocation (LDA) and Neural Topic Model (NTM). The study assesses the performance of these algorithms using four different topic coherence measures: C_V , C_UCI , C_UMass, and C_NPMI.
In order to compare the effectiveness of the algorithms, it is crucial to select an appropriate topic coherence measure. The study reveals that the coherence metric C_V is not reliable for evaluating topic modelling on short texts, while C_UCI , C_UMass, and C_NPMI are considered reliable measures. Therefore, it is recommended to utilize any of these three reliable coherence metrics or a combination of them to assess when evaluating topic modelling on short texts. Based on the findings, the study concludes that LDA is a more suitable algorithm for topic modelling on short texts compared to NTM.

Non-uniqueness of Information Projection on Submodels of the Naive Bayes Model

Student: Stella Wit

Supervisor: Prof. Dr. Nihat Ay

Thesis type: B.Sc. thesis

Date:May 2023

Hinreichende Entropie-Schranken zur universellen Approximation durch diskrete neuronale Netze

Student: Nico Bartocha

Supervisor: Prof. Dr. Nihat Ay

Thesis type: B.Sc. thesis

Date: March, 2023

Abstract:

Diese Arbeit untersucht die Möglichkeiten, informationstheoretische Konzepte auf neuronale Netzwerke anzuwenden, um Prozesse zur Datenverarbeitung effizienter gestalten zu können. Diskrete neuronale Netzwerke besitzen die Eigenschaft, durch unterschiedliche Konstruktionsmethoden jedes Klassifizierungsproblem von beliebigen Datenmengen lösen zu können. Zugleich ist die typische Menge ein Konzept aus der Informationstheorie, welches es ermöglicht, die Größe von möglichen Datenmengen zu reduzieren und dabei eine sehr hohe Auftrittswahrscheinlichkeit beizubehalten. Diese wird mit Hilfe der Entropie für stationäre ergodische Prozesse gewonnen. Diese Erkenntnisse werden zur Konstruktion von ein- sowie zweilagigen neuronalen Netzwerken, die jede Klassifikation der typischen Menge abbilden können, genutzt. Durch die Eigenschaften der typischen Menge wird im Anschluss gezeigt, dass durch dieses Verfahren Klassifizierungen der gesamten Datenmenge approximiert werden können. Die unterschiedlichen Konstruktionsmethoden werden hierbei verglichen, indem ihre Fähigkeiten zur Klassifikation von allgemeinen Datenmengen sowie ihre Anzahl an versteckten Neuronen gegenübergestellt werden. Abschließend wird gezeigt, dass durch die reduzierte Anzahl an versteckten Neuronen theoretische Schranken der Größe des neuronalen Netzwerkes zur universellen Approximation durch Prozesse mit geringen Entropiewerten deutlich unterschritten werden können.

Transfer Learning Performance in Reinforcement Learning

Student: Luis Pohl

Supervisor: Dr. Manfred Eppe

Thesis type: B.Sc. thesis

Date:

Abstract:

The aim of this thesis is to find an appropriate method and a corresponding norm to capture the distance/dissimilarity between one recently solved evaluation episode and n-previously solved evaluation episodes in the field of Goal-Conditioned Reinforcement learning (GCRL). Using the distance metric, the generalizability of the agent is inferred. The insights gained through the distance metric can be used to make prediction on the capabilities of the agent in solving similar future problems. Furthermore, by dividing the distance metric through the number of steps, the agent took to solve the problem, the transfer learning velocity is computed, quantifying the difficulty of the generalization. Based on the performance metric and background data, it is possible to determine if the current configuration underperforms or over performs in transfer learning, making it possible to track whether adjustments improve or worsen the generalizability. To address this, three possible methods to capture the distance were devised: Mean, Nearest, and Eppe. The three methods rest on two distinct perspectives: whether it is more important for the agent to have solved a problem that is closest to the new task or to gauge the distance relative to the mean of all previously n-solved points (two methods). Additionally, three different norms were employed (Euclidean, Manhattan, Cosine similarity) to quantify the distance between the problems. Ultimately, after investigation, the conclusion is that the most appropriate tuple out of method and norm is the Nearest method combined with the Euclidean norm. This indicates that leveraging previously n-solved similar problems and measuring their distance using the Euclidean norm can enable faster problem-solving.

TU Hamburg

Institute for

Data Science Foundations

Blohmstraße 15

21079 Hamburg

E-mail : dsf-office@tuhh.de

Phone : +49 40 42878 4932

Writing a thesis or research project at our institute

Thesis process

Application guidelines

Thesis and presentation templates

Topics offered by our institute

Investigations into Neural Network Architectures for Mobile Robots: An Embodied Intelligence Perspective

Learning of Linear Recurrent Neural Networks via Gradient Descent and Sub-Space Identification

Conflict driven sampling in reinforcement learning

World Models and Reinforcement Learning

Policy Architectures in Reinforcement Learning Agents with Legged Locomotion

Selected completed theses

Enhancing Multi-Agent Reinforcement Learning for Cooperation and Competition through Algorithmic Diversity

Modelling deterministic dynamics in simulated environments by supervised learning

Güte von Restricted Boltzmann Machines in Relation zur Anzahl der versteckten Neuronen und der Label-Verteilung

Enhanced Reinforcement Learning with Prioritized Transition Sampling

Information-Theoretic Analysis of Context in Air Traffic Communication applied to RNN-T Prediction Networks

A comparative study of Latent Dirichlet Allocation and Neural Topic Model for short text topic modelling based on topic coherence

Non-uniqueness of Information Projection on Submodels of the Naive Bayes Model

Hinreichende Entropie-Schranken zur universellen Approximation durch diskrete neuronale Netze

Transfer Learning Performance in Reinforcement Learning

TU Hamburg

Links