PhD - Gent | Just now
General research domain:
Artificial Intelligence
Description:
Semiconductor manufacturing processes could benefit from the integration of Artificial Intelligence (AI) solutions to perform process monitoring and control, such as yield forecasting and detecting process inefficiencies or defects towards performing process optimizations. These AI solutions aim to learn typical process behaviours under various conditions based on historical data. However, such classical AI systems are unable to take advantage of the rich, tacit domain expertise of experienced semiconductor process engineers pertaining to these behaviours. Neurosymbolic AI aims to resolve this by enabling the representation of this domain expertise as Knowledge Graphs and fusing it into the machine learning pipelines. As such, the AI can learn from both the historical data as well as the domain knowledge to perform optimal process monitoring and steering [1]. However, current hybrid AI solutions have found limited adoption due to (a) the tedious process of transforming the tacit process knowledge into Knowledge Graphs, (b) the limited availability of neurosymbolic methods that perform accurate fusion of sequential knowledge (e.g. a process consisting of different steps) into machine learning methods, and (c) the lack of explainability of the neurosymbolic predictions, leading to lack of trust in the outcomes. To overcome these challenges, this PhD aims to explore the design of user-friendly, explainable and process-aware neurosymbolic methods to accelerate and enhance the integration of AI in semiconductor manufacturing processes. The research will establish a foundational framework for applying neurosymbolic AI in this domain, emphasizing three core pillars: knowledge-driven modelling, explainability, and result reliability:
1.
Ontology-Based Knowledge
Representation
The first pillar focuses on lowering the barrier towards capturing tacit
domain knowledge on process flows, parameters and their interdependencies in a
Knowledge Graph. The goal is to bypass the need for extensive interactions and
synchronization between domain experts and knowledge engineers by employing
Large Language Models (LLMs). Through automated extraction it needs to be
investigated how LLMs can form a baseline ontological representation of the
available process knowledge (manuals, flow charts, reports, etc.), which can
then be subsequently enhanced through limited feedback loops between the LLM
and domain expert. The end result is an ontology that captures the hierarchical
and relational structure of semiconductor manufacturing processes, enabling a
shared and machine-interpretable knowledge base.
2. Design of explainable techniques for embedding process knowledge into machine learning
The second pillar focuses on developing methods to integrate knowledge graphs into machine learning methods, in such a manner that the temporal and sequential nature of the process knowledge and properties are maintained. This includes designing mechanisms to leverage structured domain knowledge during training and inference, as well as discovering new relationships from data to enrich the knowledge graph. To ensure the trustworthiness of the resulting neurosymbolic method, this embedding process should be transparent, allowing to attribute back the predictions towards the features in the Knowledge Graph that mostly steered it. Most existing embedding methods are uninterpretable, as they assume latent feature spaces, therefore the focus within this research will be on enhancing more transparent embedding techniques, such as binary features and walk-based techniques.
3.
Application Layer: Trustworthy
Predictive and Optimization Tasks
The third pillar applies the designed algorithms and framework to
real-world semiconductor manufacturing scenarios. This stage will demonstrate
how the combined Knowledge Graph and Neurosymbolic approach can address
predictive tasks, such as yield forecasting, defect detection, and process
optimization, ensuring both accuracy and explainability. Additionally, this
stage will explore how LLMs can act as an interface between the neurosymbolic
predictions, the Knowledge Graph and the end users to accurately contextualize
and translate the outputs to the process engineers.
[1] Yuwei Wan, et al. “Making knowledge
graphs work for smart manufacturing: Research topics, applications and
prospects”, Journal of Manufacturing Systems, Volume 76, 2024, Pages 103-132, https://doi.org/10.1016/j.jmsy.2024.07.009.
Expected Contributions:
The research will be conducted in the PreDICT (https://predict.idlab.ugent.be/) research team, which is part of the IDLab (https://idlab.ugent.be/) research group of Ghent University, and embedded into AI and Algorithms horizontal program of imec. The PhD will be supervised by Prof. Sofie van Hoecke, Prof. Femke Ongenae, Francisco Souza & Stefan Lefever. To ensure close alignment with the semiconductor manufacturing processes and research roadmap, close collaboration with other imec stakeholders will be maintained during the PhD, e.g. Dries Dictus, Bappaditya Dey, and Joris Vanderschrick.
The PhD student will be located both in Ghent (IDLab research group, Technologiepark Zwijnaarde) and Leuven (imec Leuven).
Required background: AI / ML computer science background
Type of work: 10% literature study, 60% AI modelling, 30% experimentation/evaluation
Supervisor: Femke Ongenae
Co-supervisor: Francisco Souza
Daily advisor: Stefan Lefever, Sofie Van Hoecke
The reference code for this position is 2026-058. Mention this reference code on your application form.