Xiaoge Zhang – Xiaoge Zhang's personal website

About Xiaoge Zhang

Xiaoge Zhang received his B.S. degree from Chongqing University of Posts and Telecommunications in 2011, and the M.S. degree from Southwest University in 2014, both in Chongqing, China; and Ph.D. from Vanderbilt University in May 2019, Nashville, TN. Currently, he works as a postdoctoral research scholar at Vanderbilt University, Nashville, TN, USA. From August to December in 2016, he interned at the National Aeronautics and Space Administration (NASA) Ames Research Center (ARC), Moffett Field, CA, working at the Prognostics Center of Excellence (PCoE) led by Dr. Kai Goebel. He was a recipient of the Chinese Government Award for Outstanding Self-financed Students Abroad in 2017. He has published more than 30 research papers in leading academic journals, such as IEEE Transactions on Cybernetics, Risk Analysis, IEEE Transactions on Reliability, Decision Support Systems, Information Sciences, Reliability Engineering and System Safety, Annals of Operations Research, and International Journal of Production Research, among others. His current research interests include machine learning, uncertainty quantification, reliability assessment, network optimization, and data analytics. He is a student member of IEEE, INFORMS, and SIAM.

Entries by

Research paper accepted by Transportation Research Part E

June 4, 2025 in Publication /by Xiaoge Zhang

Understanding causal relationships between traffic states throughout the system is of great significance for enhancing traffic management and optimization in urban traffic networks. Unfortunately, few studies in the literature have systematically analyzed causal structure characterizing the evolution of traffic states over time and gauged the importance of traffic nodes from a causal perspective, particularly in the context of large-scale traffic networks. Moreover, the dynamic nature of traffic patterns necessitates a robust method to reliably discover causal relationships, which are often overlooked in existing studies. To address these issues, we propose a Spatio-Temporal Causal Structure Learning and Analysis (STCSLA) framework for analyzing large-scale urban traffic networks at a mesoscopic level from a causal lens. The proposed framework comprises three main components: decomposition of spatio-temporal traffic data into localized traffic subprocesses; a Bayesian Information Criterion-guided spatio-temporal causal structure learning combined with temporal-dependencies preserving sampling for deriving reliable causal graph to uncover time-lagged and contemporaneous causal effects; establishing several causality-oriented indicators to identify causally critical nodes, mediator nodes, and bottleneck nodes in traffic networks. Experimental results on both a synthetic dataset and the real-world Hong Kong traffic dataset demonstrate that the proposed STCSLA framework accurately uncovers time-varying causal relationships and identifies key nodes that play various causal roles in influencing traffic dynamics. These findings underscore the potential of the proposed framework to improve traffic management and provide a comprehensive causality-driven approach for analyzing urban traffic networks.

Prof. Olga Fink gave a talk on “Integrating Domain Knowledge and Physics in AI: Harnessing Inductive Bias for Advanced PHM Solutions”

May 6, 2025 in DRSS /by Xiaoge Zhang

In the field of prognostics and health management, the integration of machine learning has enabled the development of advanced predictive models that ensure the reliable and safe operation of complex assets. However, challenges such as sparse, noisy, and incomplete data necessitate the integration of prior knowledge and inductive bias to improve model generalization, interpretability, and robustness.

Inductive bias, defined as the set of assumptions embedded in machine learning models, plays a crucial role in guiding these models to generalize effectively from limited training data to real-world scenarios. In PHM applications, where physical laws and domain-specific knowledge are fundamental, the use of inductive bias can significantly enhance a model’s ability to predict system behavior under diverse operating conditions. By embedding physical principles into learning algorithms, inductive bias reduces the reliance on large datasets, ensures that model predictions are physically consistent, and enhances both the generalizability and interpretability of the models.

This talk will explore various forms of inductive bias tailored for PHM systems, with a particular focus on heterogenous-temporal graph neural networks, as well as physics-informed and algorithm-informed graph neural networks. These approaches will be applied to virtual sensing, modelling multi-body dynamical systems and anomaly detection.

Review paper on AI system reliability is accepted by Journal of Reliability Science and Engineering

April 18, 2025 in Publication /by Xiaoge Zhang

As the potential applications of AI continue to expand, a central question remains unresolved: will users trust and adopt AI-powered technologies? Since AI’s promise closely hinges on the perceptions of its trustworthiness, how to guarantee the reliability and trustworthiness of AI plays a fundamental role in fostering its broad adoptions in practice. However, the theories, mathematical models, and methods in reliability engineering and risk management have not kept pace with the rapid technological progress in AI. As a result, the lack of essential components (e.g., reliability, trustworthiness) in the resultant models has emerged as a major roadblock to regulatory approval and widespread adoptions of AI-powered solutions in high-stakes decision environments, such as healthcare, aviation, finance, nuclear power plant, to name a few. To fully harness AI’s power for automating decision making in these safety-critical applications, it is essential to manage expectations for what AI can realistically deliver to build appropriate levels of trust. In this paper, we focus on functional reliability of AI systems developed through supervised learning and discuss the unique characteristics of AI systems that necessitate the development of specialized reliability engineering and risk management theories and methods to create functionally reliable AI systems. Next, we thoroughly review five prevalent engineering mechanisms in the existing literature for approaching functionally reliable and trustworthy AI, including uncertainty quantification (UQ) composed of model-based UQ and model-agnostic conformal prediction, failure prediction, learning with abstention, formal verification, and knowledge-enabled AI. Furthermore, we outline several research challenges and opportunities related to the development of reliability engineering and trustworthiness assurance methods for AI systems. Our research aims to deepen the understanding of reliability and trustworthiness issues associated with AI systems, and spark researchers in the field of risk and reliability engineering and beyond to contribute to this area of study with emerging importance.

Research paper accepted by European Journal of Operational Research

April 2, 2025 in Publication /by Xiaoge Zhang

It is common for multiple firms\textemdash such as manufacturers, retailers, and third-party insurers\textemdash to coexist and compete in the aftermarket for durable products. In this paper, we study price competition in a partially concentrated aftermarket where one firm offers multiple extended warranty (EW) contracts while the others offer a single one. The demand for EWs is described by the multinomial logit model. We show that, at equilibrium, such an aftermarket behaves like a combination of monopoly and oligopoly. Building upon this base model, we further investigate sequential pricing games for a durable product and its EWs to accommodate the ancillary nature of after-sales services. We consider two scenarios: one where the manufacturer (as the market leader) sets product and EW prices \emph{simultaneously}, and another where these decisions are made \emph{sequentially}. Our analysis demonstrates that offering EWs incentivizes the manufacturer to lower the product price, thereby expanding the market potential for EWs. Simultaneous product-EW pricing leads to a price concession on EWs compared to sequential pricing, effectively reducing the intensity of competition in the aftermarket. Overall, the competitiveness of an EW hinges on its ability to deliver high value to consumers at low marginal cost to its provider. While our focus is on EWs, the proposed game-theoretical pricing models apply broadly to other ancillary after-sales services.

Dr. Xiaoge Zhang delivered a talk on “Bayesian Deep Learning for Aircraft Hard Landing Safety Assessment” at East China Normal University, China

March 14, 2025 in Invited Talk /by Xiaoge Zhang

Landing is generally cited as one of the riskiest phases of a flight, as indicated by the much higher accident rate than other flight phases. In this talk, we focus on the hard landing problem (which is defined as the touchdown vertical speed exceeding a predefined threshold) and build a probabilistic deep learning model to forecast the aircraft’s vertical speed at touchdown using DASHlink data. Previous studies have treated hard landing as a classification problem, in which the vertical speed is represented as a categorical variable based on a predefined threshold. In this talk, we develop a machine learning model to predict the touchdown vertical speed during aircraft landing. Probabilistic forecasting is used to quantify the uncertainty in model prediction to support risk-informed decision-making. A Bayesian neural network approach is leveraged to build the predictive model. The overall methodology consists of five steps. First, a clustering method based on the minimum separation between different airports is developed to identify flights in the dataset that landed at the same airport. Secondly, identifying the touchdown point itself is not straightforward; in this paper, it is determined by comparing the vertical speed distributions derived from different candidate touchdown indicators. Thirdly, a forward and backward filtering (filtfilt) approach is used to smooth the data without introducing the phase lag. Next, a minimal-redundancy-maximal-relevance (mRMR) analysis is used to reduce the dimensionality of input variables. Finally, a Bayesian recurrent neural network is trained to predict the touchdown vertical speed and quantify the uncertainty in the prediction. The model is validated using several flights in the test dataset, and computational results demonstrate the satisfactory performance of the proposed approach.

Welcome Shuaiqi Yuan to join as a postdoctoral research scholar!

February 20, 2025 in Personnel /by Xiaoge Zhang

We are pleased to welcome Dr. Shuaiqi Yuan, who recently joined our group as a postdoctoral research scholar. Dr. Yuan holds a PhD in Safety and Security Science from Delft University of Technology in the Netherlands.

Research paper accepted by IEEE Transactions on Automation Science and Engineering

December 13, 2024 in Publication /by Xiaoge Zhang

The demand for disruption-free fault diagnosis of mechanical equipment under a constantly changing operation environment poses a great challenge to the deployment of data-driven diagnosis models in practice. Extant continual learning-based diagnosis models suffer from consuming a large number of labeled samples to be trained for adapting to new diagnostic tasks and failing to account for the diagnosis of heterogeneous fault types across different machines. In this paper, we use a representative mechanical equipment – rotating machinery — as an example and develop an uncertainty-aware continual learning framework (UACLF) to provide a unified interface for fault diagnosis of rotating machinery under various dynamic scenarios: class continual scenario, domain continual scenario, and both. The proposed UACLF takes a three-step to tackle fault diagnosis of rotating machinery with homogeneous-heterogeneous faults under dynamic environments. In the first step, an inter-class classification loss function and an intra-class discrimination loss function are devised to extract informative feature representations from the raw vibration signal for fault classification. Next, an uncertainty-aware pseudo labeling mechanism is developed to select unlabeled fault samples that we are able to assign pseudo labels confidently, thus expanding the training samples for faults arising in the new environment. Thirdly, an adaptive prototypical feedback mechanism is used to enhance the decision boundary of fault classification and diminish the model misclassification rate. Experimental results on three datasets suggest that the proposed UACLF outperforms several alternatives in the literature on fault diagnosis of rotating machinery across various working conditions and different machines.

Prof. Tong Wang gave a talk on “Using Advanced LLMs to Enhance Smaller LLMs: An Interpretable Knowledge Distillation Approach”

October 30, 2024 in DRSS /by Xiaoge Zhang

Large language models (LLMs) like GPT-4 or LlaMa 3 provide superior performance in complex human-like interactions. But they are costly, or too large for edge devices such as smartphones and harder to self-host, leading to security and privacy concerns. This paper introduces a novel interpretable knowledge distillation approach to enhance the performance of smaller, more economical LLMs that firms can self-host. We study this problem in the context of building a customer service agent aimed at achieving high customer satisfaction through goal-oriented dialogues. Unlike traditional knowledge distillation, where the “student” model learns directly from the “teacher” model’s responses via fine-tuning, our interpretable “strategy” teaching approach involves the teacher providing strategies to improve the student’s performance in various scenarios. This method alternates between a “scenario generation” step and a “strategies for improvement” step, creating a customized library of scenarios and optimized strategies for automated prompting. The method requires only black-box access to both student and teacher models; hence it can be used without manipulating model parameters. In our customer service application, the method improves performance, and the learned strategies are transferable to other LLMs and scenarios beyond the training set. The method’s interpretabilty helps safeguard against potential harms through human audit.

Dr. Xiaoge Zhang delivered a talk on “Reliability Engineering in the Era of AI: An Uncertainty Quantification-Based Framework” at National University of Singapore, Singapore

October 14, 2024 in Invited Talk /by Xiaoge Zhang

Establishing trustworthiness is fundamental for the responsible utilization of medical artificial intelligence (AI), particularly in cancer diagnostics, where misdiagnosis can lead to devastating consequences. However, there is currently a lack of systematic approaches to resolve the reliability challenges stemming from the model limitations and the unpredictable variability in the application domain. In this work, we address trustworthiness from two complementary aspects—data trustworthiness and model trustworthiness—in the task of subtyping non-small cell lung cancers using whole side images. We introduce TRUECAM, a framework that provides trustworthiness-focused, uncertainty-aware, end-to-end cancer diagnosis with model-agnostic capabilities by leveraging spectral-normalized neural Gaussian Process (SNGP) and conformal prediction (CP) to simultaneously ensure data and model trustworthiness. Specifically, SNGP enables the identification of inputs beyond the scope of trained models, while CP offers a statistical validity guarantee for models to contain correct classification. Systematic experiments performed on both internal and external cancer cohorts, utilizing a widely adopted specialized model and two foundation models, indicate that TRUECAM achieves significant improvements in classification accuracy, robustness, fairness, and data efficiency (i.e., selectively identifying and utilizing only informative tiles for classification). These highlight TRUECAM as a general wrapper framework around medical AI of different sizes, architectures, purposes, and complexities to enable their responsible use.

Prof. Lei Ma gave a talk on “Towards Building the Trust of Complex AI Systems in the LLM Era”

October 4, 2024 in DRSS /by Xiaoge Zhang

In recent years, deep learning-enabled systems have made remarkable progress, powering a surge in advanced intelligent applications. This growth and its real-world impact have been further amplified by the advent of large foundation models (e.g., LLM, Stable Diffusion). Yet, the rapid evolution of these AI systems often proceeds without comprehensive quality assurance and engineering support. This gap is evident in the integration of standards for quality, reliability, and safety assurance, as well as the need for mature toolchain support that provides systematic and explainable feedback of the development lifecycle. In this talk, I will present a high-level overview of our team’s ongoing initiatives to lay the groundwork for Trustworthy Assurance of AI Systems and its industrial applications, e.g., including (1) AI software testing and analysis, (2) our latest trustworthiness assurance efforts for AI-driven Cyber-physical systems with an emphasis on sim2real transition. (3) risk and safety assessment for large foundational models, including those akin to large language models, and vision transformers.