Event Details
A Reinforcement Learning Framework for Simulating Textual Data Drift in Cybersecurity Text.
Presenter: Hadeer Ahmed
Supervisor:
Date: Fri, September 5, 2025
Time: 14:00:00 - 00:00:00
Place: Zoom - see below.
ABSTRACT
Abstract : Data drift refers to a change in the distribution of input data over time, posing a critical threat to the reliability of machine learning models in cybersecurity, where adversaries continually adapt their tactics. Existing drift simulation approaches primarily focus on numerical features and lack support for text-based data and controlled drift dynamics. We introduce DriftRL, a reinforcement learning framework designed to generate labelled text datasets exhibiting four drift patterns: sudden, gradual, incremental-step, and recurring. We formulate drift simulation as a sequential decision-making process in which an agent learns to apply targeted text transformations, such as synonym/antonym substitutions, misspellings, and prompt-based rewriting, following a predefined pattern. The framework generates datasets with specific drift patterns, enabling reproducible experimentation and fair model evaluation. We evaluated the generated datasets using drift-based metrics and employed external drift detection tools to verify the presence and structure of the induced drift, confirming both fidelity and alignment with the intended patterns.
Join Zoom Meeting
https://uvic.zoom.us/j/7609989225?pwd=YVRPT2xZTVBYaUtIVVM3K21Hb0FmUT09&omn=83927402956
Meeting ID: 760 998 9225
Password: 063869
Note: Please log in to Zoom via SSO and your UVic Netlink ID.