AI in Cybersecurity Education

Faculty Development Summer Institute 2026

Security Classifier Lab Design Studio

This afternoon activity uses a standalone IDS classifier lab as the starting point for a lab-design studio. Participants first review a student-facing lab package, then use it as a reference point for designing related AI/ML security labs for their own classrooms.

The activity has three parts:

Goals

By the end of the activity, participants should be able to:

Student Lab Package

The downloadable lab is written as a student-facing assignment. It uses an IDS-style classifier over network log records and is designed for Google Colab. Students create derived features, choose at least three classifier types to compare, evaluate a train/validation split and five-fold cross-validation, designate one best model in best_model.ipynb, and explain how the model output would fit into an IDS workflow. In the student materials, “validation” refers to the local train/validation split, while “test” is reserved for the private competition test set used for bonus scoring.

To use the starter package:

  1. Download and unzip the starter package to get the topic-06-ids-lab folder.
  2. Upload the whole topic-06-ids-lab folder to your Google Drive, ideally at the top of My Drive (so it lives at My Drive/topic-06-ids-lab).
  3. Open ids_colab_explorer.ipynb in Google Colab and run the first cell. It mounts your Drive and sets DATA_DIR = '/content/drive/MyDrive/topic-06-ids-lab/data'; edit DATA_DIR if you uploaded the folder elsewhere.
  4. Run the notebook cells, complete the code TODOs, and replace notebook Answer here: placeholders.
  5. Transfer the selected final feature pipeline, classifier, threshold, and model name into best_model.ipynb.
  6. Replace the Answer here: placeholders in evaluation_notes.md, incident_review.md, and reflection.md.
  7. Submit your files through the course submission Google Form. ← SUBMISSION LINK

The starter package includes a small development set and a 5,000-row synthetic training set. The private competition test set used for bonus scoring is the real Kaggle Cybersecurity Threat Detection Dataset, shares the same 13 columns as the training file, and is not included in the starter package.

The student submission package includes:

AI Agent Use In The Student Lab

Students may use AI agents for debugging, code explanation, and brainstorming model or feature choices. The starter package includes AGENTS.md, which describes the intended boundaries for agent help. The goal is not to make those boundaries impossible to bypass; the goal is for students to practice using assistance while preserving their own reasoning, security interpretation, and final submitted code.

From Lab Review To Lab Design

After reviewing the student lab, we will discuss how the lab is structured:

The synthetic training data uses feature-separable attack categories with moderate overlap. The competition test set is a real public dataset that is more imbalanced and noisier. This gap keeps the task approachable with standard classifiers while preserving useful discussion about false positives, false negatives, thresholds, distribution shift, and the limits of synthetic data.

Guided Lab Design Sprint

After the exemplar, small groups will design a lab that could fit one of their own courses. Groups may use an AI agent to draft structure, task wording, rubric ideas, or starter-code scaffolding. The learning goals, security framing, and assessment choices should remain instructor-led.

Choose one prompt or adapt one to your course:

Lab Design Criteria

Each group should sketch a lab that includes:

Dataset Feasibility

Use this guide when scoping a lab:

Every group should answer:

Outbrief

Each group will share:

The outbrief will use a light competition framing. The best design is not the most technically elaborate design; it is the one that is easiest to teach well.

Suggested criteria:

Student-Facing Rubric

Use this rubric as a design target while building the lab.

Category Points
Learning objective and security task 15
Dataset strategy 15
Student task and deliverable 20
Evaluation and metrics 15
Tradeoff and limitation 15
AI-agent use boundary 10
Communication and classroom fit 10