AI in Cybersecurity Education

Faculty Development Summer Institute 2026

AI Application Security Lab Design Studio

This afternoon activity uses a packaged red-team lab against a deliberately weak AI application as the starting point for a lab-design studio. Participants first run a participant-facing lab end to end, then use that experience as a reference point for rapidly prototyping similar mini-labs they could deploy in their own classrooms.

The activity has three parts:

Goals

By the end of the activity, participants should be able to:

Distributed Lab Package

The downloadable lab is a deliberately weak retrieval-based study and quiz assistant, written as a participant-facing assignment. Participants run the application in Google Colab, attempt three classes of attack (direct leakage, guardrail bypass, retrieval abuse), and document their findings.

To use the starter package:

  1. Download and unzip the starter package to get the topic-07-ai-app-security-lab folder.
  2. Upload the whole topic-07-ai-app-security-lab folder to your Google Drive, ideally at the top of My Drive (so it lives at My Drive/topic-07-ai-app-security-lab).
  3. Open lab.ipynb from that Drive folder in Google Colab (File → Open notebook → Google Drive), then choose Runtime → Change runtime type → T4 GPU.
  4. Run the cells top to bottom. Section 2 mounts your Drive and copies the starter into /content/lab; edit STARTER_DIR in that cell if you uploaded the folder elsewhere. Later sections install an in-Colab Ollama server and pull llama3.1:8b (a one-time ~3–5 minute download).
  5. Use the smoke-test section to confirm the four CLI paths return output, then work through the three attack families in the notebook’s attack workspace.
  6. Document attempts in attack_log.md and write a root-cause analysis in analysis.md (both live in /content/lab/ in the Colab file browser).
  7. Run the notebook’s final cell to bundle and download your deliverables; your Drive is already mounted if you prefer to copy them there instead.

The starter package includes a working application, a small mixed-trust corpus (with obviously fake LAB7-CANARY-* markers), prompt templates, deliverable scaffolds, and a detailed COLAB_SETUP.md covering the Colab and Google Drive setup steps and troubleshooting.

The participant submission package includes:

AI Agent Use In The Lab

Participants may use AI agents for brainstorming attack ideas, critiquing whether an attack log is specific enough, comparing leakage paths, or reviewing the clarity of a defense explanation. The starter includes AGENTS.md, which describes the intended boundaries for agent help. The goal is not to make those boundaries impossible to bypass; the goal is for participants to practice using assistance while preserving their own reasoning, security interpretation, and final submitted code.

LLM Setup (Runs in Google Colab)

This lab runs entirely in Google Colab so participants do not need an institutional API key or a local install. The notebook installs and starts a self-hosted Ollama server inside the Colab runtime (an OpenAI-compatible API at http://localhost:11434/v1) and pulls the model for you:

A T4 GPU runtime (Runtime → Change runtime type → T4 GPU) is recommended. The guardrails under test live in the prompt templates, not the model, so swapping models changes how well the same guardrail holds — which is the point of the optional comparison section. Full setup and troubleshooting steps are in the starter’s COLAB_SETUP.md.

From Lab Review To Lab Design

After running the red-team lab, the conversation shifts from completing the assignment to analyzing its design:

Guided Rapid-Prototyping Sprint

Small groups choose one of five seed prompts (or adapt one) and rapidly sketch a mini-lab they could deploy in their own course. Groups may use an AI agent to draft structure, prompt templates, rubric ideas, or starter-code scaffolding. The learning objective, security framing, and assessment criteria stay instructor-led.

Choose one prompt or adapt one to your course:

The five seeds are deliberately scoped to cover the morning’s attack-family taxonomy (direct injection, indirect injection, leakage, instruction hijack, output trust).

Mini-Lab Sketch Template

Each group should sketch a mini-lab that includes:

Outbrief

Each group will share, in 60 - 90 seconds:

The outbrief uses light competition framing. The best design is not the most technically elaborate; it is the one that is easiest to teach well.

Suggested criteria:

Lab Rubric (For The Distributed Lab)

Category Points
Direct leakage evaluation 20
Guardrail-bypass evaluation 20
Retrieval-abuse evaluation 20
Root-cause analysis and defense reasoning 25
Documentation, honesty of evidence, and agent-use disclosure 15

Mini-Lab Design Rubric (For The Sprint)

Category Points
Learning objective and security task 15
Dataset strategy 15
Student task and deliverable 20
Evaluation and metrics 15
Tradeoff and limitation 15
AI-agent use boundary 10
Communication and classroom fit 10

AI Assistance Acknowledgment

This lab and the surrounding institute activity were developed and tested with substantial assistance from AI coding and writing tools. Specifically, AI assistance contributed to drafting the lab write-up, generating and refining the participant-facing scaffolding, building the synthetic mixed-trust corpus and seeded canary content, drafting the rapid-prototyping seed prompts, producing the reference solution and instructor facilitation notes, and exercising the application’s CLI paths during testing.

The institute team reviewed and revised the materials, made the final design decisions, and is responsible for the activity’s content, security framing, and pedagogical choices. AI was used as a bounded prototyping aid, not as a replacement for that judgment.

The shipped application and overall lab structure are adapted from the GW course AI Application Security (Lab 4), repackaged for institute use with a local-LLM-first backend.

If you notice unclear instructions, install issues, factual errors, or rough edges, please report them so the lab can be improved.