Instructor-Led Training Evaluation Survey Template
Use this instructor-led training (ILT) evaluation to capture clear, post-session feedback you can act on the same day: course value, instructor effectiveness, delivery/logistics, and intent to apply. The question set maps cleanly to Kirkpatrick Levels 1-3 (with a simple follow-up plan for Level 3/4), so you can report results without claiming ROI from a single "smile sheet."
When to Run an ILT Evaluation (3 Best Moments)
Immediately after a single-session ILT
Use this section to capture honest Level 1 reaction while the experience is fresh. If you are running a one-and-done session (in-person or virtual), do this first: show a QR code and drop the link in chat during the last 2 minutes, before people log off or leave.
End of each day/module for multi-day workshops
Use this timing to pinpoint exactly where pacing, activities, or content breaks. If you are running multi-day training, do this first: run a 1-2 minute pulse at the end of each day/module, then keep a slightly longer wrap-up survey for the final day.
Pilot (cohort 1) before you scale
Use the first cohort to validate your instructor approach, exercises, and logistics before rolling out broadly. If you are about to scale a program, do this first: treat cohort 1 as a pilot and review results within 24 hours to lock the run-of-show.
Do this next: Schedule a separate 30-60 day follow-up pulse (a different survey) to check real-world application (Kirkpatrick Level 3) rather than trying to force that into a same-day evaluation; a useful critique of how Kirkpatrick levels get overinterpreted is Alliger and Janak's analysis.
Who Should Take This ILT Evaluation (and How to Keep It Anonymous)
Use this section to get complete, comparable feedback right after your instructor-led session. If you are launching today, do this first: invite every attendee to respond (not just volunteers) and keep the survey short enough to finish before they switch tasks.
Audience: send the evaluation to all learners who attended the session live (including virtual attendees). If you run multiple training types, keep this template as your default and browse training survey templates when you need a specialized version (onboarding, compliance, workshops, follow-up).
Identifiers: collect session details that help you roll results up by cohort and instructor without collecting names by default. Make these fields either required or pre-filled:
- Session code (copy/paste label: "Session code")
- Session date ("Session date")
- Delivery mode ("In-person" / "Virtual")
- Instructor ("Instructor name" or "Facilitator")
Anonymous (default): Use when you want candid feedback, you are comparing instructors, or you are worried about learners holding back. Say it plainly in the invite: "Responses are anonymous. We will report results in summary (by session/instructor) and will not try to identify you."
Confidential/identified (only when needed): Use when you must follow up for coaching, remediation, certification, or accommodations. If you collect names/emails, state exactly who can see them and what you will do next, consistent with the transparency expectations in OMB information collection guidance.
Do this next: Decide "anonymous" vs "confidential/identified" before you send the link, then put the privacy promise in both the email and the first line of the survey.
Kirkpatrick-Aligned ILT Evaluation Questions (What to Ask and Why)
"Overall, this training was a good use of my time."
Why it matters: This is your Level 1 anchor for perceived value. Trend it over time and use it to prioritize which sessions need attention first.
When to use: Include in every run (single-session and multi-day final survey).
"The instructor explained concepts clearly and checked for understanding."
Why it matters: Clarity plus check-ins is the core of instructor effectiveness. It separates content problems from delivery problems.
When to use: Include whenever you want facilitator coaching inputs or you rotate trainers.
Scales: keep the same response options each session so your reporting stays comparable, and follow Likert scale best practices to standardize labels for cleaner trending.
"The pace of the session was right (not too fast, not too slow)."
Why it matters: Pace is one of the fastest levers you can fix next run. Low pace scores usually explain weak confidence and weak intent to apply.
When to use: Include in multi-day training (daily pulses) and in technical training where cognitive load is high.
"Examples and activities felt relevant to my job."
Why it matters: Relevance is the bridge between Level 1 reaction and Level 3 application. If relevance is low, "good instructor" scores will not translate into use on the job.
When to use: Include in every run, especially for mixed-audience cohorts.
"I had enough practice (exercises, role plays, labs) to use what I learned."
Why it matters: Practice time is a leading indicator for skill transfer. It also tells you whether you need fewer slides and more doing.
When to use: Use for leadership, customer, and technical skills courses; skip or simplify for pure awareness sessions.
"The materials and job aids will be useful after the session."
Why it matters: Job aids support Level 3 behavior after the class. If this scores low, you can fix it without changing the entire curriculum.
When to use: Include when you provide guides, templates, SOPs, or reference sheets.
"The room/virtual platform and logistics supported my learning."
Why it matters: A/V issues, seating, chat moderation, and breakout setup can tank otherwise good training. This item helps you separate logistics fixes from instructor fixes.
When to use: Include in every run; segment heavily by delivery mode and location.
"I can describe the key concepts/steps taught in this session."
Why it matters: This gives you a Level 2 self-report signal without pretending you ran a test. Pair it with confidence to spot "I get it" vs "I can do it" gaps.
When to use: Include when you cannot do a knowledge check in the session but still need a learning readout.
"I feel confident applying what I learned in my role."
Why it matters: Confidence is the best same-day predictor you can collect for later use. If it drops, add practice, coaching, and clearer "what good looks like" examples.
When to use: Include in every run; use it to decide whether you need follow-up support.
"I intend to use what I learned in the next 30 days."
Why it matters: Intent-to-apply is your Level 3 bridge item. Treat it as a directional signal, not proof of behavior change.
When to use: Include in every run and trend it by instructor/session; then verify with a 30-60 day follow-up pulse.
"What might prevent you from applying what you learned? (Select all that apply)"
Why it matters: Barriers turn vague feedback into an action list (time, tools, manager support, unclear process). This is where you find issues training cannot fix alone.
When to use: Include when your stakeholders want a practical Level 3 plan, not just scores.
Open ends: keep them few and optional so you stay in the 3-7 minute range, and use focused wording patterns from open-ended questions to avoid long, hard-to-code responses. Shorter surveys often get better completion and less drop-off, as shown in experiments such as Cottrell et al. on questionnaire response.
"What was the most useful part of the session for your job?"
Why it matters: This tells you what to protect when you revise the course. It also gives you quotable evidence for stakeholders without cherry-picking.
When to use: Include in every run; keep it optional to reduce fatigue.
"What is one change that would improve this session for the next group?"
Why it matters: A single-change prompt reduces rambling and gives you clear fixes (timing, examples, practice time, materials).
When to use: Include in pilots and whenever you are iterating the curriculum.
Keep vs swap (fast customization rules):
- Keep the anchor items (overall value, instructor clarity, relevance, confidence, intent-to-apply) so you can trend across cohorts and instructors.
- Swap the content-specific item based on course type: for compliance/safety use "I understand the required policy/process"; for leadership use "I can name 1-2 behaviors I will practice"; for technical use "I can complete the key task in the system/tool."
- Swap logistics wording by modality: in-person (room, audio, seating) vs virtual (platform stability, breakout setup, chat moderation).
Do this next: Lock your scale labels and your 5-7 "always-on" questions, then customize only 2-4 items per course so your results stay comparable.
Fast Deployment Checklist (Timing, Channels, Reminders, Deduping)
- Close the session with an in-the-moment link: Show a QR code in the room or drop the link in virtual chat during the final minutes, then pause so people can submit before they leave.
- Email the survey within 1-3 hours: Use a clear subject line ("Training evaluation - 3 minutes") and restate your privacy promise in the first sentence.
- Send one reminder in 24-48 hours: Remind only non-responders and stop after one nudge; guidance such as Dillman's tailored design method emphasizes limited, purposeful contacts to protect goodwill in internal programs.
- Deduplicate to one response per attendee: Default to unique links (best) or a simple access code shown at the end of the session (good). If you allow open access, add a "One response per attendee" instruction and watch for duplicates.
- Require a session ID field: Make "Session code" required so you can report clean rollups by cohort, instructor, and modality without collecting names.
- Standardize your scale labels: Use the same endpoints every time (for example: Strongly disagree to Strongly agree) so trending and dashboards do not drift.
- Keep open-ends optional and limited: Use 1-2 open text prompts max in the same-day survey. Add more only in a follow-up pulse when you can afford the time.
- Track outcomes consistently: Log sent, started, completed, and partials the same way each session so you can compare response patterns over time; AAPOR's survey outcome rate guidance provides standard definitions you can reuse in internal reporting.
Small cohorts: if attendance is low or you get only a handful of completes, treat the results as directional and lean more on patterns across sessions. Use sample size guidance to decide when to aggregate multiple cohorts before you publish comparisons.
Do this next: Put the QR code on your final slide now, and draft your email invite + one reminder before you teach the session.
Results Guide: Turn ILT Feedback Into Improvements (Kirkpatrick L1-L3)
- Step 1: Clean the file and segment it the way you will actTag each response with session code, date, delivery mode, and instructor, then scan for duplicates and empty responses. If you cannot take action at the individual level, do not report at the individual level; report by session/instructor rollups.
- Step 2: Summarize Level 1 (reaction) and instructor effectivenessStart with your two anchors: overall value + instructor clarity. If overall value drops but instructor is strong, fix content/relevance; if instructor drops across sessions, prioritize coaching and a tighter run-of-show.
- Step 3: Review Level 2 self-reported gains and confidenceCompare "I can describe the key concepts" vs "I feel confident applying this" to spot a practice gap. If knowledge looks fine but confidence is low, add guided practice, worked examples, and a clear post-session job aid.
- Step 4: Flag Level 3 intent-to-apply and the top barriersTreat intent-to-apply and barriers as your handoff to the business. If barriers cluster around time, tools, or manager support, capture that as an action item for sponsors because training alone will not fix it.
- Step 5: Turn patterns into an action plan (and schedule the follow-up)Convert repeat themes into owners and deadlines, then close the loop in the next session. Use these common "if X, do Y" paths:
- If pacing is too fast: shorten lecture segments, add timeboxes, and move one topic to pre-work or a follow-up module.
- If relevance is low: swap in role-specific examples, add a "choose your scenario" exercise, and ask learners up front which use case they need.
- If confidence is low after training: add more practice, a checklist, and a 2-week office hours slot; then verify behavior with a 30-60 day learner pulse.
- If logistics/platform scores are low: fix A/V, room layout, and breakout instructions; assign a producer/moderator for virtual delivery.
Do this next: Create a 3-line summary for sponsors (Top strength, Top issue, Next change) and schedule your separate 30-60 day follow-up pulse before the course momentum fades.
Frequently Asked Questions
How long should an instructor-led training evaluation be?
Keep the default evaluation to 3-7 minutes (about 10-15 core items) with only 1-2 optional open-ends. If you need more depth, add one strong open question and use targeted multiple-choice follow-ups (barriers, modules, or topics) instead of stacking several long text prompts.
Should ILT evaluations be anonymous or identified?
Default to anonymous responses when you want honest feedback about the instructor, pacing, or relevance. Switch to confidential/identified only when you have a clear follow-up need (coaching, remediation, certification), and state exactly who can see identifiers and how you will use them.
When should I send the survey for multi-day training?
Use a 1-2 minute daily pulse at the end of each day/module to catch pacing and module-specific issues, then send a final survey at course end for the overall view. If time is tight, send only the end-of-course survey, but expect less precision on which day/module caused problems.
How do I use Kirkpatrick levels without turning this into a complex study?
Map your same-day evaluation to Level 1 (reaction) and a light Level 2 signal (self-reported knowledge/confidence), then stop. Run Level 3 as a separate 30-60 day follow-up pulse about actual use and barriers, and treat Level 4 as supporting evidence by pairing with operational metrics (association, not proof of causation).
What is a good response rate for post-training evaluations?
No single benchmark fits every course, audience, or delivery mode, so focus on consistent trend tracking across sessions. You will reliably lift completion by collecting responses in the moment (QR/chat at close), keeping the survey short, stating the purpose, and sending one reminder to non-responders.
How do I act on open-ended comments without getting overwhelmed?
Tag comments into 5-7 buckets (pacing, relevance, instructor delivery, activities, materials, logistics, other) and count themes. Prioritize fixes that show up across multiple sessions/instructors or that align with low scores on your anchor items.
Related Survey Templates
FREE TO START -- NO CREDIT CARD REQUIRED