What are the eight Annex III high-risk categories under the EU AI Act?

They cover biometrics, critical infrastructure, education and vocational training, employment and worker management, essential private and public services (including credit and insurance), law enforcement, migration and border control, and administration of justice and democratic processes. An AI system intended for one of these purposes is classified as high-risk under Article 6(2), unless a narrow exception applies.

Is every AI tool used in HR automatically high-risk under the AI Act?

Not automatically, but most are. An agent that screens, ranks, or scores candidates falls inside the employment and worker management category. A tool that only pre-sorts applications by keyword for a human recruiter to review, without scoring or ranking, may qualify for the Article 6(3) exception, but the line is thin and most legal teams treat borderline cases as high-risk by default.

What is the Article 6(3) exception to Annex III classification?

It exempts a system from high-risk classification if it does not materially influence the decision outcome, performs a narrow procedural task, improves a previously completed human activity's result, or only detects patterns without replacing human assessment. The exception does not apply if the system profiles natural persons, which pulls it back into high-risk status regardless of how narrow its stated task is.

What obligations apply once a system is classified as high-risk?

High-risk systems require a lifecycle risk management system, data governance controls, technical documentation, automatic logging under Article 12, transparency and instructions for deployers, human oversight built into the design, and defined accuracy and robustness standards. Deployers also inherit obligations, including ensuring human oversight is actually exercised in operation.

EU AI Act Annex III: Classifying High-Risk AI

EU AI Act Annex III: Classifying High-Risk AI Systems

Most teams building AI agents assume the EU AI Act's toughest rules apply to someone else, usually a defense contractor or a biometric surveillance vendor. Then they look closely at Annex III and find their HR screening bot or credit-scoring assistant sitting squarely inside a high-risk category they had not considered.

Annex III is the list that decides which AI systems trigger the AI Act's heaviest obligations: conformity assessments, mandatory logging, human oversight design, and registration in an EU database, among others. Getting the classification right at the design stage is far cheaper than retrofitting compliance onto a system already in production.

The eight Annex III categories

Under Article 6(2), an AI system intended for one of these purposes is classified as high-risk, unless a narrow exception applies:

Biometrics — remote biometric identification, biometric categorization based on sensitive attributes, and emotion recognition systems
Critical infrastructure — safety components in the management of critical digital infrastructure, road traffic, or water, gas, heating, and electricity supply
Education and vocational training — systems that determine access to institutions, or assess and evaluate students
Employment and worker management — recruitment, hiring, promotion, termination, task allocation, or performance monitoring
Essential private and public services — creditworthiness evaluation, insurance risk assessment and pricing for life and health insurance, and eligibility for public assistance benefits
Law enforcement — risk assessment, evidence evaluation, and profiling in criminal contexts
Migration, asylum, and border control — risk assessment, examination of applications, and detection of specific individuals
Administration of justice and democratic processes — systems assisting judicial authorities in researching or interpreting facts and law

The category that catches most business software builders off guard

Employment and worker management is the one companies underestimate most often, because it does not sound like the dramatic "high-risk AI" the headlines describe. But an agent that screens résumés, ranks candidates, drafts performance review language, or flags employees for termination review falls inside this category almost by definition, regardless of how conversational or "assistive" the tool feels to the person using it.

The same applies to essential services. A support agent that only answers billing questions is low risk. The moment that same agent's output feeds into a decision about credit terms, insurance pricing, or loan eligibility, even as one input among several, the system it's embedded in likely needs high-risk classification. The trigger is the purpose the output serves, not how autonomous the AI feels.

The exception clause most guidance skips

Article 6(3) carves out a narrow exception: a system in an Annex III category is not high-risk if it does not materially influence the outcome of the decision, performs a narrow procedural task, improves the result of a previously completed human activity, or detects patterns without replacing human assessment. This exception does not apply, however, if the system profiles natural persons — profiling automatically pulls the system back into high-risk status regardless of how narrow its stated task is.

This exception matters because it means not every AI touching an Annex III domain is automatically high-risk. A tool that pre-sorts résumés by keyword match for a human recruiter to review, without scoring or ranking candidates, may fall under this carve-out. A tool that assigns each candidate a fit score does not, because scoring is squarely the kind of evaluative output the exception excludes. The line is thin enough that most legal teams treat borderline cases as high-risk by default rather than betting on the exception holding up under review.

What high-risk classification actually requires

Once a system is classified as high-risk, the obligations are substantial, not just paperwork:

A risk management system covering the full lifecycle
Data governance ensuring training, validation, and testing data meets quality criteria
Technical documentation demonstrating compliance before market placement
Automatic logging throughout operation (Article 12)
Transparency and instructions for use enabling deployers to operate the system correctly
Human oversight measures built into the system design, not bolted on afterward
Accuracy, robustness, and cybersecurity requirements appropriate to the system's purpose

Providers bear most of this burden, but deployers (the companies actually running the system) inherit real obligations too, including ensuring human oversight is exercised in practice and monitoring the system's operation.

A practical classification checklist

Before building or buying an AI agent for any employment, credit, insurance, education, or public-service use case:

Identify what decision or output the AI feeds into, not just what task it performs
Check whether that decision area appears in the eight Annex III categories
If it does, test the Article 6(3) exception carefully: does the system score, rank, or profile individuals, or does it only assist a human who retains full judgment
If borderline, document the reasoning either way, because the classification decision itself needs to be defensible
If high-risk, build logging, human oversight, and documentation into the system from day one rather than retrofitting them before a deadline

Building on infrastructure that assumes this from the start

Retrofitting Article 12 logging, human oversight gates, and documentation onto a system already in production is a multi-month project most teams underestimate. Choosing a platform where audit trails, approval gates, and role-based access are default behavior rather than features you bolt on later removes that risk before it exists. See how AgentWorks approaches AI Act readiness for agents touching employment, credit, and other Annex III domains.

Classification is not a one-time exercise either. Adding a new capability to an existing agent, such as letting an HR assistant also rank candidates instead of just summarizing applications, can move a system from low-risk to high-risk overnight. Re-check the classification every time the agent's scope changes, not just when it launches.

EU AI Act Annex III: Classifying High-Risk AI

EU AI Act Annex III: Classifying High-Risk AI Systems

The eight Annex III categories

The category that catches most business software builders off guard

The exception clause most guidance skips

What high-risk classification actually requires

A practical classification checklist

Building on infrastructure that assumes this from the start

About the author

PII Masking for LLMs: Keep Personal Data Out of Prompts

No-Training, Zero-Retention AI & Your Data

EU vs US AI Tools: Data Sovereignty for Business

EU AI Act Annex III: Classifying High-Risk AI Systems

The eight Annex III categories

The category that catches most business software builders off guard

The exception clause most guidance skips

What high-risk classification actually requires

A practical classification checklist

Building on infrastructure that assumes this from the start

About the author

Related articles

PII Masking for LLMs: Keep Personal Data Out of Prompts

No-Training, Zero-Retention AI & Your Data

EU vs US AI Tools: Data Sovereignty for Business