DE EN
Visual for Learning Learning Learning Learning maps how Nova turns observed evidence into reviewed guidance across TOB, LCJ, ALP, CLS, PKS, OK, and ETM. Overview Areas Learning chain Learning modes Where it belongs Agent Tools

Learning

Learning in Nova does not mean that an agent keeps arbitrary memories. Nova separates observed work, learning candidates, reviewed website knowledge, current operational state, and concrete task runs so later guidance stays useful and checkable.

This overview is the map for Nova’s learning area. Tool Observation Bus (TOB) records observed tool evidence. Lightweight Candidate Journal (LCJ) keeps learning evidence. Agent Learning Pipeline (ALP) reviews candidates and promotes the useful ones. Closed-Loop System (CLS) verifies action outcomes. Phenomenological Knowledge Store (PKS) stores curated website phenomena. Operational Knowledge (OK) keeps current service state. Episodic Task Memory (ETM) remembers task runs, progress, and completion conditions.

TUC stands for Task URL Coverage. TUC belongs inside ETM. It checks whether relevant URLs or work units were actually covered before a task with an exhaustive completion claim is accepted as complete.

Adjacent memory surfaces stay separate from the learning pillars. Browser Memory keeps opt-in user context for a domain, Operator Notes keep reusable cross-session notes, and Evidence Verification Mode (EVM) guides research-style claim checking through task instructions.

In short

  • Learning starts with observed evidence, not with an agent claim.
  • Automatic learning can prepare candidates, but candidates are not active trusted knowledge yet.
  • ALP decides whether repeated evidence is useful enough to be promoted.
  • CLS checks whether an action actually produced the expected result before the next step trusts it.
  • PKS, OK, and ETM keep different kinds of knowledge so guidance does not blur together.
  • TUC belongs to ETM because it checks coverage for concrete task runs.
  • Browser Memory and Operator Notes are adjacent memory surfaces, not replacements for PKS, OK, LCJ, or ETM.

Learning Areas

Each learning area answers a different question. The point is not to store more memory everywhere, but to keep evidence, review, current state, and task progress in the right place.

Observation layer

TOB

Question
What was actually observed?
What it does
TOB records tool evidence from real execution: reads, clicks, inputs, navigation, failures, blocked calls, and coverage-relevant observations. It gives later systems something stronger than an agent claim.
Open this when
Use it when task completion, learning, safety, or state checks need proof that work really happened.
Open guide
Candidate journal

LCJ

Question
What might be worth learning?
What it does
LCJ keeps early learning evidence and candidate claims without turning them into trusted knowledge. It preserves status, confidence, source context, and repeated signals for later review.
Open this when
Use it when repeated website work leaves useful evidence, but the pattern is not ready for PKS or active guidance yet.
Open guide
Learning quality gate

ALP

Question
Is the candidate mature enough?
What it does
ALP scores repeated evidence, explains learning decisions, generates candidates, promotes useful knowledge, and rejects or delays weak patterns.
Open this when
Use it when observations should become reviewed learning, or when a candidate needs explanation, promotion, or revalidation.
Open guide
Outcome verification

CLS

Question
Did the action really work?
What it does
CLS closes the loop between a dispatched browser action and the observed result. It keeps goals, expected outcomes, verification state, and uncertainty visible.
Open this when
Use it when an agent should not continue just because a click, submit, login, or model switch was sent.
Open guide
Reviewed website memory

PKS

Question
What reusable website pattern is proven?
What it does
PKS stores curated website phenomena, recognition signals, playbooks, health, and safe-use notes. It is long-term orientation, not a raw event log.
Open this when
Use it when the same site behavior returns: consent banners, modals, login walls, fragile navigation, paywalls, or platform-specific UI patterns.
Open guide
Live operational state

OK

Question
What is true right now?
What it does
OK keeps current service facts, visible state, capabilities, freshness, confidence, and conflicts. It prevents agents from relying on stale assumptions about a tab or service.
Open this when
Use it when the next step depends on current login state, active model, plan, feature availability, route, or service capability.
Open guide
Task memory

ETM

Question
What task is running and what remains?
What it does
ETM keeps task profiles, task instances, work units, progress, resume state, completion rules, and Task URL Coverage for exhaustive work.
Open this when
Use it for large or recurring tasks where an agent must resume, prove coverage, track open units, or justify completion.
Open guide

The Learning Chain

The sequence maps responsibilities. Nova may enter the chain at different points, but each area has a clear job.

  1. Observe TOB records what actually happened while an agent used tools: reads, clicks, inputs, navigation, failures, and blockers.
  2. Collect LCJ keeps learning evidence and candidates without treating them as trusted knowledge yet.
  3. Review ALP scores repeated evidence, explains decisions, and promotes only candidates with enough support.
  4. Verify CLS checks whether a browser action really produced the expected outcome before that outcome feeds guidance or task progress.
  5. Store PKS and OK separate durable website behavior from current operational facts such as visible state, service capability, or freshness.
  6. Use PKS, OK, and ETM return guidance to later agents while still requiring the current page or task state to confirm the match.
  7. Prove completion ETM and TUC keep task progress, open units, completion rules, and URL coverage visible before a large task is declared done.
  8. Maintain ALP and PKS surface drift, stale knowledge, repeated failures, and candidates that should be patched, demoted, or retired.

Kinds of Learning

Nova uses several learning paths because evidence can be collected automatically, requested deliberately, reviewed semantically, or tied to a concrete task run.

  1. Automatic learning When safe actions repeat, selectors keep working, or clear website situations reappear, Nova can prepare candidates automatically. This reduces manual bookkeeping but does not make the knowledge active by itself.
  2. Deliberate Learn Mode An agent can ask Nova for learning-oriented instructions through nova.get_instructions and produce a structured PLATFORM_PLAYBOOK. These names are for agents and integrators, not normal user controls in the UI.
  3. Semantic learning Recurring situations such as consent dialogs, login walls, or risky repeated actions can become learning opportunities. Agent-facing tools such as nova.learn_resolve_opportunity help decide how to handle them.
  4. Task learning ETM keeps task profiles, task instances, progress, guidance, and completion rules. TUC extends this when a task needs evidence that relevant URLs or units were covered.

Where It Belongs

The learning area should stay readable by giving each concept a home.

Website behavior
belongs in PKS when it is reusable and reviewed.
Current service state
belongs in OK when it describes what is true right now.
Task progress
belongs in ETM when it describes a concrete run, open work, or completion.
URL coverage
belongs in ETM through TUC, because coverage is evidence for a task run.
User context for a domain
belongs in Browser Memory when the user wants Nova to remember notes, preferences, or session context across visits.
Reusable operator notes
belong in Operator Notes when a preference, research rule, or workflow fact should apply across tasks instead of one website pattern.
Research claim checking
belongs to EVM task guidance. It can write claim evidence into LCJ, but it is not a separate long-term learning pillar.
Agent awareness and safety hints
can support learning, but they are not learning pillars on their own.

Limits

Learning should make Nova more reliable, not less accountable. A candidate can remain untrusted, stored knowledge can be stale, task completion can be rejected, and sensitive actions still need the normal safety and approval model. The current page, current task state, and current user intent remain decisive.

Agent Tools

MCP tools for agents. These variables and tool names are intended for agents and integrators. They are not normal user controls in the interface.

Variable Meaning
nova.get_instructions Read Learn Mode instructions
nova.learn_suggest Suggest learning opportunities from evidence
nova.learn_generate Prepare learning candidates from evidence
nova.learn_promote Move reviewed knowledge toward active use
nova.learn_feedback Review learning history and decisions
nova.learn_resolve_opportunity Classify a semantic learning opportunity
nova.perceive Understand the visible page
nova.tab_claim Reserve a tab for work
nova.goal_register Record a work goal
nova.pks_get Read stored guidance
nova.pks_match Find known patterns
nova.ok_observe Report operational state
nova.memory_stats Review learning-memory statistics
nova.memory_add_candidate Record an explicit learning candidate
nova.memory_recall Recalls opt-in Browser Memory notes for a domain so an agent can see relevant user context before continuing work.
nova.memory_note Stores an explicit Browser Memory note, preference, or context item for later sessions when Browser Memory is enabled.
nova.memory_forget Deletes Browser Memory entries by id, domain, type, or all entries when the user or agent should remove stored context.
nova.operator_notes_query Finds reusable operator notes for a task, such as preferences, research heuristics, or workflow facts.
nova.operator_notes_store Stores a reusable note that should survive across sessions without being tied to one website phenomenon or task instance.
nova.task_search Search task memory
nova.task_match Recognize a recurring task
nova.task_instance_create Start a task run
nova.task_instance_get Read a task run
nova.task_instance_progress Report task progress
nova.task_instance_complete Check whether task completion is allowed
nova.coverage_scan Check URL coverage
nova.tools_bundle Loads curated tool groups for agents instead of relying on long single-tool discovery.
structuredContent.evm Task-mode research guidance for claim testing, source minimums, and unknown handling; it is not part of Learn Mode.
PLATFORM_PLAYBOOK Learn Mode deliverable that summarizes a mapped platform for future work.
pksAdvice Agent-facing guidance from matched PKS knowledge.
pksAutoLearn Best-effort PKS candidate created from a safe observed trigger, including scope and candidate identifiers.
pksHealthWarnings Warnings about degraded or stale PKS knowledge that agents should treat as caution signals.
pksSemanticLearning Higher-priority prompt when Nova detects a semantic website pattern that should be resolved or stored.
pksSemanticLearningResolution Result block that confirms how a semantic learning opportunity was resolved.
frameworkHints Advisory framework guidance for current-domain forms and controls when PKS has active hints.
serviceDiscovery Service-category hints from instructions when task keywords match known PKS service metadata.
learnMode.progress Learn-mode coverage block with active tasks, visited surfaces, interactions, and floor status.
evidenceSummary TOB summary that helps agents compare a completion claim with observed evidence.
okHints Hints that important operational state is missing, stale, or uncertain.
goalContext CLS resume anchor for the current goal, step, and verification state on a target.
taskAwareness Compact ETM context for matching profile, active instance, progress, and resume state.
taskAwareness.profileMatch Best ETM profile match with goal, guidance summary, required checks, and completion condition.
taskAwareness.activeInstances[] Active ETM task instances with status, progress, target URL, and resume hints.
completionAllowed Signal used when ETM or evidence checks decide whether a task can be accepted as complete.
verificationResults Structured outcome of task or evidence verification checks returned to agents.