Learning

Learning in Nova does not mean that an agent keeps arbitrary memories. Nova separates observed work, learning candidates, reviewed website knowledge, current operational state, and concrete task runs so later guidance stays useful and checkable.

This overview is the map for Nova’s learning area. Tool Observation Bus (TOB) records observed tool evidence. Lightweight Candidate Journal (LCJ) keeps learning evidence. Agent Learning Pipeline (ALP) reviews candidates and promotes the useful ones. Closed-Loop System (CLS) verifies action outcomes. Phenomenological Knowledge Store (PKS) stores curated website phenomena. Operational Knowledge (OK) keeps current service state. Episodic Task Memory (ETM) remembers task runs, progress, and completion conditions.

TUC stands for Task URL Coverage. TUC belongs inside ETM. It checks whether relevant URLs or work units were actually covered before a task with an exhaustive completion claim is accepted as complete.

Adjacent memory surfaces stay separate from the learning pillars. Browser Memory keeps opt-in user context for a domain, Operator Notes keep reusable cross-session notes, and Evidence Verification Mode (EVM) guides research-style claim checking through task instructions.

In short

Learning starts with observed evidence, not with an agent claim.
Automatic learning can prepare candidates, but candidates are not active trusted knowledge yet.
ALP decides whether repeated evidence is useful enough to be promoted.
CLS checks whether an action actually produced the expected result before the next step trusts it.
PKS, OK, and ETM keep different kinds of knowledge so guidance does not blur together.
TUC belongs to ETM because it checks coverage for concrete task runs.
Browser Memory and Operator Notes are adjacent memory surfaces, not replacements for PKS, OK, LCJ, or ETM.

Learning Areas

Each learning area answers a different question. The point is not to store more memory everywhere, but to keep evidence, review, current state, and task progress in the right place.

Observation layer

TOB

Question: What was actually observed?
What it does: TOB records tool evidence from real execution: reads, clicks, inputs, navigation, failures, blocked calls, and coverage-relevant observations. It gives later systems something stronger than an agent claim.
Open this when: Use it when task completion, learning, safety, or state checks need proof that work really happened.

Open guide

Candidate journal

LCJ

Question: What might be worth learning?
What it does: LCJ keeps early learning evidence and candidate claims without turning them into trusted knowledge. It preserves status, confidence, source context, and repeated signals for later review.
Open this when: Use it when repeated website work leaves useful evidence, but the pattern is not ready for PKS or active guidance yet.

Open guide

Learning quality gate

ALP

Question: Is the candidate mature enough?
What it does: ALP scores repeated evidence, explains learning decisions, generates candidates, promotes useful knowledge, and rejects or delays weak patterns.
Open this when: Use it when observations should become reviewed learning, or when a candidate needs explanation, promotion, or revalidation.

Open guide

Outcome verification

CLS

Question: Did the action really work?
What it does: CLS closes the loop between a dispatched browser action and the observed result. It keeps goals, expected outcomes, verification state, and uncertainty visible.
Open this when: Use it when an agent should not continue just because a click, submit, login, or model switch was sent.

Open guide

Reviewed website memory

PKS

Question: What reusable website pattern is proven?
What it does: PKS stores curated website phenomena, recognition signals, playbooks, health, and safe-use notes. It is long-term orientation, not a raw event log.
Open this when: Use it when the same site behavior returns: consent banners, modals, login walls, fragile navigation, paywalls, or platform-specific UI patterns.

Open guide

Live operational state

OK

Question: What is true right now?
What it does: OK keeps current service facts, visible state, capabilities, freshness, confidence, and conflicts. It prevents agents from relying on stale assumptions about a tab or service.
Open this when: Use it when the next step depends on current login state, active model, plan, feature availability, route, or service capability.

Open guide

Task memory

ETM

Question: What task is running and what remains?
What it does: ETM keeps task profiles, task instances, work units, progress, resume state, completion rules, and Task URL Coverage for exhaustive work.
Open this when: Use it for large or recurring tasks where an agent must resume, prove coverage, track open units, or justify completion.

Open guide

The Learning Chain

The sequence maps responsibilities. Nova may enter the chain at different points, but each area has a clear job.

Observe TOB records what actually happened while an agent used tools: reads, clicks, inputs, navigation, failures, and blockers.
Collect LCJ keeps learning evidence and candidates without treating them as trusted knowledge yet.
Review ALP scores repeated evidence, explains decisions, and promotes only candidates with enough support.
Verify CLS checks whether a browser action really produced the expected outcome before that outcome feeds guidance or task progress.
Store PKS and OK separate durable website behavior from current operational facts such as visible state, service capability, or freshness.
Use PKS, OK, and ETM return guidance to later agents while still requiring the current page or task state to confirm the match.
Prove completion ETM and TUC keep task progress, open units, completion rules, and URL coverage visible before a large task is declared done.
Maintain ALP and PKS surface drift, stale knowledge, repeated failures, and candidates that should be patched, demoted, or retired.

Kinds of Learning

Nova uses several learning paths because evidence can be collected automatically, requested deliberately, reviewed semantically, or tied to a concrete task run.

Automatic learning When safe actions repeat, selectors keep working, or clear website situations reappear, Nova can prepare candidates automatically. This reduces manual bookkeeping but does not make the knowledge active by itself.
Deliberate Learn Mode An agent can ask Nova for learning-oriented instructions through nova.get_instructions and produce a structured PLATFORM_PLAYBOOK. These names are for agents and integrators, not normal user controls in the UI.
Semantic learning Recurring situations such as consent dialogs, login walls, or risky repeated actions can become learning opportunities. Agent-facing tools such as nova.learn_resolve_opportunity help decide how to handle them.
Task learning ETM keeps task profiles, task instances, progress, guidance, and completion rules. TUC extends this when a task needs evidence that relevant URLs or units were covered.

Where It Belongs

The learning area should stay readable by giving each concept a home.

Website behavior: belongs in PKS when it is reusable and reviewed.
Current service state: belongs in OK when it describes what is true right now.
Task progress: belongs in ETM when it describes a concrete run, open work, or completion.
URL coverage: belongs in ETM through TUC, because coverage is evidence for a task run.
User context for a domain: belongs in Browser Memory when the user wants Nova to remember notes, preferences, or session context across visits.
Reusable operator notes: belong in Operator Notes when a preference, research rule, or workflow fact should apply across tasks instead of one website pattern.
Research claim checking: belongs to EVM task guidance. It can write claim evidence into LCJ, but it is not a separate long-term learning pillar.
Agent awareness and safety hints: can support learning, but they are not learning pillars on their own.

Limits

Learning should make Nova more reliable, not less accountable. A candidate can remain untrusted, stored knowledge can be stale, task completion can be rejected, and sensitive actions still need the normal safety and approval model. The current page, current task state, and current user intent remain decisive.

Agent Tools

MCP tools for agents. These variables and tool names are intended for agents and integrators. They are not normal user controls in the interface.

Variable	Meaning
`nova.get_instructions`	Read Learn Mode instructions
`nova.learn_suggest`	Suggest learning opportunities from evidence
`nova.learn_generate`	Prepare learning candidates from evidence
`nova.learn_promote`	Move reviewed knowledge toward active use
`nova.learn_feedback`	Review learning history and decisions
`nova.learn_resolve_opportunity`	Classify a semantic learning opportunity
`nova.perceive`	Understand the visible page
`nova.tab_claim`	Reserve a tab for work
`nova.goal_register`	Record a work goal
`nova.pks_get`	Read stored guidance
`nova.pks_match`	Find known patterns
`nova.ok_observe`	Report operational state
`nova.memory_stats`	Review learning-memory statistics
`nova.memory_add_candidate`	Record an explicit learning candidate
`nova.memory_recall`	Recalls opt-in Browser Memory notes for a domain so an agent can see relevant user context before continuing work.
`nova.memory_note`	Stores an explicit Browser Memory note, preference, or context item for later sessions when Browser Memory is enabled.
`nova.memory_forget`	Deletes Browser Memory entries by id, domain, type, or all entries when the user or agent should remove stored context.
`nova.operator_notes_query`	Finds reusable operator notes for a task, such as preferences, research heuristics, or workflow facts.
`nova.operator_notes_store`	Stores a reusable note that should survive across sessions without being tied to one website phenomenon or task instance.
`nova.task_search`	Search task memory
`nova.task_match`	Recognize a recurring task
`nova.task_instance_create`	Start a task run
`nova.task_instance_get`	Read a task run
`nova.task_instance_progress`	Report task progress
`nova.task_instance_complete`	Check whether task completion is allowed
`nova.coverage_scan`	Check URL coverage
`nova.tools_bundle`	Loads curated tool groups for agents instead of relying on long single-tool discovery.
`structuredContent.evm`	Task-mode research guidance for claim testing, source minimums, and unknown handling; it is not part of Learn Mode.
`PLATFORM_PLAYBOOK`	Learn Mode deliverable that summarizes a mapped platform for future work.
`pksAdvice`	Agent-facing guidance from matched PKS knowledge.
`pksAutoLearn`	Best-effort PKS candidate created from a safe observed trigger, including scope and candidate identifiers.
`pksHealthWarnings`	Warnings about degraded or stale PKS knowledge that agents should treat as caution signals.
`pksSemanticLearning`	Higher-priority prompt when Nova detects a semantic website pattern that should be resolved or stored.
`pksSemanticLearningResolution`	Result block that confirms how a semantic learning opportunity was resolved.
`frameworkHints`	Advisory framework guidance for current-domain forms and controls when PKS has active hints.
`serviceDiscovery`	Service-category hints from instructions when task keywords match known PKS service metadata.
`learnMode.progress`	Learn-mode coverage block with active tasks, visited surfaces, interactions, and floor status.
`evidenceSummary`	TOB summary that helps agents compare a completion claim with observed evidence.
`okHints`	Hints that important operational state is missing, stale, or uncertain.
`goalContext`	CLS resume anchor for the current goal, step, and verification state on a target.
`taskAwareness`	Compact ETM context for matching profile, active instance, progress, and resume state.
`taskAwareness.profileMatch`	Best ETM profile match with goal, guidance summary, required checks, and completion condition.
`taskAwareness.activeInstances[]`	Active ETM task instances with status, progress, target URL, and resume hints.
`completionAllowed`	Signal used when ETM or evidence checks decide whether a task can be accepted as complete.
`verificationResults`	Structured outcome of task or evidence verification checks returned to agents.

TOB Observe Real tool evidence
LCJ Collect Learning candidate
ALP Review Evidence and maturity
CLS Verify Outcome confirmed
PKS / OK / ETM Store Right memory, current proof

Learning map From observation to guidance

A signal becomes useful only after Nova keeps the evidence, checks the candidate, verifies outcomes, and stores the right kind of knowledge.

Evidence goes to TOB and LCJ.
Review happens through ALP.
Action outcomes are verified through CLS.
Website phenomena live in PKS.
Live facts live in OK.
Task runs and coverage live in ETM.

In short

Learning Areas

The Learning Chain

Kinds of Learning

Where It Belongs

Limits

Related Topics