Create an AI Agent-Ready Dataset
FactVerse AI Agent workflows need data that is traceable, fresh enough for the task, mapped to the right operational identity, and reviewed by the right owner. Use this recipe to prepare a DFS Pro dataset for AI Agent workflows.
Preparation flow
Outcome
At the end of this workflow, you should have:
- a defined Agent question or workflow;
- source systems connected or imported;
- mappings reviewed;
- a DFS Pro dataset created;
- profile and quality issues reviewed;
- steward assigned;
- dataset validated;
- source limitations documented for Agent users.
Step 1: Define the Agent workflow
Start with the operational task, not the data table.
Examples:
- summarize equipment status before inspection;
- prepare predictive maintenance evidence;
- compare recent alarms with work orders;
- explain why an energy or facility value changed;
- prepare a reviewed work-order draft.
Record:
- user role;
- site or tenant boundary;
- assets or systems involved;
- expected output;
- reviewer;
- allowed action boundary.
Step 2: Identify required evidence
List the evidence the Agent should use.
| Evidence type | Examples |
|---|---|
| Asset context | Asset ID, equipment name, aliases, location, system relationship. |
| Signals | Sensor values, meter values, status, event history. |
| Work records | Work orders, inspection records, close notes, operator actions. |
| Documents | SOP, maintenance manual, drawings, compliance notes. |
| Quality notes | Missing values, stale values, source limitations. |
Step 3: Connect or import source data
Use DFS Lite connectors for live or scheduled sources. Use DFS Pro dataset creation for imported or prepared data.
For each source, confirm:
- source owner;
- update frequency;
- timestamp field;
- identity field;
- unit and value range;
- permission to use the data in the Agent workflow.
Step 4: Map operational identity
AI Agent output is only useful when identity is stable.
Review:
- source equipment names;
- asset IDs;
- aliases;
- location hierarchy;
- point names;
- work-order asset references;
- document references.
If source systems use different names for the same asset, create or update mappings before validating the dataset.
Step 5: Create DFS Pro dataset
In Dataset Center:
- Create dataset.
- Choose source type.
- Add name and description.
- Add or confirm source ID.
- Save.
- Open detail page.
- Preview rows.
- Review profile.
Dataset description should include:
- source system;
- purpose;
- owner;
- expected refresh cadence;
- known limitations;
- intended Agent workflow.
Step 6: Review quality and profile
Check:
- row count;
- required columns;
- null ratio;
- distinct IDs;
- timestamp range;
- stale records;
- quality issues;
- schema changes.
For Agent workflows, document limitations in plain language. Example: "Work-order close notes are available from the CMMS feed, but part replacement details are only available after nightly sync."
Step 7: Assign steward and validate
Assign a steward who can approve the dataset for the Agent workflow.
Validate when:
- identity mapping is reviewed;
- source timestamps are understood;
- quality issues are acceptable for the task;
- dataset schema is stable enough;
- downstream reviewer accepts the assumptions.
Step 8: Link to Agent workflow
When using the dataset in an Agent workflow, keep these visible:
- dataset name and version;
- source timestamp;
- quality notes;
- reviewer;
- allowed output type;
- action approval requirement.
For MCP workflows, confirm scopes and endpoint access separately in the MCP documentation.
Failure handling
| Problem | Action |
|---|---|
| Agent task is too broad | Narrow the answer contract to one role, boundary, time window, and output type. |
| Evidence source is missing | Record it as unavailable and keep it out of the Agent answer until connected or imported. |
| Identity mapping is ambiguous | Resolve aliases and asset IDs before validating the dataset. |
| Quality issues affect the answer | Add plain-language limitations or block the dataset until source or mapping is corrected. |
| Reviewer cannot approve the dataset | Keep the Agent workflow in read-only validation and assign a steward before broader use. |
Related pages
| Page | Use |
|---|---|
| DFS Pro Datasets | Create and validate the dataset. |
| Prepare DFS Data for AI Agent Workflows | Prepare source connectors, mappings, quality notes, fusion outputs, and handoff records before Agent validation. |
| Data Quality | Review source freshness and quality. |
| FactVerse AI Agent Data Readiness | Check whether source evidence is ready for Agent use. |
| MCP Scope Matrix | Plan endpoint and scope access for MCP clients. |