Confidence Scoring & Explainability
Introduction & Philosophy
Our APIs provide detailed explanations for every resolution attempt, helping you understand both the confidence level and reasoning behind each result. This explainability system is actively developed in collaboration with our customers to provide flexible yet standardized insights into our AI agent's decision-making process.
How Confidence Scores Work
Confidence scores range from 0-100, representing how certain our agent is in the output results. This score is generated through a robust process that combines multiple scoring mechanisms behind the scenes, which we continually update and improve to enhance accuracy.
Factors Array Overview
The factors array breaks down the reasoning behind each confidence score into two categories:
Strengths: Evidence that supports the match or resolution
Limitations: Weaknesses or uncertainties that lower confidence
Each factor includes:
- Standardized codes for programmatic processing
- Impact classification (strength vs limitation)
- Natural language descriptions explaining the reasoning
- Severity levels for limitations (low, medium, high, critical)
Actions Array - Improving Your Results
Actions are suggested improvements that appear when confidence scores are lower due to unclear or insufficient input data. These provide actionable steps to get better results in future queries.
Common scenarios where actions help:
- Generic company names with multiple matches
- Missing location data when multiple entities exist
- Ambiguous legal structures (Inc. vs LLC with same name)
Real-World Example
Scenario: User searches for "Smith Construction" without location
Why low confidence: Even a human couldn't confidently distinguish between:
- Smith Construction Inc. (manufacturing company)
- Smith Construction LLC (residential contractor)
Suggested action: "Provide location (city/state) or legal structure to distinguish between multiple entities"
Schema Structure
{
"factors": [
{
"type": "limitation" | "strength",
"code": "string",
"description": "string",
"impact": "string",
"severity": "low" | "medium" | "high" | "critical"
}
],
"actions": [
{
"code": "string",
"description": "Detailed description of the recommended action"
}
]
}
Common Factor Codes
Limitation Codes (Lower Confidence)
no_location_provided
: User did not provide location in querymultiple_potential_entities
: Multiple entities match search criterianame_too_generic
: Name provided is too broad or commonlocation_mismatch
: Provided location doesn't match entity informationinsufficient_data
: Not enough information found during researchunsupported_entity_type
: Entity type not yet supported (government, person, etc.)no_legal_entity_confirmed
: No official legal entity was confirmed
Strength Codes (Higher Confidence)
name_exact_match
: Provided name exactly matches legal entity namelocation_confirmed
: Provided location matches registered locationjurisdiction_match
: Entity jurisdiction matches query jurisdictionlegal_entity_confirmed
: Valid legal entity confirmed through official sourcesmultiple_sources_corroboration
: Multiple authoritative sources confirmed detailsregistration_number_match
: Provided registration number matched entity
API-Specific Variations
Factor and action codes vary slightly between Entity Resolution and Domain Intelligence APIs, reflecting the different data inputs and resolution processes.
Learn more:
- Entity Resolution API for complete code references
- Domain Intelligence API for complete code references
Using Explainability Data
For Developers
- Parse factor codes programmatically to understand result quality
- Use action suggestions to improve subsequent queries
- Monitor limitation patterns to optimize your data inputs
For Business Users
- Review natural language descriptions to understand match confidence
- Follow action recommendations to get more accurate results
- Use factors to validate whether results meet your quality standards
Next Steps:
- Try it: Entity Resolution API - See confidence scoring in action
- Learn more: Entities and Candidates - Understanding when entities vs candidates are returned
- Advanced: Domain Intelligence API - Confidence scoring for domain analysis