Overview
Matching strategies control how Experio identifies duplicate or related entities across different data sources. When the same person, project, or organization appears in multiple documents, matching strategies determine whether they should be merged into a single entity or kept separate. Navigate to Admin > Graph > Matching Strategies.Strategy Types
Default Strategy
The default strategy applies to all entity types unless overridden. It defines the baseline matching behavior for your entire knowledge graph.Entity-Specific Strategies
Create custom strategies for specific entity types that need different matching rules. For example, person names may need fuzzy matching, while project codes need exact matching.Configuration Sections
Each matching strategy has four configuration areas:1. Matching Methods
Enable and weight different matching algorithms:| Method | Description |
|---|---|
| Exact | Case-insensitive exact string match |
| Vector Similarity | Semantic similarity using vector embeddings |
| Fuzzy | Approximate string matching (handles typos, abbreviations) |
| Phonetic | Sound-based matching (handles spelling variations) |
| Synonym | Matches using defined synonym lists |
2. Filters
Apply additional constraints to matching: Temporal Filter:- Max Distance (Days) — Maximum time difference between entities (default: 1825 / 5 years)
- Penalty Per Year — Score reduction per year of difference (default: 0.1)
- Date Attribute Names — Which entity attributes contain dates
- Parent Entity Types — Limit matching to entities with specific parent types
- Relationship Types — Consider only entities connected by specific relationships
3. Normalization
Preprocessing steps applied before matching:| Option | Description |
|---|---|
| Unicode Normalization | Standardize unicode characters |
| Remove Prefixes | Strip “The”, “A”, “An” from entity names |
| Remove Company Suffixes | Strip “Inc”, “LLC”, “Corp”, etc. |
| Expand Abbreviations | Expand common abbreviations to full forms |
| Punctuation Normalization | Standardize punctuation |
4. Thresholds
Define confidence levels that determine how matches are handled:| Threshold | Default | Behavior |
|---|---|---|
| Auto-Match | 0.9 | Matches above this score are merged automatically |
| LLM Disambiguation | 0.7 | Matches in this range are sent to the AI for review |
| Human Review | 0.5 | Matches in this range are queued for manual review |
Creating Entity-Specific Strategies
- Click Create New Strategy
- Select the entity type this strategy applies to
- Configure matching methods, filters, normalization, and thresholds
- Save the strategy