ARI Formalization
ARI solves a structured prediction problem over a bipartite graph between source and target field paths.
The objective is to infer a globally consistent mapping via maximum a posteriori (MAP) inference in a constrained graphical model.
Problem setup
Let the sets of source and target field paths be:
S={si}T={tj}A mapping is a binary relation R:
R⊆S×TDefine indicator variables:
xs,t∈{0,1},(s,t)∈S×Twhere xs,t=1 means that the source s and target t fields are compatible.
Then, equivalently, we can define R as:
R={(s,t)∣xs,t=1}MAP objective
ARI defines a Gibbs distribution over mappings:
P(R)∝exp(−E(R))and seeks the MAP solution:
R∗=argRminE(R)Energy decomposition
The energy decomposes into unary and pairwise terms:
E(x)=(s,t)∈S×T∑ψu(s,t)xs,t+(s,t)=(s′,t′)∑ψp((s,t),(s′,t′))xs,txs′,t′where:
- ψu encodes local compatibility
- ψp encodes structural consistency
This corresponds to a pairwise Markov random field over candidate matches.
Feature representation
Each candidate pair is mapped to a feature vector:
ϕ:S×T→Rddenoted by xs,t for pair (s,t):
xs,t=ϕ(s,t)Typical features include:
- Lexical similarity
- Structural context
- Ontology compatibility
- Embedding similarity: ϕemb(s,t)=⟨f(s),g(t)⟩
Unary scoring
Unary potentials (scores) are parameterized as:
ψu(s,t)=−fθ(xs,t)Examples:
- Linear: fθ=w⊤x
- Tree/MLP models (GBDT, neural scoring)
Candidate pruning retains:
Ck(s)=Top-kfθ(xs,t){t∈C(s)}Pairwise / structured scoring
Pairwise potentials capture dependencies:
ψp((s,t),(s′,t′))=−pθ((s,t),(s′,t′))Examples:
- Cross-encoder: pθ=hθ(s,t,s′,t′)
- Structured models (CRF / GNN): pθ=ψθ(G,(s,t),(s′,t′))
These enforce:
- Structural alignment
- Co-occurrence patterns
- Ontological consistency
Constrained optimization
The MAP problem can be written as an integer quadratic program.
Using ψu(s,t)=−fθ(xs,t) and ψp((s,t),(s′,t′))=−pθ((s,t),(s′,t′)), the MAP objective
R∗=argRminE(R)becomes (equivalently)
x∈{0,1}∣S∣×∣T∣max(s,t)∑fθ(xs,t)xs,t+(s,t)=(s′,t′)∑pθ((s,t),(s′,t′))xs,txs′,t′subject to:
One-to-one constraints
t∑xs,t≤1∀ss∑xs,t≤1∀tType / ontology constraints
xs,t=0if incompatible(s,t)Structural constraints
- Mutual exclusion: xs,t+xs′,t′≤1
- Hierarchical consistency: xs,t≤xparent(s),parent(t)
This yields an ILP / quadratic optimization problem.
Solution
The optimal mapping is:
R∗={(s,t)∈Ck∣xs,t=1}Training objective
The models are trained over heterogeneous datasets:
D=αDpre+βDgold+γDfeedback+δDnegOptimize:
L=λ1Lcontrastive+λ2Lhard-neg+λ3Lranking
References
Cohesive ARI Architecture