Back to Overview
Tech

Inside the Engine .

A small look at how the engine handles cultural data — what a record carries, the route from source to surface, and the gates that keep the bad stuff out. Enough to know what's holding it up. The deep internals stay deep.

Knowledge Graph

From dataset to connection.

It's not a database — it's a graph. Every artist, every venue, every work gets a stable ID and is semantically linked. Mozart doesn't sit somewhere alone — he connects to The Magic Flute, the Marionette Theatre, and the Festspiele.

Stable entity IDsEvery entity gets a unique fingerprint. Sources change, IDs don't.
Typed edgesCREATED, PERFORMED_AT, PART_OF, FEATURED_IN — the graph knows what each connection means.
Multi-hop traversalFind every festival that performs Mozart works — across three relationships, in one query.
Knowledge Graph
ARTIST
Mozart
WORK
Zauberflöte
VENUE
Staatsoper
FESTIVAL
Festspiele
Mozarteum Salzburg · Record
One recordFour facets
Facts
capacity1,580
addressSchwarzstraße 26
accessibleyes
Feel
intimacy0.72
energy0.41
formality0.88
Words
primaryDE · EN
extendedFR · IT · ES
scriptsJA · KO · ZH · AR
Receipts
confidence0.94
reviewed2026-04-12
sourceverified feed
Edited independently · Read together
The record

Four sides per record.

Every venue, festival, artist, work, production, and showtime carries the same four-sided structure. Edited independently, read together — so the API, the widget, and the signage all see one thing.

FactsNames, addresses, opening hours, capacity, accessibility flags. Written by humans or extracted from sources.
FeelMood, audience fit, intensity, themes. Scored on shared axes so a jazz club and a chamber concert compare on the same dimensions.
WordsTen languages, original script preserved for CJK and Arabic. Editable per language, per organisation.
ReceiptsWhich model produced what, when, with what confidence. Every machine-touched value can be traced.
The route

Source to surface.

Every record passes through the same five steps — same path, every time. A new country, language, or partner plugs into the same route without a rebuild.

IngestSchema.org JSON-LD, iCal feeds, CSV uploads, partner APIs, scraped sites, manual entry.
DedupeFive sources for the same place collapse to one. Spelling, script, abbreviation — all match.
EnrichScored on a fixed set of axes. No free-form labels.
TranslateTen language codes; original script preserved.
ServeOne record read by API, widgets, signage, dashboards, white-label CMS.
Pipeline · same route, every record
Ingest5 sources
Dedupe5 → 1
Enrichfixed axes
Translate10 languages
Serveevery surface
One record · every surface
Entity Resolution — Dedup Engine
Candidate Entities
MozarteumWebsite Scrape
Stiftung MozarteumMunicipal Feed
Mozarteum SalzburgPartner API
Matching...
Golden Record
Stiftung Mozarteum Salzburg
TypeConcert Hall
Confidence0.96
mozarteum_salzburg
3
Sources
1
Merged
−2
Duplicates
Entity resolution

One record per thing.

Five sources for the same venue collapse into one. Spelling variants, scripts, and abbreviations all match the same place — so what reads the data sees one truth, not five copies of it.

Fingerprint matchNames get normalised — diacritics stripped, articles removed, scripts unified — so the same venue across sources lands on the same key.
Cross-script linkingA venue named in German, English, Japanese, and Arabic resolves to one record. Original script preserved for display.
Provenance keptEvery merged source stays attached to the record. You can see who said what, when, and how it was reconciled.
Truth Score

Data quality is measurable.

Every record gets a truth score from 0 to 100 — based on source plausibility, graph coherence, and cross-reference agreement. Public sector partners get the number they need to defend funding decisions.

Source plausibilityOfficial organiser feeds weigh more than unverified web scrapes. The score reflects that.
Graph coherenceWhen three sources agree on a date, the score rises. When one contradicts, it drops.
Audit-readyScore history and contributing factors traceable per record — DPIA-ready.
SCHEMA.ORG
PARTNER.API
GRAPH.LINK
Truth Score
0TRUTH
MASTER
LOCAL OVERRIDE
Edit without breaking

Edit without breaking.

Your team edits one version, the network keeps another, and both stay current. You can rewrite a description for your audience without breaking what other partners see — and updates from the network keep flowing in underneath.

Your edits stay yoursEach organisation gets its own copy on top of the shared record. Edits are isolated — your changes never touch anyone else's view.
Inheritance, not duplicationA sub-organisation inherits its parent's edits unless it makes its own. No copy-pasting, no drift.
Live updates underneathTranslations, scores, and facts updated by the network keep flowing through — your local edits stay on top.