AI Semantic Matching

Resolving the "Tower of Babel" Problem

Prediction markets rely on natural language to define events. However, different platforms describe the same event in different ways.

Polymarket: "Will Bitcoin hit $100,000 in 2024?"
Kalshi: "Bitcoin Price > $100,000 on Dec 31, 2024"
Limitless: "BTC/USD > 100k @ 2024 End"

To a computer, these are three completely different strings. To a human, they are the same event. Oraclyst uses an AI Semantic Engine to bridge this gap.

The Matching Process

Ingestion: The system continuously scrapes active markets from all supported venues.
Vectorization: We use OpenAI embeddings (text-embedding-3-small) to convert market titles and resolution rules into high-dimensional vectors.
Cosine Similarity: The engine compares the vector of a new market against the vectors of existing markets in our database. It calculates a "Similarity Score" from 0.00 to 1.00.
Clustering Logic:
- Score > 0.98: The markets are considered identical. They are merged into a single Unified Event ID in the Oraclyst interface.
- Score 0.85 - 0.98: The markets are flagged as "Related." They are sent to a human verification queue for manual review to ensure the resolution sources (e.g., AP vs. Reuters) are compatible.
- Score < 0.85: The market is treated as a unique, standalone event.

This technology allows Oraclyst to aggregate liquidity without requiring manual data entry for thousands of markets.

PreviousThe Smart Router Engine NextVirtual Liquidity Vaults (VLV)

Last updated 39 minutes ago

Good afternoon

The Matching Process