Redirecting to original paper in 30 seconds...
Click below to go immediately or wait for automatic redirect
📄 Abstract
Abstract: Vector symbolic architectures (VSAs) are a family of information
representation techniques which enable composition, i.e., creating complex
information structures from atomic vectors via binding and superposition, and
have recently found wide ranging applications in various neurosymbolic
artificial intelligence (AI) systems. Recently, Raviv proposed the use of
random linear codes in VSAs, suggesting that their subcode structure enables
efficient binding, while preserving the quasi-orthogonality that is necessary
for neural processing. Yet, random linear codes are difficult to decode under
noise, which severely limits the resulting VSA's ability to support recovery,
i.e., the retrieval of information objects and their attributes from a noisy
compositional representation.
In this work we bridge this gap by utilizing coding theoretic tools. First,
we argue that the concatenation of Reed-Solomon and Hadamard codes is suitable
for VSA, due to the mutual quasi-orthogonality of the resulting codewords (a
folklore result). Second, we show that recovery of the resulting compositional
representations can be done by solving a problem we call histogram recovery. In
histogram recovery, a collection of $N$ histograms over a finite field is given
as input, and one must find a collection of Reed-Solomon codewords of length
$N$ whose entry-wise symbol frequencies obey those histograms. We present an
optimal solution to the histogram recovery problem by using algorithms related
to list-decoding, and analyze the resulting noise resilience. Our results give
rise to a noise-resilient VSA with formal guarantees regarding efficient
encoding, quasi-orthogonality, and recovery, without relying on any heuristics
or training, and while operating at improved parameters relative to similar
solutions such as the Hadamard code.
Key Contributions
This work bridges the gap in VSA information recovery by utilizing coding theoretic tools, specifically arguing for the concatenation of Reed-Solomon and Hadamard codes. This approach aims to enable efficient binding while preserving quasi-orthogonality, thereby improving noise resilience and the ability to recover information from noisy compositional representations.
Business Value
Improved information representation and recovery techniques could lead to more robust AI systems, particularly in applications requiring symbolic reasoning and compositionality under noisy conditions, such as robotics or complex data analysis.