Redirecting to original paper in 30 seconds...
Click below to go immediately or wait for automatic redirect
Introduces Modular Linear Tokenization (MLT), a reversible and deterministic technique for encoding high-cardinality categorical identifiers into compact numerical vectors. MLT uses modular arithmetic and invertible linear transformations, offering explicit control over dimensionality and scalability while maintaining full reversibility.
Enables more efficient processing and modeling of large-scale categorical data, reducing memory footprint and computational costs in applications like recommendation systems and feature engineering.