NeFut Logo NeFut
Admin Login

[CS.DS] Nearly Optimal Solution for Internal Dictionary Matching

Published at: 2026-07-02 22:00 Last updated: 2026-07-04 11:13
#algorithm #optimization #Data Structure

We study the Internal Dictionary Matching (IDM) problem, where a dictionary $\mathcal{D}$ containing $d$ substrings and a text $T$ are given, and each query concerns the occurrences of patterns in $\mathcal{D}$ within another substring of $T$. We propose a novel $O(n)$-sized data structure named Basic Substring Structure (BASS), where $n$ is the length of the text $T$. With BASS, we can handle all types of queries in the IDM problem with nearly optimal query and preprocessing time. Specifically, our results include:

Blogger's Review: The introduction of the BASS data structure marks a significant breakthrough in the Internal Dictionary Matching problem, offering nearly optimal time complexity both theoretically and practically. This presents a new approach to handling large-scale text data, particularly in applications requiring frequent queries. Its efficient preprocessing and query times will greatly enhance the performance of text retrieval operations.

Original Source: https://arxiv.org/abs/2312.11873

[h] Back to Home