Skip to Main Content (Press Enter)

Logo UNIMI
  • ×
  • Home
  • People
  • Projects
  • Fields
  • Units
  • Outputs
  • Third Mission

Expertise & Skills
Logo UNIMI

|

Expertise & Skills

unimi.it
  • ×
  • Home
  • People
  • Projects
  • Fields
  • Units
  • Outputs
  • Third Mission
  1. Outputs

On Nonlinear Learned String Indexing

Academic Article
Publication Date:
2023
Citation:
On Nonlinear Learned String Indexing / P. Ferragina, M. Frasca, G. Cataldo MarinĂ², G. Vinciguerra. - In: IEEE ACCESS. - ISSN 2169-3536. - 11:(2023 Jul 14), pp. 74021-74034. [10.1109/ACCESS.2023.3295434]
abstract:
We investigate the potential of several artificial neural network architectures to be used as an index on a sorted set of strings, namely, as a mapping from a query string to (an estimate of) its lexicographic rank in the set, which allows solving some interesting string-search operations such as range and prefix searches. Our evaluation on a variety of real and synthetic datasets shows that learned models can beat the space vs error trade-off of the classic (possibly compressed) trie-based solutions for relatively dense datasets only, while being slower to be trained and queried. This leads us to conclude that learned models are not yet competitive with classic trie-based solutions, and thus cannot completely replace them, but possibly only integrate them. Although our study does not settle the question conclusively, it highlights appropriate methods, provides a baseline for comparison, and introduces several open problems, thereby serving as a starting point for future research.
IRIS type:
01 - Articolo su periodico
Keywords:
String dictionaries; string search; prefix search; tries; neural networks; machine learning; data structures; learned indexes;
List of contributors:
P. Ferragina, M. Frasca, G. Cataldo MarinĂ², G. Vinciguerra
Authors of the University:
FRASCA MARCO ( author )
Link to information sheet:
https://air.unimi.it/handle/2434/999394
Full Text:
https://air.unimi.it/retrieve/handle/2434/999394/2283349/On_Nonlinear_Learned_String_Indexing.pdf
Project:
Multi-criteria optimized data structures: from compressed indexes to learned indexes, and beyond
  • Research Areas

Research Areas

Concepts


Settore INF/01 - Informatica
  • Guide
  • Help
  • Accessibility
  • Privacy
  • Use of cookies
  • Legal notices

Powered by VIVO | Designed by Cineca | 26.5.1.0