Back-to-Front Online Lyndon Forest Construction

Badkobeh, Golnaz; Crochemore, Maxime; Ellert, Jonas and Nicaud, Cyril. 2022. 'Back-to-Front Online Lyndon Forest Construction'. In: 33rd Annual Symposium on Combinatorial Pattern Matching. Prague, Czech Republic 27–29 June 2022. [Conference or Workshop Item]

[img]
Preview
Text
rlt-submitted.pdf - Published Version
Available under License Creative Commons Attribution.

Download (884kB) | Preview

Abstract or Description

A Lyndon word is a word that is lexicographically smaller than all of its non-trivial rotations (e.g. ananas is a Lyndon word; banana is not a Lyndon word due to its smaller rotation abanan). The Lyndon forest (or equivalently Lyndon table) identifies maximal Lyndon factors of a word, and is of great combinatoric interest, e.g. when finding maximal repetitions in words. While optimal linear time algorithms for computing the Lyndon forest are known, none of them work in an online manner. We present algorithms that compute the Lyndon forest of a word in a reverse online manner, processing the input word from back to front. We assume a general ordered alphabet, i.e. the only elementary operations on symbols are comparisons of the form less-equal-greater. We start with a naive algorithm and show that, despite its quadratic worst-case behaviour, it already takes expected linear time on words drawn uniformly at random. We then introduce a much more sophisticated algorithm that takes linear time in the worst case. It borrows some ideas from the offline algorithm by Bille et al. (ICALP 2020), combined with new techniques that are necessary for the reverse online setting. While the back-to-front approach for this computation is rather natural (see Franek and Liut, PSC 2019), the steps required to achieve linear time are surprisingly intricate. We envision that our algorithm will be useful for the online computation of maximal repetitions in words.

Item Type:

Conference or Workshop Item (Paper)

Identification Number (DOI):

https://doi.org/10.4230/LIPIcs.CPM.2022.10

Additional Information:

Supplementary Material Source Code: https://github.com/jonas-ellert/right-lyndon
archived at swh:1:dir:e0cf158eeec99338193603aded28b5ebc252e87b

Keywords:

Lyndon factorisation, Lyndon forest, Lyndon table, Lyndon array, right Lyndon tree, Cartesian tree, standard factorisation, online algorithms

Related URLs:

Departments, Centres and Research Units:

Computing

Dates:

DateEvent
30 March 2022Accepted
July 2022Published

Event Location:

Prague, Czech Republic

Date range:

27–29 June 2022

Item ID:

31914

Date Deposited:

15 Jun 2022 09:40

Last Modified:

29 Jun 2022 01:26

URI:

https://research.gold.ac.uk/id/eprint/31914

View statistics for this item...

Edit Record Edit Record (login required)