Show simple item record

dc.contributor.authorCotumaccio, Nicola
dc.date.accessioned2024-03-15T18:05:10Z
dc.date.available2024-03-15T18:05:10Z
dc.date.issued2024-02-07
dc.identifier.urihttp://hdl.handle.net/10222/83663
dc.description.abstractWe introduce a new paradigm in graph compression and formal language theory. We show that the ideas behind some of the most important data structures for compressing and indexing strings - such as the suffix array, the Burrows-Wheeler Transform and the FM-index - are much more general and provide a new approach to studying automata and regular languages, which retrospectively explains the impact of these data structures. We classify all automata and all regular languages by their propensity to be sorted. Our classification represents a useful parameterization simultaneously for diverse automata-related measures: (i) the encoding bit-complexity of automata/labeled graphs, (ii) the complexity of operations on regular languages (e.g. membership) and on labeled graphs (e.g. pattern matching), (iii) the complexity of NFA determinization by the powerset-construction algorithm. To the best of our knowledge, ours is the only parameterization of automata/labeled graphs capturing simultaneously all these aspects. We show that our parameterization has deep and unexpected consequences both in data compression (encoding, pattern matching) and in automata theory (nondeterminism, entanglement, minimization).en_US
dc.language.isoenen_US
dc.subjectData Compressionen_US
dc.subjectAutomata Theoryen_US
dc.subjectBurrows-Wheeler Transformen_US
dc.subjectFM-indexen_US
dc.subjectRegular Languagesen_US
dc.subjectNondeterminismen_US
dc.subjectMinimizationen_US
dc.subjectSuffix Treesen_US
dc.subjectSuffix Arraysen_US
dc.titleData Compression Meets Automata Theoryen_US
dc.typeThesisen_US
dc.date.defence2024-01-31
dc.contributor.departmentFaculty of Computer Scienceen_US
dc.contributor.degreeDoctor of Philosophyen_US
dc.contributor.external-examinerKunihiko Sadakaneen_US
dc.contributor.thesis-readerPierluigi Crescenzien_US
dc.contributor.thesis-readerRoberto Grossien_US
dc.contributor.thesis-readerMeng Heen_US
dc.contributor.thesis-readerAlexandru Tomescuen_US
dc.contributor.thesis-supervisorTravis Gagieen_US
dc.contributor.thesis-supervisorNicola Prezzaen_US
dc.contributor.ethics-approvalNot Applicableen_US
dc.contributor.manuscriptsNot Applicableen_US
dc.contributor.copyright-releaseNot Applicableen_US
 Find Full text

Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record