Vilém Zouhar,
Clara Meister,
Juan Gastaldi,
Li Du,
Mrinmaya Sachan,
Ryan Cotterell
(2023).
Tokenization and the Noiseless Channel.
Proceedings of the 61th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers).
Clara Meister,
Tiago Pimentel,
Luca Malagutti,
Ryan Cotterell
(2023).
On the Efficacy of Sampling Adapters.
Proceedings of the 61th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers).
Li Du,
Lucas Torroba Hennigen,
Tiago Pimentel,
Clara Meister,
Jason Eisner,
Ryan Cotterell
(2023).
A Measure-theoretic Characterzation of Tight Language Model.
Proceedings of the 61th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers).
Vilém Zouhar,
Clara Meister,
Juan Gastaldi,
Li Du,
Tim Vieira,
Mrinmaya Sachan,
Ryan Cotterell
(2023).
A Formal Perspective on Byte-Pair Encoding.
Findings of the Association for Computational Linguistics: ACL 2023.