MAGNET: Improving the Multilingual Fairness of Language Models with Adaptive Gradient-Based Tokenization

Published in Thirty-eighth Conference on Neural Information Processing Systems (NeurIPS) 2024, 2024

Download paper here

Recommended citation: [code] Orevaoghene Ahia, Sachin Kumar, Hila Gonen, Valentin Hoffman, Tomasz Limisiewicz, Yulia Tsvetkov, Noah A. Smith. (2024). “MAGNET: Improving the Multilingual Fairness of Language Models with Adaptive Gradient-Based Tokenization.” Thirty-eighth Conference on Neural Information Processing Systems (NeurIPS) 2024.

Recommended citation: [code] Orevaoghene Ahia, Sachin Kumar, Hila Gonen, Valentin Hoffman, Tomasz Limisiewicz, Yulia Tsvetkov, Noah A. Smith. (2024). "MAGNET: Improving the Multilingual Fairness of Language Models with Adaptive Gradient-Based Tokenization." Thirty-eighth Conference on Neural Information Processing Systems (NeurIPS) 2024.
Download Paper