×
We present MLSUM, the first large-scale MultiLingual SUMmarization dataset. Obtained from online newspapers, it contains 1.5M+ article/summary pairs in five ...
Apr 30, 2020 · We present MLSUM, the first large-scale MultiLingual SUMmarization dataset. Obtained from online newspapers, it contains 1.5M+ article/summary pairs in five ...
The dataset is built from online news out- lets, and contains over 1.5M article-summary pairs in 5 languages: French, German, Spanish, Rus- sian, and Turkish, ...
We present MLSUM, the first large-scale MultiLingual SUMmarization dataset. Obtained from online newspapers, it contains 1.5M+ article/summary pairs in five ...
People also ask
A large-scale MultiLingual SUMmarization dataset. Obtained from online newspapers, it contains 1.5M+ article/summary pairs in five different languages.
The large-scale MultiLingual SUMmarization corpus. Contribute to ThomasScialom/MLSUM development by creating an account on GitHub.
This work presents MLSUM, the first large-scale MultiLingual SUMmarization dataset obtained from online newspapers, which contains 1.5M+ article/summary ...
MLSum is a multilingual summarization dataset crawled from different news websites. The GEM version supports the German and Spanish subset.
Oct 23, 2024 · Connected Papers is a visual tool to help researchers and applied scientists find academic papers relevant to their field of work.
MLSUM: The Multilingual Summarization Corpus. Thomas Scialom, Paul-Alexis ... Abstract: We present MLSUM, the first large-scale MultiLingual SUMmarization dataset ...