T-NER: An All-Round Python Library for Transformer-based Named Entity Recognition

Ushio, Asahi; Camacho-Collados, Jose

doi:10.18653/v1/2021.eacl-demos.7

Computer Science > Computation and Language

arXiv:2209.12616 (cs)

[Submitted on 9 Sep 2022]

Title:T-NER: An All-Round Python Library for Transformer-based Named Entity Recognition

Authors:Asahi Ushio, Jose Camacho-Collados

View PDF

Abstract:Language model (LM) pretraining has led to consistent improvements in many NLP downstream tasks, including named entity recognition (NER). In this paper, we present T-NER (Transformer-based Named Entity Recognition), a Python library for NER LM finetuning. In addition to its practical utility, T-NER facilitates the study and investigation of the cross-domain and cross-lingual generalization ability of LMs finetuned on NER. Our library also provides a web app where users can get model predictions interactively for arbitrary text, which facilitates qualitative model evaluation for non-expert programmers. We show the potential of the library by compiling nine public NER datasets into a unified format and evaluating the cross-domain and cross-lingual performance across the datasets. The results from our initial experiments show that in-domain performance is generally competitive across datasets. However, cross-domain generalization is challenging even with a large pretrained LM, which has nevertheless capacity to learn domain-specific features if fine-tuned on a combined dataset. To facilitate future research, we also release all our LM checkpoints via the Hugging Face model hub.

Comments:	Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics (EACL 2021): System Demonstrations
Subjects:	Computation and Language (cs.CL); Machine Learning (cs.LG)
Cite as:	arXiv:2209.12616 [cs.CL]
	(or arXiv:2209.12616v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2209.12616
Related DOI:	https://doi.org/10.18653/v1/2021.eacl-demos.7

Submission history

From: Asahi Ushio [view email]
[v1] Fri, 9 Sep 2022 15:00:38 UTC (7,492 KB)

Computer Science > Computation and Language

Title:T-NER: An All-Round Python Library for Transformer-based Named Entity Recognition

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:T-NER: An All-Round Python Library for Transformer-based Named Entity Recognition

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators